39
Distributed Data Management Jeremy Ticey, Hampton University University of Michigan, CERN REU Program August 14, 2008

Distributed Data Management Jeremy Ticey, Hampton University University of Michigan, CERN REU Program August 14, 2008 Jeremy Ticey, Hampton University

Embed Size (px)

Citation preview

Page 1: Distributed Data Management Jeremy Ticey, Hampton University University of Michigan, CERN REU Program August 14, 2008 Jeremy Ticey, Hampton University

Distributed Data Management

Distributed Data Management

Jeremy Ticey, Hampton UniversityUniversity of Michigan, CERN REU Program

August 14, 2008

Jeremy Ticey, Hampton UniversityUniversity of Michigan, CERN REU Program

August 14, 2008

Page 2: Distributed Data Management Jeremy Ticey, Hampton University University of Michigan, CERN REU Program August 14, 2008 Jeremy Ticey, Hampton University

Introduction

ATLAS Project

Distributed Data Management Group

Don Quijote 2

DQ2 End-User Tools

Python Programming

Django Web Development

Comparison/Progression

Cultural Experiences

Page 3: Distributed Data Management Jeremy Ticey, Hampton University University of Michigan, CERN REU Program August 14, 2008 Jeremy Ticey, Hampton University

ATLAS Project

An experiment at the Large Hadron Collider (LHC) at CERN Laboratories

Uses high energy collisions of protons that are propelled around the radius of the accelerator

General purpose detector that aims to study the basic forces of the universe

Page 4: Distributed Data Management Jeremy Ticey, Hampton University University of Michigan, CERN REU Program August 14, 2008 Jeremy Ticey, Hampton University

Distributed Data Management Group

Distributed Data Management Group manages access to ATLAS data that is distributed to different locations around the world

Ultimate goal is to make the search and download of datasets as simple and efficient as possible

Numerous possibilities

Further development of current tools

Peer-To-Peer (P2P)

Web Interface

Page 5: Distributed Data Management Jeremy Ticey, Hampton University University of Michigan, CERN REU Program August 14, 2008 Jeremy Ticey, Hampton University

Don Quijote 2 (DQ2)

Developed by the Distributed Data Management Group

The ATLAS Experiment Data Management System

Bookkeeping system that stores data as datasets

Its goal is to integrate all Grid data management services used by the ATLAS Experiment

Provides physicists access to file-resident event data

https://twiki.cern.ch/twiki/bin/view/Atlas/WorkBookDQ2

Page 6: Distributed Data Management Jeremy Ticey, Hampton University University of Michigan, CERN REU Program August 14, 2008 Jeremy Ticey, Hampton University

DQ2 ClientsEnd-User Tools End-User Tools

DQ2-Ls -- Search for Datasets

[ -f ] -- List Files in Dataset

DQ2-Get -- Download Datasets

[ -f ] -- Download a single File

DQ2-Put

Page 7: Distributed Data Management Jeremy Ticey, Hampton University University of Michigan, CERN REU Program August 14, 2008 Jeremy Ticey, Hampton University

project goal

To develop a prototype for a dataset search and retrieval system using a web interface

Page 8: Distributed Data Management Jeremy Ticey, Hampton University University of Michigan, CERN REU Program August 14, 2008 Jeremy Ticey, Hampton University

Python Script

Python

Developed a Dataset Search Engine

Through terminal, you can run the program to find any dataset produced by the ATLAS experiment

Aims to reduce the repetition of issuing commands

Page 9: Distributed Data Management Jeremy Ticey, Hampton University University of Michigan, CERN REU Program August 14, 2008 Jeremy Ticey, Hampton University

Django web developmentWeb Framework tool that implements the Python programming language to create web pages

Main components

urls.py

views.py

templates (HTML)

Page 10: Distributed Data Management Jeremy Ticey, Hampton University University of Michigan, CERN REU Program August 14, 2008 Jeremy Ticey, Hampton University

Comparison/ progression

This is a prototype of one method of obtaining access to the datasets

It will be used to determine if this is a viable option for the searching and retrieving datasets

DQ2 -----------> Python -----------> Web Interface

Page 11: Distributed Data Management Jeremy Ticey, Hampton University University of Michigan, CERN REU Program August 14, 2008 Jeremy Ticey, Hampton University

SetupSetup

Page 12: Distributed Data Management Jeremy Ticey, Hampton University University of Michigan, CERN REU Program August 14, 2008 Jeremy Ticey, Hampton University
Page 13: Distributed Data Management Jeremy Ticey, Hampton University University of Michigan, CERN REU Program August 14, 2008 Jeremy Ticey, Hampton University
Page 14: Distributed Data Management Jeremy Ticey, Hampton University University of Michigan, CERN REU Program August 14, 2008 Jeremy Ticey, Hampton University
Page 15: Distributed Data Management Jeremy Ticey, Hampton University University of Michigan, CERN REU Program August 14, 2008 Jeremy Ticey, Hampton University

SearchSearch

Page 16: Distributed Data Management Jeremy Ticey, Hampton University University of Michigan, CERN REU Program August 14, 2008 Jeremy Ticey, Hampton University
Page 17: Distributed Data Management Jeremy Ticey, Hampton University University of Michigan, CERN REU Program August 14, 2008 Jeremy Ticey, Hampton University
Page 18: Distributed Data Management Jeremy Ticey, Hampton University University of Michigan, CERN REU Program August 14, 2008 Jeremy Ticey, Hampton University
Page 19: Distributed Data Management Jeremy Ticey, Hampton University University of Michigan, CERN REU Program August 14, 2008 Jeremy Ticey, Hampton University

File ResultsFile Results

Page 20: Distributed Data Management Jeremy Ticey, Hampton University University of Michigan, CERN REU Program August 14, 2008 Jeremy Ticey, Hampton University
Page 21: Distributed Data Management Jeremy Ticey, Hampton University University of Michigan, CERN REU Program August 14, 2008 Jeremy Ticey, Hampton University
Page 22: Distributed Data Management Jeremy Ticey, Hampton University University of Michigan, CERN REU Program August 14, 2008 Jeremy Ticey, Hampton University
Page 23: Distributed Data Management Jeremy Ticey, Hampton University University of Michigan, CERN REU Program August 14, 2008 Jeremy Ticey, Hampton University

DownloadDownload

Page 24: Distributed Data Management Jeremy Ticey, Hampton University University of Michigan, CERN REU Program August 14, 2008 Jeremy Ticey, Hampton University
Page 25: Distributed Data Management Jeremy Ticey, Hampton University University of Michigan, CERN REU Program August 14, 2008 Jeremy Ticey, Hampton University
Page 26: Distributed Data Management Jeremy Ticey, Hampton University University of Michigan, CERN REU Program August 14, 2008 Jeremy Ticey, Hampton University
Page 27: Distributed Data Management Jeremy Ticey, Hampton University University of Michigan, CERN REU Program August 14, 2008 Jeremy Ticey, Hampton University

Conclusion

Produced two functional dataset search and retrieval systems that will serve as prototypes for further testing

To improve:

“Behind the scenes” downloader

Create a “waiting room” while datasets are being retrieved

Page 28: Distributed Data Management Jeremy Ticey, Hampton University University of Michigan, CERN REU Program August 14, 2008 Jeremy Ticey, Hampton University

Cultural Experiences

✦ Interlaken, Switzerland

Page 29: Distributed Data Management Jeremy Ticey, Hampton University University of Michigan, CERN REU Program August 14, 2008 Jeremy Ticey, Hampton University
Page 30: Distributed Data Management Jeremy Ticey, Hampton University University of Michigan, CERN REU Program August 14, 2008 Jeremy Ticey, Hampton University

Cultural Experiences

✦ Interlaken, Switzerland

✦ Paris, France

Page 31: Distributed Data Management Jeremy Ticey, Hampton University University of Michigan, CERN REU Program August 14, 2008 Jeremy Ticey, Hampton University
Page 32: Distributed Data Management Jeremy Ticey, Hampton University University of Michigan, CERN REU Program August 14, 2008 Jeremy Ticey, Hampton University

Cultural Experiences

✦ Interlaken, Switzerland

✦ Paris, France

✦ Barcelona, Spain

Page 33: Distributed Data Management Jeremy Ticey, Hampton University University of Michigan, CERN REU Program August 14, 2008 Jeremy Ticey, Hampton University
Page 34: Distributed Data Management Jeremy Ticey, Hampton University University of Michigan, CERN REU Program August 14, 2008 Jeremy Ticey, Hampton University

Cultural Experiences

✦ Interlaken, Switzerland

✦ Paris, France

✦ Barcelona, Spain

✦ Jura Mountains

Page 35: Distributed Data Management Jeremy Ticey, Hampton University University of Michigan, CERN REU Program August 14, 2008 Jeremy Ticey, Hampton University
Page 36: Distributed Data Management Jeremy Ticey, Hampton University University of Michigan, CERN REU Program August 14, 2008 Jeremy Ticey, Hampton University

Cultural Experiences

✦ Interlaken, Switzerland

✦ Paris, France

✦ Barcelona, Spain

✦ Jura Mountains

✦ Geneva, Switzerland

Page 37: Distributed Data Management Jeremy Ticey, Hampton University University of Michigan, CERN REU Program August 14, 2008 Jeremy Ticey, Hampton University
Page 38: Distributed Data Management Jeremy Ticey, Hampton University University of Michigan, CERN REU Program August 14, 2008 Jeremy Ticey, Hampton University

Acknowledgements

Miguel Branco

Mario Lassnig

Vincent Garonne

Dr. Homer Neal

Dr. Jean Krisch

Dr. Myron Campbell

Colleagues

Page 39: Distributed Data Management Jeremy Ticey, Hampton University University of Michigan, CERN REU Program August 14, 2008 Jeremy Ticey, Hampton University

Questions??

Questions??