12
Presented by SciDAC-2 Petascale Data Storage Institute Philip C. Roth Future Technologies Group Computer Science and Mathematics Division

Presented by SciDAC-2 Petascale Data Storage Institute Philip C. Roth Future Technologies Group Computer Science and Mathematics Division

Embed Size (px)

Citation preview

Page 1: Presented by SciDAC-2 Petascale Data Storage Institute Philip C. Roth Future Technologies Group Computer Science and Mathematics Division

Presented by

SciDAC-2Petascale Data Storage Institute

Philip C. RothFuture Technologies Group

Computer Science and Mathematics Division

Page 2: Presented by SciDAC-2 Petascale Data Storage Institute Philip C. Roth Future Technologies Group Computer Science and Mathematics Division

2 Roth_PDSI_SC07

Petascale computing makes petascale demands on storage.

Performance

Capacity

Concurrency

Reliability

Availability

Manageability

Parallel file systems are barely keeping pace at terascale; the challenges will be much greater at petascale.

Cray XT

Cray X1E

The petascale storage problem

at ORNL

Page 3: Presented by SciDAC-2 Petascale Data Storage Institute Philip C. Roth Future Technologies Group Computer Science and Mathematics Division

3 Roth_PDSI_SC07

Petascale Data Storage Institute

The PDSI is an institute in the Department of Energy (DOE) Office of Science’s Scientific Discovery through Advanced Computing (SciDAC-2) program.

Using diverse expertise with applications and fileand storage systems, members will collaborate on requirements, standards, algorithms, and analysis tools.

Led by Dr. Garth Gibson, Carnegie Mellon University

http://www.pdsi-scidac.org

Page 4: Presented by SciDAC-2 Petascale Data Storage Institute Philip C. Roth Future Technologies Group Computer Science and Mathematics Division

4 Roth_PDSI_SC07

Carnegie Mellon University

Participating institutions

Lawrence Berkeley National Laboratory/NERSC

Los Alamos National Laboratory

Pacific Northwest National Laboratory

Sandia National Laboratories

Oak Ridge National Laboratory

University of California at Santa Cruz

University of Michigan at Ann Arbor

Page 5: Presented by SciDAC-2 Petascale Data Storage Institute Philip C. Roth Future Technologies Group Computer Science and Mathematics Division

5 Roth_PDSI_SC07

Novel storage mechanisms Novel storage mechanisms

IT automation IT automation

Standards and APIs Standards and APIs

Community building Community building

Failure data collection Failure data collection

Performance data collection Performance data collection

Petascale Data Storage Institute agenda

Collection Collection

Dissemination Dissemination

Innovation Innovation

Main thrusts Projects

Page 6: Presented by SciDAC-2 Petascale Data Storage Institute Philip C. Roth Future Technologies Group Computer Science and Mathematics Division

6 Roth_PDSI_SC07

Collection: Performance analysis

Performance data collection and analysis

Workload characterization

Benchmark collection and publication

6 Roth_PDSI_0711

Led by William Kramer,National Energy ResearchScientific Computing Center (NERSC)

Page 7: Presented by SciDAC-2 Petascale Data Storage Institute Philip C. Roth Future Technologies Group Computer Science and Mathematics Division

7 Roth_PDSI_SC07

0 10 20 30 40 50 600

10

30

40

50

60

70

80

20

Months in production use

Fai

lure

s p

er m

on

th

UnknownHumanEnvironmentNetworkSoftwareHardware

http://institutes.lanl.gov/datahttp://www.pdl.cmu.edu/FailureData

Collection: Failure analysis

Capture and analyze failure, error, and usage data from high-end computing systems

Initial example: Los Alamos failure data available for 22 systems over 9 years with extensive analysis by Bianca Schroeder, Carnegie Mellon University

Led by Gary Grider, Los Alamos National Laboratory

Page 8: Presented by SciDAC-2 Petascale Data Storage Institute Philip C. Roth Future Technologies Group Computer Science and Mathematics Division

8 Roth_PDSI_SC07

Dissemination: Outreach

Our approach Workshops (SC07 Petascale Data Storage Workshop, November

11) Tutorials and course materials Online, open repository with documents, tools, and performance

and failure data

Target audience Computational scientists Academia (professors and students) Industry (storage researchers and developers)

Led by Dr. Garth Gibson, Carnegie Mellon University

Goal: To disseminate information about techniques, mechanisms, best practices,

and available tools

Goal: To disseminate information about techniques, mechanisms, best practices,

and available tools

Page 9: Presented by SciDAC-2 Petascale Data Storage Institute Philip C. Roth Future Technologies Group Computer Science and Mathematics Division

9 Roth_PDSI_SC07

Dissemination: Standards and APIs

Some work under way POSIX extensions

e.g., support for weak data and metadata consistency http://www.pdl.cmu.edu/posix

Parallel Network File System (pNFS) In IETF NFSv4.1 standard draft University of Michigan Center for Information Technology

Integration producing reference implementation http://www.pdl.cmu.edu/pNFS

Led by Gary Grider, Los Alamos National Laboratory

Goals: To facilitate standards development and deployment and to validate and demonstrate

new extensions and protocols

Goals: To facilitate standards development and deployment and to validate and demonstrate

new extensions and protocols

Page 10: Presented by SciDAC-2 Petascale Data Storage Institute Philip C. Roth Future Technologies Group Computer Science and Mathematics Division

10 Roth_PDSI_SC07

Innovation

IT automation appliedto high-end computing systems and problems

Novel mechanisms forcore high-end computingstorage problems

Storage system instrumentationfor machine learning

Data layout andaccess planning

Automated diagnosis,tuning, and failure recovery

WAN/global storage access

High-performancecollective operations

Rich metadata at scale

Integration with system virtualization technology

Led by Dr. Garth Gibson,Carnegie Mellon University

Led by Darrell Long,University of California at Santa Cruz

Page 11: Presented by SciDAC-2 Petascale Data Storage Institute Philip C. Roth Future Technologies Group Computer Science and Mathematics Division

11 Roth_PDSI_SC07

Summary

The Petascale Data Storage Institute brings together individuals with expertise in file and storage systems, applications, and performance analysis.

PDSI is a focal point for computational scientists, academia, and industry for storage-related information and tools, both within and outside SciDAC-2.

http://www.pdsi-scidac.org

Page 12: Presented by SciDAC-2 Petascale Data Storage Institute Philip C. Roth Future Technologies Group Computer Science and Mathematics Division

12 Roth_PDSI_SC07

Contact

Philip C. RothFuture Technologies GroupComputer Science and Mathematics Division(865) [email protected]

12 Roth_PDSI_SC07