22
A Data Management Life- Cycle By David Ferderer Project Chief Chris Skinner Contractor Greg Gunther Contractor dferdere @usgs.gov

A Data Management Life-Cycle By David Ferderer Project Chief Chris SkinnerContractor Greg GuntherContractor [email protected]

Embed Size (px)

Citation preview

Page 1: A Data Management Life-Cycle By David Ferderer Project Chief Chris SkinnerContractor Greg GuntherContractor dferdere@usgs.gov

A Data Management Life-Cycle

By

David Ferderer Project ChiefChris Skinner ContractorGreg Gunther Contractor

[email protected]

Page 2: A Data Management Life-Cycle By David Ferderer Project Chief Chris SkinnerContractor Greg GuntherContractor dferdere@usgs.gov

Presentation Outline

• USGS Landscape

• Life-Cycle Model and Strategy

• Component Descriptions (Skinner)

• Demonstration (Gunther)

• Conclusions and Future Directions

Page 3: A Data Management Life-Cycle By David Ferderer Project Chief Chris SkinnerContractor Greg GuntherContractor dferdere@usgs.gov

USGS Landscape - Energy Program

• What We Do– Provides Science-Based Energy Assessments

• Organization Issues – Regional Centers and Competitive Funding Process

– Multiple Project Areas, Applications, Data Types, and Platforms

• Information Issues– Technology and Data Explosion

– Access, Delivery, and Archive Requirements

– Diverse Client and Product Needs

• Policy and Mandates

Page 4: A Data Management Life-Cycle By David Ferderer Project Chief Chris SkinnerContractor Greg GuntherContractor dferdere@usgs.gov

USGS Landscape - Central Energy Team

• 125 Full and Part-Time Employees– Independent Thinkers and Researchers

• Multiple Application Platforms– UNIX (ArcInfo 8, ArcView 3x, SDE 3, ORACLE 8,

EarthVision, Seismic, PETROMOD)

– PC/NT (ArcInfo 8, ArcView 3x, Geographix)

• Centralized and Distributed Data Storage

• 100mb Fast Ethernet Network

Page 5: A Data Management Life-Cycle By David Ferderer Project Chief Chris SkinnerContractor Greg GuntherContractor dferdere@usgs.gov

Central Energy Team “Information” Shift

Data ManagementInformation Services

GIS

ProjectLife-Cycle

Integration

Page 6: A Data Management Life-Cycle By David Ferderer Project Chief Chris SkinnerContractor Greg GuntherContractor dferdere@usgs.gov

Life-Cycle Model and Strategy

• Life-Cycle Model (Conceptual)– A Series of Processes and Utilities that Manage the Flow of Data

to Information, Products, and Knowledge

• Life-Cycle Implementation Strategy (Actual)– Processes are Translated into the Find, Get, Use, Deliver, and

Maintain Strategy

– Strategy Defines Tasks, Components, and Deliverables

Page 7: A Data Management Life-Cycle By David Ferderer Project Chief Chris SkinnerContractor Greg GuntherContractor dferdere@usgs.gov

Implementation Strategy

• DM Finds Internal and External Data Resources

• DM Gets the Data Organized, Documented, and

Accessible to Team Projects

• Projects Use the Data and Other Resources in Research

• DM Assists Projects in Delivering Products to Public

• DM Maintains the System and Upgrades Components

Page 8: A Data Management Life-Cycle By David Ferderer Project Chief Chris SkinnerContractor Greg GuntherContractor dferdere@usgs.gov

Strategy Components and Utilities (Internal USGS)

FindExternal Data

and Information GetData Organized

UseData and Other Resources

In Research Projects

FindInternal Data

andInformation(Archive and

Reuse)

DeliverData and Knowledge to Projects and the Public

Maintain(Upgrades and

Documentation)

Team Data Library

ArchiveLibrary

InventoryDatabase

MetadataUtilities

Data ProcessingUtilitiesProject

DesignIntranetResources

HypermediaPublications

CD-ROMTemplates

Data Life-Cycle

Page 9: A Data Management Life-Cycle By David Ferderer Project Chief Chris SkinnerContractor Greg GuntherContractor dferdere@usgs.gov

Team Data Library

• Centralized Storage

– Team Data Resources (primarily spatial)

– Theme and Sub-Theme Organization

• Standardized– Naming Conventions

– Directory Structure

– Storage Formats (e00, shape, SDE)

– Common Data Projection (geographic)

– Metadata

– Browse Graphics

Team Data Library

Page 10: A Data Management Life-Cycle By David Ferderer Project Chief Chris SkinnerContractor Greg GuntherContractor dferdere@usgs.gov

Team Archive Library

• Offline Storage of Team Data Resources

• Contains – Publications

– USGS Digital Data Products (DLG, DEM, DOQ)

– Team Archives

• Standardized File Names and Directory Structure

ArchiveLibrary

Page 11: A Data Management Life-Cycle By David Ferderer Project Chief Chris SkinnerContractor Greg GuntherContractor dferdere@usgs.gov

Inventory Database

• MS Access Database Tracking Team’s Data Holdings

• Contains– 60 Information Fields (10 Required) in 21 Tables

– 28 Fields Corresponding to FGDC Metadata Elements

– Inventoried 4600 Datasets and 680 Archives (> 500 GB)

InventoryDatabase

Page 12: A Data Management Life-Cycle By David Ferderer Project Chief Chris SkinnerContractor Greg GuntherContractor dferdere@usgs.gov

Inventory Database

• Features– Tracks Multiple Types of Data (Spatial, Text, Graphic and Tabular)

– Separately Tracks Archives, Publications, and Individual datasets

– Automatic Loading and Editing Scripts

– Serves as the Engine to DART…

InventoryDatabase

Page 13: A Data Management Life-Cycle By David Ferderer Project Chief Chris SkinnerContractor Greg GuntherContractor dferdere@usgs.gov

DART

• Data Access, Retrieval, and Tracking System

– Easy Access to Team Data Resources via Web Browsers

– Customized Search and Browse of Archives, Publications, and Datasets

– Direct Data and Metadata Download to User’s Desktop

– Object-Oriented Application

– Java Server Pages on ServeletExec 3.1

– Stay Tuned for the Demonstration!

Page 14: A Data Management Life-Cycle By David Ferderer Project Chief Chris SkinnerContractor Greg GuntherContractor dferdere@usgs.gov

Metadata Utilities

• Web-Based Metadata Entry and Creation System – Users Generate, Modify, and Save Compliant Metadata Output

to the Desktop– Provides a Simplified and Comprehensive Online Help System

• Contains– Links to Other Metadata Tools and Resources

– Library of Metadata

MetadataUtilities

Page 15: A Data Management Life-Cycle By David Ferderer Project Chief Chris SkinnerContractor Greg GuntherContractor dferdere@usgs.gov

Other Data Management Products

• Data Processing and Automation Utilities– Portal to ‘How-To’, AMLs, and FAQ Documents Residing in

the Team and On the WWW

• Project and Workspace Design Recommendations– Templates Promote Efficient Work-Flow, Data Organization,

Archives, and Rapid Publication

• CD-ROM Templates and Hypermedia Distribution

Data ProcessingUtilities

ProjectDesign

HypermediaPublications

CD-ROMTemplates

Page 16: A Data Management Life-Cycle By David Ferderer Project Chief Chris SkinnerContractor Greg GuntherContractor dferdere@usgs.gov

Maintenance

• DM Provides Continual Maintenance and Upgrades of System Components

• Develop Publications and Documentation – User Manuals

– Formal Component Documentation

– Templates, Guidelines, and Policies

– Fact Sheets and Bulletins

Page 17: A Data Management Life-Cycle By David Ferderer Project Chief Chris SkinnerContractor Greg GuntherContractor dferdere@usgs.gov

Demonstration

Greg Gunther

Page 18: A Data Management Life-Cycle By David Ferderer Project Chief Chris SkinnerContractor Greg GuntherContractor dferdere@usgs.gov

System Summary

• Easy Access to Datasets

• Generate Metadata Quickly and Easily

• Find External Data with Over 1000 WWW Links

• Simplify Data Processing Tasks

• Organizes Projects with Workspace Templates

• Streamlines CDROM Publications

• Provides One-Stop Shopping For Shared Internal Resources

Page 19: A Data Management Life-Cycle By David Ferderer Project Chief Chris SkinnerContractor Greg GuntherContractor dferdere@usgs.gov

Future Directions

• Increase Inventory Effort

• Integrate GeoDatabase Model (ArcGIS) for Proprietary Datasets

• Formalize Metadata Extension to FDGC Standard

• Streamline Product Delivery - Implement IMS

• Publish Documented Tools and Utilities

• Implement Enterprise Architecture and Planning

Page 20: A Data Management Life-Cycle By David Ferderer Project Chief Chris SkinnerContractor Greg GuntherContractor dferdere@usgs.gov

Future Architecture

Enterprise Planning*

*Modified from Spewak Model

Planning & Initiatives

BusinessProcesses

Current Systems

Getting Started

Where We Are Today

Where We Want To Be

Plan To The Future Implementation and Migration Plans

Data Architecture

GIS &ApplicationArchitecture

IS/ITArchitecture

Page 21: A Data Management Life-Cycle By David Ferderer Project Chief Chris SkinnerContractor Greg GuntherContractor dferdere@usgs.gov

Conclusions – What We Have Learned

Data Management:

• It’s ESSENTIAL for Survival But Needs to be Promoted

• Distributed Projects REQUIRE Data Centralization

• Projects RARELY Account for Data Management Planning and

Costs

• Data Stewardship MUST Begin at the Onset of Projects

• The Terms EASY and USEFUL - Lead to Implementation

• Component Model Must be FLEXIBLE to Adapt to Technology

Trends

Page 22: A Data Management Life-Cycle By David Ferderer Project Chief Chris SkinnerContractor Greg GuntherContractor dferdere@usgs.gov

The End

And

The Beginning Of a New Cycle…