18
MGDS Project Overview and Sample Metadata (Arko) 1 of 18 SESARIGSN Workshop (February 26-27, 2007) PROJECT TEAM: Joyce Alsop * Robert Arko Suzanne Carbotte (lead) Dale Chayes John Diebold Vicki Ferrini Andrew Goodwillie * Kerstin Lehnert Andrew Melkonian Suzanne O’Hara William Ryan R.A. Weissel MGDS PROJECT OVERVIEW AND SAMPLE METADATA

MGDS Project Overview and Sample Metadata (Arko)1 of 18SESAR–IGSN Workshop (February 26-27, 2007) PROJECT TEAM: Joyce Alsop * Robert Arko Suzanne Carbotte

Embed Size (px)

Citation preview

Page 1: MGDS Project Overview and Sample Metadata (Arko)1 of 18SESAR–IGSN Workshop (February 26-27, 2007) PROJECT TEAM: Joyce Alsop * Robert Arko Suzanne Carbotte

MGDS Project Overview and Sample Metadata (Arko) 1 of 18 SESAR–IGSN Workshop (February 26-27, 2007)

PROJECT TEAM:

Joyce Alsop* Robert Arko Suzanne Carbotte (lead) Dale Chayes John Diebold Vicki Ferrini Andrew Goodwillie* Kerstin Lehnert Andrew Melkonian Suzanne O’Hara William Ryan R.A. Weissel

MGDS

PROJECT OVERVIEW

AND

SAMPLE METADATA

Page 2: MGDS Project Overview and Sample Metadata (Arko)1 of 18SESAR–IGSN Workshop (February 26-27, 2007) PROJECT TEAM: Joyce Alsop * Robert Arko Suzanne Carbotte

MGDS Project Overview and Sample Metadata (Arko) 2 of 18 SESAR–IGSN Workshop (February 26-27, 2007)

OUTLINE

1. PROJECT OVERVIEW

2. CURRENT HOLDINGS

3. DATA MODEL

4. METADATA SUBMISSION

5. CHALLENGES

Page 3: MGDS Project Overview and Sample Metadata (Arko)1 of 18SESAR–IGSN Workshop (February 26-27, 2007) PROJECT TEAM: Joyce Alsop * Robert Arko Suzanne Carbotte

MGDS Project Overview and Sample Metadata (Arko) 3 of 18 SESAR–IGSN Workshop (February 26-27, 2007)

OVERVIEW: MISSION STATEMENT

Design and maintain an integrated data repository for MG&G communities:

• Ridge 2000 Program

• MARGINS Program

• U.S. Antarctic Program

• Legacy - Multibeam Synthesis

• Seismic Reflection

Joint funding from NSF OCE + EAR + OPP

Page 4: MGDS Project Overview and Sample Metadata (Arko)1 of 18SESAR–IGSN Workshop (February 26-27, 2007) PROJECT TEAM: Joyce Alsop * Robert Arko Suzanne Carbotte

MGDS Project Overview and Sample Metadata (Arko) 4 of 18 SESAR–IGSN Workshop (February 26-27, 2007)

OVERVIEW: SCOPE AND PARTNERS

Data from marine and terrestrial realms

Data from all disciplines - biological, physical, chemical, geological

Project partners:• WHOI (Ridge 2000 Program)• TAMU (MARGINS Program)• RPSC (U.S. Antarctic Program)• NGDC, CCOM (Legacy - Multibeam Synthesis)• UTIG (Seismic Reflection)

Collaborative partners:• DLESE (education modules)• MMI (community/ontology development)• SESAR (sample registration)

Page 5: MGDS Project Overview and Sample Metadata (Arko)1 of 18SESAR–IGSN Workshop (February 26-27, 2007) PROJECT TEAM: Joyce Alsop * Robert Arko Suzanne Carbotte

MGDS Project Overview and Sample Metadata (Arko) 5 of 18 SESAR–IGSN Workshop (February 26-27, 2007)

OVERVIEW: SCIENTIFIC RATIONALE

• Ensure ability to verify research results

• Preserve expensive/unique/unrepeatable data

• Supplement traditional publication methods

• Facilitate cross-disciplinary research

• Increase data availability to non-specialists

• Enable automated analysis + synthesis

Page 6: MGDS Project Overview and Sample Metadata (Arko)1 of 18SESAR–IGSN Workshop (February 26-27, 2007) PROJECT TEAM: Joyce Alsop * Robert Arko Suzanne Carbotte

MGDS Project Overview and Sample Metadata (Arko) 6 of 18 SESAR–IGSN Workshop (February 26-27, 2007)

OVERVIEW: SYSTEM COMPONENTS

PRODUCTS

• Metadata catalog (1500+ collections)

• Data repository (210,000+ files total 5+ TB - partnership with SDSC)

• Global syntheses (e.g. multi-resolution DEM)

SERVICES

• Web portals (search + download)

• GeoMapApp® (integrate + visualize data from multiple sources)

• Web services (OAI, OGC, etc.)

Page 7: MGDS Project Overview and Sample Metadata (Arko)1 of 18SESAR–IGSN Workshop (February 26-27, 2007) PROJECT TEAM: Joyce Alsop * Robert Arko Suzanne Carbotte

MGDS Project Overview and Sample Metadata (Arko) 7 of 18 SESAR–IGSN Workshop (February 26-27, 2007)

CURRENT HOLDINGS:SOLID EARTH SAMPLES

50 NEW DATA SETS

OVER 3500 SAMPLES

(growing rapidly…)

COLLECTION TYPE INVESTIGATORS AT03-24 Rock Fisher AT03-38 Rock Fisher AT11-07 Rock Perfit AT11-07 Rock Schouten AT11-09 Rock Blake AT11-09 Rock Reysenbach AT11-09 Rock Von Damm AT11-09 Sediment Inderbitzen AT11-10 Rock Vetriani AT11-20 Rock Edwards AT11-26 Rock Vetriani AT15-06 Rock Perfit, Sievert, Haymon AT15-09 Rock Kelley AT15-12 Rock Bright COOK06MV Rock Fryer COOK07MV Rock Bloomer DANA01RR Rock Lonsdale DANA02RR Rock Lonsdale DANA07RR Rock Lonsdale DANA08RR Rock Lonsdale EW0004 Rock Sinton EW0104 Sediment Underwood KM0417 Rock Langmuir KM0502 Sediment Kuehl KM0503 Sediment Alexander KN182-13 Rock Forsyth Mariana_Forearc_2002 Rock Reagan MGLN07MV Rock Langmuir TAN0613 Sediment Alexander TCS06NH Rock Perfit TCS06NH Rock Perfit, Rubin TN154 Rock Fryer TN154 Sediment Fryer TUIM05MV Rock Tivey VANC02MV Sediment Underwood, Spinelli VANC13MV Sediment Ogston VANC14MV Sediment Nittrouer VANC15MV Sediment Nittrouer VANC16MV Sediment Ogston VANC19MV Sediment Ogston VANC20MV Sediment Nittrouer VANC21MV Sediment Goni VANC21MV Sediment Nittrouer VANC22MV Sediment Driscoll VANC23MV Sediment Driscoll VANC27MV Sediment Ogston VANC28MV Sediment Nittrouer VANC29MV Sediment Nittrouer VANC30MV Sediment Nittrouer WF2983 Rock Gill

Page 8: MGDS Project Overview and Sample Metadata (Arko)1 of 18SESAR–IGSN Workshop (February 26-27, 2007) PROJECT TEAM: Joyce Alsop * Robert Arko Suzanne Carbotte

MGDS Project Overview and Sample Metadata (Arko) 8 of 18 SESAR–IGSN Workshop (February 26-27, 2007)

MULTIPLE WEB PORTALSTO SERVE

DIFFERENT COMMUNITIES

::

SINGLE INTEGRATEDDATABASE BACKEND

DATA MODEL:

Page 9: MGDS Project Overview and Sample Metadata (Arko)1 of 18SESAR–IGSN Workshop (February 26-27, 2007) PROJECT TEAM: Joyce Alsop * Robert Arko Suzanne Carbotte

MGDS Project Overview and Sample Metadata (Arko) 9 of 18 SESAR–IGSN Workshop (February 26-27, 2007)

DATA MODEL:

COLLECTION (registration = ?)• Field

• Observatory• Expedition

• Derived

SET (registration = STD-DOI)• group of data objects having

common provenance

OBJECT (registration = IGSN)• Data File

• Real-time• Processed

• Sample

COLLECTION

SET

OBJECT

Page 10: MGDS Project Overview and Sample Metadata (Arko)1 of 18SESAR–IGSN Workshop (February 26-27, 2007) PROJECT TEAM: Joyce Alsop * Robert Arko Suzanne Carbotte

MGDS Project Overview and Sample Metadata (Arko) 10 of 18 SESAR–IGSN Workshop (February 26-27, 2007)

DATA MODEL: COLLECTION METADATA related collections

collection aliases (at other repositories)

platform/operator

funding agency/awardsproject titles/urls

science party (field + lab personnel)

lat/lon binslocation (physio features, place names)

supporting documents (cruise reports etc.)

references (citations)

Page 11: MGDS Project Overview and Sample Metadata (Arko)1 of 18SESAR–IGSN Workshop (February 26-27, 2007) PROJECT TEAM: Joyce Alsop * Robert Arko Suzanne Carbotte

MGDS Project Overview and Sample Metadata (Arko) 11 of 18 SESAR–IGSN Workshop (February 26-27, 2007)

DATA MODEL: ACQUISITION EVENTS

1. LAUNCH (independent, navigated)• daughter platforms e.g. Submersible, Drone, Small Boat

2. LINE (navigated)• towed platforms e.g. Camera, MCS, TowYo

3. STATION (only start/stop)• lowered platforms e.g. Core, Grab, CTD, BLISP• towed platforms e.g. Dredge, Net• deployed platforms e.g. OBS, Marker, Float, Probe

Events can be nested (e.g. Dive > Station)

Page 12: MGDS Project Overview and Sample Metadata (Arko)1 of 18SESAR–IGSN Workshop (February 26-27, 2007) PROJECT TEAM: Joyce Alsop * Robert Arko Suzanne Carbotte

MGDS Project Overview and Sample Metadata (Arko) 12 of 18 SESAR–IGSN Workshop (February 26-27, 2007)

DATA MODEL: SAMPLE METADATA

collection_id

sample_idsample_name (investigator’s pet name)parent_id

data_type (e.g. “Rock Sample”)sample_type (e.g. “Igneous: Volcanic: Mafic”)

launch_idline_idstation_id ---> station_type (e.g. “Bottom: Towed”) + station_platform (e.g. “Dredge”)

start_datestart_longitude/latitude/elevationstop_datestop_longitude/latitude/elevationnavfix_type

local_origin/units (e.g. for dive programs)start_local_x/ystop_local_x/y

location_id (physiographic feature)tectonic_setting (e.g. “Back-Arc Basin”)

investigator_idcontact_idcontributor_id

repository_id (holds authoritative metadata)facility_id (holds physical sample)

other/details

Page 13: MGDS Project Overview and Sample Metadata (Arko)1 of 18SESAR–IGSN Workshop (February 26-27, 2007) PROJECT TEAM: Joyce Alsop * Robert Arko Suzanne Carbotte

MGDS Project Overview and Sample Metadata (Arko) 13 of 18 SESAR–IGSN Workshop (February 26-27, 2007)

DATA MODEL:CONTROLLED VOCABULARIES

(both types and identifiers)

collection_id collection_type data_type device_type dive_type feature_id feature_type format_id initiative_id language_id launch_platform_type launch_type line_platform_type line_type location_id nav_type organization_id person_id platform_id platform_type role_id role_type station_platform_type station_type status_id

(and still growing…)

Page 14: MGDS Project Overview and Sample Metadata (Arko)1 of 18SESAR–IGSN Workshop (February 26-27, 2007) PROJECT TEAM: Joyce Alsop * Robert Arko Suzanne Carbotte

MGDS Project Overview and Sample Metadata (Arko) 14 of 18 SESAR–IGSN Workshop (February 26-27, 2007)

METADATA SUBMISSION: POLICY

Records made public immediately:

• people/projects/awards

• primary navigation

• catalog of acquisition events

• catalog of data sets

• catalog of samples

Page 15: MGDS Project Overview and Sample Metadata (Arko)1 of 18SESAR–IGSN Workshop (February 26-27, 2007) PROJECT TEAM: Joyce Alsop * Robert Arko Suzanne Carbotte

MGDS Project Overview and Sample Metadata (Arko) 15 of 18 SESAR–IGSN Workshop (February 26-27, 2007)

METADATA SUBMISSION: FORMS

1. Contact chief scientist in advance -designate science party liaison

2. Follow up with liaison (60 days)

3. Register/submit data sets toappropriate partner repositories

Page 16: MGDS Project Overview and Sample Metadata (Arko)1 of 18SESAR–IGSN Workshop (February 26-27, 2007) PROJECT TEAM: Joyce Alsop * Robert Arko Suzanne Carbotte

MGDS Project Overview and Sample Metadata (Arko) 16 of 18 SESAR–IGSN Workshop (February 26-27, 2007)

METADATA SUBMISSION: FORMS

Example: Sediment Cores (based on LDEO Repository log sheet)

Page 17: MGDS Project Overview and Sample Metadata (Arko)1 of 18SESAR–IGSN Workshop (February 26-27, 2007) PROJECT TEAM: Joyce Alsop * Robert Arko Suzanne Carbotte

MGDS Project Overview and Sample Metadata (Arko) 17 of 18 SESAR–IGSN Workshop (February 26-27, 2007)

CHALLENGES:

1. Metadata form submission• completeness• consistent identifiers and formats

2. Globally unique identifiers

3. Evolving/shared vocabularies• Physiographic Feature (gazetteer + local features e.g. Vents)• Tectonic Setting• Sample Type (domain specific) • Station Platform/Type

Page 18: MGDS Project Overview and Sample Metadata (Arko)1 of 18SESAR–IGSN Workshop (February 26-27, 2007) PROJECT TEAM: Joyce Alsop * Robert Arko Suzanne Carbotte

MGDS Project Overview and Sample Metadata (Arko) 18 of 18 SESAR–IGSN Workshop (February 26-27, 2007)

Questions?

_____________marine – geo.org