13
R2R+BCO-DMO – Linked Oceanographic Datasets Adila Krisnadhi 1,5 Robert Arko 2 Suzanne Carbotte 2 Cynthia Chandler 3 Michelle Cheatham 1 Pascal Hitzler 1 Yingji Hu 4 Krzysztof Janowciz 4 Peng Ji 2 Nazifa Karima 1 Adam Shepherd 3 Peter Wiebe 3 1 Data Semantics Lab, Wright State University 2 Lamont-Doherty Observatory, Columbia University 3 Woods Hole Oceanographic Institution 4 Geography Department, University of California, Santa Barbara 5 Faculty of Computer Science, Universitas Indonesia Diversity++ 2015 Krisnadhi, et al Diversity++ 2015 1 / 13

Diversity++2015 talk: R2R+BCO-DMO - Linked Oceanographic Datasets

Embed Size (px)

Citation preview

R2R+BCO-DMO – Linked Oceanographic Datasets

Adila Krisnadhi1,5 Robert Arko2 Suzanne Carbotte2 CynthiaChandler3 Michelle Cheatham1 Pascal Hitzler1 Yingji Hu4

Krzysztof Janowciz4 Peng Ji2 Nazifa Karima1 Adam Shepherd3

Peter Wiebe3

1Data Semantics Lab, Wright State University

2Lamont-Doherty Observatory, Columbia University

3Woods Hole Oceanographic Institution

4Geography Department, University of California, Santa Barbara

5Faculty of Computer Science, Universitas Indonesia

Diversity++ 2015

Krisnadhi, et al Diversity++ 2015 1 / 13

Why Linked Data for Oceanography

Data proliferation

Increased number of repositories ⇒ increased heterogeneity.

Need to discover, access, and integrate data cross repositories

R2R & BCO-DMO are repositories.

Both hold datasets of field observations.Linked data is for metadata of those datasets.Linked data objective: starting point to enable dataset discovery.Additional benefit: attribution of datasets to contributors in the formof links.

Krisnadhi, et al Diversity++ 2015 2 / 13

Rolling Deck Repository (R2R)

Screen shot (10/10/2015) from: http://www.rvdata.us/catalog/Kilo_Moana

Krisnadhi, et al Diversity++ 2015 3 / 13

R2R

http://www.rvdata.us

Every NSF-funded cruise on a vessel in theacademic fleet creates an R2R record.

Environmental sensor data on-board vessels.

Catalog of vessels, instrumentation systems, expeditions, datasets,investigators, organizations, funding awards, cruise reports, andnavigation tracks.

>530k triples, 25 in-service vessels, >4.3k cruises, >18 mil. archivedfiles

60,000 page views per month.

Krisnadhi, et al Diversity++ 2015 4 / 13

R2R: Architecture

Original picture from: http://www.rvdata.us/system/files/overview.png as displayed on (10/10/2015) at

http://www.rvdata.us/overview

Krisnadhi, et al Diversity++ 2015 5 / 13

R2R - http://data.rvdata.us

Krisnadhi, et al Diversity++ 2015 6 / 13

Biological and Chemical Oceanography Data ManagementOffice (BCO-DMO)

Screen shot (10/10/2015) from: http://mapservice.bco-dmo.org/mapserver/maps-ol/index.php

Krisnadhi, et al Diversity++ 2015 7 / 13

BCO-DMO: Architecture

BCO-DMO Data Management Architectural Overview

Metadata Database and Web Content

Data ServerData ServerData Server

Geospatial Access

MapServer-cartography

OpenLayers-interface; interrogate and draw features

ExtJS and other JavaScript libraries-environment

MySQL-metadata

BCO-DMO Website

Public access via Drupal

Web content and metadata

JGOFS/GLOBEC Backend Data Storage and

Retrieval

Supporting Software- Drupal- PHP, Perl- Load navigation and date information into Location table (Perl)- Report modules- NSF Tracker subsystem

Data Manager access

Metadata and web content insert, update, delete and display

Perl LibraryPerl code calling REST

API via Drupal

November 21, 2013

HighlightsText based interface; Geospatial (MapServer) interface; Metadata database stored in Drupal CMS; Distributed backend data management system; Fitness for purpose tools in MapServer and JGOFS/GLOBEC; Browser clients, also distributed; Ability to support other data management backends; Semantic elements (contributed vs standard names); Advanced search using triple stores from several sources; No login required; Access to metadata; Access to actual data; Data manager interface via Drupal; Direct transfer of data and metadata to appropriate national archive, such as NODC, when data are final.

Original picture from: http://www.bco-dmo.org/sites/default/files/BCO-DMO_System_Architecture.pdf as displayed on

10/10/2015

Krisnadhi, et al Diversity++ 2015 8 / 13

BCO-DMO

http://bco-dmo.org

PI of NSF-funded research expedition mustsubmit data from their expedition toBCO-DMO.

PI may bring own instruments.

Catalog of datasets, instrumentation systems, measurementparameters, investigators, organizations, funding awards, projects,programs, and deployments.

Deployments involve than just vessels (i.e., not just cruises).

>2.1 mil triples, 7,500 datasets including information about >1.7kresearchers, >2.1k deployments, 500 projects.

6.5k page views per month.

Krisnadhi, et al Diversity++ 2015 9 / 13

BCO-DMO

Krisnadhi, et al Diversity++ 2015 10 / 13

Overlaps

Only a few dozens oceanographic research vessels being deployed.

R2R is vessel-centric. BCO-DMO is PI-centric and has more than justcruise.

Overlapping set of people, cruise identifiers (linked between eachother).

341 person instances (exact match)

External links

R2R organization to dbpedia: 288/520BCO-DMO instruments to dbpedia: 42/409BCO-DMO organization to dbpedia: 81/488

Krisnadhi, et al Diversity++ 2015 11 / 13

Acknowledgements

GeoLink Project (NSF)

ISWC 2015 Travel Award

Krisnadhi, et al Diversity++ 2015 12 / 13

Thank you!

Krisnadhi, et al Diversity++ 2015 13 / 13