Upload
adila-krisnadhi
View
349
Download
1
Embed Size (px)
Citation preview
R2R+BCO-DMO – Linked Oceanographic Datasets
Adila Krisnadhi1,5 Robert Arko2 Suzanne Carbotte2 CynthiaChandler3 Michelle Cheatham1 Pascal Hitzler1 Yingji Hu4
Krzysztof Janowciz4 Peng Ji2 Nazifa Karima1 Adam Shepherd3
Peter Wiebe3
1Data Semantics Lab, Wright State University
2Lamont-Doherty Observatory, Columbia University
3Woods Hole Oceanographic Institution
4Geography Department, University of California, Santa Barbara
5Faculty of Computer Science, Universitas Indonesia
Diversity++ 2015
Krisnadhi, et al Diversity++ 2015 1 / 13
Why Linked Data for Oceanography
Data proliferation
Increased number of repositories ⇒ increased heterogeneity.
Need to discover, access, and integrate data cross repositories
R2R & BCO-DMO are repositories.
Both hold datasets of field observations.Linked data is for metadata of those datasets.Linked data objective: starting point to enable dataset discovery.Additional benefit: attribution of datasets to contributors in the formof links.
Krisnadhi, et al Diversity++ 2015 2 / 13
Rolling Deck Repository (R2R)
Screen shot (10/10/2015) from: http://www.rvdata.us/catalog/Kilo_Moana
Krisnadhi, et al Diversity++ 2015 3 / 13
R2R
http://www.rvdata.us
Every NSF-funded cruise on a vessel in theacademic fleet creates an R2R record.
Environmental sensor data on-board vessels.
Catalog of vessels, instrumentation systems, expeditions, datasets,investigators, organizations, funding awards, cruise reports, andnavigation tracks.
>530k triples, 25 in-service vessels, >4.3k cruises, >18 mil. archivedfiles
60,000 page views per month.
Krisnadhi, et al Diversity++ 2015 4 / 13
R2R: Architecture
Original picture from: http://www.rvdata.us/system/files/overview.png as displayed on (10/10/2015) at
http://www.rvdata.us/overview
Krisnadhi, et al Diversity++ 2015 5 / 13
Biological and Chemical Oceanography Data ManagementOffice (BCO-DMO)
Screen shot (10/10/2015) from: http://mapservice.bco-dmo.org/mapserver/maps-ol/index.php
Krisnadhi, et al Diversity++ 2015 7 / 13
BCO-DMO: Architecture
BCO-DMO Data Management Architectural Overview
Metadata Database and Web Content
Data ServerData ServerData Server
Geospatial Access
MapServer-cartography
OpenLayers-interface; interrogate and draw features
ExtJS and other JavaScript libraries-environment
MySQL-metadata
BCO-DMO Website
Public access via Drupal
Web content and metadata
JGOFS/GLOBEC Backend Data Storage and
Retrieval
Supporting Software- Drupal- PHP, Perl- Load navigation and date information into Location table (Perl)- Report modules- NSF Tracker subsystem
Data Manager access
Metadata and web content insert, update, delete and display
Perl LibraryPerl code calling REST
API via Drupal
November 21, 2013
HighlightsText based interface; Geospatial (MapServer) interface; Metadata database stored in Drupal CMS; Distributed backend data management system; Fitness for purpose tools in MapServer and JGOFS/GLOBEC; Browser clients, also distributed; Ability to support other data management backends; Semantic elements (contributed vs standard names); Advanced search using triple stores from several sources; No login required; Access to metadata; Access to actual data; Data manager interface via Drupal; Direct transfer of data and metadata to appropriate national archive, such as NODC, when data are final.
Original picture from: http://www.bco-dmo.org/sites/default/files/BCO-DMO_System_Architecture.pdf as displayed on
10/10/2015
Krisnadhi, et al Diversity++ 2015 8 / 13
BCO-DMO
http://bco-dmo.org
PI of NSF-funded research expedition mustsubmit data from their expedition toBCO-DMO.
PI may bring own instruments.
Catalog of datasets, instrumentation systems, measurementparameters, investigators, organizations, funding awards, projects,programs, and deployments.
Deployments involve than just vessels (i.e., not just cruises).
>2.1 mil triples, 7,500 datasets including information about >1.7kresearchers, >2.1k deployments, 500 projects.
6.5k page views per month.
Krisnadhi, et al Diversity++ 2015 9 / 13
Overlaps
Only a few dozens oceanographic research vessels being deployed.
R2R is vessel-centric. BCO-DMO is PI-centric and has more than justcruise.
Overlapping set of people, cruise identifiers (linked between eachother).
341 person instances (exact match)
External links
R2R organization to dbpedia: 288/520BCO-DMO instruments to dbpedia: 42/409BCO-DMO organization to dbpedia: 81/488
Krisnadhi, et al Diversity++ 2015 11 / 13
Acknowledgements
GeoLink Project (NSF)
ISWC 2015 Travel Award
Krisnadhi, et al Diversity++ 2015 12 / 13