38
Toward verifiable science: iPython meets PROV-O (Semantics in Ecosystems Assessments). April 16, 2014 ERRT eter Fox (RPI/ Tetherless World Constellation and WHOI/AOP&E) [email protected] (@taswegian) and collaborators on the SF INTEROP ECO-OP project. http://tw.rpi.edu/web/project/ECOO

Toward verifiable science: iPython meets PROV-O (Semantics in Ecosystems Assessments). April 16, 2014 ERRT Peter Fox (RPI/ Tetherless World Constellation

Embed Size (px)

Citation preview

Page 1: Toward verifiable science: iPython meets PROV-O (Semantics in Ecosystems Assessments). April 16, 2014 ERRT Peter Fox (RPI/ Tetherless World Constellation

Toward verifiable science: iPython meets PROV-O

(Semantics in Ecosystems Assessments).

April 16, 2014 ERRT

Peter Fox (RPI/ Tetherless World Constellation and WHOI/AOP&E) [email protected] (@taswegian) and collaborators on the NSF INTEROP ECO-OP project. http://tw.rpi.edu/web/project/ECOOP

Page 2: Toward verifiable science: iPython meets PROV-O (Semantics in Ecosystems Assessments). April 16, 2014 ERRT Peter Fox (RPI/ Tetherless World Constellation

Stace Beaulieu

Page 3: Toward verifiable science: iPython meets PROV-O (Semantics in Ecosystems Assessments). April 16, 2014 ERRT Peter Fox (RPI/ Tetherless World Constellation

Marine ecosystems

Page 4: Toward verifiable science: iPython meets PROV-O (Semantics in Ecosystems Assessments). April 16, 2014 ERRT Peter Fox (RPI/ Tetherless World Constellation

Stace Beaulieu

Page 5: Toward verifiable science: iPython meets PROV-O (Semantics in Ecosystems Assessments). April 16, 2014 ERRT Peter Fox (RPI/ Tetherless World Constellation

Stace Beaulieu

Page 6: Toward verifiable science: iPython meets PROV-O (Semantics in Ecosystems Assessments). April 16, 2014 ERRT Peter Fox (RPI/ Tetherless World Constellation

Fish, science, decision

Page 7: Toward verifiable science: iPython meets PROV-O (Semantics in Ecosystems Assessments). April 16, 2014 ERRT Peter Fox (RPI/ Tetherless World Constellation

Stace Beaulieu

Page 8: Toward verifiable science: iPython meets PROV-O (Semantics in Ecosystems Assessments). April 16, 2014 ERRT Peter Fox (RPI/ Tetherless World Constellation
Page 9: Toward verifiable science: iPython meets PROV-O (Semantics in Ecosystems Assessments). April 16, 2014 ERRT Peter Fox (RPI/ Tetherless World Constellation

Why important?

Stace Beaulieu

Page 10: Toward verifiable science: iPython meets PROV-O (Semantics in Ecosystems Assessments). April 16, 2014 ERRT Peter Fox (RPI/ Tetherless World Constellation

Vision?

• “Our vision is to develop, facilitate, and maintain sustained multi-way engagement of natural and social scientists and many practitioners in multi-scale local to global networks for LMEs”.

• Goal: We want to perform routine assessments of LMEs involving all (or as many) stakeholders and we want robust science data presented in forms that various end-users can consume… and verify

Page 11: Toward verifiable science: iPython meets PROV-O (Semantics in Ecosystems Assessments). April 16, 2014 ERRT Peter Fox (RPI/ Tetherless World Constellation

Environmental Assessment

Page 12: Toward verifiable science: iPython meets PROV-O (Semantics in Ecosystems Assessments). April 16, 2014 ERRT Peter Fox (RPI/ Tetherless World Constellation

Northeast Status Report

?

Page 13: Toward verifiable science: iPython meets PROV-O (Semantics in Ecosystems Assessments). April 16, 2014 ERRT Peter Fox (RPI/ Tetherless World Constellation

Generation

Stace Beaulieu

Page 14: Toward verifiable science: iPython meets PROV-O (Semantics in Ecosystems Assessments). April 16, 2014 ERRT Peter Fox (RPI/ Tetherless World Constellation

Tracing

Stace Beaulieu

Page 15: Toward verifiable science: iPython meets PROV-O (Semantics in Ecosystems Assessments). April 16, 2014 ERRT Peter Fox (RPI/ Tetherless World Constellation

W3C PROV

Stace Beaulieu

Page 16: Toward verifiable science: iPython meets PROV-O (Semantics in Ecosystems Assessments). April 16, 2014 ERRT Peter Fox (RPI/ Tetherless World Constellation

Past Experience with PROV

Stace Beaulieu

Page 17: Toward verifiable science: iPython meets PROV-O (Semantics in Ecosystems Assessments). April 16, 2014 ERRT Peter Fox (RPI/ Tetherless World Constellation

Extended Use Case

Stace Beaulieu

Page 18: Toward verifiable science: iPython meets PROV-O (Semantics in Ecosystems Assessments). April 16, 2014 ERRT Peter Fox (RPI/ Tetherless World Constellation

Stace Beaulieu

Page 19: Toward verifiable science: iPython meets PROV-O (Semantics in Ecosystems Assessments). April 16, 2014 ERRT Peter Fox (RPI/ Tetherless World Constellation

Instrument a tool - iPython

Stace Beaulieu

Page 20: Toward verifiable science: iPython meets PROV-O (Semantics in Ecosystems Assessments). April 16, 2014 ERRT Peter Fox (RPI/ Tetherless World Constellation

Stace Beaulieu

Page 21: Toward verifiable science: iPython meets PROV-O (Semantics in Ecosystems Assessments). April 16, 2014 ERRT Peter Fox (RPI/ Tetherless World Constellation

Stace Beaulieu

Page 22: Toward verifiable science: iPython meets PROV-O (Semantics in Ecosystems Assessments). April 16, 2014 ERRT Peter Fox (RPI/ Tetherless World Constellation

http://ipython.org/Screenshot of IPython Notebook used to track both data and workflow

provenance

Code inPython,Matlab,R, other Stace Beaulieu

Page 23: Toward verifiable science: iPython meets PROV-O (Semantics in Ecosystems Assessments). April 16, 2014 ERRT Peter Fox (RPI/ Tetherless World Constellation

http://ipython.org/Screenshot of IPython Notebook used to track both data and workflow

provenance

Notebook can be shared, or output as script, HTML, PDF,other Stace Beaulieu

Page 24: Toward verifiable science: iPython meets PROV-O (Semantics in Ecosystems Assessments). April 16, 2014 ERRT Peter Fox (RPI/ Tetherless World Constellation

PDF output of IPython Notebook with clickable links to data and code

Stace Beaulieu

Page 25: Toward verifiable science: iPython meets PROV-O (Semantics in Ecosystems Assessments). April 16, 2014 ERRT Peter Fox (RPI/ Tetherless World Constellation

Screenshot of csv file at GitHub

Access not only to the data that are plotted, but also to provenance metadata for reproducibility

Stace Beaulieu

Page 26: Toward verifiable science: iPython meets PROV-O (Semantics in Ecosystems Assessments). April 16, 2014 ERRT Peter Fox (RPI/ Tetherless World Constellation

Outcomes

• Implemented PROV-O in iPython NoteBook*

• Extended GCIS ontology for ECOOP to include dataset generation

• Improved the practice of annotation and generation of report materials

Page 27: Toward verifiable science: iPython meets PROV-O (Semantics in Ecosystems Assessments). April 16, 2014 ERRT Peter Fox (RPI/ Tetherless World Constellation

Beaulieu - Indicators

Stace Beaulieu

Page 28: Toward verifiable science: iPython meets PROV-O (Semantics in Ecosystems Assessments). April 16, 2014 ERRT Peter Fox (RPI/ Tetherless World Constellation

Indicators

Stace Beaulieu

Page 29: Toward verifiable science: iPython meets PROV-O (Semantics in Ecosystems Assessments). April 16, 2014 ERRT Peter Fox (RPI/ Tetherless World Constellation

Current (Proposal)

Stace Beaulieu

Page 30: Toward verifiable science: iPython meets PROV-O (Semantics in Ecosystems Assessments). April 16, 2014 ERRT Peter Fox (RPI/ Tetherless World Constellation

Indicators Explained

Stace Beaulieu

Page 31: Toward verifiable science: iPython meets PROV-O (Semantics in Ecosystems Assessments). April 16, 2014 ERRT Peter Fox (RPI/ Tetherless World Constellation

Next … iPython meets NCA

NCA=National Climate Assessment

Stace Beaulieu

Page 32: Toward verifiable science: iPython meets PROV-O (Semantics in Ecosystems Assessments). April 16, 2014 ERRT Peter Fox (RPI/ Tetherless World Constellation

Thanks!

Page 33: Toward verifiable science: iPython meets PROV-O (Semantics in Ecosystems Assessments). April 16, 2014 ERRT Peter Fox (RPI/ Tetherless World Constellation

If needed

Page 34: Toward verifiable science: iPython meets PROV-O (Semantics in Ecosystems Assessments). April 16, 2014 ERRT Peter Fox (RPI/ Tetherless World Constellation

Informatics enables a new approach

• Use cases• Stakeholders• Distributed

authority• Access control• Ontologies• Maintaining

Identity

Page 35: Toward verifiable science: iPython meets PROV-O (Semantics in Ecosystems Assessments). April 16, 2014 ERRT Peter Fox (RPI/ Tetherless World Constellation

Framework - DPSIR

Page 36: Toward verifiable science: iPython meets PROV-O (Semantics in Ecosystems Assessments). April 16, 2014 ERRT Peter Fox (RPI/ Tetherless World Constellation

Semantics of DPSIR?

Page 37: Toward verifiable science: iPython meets PROV-O (Semantics in Ecosystems Assessments). April 16, 2014 ERRT Peter Fox (RPI/ Tetherless World Constellation

Drivers/ Pressures

• Physical Drivers– North Atlantic Oscillation– Atlantic Multi-decadal Oscillation

• Human Drivers– Population– Income

• Human Pressures (Fishery Removals)

– Number Groundfish Vessels

– Landings, Principal Groundfish

– Landings, Other Fish

– Landings, Small Pelagics

– Landings, Crustaceans

– Landings, Molluscs

• Temperature– Extended Reconstructed SST

– Coastal Temperature, Virginia

– Coastal Temperature, Woods Hole

– Costal Temperature, Boothbay Harbor

– Survey sea surface temperature

– Survey bottom sea temperature

– Thermal Habitat <4oC

– Thermal Habitat >5oC and <15 oC

– Thermal Habitat >16oC

• River Discharge– River Flow-Gulf of Maine

– River Flow-Middle Atlantic Bight

– River Flow-Southern New England

• Wind Fields– Wind Stress, Cape Hatteras

– Wind Stress, New York

– Wind Stress, Georges Bank

– Wind Stress East-West, Cape Hatteras

– Wind Stress East-West, New York

– Wind Stress East-West, Georges Bank

– Wind Stress North-South, Cape Hatteras

– Wind Stress North-South, New York

– Wind Stress North-South, Georges Bank

• Other– Stratification

– Survey surface salinity

– Survey bottom salinity

– Gulf Stream Location

– %Labrador-Subarctic Slope Water in GoM

Page 38: Toward verifiable science: iPython meets PROV-O (Semantics in Ecosystems Assessments). April 16, 2014 ERRT Peter Fox (RPI/ Tetherless World Constellation

Ecosystem State Variables

Plankton

•Continuous Plankton Recorder Color Index.

•Zooplankton Ecosystem Biovolume

•Ratio of Small to Large Zooplankton

Nekton/Benthos

•Relative Abundance, Crustaceans

•Relative Abundance, Elasmobranch

•Relative Abundance, Ground Fish

•Relative Abundance, Molluscs

•Relative Abundance, Other Fish

•Relative Abundance, Small Pelagics

•Relative Abundance, All Species

Demography/Trophic Level•Mean Trophic Level Catch•Mean Trophic Level Survey•Primary Production Required, Landings•Mean Length Community Composition•Thermal Preference•Pelagic to Demersal Ratio•Elasmobranch to Demersal Groundfish Ratio•Impacts •Groundfish Fishery Revenue