47
FAIR Data, Operations and Model management for Systems Biology and Systems Medicine Projects Prof Carole Goble The FAIRDOM Consortium [email protected] http:// fair-dom.org , http :// fairdomhub.org 1 st Conference of the European Association of Systems Medicine, 26-28 October 2016, Berlin

FAIR Data, Operations and Model management for Systems Biology and Systems Medicine Projects

Embed Size (px)

Citation preview

Page 1: FAIR Data, Operations and Model management for Systems Biology and Systems Medicine Projects

FAIR Data, Operations and Model management for Systems Biology and Systems Medicine Projects

Prof Carole GobleThe FAIRDOM [email protected]://fair-dom.org, http://fairdomhub.org

1st Conference of the European Association of Systems Medicine, 26-28 October 2016, Berlin

Page 2: FAIR Data, Operations and Model management for Systems Biology and Systems Medicine Projects

Asset Management and Sharing

• Access to public funded research

• Reproducible results• Value and cite all

research outcomes• Sustained data

infrastructure

Page 3: FAIR Data, Operations and Model management for Systems Biology and Systems Medicine Projects

Findable

Accessible

Interoperable

Reusable(Intelligible)

(Reproducible)

(Citable)

(Trackable)

https://www.force11.org/group/fairgroup/fairprinciples

Page 4: FAIR Data, Operations and Model management for Systems Biology and Systems Medicine Projects

Projects .... and Programmes....funder and research project legacy

P1. BaCell-SysMOThe transition from growing to non-growing Bacillus subtilis cells - A systems biology approach

P2. COSMICSystems Biology of Clostridium acetobutylicum - a possible answer to dwindling crude oil reserves

P3. SUMOSystems Understanding of Microbial Oxygen Responses Escherichia coli

P4. KOSMOBACIon and solute homeostasis in enteric bacteria Escherichia coli

P5. SysMO-LABComparative Systems Biology: Lactic Acid Bacteria: Lactococcus lactis, Enterococcus faecalis, Streptococcus pyogenes

P6. PSYSMOSystems analysis of biotech induced stresses: towards a quantum increase in process performance in the cell factory Pseudomonas putida

P7. SCaRABSystems Biology of a genetically engineered Pseudomonas fluorescens with inducible exo-polysaccharide production: analysis of the dynamics and robustness of metabolic networks

P8. MOSESMicroOrganism Systems Biology: Energy and Saccharomyces cerevisiaeP9. TRANSLUCENT

Gene interaction networks and models of cation homeostasis

in Saccharomyces cerevisiaeP10. STREAM Global metabolic switching in

Streptomyces coelicolor

P11. SulfoSYS Silicon cell model for the central carbohydrate metabolism of the archaeon Sulfolobus solfataricus under temperature

variation

P12. SysMO-DB Data management group • Reuse • Compliance• Retention• Dissemination• Collaboration• Reproducibility• Resource &

Skills Limitations

Page 5: FAIR Data, Operations and Model management for Systems Biology and Systems Medicine Projects

FindableAccessibleInteroperableReusable

DataOperationsModels

Sponsors

Page 6: FAIR Data, Operations and Model management for Systems Biology and Systems Medicine Projects

FAIRDOM Association e.V. Partnersopen innovation, not for profit

LifeGlimmer GmBH SB-ScienceManagement GmBH New Forest Ventures Ltd

Page 7: FAIR Data, Operations and Model management for Systems Biology and Systems Medicine Projects

FAIRDOM Pillars

Project Support

Community Actions

Platforms, Tools

Public Project Commons

Page 8: FAIR Data, Operations and Model management for Systems Biology and Systems Medicine Projects

Systems Approach…people, assets, processes pragmatics

• Multiple, interrelated assets• Multiple, dispersed repositories• Multi-partner, -discipline

projects

• Team science practices• Experiment – Asset

lifecycles• Academic innovation

drivers

Page 9: FAIR Data, Operations and Model management for Systems Biology and Systems Medicine Projects

Multiple, interrelated assetsstructured formats, standards, ontologies context

Analytics &Pipelines

Literature

SBML, CellML, PharmMLMatlab, MathematicaFortran, R, Python

SOPS

Multiple omics:genomics, transcriptomicsproteomics, metabolomicsfluxomics, reactomics

ImagesReaction kineticsSamples, Specimens, StrainsHuman data

STANDARDSversioning,

tracking:provenance, parameters,

citation

Operations

Data

SOPs

Models

Page 10: FAIR Data, Operations and Model management for Systems Biology and Systems Medicine Projects

FAIR Data and Metadata Standards that help to improve understanding and exchange….

Nicolas Le Novère, Babraham Institute, UK.

Page 11: FAIR Data, Operations and Model management for Systems Biology and Systems Medicine Projects

…researchers do not always use them....

Format MetadataMetadata Ontologies

*top three most popular

The evolution of standards and data management practices in systems biology (2015). Stanford et al, Molecular Systems Biology, 11(12):851

Page 12: FAIR Data, Operations and Model management for Systems Biology and Systems Medicine Projects

… makes model reuse tricky…

Stanford et al The evolution of standards and data management practices in systems biology, Molecular Systems Biology (2015) 11: 851 DOI 10.15252/msb.20156053

Page 13: FAIR Data, Operations and Model management for Systems Biology and Systems Medicine Projects

Specialist Public Repositories

General archives

Multi Repository RepertoireAccess, Reuse

Local Data Stores

The evolution of standards and data management practices in systems biology (2015). Stanford et al, Molecular Systems Biology, 11(12):851

Page 14: FAIR Data, Operations and Model management for Systems Biology and Systems Medicine Projects

sharing/publishing assets in public archives…

Data Models

*top three most popular

The evolution of standards and data management practices in systems biology (2015). Stanford et al, Molecular Systems Biology, 11(12):851

Page 15: FAIR Data, Operations and Model management for Systems Biology and Systems Medicine Projects

Multi-partner, multi-disciplinary projectswhere sharing and metadata collection isn’t second nature

ConsortiaGrp

3Grp

1

Grp 2

Page 16: FAIR Data, Operations and Model management for Systems Biology and Systems Medicine Projects

Multi-partner, multi-disciplinary projectsSOPs and Yellow Pages top request…

Who is working with wh

ich organism?

What methods are been used to determine enzyme activity?

Under which experimental conditions are

my

partners working on for the measurement

of glucose

concentration?What is the provenance of the parameters for this version of the model?What SOP was used for this

sample?

Where is the validation data for this model?

Is there any group generating kinetic data?

Is this data available?

Track versions of my model

Whats the relationship between the data and model?

Which data belong to which publications?

Page 17: FAIR Data, Operations and Model management for Systems Biology and Systems Medicine Projects

Downstream assets discovery and sharing

Organisation Communication Dissemination

Navigate through assets

Reuse later

Enable team to reuse/

reproduceHelp

others find out

Reuse with new

partners

Tell more, take credit

Standardised metadata practices

Assets

Page 18: FAIR Data, Operations and Model management for Systems Biology and Systems Medicine Projects

Find… own hard disk for storage…

The evolution of standards and data management practices in systems biology (2015). Stanford et al, Molecular Systems Biology, 11(12):851

Page 19: FAIR Data, Operations and Model management for Systems Biology and Systems Medicine Projects

Samiul Hasan, GSKBiocuration need in Pharma: Drivers from a Translational Bioinformatics Perspective, Poster S16

Page 20: FAIR Data, Operations and Model management for Systems Biology and Systems Medicine Projects

The FAIR ProjectChallengeTrack collection of data and metadata X X

Maintain the experimental context X

Find and exchange assets X X

Retain results beyond a project X X

Share, disseminate and publish assets sensitively

X X X

Consistent reporting for interpretation, interoperability & comparison

X X

Promote standardised metadata practices. X X

Organise and link assets X X

Reuse tools and community archives X

Respect local and legacy solutions X X

Support reproducible publications X X X X

Credit owners X X

Page 21: FAIR Data, Operations and Model management for Systems Biology and Systems Medicine Projects

FAIRDOM Pillars

Project Support

Community Actions

Platforms, Tools

Public Project Commons

Page 22: FAIR Data, Operations and Model management for Systems Biology and Systems Medicine Projects

Community, Knowledge Hubhttp://www.fair-dom.org

Know-how, Guides, Templates, Workshops, Training, Webinars, Standards and Policy Forums

Page 23: FAIR Data, Operations and Model management for Systems Biology and Systems Medicine Projects

Project Support Processes, Practices, People…take time and persuasion

Community support Special project

support Special project

support

PALs project ambassadors

best practices, forums, trainingcuration handholdingSBML model technical curation

Page 24: FAIR Data, Operations and Model management for Systems Biology and Systems Medicine Projects

Asset Management Platformsan ecosystem of resources

Front endWeb based rich interfaceCatalogue and CommonsAll about the metadataResults repositoryhttp://seek4science.org

Back end Scaled LIMS and analyticsAuto-archivingInstruments data repositoryhttps://sis.id.ethz.ch/software/openbis.html

Page 25: FAIR Data, Operations and Model management for Systems Biology and Systems Medicine Projects

A community Commons….self managed workspaces

Controlled sharing and publishing

Page 26: FAIR Data, Operations and Model management for Systems Biology and Systems Medicine Projects

• Licenses• Negotiated access• Embargos• Permission controls• Staged sharing• Private walled gardens

FAIR Play Practices

Using FAIRDOM my own lab colleagues saw what I was doing and called to collaborate!

Jurgen HannstraVrije Universiteit Amsterdam, Netherlands

Page 27: FAIR Data, Operations and Model management for Systems Biology and Systems Medicine Projects

Investigation

Study Analysis

Data

Model

SOP(Assay)

….organised in an ISA (Investigation, Study, Assay/Analysis) format.

Page 28: FAIR Data, Operations and Model management for Systems Biology and Systems Medicine Projects

Linking, “Packaging” & Citing Codes, Data, Models, SOPs, Samples, Strains, Articles, People, Projects….

PackagingRetaining ContextSupporting Decision making

Page 29: FAIR Data, Operations and Model management for Systems Biology and Systems Medicine Projects

STUDY ASSAYINVESTIGATION

Experimental assay

Modeling assay

Publication

[Maksim Zakhartsev]

Page 30: FAIR Data, Operations and Model management for Systems Biology and Systems Medicine Projects

... a “Research Object” Cataloguemetadata aggregated across repositoriesretaining context to support decision making and reuse

Local Stores

ExternalDatabases

Publishing services

Secure Stores

Model Resources

Page 31: FAIR Data, Operations and Model management for Systems Biology and Systems Medicine Projects

… with integrated toolingmetadata annotation against standardsmodel validation, comparison and simulation

SBML Model simulation

Model comparison

Model versioning

Reproducing simulations

[Jacky Snoep, Dagmar Waltemate, Martin Peters, Martin Scharm]

Page 32: FAIR Data, Operations and Model management for Systems Biology and Systems Medicine Projects

Retaining context, supporting decision makingTowards data harmonisation and indexing

[Susanna Sansone]

Page 33: FAIR Data, Operations and Model management for Systems Biology and Systems Medicine Projects

Stealthy Ramps for helping with Metadata Standards Tooling for annotations and templates for different types of assay datatowards data harmonisation. Incentive by side effect.

Embed ontologies into Excel templates

Excel spreadsheets enriched with ontology annotations

Upload, extract metadata and register

http://www.rightfield.org.uk

Page 34: FAIR Data, Operations and Model management for Systems Biology and Systems Medicine Projects

Exchange and PublishingSupplementary information

Annotation file

Stoichiometric matrix

SBML Stationary fluxes

[Maxim Zakhartsev]

Page 35: FAIR Data, Operations and Model management for Systems Biology and Systems Medicine Projects

https://doi.org/10.15490/seek.1.investigation.56

Penkler et al (2015) FEBSJ 282:1481-1511.

Page 36: FAIR Data, Operations and Model management for Systems Biology and Systems Medicine Projects

Reproducible Exchange and Publishingand better credit

reviewer

Author List: Joe Bloggs; Jane DoeTitle: My Investigation Date: September 2016DOI: https://doi.org/10.15490/seek##

information travels with the data and models

Page 37: FAIR Data, Operations and Model management for Systems Biology and Systems Medicine Projects

FAIRDOM-SEEK local or public commons

*Troup, E.; Clark, I; Swain, P; Millar, AJ; Zielinski, T (2015) Practical evaluation of SEEK and openBIS for biological data management in SynthSys http://hdl.handle.net/1842/12236

FAIRDOMHub.org

Vrije Universiteit

Yellow Pages

Page 38: FAIR Data, Operations and Model management for Systems Biology and Systems Medicine Projects

IMOMESIC pathwayIntegrating Modelling of Metabolism and Signalling towards an Application in Liver Cancerhttps://fairdomhub.org/projects/24

[Ursula Klingmüller, Martin Böhm]

Page 39: FAIR Data, Operations and Model management for Systems Biology and Systems Medicine Projects

What about FAIR Systems Medicine?Olaf Wolkenhauer et al, Enabling multiscale modeling in systems medicine, Genome Medicine 2014 6:21*

1. Samples

2. Access to sensitive data

3. Multi-models

*DOI: 10.1186/gm538, http://genomemedicine.com/content/6/3/21

Page 40: FAIR Data, Operations and Model management for Systems Biology and Systems Medicine Projects

Samples metadata frameworkBBMRI, ELIXIR, Biosamples, FAIRDOM, UKCRC Tissue Directory, UK Synthetic Biology Centres

User defined sample models

Interlinking between sample typesSample type defines a sharable standard

Template toolingAuto extraction

Tied to assay processes

Page 41: FAIR Data, Operations and Model management for Systems Biology and Systems Medicine Projects

FAIR Sensitive Data, certified repositorieswalled gardens and registration flags for Cataloguelegal restrictions for sharing anonymised and non-anonymized data

Open Data

Register metadataUpload dataRegister link

Register access methodRegister metadata

Register access methodLocal AAI service

Register metadata

Closed Data

Closed Data

Page 42: FAIR Data, Operations and Model management for Systems Biology and Systems Medicine Projects

Model Laissez-Faire

• Navigation between• Single standards at 1

scale• Multi-model hosting

Linking models….• connecting (experimental/simulation) data to

models• connecting the single standards?• interfacing between the different scales?

Page 43: FAIR Data, Operations and Model management for Systems Biology and Systems Medicine Projects

In summary…Pragmatic FAIR support for projects people, assets, processes

• Multiple, interrelated assets• Multiple, dispersed repositories• Multi-partner, -discipline

projects• Multiple community tools

• Team science practices• Experiment – Asset

lifecycles• Academic innovation

drivers

ISA structured

“Research Objects”

Repository spanning catalogue

metadata

Standards-based tools

Page 44: FAIR Data, Operations and Model management for Systems Biology and Systems Medicine Projects

Challenges to FAIR Asset Management

Free Puppies

Page 45: FAIR Data, Operations and Model management for Systems Biology and Systems Medicine Projects

FAIR Play microscopes -> data scopes, sharing citizenship, incentives by side effects

PI leadershipSticking to conventionsLocal responsibilityTime and resourceCuration recognition

Trust• Tribal trading behaviours• Enclave sharing • Not public donation• Reciprocity & credit

Drivers … • External dominate• Personal productivity

affecting behavioural change through libertarian paternalism

[Kristian Garza]

Page 46: FAIR Data, Operations and Model management for Systems Biology and Systems Medicine Projects

Jon Olav Vik, Norwegian University of Life ScienceMaksim ZakhartsevUniversity Hohenheim, Stuttgart, Germany

Alexey KolodkinSiberian BranchRussian Academy of Sciences

Tomasz Zieliński,SynthSys CentreUniversity Edinburgh, UK

Martin Peters, Martin Scharm Systems Biology BioinformaticsUniversity of Rostock, Germany

Page 47: FAIR Data, Operations and Model management for Systems Biology and Systems Medicine Projects

3rd Foundry meeting, Dec 1-2 2016

Frankfurt

Developers FoundrySupport developers of Systems Biology tools and platforms