32
@nataliestanford [email protected] k SEEKing our way to better presentation of data and models from scientific investigations.

SEEKing our way to better presentation of data and models from scientific investigations

Embed Size (px)

Citation preview

Page 1: SEEKing our way to better presentation of data and models from scientific investigations

@nataliestanford

[email protected]

SEEKing our way to better presentation of data and models

from scientific investigations.

Page 2: SEEKing our way to better presentation of data and models from scientific investigations

Carole Goble

Stuart Owen

Jacky Snoep

Wolfgang Mueller

Olga Krebs Quyen Nguyen

Natalie Stanford

Katy WolstencroftPeter Kunszt Bernd Rinn

also contributing:VLN SEEK team

also contributing:UK SEEK team

Page 3: SEEKing our way to better presentation of data and models from scientific investigations

Systems biology projects produce complex and heterogeneous datasets.

Page 4: SEEKing our way to better presentation of data and models from scientific investigations

The data is saved and stored in convenient, but non-standard formats.

Page 5: SEEKing our way to better presentation of data and models from scientific investigations

This is the case for each researcher within groups across large consortia

projects.

Consortia

Grp 3

Grp 1

Grp 2

Page 6: SEEKing our way to better presentation of data and models from scientific investigations

The data contained within the files can be very ambiguous.

Page 7: SEEKing our way to better presentation of data and models from scientific investigations

Sharing within labs, across projects, and publicly becomes difficult.

Page 8: SEEKing our way to better presentation of data and models from scientific investigations

The availability and reusability of the data in the long-term is compromised.

Page 9: SEEKing our way to better presentation of data and models from scientific investigations

This all leads to issues with conveying what a project has achieved to

funders. • Papers?• Data produced?• Discoveries?• Presentations?• Workshops?• Tutorials?

Defining success and impact of

project.

Page 10: SEEKing our way to better presentation of data and models from scientific investigations

We need better ways of formatting, storing, and sharing

data and models.

Page 11: SEEKing our way to better presentation of data and models from scientific investigations

SEEK is a commons originally designed for centralizing information and assets for large

consortia projects.

Page 12: SEEKing our way to better presentation of data and models from scientific investigations

Each user has their own profile.

Page 13: SEEKing our way to better presentation of data and models from scientific investigations

…and their data and models are uploaded to projects within the SEEK database.

Page 14: SEEKing our way to better presentation of data and models from scientific investigations

SEEK has varied functionality.

Yellow pages, manage SOPs and

link to investigations, studies, assays, specimens and

samples.

Find my peers.

Creating and sharing SOPs

across projects.

Track my specimens.

Track different

versions of my model.

Data viewing functionality; ISA

framework for linking studies to

data, models, SOPs, samples,

publications.

Browse experimental data

without downloading

them.

How data, models and SOPs fit

together.

Which data belong with

which publication.

Page 15: SEEKing our way to better presentation of data and models from scientific investigations

It works as aggregated asset manager, allowing storage on SEEK, or linking assets

from disparate databases.

Page 16: SEEKing our way to better presentation of data and models from scientific investigations

It allows published work and all associated data and files to be organised in an ISA (Investigation, Study, Assay) format.

Page 17: SEEKing our way to better presentation of data and models from scientific investigations

Construction Validation

Metabolomics

Metabolomics

Mass SpecTranscriptomics

Proteomics

Fluxomics

Investigations

Studies

AssaysTowards Interoperable Bioscience Data, Nature Genetics, 2012

Assays

The ISA structure reflects an intuitive structure and storage of scientific findings.

Page 18: SEEKing our way to better presentation of data and models from scientific investigations

SEEK also integrates with other tools.

Page 19: SEEKing our way to better presentation of data and models from scientific investigations

Have now set up FAIRdom to further develop SEEK as an open platform where all assets can be uploaded and linked to

with DOI.

Page 20: SEEKing our way to better presentation of data and models from scientific investigations

“There is no greater impediment to the advancement of knowledge than

the ambiguity of words.”

-Thomas Reid + Natalie Stanford

Data + Models.

Page 21: SEEKing our way to better presentation of data and models from scientific investigations

The data contained within the files can be very ambiguous.

Page 22: SEEKing our way to better presentation of data and models from scientific investigations

There are many Systems Biology standards available.

MinimalInformationModels

Standard Formats

Ontologies

Data Models Simulation Results

[Nicolas Le Novere]

MAGE-TABStandardFormats

RDF annotations

Page 23: SEEKing our way to better presentation of data and models from scientific investigations

..But, the barrier to standard formats and annotation usage by researchers can seem great.

Page 24: SEEKing our way to better presentation of data and models from scientific investigations

There are tools available to assist users.

Page 25: SEEKing our way to better presentation of data and models from scientific investigations

We develop RightField, a semantic annotation tool for data files.

Page 26: SEEKing our way to better presentation of data and models from scientific investigations

We use it to generate templates for different types of assay data.

Excel workbook loaded into RightField with multiple worksheets

Page 27: SEEKing our way to better presentation of data and models from scientific investigations

Suitable ontologies are selected and used to annotate cells for associated data input.

Selected parent term from the ontology

Methods for specifying ontology terms

Term lists for selected cells

Value Type and Property

Page 28: SEEKing our way to better presentation of data and models from scientific investigations

Scientists are able to use the templates in Excel, where the annotations take the form of drop down menus or data entry

cells.

Page 29: SEEKing our way to better presentation of data and models from scientific investigations

The usage of tools like RightField are reducing the uptake barriers for generating formatted and annotated data and models.

Page 30: SEEKing our way to better presentation of data and models from scientific investigations
Page 31: SEEKing our way to better presentation of data and models from scientific investigations

“Ruin is the destination toward which all men rush, each pursuing his own best interest in a society that believes in the the freedom

of the commons.”

- Garrett Hardin, The Tragedy of the Commons.

Page 32: SEEKing our way to better presentation of data and models from scientific investigations

To find out more about FAIRdom please visit our website.

www.fair-dom.org