32
| brian m. bot| senior scientist | community manager | Synapse enabling transparent, reproducible research | michael kellen | director, technology platform and services |

enabling transparent, reproducible research

Embed Size (px)

Citation preview

Page 1: enabling transparent, reproducible research

| brian m. bot| senior scientist | community manager |

Synapseenabling transparent, reproducible research

| michael kellen | director, technology platform and services |

Page 2: enabling transparent, reproducible research

a tool to improve transparency and reproducibility of data intensive

science by recording analyses in real-time

Synapse

Page 3: enabling transparent, reproducible research

a collection of living research projects enabling researchers to contribute to large-scale

collaborative science pre- and post-publication

Synapse

Page 4: enabling transparent, reproducible research

Attractor Metagenes

Columbia Professor Dimitris Anastassiou

MPEG-2 compression of digital audio and video signals

Modules of co-expressed genes shared across cancers

Belief that these ‘attractors’ represent underlying biological mechanisms (bioinformatic ‘hallmarks of cancer’1)

1D. Hanahan, R. A. Weinberg. Hallmarks of cancer: The next generation. Cell 144, 646–674 (2011)

Page 5: enabling transparent, reproducible research

21 february 2013

17 april 2013

Page 6: enabling transparent, reproducible research
Page 7: enabling transparent, reproducible research

21 february 2013

17 april 2013

???

Page 8: enabling transparent, reproducible research
Page 9: enabling transparent, reproducible research
Page 10: enabling transparent, reproducible research

...

Page 11: enabling transparent, reproducible research

...

Page 12: enabling transparent, reproducible research

TCGA Pan-Cancer Consortium

Attractor Metagenes openly evolving research projects

collaboration around common data

Page 13: enabling transparent, reproducible research

Omberg,  et  al.  Nature  Gene*cs

•Analysis of: 12 Tumor types, 6 molecular profiling platforms •Focus series of: 4 papers in Nature Genetics, with 14 more to follow in other NPG journals

TCGA Pan-Cancer Consortium

Page 14: enabling transparent, reproducible research

18papers in press

Page 15: enabling transparent, reproducible research

68core projects

Page 16: enabling transparent, reproducible research

248researchers

Page 17: enabling transparent, reproducible research

28institutions

Page 18: enabling transparent, reproducible research

1070datasets

Page 19: enabling transparent, reproducible research

1723results

Page 20: enabling transparent, reproducible research

versioned data, analysis freezes

Page 21: enabling transparent, reproducible research

data versioning versus data provenance

Page 22: enabling transparent, reproducible research

TCGA Pan-Cancer Consortium collaboration around common data

CRC Subtyping Consortiumcollaboration around common question

Page 23: enabling transparent, reproducible research

CRC Subtyping Consortium

Page 24: enabling transparent, reproducible research

A

B

C

D

E

F

1

2

3

4

5

6

datasets subtypesanalysis groups

Page 25: enabling transparent, reproducible research

A

B

C

D

E

F

1

2

3

4

5

6

datasetsanalysis groups

G ...

subtypes

Page 26: enabling transparent, reproducible research

A

B

C

D

E

F

1

2

3

4

5

6

datasetsanalysis groups

G ...

subtypes

Page 27: enabling transparent, reproducible research

analysis groups

G

Page 28: enabling transparent, reproducible research

A

B

C

D

E

F

1

2

3

4

5

6

datasetsanalysis groups

G ...

subtypes

Page 29: enabling transparent, reproducible research

CRC Subtyping Consortium

Phase I: per-group subtyping ‣ subtyping calls on common data ‣ assess agreement between methods ‣ assess associations with phenotypic traits

Phase II: meta-analysis and de novo subtyping ‣ consensus subytping ‣ assess associations with clinical outcomes ‣ strategy for adoption from clinicians

Page 30: enabling transparent, reproducible research

enables transparency and reproducibility

facilitates large scale collaboration

encourages communication pre- and post-publication

summary

Page 31: enabling transparent, reproducible research

commenting / peer review mechanisms

recognition metrics for individuals and teams

deeper integration with cloud compute services

project snapshots linked to publications

future directions

Page 32: enabling transparent, reproducible research

Acknowledgements

Sage Bionetworks Synapse Development Team Alfred P. Sloan Foundation Nature Publishing Group

AAAS-Science PLoS