29
Crowdsourced Chemistry – Why Online Chemistry Data Needs Your Help

Crowdsourced Chemistry – Why Online Chemistry Data Needs Your Help

Embed Size (px)

DESCRIPTION

This is the Ignite talk that I gave at ScienceOnline2010 #sci010 in the Research Triangle Park in North Carolina on January 16th 2010. This was supposed to be a 5 minute talk highlighting the quality of chemistry data on the internet. Ok, it was a little tongue in cheek because it was an after dinner talk and late at night but the data are real, the problem is real and the need for data curation of chemistry data online is real. On ChemSpider we have provided a platform to deposit and curate data.The video is here on YouTube: http://tinyurl.com/yekmmfg

Citation preview

Page 1: Crowdsourced Chemistry – Why Online Chemistry Data Needs Your Help

Crowdsourced Chemistry – Why Online Chemistry Data Needs Your Help

Page 2: Crowdsourced Chemistry – Why Online Chemistry Data Needs Your Help

www.dhmo.org

Page 3: Crowdsourced Chemistry – Why Online Chemistry Data Needs Your Help

www.dhmo.org

Page 4: Crowdsourced Chemistry – Why Online Chemistry Data Needs Your Help

Di-Hydrogen Monoxide

Page 5: Crowdsourced Chemistry – Why Online Chemistry Data Needs Your Help

Di-Hydrogen Monoxide

2H

Page 6: Crowdsourced Chemistry – Why Online Chemistry Data Needs Your Help

Di-Hydrogen Monoxide

2H + 1O

Page 7: Crowdsourced Chemistry – Why Online Chemistry Data Needs Your Help

Di-Hydrogen Monoxide

H2O

Page 8: Crowdsourced Chemistry – Why Online Chemistry Data Needs Your Help

Di-Hydrogen Monoxide

H2OWater

Page 9: Crowdsourced Chemistry – Why Online Chemistry Data Needs Your Help

It’s all on Wikipedia…

Page 10: Crowdsourced Chemistry – Why Online Chemistry Data Needs Your Help

It’s all on Wikipedia…

Page 11: Crowdsourced Chemistry – Why Online Chemistry Data Needs Your Help

Chemistry on The Internet Is Messy

Page 12: Crowdsourced Chemistry – Why Online Chemistry Data Needs Your Help

It’s Methane…

Page 13: Crowdsourced Chemistry – Why Online Chemistry Data Needs Your Help

What’s Methane?

Page 14: Crowdsourced Chemistry – Why Online Chemistry Data Needs Your Help

What’s Methane?

Page 15: Crowdsourced Chemistry – Why Online Chemistry Data Needs Your Help

What ELSE is Methane???

Page 16: Crowdsourced Chemistry – Why Online Chemistry Data Needs Your Help

What ELSE is Methane???

Page 17: Crowdsourced Chemistry – Why Online Chemistry Data Needs Your Help

What ELSE is Methane???

Page 18: Crowdsourced Chemistry – Why Online Chemistry Data Needs Your Help

Truly “I Love You”

Page 19: Crowdsourced Chemistry – Why Online Chemistry Data Needs Your Help

Chemistry is REALLY Messy

Page 20: Crowdsourced Chemistry – Why Online Chemistry Data Needs Your Help

Vancomycin

Who will curate?

How would you clean such a large dataset?

3 days of effort with EBI

Assertions!!!

Page 21: Crowdsourced Chemistry – Why Online Chemistry Data Needs Your Help

Trust the AUTHORITY!

Page 22: Crowdsourced Chemistry – Why Online Chemistry Data Needs Your Help

Bonds ARE important!

Page 23: Crowdsourced Chemistry – Why Online Chemistry Data Needs Your Help

It’s Better to Have an Entire Molecule!

Page 24: Crowdsourced Chemistry – Why Online Chemistry Data Needs Your Help

Does Stereochemistry Matter??

Page 25: Crowdsourced Chemistry – Why Online Chemistry Data Needs Your Help

Yes, ONE stereocenter matters!!!

Thalidomide

Page 26: Crowdsourced Chemistry – Why Online Chemistry Data Needs Your Help

The EXPERTS must get it right?!

Page 27: Crowdsourced Chemistry – Why Online Chemistry Data Needs Your Help

Wikipedia? American Chemical Society? C&E News (from ACS)

Page 28: Crowdsourced Chemistry – Why Online Chemistry Data Needs Your Help

Online Chemistry

Online Chemistry is DIRTY Wikipedia “structures” are very good! A curation

team of many people working on then for 2 years Chemistry databases are polluted

Online Chemistry needs curating…

Page 29: Crowdsourced Chemistry – Why Online Chemistry Data Needs Your Help

Crowdsourced Curation

Use the wisdom of the crowd to curate chemistry!

www.chemspider.com 23.5 Million chemicals 300 Data Sources Open platform for deposition and curation