Crowdsourced Chemistry – Why Online Chemistry Data Needs Your Help

Preview:

DESCRIPTION

This is the Ignite talk that I gave at ScienceOnline2010 #sci010 in the Research Triangle Park in North Carolina on January 16th 2010. This was supposed to be a 5 minute talk highlighting the quality of chemistry data on the internet. Ok, it was a little tongue in cheek because it was an after dinner talk and late at night but the data are real, the problem is real and the need for data curation of chemistry data online is real. On ChemSpider we have provided a platform to deposit and curate data.The video is here on YouTube: http://tinyurl.com/yekmmfg

Citation preview

Crowdsourced Chemistry – Why Online Chemistry Data Needs Your Help

www.dhmo.org

www.dhmo.org

Di-Hydrogen Monoxide

Di-Hydrogen Monoxide

2H

Di-Hydrogen Monoxide

2H + 1O

Di-Hydrogen Monoxide

H2O

Di-Hydrogen Monoxide

H2OWater

It’s all on Wikipedia…

It’s all on Wikipedia…

Chemistry on The Internet Is Messy

It’s Methane…

What’s Methane?

What’s Methane?

What ELSE is Methane???

What ELSE is Methane???

What ELSE is Methane???

Truly “I Love You”

Chemistry is REALLY Messy

Vancomycin

Who will curate?

How would you clean such a large dataset?

3 days of effort with EBI

Assertions!!!

Trust the AUTHORITY!

Bonds ARE important!

It’s Better to Have an Entire Molecule!

Does Stereochemistry Matter??

Yes, ONE stereocenter matters!!!

Thalidomide

The EXPERTS must get it right?!

Wikipedia? American Chemical Society? C&E News (from ACS)

Online Chemistry

Online Chemistry is DIRTY Wikipedia “structures” are very good! A curation

team of many people working on then for 2 years Chemistry databases are polluted

Online Chemistry needs curating…

Crowdsourced Curation

Use the wisdom of the crowd to curate chemistry!

www.chemspider.com 23.5 Million chemicals 300 Data Sources Open platform for deposition and curation

Recommended