45
Online Resources to Support Open Drug Discovery Systems Antony Williams 3 rd Annual Drug Discovery Partnership: Filling the Pipeline, October 2011

Online Resources to Support Open Drug Discovery Systems

Embed Size (px)

DESCRIPTION

This is a presentation given at the Opal Events meeting ""Drug Discovery Partnerships: Filling the Pipeline". I was speaking in a session with Jean-Claude Bradley regarding "Pre-competitive Collaboration: Sharing Data to Increase Predictability". This presentation discussed some of the work we are doing on Open PHACTS. My thanks especially to Carole Goble, Lee Harland and Sean Ekins for their comments.

Citation preview

Page 1: Online Resources to Support Open Drug Discovery Systems

Online Resources to Support Open Drug Discovery Systems

Antony Williams3rd Annual Drug Discovery Partnership: Filling the Pipeline, October 2011

Page 2: Online Resources to Support Open Drug Discovery Systems

Open Drug Discovery

Pharma Companies spend >$50 billion annually on R&D

How much historical data/knowledge/information is in the public domain? And where is it?

How much generated data is truly competitive? Pre-competitive and public domain data could

deliver high value to drug discovery Data mining Model-building Integrating into in-house and online systems

Page 3: Online Resources to Support Open Drug Discovery Systems

Internal and external content Built to meet primary use-case Tailored indexes and GUIs Internal unique language & metadata Poor interoperability/integration Powerpoint, Documents, Excel Many suppliers of systems and content in

a single workflow

Literature Patents NewsPipeline SAR CSRs SafetyIn vivo Etc

Pharma Information Tombs

Page 4: Online Resources to Support Open Drug Discovery Systems

What could create change?

Harvard Business Review (2010)

“One change would make a substantial difference [to drug R&D]: the creation of agreed-upon standards for digitally

representing drug assets.”

Page 5: Online Resources to Support Open Drug Discovery Systems

It is so difficult to navigate…

What’s the structure?What’s the structure?

Are they in our file?

Are they in our file?

What’s similar?What’s

similar?

What’s the target?

What’s the target?Pharmacology

data?Pharmacology

data?

Known Pathways?

Known Pathways?

Working On Now?

Working On Now?Connections

to disease?Connections to disease?

Expressed in right cell type?Expressed in

right cell type?

Competitors?Competitors?

IP?IP?

Page 6: Online Resources to Support Open Drug Discovery Systems

Where is chemistry online? Encyclopedic articles (Wikipedia) Chemical vendor databases Metabolic pathway databases Property databases Patents with chemical structures Drug Discovery data Scientific publications Compound aggregators Blogs/Wikis and Open Notebook Science

Page 7: Online Resources to Support Open Drug Discovery Systems

PubChem

Page 8: Online Resources to Support Open Drug Discovery Systems

ChEMBL

Page 9: Online Resources to Support Open Drug Discovery Systems

ChemSpider

Page 10: Online Resources to Support Open Drug Discovery Systems

SciDBs Wiki

Page 11: Online Resources to Support Open Drug Discovery Systems
Page 12: Online Resources to Support Open Drug Discovery Systems

Pharma are accessing, processing, storing & re-processing

LiteraturePubChem

GenbankPatents

DatabasesDownloads

Data Integration Data AnalysisFirewalled Databases

Public Domain Drug Discovery Data

Page 13: Online Resources to Support Open Drug Discovery Systems

New trend: Set Data Free on the Web

Page 14: Online Resources to Support Open Drug Discovery Systems

Open Algorithms, Descriptors and Closed Data – Can We Unlock It?

Page 15: Online Resources to Support Open Drug Discovery Systems

The Innovative Medicines Initiative EC funded public-private partnership for

pharmaceutical research

Focus on key problems Efficacy Safety Education & Training Knowledge Management

Page 16: Online Resources to Support Open Drug Discovery Systems

Open PHACTS Project Develop a set of robust standards… Implement the standards in a semantic integration hub Deliver services to support drug discovery programs in

pharma and public domain 22 partners, 8 pharmaceutical companies, 3 biotechs 36 months project

Guiding principle is open access, open usage, open source- Key to standards adoption -

Guiding principle is open access, open usage, open source- Key to standards adoption -

Page 17: Online Resources to Support Open Drug Discovery Systems

Open PHACTS Project Partners

Page 18: Online Resources to Support Open Drug Discovery Systems

Example Research questions Give all compounds with IC50 < xxx for target Y in species

W and Z plus assay data

What substructures are associated with readout X (target, pathway, disease, …)

Give all experimental and clinical data for compound X

Give all targets for compound X or a compound with a similarity > y%

Page 19: Online Resources to Support Open Drug Discovery Systems

Prioritised Research Questions Analysis Prevalent Concepts

Compound Bioassay Target Pathway Disease

Prevalent data relationships Compound – target Compound – bioassay Bioassay – target Compound – target – mode of action Target – target classification Target – pathway and disease

Required cheminformatics functionality

– Chemical substructure searching– Chemical similarity searching

Required bioinformatics functionality

Sequence and similarity searching

Bioprofile similarity searching

Page 20: Online Resources to Support Open Drug Discovery Systems

Selection of prioritised data sources Chemistry

ChEMBL DrugBank ChEBI PubChem ChemSpider Human Metabolome DB Wombat (commercial)

Ontologies AmiGo (The Gene Ontology) KEGG (Kyoto Encyclopedia of Genes and Genomes) OBI (The Ontology for Biomedical Investigations) Bioassay Ontology EFO (Experimental Factor Ontology)

Biology– EntrezGene– HGNC– Uniprot– Interpro– SCOP– Wikipathways– OMIM– IUPHAR

Page 21: Online Resources to Support Open Drug Discovery Systems
Page 22: Online Resources to Support Open Drug Discovery Systems

Linking “Flavors” of Chemistry

Page 23: Online Resources to Support Open Drug Discovery Systems

Improve Linked Data Access… Coordinate effort to clean up chemistry related data

Open tools – require good validation studies

Support scientists making data open

Support companies/groups promoting software for data sharing

Engage community to help create what they want.

Page 24: Online Resources to Support Open Drug Discovery Systems

Openness and Quality IssuesWilliams and Ekins, DDT, 16: 747-750 (2011)

Science Translational Medicine 2011

Page 25: Online Resources to Support Open Drug Discovery Systems

Chemistry Databases on the Internet

Some public databases are “trusted” as primary sources

Trust is granted without investigation or understanding of the content

What do we know about some of the online resources?

Page 26: Online Resources to Support Open Drug Discovery Systems

PHYSPROP Database

The freely downloadable database under the EPI Suite prediction software

Very Basic filters suggest data quality issues

Page 27: Online Resources to Support Open Drug Discovery Systems

The Stereochemistry challenge.12500 chemicals with “missed” stereo

Page 28: Online Resources to Support Open Drug Discovery Systems

Searches on ChemSpider

Most searches are text-based: people searching for information about known chemicals

Creating accurate name-structure dictionaries is critical

Page 29: Online Resources to Support Open Drug Discovery Systems

NIST Webbook

Page 30: Online Resources to Support Open Drug Discovery Systems

PubChem

Page 31: Online Resources to Support Open Drug Discovery Systems

NPC Browser http://tripod.nih.gov/npc/

Page 32: Online Resources to Support Open Drug Discovery Systems
Page 33: Online Resources to Support Open Drug Discovery Systems

Cyclic Data Sharing

Data-sharing between open databases is cyclic

Page 34: Online Resources to Support Open Drug Discovery Systems

Synonyms on PubChem

1,3-DICHLORO-PROPAN-2-ONE

(2R,3R)-Butanediol bis(methanesulfonate)

Ethyl-1-propenyl ether, mixture of cis and trans

PSS-[2-[(Chloromethyl)phenyl]ethyl]-Heptaisobutyl substituted

1-Chlorobenzylethyl-3,5,7,9,11,13,15-heptaisobutylpentacyclo [9.5.1.1(3,9).1(5,15).1(7,13)]octasiloxane

Page 35: Online Resources to Support Open Drug Discovery Systems

Synonyms on PubChem

Page 36: Online Resources to Support Open Drug Discovery Systems

Data Proliferation

Page 37: Online Resources to Support Open Drug Discovery Systems
Page 38: Online Resources to Support Open Drug Discovery Systems
Page 39: Online Resources to Support Open Drug Discovery Systems
Page 40: Online Resources to Support Open Drug Discovery Systems
Page 41: Online Resources to Support Open Drug Discovery Systems

www.chemspider.com

Page 42: Online Resources to Support Open Drug Discovery Systems

ChemSpider…

>26 million unique molecules Links together >400 internet resources Linking patents, publications, chemical vendors and

online chemical compound databases Crowdsourced depositions and curations

Page 43: Online Resources to Support Open Drug Discovery Systems

ChemSpider…

>26 million unique molecules Links together >400 internet resources Linking patents, publications, chemical vendors and

online chemical compound databases Crowdsourced depositions and curations

A focus on data quality – cleaning data on the web

The structure database under Open PHACTS

Page 44: Online Resources to Support Open Drug Discovery Systems

Acknowledgments Sean Ekins – Collaborations in Chemistry

RSC|ChemSpider team

Open PHACTS consortium – especially Lee Harland and Carole Goble

Data depositors and curators

Software providers – ACD/Labs, OpenEye, GGA Software Inc, Open Source Cheminformatics

Page 45: Online Resources to Support Open Drug Discovery Systems

Thank you

Email: [email protected] Twitter: ChemConnectorBlog: www.chemspider.com/blogPersonal Blog: www.chemconnector.comSLIDES: www.slideshare.net/AntonyWilliams