25
software servers search tool versioning programming XML RDF OWL data sets semantic web PubChem screening fluorescence small molecule biological assay novel chemical tools chemical probes high-thoughput screening (HTS) ChemBank PDSP pharmaceutical chemical biology cheminformatics ological pathways disease networks structural biology biomedical knowledge technology end point ATP Luciferin Coupled activity viability Beta-Lactamase Induction binding based calcium redistribution caspase activity dehydrogenase activity cyclic AMP redistribution energy transfer enzyme reporter enzyme substrate based Fluorogenic substrate GFP induction standards controlled vocabula indexing subject indexing schem authorized terms taxonomies thesauri subject headings natural language library tags homographs synonyms polysemes concepts structure search knowledge specificity article meta-data information exchang classification nomenclature semantic domain properties annotation object classes individuals BioAssay Ontology (BAO) Stephan Schürer, PhD ICBO, Buffalo, July 30 2011 [email protected]

Software servers search tool versioning programming XML RDF OWL data sets semantic web PubChem screening fluorescence small molecule biological assay novel

Embed Size (px)

Citation preview

Page 1: Software servers search tool versioning programming XML RDF OWL data sets semantic web PubChem screening fluorescence small molecule biological assay novel

software

serverssearch tool

versioning

programming

XML

RDFOWL

data sets

semantic web

PubChem

screening

fluorescence

small molecule

biological assay

novel chemical tools

chemical probes

high-thoughput screening (HTS)

ChemBank

PDSP

pharmaceutical

chemical biology

cheminformatics

biological pathways

disease networks

structural biology

biomedical knowledge

technology end point

ATP Luciferin Coupled

activityviability

Beta-Lactamase Induction

binding based

calcium redistribution

caspase activity

dehydrogenase activity

cyclic AMP redistribution

energy transfer

enzyme reporter

enzyme substrate based

Fluorogenic substrateGFP induction

standards

controlled vocabulary

indexing

subject indexing schemes

authorized terms

taxonomies

thesauri

subject headings

natural language

library

tags homographs

synonyms

polysemes

conceptsstructure

searchknowledge

specificity

article

meta-data

information exchange

classification

nomenclaturesemantic

domain

propertiesannotation

object

classes

individuals

BioAssay Ontology (BAO)

Stephan Schürer, PhD

ICBO, Buffalo, July 30 2011

[email protected]

Page 2: Software servers search tool versioning programming XML RDF OWL data sets semantic web PubChem screening fluorescence small molecule biological assay novel

One of the most important approaches to find novel entry points for drug discovery programs

Historically in pharmaceutical companies Since ~2005, massive NIH effort (MLI) to make HTS

accessible to public sector research PubChem is the major repository of HTS data More recently: EU-OpenScreen project

Background for BioAssay Ontology

High-throughput screening

2

Page 3: Software servers search tool versioning programming XML RDF OWL data sets semantic web PubChem screening fluorescence small molecule biological assay novel

Lack of standardized assay annotations No standardized endpoint names or formats

Data is rarely re-used(!)Common queries cannot be askedAnalysis across different data sets is difficultIntegration with other databases is difficult

No knowledge model for assays and screening results

Motivation for BioAssay Ontology

Large public screening data setsPubChem, ChEMBL, PDSP, ChemBank, Binding DB

3

Page 4: Software servers search tool versioning programming XML RDF OWL data sets semantic web PubChem screening fluorescence small molecule biological assay novel

• Identify inhibitors of kinases in biochemical assays.• Identify compounds active in multiple luciferase reporter

gene assays.• Identify compounds active in cell viability assays and

organize by cell lines and assay types.• Identify active compounds in assays related to pathway X.• …

Queries the Ontology should be able to answer

4

Page 5: Software servers search tool versioning programming XML RDF OWL data sets semantic web PubChem screening fluorescence small molecule biological assay novel

5

Leverage the aggregated corpus of publically available HTS data to infer molecular mechanism of actions (MMOA) of small molecule perturbagens in biological model systems.

Schürer et al. “BioAssay Ontology Annotations Facilitate Cross-Analysis of Diverse High-throughput Screening Data Sets” J Biomol Screen 2011 (16), 415-426.

Page 6: Software servers search tool versioning programming XML RDF OWL data sets semantic web PubChem screening fluorescence small molecule biological assay novel

BAOSearch Software (beta):http://baosearch.ccs.miami.edu Query, explore, download BAO-annotated PubChem content Some semantic search capabilities

Project Website and Wiki with relevant materials and documentation:http://www.bioassayontology.org/http://www.bioassayontology.org/wiki

BAO Products and Resources

6

Page 7: Software servers search tool versioning programming XML RDF OWL data sets semantic web PubChem screening fluorescence small molecule biological assay novel

Application / user focus vs. “universal” ontologies Efficiency vs. “realism” of representations Rapid application development

Orthogonal ontologies vs. Ontology mapping Universal “realism” vs. domain or application-specific

Chemical bond: 2D structure graph, 3D rule based, molecular mechanics, semi-empirical, up-initio QM

Disease Virtual world

Questions / Discussion points

7

Page 8: Software servers search tool versioning programming XML RDF OWL data sets semantic web PubChem screening fluorescence small molecule biological assay novel

Collaborative ontology development Collaborative vs. individual effort Control over development and focus / application focus Rapid application development Quality

Aligning BAO to upper level ontology (BFO) Benefits vs. required resources Do upper level ontologies matter for specialized

applications?

Questions / Discussion points

8

Page 9: Software servers search tool versioning programming XML RDF OWL data sets semantic web PubChem screening fluorescence small molecule biological assay novel

Aligning BAO with OBI Some level of overlap OBI: process-oriented (model the investigation) BAO: purpose of categorization and analysis of HTS data BAO model becomes more complex if based on OBI

How do we do it practically Define missing assays to OBI and MIREOT back? Quick term templates (QTT)? Define our relations as short-cut relationships (using RO)?

Questions / Discussion points

9

Page 10: Software servers search tool versioning programming XML RDF OWL data sets semantic web PubChem screening fluorescence small molecule biological assay novel

Additional slides

10

Page 11: Software servers search tool versioning programming XML RDF OWL data sets semantic web PubChem screening fluorescence small molecule biological assay novel

BAO-facilitated Example for Analysis(Luciferase Assays)

Details in: Schürer et al. “BioAssay Ontology Annotations Facilitate Cross-Analysis of Diverse High-throughput Screening Data Sets” J Biomol Screen 2011 (16), 415-426.

11

Page 12: Software servers search tool versioning programming XML RDF OWL data sets semantic web PubChem screening fluorescence small molecule biological assay novel

Panel AssaySingle ConcOtherConc-responseA

ssa

y C

ou

nt

Most promiscuous reporter gene compounds

Page 13: Software servers search tool versioning programming XML RDF OWL data sets semantic web PubChem screening fluorescence small molecule biological assay novel

Most promiscuous reporter gene compoundsR

epor

ter D

RR

epor

ter S

CVi

abilit

y D

RVi

abilit

y SC

Enz

Activ

DR

Enz

Activ

SC

ATP

DR

ATP

SCLu

cife

rin D

RLu

cife

rin S

C

Promiscuity Index

0 10.2

Com

poun

ds

Luciferase Enzyme Inhibitors

Generally cytotoxic

Page 14: Software servers search tool versioning programming XML RDF OWL data sets semantic web PubChem screening fluorescence small molecule biological assay novel

Examples: Cytotoxic Series

Cluster Reporter PCIdx: 0.56Cluster Reporter Active: 58Cluster Viability PCIdx: 0.64Cluster Viability Active 27 Cluster Reporter PCIdx: 0.48

Cluster Reporter Active: 23Cluster Viability PCIdx: 0.45Cluster Viability Active 10

Cluster Reporter PCIdx: 0.41Cluster Reporter Active: 29Cluster Viability PCIdx: 0.57Cluster Viability Active 13

Daunorubicin

Emetine

Page 15: Software servers search tool versioning programming XML RDF OWL data sets semantic web PubChem screening fluorescence small molecule biological assay novel

Examples: Luciferase Inhibitor Series

Cluster Size: 6Cluster Reporter PCIdx: 0.61Cluster Reporter Active: 101Cluster EnzActivity PCIdx: 0.58Cluster EnzActivity: 15

Cluster Size: 4Cluster Reporter PCIdx: 0.38Cluster Reporter Active: 52Cluster EnzActivity PCIdx: 0.61Cluster EnzActivity: 11

Cluster Size: 5Cluster Reporter PCIdx: 0.46Cluster Reporter Active: 77Cluster EnzActivity PCIdx: 0.58Cluster EnzActivity: 14

Schürer et al. “BioAssay Ontology Annotations Facilitate Cross-Analysis of Diverse High-throughput Screening Data Sets” J Biomol Screen 2011 (16), 415-426.

Page 16: Software servers search tool versioning programming XML RDF OWL data sets semantic web PubChem screening fluorescence small molecule biological assay novel

1) Development of the Bioassay Ontology

2) Annotation of assays and assay results(content curation)

3) Development of software tools

BAO Project: Three major components

16

Page 17: Software servers search tool versioning programming XML RDF OWL data sets semantic web PubChem screening fluorescence small molecule biological assay novel

BAO design to describe assays

Page 18: Software servers search tool versioning programming XML RDF OWL data sets semantic web PubChem screening fluorescence small molecule biological assay novel

Application of BAO: BAO Search Software

18

Page 19: Software servers search tool versioning programming XML RDF OWL data sets semantic web PubChem screening fluorescence small molecule biological assay novel

19

http://baosearch.ccs.miami.edu/baosearch/

Page 20: Software servers search tool versioning programming XML RDF OWL data sets semantic web PubChem screening fluorescence small molecule biological assay novel

20

BAO: Concept Search

Page 21: Software servers search tool versioning programming XML RDF OWL data sets semantic web PubChem screening fluorescence small molecule biological assay novel

21

Biochemical Assays with IC50 < 1 mM

Page 22: Software servers search tool versioning programming XML RDF OWL data sets semantic web PubChem screening fluorescence small molecule biological assay novel

22

Page 23: Software servers search tool versioning programming XML RDF OWL data sets semantic web PubChem screening fluorescence small molecule biological assay novel

23

Chemical structure search

Page 24: Software servers search tool versioning programming XML RDF OWL data sets semantic web PubChem screening fluorescence small molecule biological assay novel

BioAssay Ontology (NCBO bioportal and project site):http://bioportal.bioontology.org/ontologies/45410http://www.bioassayontology.org/visualize/

Terminology / annotations for biochemical assays: http://www.bioassayontology.org/>Assay Annotation Template

Over 1000 BAO-annotated assays from PubChem (available in BAOSearch)

BAO Products and Resources

24

Page 25: Software servers search tool versioning programming XML RDF OWL data sets semantic web PubChem screening fluorescence small molecule biological assay novel

• Chris Mader• Amar Koleti• Nakul Datar• Sreeharsha

Venkatapuram• Felimon Gayanilo

• Mark Southern

• Saminda Abeyruwan• Uma Vempati• Magdalena Przydzial• Kunie Sakurai• Robin Smith• Yuanyuan Jia• Caty Chung

• Ubbo Visser• Vance Lemmon• Mitsunori Ogihara

• Nick Tsinoremas

http://bioassayontology.org

[email protected]

Acknowledgements

25