19
Principles and Practice of Ontology Development: Making Definitions Computable Chris Mungall LBL

Principles and Practice of Ontology Development: Making Definitions Computable

Embed Size (px)

DESCRIPTION

Principles and Practice of Ontology Development: Making Definitions Computable. Chris Mungall LBL. Why make definitions computable?. Doing the work all by yourself is hard, boring and error prone Automate using reasoning Bonus: Makes the definitions easier for humans - PowerPoint PPT Presentation

Citation preview

Page 1: Principles and Practice of Ontology Development: Making Definitions Computable

Principles and Practice of Ontology Development: Making Definitions

ComputableChris Mungall

LBL

Page 2: Principles and Practice of Ontology Development: Making Definitions Computable

Why make definitions computable?

• Doing the work all by yourself is hard, boring and error prone

• Automate using reasoning• Bonus:– Makes the definitions easier for humans– Enables better visualization

Page 3: Principles and Practice of Ontology Development: Making Definitions Computable

Biology is modular, most ontology classes are compositional

distalphalanx

proximalphalanx

phalanx

autopod

foot hand

p

dp pp

dpf dph pphppf

pf ph

(adapted from diagram by David Hill and Joel Richardson)

Page 4: Principles and Practice of Ontology Development: Making Definitions Computable

History

• “Cross-products”– Hill, D. P. and Blake, J. A. and Richardson, J. E. and

Ringwald M: Extension and integration of the gene ontology (GO): combining GO vocabularies with external vocabularies. Genome Res 2002

• Formal Concept Analysis• Description Logics 1980s-present– Galen – Rector et al– GONG – Wroe, Stevens et al

Page 5: Principles and Practice of Ontology Development: Making Definitions Computable

Logical definitions

• Genus-differentia form– Text definition is genus-differentia form

• Distal phalanx of hand: – “A distal phalanx that is part of a hand.”– or “A distal phalanx[ID:2] that is part of a hand[ID:3].”

– OWL:• ‘distal phalanx of hand’ EquivalentTo ‘distal phalanx’ and

part_of some hand– OBO-Format

• [Term]• id: ID:1 ! distal phalanx of hand• intersection_of: ID:2 ! distal phalanx• intersection_of: part_of ID:3 ! hand

Page 6: Principles and Practice of Ontology Development: Making Definitions Computable

Editing logical definitions

• OBO-Edit– Cross-product tab• Genus• Differentia

– Parent Editor• select links to make them ‘intersections’

• Protégé 4– Equivalent Classes• Enter expression

Page 7: Principles and Practice of Ontology Development: Making Definitions Computable

Using a reasoner• Given:

– ‘distal phalanx of hand’ EquivalentTo ‘distal phalanx’ and part_of some hand

– ‘phalanx of hand’ EquivalentTo phalanx and part_of some hand– ‘distal phalanx’ is_a phalanx

• A reasoner can infer that:– ‘distal phalanx of hand’ is_a ‘phalanx of hand’

dp

dph

ph

Page 8: Principles and Practice of Ontology Development: Making Definitions Computable

Uses of a reasoner

• Ontology authoring– Time saving• Automatically inferring is_a polyhierarchy

– Quality Control• detecting inconsistencies

• Data integration and discovery– Less applications… so far

Page 9: Principles and Practice of Ontology Development: Making Definitions Computable

Reasoners

• OBO-Edit– Rule Based Reasoner

• OWL Reasoners (Protégé 4)– Pellet– FaCT++– HermiT

Page 10: Principles and Practice of Ontology Development: Making Definitions Computable

Difference between OBO and OWL reasoning

• OBO format expresses a subset of OWL– We thought this would be enough for most users– But this turned out not to be true– E.g. Lacks part– See talk later today

• Sufficient for basic anatomical classification

Page 11: Principles and Practice of Ontology Development: Making Definitions Computable

Modularity and external ontologies

• Modularity principle:– Multiple orthogonal ontologies– Use classes from O1 as building blocks in O2

– E.g.• astrocyte differentiation• anucleate cell

• BUT: Can pose problems for large external ontologies– CHEBI– PRO

Page 12: Principles and Practice of Ontology Development: Making Definitions Computable

Import

• application can choose to follow import chain– if it doesn’t, then there are

dangling references• the full import closure can be

large!

GO-BPGO-BP CLCL

imports

ontology basedapplication

Page 13: Principles and Practice of Ontology Development: Making Definitions Computable

Large Import Closure

GO-BPGO-BP CLCL

ontology basedapplication

PROPRO

CHEBICHEBI

GO-CCGO-CCAOAO

Page 14: Principles and Practice of Ontology Development: Making Definitions Computable

Bridge files

BP-xp-CLBP-xp-CL

BPBP CLCL

basic applicationmoreadvanced application

• current solution used by NIF, GO

Page 15: Principles and Practice of Ontology Development: Making Definitions Computable

MIREOT - Minimal Information for Retrieval of External Ontology Terms

BPBP

CL (full)CL (full)

CL subsetCL subsetcopycopy

• current solution used by OBI, PRO, hemo-CL

• ontology author “MIREOT”s in subset of external ontology

Page 16: Principles and Practice of Ontology Development: Making Definitions Computable

Deploying cell.obo

• Typical User– inferred links materialized– external ontology links removed

• Advanced User– asserted links only– external links present– minimal external classes included

Page 17: Principles and Practice of Ontology Development: Making Definitions Computable

Ontology publishing pipeline

cell_edit.obocell_edit.obo

GO/PRO/etc subsetGO/PRO/etc subset

cell_public.obocell_public.obo

GO/PRO/etc subsetGO/PRO/etc subset

cell_basic.obocell_basic.obo

cell_edit.owlcell_edit.owl

GO/PRO/etc subsetGO/PRO/etc subset

cell_public.owlcell_public.owl

GO/PRO/etc subsetGO/PRO/etc subset

cell_basic.owlcell_basic.owl

reasonerreasoner

reasonerreasoner

filterfilter

filterfilter

obo2owl obo2owl obo2owl

Page 18: Principles and Practice of Ontology Development: Making Definitions Computable

Summary

• Many ontologies are seeing the benefits of using computable definitions– e.g. fly anatomy– GO slow to adopt, lots of legacy issues to work out

• Better to employ them from the outset– DC_CL– Hemo_CL

Page 19: Principles and Practice of Ontology Development: Making Definitions Computable

•END