76
New York State Center of Excellence in Bioinformatics & Life R T U New York State Center of Excellence in Bioinformatics & Life Sciences R T U Clinical Trial Ontology Meeting How to build an Ontology ? Some basic principles NIH, May 16-17, 2007 Werner CEUSTERS, MD Center of Excellence in Bioinformatics and Life Sciences Department of Psychiatry, University at Buffalo, NY, USA http://www.org.buffalo.edu/RTU

Werner CEUSTERS, MD Center of Excellence in Bioinformatics and Life Sciences

  • Upload
    loki

  • View
    26

  • Download
    0

Embed Size (px)

DESCRIPTION

Clinical Trial Ontology Meeting How to build an Ontology ? Some basic principles NIH, May 16-17, 2007. Werner CEUSTERS, MD Center of Excellence in Bioinformatics and Life Sciences Department of Psychiatry, University at Buffalo, NY, USA http://www.org.buffalo.edu/RTU. - PowerPoint PPT Presentation

Citation preview

Page 1: Werner CEUSTERS, MD Center of Excellence in Bioinformatics and Life Sciences

New York State Center of Excellence in Bioinformatics & Life Sciences

R T U New York State Center of Excellence in Bioinformatics & Life Sciences

R T U

Clinical Trial Ontology Meeting

How to build an Ontology ?Some basic principles

NIH, May 16-17, 2007

Werner CEUSTERS, MDCenter of Excellence in Bioinformatics and Life Sciences

Department of Psychiatry, University at Buffalo, NY, USA

http://www.org.buffalo.edu/RTU

Page 2: Werner CEUSTERS, MD Center of Excellence in Bioinformatics and Life Sciences

2

New York State Center of Excellence in Bioinformatics & Life Sciences

R T U New York State Center of Excellence in Bioinformatics & Life Sciences

R T U

Short personal history

1959 - 20061977

1989

1992

19982002

2004

Page 3: Werner CEUSTERS, MD Center of Excellence in Bioinformatics and Life Sciences

3

New York State Center of Excellence in Bioinformatics & Life Sciences

R T U New York State Center of Excellence in Bioinformatics & Life Sciences

R T U

Mainstream interpretations of “ontology”• An explicit specification of an agreed upon

conceptualization of a domain– Tom Grüber

• Anything what is given the name ‘ontology’ and that can be described in terms of 6 axes: expressiveness, structure, intended use, granularity, automated reasoning, prescriptive/descriptive– Ontology Summit 2007

Page 4: Werner CEUSTERS, MD Center of Excellence in Bioinformatics and Life Sciences

4

New York State Center of Excellence in Bioinformatics & Life Sciences

R T U New York State Center of Excellence in Bioinformatics & Life Sciences

R T U

Problems with mainstream ontologies• Based upon the confusing notion of “concept”

– Unit of thought or knowledge concerning anything perceivable or conceivable

– The meaning of a term– …

• Confuse information representation with domain representation

Information about X part_of information about Y

X part of Y

Page 5: Werner CEUSTERS, MD Center of Excellence in Bioinformatics and Life Sciences

5

New York State Center of Excellence in Bioinformatics & Life Sciences

R T U New York State Center of Excellence in Bioinformatics & Life Sciences

R T U

What I mean with the word “ontology”• A representation of some pre-existing domain of reality (a

portion of reality) which

1. reflects the properties of the entities within its domain in such a way that there obtains a systematic correlation between reality and the representation itself,

2. is intelligible to a domain expert

3. is formalized in a way that allows it to support automatic information processing

reality

Page 6: Werner CEUSTERS, MD Center of Excellence in Bioinformatics and Life Sciences

6

New York State Center of Excellence in Bioinformatics & Life Sciences

R T U New York State Center of Excellence in Bioinformatics & Life Sciences

R T U

Three levels of reality

1. The world exists ‘as it is’ prior to a cognitive agent’s perception thereof;

2. Cognitive agents build up ‘in their minds’ cognitive representations of the world;

3. To make these representations publicly accessible in some enduring fashion, they create representational artifacts that are fixed in some medium.

Smith B, Kusnierczyk W, Schober D, Ceusters W. Towards a Reference Terminology for Ontology Research and Development in the Biomedical Domain. Proceedings of KR-MED 2006, November 8, 2006, Baltimore MD,

USA

Page 7: Werner CEUSTERS, MD Center of Excellence in Bioinformatics and Life Sciences

7

New York State Center of Excellence in Bioinformatics & Life Sciences

R T U New York State Center of Excellence in Bioinformatics & Life Sciences

R T U

O

RU1O1

concretization

R1st level reality

Represent what exist and is relevant

BRU1

B1 Cognitiverepresentation

Page 8: Werner CEUSTERS, MD Center of Excellence in Bioinformatics and Life Sciences

8

New York State Center of Excellence in Bioinformatics & Life Sciences

R T U New York State Center of Excellence in Bioinformatics & Life Sciences

R T U

Some characteristics of representational units1. each unit is assumed by the creators of the

representation to be veridical, i.e. to conform to some relevant POR as conceived on the best current scientific understanding;

2. several units may correspond to the same POR by presenting different though still veridical views or perspectives;

3. what is to be represented by the units in a representation depends on the purposes which the representation is designed to serve.

Page 9: Werner CEUSTERS, MD Center of Excellence in Bioinformatics and Life Sciences

9

New York State Center of Excellence in Bioinformatics & Life Sciences

R T U New York State Center of Excellence in Bioinformatics & Life Sciences

R T U

Some characteristics of an optimal ontology• Each representational unit in such an ontology

would designate – (1) a single portion of reality (POR), which is – (2) relevant to the purposes of the ontology and such

that – (3) the authors of the ontology intended to use this unit

to designate this POR, and– (4) there would be no PORs objectively relevant to

these purposes that are not referred to in the ontology.

Page 10: Werner CEUSTERS, MD Center of Excellence in Bioinformatics and Life Sciences

10

New York State Center of Excellence in Bioinformatics & Life Sciences

R T U New York State Center of Excellence in Bioinformatics & Life Sciences

R T U

Three types of ontologies• Upper level ontologies:

– (should) describe the most generic structure of reality• Domain ontologies:

– (should) describe the portion of reality that is dealt with in some domain

– Special case: reference ontologies• Application ontologies:

– To be used in a specific context and to support some specific application

Page 11: Werner CEUSTERS, MD Center of Excellence in Bioinformatics and Life Sciences

11

New York State Center of Excellence in Bioinformatics & Life Sciences

R T U New York State Center of Excellence in Bioinformatics & Life Sciences

R T U

Clinical trial ontologies• As domain ontologies:

– Cover all entity types relevant in the clinical trial domain

• As application ontologies:– A subset of the above which is large enough to support

all functions the application has to serve:• CT protocol development• Study management• Data analysis• …

Page 12: Werner CEUSTERS, MD Center of Excellence in Bioinformatics and Life Sciences

New York State Center of Excellence in Bioinformatics & Life Sciences

R T U New York State Center of Excellence in Bioinformatics & Life Sciences

R T U

Key question

How to build an optimal clinical trial domain ontology ?

Page 13: Werner CEUSTERS, MD Center of Excellence in Bioinformatics and Life Sciences

New York State Center of Excellence in Bioinformatics & Life Sciences

R T U New York State Center of Excellence in Bioinformatics & Life Sciences

R T U

Rule 1:

Analyze the domain

Page 14: Werner CEUSTERS, MD Center of Excellence in Bioinformatics and Life Sciences

New York State Center of Excellence in Bioinformatics & Life Sciences

R T U New York State Center of Excellence in Bioinformatics & Life Sciences

R T U

Rule 2a:

Try to be lazy:re-use what others have done.

Page 15: Werner CEUSTERS, MD Center of Excellence in Bioinformatics and Life Sciences

15

New York State Center of Excellence in Bioinformatics & Life Sciences

R T U New York State Center of Excellence in Bioinformatics & Life Sciences

R T U

The BRIDG (domain analysis) model• NOT an ontology• A computable clinical trials protocol

representation – that supports the entire life-cycle of clinical trial

protocols, and– that will serve as a foundation for caBIG modules

• that support all phases of the clinical trials life cycle, (including protocol authoring) and

• be developed to meet user needs and requirements.The BRIDG Project: Creating a model of the semantics of clinical trials research. Douglas B. Fridsma. July 26, 2006

Page 16: Werner CEUSTERS, MD Center of Excellence in Bioinformatics and Life Sciences

16

New York State Center of Excellence in Bioinformatics & Life Sciences

R T U New York State Center of Excellence in Bioinformatics & Life Sciences

R T U

Reasons for selecting BRIDG

• BRIDG tries to solve an important problem• Does not completely ignore reality as many other

initiatives do: – If the tools and models don’t work with reality, it is

probably the tools and the models that need to change• The BRIDG Project: Creating a model of the semantics of clinical trials research. Douglas B. Fridsma.

July 26, 2006

• Intended to become the next best thing on earth (after HL7, I assume)

(although one has to search hard to find evidence and sometimes it looks as if some contributors observed reality from outer space)

Page 17: Werner CEUSTERS, MD Center of Excellence in Bioinformatics and Life Sciences

17

New York State Center of Excellence in Bioinformatics & Life Sciences

R T U New York State Center of Excellence in Bioinformatics & Life Sciences

R T U

http://www.bridgproject.org/status.html

Page 18: Werner CEUSTERS, MD Center of Excellence in Bioinformatics and Life Sciences

18

New York State Center of Excellence in Bioinformatics & Life Sciences

R T U New York State Center of Excellence in Bioinformatics & Life Sciences

R T U

BRIDG_Model_V1_49

Page 19: Werner CEUSTERS, MD Center of Excellence in Bioinformatics and Life Sciences

19

New York State Center of Excellence in Bioinformatics & Life Sciences

R T U New York State Center of Excellence in Bioinformatics & Life Sciences

R T U

BRIDG model organization

Image from: The BRIDG Project: Creating a model of the semantics of clinical trials research. Douglas B. Fridsma. July 26, 2006

Page 20: Werner CEUSTERS, MD Center of Excellence in Bioinformatics and Life Sciences

New York State Center of Excellence in Bioinformatics & Life Sciences

R T U New York State Center of Excellence in Bioinformatics & Life Sciences

R T U

Rule 2b:

Try to be lazy: re-use what others have done,But… remain critical at all times!

Page 21: Werner CEUSTERS, MD Center of Excellence in Bioinformatics and Life Sciences

21

New York State Center of Excellence in Bioinformatics & Life Sciences

R T U New York State Center of Excellence in Bioinformatics & Life Sciences

R T U

Being critical ≠ being negative

RFQ-NCI-60001-NG: Review of NCI Thesaurus and Development of Plan to Achieve OBO-Compliance

Grant to Apelon (H. Solbrig) to improve NCIT

Page 22: Werner CEUSTERS, MD Center of Excellence in Bioinformatics and Life Sciences

New York State Center of Excellence in Bioinformatics & Life Sciences

R T U New York State Center of Excellence in Bioinformatics & Life Sciences

R T U

Rule 3:

Don’t have a blind trust in the power of representation and modeling

languages, and certainly not in UML

Page 23: Werner CEUSTERS, MD Center of Excellence in Bioinformatics and Life Sciences

23

New York State Center of Excellence in Bioinformatics & Life Sciences

R T U New York State Center of Excellence in Bioinformatics & Life Sciences

R T U

‘Death by UML Fever’• It is important to emphasize that UML itself is not the direct cause

of any maladies described herein. • Instead, UML is largely an innocent victim caught in the midst of

poor process, no process, or sheer incompetence of its users. • UML sometimes does amplify the symptoms of some fevers as the

result of the often divine-like aura attached to it. • For example, it is not uncommon for people to believe that no

matter what task they may be engaged in, mere usage of UML somehow legitimizes their efforts or guarantees the value of the artifacts produced.

Alex E. Bell. Death by UML Fever. Queue 2(1), March 2004, ACM Press, 72 – 80, 2004

Page 24: Werner CEUSTERS, MD Center of Excellence in Bioinformatics and Life Sciences

24

New York State Center of Excellence in Bioinformatics & Life Sciences

R T U New York State Center of Excellence in Bioinformatics & Life Sciences

R T U

Who would not be impressed ?

• Fig. 10: BRIDG Comprehensive Class and attribute diagram - (Logical diagram), p99

Page 25: Werner CEUSTERS, MD Center of Excellence in Bioinformatics and Life Sciences

25

New York State Center of Excellence in Bioinformatics & Life Sciences

R T U New York State Center of Excellence in Bioinformatics & Life Sciences

R T U

I’m not !• I have come to appreciate domain modeling in

UML as an implementation-independent approach which is more likely to uncover “the truth” about the underlying semantics.– Dr. Diane Wold. Modeling Trial Design with BRIDG. July 26, 2006

• The UML diagram helped us to keep separate an activity, which exists independent of any schedule, and an activity-at-a-visit, (the X), which is a plan to perform that activity at a particular time.

Page 26: Werner CEUSTERS, MD Center of Excellence in Bioinformatics and Life Sciences

New York State Center of Excellence in Bioinformatics & Life Sciences

R T U New York State Center of Excellence in Bioinformatics & Life Sciences

R T U

Rule 4:

Limit the number of developers/contributors

Page 27: Werner CEUSTERS, MD Center of Excellence in Bioinformatics and Life Sciences

27

New York State Center of Excellence in Bioinformatics & Life Sciences

R T U New York State Center of Excellence in Bioinformatics & Life Sciences

R T U

Contributors to the BRIDG model

A chain is as

strong as its

weakest link

Image from: The BRIDG Project: Creating a model of the semantics of clinical trials research. Douglas B. Fridsma. July 26, 2006

Page 28: Werner CEUSTERS, MD Center of Excellence in Bioinformatics and Life Sciences

New York State Center of Excellence in Bioinformatics & Life Sciences

R T U New York State Center of Excellence in Bioinformatics & Life Sciences

R T U

Rule 5:

• Be consistent in what you describe: – either representational units, or – the entities represented by them.

• Thus: keep the levels of reality all the time in mind

Page 29: Werner CEUSTERS, MD Center of Excellence in Bioinformatics and Life Sciences

29

New York State Center of Excellence in Bioinformatics & Life Sciences

R T U New York State Center of Excellence in Bioinformatics & Life Sciences

R T U

LivingSubject (BRIDG logical model p1031)• Type: Class• Status: . Version . Phase .• Package: Entities and Roles Keywords:• Detail: Created on 02/09/2006. Last modified on

02/09/2006.• GUID: {7C04F8D8-30B9-4942-B2A8-4CF93E8913D9}• An object representing an organism or complex animal,

alive or not. Examples: person, dog, microorganism, plant of any taxonomic group, tissue sample, bacteria, fungi, and viruses.

Page 30: Werner CEUSTERS, MD Center of Excellence in Bioinformatics and Life Sciences

30

New York State Center of Excellence in Bioinformatics & Life Sciences

R T U New York State Center of Excellence in Bioinformatics & Life Sciences

R T U

SubstanceAdministration (BRIDG logical model p84)

• Type: Class PerformedActivity• Status: Proposed. Version 1.0. Phase 1.0.• Package: CTOM Elements Keywords:• Detail: Created on 01/05/2005. Last modified on

12/14/2006.• GUID: {2289C0E8-855D-42e3-86FA-

2ECBE59D8982}• The description of applying, dispensing or giving

agents or medications to subjects.

Page 31: Werner CEUSTERS, MD Center of Excellence in Bioinformatics and Life Sciences

31

New York State Center of Excellence in Bioinformatics & Life Sciences

R T U New York State Center of Excellence in Bioinformatics & Life Sciences

R T U

Person (BRIDG logical model p106 a.f., HE!)

• Type: Class• Status: Proposed. Version 1.0. Phase 1.0.• Package: Clinical Research Entities Keywords:• Detail: Created on 06/09/2005. Last modified on

01/13/2007.• GUID: {6F49F110-7B36-4c03-A7EA-

F456CE1E739D}• A human being.

Page 32: Werner CEUSTERS, MD Center of Excellence in Bioinformatics and Life Sciences

32

New York State Center of Excellence in Bioinformatics & Life Sciences

R T U New York State Center of Excellence in Bioinformatics & Life Sciences

R T U

Some Person Attributes• administrativeGenderCode (p107)

– The classification of the sex or gender role of the patient. Values include: Female, Male, and Unknown.

• genderCode (p108)– The text that describes the assemblage of physical

properties or qualities by which male is distinguished from female; the physical difference between male and female within a person. [Explanatory Comment: Identification of sex is usually based upon self-report and may come from a form, questionnaire, interview, etc.]

Page 33: Werner CEUSTERS, MD Center of Excellence in Bioinformatics and Life Sciences

33

New York State Center of Excellence in Bioinformatics & Life Sciences

R T U New York State Center of Excellence in Bioinformatics & Life Sciences

R T UA better example:

Clinical Trial Ontology under DOLCE

Crenguta Bogdan, Daniela Luzi, Fabrizio L. Ricci, Luca D. Serbanati. Towards a Clinical Trial Ontology using a Concern-Oriented Approach. W.P. n. 10, October 2006.

Page 34: Werner CEUSTERS, MD Center of Excellence in Bioinformatics and Life Sciences

New York State Center of Excellence in Bioinformatics & Life Sciences

R T U New York State Center of Excellence in Bioinformatics & Life Sciences

R T U

Rule 6:

Use a Realism-based Upper Ontology to classify the representational units

in your Domain Ontology

Page 35: Werner CEUSTERS, MD Center of Excellence in Bioinformatics and Life Sciences

35

New York State Center of Excellence in Bioinformatics & Life Sciences

R T U New York State Center of Excellence in Bioinformatics & Life Sciences

R T U

Realism in Basic Formal Ontology (BFO)• The world consists of

– entities that are • Either particulars or universals; • Either occurrents or continuants;• Either dependent or independent; and,

– relationships between these entities of the form• <particular , universal> e.g. is-instance-of, • <particular , particular> e.g. is-member-of• <universal , universal> e.g. isa (is-subtype-of)

Smith B, Kusnierczyk W, Schober D, Ceusters W. Towards a Reference Terminology for Ontology Research and Development in the Biomedical Domain. Proceedings of KR-MED 2006, November 8, 2006, Baltimore MD, USA

Page 36: Werner CEUSTERS, MD Center of Excellence in Bioinformatics and Life Sciences

36

New York State Center of Excellence in Bioinformatics & Life Sciences

R T U New York State Center of Excellence in Bioinformatics & Life Sciences

R T U

Only what exists (or existed) can be represented• Anything else can be imagined• Examples of what exist:

– Body parts– Disorders– Abortions– Women with prevented abortions– Plans about my future activities

• What does not exist– Prevented abortions– My future activities

Page 37: Werner CEUSTERS, MD Center of Excellence in Bioinformatics and Life Sciences

37

New York State Center of Excellence in Bioinformatics & Life Sciences

R T U New York State Center of Excellence in Bioinformatics & Life Sciences

R T U

PlannedActivity (BRIDG logical model p202, HE!)

Page 38: Werner CEUSTERS, MD Center of Excellence in Bioinformatics and Life Sciences

New York State Center of Excellence in Bioinformatics & Life Sciences

R T U New York State Center of Excellence in Bioinformatics & Life Sciences

R T U

Rule 7:

• Use formal ontological methods to: – distinguish distinct entities– assess in what way distinct entities are

distinct

Page 39: Werner CEUSTERS, MD Center of Excellence in Bioinformatics and Life Sciences

39

New York State Center of Excellence in Bioinformatics & Life Sciences

R T U New York State Center of Excellence in Bioinformatics & Life Sciences

R T U

Organism (BRIDG logical model p160, HE!)• Type: Class• Status: Proposed. Version 1.0. Phase 1.0.• Package: Clinical Research Roles Keywords:• Detail: Created on 12/13/2006. Last modified on

01/19/2007.• GUID: {B9F321DB-365F-4155-B8F6-3D….• The role that a biological entity has, and that role

participates in a microbiology test in two ways: first, it can be identified as the result of a microbiology test. It can also participate as a specimen in the microbiology test. [HL7 Perspective]

Page 40: Werner CEUSTERS, MD Center of Excellence in Bioinformatics and Life Sciences

40

New York State Center of Excellence in Bioinformatics & Life Sciences

R T U New York State Center of Excellence in Bioinformatics & Life Sciences

R T U

An example: ONTOCLEAN

• Identity, essence, unity, dependence

C. Welty, N. Guarino"Supporting ontological analysis of taxonomic relationships", Data and Knowledge Engineering vol. 39, no. 1, pp. 51-74, 2001

Page 41: Werner CEUSTERS, MD Center of Excellence in Bioinformatics and Life Sciences

New York State Center of Excellence in Bioinformatics & Life Sciences

R T U New York State Center of Excellence in Bioinformatics & Life Sciences

R T U

Rule 8:

• Don’t confuse reality with our means to access that reality, f.i.:

• Don’t confuse the observation of an entity with the entity observed

Page 42: Werner CEUSTERS, MD Center of Excellence in Bioinformatics and Life Sciences

42

New York State Center of Excellence in Bioinformatics & Life Sciences

R T U New York State Center of Excellence in Bioinformatics & Life Sciences

R T U

AdverseEvent (BRIDG logical model p168, HE!)• Type: Class Assessment• Status: Proposed. Version 1.0. Phase 1.0.• Package: Clinical Research Activities Keywords:• Detail: Created on 05/24/2006. Last modified on

01/26/2007.• GUID: {CD620136-3CB9-4382-802B-F6CA82F98C10}• An observation of a change in the state of a subject that is

assessed as being untoward by one or more interested parties within the context of protocol-driven research or public health.

Page 43: Werner CEUSTERS, MD Center of Excellence in Bioinformatics and Life Sciences

43

New York State Center of Excellence in Bioinformatics & Life Sciences

R T U New York State Center of Excellence in Bioinformatics & Life Sciences

R T U

Example: medical ‘findings’ and ‘observations’• A particular pathological entity may at a certain

time be undetectable by any observation method or technique available to an observer, including the person exhibiting the pathological entity itself.

Page 44: Werner CEUSTERS, MD Center of Excellence in Bioinformatics and Life Sciences

44

New York State Center of Excellence in Bioinformatics & Life Sciences

R T U New York State Center of Excellence in Bioinformatics & Life Sciences

R T U

Example: medical ‘findings’ and ‘observations’ (1)• A particular pathological entity may at a certain

time be undetectable by any observation method or technique available to an observer, including the person exhibiting the pathological entity itself.

• A particular observation (‘act of looking’) may produce false results and thus simulate the existence of a pathological entity.

Page 45: Werner CEUSTERS, MD Center of Excellence in Bioinformatics and Life Sciences

45

New York State Center of Excellence in Bioinformatics & Life Sciences

R T U New York State Center of Excellence in Bioinformatics & Life Sciences

R T U

Example: medical ‘findings’ and ‘observations’ (1)• A particular pathological entity may at a certain

time be undetectable by any observation method or technique available to an observer, including the person exhibiting the pathological entity itself.

• A particular observation may produce false results and thus simulate the existence of a pathological entity.

• An observer may observe or fail to observe a detectable particular pathological entity.

Page 46: Werner CEUSTERS, MD Center of Excellence in Bioinformatics and Life Sciences

46

New York State Center of Excellence in Bioinformatics & Life Sciences

R T U New York State Center of Excellence in Bioinformatics & Life Sciences

R T U

On ‘findings’ and ‘observations’ (2)• When an observer perceives a particular pathological

entity, he might judge it – (1) to be an instance of the universal of which it is indeed an

instance in reality, – (2) to be an instance of another universal (and thus be in error),

or – (3) he might be not able to make an association with any

universal at all.• Distinct manifestations of ‘the same type’ may be

pathological or not:– Singing naked under the shower versus in front of The White

House• ...

Page 47: Werner CEUSTERS, MD Center of Excellence in Bioinformatics and Life Sciences

New York State Center of Excellence in Bioinformatics & Life Sciences

R T U New York State Center of Excellence in Bioinformatics & Life Sciences

R T U

Rule 9:

Do not accept silly suggestions, whomever they come from

Page 48: Werner CEUSTERS, MD Center of Excellence in Bioinformatics and Life Sciences

48

New York State Center of Excellence in Bioinformatics & Life Sciences

R T U New York State Center of Excellence in Bioinformatics & Life Sciences

R T U

Device (BRIDG logical model, p100, HE!)• Type: Class Material• Status: Proposed. Version 1.0. Phase 1.0.• Package: Clinical Research Entities Keywords:• Detail: Created on 02/22/2006. Last modified on

01/04/2007.• GUID: {3546A977-C51F-4860-A09A-

2ADAE896D74B}• <PROPOSED> A therapeutic or diagnostic intervention

utilizing a piece of equipment or a mechanism designed to serve a special purpose or perform a special function whose basic characteristics are not altered in the course of the intervention.

Page 49: Werner CEUSTERS, MD Center of Excellence in Bioinformatics and Life Sciences

49

New York State Center of Excellence in Bioinformatics & Life Sciences

R T U New York State Center of Excellence in Bioinformatics & Life Sciences

R T U

The latter could also go under other rules:• Stop working when you are tired• Be careful with cut and paste• Proof-read your work• …

Page 50: Werner CEUSTERS, MD Center of Excellence in Bioinformatics and Life Sciences

New York State Center of Excellence in Bioinformatics & Life Sciences

R T U New York State Center of Excellence in Bioinformatics & Life Sciences

R T U

Rule 10:

Use distinct names for distinct representational units that denote

distinct entities

Page 51: Werner CEUSTERS, MD Center of Excellence in Bioinformatics and Life Sciences

51

New York State Center of Excellence in Bioinformatics & Life Sciences

R T U New York State Center of Excellence in Bioinformatics & Life Sciences

R T U

AdverseEvent (BRIDG logical model p504)• Type: Class HealthProblem• Status: Proposed. Version 1.0. Phase 1.0.• Package: Adverse Event Keywords:• Detail: Created on 05/01/2006. Last modified on

05/02/2006.• GUID: {6783F6F2-8837-4b7d-B81B-

A25206D36689}• A toxic reaction to a medical therapy, or to an

experience such as consuming a meal.

Page 52: Werner CEUSTERS, MD Center of Excellence in Bioinformatics and Life Sciences

52

New York State Center of Excellence in Bioinformatics & Life Sciences

R T U New York State Center of Excellence in Bioinformatics & Life Sciences

R T U

AdverseEvent (BRIDG logical model p91)• Type: Class• Status: Proposed. Version 1.0. Phase 1.0.• Package: SDTM Keywords:• Detail: Created on 12/14/2005. Last modified on 12/28/2006.• GUID: {F1786F01-F973-426d-B765-0107B5823A18}• Any untoward medical occurrence in a patient or clinical

investigation subject administered a pharmaceutical product and which does not necessarily have a causal relationship with this treatment. An adverse event (AE) can therefore be any unintended sign (including an abnormal laboratory finding), symptom, or disease temporally associated with the use of a medicinal (investigational) product, whether or not related to the medicinal investigational) product.

Page 53: Werner CEUSTERS, MD Center of Excellence in Bioinformatics and Life Sciences

53

New York State Center of Excellence in Bioinformatics & Life Sciences

R T U New York State Center of Excellence in Bioinformatics & Life Sciences

R T U

AdverseEvent (BRIDG logical model p36)• Type: Class Assessment• Status: Proposed. Version 1.0. Phase 1.0.• Package: CTOM (imported package) Keywords:• Detail: Created on 01/05/2005. Last modified on 09/26/2005.• GUID: {C0F30FE6-EE1E-443e-A7AB-256342B193B3}• An unfavorable and unintended reaction, symptom, syndrome, or

disease encountered by a subject while on a clinical trial regardless of whether or not it is considered related to the product or procedure. . The concept refers to assessments that could be medically related, dose related, route related, patient related, caused by an interaction with another therapy or procedure, or dose escalation.

Page 54: Werner CEUSTERS, MD Center of Excellence in Bioinformatics and Life Sciences

New York State Center of Excellence in Bioinformatics & Life Sciences

R T U New York State Center of Excellence in Bioinformatics & Life Sciences

R T U

Rule 11:

Avoid contradictions

Page 55: Werner CEUSTERS, MD Center of Excellence in Bioinformatics and Life Sciences

55

New York State Center of Excellence in Bioinformatics & Life Sciences

R T U New York State Center of Excellence in Bioinformatics & Life Sciences

R T U

ObjectiveResult (BRIDG logical model p191, HE!)• Type: Class InvestigativeResult, Observation• Status: Proposed. Version 1.0. Phase 1.0.• Package: Clinical Research Activities Keywords:• Detail: Created on 01/20/2005. Last modified on

12/28/2006.• GUID: {F388CFB0-77DE-4008-B222-EB…• An act of monitoring, recognizing and noting

reproducible measurement of some magnitude with suitable instruments or established scientific processes.

• <EXAMPLE> A laboratory test with standardized instruments, ECG measurement or question on a validated questionnaire such as SF36.

Page 56: Werner CEUSTERS, MD Center of Excellence in Bioinformatics and Life Sciences

56

New York State Center of Excellence in Bioinformatics & Life Sciences

R T U New York State Center of Excellence in Bioinformatics & Life Sciences

R T U

Some attributes of ObjectiveResult• missedIndicator boolean

– This is an indicator flag that flags a performed observation as "not done". (default: CDISC) …… p193

• missedReason– This captures SDTM's ---REASND. In HL7, there is a list of

permissible missing value types, and we need to ensure that HL7's list is a superset of what is needed by SDTM.

– <EXAMPLE> A planned observation was not done because the equipment failed, so the corresponding "performed observation" exists as a placeholder to describe why that performed observation was not done…….……p193

– Default: [CDISC SDTM IG v3.1.1 = REASND ]

Page 57: Werner CEUSTERS, MD Center of Excellence in Bioinformatics and Life Sciences

New York State Center of Excellence in Bioinformatics & Life Sciences

R T U New York State Center of Excellence in Bioinformatics & Life Sciences

R T U

Rule 12:

Avoid circular definitions

Page 58: Werner CEUSTERS, MD Center of Excellence in Bioinformatics and Life Sciences

58

New York State Center of Excellence in Bioinformatics & Life Sciences

R T U New York State Center of Excellence in Bioinformatics & Life Sciences

R T U

Ingredient (BRIDG logical model p507) • Status: Proposed. Version 1.0. Phase 1.0.• Package: Adverse Event Keywords:• Detail: Created on 03/01/2006. Last modified on

03/01/2006.• GUID: {7D53B2A1-CEC4-49ae-8BD6-

611E2CF4D862}• A substance that acts as an ingredient within a

product. Note, that ingredients may also have ingredients.

Page 59: Werner CEUSTERS, MD Center of Excellence in Bioinformatics and Life Sciences

New York State Center of Excellence in Bioinformatics & Life Sciences

R T U New York State Center of Excellence in Bioinformatics & Life Sciences

R T U

Rule 13:

Do not use names with a precise meaning in general language to designate entities which are of a more specific or totally

different type in the context of a specific application

Page 60: Werner CEUSTERS, MD Center of Excellence in Bioinformatics and Life Sciences

60

New York State Center of Excellence in Bioinformatics & Life Sciences

R T U New York State Center of Excellence in Bioinformatics & Life Sciences

R T U

Animal (BRIDG logical model p526) • Type: Class InvestigatedParty• Status: Proposed. Version 1.0. Phase 1.0.• Package: InvestigatedSubject Keywords:• Detail: Created on 03/10/2006. Last modified on

03/10/2006.• GUID: {996CB91C-04EC-4b1d-9AFF-57B878D532D7}• A non-person living entity which is chosen to be the

subject of an investigation, or which is the subject of an• implicated act.

Page 61: Werner CEUSTERS, MD Center of Excellence in Bioinformatics and Life Sciences

New York State Center of Excellence in Bioinformatics & Life Sciences

R T U New York State Center of Excellence in Bioinformatics & Life Sciences

R T U

Rule 14:

Provide a mechanism to let the ontology evolve in line with changes

in reality and in are understanding thereof

Page 62: Werner CEUSTERS, MD Center of Excellence in Bioinformatics and Life Sciences

62

New York State Center of Excellence in Bioinformatics & Life Sciences

R T U New York State Center of Excellence in Bioinformatics & Life Sciences

R T U

Reality versus beliefs, both in evolution

IUI-#3

O-#2

O-#1

tU1

U2

p3Reality

BeliefO-#0

= “denotes” = what constitutes the meaning of representational units …. Therefore: O-#0 is meaningless

Page 63: Werner CEUSTERS, MD Center of Excellence in Bioinformatics and Life Sciences

63

New York State Center of Excellence in Bioinformatics & Life Sciences

R T U New York State Center of Excellence in Bioinformatics & Life Sciences

R T U

Changes in reality, beliefs, representations

tU1

U2

p3

IUI-#3

O-#2

O-#1

R

BO-#0

Relationships amongst universals (R) or beliefs therein (B)

Page 64: Werner CEUSTERS, MD Center of Excellence in Bioinformatics and Life Sciences

64

New York State Center of Excellence in Bioinformatics & Life Sciences

R T U New York State Center of Excellence in Bioinformatics & Life Sciences

R T U

Mistakes, discoveries, being lucky, having bad luck

tU1

U2

p3

IUI-#3

O-#2

O-#1

R

BO-#0

Mistakes

Page 65: Werner CEUSTERS, MD Center of Excellence in Bioinformatics and Life Sciences

65

New York State Center of Excellence in Bioinformatics & Life Sciences

R T U New York State Center of Excellence in Bioinformatics & Life Sciences

R T U

Mistakes, discoveries, being lucky, having bad luck

tU1

U2

p3

IUI-#3

O-#2

O-#1

R

BO-#0

discoveries

Page 66: Werner CEUSTERS, MD Center of Excellence in Bioinformatics and Life Sciences

66

New York State Center of Excellence in Bioinformatics & Life Sciences

R T U New York State Center of Excellence in Bioinformatics & Life Sciences

R T U

Mistakes, discoveries, being lucky, having bad luck

tU1

U2

p3

IUI-#3

O-#2

O-#1

R

BO-#0

Page 67: Werner CEUSTERS, MD Center of Excellence in Bioinformatics and Life Sciences

67

New York State Center of Excellence in Bioinformatics & Life Sciences

R T U New York State Center of Excellence in Bioinformatics & Life Sciences

R T U

Mistakes, discoveries, being lucky, having bad luck

tU1

U2

p3

IUI-#3

O-#2

O-#1

R

BO-#0

Page 68: Werner CEUSTERS, MD Center of Excellence in Bioinformatics and Life Sciences

68

New York State Center of Excellence in Bioinformatics & Life Sciences

R T U New York State Center of Excellence in Bioinformatics & Life Sciences

R T U

Key requirement for versioning

Any change in an ontology or data repository should be

associated with the reason for that change to be able to assess later what kind of mistake has been made !

Page 69: Werner CEUSTERS, MD Center of Excellence in Bioinformatics and Life Sciences

69

New York State Center of Excellence in Bioinformatics & Life Sciences

R T U New York State Center of Excellence in Bioinformatics & Life Sciences

R T U

Example: a person’s gender in the EHR• In John Smith’s EHR:

– At t1: “male” at t2: “female”

Page 70: Werner CEUSTERS, MD Center of Excellence in Bioinformatics and Life Sciences

70

New York State Center of Excellence in Bioinformatics & Life Sciences

R T U New York State Center of Excellence in Bioinformatics & Life Sciences

R T U

Example: a person’s gender in the EHR• In John Smith’s EHR:

– At t1: “male” at t2: “female”

• What are the possibilities ?

Page 71: Werner CEUSTERS, MD Center of Excellence in Bioinformatics and Life Sciences

71

New York State Center of Excellence in Bioinformatics & Life Sciences

R T U New York State Center of Excellence in Bioinformatics & Life Sciences

R T U

Example: a person’s gender in the EHR• In John Smith’s EHR:

– At t1: “male” at t2: “female”

• What are the possibilities ?• Change in reality:

Page 72: Werner CEUSTERS, MD Center of Excellence in Bioinformatics and Life Sciences

72

New York State Center of Excellence in Bioinformatics & Life Sciences

R T U New York State Center of Excellence in Bioinformatics & Life Sciences

R T U

Example: a person’s gender in the EHR• In John Smith’s EHR:

– At t1: “male” at t2: “female”

• What are the possibilities ?• Change in reality:

• transgender surgery

Page 73: Werner CEUSTERS, MD Center of Excellence in Bioinformatics and Life Sciences

73

New York State Center of Excellence in Bioinformatics & Life Sciences

R T U New York State Center of Excellence in Bioinformatics & Life Sciences

R T U

Example: a person’s gender in the EHR• In John Smith’s EHR:

– At t1: “male” at t2: “female”

• What are the possibilities ?• Change in reality:

• transgender surgery• change in legal self-identification

Page 74: Werner CEUSTERS, MD Center of Excellence in Bioinformatics and Life Sciences

74

New York State Center of Excellence in Bioinformatics & Life Sciences

R T U New York State Center of Excellence in Bioinformatics & Life Sciences

R T U

Example: a person’s gender in the EHR• In John Smith’s EHR:

– At t1: “male” at t2: “female”

• What are the possibilities ?• Change in reality:

• transgender surgery• change in legal self-identification

• Change in understanding: it was female from the very beginning but interpreted wrongly

Page 75: Werner CEUSTERS, MD Center of Excellence in Bioinformatics and Life Sciences

75

New York State Center of Excellence in Bioinformatics & Life Sciences

R T U New York State Center of Excellence in Bioinformatics & Life Sciences

R T U

Example: a person’s gender in the EHR• In John Smith’s EHR:

– At t1: “male” at t2: “female”

• What are the possibilities ?• Change in reality:

• transgender surgery• change in legal self-identification

• Change in understanding: it was female from the very beginning but interpreted wrongly

• Correction of data entry mistake: it was understood as male, but wrongly transcribed

Page 76: Werner CEUSTERS, MD Center of Excellence in Bioinformatics and Life Sciences

76

New York State Center of Excellence in Bioinformatics & Life Sciences

R T U New York State Center of Excellence in Bioinformatics & Life Sciences

R T U

Conclusion• Building high quality ontologies is hard.• Experts in driving cars are not necessarily experts in car

mechanics (and the other way round). – Good computer scientists are usually lousy ontologists

• Ontologies should represent the state of the art in a domain, i.e. the science. – Science is not a matter of consensus or democracy.

• Natural language relates more to how humans talk about reality or perceive it, than to how reality is structured.

• No high quality ontology without the involvement of ontologists.