27
Ontologies for Earth System Science: Experiences in the Development of a Community Knowledge Base Rob Raskin NASA/Jet Propulsion Laboratory Pasadena, CA Feb 22, 2008

Ontologies for Earth System Science: Experiences in the Development of a Community Knowledge Base Rob Raskin NASA/Jet Propulsion Laboratory Pasadena, CA

Embed Size (px)

Citation preview

Page 1: Ontologies for Earth System Science: Experiences in the Development of a Community Knowledge Base Rob Raskin NASA/Jet Propulsion Laboratory Pasadena, CA

Ontologies for Earth System Science: Experiences in the Development of a Community Knowledge Base

Rob RaskinNASA/Jet Propulsion Laboratory

Pasadena, CA

Feb 22, 2008

Page 2: Ontologies for Earth System Science: Experiences in the Development of a Community Knowledge Base Rob Raskin NASA/Jet Propulsion Laboratory Pasadena, CA

Data to Knowledge

Data Information Knowledge

Basic Elements Bytes Numbers Models FactsServices Save Visualize Infer Understand PredictStorage File Database GIS Ontology MindVolume High LowDensity Low High

Syntax Semantics

Page 3: Ontologies for Earth System Science: Experiences in the Development of a Community Knowledge Base Rob Raskin NASA/Jet Propulsion Laboratory Pasadena, CA

What is Knowledge? Facts, relations, meanings, contexts,

common sense Information with context Shared understanding of meaning Suitable for reasoning/inference Dynamic, expandable

Page 4: Ontologies for Earth System Science: Experiences in the Development of a Community Knowledge Base Rob Raskin NASA/Jet Propulsion Laboratory Pasadena, CA

Application:Intelligent Search for Data Consults knowledge base to find

alternative meanings Clustered by: synonyms, parent, children

Enables discovery of resources without exact keyword match

Semantic understanding is crucial Common search engines (Google) use these

capabilities only minimally, at present

Page 5: Ontologies for Earth System Science: Experiences in the Development of a Community Knowledge Base Rob Raskin NASA/Jet Propulsion Laboratory Pasadena, CA

Application: Intelligent Search for Data (more) Noesis ontology-aided search tool

http://noesis.itsc.uah.edu Provides access to:

Data Journal articles Web pages Experts (people)

Page 6: Ontologies for Earth System Science: Experiences in the Development of a Community Knowledge Base Rob Raskin NASA/Jet Propulsion Laboratory Pasadena, CA

Semantic Understanding is Semantic Understanding is Difficult!Difficult!

Variable t: temperatureVariable t: time

Sea surface temperature: measured 3 m above surface Sea surface temperature: measured at surface

Data quality= 5 Data quality= 3

Page 7: Ontologies for Earth System Science: Experiences in the Development of a Community Knowledge Base Rob Raskin NASA/Jet Propulsion Laboratory Pasadena, CA

Let’s eat, Grandma.Let’s eat Grandma.

Time flies like an arrow.Fruit flies like a pie.

Semantic Understanding is Semantic Understanding is Difficult!Difficult!(more)(more)

LA Times headline

“Mission accomplished. Major combat operations in Iraq have ended”

-Pres. Bush, 2003

“not a nobody” “ain’t nobody” “yeah, right”

Page 8: Ontologies for Earth System Science: Experiences in the Development of a Community Knowledge Base Rob Raskin NASA/Jet Propulsion Laboratory Pasadena, CA

Knowledge Base of Facts as Triples

Subject-Verb-Object representation

Flood is a WeatherPhenomena GeoTIFF is a FileFormat Soil Type is a PhysicalProperty Pacific Ocean is a Ocean

Ocean has substance Water Sensor measures Temperature

Page 9: Ontologies for Earth System Science: Experiences in the Development of a Community Knowledge Base Rob Raskin NASA/Jet Propulsion Laboratory Pasadena, CA

Semantic Web Vision Web page creators place XML tags around

technical terms on web pages XML tags point to knowledge base

(namespace) where term is “defined” This vision has not caught on, in part due to

the success of free-text searching without semantic aid

Page 10: Ontologies for Earth System Science: Experiences in the Development of a Community Knowledge Base Rob Raskin NASA/Jet Propulsion Laboratory Pasadena, CA

Ontologies

Method to store “facts” General definition: “all that is known” Computer science definition: Machine-readable definition

of terms and how they relate to one another As with a dictionary, terms are defined in terms of other terms

Provide shared understanding of concepts Support knowledge reuse Support machine-to-machine communications with

deeper semantics than controlled vocabulary

Page 11: Ontologies for Earth System Science: Experiences in the Development of a Community Knowledge Base Rob Raskin NASA/Jet Propulsion Laboratory Pasadena, CA

Ontology Properties

Standard languages make it easy to extend (specialize) concepts developed by others

Synonym support (multiple terms with same meaning) Label available to indicate preferred term for each

community Homonym support (multiple meanings of

same term) Separate namespaces (President:Bush vs Plant:Bush)

Page 12: Ontologies for Earth System Science: Experiences in the Development of a Community Knowledge Base Rob Raskin NASA/Jet Propulsion Laboratory Pasadena, CA

Plate tectonics - before

Plate Tectonics Ontology

Page 13: Ontologies for Earth System Science: Experiences in the Development of a Community Knowledge Base Rob Raskin NASA/Jet Propulsion Laboratory Pasadena, CA

Atmosphere Ontology…

Atmosphere Ontology

Page 14: Ontologies for Earth System Science: Experiences in the Development of a Community Knowledge Base Rob Raskin NASA/Jet Propulsion Laboratory Pasadena, CA

Ontology Languages:RDF and OWL

W3C has adopted XML-based standard ontology languages

Resource Description Formulation (RDF) Ontology Web Language (OWL)

Languages predefine specific tags RDF: Class, subclass, property, subproperty, … OWL: Extends RDF to predefine further tags such as cardinality

Three flavors of OWL (Lite, DL, and Full)

Use of standard languages make it easy to extend (specialize) work of others

Page 15: Ontologies for Earth System Science: Experiences in the Development of a Community Knowledge Base Rob Raskin NASA/Jet Propulsion Laboratory Pasadena, CA

Semantic Web for Earth and Environmental Terminology (SWEET)

Concept space written in OWL Initial focus to assist search for data resources

Funded by NASA Later focus to serve as community standard Enables scalable classification of Earth system science

concepts

Page 16: Ontologies for Earth System Science: Experiences in the Development of a Community Knowledge Base Rob Raskin NASA/Jet Propulsion Laboratory Pasadena, CA

Non-LivingSubstances

LivingSubstances

PhysicalProcesses

Earth Realm

PhysicalProperties

Time

NaturalPhenomena

Human Activities

Integrative Ontologies

Space

Data

Faceted Ontologies

Units

Numerics

SWEET 1.0 Ontologies

Page 17: Ontologies for Earth System Science: Experiences in the Development of a Community Knowledge Base Rob Raskin NASA/Jet Propulsion Laboratory Pasadena, CA

SWEET 2.0 Ontologies: Modular Design

Mathematics Units

Electric/Magnetism

TimeSpace

RadiationTransfer

GeophysFluid Dynam

HumanActivities

ClimateChange

Troposphere

Thermo

Land Surface

Waves

Heliosphere Cryosphere

Mechanics

BasicScience/Math

SupportingGeophysicalPhenomena

PlanetaryRealms

ApplicationsAir Pollution

WaterResources

Geosphere

Ecosphere

Biogeo-chemistry

PlanetaryStructures

import

OceanUpper

Atmosphere

Geo-magnetism

PlanetaryGravity

BiogeochemCycles

Energy etc.

Chemistry

Page 18: Ontologies for Earth System Science: Experiences in the Development of a Community Knowledge Base Rob Raskin NASA/Jet Propulsion Laboratory Pasadena, CA

SWEET as an Upper Level Earth Science Ontology

Math Physics Chemistry

Space

TimeProperty

EarthRealmProcess, Phenomena

SubstanceData

StratosphericChemistry

Biogeochemistry Specialized domains

import

import

SWEET

Page 19: Ontologies for Earth System Science: Experiences in the Development of a Community Knowledge Base Rob Raskin NASA/Jet Propulsion Laboratory Pasadena, CA

SWEET Numerical Ontologies

Intervals, numeric relations (<,>) Cartesian products Functions, derivatives Fuzzy concepts

“near” Spatial concepts

0-D, 1-D, 2-D, and 3-D objects Coordinate systems Above, inside, etc.

Temporal concepts Instant, durations, geological time scales

Page 20: Ontologies for Earth System Science: Experiences in the Development of a Community Knowledge Base Rob Raskin NASA/Jet Propulsion Laboratory Pasadena, CA

SWEET Data Ontology Dataset characteristics

Format, data model, dimensions, … Provenance

Source, processing history, … Parameters

Scale factors, offsets, … Data services

Subsetting, reprojection, … Quality measures Special values

Missing, land, sea, ice, ...

Page 21: Ontologies for Earth System Science: Experiences in the Development of a Community Knowledge Base Rob Raskin NASA/Jet Propulsion Laboratory Pasadena, CA

Web Services Ontology OWL-S

Enables semantic descriptions of Web service inputs and outputs

Provides binding to actual implementation of Web service

Page 22: Ontologies for Earth System Science: Experiences in the Development of a Community Knowledge Base Rob Raskin NASA/Jet Propulsion Laboratory Pasadena, CA

Best Practices Keep ontologies small, modular

Be careful that “Owl:Import” imports everything Use higher level ontologies where possible

Identify hierarchy of concept spaces Model schemas Try to keep dependencies unidirectional

Gain community buy-in Involve respected leaders

Page 23: Ontologies for Earth System Science: Experiences in the Development of a Community Knowledge Base Rob Raskin NASA/Jet Propulsion Laboratory Pasadena, CA

PlanetOnt.org

Page 24: Ontologies for Earth System Science: Experiences in the Development of a Community Knowledge Base Rob Raskin NASA/Jet Propulsion Laboratory Pasadena, CA

Tec

hn

olo

gy

Geospatial semantic services

established

Geospatial semantic services proliferate

Scientific semantic assisted services

SWEET 3.0 with semantic callable

interfaces via standard programming languages

SWEET core 2.0 based on best practices decided from community

Scientific reasoning

Reasoners able to utilize

SWEET 4.0

Local processing + data exchange

Basic data tailoring services (data as

service), verification/ validation

Interoperable geospatial services

(analysis as service), explanation

Common vocabulary based

product search and access

Inte

rop

erab

le

Info

rmat

ion

In

fras

tru

ctu

re

Ass

iste

d

Dis

cove

ry &

M

edia

tio

n

Metadata-driven data fusion

(semantic service chaining), trust

Semantic agent-based integration

Semantic agent-based

searches

Geospatial reasoning, OWL-

Time

Cap

ab

ilit

yR

esu

lts

Improved Information

Sharing

Increased Collaboration & Interdisciplinary

Science

Acceleration of Knowledge Production

Revolutionizing how science is

done

RDF, OWL, OWL-S

Semantic Web Roadmap

Current Near Term Mid Term Long Term

Autonomous inference of

science results

Numerical reasoning

Semantic geospatial search & inference, access

SWEET core 1.0 based on GCMD/CF

Vo

cab

ula

ryL

ang

uag

es/

Rea

son

ing

Ou

tpu

tO

utc

om

e

Page 25: Ontologies for Earth System Science: Experiences in the Development of a Community Knowledge Base Rob Raskin NASA/Jet Propulsion Laboratory Pasadena, CA

Ontology Development Tools: CMAP Free, downloadable tool for knowledge

representation and ontology development

Visual language with input/export to OWL Supports subset of OWL language

http://cmap.ihmc.us/coe

Page 26: Ontologies for Earth System Science: Experiences in the Development of a Community Knowledge Base Rob Raskin NASA/Jet Propulsion Laboratory Pasadena, CA

Other Ontology Editors Public Domain

SWOOP Mindswap Lab (UMBC) http://www.mindswap.org/2004/SWOOP

Protégé Stanford http://protege.stanford.edu

Commercial Cerebra Construct

http://cerebra.com TopBraid Composer

http://www.topquadrant.com/topbraid/composer

Page 27: Ontologies for Earth System Science: Experiences in the Development of a Community Knowledge Base Rob Raskin NASA/Jet Propulsion Laboratory Pasadena, CA

Resources SWEET

http://sweet.jpl.nasa.gov Ontology development/sharing site

http://PlanetOnt.org Noesis (search tool)

http://noesis.itsc.uah.edu SESDI

http://sesdi.hao.ucar.edu