22
http://www.daml.org Jim Hendler Chief Scientist - Information Systems Office DARPA

Http:// Jim Hendler Chief Scientist - Information Systems Office DARPA

Embed Size (px)

Citation preview

Page 1: Http:// Jim Hendler Chief Scientist - Information Systems Office DARPA

http://www.daml.org

Jim HendlerChief Scientist - Information Systems Office

DARPA

Page 2: Http:// Jim Hendler Chief Scientist - Information Systems Office DARPA

2

DAML: Building the Semantic Web

• Creating the next revolution in Internet technology...

…by enabling “agent” communication at a Web-wide scale.

• The first revolution, the Internet, funded by DARPA, enabled disparate machines to exchange data. • The second revolution, the World Wide Web, (funded by DARPA/NSF) enabled a wide range of new applications on top of the growing Internet. The Web made a huge amount of information available, in human-readable form, allowing a revolution in new applications, environments, and b2c e-commerce.• The next revolution of the net is an “agent-enabled” resource (the “Semantic Web”) which makes a huge amount of information available in machine-readable form creating a revolution in new applications, environments, and b2b e-commerce. DARPA is funding this now!

Page 3: Http:// Jim Hendler Chief Scientist - Information Systems Office DARPA

3

The Evolving Web

Web ofKnowledge

HyperText Markup LanguageHyperText Transfer Protocol

Resource Description FrameworkeXtensible Markup Language Self-Describing Documents

Foundation of the Current Web

Proof, Logic andOntology Languages Shared terms/terminology

Machine-Machine communication

1990

2000

2010

Based on Berners-Lee, Hendler; Nature, 2001

Page 4: Http:// Jim Hendler Chief Scientist - Information Systems Office DARPA

4

What is an Ontology?

Catalog/ID

GeneralLogical

constraints

Terms/glossary

Thesauri“narrower

term”relation

Formalis-a

Frames(properties)

Informalis-a

Formalinstance

Value Restrs.

Disjointness, Inverse, part-

of…

TAXONOMY ONTOLOGY

Page 5: Http:// Jim Hendler Chief Scientist - Information Systems Office DARPA

5

TSE Ontology

<DEF-RELATION NAME="hasEconomy"> <DEF-ARG POS=1 TYPE="Location"> <DEF-ARG POS=2 TYPE=".STRING"></DEF-RELATION>

<DEF-RELATION NAME="hasSurveillanceBy"> <DEF-ARG POS=1 TYPE="Location"> <DEF-ARG POS=2 TYPE="Organization"></DEF-RELATION>

<DEF-RELATION NAME="surveillanceLevel"> <DEF-ARG POS=1 TYPE="Location"> <DEF-ARG POS=2 TYPE=".NUMBER"></DEF-RELATION>

<!-- Disease_Agent definitions --><DEF-CATEGORY NAME="Disease_Agent" ISA="base.SHOEEntity"><DEF-CATEGORY NAME="TSE" ISA="Disease_Agent"><DEF-CATEGORY NAME="CJD" ISA="TSE"><DEF-CATEGORY NAME="Genetic_CJD" ISA="CJD"><DEF-CATEGORY NAME="Spontaneous_CJD" ISA="CJD"><DEF-CATEGORY NAME="Iatrogenic_CJD" ISA="CJD"><DEF-CATEGORY NAME="GSS" ISA="TSE"><DEF-CATEGORY NAME="FFI" ISA="TSE"><DEF-CATEGORY NAME="Kuru" ISA="TSE"><DEF-CATEGORY NAME="BSE" ISA="TSE"><DEF-CATEGORY NAME="FSE" ISA="BSE"><DEF-CATEGORY NAME="NV-CJD" ISA="BSE"><DEF-CATEGORY NAME="Scrapie" ISA="TSE"><DEF-CATEGORY NAME="TME" ISA="TSE"><DEF-CATEGORY NAME="CWD" ISA="TSE">

Page 6: Http:// Jim Hendler Chief Scientist - Information Systems Office DARPA

6

How do we attack this problem?

The key enabler of current interoperability in both military and commercial systems is the “HyperText Mark-up Language” (HTML) Allows a machine readable, formal language, to be expressed on web

pages for the presentation of data limited set of tags

not useful for machine search

<Title> How do we attack this problem? </title>

Page 7: Http:// Jim Hendler Chief Scientist - Information Systems Office DARPA

7

Beyond HTML: adding syntax

Current languages attack this by adding syntactic data handling abilities XML (eXtensible Markup Language)

Extensible keyword set Solves syntactic inequalities between data formats

DB 1 -> ADDRESS <- DB 2 Useful for Data Sharing

Not search

<!Element TITLE-BLOCK EMPTY>

<!ATTLIST TITLE-BLOCK Title #cdata required Subtitle #cdata implied>

<Title-Block>

<title> Beyond HTML <title>

<subtitle> adding syntax </subtitle>

</title-block>

Page 8: Http:// Jim Hendler Chief Scientist - Information Systems Office DARPA

8

Semantic Problems for XML

Synonymy and polysemy* <PERSON> vs. <INDIVIDUAL>* is <SPIDER> an arachnid or software?

Structural differences

<PERSON> <NAME>John Smith</NAME></PERSON>

<PERSON> <NAME><FNAME>John</FNAME><LNAME>Smith</LNAME></NAME></PERSON>

<PERSON NAME=“John Smith”>

Page 9: Http:// Jim Hendler Chief Scientist - Information Systems Office DARPA

9

Beyond XML:Agent Semantics

DARPA will lead the way with the development of Agent markup Language (DAML) A “semantic” language that ties the information on a page to

machine readable semantics (ontology)Currently being fielded at University level

SHOE (Maryland), Ontobroker(Karlsruhe),OWL(Washington Univ) Largely grows from past DARPA programs (I3, ARPI)

But not transitioning Even with SW activity, W3C membership focused on short-term gain:HTML/XML

<Title> Beyond XML

<subtitle> agent semantics </subtitle> </title>

<USE-ONTOLOGY ID=”PPT-ontology" VERSION="1.0" PREFIX=”PP" URL= "http://iwp.darpa.mil/ppt..html">

<CATEGORY NAME=”pp.presentation” FOR="http://iwp.darpa.mil/jhendler/agents.html">

<RELATION-VALUE POS1 = “Agents” POS2 = “/jhendler”>

<ONTOLOGY ID=”powerpoint-ontology" VERSION="1.0" DESCRIPTION=”formal model for powerpoint presentations">

<DEF-CATEGORY NAME=”Title" ISA=”Pres-Feature" > <DEF-CATEGORY NAME=”Subtitle" ISA=”Pres-Feature" >

<DEF-RELATION NAME=”title-of" SHORT="was written by"> <DEF-ARG POS=1 TYPE=”presentation"> <DEF-ARG POS=2 TYPE=”presenter" >

Page 10: Http:// Jim Hendler Chief Scientist - Information Systems Office DARPA

10

A distributed ontological representation

Small communities define common semantics Technical Vocabularies abound

Mission specific Technical jargons Shared values

Larger communities form around shared terms Mapping and “articulation” become crucial

Interoperability at web languages level Top-Down (AIA defines critical aircraft properties) or bottom up (Oh, a “foxbat” is a Mig29) Business case for improving communication!

Page 11: Http:// Jim Hendler Chief Scientist - Information Systems Office DARPA

11

uses

uses

uses

uses

uses

uses

uses

uses

This leads to a radically new view of ontologies!

BEFORE

* Monolithic

* Fully Shared

* Necessarily

Consistent

* Difficult to change

Before

* monolithic

* consistent

* fully shared

* logic-based

uses

uses

uses

uses

usesuses

uses

uses

AFTERAFTER

Distributed,partially mapped, inconsistent -- but very flexible!

Page 12: Http:// Jim Hendler Chief Scientist - Information Systems Office DARPA

12

A new/old model of DoD partnering

Semantic Web Res. (EU)

W3CDAML

www.semanticweb.org

www.daml.orgwww.w3.org/RDF/

C2 linkRDF

XML

RDF-S

DAML-ONT

DAML-LOGIC

US/EU Joint Efforts

(S. Decker, Coord)

Horus

Research efforts:SHOEOILEC OntoWeb

Intl Workshops

Tools

Lang Spec

Demos

Ctr for Army Lessons Learned

EU W3CMembers/directors

(Dan Brickley, coord)

DARPA: Funds a new

generation of www technology

Works closely with W3C to create a web standard

Works closely with EU on international acceptance

Brings DoD users (J2,J3,J6) in as early adopters

Page 13: Http:// Jim Hendler Chief Scientist - Information Systems Office DARPA

13

DoD and W3CWorking Together

DAML is being built on existing web “standards”, by many of the same people who developed them

PI Team includes MIT researchers who also head up World Wide Web Consortium:

PI/Co-PI: Tim Berners-Lee, Ralph Swick, Dan Connolly

XML Existing W3C Recommendation

RDF Existing W3C Recommendation

RDF-Schema-extensions

DAML-OntologyDAML-LogicDAMLDAML

Page 14: Http:// Jim Hendler Chief Scientist - Information Systems Office DARPA

14

Horus transitioning to use of DAML languageEmerging DAML tools tested in Intelink environment via DAML-Horus TIEsMature DAML tools integrated into Horus Toolkit Horus Toolkit applied in Horus Evaluation Site & Horus-enabled user sitesExperiments with Intelink Users

DAML – HorusIntelink Transition

Objective

Expected ResultsMethodology

Simpler, More Accurate and Focused Searches

Search on Information Organized by Content Capability to Follow Threads of Logic Help Independent Developers Tie Into

Common Framework

Changing the way we produce intelligence

More linking, less traditional data entry

Lots of grabbing Horus entities and

dropping into products

Software performs technical markup detail

Page 15: Http:// Jim Hendler Chief Scientist - Information Systems Office DARPA

15

DAML Status

DAML+OIL ontology language released on World Wide Web Annotated “walkthrough” Examples Full definition

RDFS = Resource Description Framework Schema Provides 100% mapping to XML

Open discussion group run by W3C: [email protected] Denotational (and axiomatic) Semantics published

First formal semantics for a web language Proposal to W3C for standardization ongoing

Page 16: Http:// Jim Hendler Chief Scientist - Information Systems Office DARPA

16

DAML Examples

“Transparent” MarkupContent-Based SearchService advertising

Reading “DAML”! Using “DAML”!

Writing “DAML”!

Page 17: Http:// Jim Hendler Chief Scientist - Information Systems Office DARPA

17

www.daml.org

Language SpecificationsDAML Newsletter (you can subscribe)Collection of web toolsOntology library

134 ontologies as of May 15, 01

DAML crawler over 14,000 pages w/2,000,000+ DAML statements, 5/15/01

Web tracking software used for baselining DAML use Over 500,000 hits in first 6 months

Page 18: Http:// Jim Hendler Chief Scientist - Information Systems Office DARPA

18

DAMLNotional Schedule

Now

Later

Page 19: Http:// Jim Hendler Chief Scientist - Information Systems Office DARPA

19

Web “travel” agents

Query processed: A satellite image taken yesterday at 10 AM is available on the web at http://…

A new satellite image, to be taken today at 10AM, will be available for $100 — click here to authorize transfer of funds and obtain image (you will need a valid credit card number from one of the following providers: …)

In an emergency situation, a Coast Guard observer plane can be sent to any location within the area you indicate. Service Note: You will be responsible for cost of flight if the situation does not result in emergency pickup. Click Here for more information.

A high altitude observer can be sent to your location in 13 hours. Click here to initiate procedure. (You will need to provide US military authorization, A valid military unit code, and the name of commanding officer)

A service entitled “commercial service for providing satelite images” is advertised as becoming available in 2004. See http://… for more information

Page 20: Http:// Jim Hendler Chief Scientist - Information Systems Office DARPA

20

Service Descriptions

Page 21: Http:// Jim Hendler Chief Scientist - Information Systems Office DARPA

21

Web Logics

Page 22: Http:// Jim Hendler Chief Scientist - Information Systems Office DARPA

22

Conclusions

The Semantic Web is coming! Joint development between DARPA/EU/and W3C communities Languages and tools are available to play with

Http://www.daml.org/

W3C interest group available for those wishing to join the discussion [email protected] (live or archived)

Ongoing DoD and commercial projectsCome join us

Submit ontologies/marked up pages Develop tools or help test ours

Get in on the next big thing early!