44
13.12.2005 13.12.2005 1 Intelligent Systems Lecture 24 Ontologies. Semantic WEB (based on presentation of Forschungszentrum Informatik at the University of Karlsruhe, Germany)

13.12.2005 1 Intelligent Systems Lecture 24 Ontologies. Semantic WEB (based on presentation of Forschungszentrum Informatik at the University of Karlsruhe,

Embed Size (px)

Citation preview

Page 1: 13.12.2005 1 Intelligent Systems Lecture 24 Ontologies. Semantic WEB (based on presentation of Forschungszentrum Informatik at the University of Karlsruhe,

13.12.2005 13.12.2005 1

Intelligent Systems

Lecture 24

Ontologies. Semantic WEB

(based on presentation of Forschungszentrum Informatik at the University of Karlsruhe, Germany)

Page 2: 13.12.2005 1 Intelligent Systems Lecture 24 Ontologies. Semantic WEB (based on presentation of Forschungszentrum Informatik at the University of Karlsruhe,

13.12.2005 13.12.2005 2

Definitions

• An ontology is a specification of a conceptualization

• An ontology is a description (like a formal specification of a program) of the concepts and relationships that can exist for an agent or a community of agents.

• Purpose of enabling knowledge sharing and reuse. In that context, an ontology is a specification used for making ontological commitments.

Page 3: 13.12.2005 1 Intelligent Systems Lecture 24 Ontologies. Semantic WEB (based on presentation of Forschungszentrum Informatik at the University of Karlsruhe,

13.12.2005 13.12.2005 3

Definitions (2)• The subject of ontology is the study of the categories of

things that exist or may exist in some domain.• The product of such a study, called an ontology, is a

catalog of the types of things that are assumed to exist in a domain of interest D from the perspective of a person who uses a language L for the purpose of talking about D.

• The types in the ontology represent the predicates, word senses, or concept and relation types of the language L when used to discuss topics in the domain D. An uninterpreted logic, such as predicate calculus, conceptual graphs, or KIF, is ontologically neutral. It imposes no constraints on the subject matter or the way the subject may be characterized. By itself, logic says nothing about anything, but the combination of logic with an ontology provides a language that can express relationships about the entities in the domain of interest.

Page 4: 13.12.2005 1 Intelligent Systems Lecture 24 Ontologies. Semantic WEB (based on presentation of Forschungszentrum Informatik at the University of Karlsruhe,

13.12.2005 13.12.2005 4

Definitions (2)• An informal ontology may be specified by a catalog of types

that are either undefined or defined only by statements in a natural language. A formal ontology is specified by a collection of names for concept and relation types organized in a partial ordering by the type-subtype relation. Formal ontologies are further distinguished by the way the subtypes are distinguished from their supertypes: an axiomatized ontology distinguishes subtypes by axioms and definitions stated in a formal language, such as logic or some computer-oriented notation that can be translated to logic; a prototype-based ontology distinguishes subtypes by a comparison with a typical member or prototype for each subtype. Large ontologies often use a mixture of definitional methods: formal axioms and definitions are used for the terms in mathematics, physics, and engineering; and prototypes are used for plants, animals, and common household items

Page 5: 13.12.2005 1 Intelligent Systems Lecture 24 Ontologies. Semantic WEB (based on presentation of Forschungszentrum Informatik at the University of Karlsruhe,

13.12.2005 13.12.2005 5

Agenda13.12.2005

• Introduction & Motivation

• Introduction - Semantic Web

• Semantic Web Applications

• Semantic Web Technology

• Next Steps

Page 6: 13.12.2005 1 Intelligent Systems Lecture 24 Ontologies. Semantic WEB (based on presentation of Forschungszentrum Informatik at the University of Karlsruhe,

13.12.2005 13.12.2005 6

KnowledgeManagement

Ontologies andMetadata

MachineLearning

Web Technologies

and Standards

E-Learning Data, TextWeb Mining

InformationExtraction

Web PortalsSearch engines

Metadata-drivenApplications

BasicTechnologies

ApplicationFields

Application fields and technologies

Page 7: 13.12.2005 1 Intelligent Systems Lecture 24 Ontologies. Semantic WEB (based on presentation of Forschungszentrum Informatik at the University of Karlsruhe,

13.12.2005 13.12.2005 7

Motivation

• WWW is a success, measured in

– the number of users

– the number of available documents

• Goal-driven access to information is problematic, because Web content has to be interpreted, combined and processed by humans.

• We are currently on the way to a next generation Web, building on the existing WWW - the Semantic Web which will make contents also for machines accessible and interpretable !

Page 8: 13.12.2005 1 Intelligent Systems Lecture 24 Ontologies. Semantic WEB (based on presentation of Forschungszentrum Informatik at the University of Karlsruhe,

13.12.2005 13.12.2005 8

Arpanet Internet/WWW Semantic Web

1965 1985 20001975 1995

PacketsObjects

Concepts

2005

...

On the Way to a Global Information Structure

Page 9: 13.12.2005 1 Intelligent Systems Lecture 24 Ontologies. Semantic WEB (based on presentation of Forschungszentrum Informatik at the University of Karlsruhe,

13.12.2005 13.12.2005 9

Agenda13.12.2005

• Introduction & Motivation

• Introduction - Semantic Web

• Semantic Web Applications

• Semantic Web Technology

• Next Steps

Page 10: 13.12.2005 1 Intelligent Systems Lecture 24 Ontologies. Semantic WEB (based on presentation of Forschungszentrum Informatik at the University of Karlsruhe,

13.12.2005 13.12.2005 10

“Information Management: A Proposal“,Tim Berners-Lee, CERN,

1989

The Origin of the WWW

Page 11: 13.12.2005 1 Intelligent Systems Lecture 24 Ontologies. Semantic WEB (based on presentation of Forschungszentrum Informatik at the University of Karlsruhe,

13.12.2005 13.12.2005 11

Semantic Web – Bringing the Web to its Full Potential

HTML mit Hyperlinks

Relational

Metadata

URI-SHA

URI-STEFAND

URI-DAMLPROJWORKS-IN

COOPERATES-WITH

WORKS-IN

„ Darpa Agent Markup Language“

PROJECT

RESEARCHER

PERSON

subClassOf

rangedomain

Ontology

TOP

COOPERATESWITH

WORKS-IN

NAME domaindomain

range

NAME

subClassOf

subClassOf

SYMMETRIC

Page 12: 13.12.2005 1 Intelligent Systems Lecture 24 Ontologies. Semantic WEB (based on presentation of Forschungszentrum Informatik at the University of Karlsruhe,

13.12.2005 13.12.2005 12

Ontologies

• In its classical sense ontology is a philosophical discipline.

• In Computer Science: Formal specification of a domain of interest in the form of a concept system

• Targets:

– Shared understanding of a domain of interest

– Formal description of the meaning of terms and relations

– Machine executable (e.g. query for all relations of the concept „HOTEL“)

Page 13: 13.12.2005 1 Intelligent Systems Lecture 24 Ontologies. Semantic WEB (based on presentation of Forschungszentrum Informatik at the University of Karlsruhe,

13.12.2005 13.12.2005 13

• Metadata are „data about data“, e.g.– Library classification systems– The Yahoo! Categorization– Microsoft Office Document Properties

• Metadata in the Semantic Web is complex structured (based on predefined ontologies):

Relational Metadata

http://www.w3c.org/http://www.w3c.org

http://www.w3.org/Home/Lassilahttp://www.w3.org/ Lassila

Ora LassilaOra Lassila

s:Name

s.Organizations.

s:Personrdf:type

rdf:type

s:email

[email protected]

s:worksAt

s:email

s:Name

Page 14: 13.12.2005 1 Intelligent Systems Lecture 24 Ontologies. Semantic WEB (based on presentation of Forschungszentrum Informatik at the University of Karlsruhe,

13.12.2005 13.12.2005 14

Agenda13.12.2005

• Introduction & Motivation

• Introduction - Semantic Web

• Semantic Web Applications

– Skills & Human Resources

– Semantic Intranet Portals

– Interoperability in Tourism

– Web Services

– Virtual Museum

• Semantic Web Technology

• Next Steps

Page 15: 13.12.2005 1 Intelligent Systems Lecture 24 Ontologies. Semantic WEB (based on presentation of Forschungszentrum Informatik at the University of Karlsruhe,

13.12.2005 13.12.2005 15

Ontologies/Metadata in Human Resources

• Usage of skill ontologies:

– Automatic extraction of skills (from applications)

– Semantic Ranking

– Competency Analysis via

Data Mining

– Relation to E-Learning with

skills

Page 16: 13.12.2005 1 Intelligent Systems Lecture 24 Ontologies. Semantic WEB (based on presentation of Forschungszentrum Informatik at the University of Karlsruhe,

13.12.2005 13.12.2005 16

2. Digitalization &Text Generation via OCR

3. Automatic Skill Extraction using Shallow Parsing

3. Intranet – EmployeeMarketplace

1. Paper-basedApplication

Ontologies/Metadata in Human Resources

Page 17: 13.12.2005 1 Intelligent Systems Lecture 24 Ontologies. Semantic WEB (based on presentation of Forschungszentrum Informatik at the University of Karlsruhe,

13.12.2005 13.12.2005 17

Automatic Generation of Metadata

Via OCR from written documents extracted

Predefined skill ontologywith metadata and lexicon

Page 18: 13.12.2005 1 Intelligent Systems Lecture 24 Ontologies. Semantic WEB (based on presentation of Forschungszentrum Informatik at the University of Karlsruhe,

13.12.2005 13.12.2005 18

Semantic-Driven Intranet Portal (I)

• Requirements:• Develop domain-specific terminology for topics• Automatically generate Yahoo-like structure for this

terminology• Allow to add further, complex structured information

to the terminology• Techniques:

• Ontology Engineering• Discovering of Web Documents via Focused

Crawling• Automatic Classification of Documents into Ontology• Cooperative Metadata Engineering

Page 19: 13.12.2005 1 Intelligent Systems Lecture 24 Ontologies. Semantic WEB (based on presentation of Forschungszentrum Informatik at the University of Karlsruhe,

13.12.2005 13.12.2005 19

Semantic-Driven Intranet Portal (II)

Human ResourceStrategy:

- Define relevant topics in the form of an ontology

- Cooperatively add further information in the form of metadata!

Semantic Portals for HR strategy

- Search relevant Web resources

Page 20: 13.12.2005 1 Intelligent Systems Lecture 24 Ontologies. Semantic WEB (based on presentation of Forschungszentrum Informatik at the University of Karlsruhe,

13.12.2005 13.12.2005 20

Virtual Museum (I)

Page 21: 13.12.2005 1 Intelligent Systems Lecture 24 Ontologies. Semantic WEB (based on presentation of Forschungszentrum Informatik at the University of Karlsruhe,

13.12.2005 13.12.2005 21

Virtual Museum (II)

Page 22: 13.12.2005 1 Intelligent Systems Lecture 24 Ontologies. Semantic WEB (based on presentation of Forschungszentrum Informatik at the University of Karlsruhe,

13.12.2005 13.12.2005 22

News Services - Content Syndication with RSS (I)13.12.2005

NEWSARE FREE!

Page 23: 13.12.2005 1 Intelligent Systems Lecture 24 Ontologies. Semantic WEB (based on presentation of Forschungszentrum Informatik at the University of Karlsruhe,

13.12.2005 13.12.2005 23

News Services - RDF Site Summary RSS (II)

Page 24: 13.12.2005 1 Intelligent Systems Lecture 24 Ontologies. Semantic WEB (based on presentation of Forschungszentrum Informatik at the University of Karlsruhe,

13.12.2005 13.12.2005 24

Content Services - OntoWeb Community Portal

OntoWeb Community

AnnotatedWeb PagesGenerated

Content Objects

Participating Site2

{ }

Participating Siten

{ }

Participating Site1

{ }

...

OntologyBrowse & Query

Front End

ContentSyndication

Service

http://www.ontoweb.org

Page 25: 13.12.2005 1 Intelligent Systems Lecture 24 Ontologies. Semantic WEB (based on presentation of Forschungszentrum Informatik at the University of Karlsruhe,

13.12.2005 13.12.2005 25

General Web Services13.12.2005

• Web services

– perform functions, which can be anything from simple requests to complicated business processes!

– will transform the Web from a collection of information to a distributed device of computation

• Web services clearly require

a semantic-driven description!

=> Semantic Web Enabled

Web Services

Page 26: 13.12.2005 1 Intelligent Systems Lecture 24 Ontologies. Semantic WEB (based on presentation of Forschungszentrum Informatik at the University of Karlsruhe,

13.12.2005 13.12.2005 26

HARMONISE – Interoperability in

Tourism • The tourism industry is

essentially an information business where data interoperability is necessary to create dynamic markets and cooperation.

• Build bridges between different tourism marketplaces via Semantic Web technologies MAPPING

DISCOVERY!• An ontology will mediate

between the different underlying representations.

Page 27: 13.12.2005 1 Intelligent Systems Lecture 24 Ontologies. Semantic WEB (based on presentation of Forschungszentrum Informatik at the University of Karlsruhe,

13.12.2005 13.12.2005 27

Agenda13.12.2005

• Introduction & Motivation

• Introduction - Semantic Web

• Semantic Web Applications

• Semantic Web Technology

– A Layered Approach

– RDF(S)

– KAON Open Source Infrastructure

• Next Steps

Page 28: 13.12.2005 1 Intelligent Systems Lecture 24 Ontologies. Semantic WEB (based on presentation of Forschungszentrum Informatik at the University of Karlsruhe,

13.12.2005 13.12.2005 28

The Semantic Web As By its Inventor

Page 29: 13.12.2005 1 Intelligent Systems Lecture 24 Ontologies. Semantic WEB (based on presentation of Forschungszentrum Informatik at the University of Karlsruhe,

13.12.2005 13.12.2005 29

XML and its relation to the Semantic Web

• „XML only provides an alphabet, not a vocabulary“. [Forrester Report, December 2001]

• The languages french and english use the same alphabet.

=> Can all french people communicate with english people?

• Adopted to the WWW: – XML provides an alphabet and further important means for

validation and modularization! – XML does not offer any possibilities to transport conceptual

content!

Page 30: 13.12.2005 1 Intelligent Systems Lecture 24 Ontologies. Semantic WEB (based on presentation of Forschungszentrum Informatik at the University of Karlsruhe,

13.12.2005 13.12.2005 30

• RDF: Standard for metadata representation– Basis for interoperability in applications– Cost effective development of tools and

applications– Basis for very different users: Digital libraries,

content rating, B2B, etc.

• RDF-Schema: Definition of simple ontologies in the WWW.

• W3C Recommendation RDF is used by different software companies and standardization organisations

RDF – Data Model for the Semantic Web

Page 31: 13.12.2005 1 Intelligent Systems Lecture 24 Ontologies. Semantic WEB (based on presentation of Forschungszentrum Informatik at the University of Karlsruhe,

13.12.2005 13.12.2005 31

• Not the subatomic particle ...KArlsruhe Ontology

• Based on RDF(S), with several extensions, e.g. for typed, multilingual lexical expressions

• Component-based, easily extendable application framework

• Open-Source Tool Suite, supporting

KAON – A RDF-based Software Infrastructure

Page 32: 13.12.2005 1 Intelligent Systems Lecture 24 Ontologies. Semantic WEB (based on presentation of Forschungszentrum Informatik at the University of Karlsruhe,

13.12.2005 13.12.2005 32

KAON Architecture

RDFFiles

P2PRelationalDatabase

RelationalDatabase

NLP Service

Ap

pli

cati

on

s

& S

erv

ice

s

Web Application Framework

HTMLBrowser

Ontology and Metadata Editing

Reverse Engineering

SYNDICATION

KAON PortalPortal Maker

OntoMat App Framework

Focused Crawler

Text Mining Evolution

Legacy Portals

KAON-API

RDF-APIK-EdutellaWrapper

KAON-Server

J2EEMid

dle

war

e NLP-API

QEL-Wrapper

NLP-API

Reasoning Service

DOC-API

Doc-Manag. Service

Dat

aA

nd

Rem

ote

Ser

vice

s

Page 33: 13.12.2005 1 Intelligent Systems Lecture 24 Ontologies. Semantic WEB (based on presentation of Forschungszentrum Informatik at the University of Karlsruhe,

13.12.2005 13.12.2005 33

Ontology Engineering Plugin - SOEP

• …

Page 34: 13.12.2005 1 Intelligent Systems Lecture 24 Ontologies. Semantic WEB (based on presentation of Forschungszentrum Informatik at the University of Karlsruhe,

13.12.2005 13.12.2005 34

Database Reverse Engineering Plugin - REVERSE

• …

Page 35: 13.12.2005 1 Intelligent Systems Lecture 24 Ontologies. Semantic WEB (based on presentation of Forschungszentrum Informatik at the University of Karlsruhe,

13.12.2005 13.12.2005 35

Text Extraction Plugin

• …

Page 36: 13.12.2005 1 Intelligent Systems Lecture 24 Ontologies. Semantic WEB (based on presentation of Forschungszentrum Informatik at the University of Karlsruhe,

13.12.2005 13.12.2005 36

Focused Document/Metadata Crawling Plugin

• …

Page 37: 13.12.2005 1 Intelligent Systems Lecture 24 Ontologies. Semantic WEB (based on presentation of Forschungszentrum Informatik at the University of Karlsruhe,

13.12.2005 13.12.2005 37

Further Plugins 13.12.2005

• Automatic Ontology Extraction Component - TextToOnto

• Ontology-based Document Clustering

• Hierarchical Text classification – Automatic Yahoo generation

• View definition component

• Peer-2-Peer-based document annotation and authoring

(for HTML, PDF, JPEG, GIF)

• Graphical Query Interface based on QEL

• SVG-based visualization

• Versioning component

Page 38: 13.12.2005 1 Intelligent Systems Lecture 24 Ontologies. Semantic WEB (based on presentation of Forschungszentrum Informatik at the University of Karlsruhe,

13.12.2005 13.12.2005 38

KAON Portal13.12.2005

• KAON Portal is a set of tools supporting ontology-based web site management

• It supports web-based presentation of information for users (generated and extracted by other components)

• It also provides means for defining information (cooperatively!)

Page 39: 13.12.2005 1 Intelligent Systems Lecture 24 Ontologies. Semantic WEB (based on presentation of Forschungszentrum Informatik at the University of Karlsruhe,

13.12.2005 13.12.2005 39

Rapid Prototyping a Semantic Portal

KAON EngineeringFrontend

KAON User RapidPrototypFrontend

KAON Backend

KAON Server

Page 40: 13.12.2005 1 Intelligent Systems Lecture 24 Ontologies. Semantic WEB (based on presentation of Forschungszentrum Informatik at the University of Karlsruhe,

13.12.2005 13.12.2005 40

Finally: What is behind ? KAON Server!

• Middleware connects applications with data and network services

• Generic API’s for

– Access to ontologies and metadata

– Access to documents

– Access to language processing tools

– P2P Access

• API‘s are implemented, e.g. by Stanfords RDF-API, by J2EE complient implementation,etc.

ConnectorsJXTA HTTP, IIOP

WebDAVJava API

Security

Authorization AuthenticationEncryption Auditing

Management

Transaction ReplicationNaming Services Storage

Data Access

Query UpdateValidatation Inferencing

External Services

TP Monitors DatabasesInference Engines

Page 41: 13.12.2005 1 Intelligent Systems Lecture 24 Ontologies. Semantic WEB (based on presentation of Forschungszentrum Informatik at the University of Karlsruhe,

13.12.2005 13.12.2005 41

Summarization KAON• KAON is basis for approx. 10

research and industry projects. It is also used by external projects all over the world.

• Open Source Community is growing, currently 35 persons.

• KAON is basis for building knowledge-intensive and semantics-based applications.

Page 42: 13.12.2005 1 Intelligent Systems Lecture 24 Ontologies. Semantic WEB (based on presentation of Forschungszentrum Informatik at the University of Karlsruhe,

13.12.2005 13.12.2005 42

Agenda13.12.2005

• Introduction & Motivation

• Introduction - Semantic Web

• Semantic Web Applications

• Semantic Web Technology

• Next Steps

Page 43: 13.12.2005 1 Intelligent Systems Lecture 24 Ontologies. Semantic WEB (based on presentation of Forschungszentrum Informatik at the University of Karlsruhe,

13.12.2005 13.12.2005 43

Conclusion

• We are on the way to a global information structure, being

based on the World Wide Web and it‘s successor Semantic

Web

• The main vision is: Support machine-processable and

interpretable data to provide a higher degree of automatization

(e.g. Web Services, Query Answering, etc.)

• Standards and tools for ontologies and metadata are ready to

use!

Page 44: 13.12.2005 1 Intelligent Systems Lecture 24 Ontologies. Semantic WEB (based on presentation of Forschungszentrum Informatik at the University of Karlsruhe,

13.12.2005 13.12.2005 44

Next Steps

• Semantic Web technology should be the basis for the

Agricultural Ontology Service (AOS)

• KAON already provides ready-to-run, open-source tools on

which the specific AOS functionalities may be built!

• Rapid prototyping approach is promising:

Convert AGROVOC in RDF(S), connect it with existing data

sources and present the information in the Web browser!