55
Semantic Web 6. Linked Open Data Gerd Gr¨ oner, Matthias Thimm {groener,thimm}@uni-koblenz.de Institute for Web Science and Technologies (WeST) University of Koblenz-Landau July 12, 2013 Gerd Gr¨ oner, Matthias Thimm Semantic Web 1 / 37

Semantic Web - 6. Linked Open Data - WeST – Uni … Web 6. Linked Open Data Gerd Gr oner, Matthias Thimm fgroener,[email protected] Institute for Web Science and Technologies

  • Upload
    vanthuy

  • View
    214

  • Download
    0

Embed Size (px)

Citation preview

Semantic Web6. Linked Open Data

Gerd Groner, Matthias Thimm

{groener,thimm}@uni-koblenz.de

Institute for Web Science and Technologies (WeST)University of Koblenz-Landau

July 12, 2013

Gerd Groner, Matthias Thimm Semantic Web 1 / 37

RDF, RDFS, OWL vocabularies

I RDFI rdf:StatementI rdf:typeI rdf:subject, rdf:predicate, rdf:objectI . . .

I RDFSI rdfs:ResourceI rdfs:ClassI rdfs:subClassOfI . . .

I OWLI owl:disjointOfI owl:sameAsI owl:allValuesFromI . . .

What is missing?

Gerd Groner, Matthias Thimm Semantic Web 2 / 37

RDF, RDFS, OWL vocabularies

I RDFI rdf:StatementI rdf:typeI rdf:subject, rdf:predicate, rdf:objectI . . .

I RDFSI rdfs:ResourceI rdfs:ClassI rdfs:subClassOfI . . .

I OWLI owl:disjointOfI owl:sameAsI owl:allValuesFromI . . .

What is missing?

Gerd Groner, Matthias Thimm Semantic Web 2 / 37

RDF, RDFS, OWL vocabularies

I RDFI rdf:StatementI rdf:typeI rdf:subject, rdf:predicate, rdf:objectI . . .

I RDFSI rdfs:ResourceI rdfs:ClassI rdfs:subClassOfI . . .

I OWLI owl:disjointOfI owl:sameAsI owl:allValuesFromI . . .

What is missing?

Gerd Groner, Matthias Thimm Semantic Web 2 / 37

RDF, RDFS, OWL vocabularies

I RDFI rdf:StatementI rdf:typeI rdf:subject, rdf:predicate, rdf:objectI . . .

I RDFSI rdfs:ResourceI rdfs:ClassI rdfs:subClassOfI . . .

I OWLI owl:disjointOfI owl:sameAsI owl:allValuesFromI . . .

What is missing?

Gerd Groner, Matthias Thimm Semantic Web 2 / 37

Linked Open Data

→ everyone has to use it! Properly!

Gerd Groner, Matthias Thimm Semantic Web 3 / 37

Linked Open Data

I RDF, RDFS, OWL can be used to describe a local isolateddata set

I linking resources to other data sets openly enables linked opendata

Gerd Groner, Matthias Thimm Semantic Web 4 / 37

Linked Open Data

I RDF encourages to use URIs to denote things

I Linked Open Data demands it

Tim Berners-Lee:“Don’t say “colour” say <http://example.com/2002/std6#col>”

Gerd Groner, Matthias Thimm Semantic Web 5 / 37

Linked Open Data

I RDF encourages to use URIs to denote things

I Linked Open Data demands it

Tim Berners-Lee:“Don’t say “colour” say <http://example.com/2002/std6#col>”

Gerd Groner, Matthias Thimm Semantic Web 5 / 37

Linked Open Data

I RDF encourages to use URIs to denote things

I Linked Open Data demands it

Tim Berners-Lee:“Don’t say “colour” say <http://example.com/2002/std6#col>”

Gerd Groner, Matthias Thimm Semantic Web 5 / 37

Outline

1 Linked Open Data Principles

2 Some examples

3 Applications

4 Summary and Exercises

Gerd Groner, Matthias Thimm Semantic Web 6 / 37

Outline

1 Linked Open Data Principles

2 Some examples

3 Applications

4 Summary and Exercises

Gerd Groner, Matthias Thimm Semantic Web 7 / 37

Linked Open Data Principles

I Linked Open Data PrinciplesI use URIs as names for thingsI use HTTP URIs so those names can be dereferencedI return useful information upon lookup of those URIs (e.g.

RDF)I include links by using URIs that dereference to remote

documents

I Linked Open Data SchemeI Publish under an open licenseI Publish structured dataI Use non-proprietary formatsI Use URIs to identify thingsI Link you data to other data

Gerd Groner, Matthias Thimm Semantic Web 8 / 37

Linked Open Data Principles

I Linked Open Data PrinciplesI use URIs as names for thingsI use HTTP URIs so those names can be dereferencedI return useful information upon lookup of those URIs (e.g.

RDF)I include links by using URIs that dereference to remote

documents

I Linked Open Data SchemeI Publish under an open licenseI Publish structured dataI Use non-proprietary formatsI Use URIs to identify thingsI Link you data to other data

Gerd Groner, Matthias Thimm Semantic Web 8 / 37

Five stars of Linked Open Data conformance

5stardata.info

Gerd Groner, Matthias Thimm Semantic Web 9 / 37

Publish under an open license

As a consumer

I You can store it locally (on your hard drive or on an USBstick).

I You can enter the data into any other system.

I You can change the data as you wish.

I You can share the data with anyone you like.

As a publisher

I It’s simple to publish.

I You do not have explain repeatedly to others that they canuse your data.

5stardata.info

Gerd Groner, Matthias Thimm Semantic Web 10 / 37

Publish structured data

As a consumer

I You can directly process it with proprietary software toaggregate it, perform calculations, visualise it, etc.

I You can export it into another (structured) format.

As a publisher

I It’s still simple to publish.

5stardata.info

Gerd Groner, Matthias Thimm Semantic Web 11 / 37

Use non-propriety formats

As a consumer

I You can manipulate the data in any way you like, withoutbeing confined by the capabilities of any particular software.

As a publisher

I You might need converters or plug-ins to export the data fromthe proprietary format.

I It’s still rather simple to publish

5stardata.info

Gerd Groner, Matthias Thimm Semantic Web 12 / 37

Use URIs to identify things

As a consumer

I You can link to it from any other place (on the Web orlocally).

I You may be able to reuse existing tools and libraries, even ifthey only understand parts of the pattern the publisher used.

I Understanding the structure of an RDF Graph of data can bemore effort

I You can combine the data safely with other data.

As a publisher

I Other data publishers can now link into your data

I You typically invest some time slicing and dicing your data.

I You’ll need to assign URIs to data items and think about howto represent the data.

5stardata.info

Gerd Groner, Matthias Thimm Semantic Web 13 / 37

Link your data to other data

As a consumerI You can discover more data while consuming the data.I You can directly learn about the data schema.I You now have to deal with broken data links.I Presenting data from an arbitrary link as fact is as risky as

letting people include content from any website in your pages.Caution, trust and common sense are all still necessary.

As a publisherI You make your data discoverable.I You increase the value of your data.I You own organisation will gain the same benefits from the

links as the consumers.I You’ll need to invest resources to link your data to other data

on the Web.I You may need to repair broken or incorrect links.

5stardata.info

Gerd Groner, Matthias Thimm Semantic Web 14 / 37

Outline

1 Linked Open Data Principles

2 Some examples

3 Applications

4 Summary and Exercises

Gerd Groner, Matthias Thimm Semantic Web 15 / 37

DBpedia 1/2

I “DBpedia is a crowd-sourced community effort toextract structured information from Wikipedia andmake this information available on the Web.”

I “Semantification” of Wikipedia:I 764,000 personsI 573,000 placesI 333,000 creative worksI 192,000 organizationsI 202,000 speciesI 5,500 diseases.

Gerd Groner, Matthias Thimm Semantic Web 16 / 37

DBpedia 2/2

Excerpt from the DBpedia ontology:

I owl:ThingI dbpedia:Agent

I dbpedia:FamilyI dbpedia:OrganisationI dbpedia:Person

I dbpedia:EventI dbpedia:ConventionI dbpedia:FilmFestivalI dbpedia:SportsEvent

I dbpedia:LanguageI dbpedia:NameI dbpedia:Place

I dbpedia:MonumentI dbpedia:PopulatedPlaceI dbpedia:WineRegion

Gerd Groner, Matthias Thimm Semantic Web 17 / 37

FOAF

I FOAF=Friend of a FriendI Vocabulary for describing people, links between them, things

they create and do

Gerd Groner, Matthias Thimm Semantic Web 18 / 37

Dublin Core

I Vocabulary for describing the creation and publishing ofgeneral works

I Namespace:xmlns:dc="http://purl.org/dc/elements/1.1"

I ExamplesI dc:creatorI dc:titleI dc:rightsI dc:publisherI dc:dateI dc:descriptionI dc:sourceI dc:language

Gerd Groner, Matthias Thimm Semantic Web 19 / 37

Dublin Core

I Vocabulary for describing the creation and publishing ofgeneral works

I Namespace:xmlns:dc="http://purl.org/dc/elements/1.1"

I ExamplesI dc:creatorI dc:titleI dc:rightsI dc:publisherI dc:dateI dc:descriptionI dc:sourceI dc:language

Gerd Groner, Matthias Thimm Semantic Web 19 / 37

Dublin Core

I Vocabulary for describing the creation and publishing ofgeneral works

I Namespace:xmlns:dc="http://purl.org/dc/elements/1.1"

I ExamplesI dc:creatorI dc:titleI dc:rightsI dc:publisherI dc:dateI dc:descriptionI dc:sourceI dc:language

Gerd Groner, Matthias Thimm Semantic Web 19 / 37

Dublin Core

I Vocabulary for describing the creation and publishing ofgeneral works

I Namespace:xmlns:dc="http://purl.org/dc/elements/1.1"

I ExamplesI dc:creatorI dc:titleI dc:rightsI dc:publisherI dc:dateI dc:descriptionI dc:sourceI dc:language

Gerd Groner, Matthias Thimm Semantic Web 19 / 37

Dublin Core

I Vocabulary for describing the creation and publishing ofgeneral works

I Namespace:xmlns:dc="http://purl.org/dc/elements/1.1"

I ExamplesI dc:creatorI dc:titleI dc:rightsI dc:publisherI dc:dateI dc:descriptionI dc:sourceI dc:language

Gerd Groner, Matthias Thimm Semantic Web 19 / 37

Dublin Core

I Vocabulary for describing the creation and publishing ofgeneral works

I Namespace:xmlns:dc="http://purl.org/dc/elements/1.1"

I ExamplesI dc:creatorI dc:titleI dc:rightsI dc:publisherI dc:dateI dc:descriptionI dc:sourceI dc:language

Gerd Groner, Matthias Thimm Semantic Web 19 / 37

Dublin Core

I Vocabulary for describing the creation and publishing ofgeneral works

I Namespace:xmlns:dc="http://purl.org/dc/elements/1.1"

I ExamplesI dc:creatorI dc:titleI dc:rightsI dc:publisherI dc:dateI dc:descriptionI dc:sourceI dc:language

Gerd Groner, Matthias Thimm Semantic Web 19 / 37

Dublin Core

I Vocabulary for describing the creation and publishing ofgeneral works

I Namespace:xmlns:dc="http://purl.org/dc/elements/1.1"

I ExamplesI dc:creatorI dc:titleI dc:rightsI dc:publisherI dc:dateI dc:descriptionI dc:sourceI dc:language

Gerd Groner, Matthias Thimm Semantic Web 19 / 37

Dublin Core

I Vocabulary for describing the creation and publishing ofgeneral works

I Namespace:xmlns:dc="http://purl.org/dc/elements/1.1"

I ExamplesI dc:creatorI dc:titleI dc:rightsI dc:publisherI dc:dateI dc:descriptionI dc:sourceI dc:language

Gerd Groner, Matthias Thimm Semantic Web 19 / 37

Outline

1 Linked Open Data Principles

2 Some examples

3 Applications

4 Summary and Exercises

Gerd Groner, Matthias Thimm Semantic Web 20 / 37

Applications

I What can I do with this?

Gerd Groner, Matthias Thimm Semantic Web 21 / 37

Use case example: SchemEx

courtesy of Ansgar Scherp

Gerd Groner, Matthias Thimm Semantic Web 22 / 37

Use case example: SchemEx (cont’d)

Gerd Groner, Matthias Thimm Semantic Web 23 / 37

Use case example: SchemEx (cont’d)

Naive Approach:

I Download the entire LOD cloud

I Put it into a (really) large triple store

I Process the data and extract schema

I Provide lookup

Drawbacks:

I Big machinery

I Late in processing the data

I High effort to scale with LOD cloud

Can we do it smarter?

Gerd Groner, Matthias Thimm Semantic Web 24 / 37

Use case example: SchemEx (cont’d)

Naive Approach:

I Download the entire LOD cloud

I Put it into a (really) large triple store

I Process the data and extract schema

I Provide lookup

Drawbacks:

I Big machinery

I Late in processing the data

I High effort to scale with LOD cloud

Can we do it smarter?

Gerd Groner, Matthias Thimm Semantic Web 24 / 37

Use case example: SchemEx (cont’d)

Naive Approach:

I Download the entire LOD cloud

I Put it into a (really) large triple store

I Process the data and extract schema

I Provide lookup

Drawbacks:

I Big machinery

I Late in processing the data

I High effort to scale with LOD cloud

Can we do it smarter?

Gerd Groner, Matthias Thimm Semantic Web 24 / 37

Use case example: SchemEx (cont’d)

Idea:I Schema-level index

I Define families of graph patternsI Assign instances to graph patternsI Map graph patterns to context (source URI)

I ConstructionI Stream-based for scalabilityI Little loss of accuracy

I NoteI Index defined over instancesI But stores the context

Gerd Groner, Matthias Thimm Semantic Web 25 / 37

Use case example: SchemEx (cont’d)

Idea:I Schema-level index

I Define families of graph patternsI Assign instances to graph patternsI Map graph patterns to context (source URI)

I ConstructionI Stream-based for scalabilityI Little loss of accuracy

I NoteI Index defined over instancesI But stores the context

Gerd Groner, Matthias Thimm Semantic Web 25 / 37

Use case example: SchemEx (cont’d)

Idea:I Schema-level index

I Define families of graph patternsI Assign instances to graph patternsI Map graph patterns to context (source URI)

I ConstructionI Stream-based for scalabilityI Little loss of accuracy

I NoteI Index defined over instancesI But stores the context

Gerd Groner, Matthias Thimm Semantic Web 25 / 37

Use case example: SchemEx (cont’d)

Gerd Groner, Matthias Thimm Semantic Web 26 / 37

Use case example: SchemEx (cont’d)

Gerd Groner, Matthias Thimm Semantic Web 27 / 37

Use case example: SchemEx (cont’d)

Gerd Groner, Matthias Thimm Semantic Web 28 / 37

Use case example: SchemEx (cont’d)

Gerd Groner, Matthias Thimm Semantic Web 29 / 37

Use case example: SchemEx (cont’d)

Gerd Groner, Matthias Thimm Semantic Web 30 / 37

Use case example: SchemEx (cont’d)

Gerd Groner, Matthias Thimm Semantic Web 31 / 37

Use case example: SchemEx (cont’d)

Gerd Groner, Matthias Thimm Semantic Web 32 / 37

Use case example: SchemEx (cont’d)

Gerd Groner, Matthias Thimm Semantic Web 33 / 37

Outline

1 Linked Open Data Principles

2 Some examples

3 Applications

4 Summary and Exercises

Gerd Groner, Matthias Thimm Semantic Web 34 / 37

Summary

I Linked Open Data as the application area of RDF, RDFS,OWL

I Linked Open Data SchemeI Publish under an open licenseI Publish structured dataI Use non-proprietary formatsI Use URIs to identify thingsI Link you data to other data

I Examples: DBpedia, FOAF, Dublin Core

I Applications: SchemEx

Gerd Groner, Matthias Thimm Semantic Web 35 / 37

Pointers to further reading

I 5stardata.info The five stars of Linked Open DataConformance

I A. Hogan, J. Umbrich, A. Hart, R. Cyganiak, A. Polleres, S.Decker. An empirical survey of Linked Data conformance. InJournal of Web Semantics, 14:14–44, 2012.

I M. Konrath, T. Gottron, S. Staab and A. Scherp: SchemEX ?Efficient Construction of a Data Catalogue by Stream-basedIndexing of Linked Data, Journal of Web Semantics: Science,Services and Agents on the World Wide Web, Special Issue onSemantic Web Callenge 2011, 2012.

Gerd Groner, Matthias Thimm Semantic Web 36 / 37

Exercises

No exercises today

Have a nice weekend!

Gerd Groner, Matthias Thimm Semantic Web 37 / 37