31
Describing Linked Datasets On the Design and Usage of voiD, the ‘Vocabulary Of Interlinked Datasets’ Linked Data Workshop at WWW09, 2009‐04‐20, Madrid, Spain Keith Alexander (Talis), Richard Cyganiak (DERI), Michael Hausenblas (DERI) and Jun Zhao (University of Oxford)

Describing Linked Datasets

Embed Size (px)

Citation preview

Page 1: Describing Linked Datasets

DescribingLinkedDatasetsOntheDesignandUsageofvoiD,

the‘VocabularyOfInterlinkedDatasets’LinkedDataWorkshopatWWW09,2009‐04‐20,Madrid,Spain

KeithAlexander(Talis),RichardCyganiak(DERI),MichaelHausenblas(DERI)andJunZhao(UniversityofOxford)

Page 2: Describing Linked Datasets

DescribingLinkedDatasets–OntheDesignandUsageofvoiD,the“VocabularyOfInterlinkedDatasets”,LinkedDataWorkshopatWWW09,2009‐04‐20,Madrid,Spain 2

Agenda

•  TheProblem•  OurProposal–voiD•  ApplicaNons•  NextSteps

Page 3: Describing Linked Datasets

DescribingLinkedDatasets–OntheDesignandUsageofvoiD,the“VocabularyOfInterlinkedDatasets”,LinkedDataWorkshopatWWW09,2009‐04‐20,Madrid,Spain 3

TheProblem

2008

2007

Page 4: Describing Linked Datasets

DescribingLinkedDatasets–OntheDesignandUsageofvoiD,the“VocabularyOfInterlinkedDatasets”,LinkedDataWorkshopatWWW09,2009‐04‐20,Madrid,Spain 4

TheProblem

2009

2008

Page 5: Describing Linked Datasets

DescribingLinkedDatasets–OntheDesignandUsageofvoiD,the“VocabularyOfInterlinkedDatasets”,LinkedDataWorkshopatWWW09,2009‐04‐20,Madrid,Spain 5

TheProblem

•  TheLinkingOpenData(LOD)cloudgatherscurrentlyroughlythesamemomentumastheWebintheearly1990s

•  Howdidpeopledealwiththeconsequencesofhavingadecentralizedsystem,backthen?

Page 6: Describing Linked Datasets

DescribingLinkedDatasets–OntheDesignandUsageofvoiD,the“VocabularyOfInterlinkedDatasets”,LinkedDataWorkshopatWWW09,2009‐04‐20,Madrid,Spain 6

TheProblem

Page 7: Describing Linked Datasets

DescribingLinkedDatasets–OntheDesignandUsageofvoiD,the“VocabularyOfInterlinkedDatasets”,LinkedDataWorkshopatWWW09,2009‐04‐20,Madrid,Spain 7

TheProblem

•  From2007on,wehavebeendoingitintheYahoo!‐catalog‐style:manuallycollec>ngandrepresen>ngdataabouttheLinkingOpenDatacloud:–  IntheLODclouddiagram,wegiveaqualitaNveviewinformofavisualgraph

–  InvariousESWWikipageswecreateHTMLtables:•  h`p://esw.w3.org/topic/TaskForces/CommunityProjects/LinkingOpenData/DataSets/StaNsNcs

•  h`p://esw.w3.org/topic/TaskForces/CommunityProjects/LinkingOpenData/DataSets/LinkStaNsNcs

Page 8: Describing Linked Datasets

DescribingLinkedDatasets–OntheDesignandUsageofvoiD,the“VocabularyOfInterlinkedDatasets”,LinkedDataWorkshopatWWW09,2009‐04‐20,Madrid,Spain 8

TheProblem

h`p://esw.w3.org/topic/TaskForces/CommunityProjects/LinkingOpenData/DataSets/StaNsNcs

h`p://esw.w3.org/topic/TaskForces/CommunityProjects/LinkingOpenData/DataSets/LinkStaNsNcs

Page 9: Describing Linked Datasets

DescribingLinkedDatasets–OntheDesignandUsageofvoiD,the“VocabularyOfInterlinkedDatasets”,LinkedDataWorkshopatWWW09,2009‐04‐20,Madrid,Spain 9

TheProblem

•  Currently,onlyhumancomprehensibledescrip>ons(theLODcloud,Wikipages)available

•  Wecan’tautomatetasks,suchas– Efficient&effecNvesearch

– SelecNonofdataset(forapps,interlinkingtargets)– GeneraNonofmaps,etc.

Page 10: Describing Linked Datasets

DescribingLinkedDatasets–OntheDesignandUsageofvoiD,the“VocabularyOfInterlinkedDatasets”,LinkedDataWorkshopatWWW09,2009‐04‐20,Madrid,Spain 10

TheProblem

•  Wecan’tapplyourtoolsandmethodswehaveexperienceswith,suchaseditors,engines,stores,etc.

•  Evenworse,itdoesn’tscale– We’dneedaGoogle‐styleapproachthatscaleslikehellandispowerfulenoughtoenabletheabovemenNoned

– ProvidingmetadataabouttheLODcloudinamachine‐comprehensibleway

Page 11: Describing Linked Datasets

DescribingLinkedDatasets–OntheDesignandUsageofvoiD,the“VocabularyOfInterlinkedDatasets”,LinkedDataWorkshopatWWW09,2009‐04‐20,Madrid,Spain 11

Agenda

 TheProblem•  OurProposal–voiD•  ApplicaNons•  NextSteps

Page 12: Describing Linked Datasets

DescribingLinkedDatasets–OntheDesignandUsageofvoiD,the“VocabularyOfInterlinkedDatasets”,LinkedDataWorkshopatWWW09,2009‐04‐20,Madrid,Spain 12

OurProposal‐voiD

•  SoluNon:providingaformaldescripNonof– Whatadatasetisabout(topic,technicaldetails)

– HowandunderwhichcondiNonstoaccessit– Howthedatasetisinterlinkedwithotherdatasets•  QualitaNvelevel:typeofinterlinking•  QuanNtaNvelevel:numberoflinks,resources,etc.

– Howtodiscoverthemetadata

•  voiD,the“VocabularyofInterlinkedDatasets”providespreciselythis

Page 13: Describing Linked Datasets

DescribingLinkedDatasets–OntheDesignandUsageofvoiD,the“VocabularyOfInterlinkedDatasets”,LinkedDataWorkshopatWWW09,2009‐04‐20,Madrid,Spain 13

OurProposal‐voiD

•  AdatasetisasetofRDFtriplesthatarepublished,maintainedoraggregatedbyasingleprovider.

•  Adatasetisauthorita>vewithrespecttoacertainURInamespaceifitcontainsinformaNonaboutresourcesnamedbyURIsinthisnamespace,andispublishedbytheURIowner(URIownershipasoftheAWWW1)

Page 14: Describing Linked Datasets

DescribingLinkedDatasets–OntheDesignandUsageofvoiD,the“VocabularyOfInterlinkedDatasets”,LinkedDataWorkshopatWWW09,2009‐04‐20,Madrid,Spain 14

OurProposal‐voiD

•  AlinksetLSisasetofRDFtripleswhereforalltriplesti=⟨si,pi,oi⟩∈LS,thesubjectisinonedataset,i.e.allsiaredescribedinDS1,andtheobjectisinanotherdataset,i.e.alloiaredescribedinDS2.

Page 15: Describing Linked Datasets

DescribingLinkedDatasets–OntheDesignandUsageofvoiD,the“VocabularyOfInterlinkedDatasets”,LinkedDataWorkshopatWWW09,2009‐04‐20,Madrid,Spain 15

OurProposal‐voiD

Page 16: Describing Linked Datasets

DescribingLinkedDatasets–OntheDesignandUsageofvoiD,the“VocabularyOfInterlinkedDatasets”,LinkedDataWorkshopatWWW09,2009‐04‐20,Madrid,Spain 16

OurProposal‐voiD

voiDofferstwoorthogonalinterlinkingtypes:•  classicLODvs.3rd‐party,differinginwheretheinterlinkingstatementsare

kept.Inthefirstcasetheinterlinkingtriples,i.e.alinkset,arehostedinoneofthetwoinvolveddatasets,whileinthela`ercasethereisathirddatasetinvolvedthatcontainstheinterlinkingtriples,i.e.thelinkset;

•  non‐directedvs.directed,whichaddressestheissueifsomeoneisinterestedinstaNngthedirecNonoftheinterlinkingornot(forexamplewithowl:sameAs)

classicLOD,non‐directed

3rd‐party,non‐directed

classicLOD,directed

3rd‐party,directed

Page 17: Describing Linked Datasets

DescribingLinkedDatasets–OntheDesignandUsageofvoiD,the“VocabularyOfInterlinkedDatasets”,LinkedDataWorkshopatWWW09,2009‐04‐20,Madrid,Spain 17

OurProposal‐voiD

classicLOD,non‐directed

Page 18: Describing Linked Datasets

DescribingLinkedDatasets–OntheDesignandUsageofvoiD,the“VocabularyOfInterlinkedDatasets”,LinkedDataWorkshopatWWW09,2009‐04‐20,Madrid,Spain 18

classicLOD,directed

OurProposal‐voiD

Page 19: Describing Linked Datasets

DescribingLinkedDatasets–OntheDesignandUsageofvoiD,the“VocabularyOfInterlinkedDatasets”,LinkedDataWorkshopatWWW09,2009‐04‐20,Madrid,Spain 19

3rd‐party,non‐directed

OurProposal‐voiD

Page 20: Describing Linked Datasets

DescribingLinkedDatasets–OntheDesignandUsageofvoiD,the“VocabularyOfInterlinkedDatasets”,LinkedDataWorkshopatWWW09,2009‐04‐20,Madrid,Spain 20

3rd‐party,directed

OurProposal‐voiD

Page 21: Describing Linked Datasets

DescribingLinkedDatasets–OntheDesignandUsageofvoiD,the“VocabularyOfInterlinkedDatasets”,LinkedDataWorkshopatWWW09,2009‐04‐20,Madrid,Spain 21

OurProposal‐voiD

•  Reusingtermsfromothervocabularies–  foaf:homepage/IFP

– dcterms:subjectalongwithDBpediaURIsh`p://dbpedia.org/resource/XXX

– SCOVOforstaNsNcsabouttriples,links,etc

Page 22: Describing Linked Datasets

DescribingLinkedDatasets–OntheDesignandUsageofvoiD,the“VocabularyOfInterlinkedDatasets”,LinkedDataWorkshopatWWW09,2009‐04‐20,Madrid,Spain 22

OurProposal‐voiD

•  PublicaNon&discoveryviasitemapsand/orbacklinks(dcterms:isPartOf)

Page 23: Describing Linked Datasets

DescribingLinkedDatasets–OntheDesignandUsageofvoiD,the“VocabularyOfInterlinkedDatasets”,LinkedDataWorkshopatWWW09,2009‐04‐20,Madrid,Spain 23

OurProposal‐voiD

•  OncedatasetprovidershavepublishedtheirvoiDdescripNoninRDFalongwiththeirdataset,onecanaddressthefollowingissues:– Howtofindsomedatasets?

– Howtoefficientlyfindaspecificdataset?– Howtoeffec>velyfinddatasets?– Howtodynamicallyselectdatasets?– Howtoselectdatasetsbasedoncertainpreferences?

Page 24: Describing Linked Datasets

DescribingLinkedDatasets–OntheDesignandUsageofvoiD,the“VocabularyOfInterlinkedDatasets”,LinkedDataWorkshopatWWW09,2009‐04‐20,Madrid,Spain 24

Agenda

 TheProblem OurProposal–voiD•  ApplicaNons•  NextSteps

Page 25: Describing Linked Datasets

DescribingLinkedDatasets–OntheDesignandUsageofvoiD,the“VocabularyOfInterlinkedDatasets”,LinkedDataWorkshopatWWW09,2009‐04‐20,Madrid,Spain 25

Applica>ons

•  GeneraNon(ve,lipSSM,NXparser)•  VocabularyManagement(Talis)

•  Explorer(RKB,LDE)•  QueryFederaNon(Clarck‐Parsia,OpenLink)•  Datasetranking(DING!talk)

•  PotenNalApplicaNons– Mapofdata(Sindice)– DynamicMeshupsforApplicaNon

Page 26: Describing Linked Datasets

DescribingLinkedDatasets–OntheDesignandUsageofvoiD,the“VocabularyOfInterlinkedDatasets”,LinkedDataWorkshopatWWW09,2009‐04‐20,Madrid,Spain 26

Applica>ons

h`p://ld2sd.deri.org/ve

Page 27: Describing Linked Datasets

DescribingLinkedDatasets–OntheDesignandUsageofvoiD,the“VocabularyOfInterlinkedDatasets”,LinkedDataWorkshopatWWW09,2009‐04‐20,Madrid,Spain 27

Applica>ons

h`p://ld2sd.deri.org/lde

Page 28: Describing Linked Datasets

DescribingLinkedDatasets–OntheDesignandUsageofvoiD,the“VocabularyOfInterlinkedDatasets”,LinkedDataWorkshopatWWW09,2009‐04‐20,Madrid,Spain 28

Applica>ons

h`p://db

lp.rkbexplorer.com

/mod

els/void.`l

Page 29: Describing Linked Datasets

DescribingLinkedDatasets–OntheDesignandUsageofvoiD,the“VocabularyOfInterlinkedDatasets”,LinkedDataWorkshopatWWW09,2009‐04‐20,Madrid,Spain 29

Applica>ons

h`p://linkeddata.uriburner.com/

Page 30: Describing Linked Datasets

DescribingLinkedDatasets–OntheDesignandUsageofvoiD,the“VocabularyOfInterlinkedDatasets”,LinkedDataWorkshopatWWW09,2009‐04‐20,Madrid,Spain 30

Agenda

 TheProblem OurProposal–voiD ApplicaNons•  NextSteps

Page 31: Describing Linked Datasets

DescribingLinkedDatasets–OntheDesignandUsageofvoiD,the“VocabularyOfInterlinkedDatasets”,LinkedDataWorkshopatWWW09,2009‐04‐20,Madrid,Spain 31

NextSteps

•  voiD2.0seeissuesath`p://code.google.com/p/void‐impl/issues/list

•  staNsNcsmodule(fix/extendreSCOVO)

•  SPARQLendpoints•  provenance,trust(?)•  AssistpeopleinpublishingvoiD