Linked (Open) Data - But what does it buy me?

Embed Size (px)

DESCRIPTION

Pres

Text of Linked (Open) Data - But what does it buy me?

  • 1.Linked (Open) DataBut what does it buy me? Rinke HoekstraVU University Amsterdam/University of Amsterdam rinke.hoekstra@vu.nlLinked (Open) Data - But what does it buy me? by Rinke HoekstraLicensed under a Creative Commons Attribution-ShareAlike 3.0 Unported License.maandag 11 maart 13

2. maandag 11 maart 13 3. http://www.youtube.com/watch?v=ga1aSJXCFe0maandag 11 maart 13 4. maandag 11 maart 13 5. http://www.ted.com/talks/tim_berners_lee_the_year_open_data_went_worldwide.htmlmaandag 11 maart 13 6. Linked Open Datamaandag 11 maart 13 7. Linked Open DataTexts taken from http://5stardata.infomaandag 11 maart 13 8. Why people go Meh Data needs to be converted to RDF Data needs to be published on the Web An open license is required even for a single Pacic Barreleye, http://imgur.com/gallery/Mzyb5(can rotate its eyes forwards or upwards to look through the transparent head to prey above)maandag 11 maart 13 9. Why people go Meh What if people draw incorrectconclusions from my data? Data needs to be converted to RDF Data needs to be published on the Web An open license is required even for a single Pacic Barreleye, http://imgur.com/gallery/Mzyb5(can rotate its eyes forwards or upwards to look through the transparent head to prey above)maandag 11 maart 13 10. Why people go Meh What if if people draw incorrectWhat journalists draw incorrect conclusions from my data? Data needs to be converted to RDF Data needs to be published on the Web An open license is required even for a single Pacic Barreleye, http://imgur.com/gallery/Mzyb5(can rotate its eyes forwards or upwards to look through the transparent head to prey above)maandag 11 maart 13 11. Why people go Meh What if if people draw incorrectWhat journalists draw incorrect conclusions from my data? Data needs to be converted to RDF Data needs to be published on the Web An open license is required even for a single What if combining data results inprivacy infringement?Pacic Barreleye, http://imgur.com/gallery/Mzyb5(can rotate its eyes forwards or upwards to look through the transparent head to prey above)maandag 11 maart 13 12. ... but LOD is just asking for more!maandag 11 maart 13 13. ... how can I sell this internally?maandag 11 maart 13 14. maandag 11 maart 13 15. Linked Open Datamaandag 11 maart 13 16. Repeatable TransformationThe missing Choose your Grain Size Linked DataSix IngredientsContextualize!Mix n MashLower the Thresholdmaandag 11 maart 13 17. 1 The missing maandag 11 maart 13 18. 1 The missing maandag 11 maart 13 19. 1The missing Version information Guessable http://give.everything/a/URIVersion agnostic HTTPs URIs only please!(or resolver + URN)maandag 11 maart 13 20. Messy Data http://wetten.overheid.nl/BWBIdService/BWBIdList.xml.zipNB: The problem with the XML processing instruction was reported and xed, but returned some weeks latermaandag 11 maart 13 21. Example: Juriconnect1.0:c:BWBR0005416&artikel=6 vs http://wetten.overheid.nl/cgi-bin/deeplink/law1/bwbid=BWBR0005416/article=6/date=2005-01-14 vshttp://wetten.overheid.nl/BWBR0005416/TitelII698946/HoofdstukII/Artikel16/geldigheidsdatum_14-01-2005Existing identication standard: JuriconnectURN-like... but no naming server cf. Document Object IdentiersNamed elements do not carry identierNo explicit version information, only contextualmaandag 11 maart 13 22. Levels of IdenticationBibliographicWorkEntity realizesIFLA FRBR levelsExpression embodiesWork Manifestation ExpressionItem exemplies ManifestationXML version of regulation on XML version of Version ofRegulation regulation regulationmy harddiskmaandag 11 maart 13 23. Transparent = Guessable Hierarchical information (work)http://doc.metalex.eu/id/BWBR0011823/hoofdstuk/1/artikel/1http://doc.metalex.eu/id/BWBR0011823/artikel/1 Version and language (expression)http://doc.metalex.eu/id/BWBR0011823/hoofdstuk/1/artikel/1/nl/2010-09-01 Format information (manifestation)http://doc.metalex.eu/doc/BWBR0011823/hoofdstuk/1/artikel/1/nl/2010-09-01/data.xmlmaandag 11 maart 13 24. Versioning Issues URIs dont carry semantics... Detect changes:which element versions are the same... and which versions are dierent? Art. 44, lid 4(2011-03-26)Art. 44, lid 4 (2011-04-05)From: Besluit prudentile regels Wft, BWBR0020420maandag 11 maart 13 25. Opaque Identiershttp://doc.metalex.eu/BWBR0011823/hoofdstuk/1/artikel/34b0cee26ee5138c74aa2c62caf2c117d3c616e9vermogen van de erater dcterms:subject SWSW Hoofdstuk I, Artikel 10 Hoofdstuk I, Artikel 10 2011-01-012011-10-12 owl:sameAsSHA18738ef273ea4dbc73Content informationUnique SHA1 Hash of textmaandag 11 maart 13 26. Opaque Identiershttp://doc.metalex.eu/BWBR0011823/hoofdstuk/1/artikel/34b0cee26ee5138c74aa2c62caf2c117d3c616e9vermogen van de erater dcterms:subject SW SW Hoofdstuk I, Artikel 10Hoofdstuk I, Artikel 10 2011-01-01 2011-10-12 owl:sameAsowl:sameAsSHA18738ef273ea4dbc73Content informationUnique SHA1 Hash of textmaandag 11 maart 13 27. Opaque Identiershttp://doc.metalex.eu/BWBR0011823/hoofdstuk/1/artikel/34b0cee26ee5138c74aa2c62caf2c117d3c616e9vermogen van de erater dcterms:subjectdcterms:subject SW SW Hoofdstuk I, Artikel 10owl:sameAsHoofdstuk I, Artikel 10 2011-01-01 2011-10-12 owl:sameAsowl:sameAsSHA18738ef273ea4dbc73Content informationUnique SHA1 Hash of textmaandag 11 maart 13 28. Opaque Identiershttp://doc.metalex.eu/BWBR0011823/hoofdstuk/1/artikel/34b0cee26ee5138c74aa2c62caf2c117d3c616e9vermogen van de erater dcterms:subject SWSW Hoofdstuk I, Artikel 10 Hoofdstuk I, Artikel 10 2011-01-012011-10-12 owl:sameAsowl:sameAsSHA1SHA18738ef273ea4dbc73a433f53273c78a56f2Content informationUnique SHA1 Hash of textmaandag 11 maart 13 29. Network Analysismaandag 11 maart 13 30. 2Repeatable Transformation Transformation should be part of routine ...... manageable and scalable ... ... repeatable ...http://www.w3.org/TR/prov-overview/maandag 11 maart 13 31. 2Repeatable TransformationLinked Data will not be the ocial source anytime soonProvenance is key Transformation should be part of routine ...... manageable and scalable ... ... repeatable ...http://www.w3.org/TR/prov-overview/maandag 11 maart 13 32. maandag 11 maart 13 33. LODStatshttp://stats.lod2.eumaandag 11 maart 13 34. 40.745.554.078 Triples!maandag 11 maart 13 35. 40.745.554.078 Triples!(1.6 Billion) (I tried to check the latest gures, but http://stats.lod2.eu was down)maandag 11 maart 13 36. 3Choose your Grain Size The document is thetraditional grain size(dublin core) Linked data allows fordeep links into data Cost versus usefulness Are you the right party to provide detailed descriptions? http://creatingandeducating.blogspot.nl/2011/11/blog-post.htmlmaandag 11 maart 13 37. Report Card Categories Report Card Cate RDF Report CardLow DetailHigh Detail StructureMetadataScope Internals RDF Report Card by Leigh Dodds, talk at Semtech Biz London, 2011, http://slideshare.net/ldoddsmaandag 11 maart 13 38. 4 Mix n Mash Multiple vocabularies wont bite Multiple identiers wont bite Choose whats useful for you... ... then map to others!Image David Sykes 2009 All rights reservedmaandag 11 maart 13 39. 4 Mix n Mash Multiple vocabularies wont bite Multiple identiers wont bite Choose whats useful for you... ... then map to others! Good News: the bulk has already been done for you!Image David Sykes 2009 All rights reservedmaandag 11 maart 13 40. Semantically-InterlinkedOnline Communitiesmaandag 11 maart 13 41. Semantically-InterlinkedOnline Communitiesmaandag 11 maart 13 42. Example: Provenance The date at which the expression was created "2009-10-23"^^xsd:date time:Instant ml:Datesem:Time rdf:valuesem:hasTimeStamprdf:typerdf:type sem:timeTypetime:inXSDDateTimerdf:type opmv:Process http://doc.metalex.eu/id/date/2009-10-23 sem:Event ml:LegislativeModicationsem:hasTimerdf:typerdf:type time:hasEndrdf:typeml:date sem:eventTypeThe creation event of the regulationhttp://doc.metalex.eu/id/process/BWBR0017869/2009-10-23http://doc.metalex.eu/id/event/BWBR0017869/2009-10-23 opmv:Artifact opmv:wasGeneratedAt The process that generated the expressionml:resultOfrdf:typeml:BibliographicExpression opmv:wasGeneratedByrdf:typehttp://doc.metalex.eu/id/BWBR0017869/2009-10-23The expression (version) URI of a regulationmaandag 11 maart 13 43. 5Contextualize! Information is not always compatible Make explicit in which context the information holds ... ... and who stated the information, why and how. Flat Earth and Square Earth idea courtesy of Szymon Klarmanmaandag 11 maart 13 44. provo:Activity rdf:type:curation20120126"1"^^xsd:int"11"^^xsd:intprovo:wasGeneratedBy provo:hadAgent provo:startedAtd2s:populationSize d2s:populationSizeprovo:endedAt"1889"^^xsd:int:RinkeHoekstrad2s:censusYear_:xd2s:birthYears:1875--1874 _:b_:a d2s:gemeented2s:dimensiond2s:ageGroup time:inXSDDateTime time:inXSDDateTime :Assendelft:14--15_1875--1874:14-15 "20120126T09:00:00" "20120126T08:30:00" Namespaces dont mean anything Use named graphs to compartmentalize metadata Add provenance information about groups of statementsmaandag 11 maart 13 45. ComplianceRegulation A Art 12 Art 14, lid 3, 2e volzinmaandag 11 maart 13 46. Compliance startState Name entry/action do/activity actionState exit/action event/action(arguments)endRegulation A Art 12 Art 14, lid 3, 2e volzinmaandag 11 maart 13 47. Compliance startState Name entry/action do/activity actionState exit/action event/action(arguments)endRegulation A Art 12 Art 14, lid 3, 2e volzinmaandag 11 maart 13 48. Compliance startState Name entry/action do/activity actionState exit/action event/action(arguments)endRegulation A Art 12 Art 14, lid 3, 2e volzinmaandag 11 maart 13 49. Compliance startState Name entry/action do/activity actionState exit/action event/action(arguments)endRegulation A Art 12 Art 14, lid 3, 2e volzinmaandag 11 maart 13 50. Compliancestart State Nameentry/actiondo/activity action Stateexit/actionevent/action(arguments) endRegulation A Art 12 Art 14, lid 3, 2e volzin Art 14, lid 3, 2e volzinmaandag 11 maart 13 51. Compliance startState Name entry/acti