Linked (Open) Data - But what does it buy me?

  • Published on
    11-May-2015

  • View
    1.280

  • Download
    0

DESCRIPTION

Pres

Transcript

1.Linked (Open) DataBut what does it buy me? Rinke HoekstraVU University Amsterdam/University of Amsterdam rinke.hoekstra@vu.nlLinked (Open) Data - But what does it buy me? by Rinke HoekstraLicensed under a Creative Commons Attribution-ShareAlike 3.0 Unported License.maandag 11 maart 132. maandag 11 maart 13 3. http://www.youtube.com/watch?v=ga1aSJXCFe0maandag 11 maart 13 4. maandag 11 maart 13 5. http://www.ted.com/talks/tim_berners_lee_the_year_open_data_went_worldwide.htmlmaandag 11 maart 13 6. Linked Open Datamaandag 11 maart 13 7. Linked Open DataTexts taken from http://5stardata.infomaandag 11 maart 13 8. Why people go Meh Data needs to be converted to RDF Data needs to be published on the Web An open license is required even for a single Pacic Barreleye, http://imgur.com/gallery/Mzyb5(can rotate its eyes forwards or upwards to look through the transparent head to prey above)maandag 11 maart 13 9. Why people go Meh What if people draw incorrectconclusions from my data? Data needs to be converted to RDF Data needs to be published on the Web An open license is required even for a single Pacic Barreleye, http://imgur.com/gallery/Mzyb5(can rotate its eyes forwards or upwards to look through the transparent head to prey above)maandag 11 maart 13 10. Why people go Meh What if if people draw incorrectWhat journalists draw incorrect conclusions from my data? Data needs to be converted to RDF Data needs to be published on the Web An open license is required even for a single Pacic Barreleye, http://imgur.com/gallery/Mzyb5(can rotate its eyes forwards or upwards to look through the transparent head to prey above)maandag 11 maart 13 11. Why people go Meh What if if people draw incorrectWhat journalists draw incorrect conclusions from my data? Data needs to be converted to RDF Data needs to be published on the Web An open license is required even for a single What if combining data results inprivacy infringement?Pacic Barreleye, http://imgur.com/gallery/Mzyb5(can rotate its eyes forwards or upwards to look through the transparent head to prey above)maandag 11 maart 13 12. ... but LOD is just asking for more!maandag 11 maart 13 13. ... how can I sell this internally?maandag 11 maart 13 14. maandag 11 maart 13 15. Linked Open Datamaandag 11 maart 13 16. Repeatable TransformationThe missing Choose your Grain Size Linked DataSix IngredientsContextualize!Mix n MashLower the Thresholdmaandag 11 maart 13 17. 1 The missing maandag 11 maart 13 18. 1 The missing maandag 11 maart 13 19. 1The missing Version information Guessable http://give.everything/a/URIVersion agnostic HTTPs URIs only please!(or resolver + URN)maandag 11 maart 13 20. Messy Data http://wetten.overheid.nl/BWBIdService/BWBIdList.xml.zipNB: The problem with the XML processing instruction was reported and xed, but returned some weeks latermaandag 11 maart 13 21. Example: Juriconnect1.0:c:BWBR0005416&artikel=6 vs http://wetten.overheid.nl/cgi-bin/deeplink/law1/bwbid=BWBR0005416/article=6/date=2005-01-14 vshttp://wetten.overheid.nl/BWBR0005416/TitelII698946/HoofdstukII/Artikel16/geldigheidsdatum_14-01-2005Existing identication standard: JuriconnectURN-like... but no naming server cf. Document Object IdentiersNamed elements do not carry identierNo explicit version information, only contextualmaandag 11 maart 13 22. Levels of IdenticationBibliographicWorkEntity realizesIFLA FRBR levelsExpression embodiesWork Manifestation ExpressionItem exemplies ManifestationXML version of regulation on XML version of Version ofRegulation regulation regulationmy harddiskmaandag 11 maart 13 23. Transparent = Guessable Hierarchical information (work)http://doc.metalex.eu/id/BWBR0011823/hoofdstuk/1/artikel/1http://doc.metalex.eu/id/BWBR0011823/artikel/1 Version and language (expression)http://doc.metalex.eu/id/BWBR0011823/hoofdstuk/1/artikel/1/nl/2010-09-01 Format information (manifestation)http://doc.metalex.eu/doc/BWBR0011823/hoofdstuk/1/artikel/1/nl/2010-09-01/data.xmlmaandag 11 maart 13 24. Versioning Issues URIs dont carry semantics... Detect changes:which element versions are the same... and which versions are dierent? Art. 44, lid 4(2011-03-26)Art. 44, lid 4 (2011-04-05)From: Besluit prudentile regels Wft, BWBR0020420maandag 11 maart 13 25. Opaque Identiershttp://doc.metalex.eu/BWBR0011823/hoofdstuk/1/artikel/34b0cee26ee5138c74aa2c62caf2c117d3c616e9vermogen van de erater dcterms:subject SWSW Hoofdstuk I, Artikel 10 Hoofdstuk I, Artikel 10 2011-01-012011-10-12 owl:sameAsSHA18738ef273ea4dbc73Content informationUnique SHA1 Hash of textmaandag 11 maart 13 26. Opaque Identiershttp://doc.metalex.eu/BWBR0011823/hoofdstuk/1/artikel/34b0cee26ee5138c74aa2c62caf2c117d3c616e9vermogen van de erater dcterms:subject SW SW Hoofdstuk I, Artikel 10Hoofdstuk I, Artikel 10 2011-01-01 2011-10-12 owl:sameAsowl:sameAsSHA18738ef273ea4dbc73Content informationUnique SHA1 Hash of textmaandag 11 maart 13 27. Opaque Identiershttp://doc.metalex.eu/BWBR0011823/hoofdstuk/1/artikel/34b0cee26ee5138c74aa2c62caf2c117d3c616e9vermogen van de erater dcterms:subjectdcterms:subject SW SW Hoofdstuk I, Artikel 10owl:sameAsHoofdstuk I, Artikel 10 2011-01-01 2011-10-12 owl:sameAsowl:sameAsSHA18738ef273ea4dbc73Content informationUnique SHA1 Hash of textmaandag 11 maart 13 28. Opaque Identiershttp://doc.metalex.eu/BWBR0011823/hoofdstuk/1/artikel/34b0cee26ee5138c74aa2c62caf2c117d3c616e9vermogen van de erater dcterms:subject SWSW Hoofdstuk I, Artikel 10 Hoofdstuk I, Artikel 10 2011-01-012011-10-12 owl:sameAsowl:sameAsSHA1SHA18738ef273ea4dbc73a433f53273c78a56f2Content informationUnique SHA1 Hash of textmaandag 11 maart 13 29. Network Analysismaandag 11 maart 13 30. 2Repeatable Transformation Transformation should be part of routine ...... manageable and scalable ... ... repeatable ...http://www.w3.org/TR/prov-overview/maandag 11 maart 13 31. 2Repeatable TransformationLinked Data will not be the ocial source anytime soonProvenance is key Transformation should be part of routine ...... manageable and scalable ... ... repeatable ...http://www.w3.org/TR/prov-overview/maandag 11 maart 13 32. maandag 11 maart 13 33. LODStatshttp://stats.lod2.eumaandag 11 maart 13 34. 40.745.554.078 Triples!maandag 11 maart 13 35. 40.745.554.078 Triples!(1.6 Billion) (I tried to check the latest gures, but http://stats.lod2.eu was down)maandag 11 maart 13 36. 3Choose your Grain Size The document is thetraditional grain size(dublin core) Linked data allows fordeep links into data Cost versus usefulness Are you the right party to provide detailed descriptions? http://creatingandeducating.blogspot.nl/2011/11/blog-post.htmlmaandag 11 maart 13 37. Report Card Categories Report Card Cate RDF Report CardLow DetailHigh Detail StructureMetadataScope Internals RDF Report Card by Leigh Dodds, talk at Semtech Biz London, 2011, http://slideshare.net/ldoddsmaandag 11 maart 13 38. 4 Mix n Mash Multiple vocabularies wont bite Multiple identiers wont bite Choose whats useful for you... ... then map to others!Image David Sykes 2009 All rights reservedmaandag 11 maart 13 39. 4 Mix n Mash Multiple vocabularies wont bite Multiple identiers wont bite Choose whats useful for you... ... then map to others! Good News: the bulk has already been done for you!Image David Sykes 2009 All rights reservedmaandag 11 maart 13 40. Semantically-InterlinkedOnline Communitiesmaandag 11 maart 13 41. Semantically-InterlinkedOnline Communitiesmaandag 11 maart 13 42. Example: Provenance The date at which the expression was created "2009-10-23"^^xsd:date time:Instant ml:Datesem:Time rdf:valuesem:hasTimeStamprdf:typerdf:type sem:timeTypetime:inXSDDateTimerdf:type opmv:Process http://doc.metalex.eu/id/date/2009-10-23 sem:Event ml:LegislativeModicationsem:hasTimerdf:typerdf:type time:hasEndrdf:typeml:date sem:eventTypeThe creation event of the regulationhttp://doc.metalex.eu/id/process/BWBR0017869/2009-10-23http://doc.metalex.eu/id/event/BWBR0017869/2009-10-23 opmv:Artifact opmv:wasGeneratedAt The process that generated the expressionml:resultOfrdf:typeml:BibliographicExpression opmv:wasGeneratedByrdf:typehttp://doc.metalex.eu/id/BWBR0017869/2009-10-23The expression (version) URI of a regulationmaandag 11 maart 13 43. 5Contextualize! Information is not always compatible Make explicit in which context the information holds ... ... and who stated the information, why and how. Flat Earth and Square Earth idea courtesy of Szymon Klarmanmaandag 11 maart 13 44. provo:Activity rdf:type:curation20120126"1"^^xsd:int"11"^^xsd:intprovo:wasGeneratedBy provo:hadAgent provo:startedAtd2s:populationSize d2s:populationSizeprovo:endedAt"1889"^^xsd:int:RinkeHoekstrad2s:censusYear_:xd2s:birthYears:1875--1874 _:b_:a d2s:gemeented2s:dimensiond2s:ageGroup time:inXSDDateTime time:inXSDDateTime :Assendelft:14--15_1875--1874:14-15 "20120126T09:00:00" "20120126T08:30:00" Namespaces dont mean anything Use named graphs to compartmentalize metadata Add provenance information about groups of statementsmaandag 11 maart 13 45. ComplianceRegulation A Art 12 Art 14, lid 3, 2e volzinmaandag 11 maart 13 46. Compliance startState Name entry/action do/activity actionState exit/action event/action(arguments)endRegulation A Art 12 Art 14, lid 3, 2e volzinmaandag 11 maart 13 47. Compliance startState Name entry/action do/activity actionState exit/action event/action(arguments)endRegulation A Art 12 Art 14, lid 3, 2e volzinmaandag 11 maart 13 48. Compliance startState Name entry/action do/activity actionState exit/action event/action(arguments)endRegulation A Art 12 Art 14, lid 3, 2e volzinmaandag 11 maart 13 49. Compliance startState Name entry/action do/activity actionState exit/action event/action(arguments)endRegulation A Art 12 Art 14, lid 3, 2e volzinmaandag 11 maart 13 50. Compliancestart State Nameentry/actiondo/activity action Stateexit/actionevent/action(arguments) endRegulation A Art 12 Art 14, lid 3, 2e volzin Art 14, lid 3, 2e volzinmaandag 11 maart 13 51. Compliance startState Name entry/action do/activity actionState exit/action event/action(arguments)end Regulation A Art 12 Art 14, lid 3, 2e volzin Art 14, lid 3, 2e volzin(01-01-2011)(04-02-2011) (11-06-2008) (01-07-2011)maandag 11 maart 13 52. Contextual Annotationvermogen van de eraterSuccessiewetdcterms:subjectSuccessiewetvermogen van de eraterSW Hoofdstuk ISWdcterms:subjectvermogen van de eraterHoofdstuk ISW Artikel 10 SWdcterms:subjectvermogen van de erater Hoofdstuk I, Artikel 10 SW SW Art. 10, zin 1Hoofdstuk I, Artikel 10dcterms:subjectvermogen van de erater Zin 1No nice background because Google Image search only returned boring imagesmaandag 11 maart 13 53. 6 Lower the Threshold Integrate Linked Data production into everyday tools Allow tools to do the work for you Use a built-in reward modelImage courtesy of http://themaisonette.netmaandag 11 maart 13 54. 6 Lower the ThresholdLinked Data allows you to trace usage! Integrate Linked Data production into everyday tools Allow tools to do the work for you Use a built-in reward modelImage courtesy of http://themaisonette.netmaandag 11 maart 13 55. Wrap Legacy Systems http://www.w3.org/TR/r2rml/maandag 11 maart 13 56. maandag 11 maart 13 57. Idea: use reward mechanisms of Web 2.0maandag 11 maart 13 58. Lightweight Web ApplicationInterface to API of existing data repositoriesEnrich metadata by linking to Linked Data resourcesProvide annotation services for data lesPlugin based architecture http://linkitup.data2semantics.orgPublish RDF metadata as new data publicationmaandag 11 maart 13 59. recoprov Reconstruct provenance using Dropbox le edit history19 75 814 11 913 41612217 12 20233 1861015 20 2124 Sara Magliacane and Paul Grothmaandag 11 maart 13 60. plsheetHow are results calculated (1)? Analyse dependencies between Automatic analyis of workflow in spreadsheets cells in complex spreadsheetsMartine de Vos, Jan Wielemaker and Willem van Hagemaandag 11 maart 13 61. plsheet Reconstruct and explain theworkow of computations Martine de Vos, Jan Wielemaker and Willem van Hagemaandag 11 maart 13 62. TabLinkerSemi-automatic RDF converter for eccentric spreadsheets Albert Merono-Penuela, Rinke Hoekstra, http://www.cedar-project.nl Laurens Rietveld, Christophe Gueretmaandag 11 maart 13 63. TabLinkerSemi-automatic RDF converter for eccentric spreadsheets Albert Merono-Penuela, Rinke Hoekstra, http://www.cedar-project.nl Laurens Rietveld, Christophe Gueretmaandag 11 maart 13 64. Repeatable TransformationThe missing Choose your Grain SizeLinked Data Six Ingredients Contextualize! Mix n Mash Lower the Thresholdmaandag 11 maart 13 65. Repeatable TransformationThe missing Choose your Grain SizeLinked Open Data... be sure to use it internally too! Contextualize!Mix n Mash Lower the Thresholdmaandag 11 maart 13