Methodological Guidelines for Publishing Linked Data

Embed Size (px)

DESCRIPTION

Methodological Guidelines for Publishing Linked Data presented in Bolivia. UPB, UCB, UMSS, Jalasoft

Text of Methodological Guidelines for Publishing Linked Data

  • 1.Methodological Guidelines for Publishing Linked Data Boris Villazn-Terrazas, Asuncin Gmez-Prez, and scar Corcho Facultad de Informtica, Universidad Politcnica de Madrid Campus de Montegancedo sn, 28660 Boadilla del Monte, Madrid http://www oeg upm nethttp://www.oeg-upm.net {bvillazon,asun,ocorcho}@fi.upm.es Phone: 34.91.3366605, Fax: 34.91.3524819 Cochabamba, Bolivia May, 2011

2. ToC Ontology Engineering Group Introduction to Linked Data Guidelines for Publishing Linked Data Demo 2 3. ToC Ontology Engineering Group Introduction to Linked Data Guidelines for Publishing Linked Data Demo 3 4. People Director: A. Gmez-Prez Research Group (38 people) 2 Full Professors 6 Associate Professors 1 Assistant Professor 6 Postdocs 14 PhD Students 6 MSc Students 4 Software Engineers Management (5) 3 Project Managers 3 Project Managers 1 System Administrator 1 Secretary 80+ Past Collaborators 80+ Past Collaborators 15+ visitors 4 http://www.oeg-upm.net 5. Students from... France G Lithuania Italy France Serbia Macedonia Germany MalaysiaColombia India Ecuador Cameroon Bolivia Ecuador 5 6. > 30 Research projects 1999 20022000 2001 2003 2004 2005 2006 2007 2008 2009 2010 2011 2012 20131999 20022000 2001 2003 2004 2005 2006 2007 Katalyx Group IGN/RAE/AMPER/XMEDIA 2008 2009 2010 E Vi t l/ IO!/B diPLATA WHO/IGN/BNE/FAO 2011 2012 2013 ContentWeb 20 Ac Especiales/Complementarias Servicios Semnticos REIMDOC (FIT) Red/Gis4Gov/11811/UPnP/UpGrid/Autores3.0/WEBn+1 GeoBuddies Espaa Virtual/mIO!/BuscamediaPLATA BabelData / myBigData HA98-0002 MKBEEM O t W b HF02-0013 20 Ac. Especiales/Complementarias SEEMP NeOnOntoWeb Esperonto PIKON Knowledge Web NeO ADMIRE DynaLearn OntoGrid Marie Curie SemSorGrid4Env SEALS MONNET Company EU Project Coordinators SCALUS PlanetData Wf4Ever 6 Company EU Project Coordinators Spanish Projects EU Project Participation 7. Collaboration with international companies 7 8. Collaboration with other research groups Univ. of Amsterdam i f d DFKI Univ. of Augsburg Univ. of Karlsruhe U i f K bl KSL. Stanford Univ. Univ. of Wien Univ. of NR & ALS Univ. of Innsbruck Free Univ. of Amsterdam Univ. of Koblenz Univ. of Hannover Univ. of Mannheim Univ of Bielefeld Univ. of Brasilia Univ. of Bielefeld Forschungszentrum Informatik Univ. of Galway (DERI) Free Univ. of Brussels niv. of Zurich Open University O f d U i it stav Informatiky Oxford University Univ. of Manchester Univ. of Liverpool Univ of Sheffield Academy of Sciences Univ. of Sheffield Univ. of Aberdeen Univ. of Edinburgh Univ. of Southampton CNR Univ of Trento Univ. of Tel Aviv 8 Univ. of Southampton Univ. of Hull Univ. of Trento Univ. of Bolzano INRIA Univ. of Athens TUC 9. Research Areas 2004 2008 Internet of Things Semantic e-Science (Data Integration, Semantic Grid) Ontological Engineering 1995Semantic Grid) 1995 (S i l) N l(Social) Semantic Web Natural Language Processingg 19972000 9 10. Linked Data in OEG GeoLinkedData is an open initiative whose aim is to enrich the Web of Data with Spanish geospatial data.p g p http://geo.linkeddata.es El Viajero Linked Data is project that focuses on the integration of the contents produced by newspapers and digital platforms belonging to Prisa Groupand digital platforms belonging to Prisa Group. http://webenemasuno.linkeddata.es/ A project with the Biblioteca Nacional to publish the library information as Linked Data.y http://cultura.linkeddata.es/visualizer/ 10 11. Linked Data in OEG Tools for generating and cosuming Linked Data, e.g., geometry2rdf http://www oeg upm net/index php/downloads/151 geometry2rdf geometry2rdf http://www.oeg-upm.net/index.php/downloads/151-geometry2rdf map4rdf http://oegdev.dia.fi.upm.es/projects/map4rdf/ Spanish Thematic Network of Linked Data http://red.linkeddata.esp Group leader: Ontology Engineering Group 19 Research Groups 4 companies 4 companies 11 12. ToC Ontology Engineering Group Introduction to Linked Data Guidelines for Publishing Linked Data Demo 12 13. Classic Web MovieDB Data exposed to the Web viathe Web via HTML, pdf, etc. CIA World FactBook 13 Slide adapted from 5min Introduction to Linked Data- Olaf Hartig 14. Classic Web Information from single pagesComplex queries l i l s g e pages can be found via search engines over multiple pages / data ? sea c e g es sources? 14 Slide adapted from 5min Introduction to Linked Data- Olaf Hartig 15. What do we actually want? Use the Web like a single global database MovieDB CIA World FactBook 15 Slide adapted from 5min Introduction to Linked Data- Olaf Hartig 16. Linked Data enables such Web of Data Global Identifier: URI (Uniform Resource Identifier) which is a string of characters usedGlobal Identifier: URI (Uniform Resource Identifier), which is a string of characters used to identify a name or a resource on the Internet. Data Model: RDF (Resource Description Framework), which is a standard model for data interchange on the Web Access Mechanism: HTTP Connection: Typed Links 8000000 Even the Rain http://cia.../Bolivia http://imdb.../TLLuvia http://.../population http://.../name http://.../filming_location p MovieDB CIA World FactBook 16 Slide adapted from 5min Introduction to Linked Data- Olaf Hartig 17. In a nutshell An extension of the current An extension of the current Web where information and servicesdata are given well-defined and explicitly represented meaning, so that it can be shared and used by humans and machinesby humans and machines, ... ... better enabling them to work in cooperation How? Promoting information exchange by tagging web content with machine processable descriptions of its meaning. A d t h l i d i f t t And technologies and infrastructure to do this And clear principles on how to publish data 17 publish data 18. The four principles (Tim Berners Lee, 2006) 1. Use URIs as names for things http://www.w3.org/D esignIssues/Linkedfor things 2. Use HTTP URIs so that people can look esignIssues/Linked Data.html that people can look up those names. 3. When someone looks http://www.ted.com/talks/tim_berners_lee_on_the_next_web.htmlhttp://www.ted.com/talks/tim_berners_lee_on_the_next_web.html up a URI, provide useful information, i th t d dusing the standards (RDF*, SPARQL) 4 Include links to other4. Include links to other URIs, so that they can discover more things.discover more things. 18 19. RDF Resource Description Framework W3C d ti W3C recommendation RDF is a basic KR language based on semantic networks RDF is a basic KR language based on semantic networks Useful to represent metadata and describe any type of Useful to represent metadata and describe any type of information in a machine-accesible way (aka data model) Resources are described in terms of properties and property values using RDF statement St t t t d t i l i ti f Statements are represented as triples, consisting of a subject, predicate, and object [S,P,O] Subject Object property 19 Statement Slide adapted from RDF and RDF Schema- Ral Garca et al. 20. RDF - Example Alex VillaznAlex Villazn hasName hasColleague http://upb.edu/Alex http://upb.edu/Hugo htt // b d /B t hasColleague M l http://upb.edu/Beto Male For practical purposes, specially if handwritten, URIs are shortened using XML namespaces xmlns:upb=http://upb edu/ hasSex xmlns:upb http://upb.edu/ upb:Alex is equivalent to http://upb.edu/Alex RDF serializations: XML, N3, N-Triple Alex Villazn person:hasName upb:Alex upb:Hugo person:hasColleague person:hasColleague 20 upb:Beto Male person:hasSex Slide adapted from RDF and RDF Schema- Ral Garca et al. 21. RDF - SPARQL Alex Villazn person:hasName person hasColleag e upb:Alex upb:Hugo person:hasColleague person:hasColleague Query: Tell me who are the persons who have Hugo as colleague upb:Beto Male person:hasSex upb:Hugo? person:hasColleague Result: upb:Alex and upb:Beto SPARQL l f RDF W3C d i SPARQL query language for RDF. W3C recommendation SELECT ?s WHERE { ?s person:hasColleague upb:Hugo.} 21 Slide adapted from RDF and RDF Schema- Ral Garca et al. 22. So does that mean I have to publish my data as Linked Data, now? But, why? What was your incentive to publish an HTML page in 1990? Share data in documents and because your neighbor was doing itwas doing it So, why should we publish Linked Data in 2011?, y p Share data as data and because your neighbor is doing it 22 Slide adapted from Introduction to Linked Data- Juan Sequeda 23. And guess who is starting to publish Linked Data now? UK Government UK Government US Government BBC Open Calais Freebase NY Times CNET Dbpedia Dbpedia . 23 24. Linked Open Data evolution 2007 2008 2009 2424 25. Linked Open Data 2010 25 http://richard.cyganiak.de/2007/10/lod/ 26. ToC Ontology Engineering Group Introduction to Linked Data Guidelines for Publishing Linked Data Demo 26 27. Guidelines for Publishing Linked Data 27 28. Guidelines for Publishing Linked Data 28 29. Identification of the data sources Guidelines based on the Open Data Manual 1 Two possibilities To find the data sources already available in a public data catalog, e.g., Aporta project 2 To get an agreement with a particular government body to publish its data sources, e.g., GeoLinkedData - IGNp g 29 1 http://opendatamanual.org/ 2 http://aporta.es 30. GeoLinkedData Identification of the data sources IGN National Geographic Institute of Spain Agreement with the IGN g p p Oracle & MySQL Data sources available in a public data catalog INE National Statistic Institute of Spain in a public data catalog 30 31. IGN & INE Identification of the data sources Year Industry Production IndexProvince 31 32. Guidelines for Publishing Linked Data 32 33. Ontology Vocabulary Modelling An ontology is an engineering artifact, which provides: A set of terms A set of explicit assumptions regarding the intended meaning of the terms. Almost always including concepts and their classification Almost always including