43
Linked (Open) Data Opportunities and challenges Makx Dekkers [email protected]

Linked (Open) Data

  • Upload
    halen

  • View
    37

  • Download
    0

Embed Size (px)

DESCRIPTION

Linked (Open) Data. Opportunities and challenges Makx Dekkers [email protected]. Outline. Basic notions Recent developments Comparing objectives Opportunities and risks Conclusions. Basic notions. The idea and its history. - PowerPoint PPT Presentation

Citation preview

Page 1: Linked (Open) Data

Linked (Open) Data

Opportunities and challenges

Makx [email protected]

Page 2: Linked (Open) Data

Outline

• Basic notions• Recent developments• Comparing objectives• Opportunities and risks• Conclusions

© 2011 Makx Dekkers Journeés ABES 2011 2

Page 3: Linked (Open) Data

BASIC NOTIONS

© 2011 Makx Dekkers Journeés ABES 2011 3

Page 4: Linked (Open) Data

The idea and its history

• 1989: Tim Berners-Lee already talked about linking documents and data together (http://www.w3.org/History/1989/proposal.html)

• 2001: Tim Berners-Lee and Ora Lassila introduced the “Semantic Web” (http://www.scientificamerican.com/article.cfm?id=the-semantic-web)

• 2006: Tim Berners-Lee presented the initial design issues (rules) for Linked Data (http://www.w3.org/DesignIssues/LinkedData.html)

© 2011 Makx Dekkers Journeés ABES 2011 4

Page 5: Linked (Open) Data

W3C Semantic Web initiative

• Objective– to create a universal medium for the exchange of

data […] to smoothly interconnect personal information management, enterprise application integration, and the global sharing of commercial, scientific and cultural data

• Main results– Resource Description Framework (RDF), RDFa

(RDF-in-HTML), SPARQL Query Language

© 2011 Makx Dekkers Journeés ABES 2011 5

Page 6: Linked (Open) Data

Core Linked Data Specifications

• Transport– HTTP Hypertext Transfer Protocol

• Identification– URI Uniform Resource Identifier

• Description and linking– RDF Resource Description Framework

• Search and access– SPARQL Query Language for RDF

© 2011 Makx Dekkers Journeés ABES 2011 6

Page 7: Linked (Open) Data

The four rules of Linked Data

• TBL’s recommendations:1. Use URIs as names for things2. Use HTTP URIs so that people can look up those

names3. When someone looks up a URI, provide useful

information, using the standards (RDF*, SPARQL)4. Include links to other URIs so that they can

discover more things

© 2011 Makx Dekkers Journeés ABES 2011 7

Page 8: Linked (Open) Data

The basic model of RDF

• Resource Description Framework “triple”:– Subject: the “thing” (resource) described– Predicate: the characteristic of the resource– Object: the value of the characteristic

Subject ObjectPredicate

© 2011 Makx Dekkers Journeés ABES 2011 8

Page 9: Linked (Open) Data

Complex structures in RDF

This presentation

Makx Dekkers Barcelona

Journées ABES

ABES

Montpellier

17-18 May 2011

presenter

partOf organizer location

hometown

date

location

© 2011 Makx Dekkers Journeés ABES 2011 9

Page 10: Linked (Open) Data

Linked (Open / Enterprise) Data

• Commonalities– Using Semantic Web technologies (RDF)– Linking information resources, people, places

• Differences– Open Data with open licenses; Enterprise Data

mostly for closed, controlled environments– Open Data links to other Open Data, available for

external use; Enterprise Data may link to external data but not openly available for external use

© 2011 Makx Dekkers Journeés ABES 2011 10

Page 11: Linked (Open) Data

Linked Data -- Open Data

• Linked Data: focus on technology– Semantic Web: Resource Description Framework,

and other Web standards– Final solutions still under development

• Open Data: focus on strategy– Based on notion that sharing is important and

benefits all– Technology is secondary

© 2011 Makx Dekkers Journeés ABES 2011 11

Page 12: Linked (Open) Data

The five-star system

Source: http://inkdroid.org/journal/2010/06/04/the-5-stars-of-open-linked-data/

© 2011 Makx Dekkers Journeés ABES 2011 12

Page 13: Linked (Open) Data

The LOD diagram: 2007

Linking Open Data cloud diagram, by Richard Cyganiak and Anja Jentzsch. http://lod-cloud.net/

© 2011 Makx Dekkers Journeés ABES 2011 13

25 datasets

Page 14: Linked (Open) Data

The LOD diagram: 2008

Linking Open Data cloud diagram, by Richard Cyganiak and Anja Jentzsch. http://lod-cloud.net/

© 2011 Makx Dekkers Journeés ABES 2011 14

45 datasets

Page 15: Linked (Open) Data

The LOD diagram: 2009

Linking Open Data cloud diagram, by Richard Cyganiak and Anja Jentzsch. http://lod-cloud.net/

© 2011 Makx Dekkers Journeés ABES 2011 15

95 datasets

Page 16: Linked (Open) Data

The LOD diagram: 2010

Linking Open Data cloud diagram, by Richard Cyganiak and Anja Jentzsch. http://lod-cloud.net/

© 2011 Makx Dekkers Journeés ABES 2011 16

203 datasets

Page 17: Linked (Open) Data

RECENT DEVELOPMENTS

© 2011 Makx Dekkers Journeés ABES 2011 17

Page 18: Linked (Open) Data

W3C communities

• LinkingOpenData SWEO Community Project– Goal: to extend the Web with a data commons by

publishing various open data sets as RDF on the Web and by setting RDF links between data items from different data sources (http://www.w3.org/wiki/SweoIG/TaskForces/CommunityProjects/LinkingOpenData)

• Library Linked Data Incubator Group– to help increase global interoperability of library

data on the Web (http://www.w3.org/2005/Incubator/lld/)

© 2011 Makx Dekkers Journeés ABES 2011 18

Page 19: Linked (Open) Data

More W3C communities

• Government Linking Data Working Group– to provide standards and other information which

help governments around the world publish their data as effective and usable linked data (http://www.w3.org/2011/gld/charter)

• Semantic Web Health Care and Life Sciences (HCLS) Interest Group– to develop, advocate for, and support the use of

Semantic Web technologies for health care and life science (e.g. biology, medicine) (http://www.w3.org/2001/sw/hcls/)

© 2011 Makx Dekkers Journeés ABES 2011 19

Page 20: Linked (Open) Data

Open Knowledge Foundation, okfn.org

• not-for-profit organization promoting open knowledge: any kind of data and content that can be freely used, reused, and redistributed

• Working and Interest Groups, e.g.– Open Data in Science, Open Government Data,

Open Bibliographic Data, Cultural Heritage etc.

• CKAN.net: registry of open datasets and other “knowledge resources”

© 2011 Makx Dekkers Journeés ABES 2011 20

Page 21: Linked (Open) Data

Linked Data initiativesPredicate vocabularies (descriptors)

Research Description and Access (RDA) http://metadataregistry.org/rdabrowse.htm

The Bibliographic Ontology (BIBO) http://bibliontology.com/

Dublin Core http://dublincore.org/

Object vocabularies (values)

Virtual International Authority File (VIAF) http://viaf.org/

Library of Congress authorities http://id.loc.gov/authorities/

AgroVOC (agricultural terminology) e.g. http://aims.fao.org/aos/agrovoc/c_550

DBPedia (based on Wikipedia) e.g. http://dbpedia.org/page/Montpellier

Bibliographic data

LIBRIS Sweden e.g. http://libris.kb.se/library/S

British Library http://www.bl.uk/bibliographic/datasamples.html

CrossRef (DOI metadata) http://www.crossref.org/CrossTech/linked_data/

© 2011 Makx Dekkers Journeés ABES 2011 21

Page 22: Linked (Open) Data

More Linked Data initiativesBroadcasting, publishing

BBC http://www.bbc.co.uk/blogs/bbcinternet/linked_data/

New York Times http://data.nytimes.com/

Governments (small sample)

USA http://data.gov/

France http://opendata.paris.fr/

Finland http://data.suomi.fi/

UK http://data.gov.uk/

Spain (Cataluña) http://dadesobertes.gencat.cat/

Norway http://data.norge.no

Netherlands http://www.overheid.nl/opendata

Australia http://data.gov.au/

© 2011 Makx Dekkers Journeés ABES 2011 22

Page 23: Linked (Open) Data

COMPARING OBJECTIVES

© 2011 Makx Dekkers Journeés ABES 2011 23

Page 24: Linked (Open) Data

Strategic aspects Linked Data

• Achieving global interoperability with minimal coordination

• Aggregating human knowledge• Supporting democracy, transparency and

accountability• Enhancing and enriching information• Enabling user-driven and user-generated

applications

© 2011 Makx Dekkers Journeés ABES 2011 24

Page 25: Linked (Open) Data

Strategic aspects libraries

• Organizing information for use by specific users for specific goals

• Ensuring and maintaining quality• Sustaining services economically• Preserving information for the long term• Providing trusted services

© 2011 Makx Dekkers Journeés ABES 2011 25

Page 26: Linked (Open) Data

Functional aspects Linked Data

• Searching distributed collections • “Following your nose” – navigating links

between pieces of content• Distributing responsibility for making

statements about things• Leaving to the user whom and what to trust• Leaving development of products and services

to an open market (apps)

© 2011 Makx Dekkers Journeés ABES 2011 26

Page 27: Linked (Open) Data

Functional aspects libraries

• Describing information by professionals• Bringing together and managing aggregations

of information• Selecting relevant information• Mixing analogue and digital resources

© 2011 Makx Dekkers Journeés ABES 2011 27

Page 28: Linked (Open) Data

Technical aspects Linked Data

• Publishing and using machine-readable statements (“data that speak for themselves”)

• Focusing on Semantic Web technology• Enabling inferences across large distributed

data sets• (Still to be done) Solving issues around

harvesting, caching and real-time updating

© 2011 Makx Dekkers Journeés ABES 2011 28

Page 29: Linked (Open) Data

Technical aspects libraries

• Using proven technology to provide high-quality services

• Managing production systems and services• Guaranteeing performance, uptime,

consistency across data

© 2011 Makx Dekkers Journeés ABES 2011 29

Page 30: Linked (Open) Data

Agility versus sustainability

• In the Linked Data space:– Things move fast– Trial-and-error– Lots of development by volunteers (hackers)

• In the library domain:– Operational systems need to evolve– Need to handle legacy data– Development by professionals in managed

projects

© 2011 Makx Dekkers Journeés ABES 2011 30

Page 31: Linked (Open) Data

Data versus services

• In the Linked Data space:– Focus on availability of “raw data”– Quality is secondary– Data and technology should lead to useful results

• In the library domain:– Focus on services– Quality is essential– Data and technology in support of the service

© 2011 Makx Dekkers Journeés ABES 2011 31

Page 32: Linked (Open) Data

Economic aspects

• In the Linked Data space:– “Information wants to be free” – a human right?– Short-term thinking: today is hot, yesterday is not – Focus on applications to create value out of data

• In the library domain:– Long-term view: sustainability is crucial– Public money to provide community services– Expected to do more with less money

© 2011 Makx Dekkers Journeés ABES 2011 32

Page 33: Linked (Open) Data

OPPORTUNITIES AND RISKS

© 2011 Makx Dekkers Journeés ABES 2011 33

Page 34: Linked (Open) Data

Strong points Linked Data

• Attempt to create a common technical platform for machine-readable data

• Lots of enthusiasm in publishing open data• Promise of global interoperability• Mix of researchers, user communities,

hackers, professional data providers• High visibility on political level

© 2011 Makx Dekkers Journeés ABES 2011 34

Page 35: Linked (Open) Data

Risks Linked Data

• Driven by technology, not by requirements• Technology may not (yet) be stable – RDF 2.0?• Operational issues far from solved (reliability,

performance, quality, security, trust) • Hope for general agreement across domains

may not be realistic• Promise may turn into disappointment

© 2011 Makx Dekkers Journeés ABES 2011 35

Page 36: Linked (Open) Data

Strong points libraries

• Long time operational experience in managing information

• Professional intermediaries between users and information needs

• Sustainable business models (albeit with eternally shrinking budgets)

• Long-term perspective: the past (legacy data) as well as the future (preservation)

© 2011 Makx Dekkers Journeés ABES 2011 36

Page 37: Linked (Open) Data

Risks libraries

• Technologies change rapidly• New skills difficult to spread through the

organization• Some people see libraries as a thing of the

past (“the book museum”)• Underestimation of information handling skills• Information overload, human intervention

does not scale, need for better tools

© 2011 Makx Dekkers Journeés ABES 2011 37

Page 38: Linked (Open) Data

Meeting both worlds

• An example: Europeana.eu– Started out with domain perspectives (libraries,

archives, museums, audiovisual archives)– “Traditional” approach (metadata mappings)

works but insufficient– Using Linked Data approach preserves domain

specifics but allows for generalization to support common services

– Cross-domain (but co-ordinated) interoperability

© 2011 Makx Dekkers Journeés ABES 2011 38

Page 39: Linked (Open) Data

Europeana Data ModelClasses Properties

Simple example

Complex example

Source at:http://version1.europeana.eu/web/europeana-project/technicaldocuments/

© 2011 Makx Dekkers Journeés ABES 2011 39

Page 40: Linked (Open) Data

CONCLUSION

© 2011 Makx Dekkers Journeés ABES 2011 40

Page 41: Linked (Open) Data

Libraries and Linked Data

• Using Linked Data technology as the next step in connecting services

• Offering information management skills to the technology domain

• Creating a quality hub in the Linked Data space

© 2011 Makx Dekkers Journeés ABES 2011 41

Page 42: Linked (Open) Data

Best of both worlds

• Libraries providing stability and sustainability to Linked Data spaces

• Library professionals helping to manage the distributed collections

• Libraries delivering high-quality linked data to the Web

• Technologists to provide the next generation of systems and tools

© 2011 Makx Dekkers Journeés ABES 2011 42

Page 43: Linked (Open) Data

Linked (Open) Data: opportunity for libraries!

Thank you!

Makx [email protected]