27
Linked Data Publishing Ruben Taelman - @rubensworks imec - Ghent University 1

EKAW - Linked Data Publishing

Embed Size (px)

Citation preview

Page 1: EKAW - Linked Data Publishing

Linked Data Publishing Ruben Taelman - @rubensworks

imec - Ghent University

1

Page 2: EKAW - Linked Data Publishing

Publishing as part of any Linked Data life cycle

Generate Publish

Validate

QueryEnhance2

Publish Query... ...

Page 3: EKAW - Linked Data Publishing

Linked Data PublishingLinked Data Interfaces

Storage

Non-technical tasks

3

Page 4: EKAW - Linked Data Publishing

Linked Data PublishingLinked Data Interfaces

Storage

Non-technical tasks

4

Page 5: EKAW - Linked Data Publishing

Linked Data provides machine-accessible data Machines can retrieve and discover data through HTTP interfaces

Machines can understand the data

5

Page 6: EKAW - Linked Data Publishing

Ways for publishing Linked Data on the Web

Data dump

1 RDF document

Linked Data document

RDF document per topic

SPARQL endpoint

Expressive query interface

6

Page 7: EKAW - Linked Data Publishing

Data dumpSimple for data publisher

Data dumps can be large (~gigabytes)

Querying only possible after downloading entire dataset

7

Page 8: EKAW - Linked Data Publishing

Linked Data documentData is available in smaller fragments, according to subject

Linked Data principle of dereferencing

“3. When someone looks up a URI, provide useful information, using the open Web standards such as RDF, SPARQL” (Hyland 2013)

Querying only possible by traversing links

8

Page 9: EKAW - Linked Data Publishing

SPARQL endpointRequires higher computational effort from server

Single point to get and expose data

Easily queryable by clients

9

Page 10: EKAW - Linked Data Publishing

How do the different interfaces relate to each other?

10

Page 11: EKAW - Linked Data Publishing

Linked Data Fragments (LDF)A uniform view on Linked Data interfaces

high client effort high server effort

Data du

mp

LD do

cumen

t

SPARQL res

ult

11

(Verborgh 2016)

Page 12: EKAW - Linked Data Publishing

A big unexplored area on the LDF axis

high client effort high server effort

Data du

mp

LD do

cumen

t

SPARQL res

ult

?

12

(Verborgh 2016)

Page 13: EKAW - Linked Data Publishing

Triple Pattern Fragments (TPF),a trade-off between server and client effort

high client effort high server effort

Data du

mp

LD do

cumen

t

SPARQL res

ult

TPF

13

(Verborgh 2016)

Page 14: EKAW - Linked Data Publishing

Triple Pattern FragmentsLow-cost server interface

Fragmentation of a dataset by triple patterns

Client-side SPARQL query evaluation using a TPF interface

14

Page 15: EKAW - Linked Data Publishing

Choosing an LD interface as trade-off between server and client effort

15

Page 16: EKAW - Linked Data Publishing

URI policies for interfacesLinked Data uses URI’s as a global identification system

URI design principles also apply to interface URI’s:Persistent URI’s and redirectionDomain authority (e.g. government domain)Machine and human-readable representations through content negotiation...

16

Page 17: EKAW - Linked Data Publishing

Linked Data PublishingLinked Data Interfaces

Storage

Non-technical tasks

17

Page 18: EKAW - Linked Data Publishing

Interface and storage solution influence each other

18

Page 19: EKAW - Linked Data Publishing

Start with most restrictive elementStorage is fixed → storage, interface

Machine limitations → interface, storage

19

Page 20: EKAW - Linked Data Publishing

Storage solutions for Linked Data interfacesData dump

Linked Data document

Triple Pattern Fragments

SPARQL endpoint

RDF file, HDT, ...

Static or dynamic RDF files

RDF file, HDT, SPARQL engine, ...

SPARQL engine

20

Page 21: EKAW - Linked Data Publishing

Linked Data PublishingLinked Data Interfaces

Storage

Non-technical tasks

LicensingPublication announcementMaintenance

21

Page 22: EKAW - Linked Data Publishing

Linked Open Data requires an open licenseAll published data should have a connected license

Features of openness: (Open Knowledge Foundation)

Availability and accessReuse and redistributionUniversal participation

Popular open license: CC0

Mention license in dataset listings and in metadata

Confidential data might require restrictive license and security

22

Page 23: EKAW - Linked Data Publishing

Announcing to the publicCommunication channels: mailing lists, blogs, newsletters, …

Feedback channel: form or contact address for any issues

Centralized repositories (e.g. https://datahub.io)

Automated discovery with metadata (e.g. DCAT, VOID)

23

Page 24: EKAW - Linked Data Publishing

Linked Data Publication is a continuous processSocial contract with data consumers

Avoid dataset / interface removal

Data can change

Movement of dataset to new location → URI persistence!

Responsible entity behind feedback channel

24

Page 25: EKAW - Linked Data Publishing

Linked Data PublishingLinked Data Interfaces

Storage

Non-technical tasks

25

Page 26: EKAW - Linked Data Publishing

ConclusionsDifferent Linked Data interfaces exist for publishing Linked Data

Trade-off between server and client effort

Interface and storage solution influence each other

Properly license, announce and maintain your data

26

Page 27: EKAW - Linked Data Publishing

SourcesR Verborgh “Linked Data Publishing” http://rubenverborgh.github.io/WebFundamentals/linked-data-publishing/

Hyland B, Atemezing G, Villazón-Terrazas B. “Best Practises for Publishing Linked Data”https://www.w3.org/TR/ld-bp/

Berners-Lee, Tim. "Linked data, 2006." (2006). https://www.w3.org/DesignIssues/LinkedData.html

Villazón-Terrazas, Boris, et al. "Methodological guidelines for publishing government Linked Data." Linking government data. Springer New York, 2011. 27-49.

27