115
LOD case study: WW2 underground newspapers on Wikipedia Digital Access to Cultural Heritage, Leiden University, 3-3-2016 Olaf Janssen (Koninklijke Bibliotheek) [email protected] - @ookgezellig

Linked Open Data case study (illegal newspapers WW2, Wikipedia, DBpedia) - Lecture Leiden University 3-3-2016

Embed Size (px)

Citation preview

Page 1: Linked Open Data case study (illegal newspapers WW2, Wikipedia, DBpedia) - Lecture Leiden University 3-3-2016

LOD case study WW2 underground newspapers on Wikipedia

Digital Access to Cultural Heritage Leiden University 3-3-2016

Olaf Janssen (Koninklijke Bibliotheek)

olafjanssenkbnl - ookgezellig

What I hope yoursquoll learn today

1 How to give a new life to an old paper book

2 How to get 1300 newspapers from WW2 on Wikipedia

While doing 1 and 2 3 The advantages of linked open data

(= downsides of unconnected data sources)

Olaf Janssen

Wikipedia amp Open data coordinator

National library of the

Netherlands

htt

p

ww

w4

en5

mei

amst

erd

amn

lat

tach

men

t4

74

54

During WW2 plusmn 1300 Dutch underground newspaper titles have been

issued in NL

In every shape amp formhellip

htt

p

ww

w4

en5

mei

amst

erd

amn

lat

tach

men

t4

74

54

httpresolverkbnlresolveurn=ddd010436323

httpresolverkbnlresolveurn=ddd010442948

httpresolverkbnlresolveurn=ddd010447825 httpresolverkbnlresolveurn=ddd010450508

From well-known big titles

(oa Parool Vrij Nederland Trouw de Waarheid)

To very small home-made pamphlet-like

issues

After the war many titles have

been (physically) preserved at the NIOD hellip

The national Institute for War Holocaust and Genocide Studies in Amsterdam

By Romaine - Own work CC0 httpscommonswikimediaorgwindexphpcurid=37072767

httpscommonswikimediaorgwikiFileVerzetskrant_in_archiefdozen_bij_het_NIODjpg ndash CC-BY-SA - OlafJanssen

By Romaine - Own work CC0 httpscommonswikimediaorgwindexphpcurid=37072734

These 1300 newspapers have been described in the KB-cataloguehellip

Bibliographic metadata

Like this one De Geus onder studenten

PPN = unique ID of this title

in KB-catalogue

httpopc4kbnlDB=1PPNPPN=107123223

These newspapers were digitised page by pagehellip resulting in hellip

full texts in Delpher

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

bull Scans bull Full-text OCR

Again De Geus onder studenten

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

PPN = unique ID of this title in Delpher

(same at for KB-catalogue)

Again De Geus onder studenten

On Delpher you can read and (word)search this title

Say I want to know more about this newspaper bull What sortstyle of underground paper was De Geus bull What is the history of this newspaper bull Who were working on it bull Where was this newpaper printed bull How was De Geus distributed and financed bull Were there any relations with other illegal newspapers or resistance

groups bull Etchellip

Under ldquoDetailsrdquo perhaps

OK ok some metadatahellip

but I want to know moacuteoacuteoacuterhellip

Maybe in the catalogue record

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

Question Where would most people start searching contextual

information about De Geus onder studenten

Probably Wikipedia (via Google)

httpnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

Question Where would most people start searching contextual

information about De Geus onder studenten

Probably Wikipedia (via Google)

httpswwwyoutubecomwatchv=VREJV--VHSw

Report on interest in WW2 among Dutch population

httpwwwoorlogsbronnennlgebruikersonderzoek2015 May 2015

Many of us use the internet to search for information [] We often mention Wikipediahellip

Everything is of course on Wikipedia Just type in a name and you can read entire essays (man 70s)

Over half of us think that Wikipedia and Google contribute to our knowledge and understanding of history

When we have to find information about WW2 outside the class setting we fully concentrate on digital resources like Google and Wikipedia (school kids)

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

httpnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

is given in

Context about De Geus onder studenten

But nowhellip

We have another problemhellip

httpsnlwikipediaorgwikiCategorieIllegale_pers_in_de_Tweede_Wereldoorlog

hellip De Geus on Wikipedia is an exception

1 Very few underground newspapers have their own WP articles

2 The overview of these newspapers on WP is far from complete

Good news

htt

p

ww

wa

rch

ives

go

vre

sear

chm

ilita

ryw

w2

ph

oto

sim

ages

ww

2-1

94

jpg

There is a 1-fix solution to this contextual problem

De Ondergrondse Pers 1940-1945

By Lydia E Winkel amp H de Vries

1989 ISBN 9021837463 Veen Uitgevers

This book (ldquoDe Winkelrdquo) contains

contextual articles about

(nearly) all plusmn 1300 illegal WW2 newspapers

ldquoDe Winkelrdquo ndash nr 199

De Ondergrondse Pers 1940-1945 Lydia E Winkel H de Vries 1989

ISBN 9021837463 Veen Uitgevers

Every article has a unique ID

(ldquoWinkel-IDrdquo)

Every article has metadata

bull Title subtitle motto bull Place of publication bull Period of publication bull Publication frequency (daily weekly one-off irregular)

bull Multiplication (stenciled printed typed handwritten)

bull Contents (news opinions poems illustrations humor)

bull Number of prints (min ndash max)

Relation 13

Newspaper Placename

semantics linked data

Relation 13

Newspaper Placename

semantics linked data

Contextual information

Nice material

for a Wikipedia article

Very often persons related to this newspaper are mentioned

Relation 23

Newspaper Persons

semantics linked data

Many articles also contain references to other newspapers

bull 106 = Cereales Vadeness (students resistance newspaper Wageningen) bull 360 = Leidsche Brief (students resistance newspaper Leiden) bull 748 = Sol Justitiae (students resistance newspaper Utrecht)

Relation 33

Newspaper Other newspapers

semantics linked data

htt

p

htt

ps

kn

ow

led

geu

top

iaf

iles

wo

rdp

ress

co

m2

01

40

1h

olla

nd

ho

use

libra

ryb

litz1

94

0j

pg

Too bad itrsquos a paper book hard to find multiply distribute and build upon

We need it digital

htt

p

htt

ps

kn

ow

led

geu

top

iaf

iles

wo

rdp

ress

co

m2

01

40

1h

olla

nd

ho

use

libra

ryb

litz1

94

0j

pg

We need it digital

1 Clear copyright with copyright holder (NIOD) Open CC-BY-SA license

2 Scan amp OCR

3 Convert into PDF

4 Put online NIOD site amp Wikimedia Commons

De Winkel as PDF on NIOD website (CC-BY-SA) httpwwwniodnlnlde-ondergrondse-pers-1940-1945

De Winkel as PDF on Wikimedia Commons httpscommonswikimediaorgwikiFilePDF_of_De_Ondergrondse_Pers_1940-1945_-_derde_druk_-_1989pdf

De Winkel as PDF on NIOD website (CC-BY-SA) httpwwwniodnlnlde-ondergrondse-pers-1940-1945

Saved us euro13330 httpwwwbrillcomdutch-underground-press-1940-1945

Wikipedia article about De Winkel httpnlwikipediaorgwikiDe_ondergrondse_pers_1940-1945

Wikipedia article about the author httpnlwikipediaorgwikiLydia_Winkel

Winkel the plusses

Available online (PDF flat file)

Open license (CC-BY-SA) Contextual information Relations

bull Titles Places bull Titles Persons

bull Titles Other titles

htt

p

ww

wa

rch

ives

go

vre

sear

chm

ilita

ryw

w2

ph

oto

sim

ages

ww

2-1

94

jpg

htt

p

2b

pb

logsp

ot

com

_BW

zuYw

iS6-I

TM

geR

sFd3m

IAAAAAAAAElw

3cv

gbZSPW

css

1600d

oct

or+

macr

o+

judy+

scare

djpg

Winkel the minusses

Unstructured data (PDF flat file)

Not very machine readable (unlike CSV XML JSON RDF)

PDF is no (real) open standard (unlike CSV XML JSON RDF)

No links between titles Delpher amp KB-cat No links between titles places amp persons

external sources (like Wikipedia)

but the data sources are

unconnected (and for 3+4 unstructured amp not machine-readable)

To summarize

a lot of information is available about these WW2 underground newspapers

1 Metadata (KB-cat)

2 Content (full-text Delpher)

3 Context (Winkel PDF)

4 Relations titles places persons other titles (Winkel PDF)

5 External resources about titles places and persons

htt

p

2b

pb

logsp

ot

com

_BW

zuYw

iS6-I

TM

geR

sFd3m

IAAAAAAAAElw

3cv

gbZSPW

css

1600d

oct

or+

macr

o+

judy+

scare

djpg

making discovery understanding amp research

of these newspapers (and related places amp persons) more difficult than necessary

We can solve all these issues

Good news

Wikiproject() Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

tinyurlcomverzetskranten (in Dutch)

httpsnlwikipediaorgwikiWikipediaWikiproject httpsenwikipediaorgwikiWikipediaWikiproject

From 14 1300 titles

htt

ps

n

lwik

iped

iao

rgw

iki

Cat

ego

rie

Illeg

ale_

per

s_in

_de_

Twee

de_

Wer

eld

oo

rlo

g

Wikiproject() Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

tinyurlcomverzetskranten (in Dutch)

We need a database

Build central database

httpswwwyoutubecomwatchv=GVDGuCjog_0

Build central database

httpswwwyoutubecomwatchv=GVDGuCjog_0

Step 1 Create Excel-sheet with bull Metadata about newspaper (from Winkel PDF)

bull Unique Wikipedia article title

bull Contextual info incl related persons amp titles (from Winkel PDF)

bull PPN unique ID linking newspaper to

KB-catalogue amp Delpher

Winkel-ID

Place of publication

Other metadata

Title

Unique Wikipedia article title =

ltNewspaper titlegt (verzetsblad ltPlace of publicationgt)

bull Contextual info bull Related persons

bull Related newspaper titles

PPN (107123223)

httpopc4kbnlDB=1PPNPPN=107123223

httpopc4kbnlDB=1PPNPPN=107123223

PPN (107123223)

PPN (107123223)

httpwwwdelphernlnlkrantenresultsindexcoll=dddtitelampcql[]=ppn3D107123223

httpwwwdelphernlnlkrantenresultsindexcoll=dddtitelampcql[]=ppn3D107123223

PPN (107123223)

Build central database

Step 2 Convert Excel into RDF triplestore (=special kind of online database anybody can access)

bull Steps 1-4 from httplinda-projecteulinked-

data-primer-2

bull Step 4 Vocubulary used = Bibframe (httpbibframeorgvocab)

Build central database

Step 3 Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

bull DBpedia = hub for linking different data sets on the Web to each

other Linked Open Data cloud

bull Connect persons amp places in newspaper database to external resources via DBpedia

Step 1c Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

DBpedia allows you to ask sophisticated queries against Wikipedia and to link the different data sets on the Web to Wikipedia data

bull Connect persons amp places in newspaper database to external

resources via DBpedia

httplod-cloudnetversions2010-09-22lod-cloud_coloredpng

Linked Open Data cloud

Build central database

Step 3 Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

bull DBpedia = hub for linking different data sets on the Web to each

other Linked Open Data cloud

bull We use DBpdia to connect persons amp places in our newspaper database to information in other databases

httpnldbpediaorgpageHuib_Drion

httpsnlwikipediaorgwikiHuib_Drion

httpwwwdbnlorgauteursauteurphpid=drio001

httpwwwbiografischportaalnlpersoon41181342

Build central database

Added value of Linked Open Data amp DBpedia Software can automatically query for additional information about places and persons mentioned in De Winkel that is not available in bull KB-catalogue bull Delpher bull De Winkel

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples)

Open license (CC-BY-SA) Open standard (RDF)

Contextual information Links between titles Delpher amp KB-cat (via PPNs)

Relations Links between places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia)

bull Titles Other titles

(PPNs)

http5stardatainfoen

Wikiproject Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

We need a template

Using an article template we can generate 1300 uniform and interlinked Wikipedia articles

from the LOD-database

htt

ps

c1

sta

ticf

lickr

co

m9

82

81

76

99

23

19

18

_11

a73

56

c38

_bjp

g

LOD-database + article template = Wikipedia article

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

Titles from database

Metadata from database

Related persons from database

Related newspapers from database

Winkel-ID from database

Link to full-texts in Delpher from database

Link to KB-catalogue record from database

Fixed categories on WP and Commons

Predefined fixed strings

Grey = bull From database bull Predefined fixed strings

Uniformity between articles guaranteed

All that WP-writers need to add manually

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

The KB can re-use (embed) the Wikipedia content in its own

websites to tackle this problem

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_28verzetsblad29

Delpher - search results

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

Delpher - search results

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Delpher - object presentation

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Over De Geus onder studenten

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog dat vanaf 4 oktober 1940 tot en met 13 juli 1944 in Den Haag werd uitgegeven Het blad verscheen in 1940 1941 en 1943 maandelijks verder onregelmatig in een oplage tussen de 250 en 8000 exemplaren Het werd aanvankelijk gestencild en vanaf november 1942 gedrukt en de inhoud bestond voornamelijk uit opinie-artikelen

Het blad werd uitgegeven door Jan Drion en Huib Drion twee Leidsehellip

Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Delpher - object presentation

Suggested reading

bull httpwwwtedcomtalkstim_berners_lee_on_the_next_web 20 years ago Tim Berners-Lee invented the World Wide Web For his next project hes building a web for open linked data that could do for numbers what the Web did for words pictures video unlock our data and reframe the way we use it together

bull httpsenwikipediaorgwikiLinked_data Wikipedia article related to the above video

bull http5stardatainfoen The 5 stars of Linked Open Data (Tim Berners-Lee)

bull httplinda-projecteulinked-data-primer-2 Short primer about creating LOD in practice starting from an Excel sheet

bull httpwwwprogrammablewebcomnewshow-linked-data-solved-digital-age-marketing-problemanalysis20150831 The figure near the bottom of the first page is a good illustration of the concept of (linked) triples

bull httpsenwikipediaorgwikiDBpedia

bull httpsenwikipediaorgwikiSemantic_network

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Questions

olafjanssenkbnl - ookgezellig

tinyurlcomverzetskranten

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Page 2: Linked Open Data case study (illegal newspapers WW2, Wikipedia, DBpedia) - Lecture Leiden University 3-3-2016

What I hope yoursquoll learn today

1 How to give a new life to an old paper book

2 How to get 1300 newspapers from WW2 on Wikipedia

While doing 1 and 2 3 The advantages of linked open data

(= downsides of unconnected data sources)

Olaf Janssen

Wikipedia amp Open data coordinator

National library of the

Netherlands

htt

p

ww

w4

en5

mei

amst

erd

amn

lat

tach

men

t4

74

54

During WW2 plusmn 1300 Dutch underground newspaper titles have been

issued in NL

In every shape amp formhellip

htt

p

ww

w4

en5

mei

amst

erd

amn

lat

tach

men

t4

74

54

httpresolverkbnlresolveurn=ddd010436323

httpresolverkbnlresolveurn=ddd010442948

httpresolverkbnlresolveurn=ddd010447825 httpresolverkbnlresolveurn=ddd010450508

From well-known big titles

(oa Parool Vrij Nederland Trouw de Waarheid)

To very small home-made pamphlet-like

issues

After the war many titles have

been (physically) preserved at the NIOD hellip

The national Institute for War Holocaust and Genocide Studies in Amsterdam

By Romaine - Own work CC0 httpscommonswikimediaorgwindexphpcurid=37072767

httpscommonswikimediaorgwikiFileVerzetskrant_in_archiefdozen_bij_het_NIODjpg ndash CC-BY-SA - OlafJanssen

By Romaine - Own work CC0 httpscommonswikimediaorgwindexphpcurid=37072734

These 1300 newspapers have been described in the KB-cataloguehellip

Bibliographic metadata

Like this one De Geus onder studenten

PPN = unique ID of this title

in KB-catalogue

httpopc4kbnlDB=1PPNPPN=107123223

These newspapers were digitised page by pagehellip resulting in hellip

full texts in Delpher

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

bull Scans bull Full-text OCR

Again De Geus onder studenten

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

PPN = unique ID of this title in Delpher

(same at for KB-catalogue)

Again De Geus onder studenten

On Delpher you can read and (word)search this title

Say I want to know more about this newspaper bull What sortstyle of underground paper was De Geus bull What is the history of this newspaper bull Who were working on it bull Where was this newpaper printed bull How was De Geus distributed and financed bull Were there any relations with other illegal newspapers or resistance

groups bull Etchellip

Under ldquoDetailsrdquo perhaps

OK ok some metadatahellip

but I want to know moacuteoacuteoacuterhellip

Maybe in the catalogue record

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

Question Where would most people start searching contextual

information about De Geus onder studenten

Probably Wikipedia (via Google)

httpnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

Question Where would most people start searching contextual

information about De Geus onder studenten

Probably Wikipedia (via Google)

httpswwwyoutubecomwatchv=VREJV--VHSw

Report on interest in WW2 among Dutch population

httpwwwoorlogsbronnennlgebruikersonderzoek2015 May 2015

Many of us use the internet to search for information [] We often mention Wikipediahellip

Everything is of course on Wikipedia Just type in a name and you can read entire essays (man 70s)

Over half of us think that Wikipedia and Google contribute to our knowledge and understanding of history

When we have to find information about WW2 outside the class setting we fully concentrate on digital resources like Google and Wikipedia (school kids)

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

httpnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

is given in

Context about De Geus onder studenten

But nowhellip

We have another problemhellip

httpsnlwikipediaorgwikiCategorieIllegale_pers_in_de_Tweede_Wereldoorlog

hellip De Geus on Wikipedia is an exception

1 Very few underground newspapers have their own WP articles

2 The overview of these newspapers on WP is far from complete

Good news

htt

p

ww

wa

rch

ives

go

vre

sear

chm

ilita

ryw

w2

ph

oto

sim

ages

ww

2-1

94

jpg

There is a 1-fix solution to this contextual problem

De Ondergrondse Pers 1940-1945

By Lydia E Winkel amp H de Vries

1989 ISBN 9021837463 Veen Uitgevers

This book (ldquoDe Winkelrdquo) contains

contextual articles about

(nearly) all plusmn 1300 illegal WW2 newspapers

ldquoDe Winkelrdquo ndash nr 199

De Ondergrondse Pers 1940-1945 Lydia E Winkel H de Vries 1989

ISBN 9021837463 Veen Uitgevers

Every article has a unique ID

(ldquoWinkel-IDrdquo)

Every article has metadata

bull Title subtitle motto bull Place of publication bull Period of publication bull Publication frequency (daily weekly one-off irregular)

bull Multiplication (stenciled printed typed handwritten)

bull Contents (news opinions poems illustrations humor)

bull Number of prints (min ndash max)

Relation 13

Newspaper Placename

semantics linked data

Relation 13

Newspaper Placename

semantics linked data

Contextual information

Nice material

for a Wikipedia article

Very often persons related to this newspaper are mentioned

Relation 23

Newspaper Persons

semantics linked data

Many articles also contain references to other newspapers

bull 106 = Cereales Vadeness (students resistance newspaper Wageningen) bull 360 = Leidsche Brief (students resistance newspaper Leiden) bull 748 = Sol Justitiae (students resistance newspaper Utrecht)

Relation 33

Newspaper Other newspapers

semantics linked data

htt

p

htt

ps

kn

ow

led

geu

top

iaf

iles

wo

rdp

ress

co

m2

01

40

1h

olla

nd

ho

use

libra

ryb

litz1

94

0j

pg

Too bad itrsquos a paper book hard to find multiply distribute and build upon

We need it digital

htt

p

htt

ps

kn

ow

led

geu

top

iaf

iles

wo

rdp

ress

co

m2

01

40

1h

olla

nd

ho

use

libra

ryb

litz1

94

0j

pg

We need it digital

1 Clear copyright with copyright holder (NIOD) Open CC-BY-SA license

2 Scan amp OCR

3 Convert into PDF

4 Put online NIOD site amp Wikimedia Commons

De Winkel as PDF on NIOD website (CC-BY-SA) httpwwwniodnlnlde-ondergrondse-pers-1940-1945

De Winkel as PDF on Wikimedia Commons httpscommonswikimediaorgwikiFilePDF_of_De_Ondergrondse_Pers_1940-1945_-_derde_druk_-_1989pdf

De Winkel as PDF on NIOD website (CC-BY-SA) httpwwwniodnlnlde-ondergrondse-pers-1940-1945

Saved us euro13330 httpwwwbrillcomdutch-underground-press-1940-1945

Wikipedia article about De Winkel httpnlwikipediaorgwikiDe_ondergrondse_pers_1940-1945

Wikipedia article about the author httpnlwikipediaorgwikiLydia_Winkel

Winkel the plusses

Available online (PDF flat file)

Open license (CC-BY-SA) Contextual information Relations

bull Titles Places bull Titles Persons

bull Titles Other titles

htt

p

ww

wa

rch

ives

go

vre

sear

chm

ilita

ryw

w2

ph

oto

sim

ages

ww

2-1

94

jpg

htt

p

2b

pb

logsp

ot

com

_BW

zuYw

iS6-I

TM

geR

sFd3m

IAAAAAAAAElw

3cv

gbZSPW

css

1600d

oct

or+

macr

o+

judy+

scare

djpg

Winkel the minusses

Unstructured data (PDF flat file)

Not very machine readable (unlike CSV XML JSON RDF)

PDF is no (real) open standard (unlike CSV XML JSON RDF)

No links between titles Delpher amp KB-cat No links between titles places amp persons

external sources (like Wikipedia)

but the data sources are

unconnected (and for 3+4 unstructured amp not machine-readable)

To summarize

a lot of information is available about these WW2 underground newspapers

1 Metadata (KB-cat)

2 Content (full-text Delpher)

3 Context (Winkel PDF)

4 Relations titles places persons other titles (Winkel PDF)

5 External resources about titles places and persons

htt

p

2b

pb

logsp

ot

com

_BW

zuYw

iS6-I

TM

geR

sFd3m

IAAAAAAAAElw

3cv

gbZSPW

css

1600d

oct

or+

macr

o+

judy+

scare

djpg

making discovery understanding amp research

of these newspapers (and related places amp persons) more difficult than necessary

We can solve all these issues

Good news

Wikiproject() Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

tinyurlcomverzetskranten (in Dutch)

httpsnlwikipediaorgwikiWikipediaWikiproject httpsenwikipediaorgwikiWikipediaWikiproject

From 14 1300 titles

htt

ps

n

lwik

iped

iao

rgw

iki

Cat

ego

rie

Illeg

ale_

per

s_in

_de_

Twee

de_

Wer

eld

oo

rlo

g

Wikiproject() Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

tinyurlcomverzetskranten (in Dutch)

We need a database

Build central database

httpswwwyoutubecomwatchv=GVDGuCjog_0

Build central database

httpswwwyoutubecomwatchv=GVDGuCjog_0

Step 1 Create Excel-sheet with bull Metadata about newspaper (from Winkel PDF)

bull Unique Wikipedia article title

bull Contextual info incl related persons amp titles (from Winkel PDF)

bull PPN unique ID linking newspaper to

KB-catalogue amp Delpher

Winkel-ID

Place of publication

Other metadata

Title

Unique Wikipedia article title =

ltNewspaper titlegt (verzetsblad ltPlace of publicationgt)

bull Contextual info bull Related persons

bull Related newspaper titles

PPN (107123223)

httpopc4kbnlDB=1PPNPPN=107123223

httpopc4kbnlDB=1PPNPPN=107123223

PPN (107123223)

PPN (107123223)

httpwwwdelphernlnlkrantenresultsindexcoll=dddtitelampcql[]=ppn3D107123223

httpwwwdelphernlnlkrantenresultsindexcoll=dddtitelampcql[]=ppn3D107123223

PPN (107123223)

Build central database

Step 2 Convert Excel into RDF triplestore (=special kind of online database anybody can access)

bull Steps 1-4 from httplinda-projecteulinked-

data-primer-2

bull Step 4 Vocubulary used = Bibframe (httpbibframeorgvocab)

Build central database

Step 3 Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

bull DBpedia = hub for linking different data sets on the Web to each

other Linked Open Data cloud

bull Connect persons amp places in newspaper database to external resources via DBpedia

Step 1c Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

DBpedia allows you to ask sophisticated queries against Wikipedia and to link the different data sets on the Web to Wikipedia data

bull Connect persons amp places in newspaper database to external

resources via DBpedia

httplod-cloudnetversions2010-09-22lod-cloud_coloredpng

Linked Open Data cloud

Build central database

Step 3 Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

bull DBpedia = hub for linking different data sets on the Web to each

other Linked Open Data cloud

bull We use DBpdia to connect persons amp places in our newspaper database to information in other databases

httpnldbpediaorgpageHuib_Drion

httpsnlwikipediaorgwikiHuib_Drion

httpwwwdbnlorgauteursauteurphpid=drio001

httpwwwbiografischportaalnlpersoon41181342

Build central database

Added value of Linked Open Data amp DBpedia Software can automatically query for additional information about places and persons mentioned in De Winkel that is not available in bull KB-catalogue bull Delpher bull De Winkel

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples)

Open license (CC-BY-SA) Open standard (RDF)

Contextual information Links between titles Delpher amp KB-cat (via PPNs)

Relations Links between places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia)

bull Titles Other titles

(PPNs)

http5stardatainfoen

Wikiproject Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

We need a template

Using an article template we can generate 1300 uniform and interlinked Wikipedia articles

from the LOD-database

htt

ps

c1

sta

ticf

lickr

co

m9

82

81

76

99

23

19

18

_11

a73

56

c38

_bjp

g

LOD-database + article template = Wikipedia article

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

Titles from database

Metadata from database

Related persons from database

Related newspapers from database

Winkel-ID from database

Link to full-texts in Delpher from database

Link to KB-catalogue record from database

Fixed categories on WP and Commons

Predefined fixed strings

Grey = bull From database bull Predefined fixed strings

Uniformity between articles guaranteed

All that WP-writers need to add manually

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

The KB can re-use (embed) the Wikipedia content in its own

websites to tackle this problem

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_28verzetsblad29

Delpher - search results

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

Delpher - search results

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Delpher - object presentation

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Over De Geus onder studenten

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog dat vanaf 4 oktober 1940 tot en met 13 juli 1944 in Den Haag werd uitgegeven Het blad verscheen in 1940 1941 en 1943 maandelijks verder onregelmatig in een oplage tussen de 250 en 8000 exemplaren Het werd aanvankelijk gestencild en vanaf november 1942 gedrukt en de inhoud bestond voornamelijk uit opinie-artikelen

Het blad werd uitgegeven door Jan Drion en Huib Drion twee Leidsehellip

Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Delpher - object presentation

Suggested reading

bull httpwwwtedcomtalkstim_berners_lee_on_the_next_web 20 years ago Tim Berners-Lee invented the World Wide Web For his next project hes building a web for open linked data that could do for numbers what the Web did for words pictures video unlock our data and reframe the way we use it together

bull httpsenwikipediaorgwikiLinked_data Wikipedia article related to the above video

bull http5stardatainfoen The 5 stars of Linked Open Data (Tim Berners-Lee)

bull httplinda-projecteulinked-data-primer-2 Short primer about creating LOD in practice starting from an Excel sheet

bull httpwwwprogrammablewebcomnewshow-linked-data-solved-digital-age-marketing-problemanalysis20150831 The figure near the bottom of the first page is a good illustration of the concept of (linked) triples

bull httpsenwikipediaorgwikiDBpedia

bull httpsenwikipediaorgwikiSemantic_network

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Questions

olafjanssenkbnl - ookgezellig

tinyurlcomverzetskranten

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Page 3: Linked Open Data case study (illegal newspapers WW2, Wikipedia, DBpedia) - Lecture Leiden University 3-3-2016

htt

p

ww

w4

en5

mei

amst

erd

amn

lat

tach

men

t4

74

54

During WW2 plusmn 1300 Dutch underground newspaper titles have been

issued in NL

In every shape amp formhellip

htt

p

ww

w4

en5

mei

amst

erd

amn

lat

tach

men

t4

74

54

httpresolverkbnlresolveurn=ddd010436323

httpresolverkbnlresolveurn=ddd010442948

httpresolverkbnlresolveurn=ddd010447825 httpresolverkbnlresolveurn=ddd010450508

From well-known big titles

(oa Parool Vrij Nederland Trouw de Waarheid)

To very small home-made pamphlet-like

issues

After the war many titles have

been (physically) preserved at the NIOD hellip

The national Institute for War Holocaust and Genocide Studies in Amsterdam

By Romaine - Own work CC0 httpscommonswikimediaorgwindexphpcurid=37072767

httpscommonswikimediaorgwikiFileVerzetskrant_in_archiefdozen_bij_het_NIODjpg ndash CC-BY-SA - OlafJanssen

By Romaine - Own work CC0 httpscommonswikimediaorgwindexphpcurid=37072734

These 1300 newspapers have been described in the KB-cataloguehellip

Bibliographic metadata

Like this one De Geus onder studenten

PPN = unique ID of this title

in KB-catalogue

httpopc4kbnlDB=1PPNPPN=107123223

These newspapers were digitised page by pagehellip resulting in hellip

full texts in Delpher

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

bull Scans bull Full-text OCR

Again De Geus onder studenten

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

PPN = unique ID of this title in Delpher

(same at for KB-catalogue)

Again De Geus onder studenten

On Delpher you can read and (word)search this title

Say I want to know more about this newspaper bull What sortstyle of underground paper was De Geus bull What is the history of this newspaper bull Who were working on it bull Where was this newpaper printed bull How was De Geus distributed and financed bull Were there any relations with other illegal newspapers or resistance

groups bull Etchellip

Under ldquoDetailsrdquo perhaps

OK ok some metadatahellip

but I want to know moacuteoacuteoacuterhellip

Maybe in the catalogue record

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

Question Where would most people start searching contextual

information about De Geus onder studenten

Probably Wikipedia (via Google)

httpnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

Question Where would most people start searching contextual

information about De Geus onder studenten

Probably Wikipedia (via Google)

httpswwwyoutubecomwatchv=VREJV--VHSw

Report on interest in WW2 among Dutch population

httpwwwoorlogsbronnennlgebruikersonderzoek2015 May 2015

Many of us use the internet to search for information [] We often mention Wikipediahellip

Everything is of course on Wikipedia Just type in a name and you can read entire essays (man 70s)

Over half of us think that Wikipedia and Google contribute to our knowledge and understanding of history

When we have to find information about WW2 outside the class setting we fully concentrate on digital resources like Google and Wikipedia (school kids)

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

httpnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

is given in

Context about De Geus onder studenten

But nowhellip

We have another problemhellip

httpsnlwikipediaorgwikiCategorieIllegale_pers_in_de_Tweede_Wereldoorlog

hellip De Geus on Wikipedia is an exception

1 Very few underground newspapers have their own WP articles

2 The overview of these newspapers on WP is far from complete

Good news

htt

p

ww

wa

rch

ives

go

vre

sear

chm

ilita

ryw

w2

ph

oto

sim

ages

ww

2-1

94

jpg

There is a 1-fix solution to this contextual problem

De Ondergrondse Pers 1940-1945

By Lydia E Winkel amp H de Vries

1989 ISBN 9021837463 Veen Uitgevers

This book (ldquoDe Winkelrdquo) contains

contextual articles about

(nearly) all plusmn 1300 illegal WW2 newspapers

ldquoDe Winkelrdquo ndash nr 199

De Ondergrondse Pers 1940-1945 Lydia E Winkel H de Vries 1989

ISBN 9021837463 Veen Uitgevers

Every article has a unique ID

(ldquoWinkel-IDrdquo)

Every article has metadata

bull Title subtitle motto bull Place of publication bull Period of publication bull Publication frequency (daily weekly one-off irregular)

bull Multiplication (stenciled printed typed handwritten)

bull Contents (news opinions poems illustrations humor)

bull Number of prints (min ndash max)

Relation 13

Newspaper Placename

semantics linked data

Relation 13

Newspaper Placename

semantics linked data

Contextual information

Nice material

for a Wikipedia article

Very often persons related to this newspaper are mentioned

Relation 23

Newspaper Persons

semantics linked data

Many articles also contain references to other newspapers

bull 106 = Cereales Vadeness (students resistance newspaper Wageningen) bull 360 = Leidsche Brief (students resistance newspaper Leiden) bull 748 = Sol Justitiae (students resistance newspaper Utrecht)

Relation 33

Newspaper Other newspapers

semantics linked data

htt

p

htt

ps

kn

ow

led

geu

top

iaf

iles

wo

rdp

ress

co

m2

01

40

1h

olla

nd

ho

use

libra

ryb

litz1

94

0j

pg

Too bad itrsquos a paper book hard to find multiply distribute and build upon

We need it digital

htt

p

htt

ps

kn

ow

led

geu

top

iaf

iles

wo

rdp

ress

co

m2

01

40

1h

olla

nd

ho

use

libra

ryb

litz1

94

0j

pg

We need it digital

1 Clear copyright with copyright holder (NIOD) Open CC-BY-SA license

2 Scan amp OCR

3 Convert into PDF

4 Put online NIOD site amp Wikimedia Commons

De Winkel as PDF on NIOD website (CC-BY-SA) httpwwwniodnlnlde-ondergrondse-pers-1940-1945

De Winkel as PDF on Wikimedia Commons httpscommonswikimediaorgwikiFilePDF_of_De_Ondergrondse_Pers_1940-1945_-_derde_druk_-_1989pdf

De Winkel as PDF on NIOD website (CC-BY-SA) httpwwwniodnlnlde-ondergrondse-pers-1940-1945

Saved us euro13330 httpwwwbrillcomdutch-underground-press-1940-1945

Wikipedia article about De Winkel httpnlwikipediaorgwikiDe_ondergrondse_pers_1940-1945

Wikipedia article about the author httpnlwikipediaorgwikiLydia_Winkel

Winkel the plusses

Available online (PDF flat file)

Open license (CC-BY-SA) Contextual information Relations

bull Titles Places bull Titles Persons

bull Titles Other titles

htt

p

ww

wa

rch

ives

go

vre

sear

chm

ilita

ryw

w2

ph

oto

sim

ages

ww

2-1

94

jpg

htt

p

2b

pb

logsp

ot

com

_BW

zuYw

iS6-I

TM

geR

sFd3m

IAAAAAAAAElw

3cv

gbZSPW

css

1600d

oct

or+

macr

o+

judy+

scare

djpg

Winkel the minusses

Unstructured data (PDF flat file)

Not very machine readable (unlike CSV XML JSON RDF)

PDF is no (real) open standard (unlike CSV XML JSON RDF)

No links between titles Delpher amp KB-cat No links between titles places amp persons

external sources (like Wikipedia)

but the data sources are

unconnected (and for 3+4 unstructured amp not machine-readable)

To summarize

a lot of information is available about these WW2 underground newspapers

1 Metadata (KB-cat)

2 Content (full-text Delpher)

3 Context (Winkel PDF)

4 Relations titles places persons other titles (Winkel PDF)

5 External resources about titles places and persons

htt

p

2b

pb

logsp

ot

com

_BW

zuYw

iS6-I

TM

geR

sFd3m

IAAAAAAAAElw

3cv

gbZSPW

css

1600d

oct

or+

macr

o+

judy+

scare

djpg

making discovery understanding amp research

of these newspapers (and related places amp persons) more difficult than necessary

We can solve all these issues

Good news

Wikiproject() Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

tinyurlcomverzetskranten (in Dutch)

httpsnlwikipediaorgwikiWikipediaWikiproject httpsenwikipediaorgwikiWikipediaWikiproject

From 14 1300 titles

htt

ps

n

lwik

iped

iao

rgw

iki

Cat

ego

rie

Illeg

ale_

per

s_in

_de_

Twee

de_

Wer

eld

oo

rlo

g

Wikiproject() Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

tinyurlcomverzetskranten (in Dutch)

We need a database

Build central database

httpswwwyoutubecomwatchv=GVDGuCjog_0

Build central database

httpswwwyoutubecomwatchv=GVDGuCjog_0

Step 1 Create Excel-sheet with bull Metadata about newspaper (from Winkel PDF)

bull Unique Wikipedia article title

bull Contextual info incl related persons amp titles (from Winkel PDF)

bull PPN unique ID linking newspaper to

KB-catalogue amp Delpher

Winkel-ID

Place of publication

Other metadata

Title

Unique Wikipedia article title =

ltNewspaper titlegt (verzetsblad ltPlace of publicationgt)

bull Contextual info bull Related persons

bull Related newspaper titles

PPN (107123223)

httpopc4kbnlDB=1PPNPPN=107123223

httpopc4kbnlDB=1PPNPPN=107123223

PPN (107123223)

PPN (107123223)

httpwwwdelphernlnlkrantenresultsindexcoll=dddtitelampcql[]=ppn3D107123223

httpwwwdelphernlnlkrantenresultsindexcoll=dddtitelampcql[]=ppn3D107123223

PPN (107123223)

Build central database

Step 2 Convert Excel into RDF triplestore (=special kind of online database anybody can access)

bull Steps 1-4 from httplinda-projecteulinked-

data-primer-2

bull Step 4 Vocubulary used = Bibframe (httpbibframeorgvocab)

Build central database

Step 3 Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

bull DBpedia = hub for linking different data sets on the Web to each

other Linked Open Data cloud

bull Connect persons amp places in newspaper database to external resources via DBpedia

Step 1c Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

DBpedia allows you to ask sophisticated queries against Wikipedia and to link the different data sets on the Web to Wikipedia data

bull Connect persons amp places in newspaper database to external

resources via DBpedia

httplod-cloudnetversions2010-09-22lod-cloud_coloredpng

Linked Open Data cloud

Build central database

Step 3 Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

bull DBpedia = hub for linking different data sets on the Web to each

other Linked Open Data cloud

bull We use DBpdia to connect persons amp places in our newspaper database to information in other databases

httpnldbpediaorgpageHuib_Drion

httpsnlwikipediaorgwikiHuib_Drion

httpwwwdbnlorgauteursauteurphpid=drio001

httpwwwbiografischportaalnlpersoon41181342

Build central database

Added value of Linked Open Data amp DBpedia Software can automatically query for additional information about places and persons mentioned in De Winkel that is not available in bull KB-catalogue bull Delpher bull De Winkel

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples)

Open license (CC-BY-SA) Open standard (RDF)

Contextual information Links between titles Delpher amp KB-cat (via PPNs)

Relations Links between places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia)

bull Titles Other titles

(PPNs)

http5stardatainfoen

Wikiproject Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

We need a template

Using an article template we can generate 1300 uniform and interlinked Wikipedia articles

from the LOD-database

htt

ps

c1

sta

ticf

lickr

co

m9

82

81

76

99

23

19

18

_11

a73

56

c38

_bjp

g

LOD-database + article template = Wikipedia article

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

Titles from database

Metadata from database

Related persons from database

Related newspapers from database

Winkel-ID from database

Link to full-texts in Delpher from database

Link to KB-catalogue record from database

Fixed categories on WP and Commons

Predefined fixed strings

Grey = bull From database bull Predefined fixed strings

Uniformity between articles guaranteed

All that WP-writers need to add manually

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

The KB can re-use (embed) the Wikipedia content in its own

websites to tackle this problem

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_28verzetsblad29

Delpher - search results

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

Delpher - search results

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Delpher - object presentation

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Over De Geus onder studenten

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog dat vanaf 4 oktober 1940 tot en met 13 juli 1944 in Den Haag werd uitgegeven Het blad verscheen in 1940 1941 en 1943 maandelijks verder onregelmatig in een oplage tussen de 250 en 8000 exemplaren Het werd aanvankelijk gestencild en vanaf november 1942 gedrukt en de inhoud bestond voornamelijk uit opinie-artikelen

Het blad werd uitgegeven door Jan Drion en Huib Drion twee Leidsehellip

Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Delpher - object presentation

Suggested reading

bull httpwwwtedcomtalkstim_berners_lee_on_the_next_web 20 years ago Tim Berners-Lee invented the World Wide Web For his next project hes building a web for open linked data that could do for numbers what the Web did for words pictures video unlock our data and reframe the way we use it together

bull httpsenwikipediaorgwikiLinked_data Wikipedia article related to the above video

bull http5stardatainfoen The 5 stars of Linked Open Data (Tim Berners-Lee)

bull httplinda-projecteulinked-data-primer-2 Short primer about creating LOD in practice starting from an Excel sheet

bull httpwwwprogrammablewebcomnewshow-linked-data-solved-digital-age-marketing-problemanalysis20150831 The figure near the bottom of the first page is a good illustration of the concept of (linked) triples

bull httpsenwikipediaorgwikiDBpedia

bull httpsenwikipediaorgwikiSemantic_network

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Questions

olafjanssenkbnl - ookgezellig

tinyurlcomverzetskranten

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Page 4: Linked Open Data case study (illegal newspapers WW2, Wikipedia, DBpedia) - Lecture Leiden University 3-3-2016

During WW2 plusmn 1300 Dutch underground newspaper titles have been

issued in NL

In every shape amp formhellip

htt

p

ww

w4

en5

mei

amst

erd

amn

lat

tach

men

t4

74

54

httpresolverkbnlresolveurn=ddd010436323

httpresolverkbnlresolveurn=ddd010442948

httpresolverkbnlresolveurn=ddd010447825 httpresolverkbnlresolveurn=ddd010450508

From well-known big titles

(oa Parool Vrij Nederland Trouw de Waarheid)

To very small home-made pamphlet-like

issues

After the war many titles have

been (physically) preserved at the NIOD hellip

The national Institute for War Holocaust and Genocide Studies in Amsterdam

By Romaine - Own work CC0 httpscommonswikimediaorgwindexphpcurid=37072767

httpscommonswikimediaorgwikiFileVerzetskrant_in_archiefdozen_bij_het_NIODjpg ndash CC-BY-SA - OlafJanssen

By Romaine - Own work CC0 httpscommonswikimediaorgwindexphpcurid=37072734

These 1300 newspapers have been described in the KB-cataloguehellip

Bibliographic metadata

Like this one De Geus onder studenten

PPN = unique ID of this title

in KB-catalogue

httpopc4kbnlDB=1PPNPPN=107123223

These newspapers were digitised page by pagehellip resulting in hellip

full texts in Delpher

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

bull Scans bull Full-text OCR

Again De Geus onder studenten

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

PPN = unique ID of this title in Delpher

(same at for KB-catalogue)

Again De Geus onder studenten

On Delpher you can read and (word)search this title

Say I want to know more about this newspaper bull What sortstyle of underground paper was De Geus bull What is the history of this newspaper bull Who were working on it bull Where was this newpaper printed bull How was De Geus distributed and financed bull Were there any relations with other illegal newspapers or resistance

groups bull Etchellip

Under ldquoDetailsrdquo perhaps

OK ok some metadatahellip

but I want to know moacuteoacuteoacuterhellip

Maybe in the catalogue record

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

Question Where would most people start searching contextual

information about De Geus onder studenten

Probably Wikipedia (via Google)

httpnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

Question Where would most people start searching contextual

information about De Geus onder studenten

Probably Wikipedia (via Google)

httpswwwyoutubecomwatchv=VREJV--VHSw

Report on interest in WW2 among Dutch population

httpwwwoorlogsbronnennlgebruikersonderzoek2015 May 2015

Many of us use the internet to search for information [] We often mention Wikipediahellip

Everything is of course on Wikipedia Just type in a name and you can read entire essays (man 70s)

Over half of us think that Wikipedia and Google contribute to our knowledge and understanding of history

When we have to find information about WW2 outside the class setting we fully concentrate on digital resources like Google and Wikipedia (school kids)

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

httpnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

is given in

Context about De Geus onder studenten

But nowhellip

We have another problemhellip

httpsnlwikipediaorgwikiCategorieIllegale_pers_in_de_Tweede_Wereldoorlog

hellip De Geus on Wikipedia is an exception

1 Very few underground newspapers have their own WP articles

2 The overview of these newspapers on WP is far from complete

Good news

htt

p

ww

wa

rch

ives

go

vre

sear

chm

ilita

ryw

w2

ph

oto

sim

ages

ww

2-1

94

jpg

There is a 1-fix solution to this contextual problem

De Ondergrondse Pers 1940-1945

By Lydia E Winkel amp H de Vries

1989 ISBN 9021837463 Veen Uitgevers

This book (ldquoDe Winkelrdquo) contains

contextual articles about

(nearly) all plusmn 1300 illegal WW2 newspapers

ldquoDe Winkelrdquo ndash nr 199

De Ondergrondse Pers 1940-1945 Lydia E Winkel H de Vries 1989

ISBN 9021837463 Veen Uitgevers

Every article has a unique ID

(ldquoWinkel-IDrdquo)

Every article has metadata

bull Title subtitle motto bull Place of publication bull Period of publication bull Publication frequency (daily weekly one-off irregular)

bull Multiplication (stenciled printed typed handwritten)

bull Contents (news opinions poems illustrations humor)

bull Number of prints (min ndash max)

Relation 13

Newspaper Placename

semantics linked data

Relation 13

Newspaper Placename

semantics linked data

Contextual information

Nice material

for a Wikipedia article

Very often persons related to this newspaper are mentioned

Relation 23

Newspaper Persons

semantics linked data

Many articles also contain references to other newspapers

bull 106 = Cereales Vadeness (students resistance newspaper Wageningen) bull 360 = Leidsche Brief (students resistance newspaper Leiden) bull 748 = Sol Justitiae (students resistance newspaper Utrecht)

Relation 33

Newspaper Other newspapers

semantics linked data

htt

p

htt

ps

kn

ow

led

geu

top

iaf

iles

wo

rdp

ress

co

m2

01

40

1h

olla

nd

ho

use

libra

ryb

litz1

94

0j

pg

Too bad itrsquos a paper book hard to find multiply distribute and build upon

We need it digital

htt

p

htt

ps

kn

ow

led

geu

top

iaf

iles

wo

rdp

ress

co

m2

01

40

1h

olla

nd

ho

use

libra

ryb

litz1

94

0j

pg

We need it digital

1 Clear copyright with copyright holder (NIOD) Open CC-BY-SA license

2 Scan amp OCR

3 Convert into PDF

4 Put online NIOD site amp Wikimedia Commons

De Winkel as PDF on NIOD website (CC-BY-SA) httpwwwniodnlnlde-ondergrondse-pers-1940-1945

De Winkel as PDF on Wikimedia Commons httpscommonswikimediaorgwikiFilePDF_of_De_Ondergrondse_Pers_1940-1945_-_derde_druk_-_1989pdf

De Winkel as PDF on NIOD website (CC-BY-SA) httpwwwniodnlnlde-ondergrondse-pers-1940-1945

Saved us euro13330 httpwwwbrillcomdutch-underground-press-1940-1945

Wikipedia article about De Winkel httpnlwikipediaorgwikiDe_ondergrondse_pers_1940-1945

Wikipedia article about the author httpnlwikipediaorgwikiLydia_Winkel

Winkel the plusses

Available online (PDF flat file)

Open license (CC-BY-SA) Contextual information Relations

bull Titles Places bull Titles Persons

bull Titles Other titles

htt

p

ww

wa

rch

ives

go

vre

sear

chm

ilita

ryw

w2

ph

oto

sim

ages

ww

2-1

94

jpg

htt

p

2b

pb

logsp

ot

com

_BW

zuYw

iS6-I

TM

geR

sFd3m

IAAAAAAAAElw

3cv

gbZSPW

css

1600d

oct

or+

macr

o+

judy+

scare

djpg

Winkel the minusses

Unstructured data (PDF flat file)

Not very machine readable (unlike CSV XML JSON RDF)

PDF is no (real) open standard (unlike CSV XML JSON RDF)

No links between titles Delpher amp KB-cat No links between titles places amp persons

external sources (like Wikipedia)

but the data sources are

unconnected (and for 3+4 unstructured amp not machine-readable)

To summarize

a lot of information is available about these WW2 underground newspapers

1 Metadata (KB-cat)

2 Content (full-text Delpher)

3 Context (Winkel PDF)

4 Relations titles places persons other titles (Winkel PDF)

5 External resources about titles places and persons

htt

p

2b

pb

logsp

ot

com

_BW

zuYw

iS6-I

TM

geR

sFd3m

IAAAAAAAAElw

3cv

gbZSPW

css

1600d

oct

or+

macr

o+

judy+

scare

djpg

making discovery understanding amp research

of these newspapers (and related places amp persons) more difficult than necessary

We can solve all these issues

Good news

Wikiproject() Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

tinyurlcomverzetskranten (in Dutch)

httpsnlwikipediaorgwikiWikipediaWikiproject httpsenwikipediaorgwikiWikipediaWikiproject

From 14 1300 titles

htt

ps

n

lwik

iped

iao

rgw

iki

Cat

ego

rie

Illeg

ale_

per

s_in

_de_

Twee

de_

Wer

eld

oo

rlo

g

Wikiproject() Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

tinyurlcomverzetskranten (in Dutch)

We need a database

Build central database

httpswwwyoutubecomwatchv=GVDGuCjog_0

Build central database

httpswwwyoutubecomwatchv=GVDGuCjog_0

Step 1 Create Excel-sheet with bull Metadata about newspaper (from Winkel PDF)

bull Unique Wikipedia article title

bull Contextual info incl related persons amp titles (from Winkel PDF)

bull PPN unique ID linking newspaper to

KB-catalogue amp Delpher

Winkel-ID

Place of publication

Other metadata

Title

Unique Wikipedia article title =

ltNewspaper titlegt (verzetsblad ltPlace of publicationgt)

bull Contextual info bull Related persons

bull Related newspaper titles

PPN (107123223)

httpopc4kbnlDB=1PPNPPN=107123223

httpopc4kbnlDB=1PPNPPN=107123223

PPN (107123223)

PPN (107123223)

httpwwwdelphernlnlkrantenresultsindexcoll=dddtitelampcql[]=ppn3D107123223

httpwwwdelphernlnlkrantenresultsindexcoll=dddtitelampcql[]=ppn3D107123223

PPN (107123223)

Build central database

Step 2 Convert Excel into RDF triplestore (=special kind of online database anybody can access)

bull Steps 1-4 from httplinda-projecteulinked-

data-primer-2

bull Step 4 Vocubulary used = Bibframe (httpbibframeorgvocab)

Build central database

Step 3 Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

bull DBpedia = hub for linking different data sets on the Web to each

other Linked Open Data cloud

bull Connect persons amp places in newspaper database to external resources via DBpedia

Step 1c Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

DBpedia allows you to ask sophisticated queries against Wikipedia and to link the different data sets on the Web to Wikipedia data

bull Connect persons amp places in newspaper database to external

resources via DBpedia

httplod-cloudnetversions2010-09-22lod-cloud_coloredpng

Linked Open Data cloud

Build central database

Step 3 Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

bull DBpedia = hub for linking different data sets on the Web to each

other Linked Open Data cloud

bull We use DBpdia to connect persons amp places in our newspaper database to information in other databases

httpnldbpediaorgpageHuib_Drion

httpsnlwikipediaorgwikiHuib_Drion

httpwwwdbnlorgauteursauteurphpid=drio001

httpwwwbiografischportaalnlpersoon41181342

Build central database

Added value of Linked Open Data amp DBpedia Software can automatically query for additional information about places and persons mentioned in De Winkel that is not available in bull KB-catalogue bull Delpher bull De Winkel

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples)

Open license (CC-BY-SA) Open standard (RDF)

Contextual information Links between titles Delpher amp KB-cat (via PPNs)

Relations Links between places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia)

bull Titles Other titles

(PPNs)

http5stardatainfoen

Wikiproject Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

We need a template

Using an article template we can generate 1300 uniform and interlinked Wikipedia articles

from the LOD-database

htt

ps

c1

sta

ticf

lickr

co

m9

82

81

76

99

23

19

18

_11

a73

56

c38

_bjp

g

LOD-database + article template = Wikipedia article

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

Titles from database

Metadata from database

Related persons from database

Related newspapers from database

Winkel-ID from database

Link to full-texts in Delpher from database

Link to KB-catalogue record from database

Fixed categories on WP and Commons

Predefined fixed strings

Grey = bull From database bull Predefined fixed strings

Uniformity between articles guaranteed

All that WP-writers need to add manually

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

The KB can re-use (embed) the Wikipedia content in its own

websites to tackle this problem

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_28verzetsblad29

Delpher - search results

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

Delpher - search results

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Delpher - object presentation

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Over De Geus onder studenten

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog dat vanaf 4 oktober 1940 tot en met 13 juli 1944 in Den Haag werd uitgegeven Het blad verscheen in 1940 1941 en 1943 maandelijks verder onregelmatig in een oplage tussen de 250 en 8000 exemplaren Het werd aanvankelijk gestencild en vanaf november 1942 gedrukt en de inhoud bestond voornamelijk uit opinie-artikelen

Het blad werd uitgegeven door Jan Drion en Huib Drion twee Leidsehellip

Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Delpher - object presentation

Suggested reading

bull httpwwwtedcomtalkstim_berners_lee_on_the_next_web 20 years ago Tim Berners-Lee invented the World Wide Web For his next project hes building a web for open linked data that could do for numbers what the Web did for words pictures video unlock our data and reframe the way we use it together

bull httpsenwikipediaorgwikiLinked_data Wikipedia article related to the above video

bull http5stardatainfoen The 5 stars of Linked Open Data (Tim Berners-Lee)

bull httplinda-projecteulinked-data-primer-2 Short primer about creating LOD in practice starting from an Excel sheet

bull httpwwwprogrammablewebcomnewshow-linked-data-solved-digital-age-marketing-problemanalysis20150831 The figure near the bottom of the first page is a good illustration of the concept of (linked) triples

bull httpsenwikipediaorgwikiDBpedia

bull httpsenwikipediaorgwikiSemantic_network

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Questions

olafjanssenkbnl - ookgezellig

tinyurlcomverzetskranten

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Page 5: Linked Open Data case study (illegal newspapers WW2, Wikipedia, DBpedia) - Lecture Leiden University 3-3-2016

httpresolverkbnlresolveurn=ddd010436323

httpresolverkbnlresolveurn=ddd010442948

httpresolverkbnlresolveurn=ddd010447825 httpresolverkbnlresolveurn=ddd010450508

From well-known big titles

(oa Parool Vrij Nederland Trouw de Waarheid)

To very small home-made pamphlet-like

issues

After the war many titles have

been (physically) preserved at the NIOD hellip

The national Institute for War Holocaust and Genocide Studies in Amsterdam

By Romaine - Own work CC0 httpscommonswikimediaorgwindexphpcurid=37072767

httpscommonswikimediaorgwikiFileVerzetskrant_in_archiefdozen_bij_het_NIODjpg ndash CC-BY-SA - OlafJanssen

By Romaine - Own work CC0 httpscommonswikimediaorgwindexphpcurid=37072734

These 1300 newspapers have been described in the KB-cataloguehellip

Bibliographic metadata

Like this one De Geus onder studenten

PPN = unique ID of this title

in KB-catalogue

httpopc4kbnlDB=1PPNPPN=107123223

These newspapers were digitised page by pagehellip resulting in hellip

full texts in Delpher

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

bull Scans bull Full-text OCR

Again De Geus onder studenten

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

PPN = unique ID of this title in Delpher

(same at for KB-catalogue)

Again De Geus onder studenten

On Delpher you can read and (word)search this title

Say I want to know more about this newspaper bull What sortstyle of underground paper was De Geus bull What is the history of this newspaper bull Who were working on it bull Where was this newpaper printed bull How was De Geus distributed and financed bull Were there any relations with other illegal newspapers or resistance

groups bull Etchellip

Under ldquoDetailsrdquo perhaps

OK ok some metadatahellip

but I want to know moacuteoacuteoacuterhellip

Maybe in the catalogue record

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

Question Where would most people start searching contextual

information about De Geus onder studenten

Probably Wikipedia (via Google)

httpnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

Question Where would most people start searching contextual

information about De Geus onder studenten

Probably Wikipedia (via Google)

httpswwwyoutubecomwatchv=VREJV--VHSw

Report on interest in WW2 among Dutch population

httpwwwoorlogsbronnennlgebruikersonderzoek2015 May 2015

Many of us use the internet to search for information [] We often mention Wikipediahellip

Everything is of course on Wikipedia Just type in a name and you can read entire essays (man 70s)

Over half of us think that Wikipedia and Google contribute to our knowledge and understanding of history

When we have to find information about WW2 outside the class setting we fully concentrate on digital resources like Google and Wikipedia (school kids)

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

httpnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

is given in

Context about De Geus onder studenten

But nowhellip

We have another problemhellip

httpsnlwikipediaorgwikiCategorieIllegale_pers_in_de_Tweede_Wereldoorlog

hellip De Geus on Wikipedia is an exception

1 Very few underground newspapers have their own WP articles

2 The overview of these newspapers on WP is far from complete

Good news

htt

p

ww

wa

rch

ives

go

vre

sear

chm

ilita

ryw

w2

ph

oto

sim

ages

ww

2-1

94

jpg

There is a 1-fix solution to this contextual problem

De Ondergrondse Pers 1940-1945

By Lydia E Winkel amp H de Vries

1989 ISBN 9021837463 Veen Uitgevers

This book (ldquoDe Winkelrdquo) contains

contextual articles about

(nearly) all plusmn 1300 illegal WW2 newspapers

ldquoDe Winkelrdquo ndash nr 199

De Ondergrondse Pers 1940-1945 Lydia E Winkel H de Vries 1989

ISBN 9021837463 Veen Uitgevers

Every article has a unique ID

(ldquoWinkel-IDrdquo)

Every article has metadata

bull Title subtitle motto bull Place of publication bull Period of publication bull Publication frequency (daily weekly one-off irregular)

bull Multiplication (stenciled printed typed handwritten)

bull Contents (news opinions poems illustrations humor)

bull Number of prints (min ndash max)

Relation 13

Newspaper Placename

semantics linked data

Relation 13

Newspaper Placename

semantics linked data

Contextual information

Nice material

for a Wikipedia article

Very often persons related to this newspaper are mentioned

Relation 23

Newspaper Persons

semantics linked data

Many articles also contain references to other newspapers

bull 106 = Cereales Vadeness (students resistance newspaper Wageningen) bull 360 = Leidsche Brief (students resistance newspaper Leiden) bull 748 = Sol Justitiae (students resistance newspaper Utrecht)

Relation 33

Newspaper Other newspapers

semantics linked data

htt

p

htt

ps

kn

ow

led

geu

top

iaf

iles

wo

rdp

ress

co

m2

01

40

1h

olla

nd

ho

use

libra

ryb

litz1

94

0j

pg

Too bad itrsquos a paper book hard to find multiply distribute and build upon

We need it digital

htt

p

htt

ps

kn

ow

led

geu

top

iaf

iles

wo

rdp

ress

co

m2

01

40

1h

olla

nd

ho

use

libra

ryb

litz1

94

0j

pg

We need it digital

1 Clear copyright with copyright holder (NIOD) Open CC-BY-SA license

2 Scan amp OCR

3 Convert into PDF

4 Put online NIOD site amp Wikimedia Commons

De Winkel as PDF on NIOD website (CC-BY-SA) httpwwwniodnlnlde-ondergrondse-pers-1940-1945

De Winkel as PDF on Wikimedia Commons httpscommonswikimediaorgwikiFilePDF_of_De_Ondergrondse_Pers_1940-1945_-_derde_druk_-_1989pdf

De Winkel as PDF on NIOD website (CC-BY-SA) httpwwwniodnlnlde-ondergrondse-pers-1940-1945

Saved us euro13330 httpwwwbrillcomdutch-underground-press-1940-1945

Wikipedia article about De Winkel httpnlwikipediaorgwikiDe_ondergrondse_pers_1940-1945

Wikipedia article about the author httpnlwikipediaorgwikiLydia_Winkel

Winkel the plusses

Available online (PDF flat file)

Open license (CC-BY-SA) Contextual information Relations

bull Titles Places bull Titles Persons

bull Titles Other titles

htt

p

ww

wa

rch

ives

go

vre

sear

chm

ilita

ryw

w2

ph

oto

sim

ages

ww

2-1

94

jpg

htt

p

2b

pb

logsp

ot

com

_BW

zuYw

iS6-I

TM

geR

sFd3m

IAAAAAAAAElw

3cv

gbZSPW

css

1600d

oct

or+

macr

o+

judy+

scare

djpg

Winkel the minusses

Unstructured data (PDF flat file)

Not very machine readable (unlike CSV XML JSON RDF)

PDF is no (real) open standard (unlike CSV XML JSON RDF)

No links between titles Delpher amp KB-cat No links between titles places amp persons

external sources (like Wikipedia)

but the data sources are

unconnected (and for 3+4 unstructured amp not machine-readable)

To summarize

a lot of information is available about these WW2 underground newspapers

1 Metadata (KB-cat)

2 Content (full-text Delpher)

3 Context (Winkel PDF)

4 Relations titles places persons other titles (Winkel PDF)

5 External resources about titles places and persons

htt

p

2b

pb

logsp

ot

com

_BW

zuYw

iS6-I

TM

geR

sFd3m

IAAAAAAAAElw

3cv

gbZSPW

css

1600d

oct

or+

macr

o+

judy+

scare

djpg

making discovery understanding amp research

of these newspapers (and related places amp persons) more difficult than necessary

We can solve all these issues

Good news

Wikiproject() Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

tinyurlcomverzetskranten (in Dutch)

httpsnlwikipediaorgwikiWikipediaWikiproject httpsenwikipediaorgwikiWikipediaWikiproject

From 14 1300 titles

htt

ps

n

lwik

iped

iao

rgw

iki

Cat

ego

rie

Illeg

ale_

per

s_in

_de_

Twee

de_

Wer

eld

oo

rlo

g

Wikiproject() Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

tinyurlcomverzetskranten (in Dutch)

We need a database

Build central database

httpswwwyoutubecomwatchv=GVDGuCjog_0

Build central database

httpswwwyoutubecomwatchv=GVDGuCjog_0

Step 1 Create Excel-sheet with bull Metadata about newspaper (from Winkel PDF)

bull Unique Wikipedia article title

bull Contextual info incl related persons amp titles (from Winkel PDF)

bull PPN unique ID linking newspaper to

KB-catalogue amp Delpher

Winkel-ID

Place of publication

Other metadata

Title

Unique Wikipedia article title =

ltNewspaper titlegt (verzetsblad ltPlace of publicationgt)

bull Contextual info bull Related persons

bull Related newspaper titles

PPN (107123223)

httpopc4kbnlDB=1PPNPPN=107123223

httpopc4kbnlDB=1PPNPPN=107123223

PPN (107123223)

PPN (107123223)

httpwwwdelphernlnlkrantenresultsindexcoll=dddtitelampcql[]=ppn3D107123223

httpwwwdelphernlnlkrantenresultsindexcoll=dddtitelampcql[]=ppn3D107123223

PPN (107123223)

Build central database

Step 2 Convert Excel into RDF triplestore (=special kind of online database anybody can access)

bull Steps 1-4 from httplinda-projecteulinked-

data-primer-2

bull Step 4 Vocubulary used = Bibframe (httpbibframeorgvocab)

Build central database

Step 3 Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

bull DBpedia = hub for linking different data sets on the Web to each

other Linked Open Data cloud

bull Connect persons amp places in newspaper database to external resources via DBpedia

Step 1c Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

DBpedia allows you to ask sophisticated queries against Wikipedia and to link the different data sets on the Web to Wikipedia data

bull Connect persons amp places in newspaper database to external

resources via DBpedia

httplod-cloudnetversions2010-09-22lod-cloud_coloredpng

Linked Open Data cloud

Build central database

Step 3 Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

bull DBpedia = hub for linking different data sets on the Web to each

other Linked Open Data cloud

bull We use DBpdia to connect persons amp places in our newspaper database to information in other databases

httpnldbpediaorgpageHuib_Drion

httpsnlwikipediaorgwikiHuib_Drion

httpwwwdbnlorgauteursauteurphpid=drio001

httpwwwbiografischportaalnlpersoon41181342

Build central database

Added value of Linked Open Data amp DBpedia Software can automatically query for additional information about places and persons mentioned in De Winkel that is not available in bull KB-catalogue bull Delpher bull De Winkel

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples)

Open license (CC-BY-SA) Open standard (RDF)

Contextual information Links between titles Delpher amp KB-cat (via PPNs)

Relations Links between places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia)

bull Titles Other titles

(PPNs)

http5stardatainfoen

Wikiproject Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

We need a template

Using an article template we can generate 1300 uniform and interlinked Wikipedia articles

from the LOD-database

htt

ps

c1

sta

ticf

lickr

co

m9

82

81

76

99

23

19

18

_11

a73

56

c38

_bjp

g

LOD-database + article template = Wikipedia article

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

Titles from database

Metadata from database

Related persons from database

Related newspapers from database

Winkel-ID from database

Link to full-texts in Delpher from database

Link to KB-catalogue record from database

Fixed categories on WP and Commons

Predefined fixed strings

Grey = bull From database bull Predefined fixed strings

Uniformity between articles guaranteed

All that WP-writers need to add manually

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

The KB can re-use (embed) the Wikipedia content in its own

websites to tackle this problem

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_28verzetsblad29

Delpher - search results

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

Delpher - search results

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Delpher - object presentation

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Over De Geus onder studenten

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog dat vanaf 4 oktober 1940 tot en met 13 juli 1944 in Den Haag werd uitgegeven Het blad verscheen in 1940 1941 en 1943 maandelijks verder onregelmatig in een oplage tussen de 250 en 8000 exemplaren Het werd aanvankelijk gestencild en vanaf november 1942 gedrukt en de inhoud bestond voornamelijk uit opinie-artikelen

Het blad werd uitgegeven door Jan Drion en Huib Drion twee Leidsehellip

Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Delpher - object presentation

Suggested reading

bull httpwwwtedcomtalkstim_berners_lee_on_the_next_web 20 years ago Tim Berners-Lee invented the World Wide Web For his next project hes building a web for open linked data that could do for numbers what the Web did for words pictures video unlock our data and reframe the way we use it together

bull httpsenwikipediaorgwikiLinked_data Wikipedia article related to the above video

bull http5stardatainfoen The 5 stars of Linked Open Data (Tim Berners-Lee)

bull httplinda-projecteulinked-data-primer-2 Short primer about creating LOD in practice starting from an Excel sheet

bull httpwwwprogrammablewebcomnewshow-linked-data-solved-digital-age-marketing-problemanalysis20150831 The figure near the bottom of the first page is a good illustration of the concept of (linked) triples

bull httpsenwikipediaorgwikiDBpedia

bull httpsenwikipediaorgwikiSemantic_network

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Questions

olafjanssenkbnl - ookgezellig

tinyurlcomverzetskranten

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Page 6: Linked Open Data case study (illegal newspapers WW2, Wikipedia, DBpedia) - Lecture Leiden University 3-3-2016

To very small home-made pamphlet-like

issues

After the war many titles have

been (physically) preserved at the NIOD hellip

The national Institute for War Holocaust and Genocide Studies in Amsterdam

By Romaine - Own work CC0 httpscommonswikimediaorgwindexphpcurid=37072767

httpscommonswikimediaorgwikiFileVerzetskrant_in_archiefdozen_bij_het_NIODjpg ndash CC-BY-SA - OlafJanssen

By Romaine - Own work CC0 httpscommonswikimediaorgwindexphpcurid=37072734

These 1300 newspapers have been described in the KB-cataloguehellip

Bibliographic metadata

Like this one De Geus onder studenten

PPN = unique ID of this title

in KB-catalogue

httpopc4kbnlDB=1PPNPPN=107123223

These newspapers were digitised page by pagehellip resulting in hellip

full texts in Delpher

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

bull Scans bull Full-text OCR

Again De Geus onder studenten

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

PPN = unique ID of this title in Delpher

(same at for KB-catalogue)

Again De Geus onder studenten

On Delpher you can read and (word)search this title

Say I want to know more about this newspaper bull What sortstyle of underground paper was De Geus bull What is the history of this newspaper bull Who were working on it bull Where was this newpaper printed bull How was De Geus distributed and financed bull Were there any relations with other illegal newspapers or resistance

groups bull Etchellip

Under ldquoDetailsrdquo perhaps

OK ok some metadatahellip

but I want to know moacuteoacuteoacuterhellip

Maybe in the catalogue record

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

Question Where would most people start searching contextual

information about De Geus onder studenten

Probably Wikipedia (via Google)

httpnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

Question Where would most people start searching contextual

information about De Geus onder studenten

Probably Wikipedia (via Google)

httpswwwyoutubecomwatchv=VREJV--VHSw

Report on interest in WW2 among Dutch population

httpwwwoorlogsbronnennlgebruikersonderzoek2015 May 2015

Many of us use the internet to search for information [] We often mention Wikipediahellip

Everything is of course on Wikipedia Just type in a name and you can read entire essays (man 70s)

Over half of us think that Wikipedia and Google contribute to our knowledge and understanding of history

When we have to find information about WW2 outside the class setting we fully concentrate on digital resources like Google and Wikipedia (school kids)

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

httpnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

is given in

Context about De Geus onder studenten

But nowhellip

We have another problemhellip

httpsnlwikipediaorgwikiCategorieIllegale_pers_in_de_Tweede_Wereldoorlog

hellip De Geus on Wikipedia is an exception

1 Very few underground newspapers have their own WP articles

2 The overview of these newspapers on WP is far from complete

Good news

htt

p

ww

wa

rch

ives

go

vre

sear

chm

ilita

ryw

w2

ph

oto

sim

ages

ww

2-1

94

jpg

There is a 1-fix solution to this contextual problem

De Ondergrondse Pers 1940-1945

By Lydia E Winkel amp H de Vries

1989 ISBN 9021837463 Veen Uitgevers

This book (ldquoDe Winkelrdquo) contains

contextual articles about

(nearly) all plusmn 1300 illegal WW2 newspapers

ldquoDe Winkelrdquo ndash nr 199

De Ondergrondse Pers 1940-1945 Lydia E Winkel H de Vries 1989

ISBN 9021837463 Veen Uitgevers

Every article has a unique ID

(ldquoWinkel-IDrdquo)

Every article has metadata

bull Title subtitle motto bull Place of publication bull Period of publication bull Publication frequency (daily weekly one-off irregular)

bull Multiplication (stenciled printed typed handwritten)

bull Contents (news opinions poems illustrations humor)

bull Number of prints (min ndash max)

Relation 13

Newspaper Placename

semantics linked data

Relation 13

Newspaper Placename

semantics linked data

Contextual information

Nice material

for a Wikipedia article

Very often persons related to this newspaper are mentioned

Relation 23

Newspaper Persons

semantics linked data

Many articles also contain references to other newspapers

bull 106 = Cereales Vadeness (students resistance newspaper Wageningen) bull 360 = Leidsche Brief (students resistance newspaper Leiden) bull 748 = Sol Justitiae (students resistance newspaper Utrecht)

Relation 33

Newspaper Other newspapers

semantics linked data

htt

p

htt

ps

kn

ow

led

geu

top

iaf

iles

wo

rdp

ress

co

m2

01

40

1h

olla

nd

ho

use

libra

ryb

litz1

94

0j

pg

Too bad itrsquos a paper book hard to find multiply distribute and build upon

We need it digital

htt

p

htt

ps

kn

ow

led

geu

top

iaf

iles

wo

rdp

ress

co

m2

01

40

1h

olla

nd

ho

use

libra

ryb

litz1

94

0j

pg

We need it digital

1 Clear copyright with copyright holder (NIOD) Open CC-BY-SA license

2 Scan amp OCR

3 Convert into PDF

4 Put online NIOD site amp Wikimedia Commons

De Winkel as PDF on NIOD website (CC-BY-SA) httpwwwniodnlnlde-ondergrondse-pers-1940-1945

De Winkel as PDF on Wikimedia Commons httpscommonswikimediaorgwikiFilePDF_of_De_Ondergrondse_Pers_1940-1945_-_derde_druk_-_1989pdf

De Winkel as PDF on NIOD website (CC-BY-SA) httpwwwniodnlnlde-ondergrondse-pers-1940-1945

Saved us euro13330 httpwwwbrillcomdutch-underground-press-1940-1945

Wikipedia article about De Winkel httpnlwikipediaorgwikiDe_ondergrondse_pers_1940-1945

Wikipedia article about the author httpnlwikipediaorgwikiLydia_Winkel

Winkel the plusses

Available online (PDF flat file)

Open license (CC-BY-SA) Contextual information Relations

bull Titles Places bull Titles Persons

bull Titles Other titles

htt

p

ww

wa

rch

ives

go

vre

sear

chm

ilita

ryw

w2

ph

oto

sim

ages

ww

2-1

94

jpg

htt

p

2b

pb

logsp

ot

com

_BW

zuYw

iS6-I

TM

geR

sFd3m

IAAAAAAAAElw

3cv

gbZSPW

css

1600d

oct

or+

macr

o+

judy+

scare

djpg

Winkel the minusses

Unstructured data (PDF flat file)

Not very machine readable (unlike CSV XML JSON RDF)

PDF is no (real) open standard (unlike CSV XML JSON RDF)

No links between titles Delpher amp KB-cat No links between titles places amp persons

external sources (like Wikipedia)

but the data sources are

unconnected (and for 3+4 unstructured amp not machine-readable)

To summarize

a lot of information is available about these WW2 underground newspapers

1 Metadata (KB-cat)

2 Content (full-text Delpher)

3 Context (Winkel PDF)

4 Relations titles places persons other titles (Winkel PDF)

5 External resources about titles places and persons

htt

p

2b

pb

logsp

ot

com

_BW

zuYw

iS6-I

TM

geR

sFd3m

IAAAAAAAAElw

3cv

gbZSPW

css

1600d

oct

or+

macr

o+

judy+

scare

djpg

making discovery understanding amp research

of these newspapers (and related places amp persons) more difficult than necessary

We can solve all these issues

Good news

Wikiproject() Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

tinyurlcomverzetskranten (in Dutch)

httpsnlwikipediaorgwikiWikipediaWikiproject httpsenwikipediaorgwikiWikipediaWikiproject

From 14 1300 titles

htt

ps

n

lwik

iped

iao

rgw

iki

Cat

ego

rie

Illeg

ale_

per

s_in

_de_

Twee

de_

Wer

eld

oo

rlo

g

Wikiproject() Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

tinyurlcomverzetskranten (in Dutch)

We need a database

Build central database

httpswwwyoutubecomwatchv=GVDGuCjog_0

Build central database

httpswwwyoutubecomwatchv=GVDGuCjog_0

Step 1 Create Excel-sheet with bull Metadata about newspaper (from Winkel PDF)

bull Unique Wikipedia article title

bull Contextual info incl related persons amp titles (from Winkel PDF)

bull PPN unique ID linking newspaper to

KB-catalogue amp Delpher

Winkel-ID

Place of publication

Other metadata

Title

Unique Wikipedia article title =

ltNewspaper titlegt (verzetsblad ltPlace of publicationgt)

bull Contextual info bull Related persons

bull Related newspaper titles

PPN (107123223)

httpopc4kbnlDB=1PPNPPN=107123223

httpopc4kbnlDB=1PPNPPN=107123223

PPN (107123223)

PPN (107123223)

httpwwwdelphernlnlkrantenresultsindexcoll=dddtitelampcql[]=ppn3D107123223

httpwwwdelphernlnlkrantenresultsindexcoll=dddtitelampcql[]=ppn3D107123223

PPN (107123223)

Build central database

Step 2 Convert Excel into RDF triplestore (=special kind of online database anybody can access)

bull Steps 1-4 from httplinda-projecteulinked-

data-primer-2

bull Step 4 Vocubulary used = Bibframe (httpbibframeorgvocab)

Build central database

Step 3 Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

bull DBpedia = hub for linking different data sets on the Web to each

other Linked Open Data cloud

bull Connect persons amp places in newspaper database to external resources via DBpedia

Step 1c Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

DBpedia allows you to ask sophisticated queries against Wikipedia and to link the different data sets on the Web to Wikipedia data

bull Connect persons amp places in newspaper database to external

resources via DBpedia

httplod-cloudnetversions2010-09-22lod-cloud_coloredpng

Linked Open Data cloud

Build central database

Step 3 Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

bull DBpedia = hub for linking different data sets on the Web to each

other Linked Open Data cloud

bull We use DBpdia to connect persons amp places in our newspaper database to information in other databases

httpnldbpediaorgpageHuib_Drion

httpsnlwikipediaorgwikiHuib_Drion

httpwwwdbnlorgauteursauteurphpid=drio001

httpwwwbiografischportaalnlpersoon41181342

Build central database

Added value of Linked Open Data amp DBpedia Software can automatically query for additional information about places and persons mentioned in De Winkel that is not available in bull KB-catalogue bull Delpher bull De Winkel

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples)

Open license (CC-BY-SA) Open standard (RDF)

Contextual information Links between titles Delpher amp KB-cat (via PPNs)

Relations Links between places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia)

bull Titles Other titles

(PPNs)

http5stardatainfoen

Wikiproject Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

We need a template

Using an article template we can generate 1300 uniform and interlinked Wikipedia articles

from the LOD-database

htt

ps

c1

sta

ticf

lickr

co

m9

82

81

76

99

23

19

18

_11

a73

56

c38

_bjp

g

LOD-database + article template = Wikipedia article

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

Titles from database

Metadata from database

Related persons from database

Related newspapers from database

Winkel-ID from database

Link to full-texts in Delpher from database

Link to KB-catalogue record from database

Fixed categories on WP and Commons

Predefined fixed strings

Grey = bull From database bull Predefined fixed strings

Uniformity between articles guaranteed

All that WP-writers need to add manually

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

The KB can re-use (embed) the Wikipedia content in its own

websites to tackle this problem

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_28verzetsblad29

Delpher - search results

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

Delpher - search results

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Delpher - object presentation

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Over De Geus onder studenten

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog dat vanaf 4 oktober 1940 tot en met 13 juli 1944 in Den Haag werd uitgegeven Het blad verscheen in 1940 1941 en 1943 maandelijks verder onregelmatig in een oplage tussen de 250 en 8000 exemplaren Het werd aanvankelijk gestencild en vanaf november 1942 gedrukt en de inhoud bestond voornamelijk uit opinie-artikelen

Het blad werd uitgegeven door Jan Drion en Huib Drion twee Leidsehellip

Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Delpher - object presentation

Suggested reading

bull httpwwwtedcomtalkstim_berners_lee_on_the_next_web 20 years ago Tim Berners-Lee invented the World Wide Web For his next project hes building a web for open linked data that could do for numbers what the Web did for words pictures video unlock our data and reframe the way we use it together

bull httpsenwikipediaorgwikiLinked_data Wikipedia article related to the above video

bull http5stardatainfoen The 5 stars of Linked Open Data (Tim Berners-Lee)

bull httplinda-projecteulinked-data-primer-2 Short primer about creating LOD in practice starting from an Excel sheet

bull httpwwwprogrammablewebcomnewshow-linked-data-solved-digital-age-marketing-problemanalysis20150831 The figure near the bottom of the first page is a good illustration of the concept of (linked) triples

bull httpsenwikipediaorgwikiDBpedia

bull httpsenwikipediaorgwikiSemantic_network

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Questions

olafjanssenkbnl - ookgezellig

tinyurlcomverzetskranten

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Page 7: Linked Open Data case study (illegal newspapers WW2, Wikipedia, DBpedia) - Lecture Leiden University 3-3-2016

After the war many titles have

been (physically) preserved at the NIOD hellip

The national Institute for War Holocaust and Genocide Studies in Amsterdam

By Romaine - Own work CC0 httpscommonswikimediaorgwindexphpcurid=37072767

httpscommonswikimediaorgwikiFileVerzetskrant_in_archiefdozen_bij_het_NIODjpg ndash CC-BY-SA - OlafJanssen

By Romaine - Own work CC0 httpscommonswikimediaorgwindexphpcurid=37072734

These 1300 newspapers have been described in the KB-cataloguehellip

Bibliographic metadata

Like this one De Geus onder studenten

PPN = unique ID of this title

in KB-catalogue

httpopc4kbnlDB=1PPNPPN=107123223

These newspapers were digitised page by pagehellip resulting in hellip

full texts in Delpher

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

bull Scans bull Full-text OCR

Again De Geus onder studenten

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

PPN = unique ID of this title in Delpher

(same at for KB-catalogue)

Again De Geus onder studenten

On Delpher you can read and (word)search this title

Say I want to know more about this newspaper bull What sortstyle of underground paper was De Geus bull What is the history of this newspaper bull Who were working on it bull Where was this newpaper printed bull How was De Geus distributed and financed bull Were there any relations with other illegal newspapers or resistance

groups bull Etchellip

Under ldquoDetailsrdquo perhaps

OK ok some metadatahellip

but I want to know moacuteoacuteoacuterhellip

Maybe in the catalogue record

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

Question Where would most people start searching contextual

information about De Geus onder studenten

Probably Wikipedia (via Google)

httpnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

Question Where would most people start searching contextual

information about De Geus onder studenten

Probably Wikipedia (via Google)

httpswwwyoutubecomwatchv=VREJV--VHSw

Report on interest in WW2 among Dutch population

httpwwwoorlogsbronnennlgebruikersonderzoek2015 May 2015

Many of us use the internet to search for information [] We often mention Wikipediahellip

Everything is of course on Wikipedia Just type in a name and you can read entire essays (man 70s)

Over half of us think that Wikipedia and Google contribute to our knowledge and understanding of history

When we have to find information about WW2 outside the class setting we fully concentrate on digital resources like Google and Wikipedia (school kids)

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

httpnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

is given in

Context about De Geus onder studenten

But nowhellip

We have another problemhellip

httpsnlwikipediaorgwikiCategorieIllegale_pers_in_de_Tweede_Wereldoorlog

hellip De Geus on Wikipedia is an exception

1 Very few underground newspapers have their own WP articles

2 The overview of these newspapers on WP is far from complete

Good news

htt

p

ww

wa

rch

ives

go

vre

sear

chm

ilita

ryw

w2

ph

oto

sim

ages

ww

2-1

94

jpg

There is a 1-fix solution to this contextual problem

De Ondergrondse Pers 1940-1945

By Lydia E Winkel amp H de Vries

1989 ISBN 9021837463 Veen Uitgevers

This book (ldquoDe Winkelrdquo) contains

contextual articles about

(nearly) all plusmn 1300 illegal WW2 newspapers

ldquoDe Winkelrdquo ndash nr 199

De Ondergrondse Pers 1940-1945 Lydia E Winkel H de Vries 1989

ISBN 9021837463 Veen Uitgevers

Every article has a unique ID

(ldquoWinkel-IDrdquo)

Every article has metadata

bull Title subtitle motto bull Place of publication bull Period of publication bull Publication frequency (daily weekly one-off irregular)

bull Multiplication (stenciled printed typed handwritten)

bull Contents (news opinions poems illustrations humor)

bull Number of prints (min ndash max)

Relation 13

Newspaper Placename

semantics linked data

Relation 13

Newspaper Placename

semantics linked data

Contextual information

Nice material

for a Wikipedia article

Very often persons related to this newspaper are mentioned

Relation 23

Newspaper Persons

semantics linked data

Many articles also contain references to other newspapers

bull 106 = Cereales Vadeness (students resistance newspaper Wageningen) bull 360 = Leidsche Brief (students resistance newspaper Leiden) bull 748 = Sol Justitiae (students resistance newspaper Utrecht)

Relation 33

Newspaper Other newspapers

semantics linked data

htt

p

htt

ps

kn

ow

led

geu

top

iaf

iles

wo

rdp

ress

co

m2

01

40

1h

olla

nd

ho

use

libra

ryb

litz1

94

0j

pg

Too bad itrsquos a paper book hard to find multiply distribute and build upon

We need it digital

htt

p

htt

ps

kn

ow

led

geu

top

iaf

iles

wo

rdp

ress

co

m2

01

40

1h

olla

nd

ho

use

libra

ryb

litz1

94

0j

pg

We need it digital

1 Clear copyright with copyright holder (NIOD) Open CC-BY-SA license

2 Scan amp OCR

3 Convert into PDF

4 Put online NIOD site amp Wikimedia Commons

De Winkel as PDF on NIOD website (CC-BY-SA) httpwwwniodnlnlde-ondergrondse-pers-1940-1945

De Winkel as PDF on Wikimedia Commons httpscommonswikimediaorgwikiFilePDF_of_De_Ondergrondse_Pers_1940-1945_-_derde_druk_-_1989pdf

De Winkel as PDF on NIOD website (CC-BY-SA) httpwwwniodnlnlde-ondergrondse-pers-1940-1945

Saved us euro13330 httpwwwbrillcomdutch-underground-press-1940-1945

Wikipedia article about De Winkel httpnlwikipediaorgwikiDe_ondergrondse_pers_1940-1945

Wikipedia article about the author httpnlwikipediaorgwikiLydia_Winkel

Winkel the plusses

Available online (PDF flat file)

Open license (CC-BY-SA) Contextual information Relations

bull Titles Places bull Titles Persons

bull Titles Other titles

htt

p

ww

wa

rch

ives

go

vre

sear

chm

ilita

ryw

w2

ph

oto

sim

ages

ww

2-1

94

jpg

htt

p

2b

pb

logsp

ot

com

_BW

zuYw

iS6-I

TM

geR

sFd3m

IAAAAAAAAElw

3cv

gbZSPW

css

1600d

oct

or+

macr

o+

judy+

scare

djpg

Winkel the minusses

Unstructured data (PDF flat file)

Not very machine readable (unlike CSV XML JSON RDF)

PDF is no (real) open standard (unlike CSV XML JSON RDF)

No links between titles Delpher amp KB-cat No links between titles places amp persons

external sources (like Wikipedia)

but the data sources are

unconnected (and for 3+4 unstructured amp not machine-readable)

To summarize

a lot of information is available about these WW2 underground newspapers

1 Metadata (KB-cat)

2 Content (full-text Delpher)

3 Context (Winkel PDF)

4 Relations titles places persons other titles (Winkel PDF)

5 External resources about titles places and persons

htt

p

2b

pb

logsp

ot

com

_BW

zuYw

iS6-I

TM

geR

sFd3m

IAAAAAAAAElw

3cv

gbZSPW

css

1600d

oct

or+

macr

o+

judy+

scare

djpg

making discovery understanding amp research

of these newspapers (and related places amp persons) more difficult than necessary

We can solve all these issues

Good news

Wikiproject() Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

tinyurlcomverzetskranten (in Dutch)

httpsnlwikipediaorgwikiWikipediaWikiproject httpsenwikipediaorgwikiWikipediaWikiproject

From 14 1300 titles

htt

ps

n

lwik

iped

iao

rgw

iki

Cat

ego

rie

Illeg

ale_

per

s_in

_de_

Twee

de_

Wer

eld

oo

rlo

g

Wikiproject() Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

tinyurlcomverzetskranten (in Dutch)

We need a database

Build central database

httpswwwyoutubecomwatchv=GVDGuCjog_0

Build central database

httpswwwyoutubecomwatchv=GVDGuCjog_0

Step 1 Create Excel-sheet with bull Metadata about newspaper (from Winkel PDF)

bull Unique Wikipedia article title

bull Contextual info incl related persons amp titles (from Winkel PDF)

bull PPN unique ID linking newspaper to

KB-catalogue amp Delpher

Winkel-ID

Place of publication

Other metadata

Title

Unique Wikipedia article title =

ltNewspaper titlegt (verzetsblad ltPlace of publicationgt)

bull Contextual info bull Related persons

bull Related newspaper titles

PPN (107123223)

httpopc4kbnlDB=1PPNPPN=107123223

httpopc4kbnlDB=1PPNPPN=107123223

PPN (107123223)

PPN (107123223)

httpwwwdelphernlnlkrantenresultsindexcoll=dddtitelampcql[]=ppn3D107123223

httpwwwdelphernlnlkrantenresultsindexcoll=dddtitelampcql[]=ppn3D107123223

PPN (107123223)

Build central database

Step 2 Convert Excel into RDF triplestore (=special kind of online database anybody can access)

bull Steps 1-4 from httplinda-projecteulinked-

data-primer-2

bull Step 4 Vocubulary used = Bibframe (httpbibframeorgvocab)

Build central database

Step 3 Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

bull DBpedia = hub for linking different data sets on the Web to each

other Linked Open Data cloud

bull Connect persons amp places in newspaper database to external resources via DBpedia

Step 1c Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

DBpedia allows you to ask sophisticated queries against Wikipedia and to link the different data sets on the Web to Wikipedia data

bull Connect persons amp places in newspaper database to external

resources via DBpedia

httplod-cloudnetversions2010-09-22lod-cloud_coloredpng

Linked Open Data cloud

Build central database

Step 3 Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

bull DBpedia = hub for linking different data sets on the Web to each

other Linked Open Data cloud

bull We use DBpdia to connect persons amp places in our newspaper database to information in other databases

httpnldbpediaorgpageHuib_Drion

httpsnlwikipediaorgwikiHuib_Drion

httpwwwdbnlorgauteursauteurphpid=drio001

httpwwwbiografischportaalnlpersoon41181342

Build central database

Added value of Linked Open Data amp DBpedia Software can automatically query for additional information about places and persons mentioned in De Winkel that is not available in bull KB-catalogue bull Delpher bull De Winkel

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples)

Open license (CC-BY-SA) Open standard (RDF)

Contextual information Links between titles Delpher amp KB-cat (via PPNs)

Relations Links between places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia)

bull Titles Other titles

(PPNs)

http5stardatainfoen

Wikiproject Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

We need a template

Using an article template we can generate 1300 uniform and interlinked Wikipedia articles

from the LOD-database

htt

ps

c1

sta

ticf

lickr

co

m9

82

81

76

99

23

19

18

_11

a73

56

c38

_bjp

g

LOD-database + article template = Wikipedia article

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

Titles from database

Metadata from database

Related persons from database

Related newspapers from database

Winkel-ID from database

Link to full-texts in Delpher from database

Link to KB-catalogue record from database

Fixed categories on WP and Commons

Predefined fixed strings

Grey = bull From database bull Predefined fixed strings

Uniformity between articles guaranteed

All that WP-writers need to add manually

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

The KB can re-use (embed) the Wikipedia content in its own

websites to tackle this problem

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_28verzetsblad29

Delpher - search results

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

Delpher - search results

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Delpher - object presentation

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Over De Geus onder studenten

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog dat vanaf 4 oktober 1940 tot en met 13 juli 1944 in Den Haag werd uitgegeven Het blad verscheen in 1940 1941 en 1943 maandelijks verder onregelmatig in een oplage tussen de 250 en 8000 exemplaren Het werd aanvankelijk gestencild en vanaf november 1942 gedrukt en de inhoud bestond voornamelijk uit opinie-artikelen

Het blad werd uitgegeven door Jan Drion en Huib Drion twee Leidsehellip

Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Delpher - object presentation

Suggested reading

bull httpwwwtedcomtalkstim_berners_lee_on_the_next_web 20 years ago Tim Berners-Lee invented the World Wide Web For his next project hes building a web for open linked data that could do for numbers what the Web did for words pictures video unlock our data and reframe the way we use it together

bull httpsenwikipediaorgwikiLinked_data Wikipedia article related to the above video

bull http5stardatainfoen The 5 stars of Linked Open Data (Tim Berners-Lee)

bull httplinda-projecteulinked-data-primer-2 Short primer about creating LOD in practice starting from an Excel sheet

bull httpwwwprogrammablewebcomnewshow-linked-data-solved-digital-age-marketing-problemanalysis20150831 The figure near the bottom of the first page is a good illustration of the concept of (linked) triples

bull httpsenwikipediaorgwikiDBpedia

bull httpsenwikipediaorgwikiSemantic_network

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Questions

olafjanssenkbnl - ookgezellig

tinyurlcomverzetskranten

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Page 8: Linked Open Data case study (illegal newspapers WW2, Wikipedia, DBpedia) - Lecture Leiden University 3-3-2016

httpscommonswikimediaorgwikiFileVerzetskrant_in_archiefdozen_bij_het_NIODjpg ndash CC-BY-SA - OlafJanssen

By Romaine - Own work CC0 httpscommonswikimediaorgwindexphpcurid=37072734

These 1300 newspapers have been described in the KB-cataloguehellip

Bibliographic metadata

Like this one De Geus onder studenten

PPN = unique ID of this title

in KB-catalogue

httpopc4kbnlDB=1PPNPPN=107123223

These newspapers were digitised page by pagehellip resulting in hellip

full texts in Delpher

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

bull Scans bull Full-text OCR

Again De Geus onder studenten

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

PPN = unique ID of this title in Delpher

(same at for KB-catalogue)

Again De Geus onder studenten

On Delpher you can read and (word)search this title

Say I want to know more about this newspaper bull What sortstyle of underground paper was De Geus bull What is the history of this newspaper bull Who were working on it bull Where was this newpaper printed bull How was De Geus distributed and financed bull Were there any relations with other illegal newspapers or resistance

groups bull Etchellip

Under ldquoDetailsrdquo perhaps

OK ok some metadatahellip

but I want to know moacuteoacuteoacuterhellip

Maybe in the catalogue record

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

Question Where would most people start searching contextual

information about De Geus onder studenten

Probably Wikipedia (via Google)

httpnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

Question Where would most people start searching contextual

information about De Geus onder studenten

Probably Wikipedia (via Google)

httpswwwyoutubecomwatchv=VREJV--VHSw

Report on interest in WW2 among Dutch population

httpwwwoorlogsbronnennlgebruikersonderzoek2015 May 2015

Many of us use the internet to search for information [] We often mention Wikipediahellip

Everything is of course on Wikipedia Just type in a name and you can read entire essays (man 70s)

Over half of us think that Wikipedia and Google contribute to our knowledge and understanding of history

When we have to find information about WW2 outside the class setting we fully concentrate on digital resources like Google and Wikipedia (school kids)

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

httpnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

is given in

Context about De Geus onder studenten

But nowhellip

We have another problemhellip

httpsnlwikipediaorgwikiCategorieIllegale_pers_in_de_Tweede_Wereldoorlog

hellip De Geus on Wikipedia is an exception

1 Very few underground newspapers have their own WP articles

2 The overview of these newspapers on WP is far from complete

Good news

htt

p

ww

wa

rch

ives

go

vre

sear

chm

ilita

ryw

w2

ph

oto

sim

ages

ww

2-1

94

jpg

There is a 1-fix solution to this contextual problem

De Ondergrondse Pers 1940-1945

By Lydia E Winkel amp H de Vries

1989 ISBN 9021837463 Veen Uitgevers

This book (ldquoDe Winkelrdquo) contains

contextual articles about

(nearly) all plusmn 1300 illegal WW2 newspapers

ldquoDe Winkelrdquo ndash nr 199

De Ondergrondse Pers 1940-1945 Lydia E Winkel H de Vries 1989

ISBN 9021837463 Veen Uitgevers

Every article has a unique ID

(ldquoWinkel-IDrdquo)

Every article has metadata

bull Title subtitle motto bull Place of publication bull Period of publication bull Publication frequency (daily weekly one-off irregular)

bull Multiplication (stenciled printed typed handwritten)

bull Contents (news opinions poems illustrations humor)

bull Number of prints (min ndash max)

Relation 13

Newspaper Placename

semantics linked data

Relation 13

Newspaper Placename

semantics linked data

Contextual information

Nice material

for a Wikipedia article

Very often persons related to this newspaper are mentioned

Relation 23

Newspaper Persons

semantics linked data

Many articles also contain references to other newspapers

bull 106 = Cereales Vadeness (students resistance newspaper Wageningen) bull 360 = Leidsche Brief (students resistance newspaper Leiden) bull 748 = Sol Justitiae (students resistance newspaper Utrecht)

Relation 33

Newspaper Other newspapers

semantics linked data

htt

p

htt

ps

kn

ow

led

geu

top

iaf

iles

wo

rdp

ress

co

m2

01

40

1h

olla

nd

ho

use

libra

ryb

litz1

94

0j

pg

Too bad itrsquos a paper book hard to find multiply distribute and build upon

We need it digital

htt

p

htt

ps

kn

ow

led

geu

top

iaf

iles

wo

rdp

ress

co

m2

01

40

1h

olla

nd

ho

use

libra

ryb

litz1

94

0j

pg

We need it digital

1 Clear copyright with copyright holder (NIOD) Open CC-BY-SA license

2 Scan amp OCR

3 Convert into PDF

4 Put online NIOD site amp Wikimedia Commons

De Winkel as PDF on NIOD website (CC-BY-SA) httpwwwniodnlnlde-ondergrondse-pers-1940-1945

De Winkel as PDF on Wikimedia Commons httpscommonswikimediaorgwikiFilePDF_of_De_Ondergrondse_Pers_1940-1945_-_derde_druk_-_1989pdf

De Winkel as PDF on NIOD website (CC-BY-SA) httpwwwniodnlnlde-ondergrondse-pers-1940-1945

Saved us euro13330 httpwwwbrillcomdutch-underground-press-1940-1945

Wikipedia article about De Winkel httpnlwikipediaorgwikiDe_ondergrondse_pers_1940-1945

Wikipedia article about the author httpnlwikipediaorgwikiLydia_Winkel

Winkel the plusses

Available online (PDF flat file)

Open license (CC-BY-SA) Contextual information Relations

bull Titles Places bull Titles Persons

bull Titles Other titles

htt

p

ww

wa

rch

ives

go

vre

sear

chm

ilita

ryw

w2

ph

oto

sim

ages

ww

2-1

94

jpg

htt

p

2b

pb

logsp

ot

com

_BW

zuYw

iS6-I

TM

geR

sFd3m

IAAAAAAAAElw

3cv

gbZSPW

css

1600d

oct

or+

macr

o+

judy+

scare

djpg

Winkel the minusses

Unstructured data (PDF flat file)

Not very machine readable (unlike CSV XML JSON RDF)

PDF is no (real) open standard (unlike CSV XML JSON RDF)

No links between titles Delpher amp KB-cat No links between titles places amp persons

external sources (like Wikipedia)

but the data sources are

unconnected (and for 3+4 unstructured amp not machine-readable)

To summarize

a lot of information is available about these WW2 underground newspapers

1 Metadata (KB-cat)

2 Content (full-text Delpher)

3 Context (Winkel PDF)

4 Relations titles places persons other titles (Winkel PDF)

5 External resources about titles places and persons

htt

p

2b

pb

logsp

ot

com

_BW

zuYw

iS6-I

TM

geR

sFd3m

IAAAAAAAAElw

3cv

gbZSPW

css

1600d

oct

or+

macr

o+

judy+

scare

djpg

making discovery understanding amp research

of these newspapers (and related places amp persons) more difficult than necessary

We can solve all these issues

Good news

Wikiproject() Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

tinyurlcomverzetskranten (in Dutch)

httpsnlwikipediaorgwikiWikipediaWikiproject httpsenwikipediaorgwikiWikipediaWikiproject

From 14 1300 titles

htt

ps

n

lwik

iped

iao

rgw

iki

Cat

ego

rie

Illeg

ale_

per

s_in

_de_

Twee

de_

Wer

eld

oo

rlo

g

Wikiproject() Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

tinyurlcomverzetskranten (in Dutch)

We need a database

Build central database

httpswwwyoutubecomwatchv=GVDGuCjog_0

Build central database

httpswwwyoutubecomwatchv=GVDGuCjog_0

Step 1 Create Excel-sheet with bull Metadata about newspaper (from Winkel PDF)

bull Unique Wikipedia article title

bull Contextual info incl related persons amp titles (from Winkel PDF)

bull PPN unique ID linking newspaper to

KB-catalogue amp Delpher

Winkel-ID

Place of publication

Other metadata

Title

Unique Wikipedia article title =

ltNewspaper titlegt (verzetsblad ltPlace of publicationgt)

bull Contextual info bull Related persons

bull Related newspaper titles

PPN (107123223)

httpopc4kbnlDB=1PPNPPN=107123223

httpopc4kbnlDB=1PPNPPN=107123223

PPN (107123223)

PPN (107123223)

httpwwwdelphernlnlkrantenresultsindexcoll=dddtitelampcql[]=ppn3D107123223

httpwwwdelphernlnlkrantenresultsindexcoll=dddtitelampcql[]=ppn3D107123223

PPN (107123223)

Build central database

Step 2 Convert Excel into RDF triplestore (=special kind of online database anybody can access)

bull Steps 1-4 from httplinda-projecteulinked-

data-primer-2

bull Step 4 Vocubulary used = Bibframe (httpbibframeorgvocab)

Build central database

Step 3 Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

bull DBpedia = hub for linking different data sets on the Web to each

other Linked Open Data cloud

bull Connect persons amp places in newspaper database to external resources via DBpedia

Step 1c Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

DBpedia allows you to ask sophisticated queries against Wikipedia and to link the different data sets on the Web to Wikipedia data

bull Connect persons amp places in newspaper database to external

resources via DBpedia

httplod-cloudnetversions2010-09-22lod-cloud_coloredpng

Linked Open Data cloud

Build central database

Step 3 Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

bull DBpedia = hub for linking different data sets on the Web to each

other Linked Open Data cloud

bull We use DBpdia to connect persons amp places in our newspaper database to information in other databases

httpnldbpediaorgpageHuib_Drion

httpsnlwikipediaorgwikiHuib_Drion

httpwwwdbnlorgauteursauteurphpid=drio001

httpwwwbiografischportaalnlpersoon41181342

Build central database

Added value of Linked Open Data amp DBpedia Software can automatically query for additional information about places and persons mentioned in De Winkel that is not available in bull KB-catalogue bull Delpher bull De Winkel

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples)

Open license (CC-BY-SA) Open standard (RDF)

Contextual information Links between titles Delpher amp KB-cat (via PPNs)

Relations Links between places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia)

bull Titles Other titles

(PPNs)

http5stardatainfoen

Wikiproject Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

We need a template

Using an article template we can generate 1300 uniform and interlinked Wikipedia articles

from the LOD-database

htt

ps

c1

sta

ticf

lickr

co

m9

82

81

76

99

23

19

18

_11

a73

56

c38

_bjp

g

LOD-database + article template = Wikipedia article

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

Titles from database

Metadata from database

Related persons from database

Related newspapers from database

Winkel-ID from database

Link to full-texts in Delpher from database

Link to KB-catalogue record from database

Fixed categories on WP and Commons

Predefined fixed strings

Grey = bull From database bull Predefined fixed strings

Uniformity between articles guaranteed

All that WP-writers need to add manually

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

The KB can re-use (embed) the Wikipedia content in its own

websites to tackle this problem

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_28verzetsblad29

Delpher - search results

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

Delpher - search results

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Delpher - object presentation

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Over De Geus onder studenten

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog dat vanaf 4 oktober 1940 tot en met 13 juli 1944 in Den Haag werd uitgegeven Het blad verscheen in 1940 1941 en 1943 maandelijks verder onregelmatig in een oplage tussen de 250 en 8000 exemplaren Het werd aanvankelijk gestencild en vanaf november 1942 gedrukt en de inhoud bestond voornamelijk uit opinie-artikelen

Het blad werd uitgegeven door Jan Drion en Huib Drion twee Leidsehellip

Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Delpher - object presentation

Suggested reading

bull httpwwwtedcomtalkstim_berners_lee_on_the_next_web 20 years ago Tim Berners-Lee invented the World Wide Web For his next project hes building a web for open linked data that could do for numbers what the Web did for words pictures video unlock our data and reframe the way we use it together

bull httpsenwikipediaorgwikiLinked_data Wikipedia article related to the above video

bull http5stardatainfoen The 5 stars of Linked Open Data (Tim Berners-Lee)

bull httplinda-projecteulinked-data-primer-2 Short primer about creating LOD in practice starting from an Excel sheet

bull httpwwwprogrammablewebcomnewshow-linked-data-solved-digital-age-marketing-problemanalysis20150831 The figure near the bottom of the first page is a good illustration of the concept of (linked) triples

bull httpsenwikipediaorgwikiDBpedia

bull httpsenwikipediaorgwikiSemantic_network

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Questions

olafjanssenkbnl - ookgezellig

tinyurlcomverzetskranten

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Page 9: Linked Open Data case study (illegal newspapers WW2, Wikipedia, DBpedia) - Lecture Leiden University 3-3-2016

By Romaine - Own work CC0 httpscommonswikimediaorgwindexphpcurid=37072734

These 1300 newspapers have been described in the KB-cataloguehellip

Bibliographic metadata

Like this one De Geus onder studenten

PPN = unique ID of this title

in KB-catalogue

httpopc4kbnlDB=1PPNPPN=107123223

These newspapers were digitised page by pagehellip resulting in hellip

full texts in Delpher

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

bull Scans bull Full-text OCR

Again De Geus onder studenten

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

PPN = unique ID of this title in Delpher

(same at for KB-catalogue)

Again De Geus onder studenten

On Delpher you can read and (word)search this title

Say I want to know more about this newspaper bull What sortstyle of underground paper was De Geus bull What is the history of this newspaper bull Who were working on it bull Where was this newpaper printed bull How was De Geus distributed and financed bull Were there any relations with other illegal newspapers or resistance

groups bull Etchellip

Under ldquoDetailsrdquo perhaps

OK ok some metadatahellip

but I want to know moacuteoacuteoacuterhellip

Maybe in the catalogue record

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

Question Where would most people start searching contextual

information about De Geus onder studenten

Probably Wikipedia (via Google)

httpnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

Question Where would most people start searching contextual

information about De Geus onder studenten

Probably Wikipedia (via Google)

httpswwwyoutubecomwatchv=VREJV--VHSw

Report on interest in WW2 among Dutch population

httpwwwoorlogsbronnennlgebruikersonderzoek2015 May 2015

Many of us use the internet to search for information [] We often mention Wikipediahellip

Everything is of course on Wikipedia Just type in a name and you can read entire essays (man 70s)

Over half of us think that Wikipedia and Google contribute to our knowledge and understanding of history

When we have to find information about WW2 outside the class setting we fully concentrate on digital resources like Google and Wikipedia (school kids)

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

httpnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

is given in

Context about De Geus onder studenten

But nowhellip

We have another problemhellip

httpsnlwikipediaorgwikiCategorieIllegale_pers_in_de_Tweede_Wereldoorlog

hellip De Geus on Wikipedia is an exception

1 Very few underground newspapers have their own WP articles

2 The overview of these newspapers on WP is far from complete

Good news

htt

p

ww

wa

rch

ives

go

vre

sear

chm

ilita

ryw

w2

ph

oto

sim

ages

ww

2-1

94

jpg

There is a 1-fix solution to this contextual problem

De Ondergrondse Pers 1940-1945

By Lydia E Winkel amp H de Vries

1989 ISBN 9021837463 Veen Uitgevers

This book (ldquoDe Winkelrdquo) contains

contextual articles about

(nearly) all plusmn 1300 illegal WW2 newspapers

ldquoDe Winkelrdquo ndash nr 199

De Ondergrondse Pers 1940-1945 Lydia E Winkel H de Vries 1989

ISBN 9021837463 Veen Uitgevers

Every article has a unique ID

(ldquoWinkel-IDrdquo)

Every article has metadata

bull Title subtitle motto bull Place of publication bull Period of publication bull Publication frequency (daily weekly one-off irregular)

bull Multiplication (stenciled printed typed handwritten)

bull Contents (news opinions poems illustrations humor)

bull Number of prints (min ndash max)

Relation 13

Newspaper Placename

semantics linked data

Relation 13

Newspaper Placename

semantics linked data

Contextual information

Nice material

for a Wikipedia article

Very often persons related to this newspaper are mentioned

Relation 23

Newspaper Persons

semantics linked data

Many articles also contain references to other newspapers

bull 106 = Cereales Vadeness (students resistance newspaper Wageningen) bull 360 = Leidsche Brief (students resistance newspaper Leiden) bull 748 = Sol Justitiae (students resistance newspaper Utrecht)

Relation 33

Newspaper Other newspapers

semantics linked data

htt

p

htt

ps

kn

ow

led

geu

top

iaf

iles

wo

rdp

ress

co

m2

01

40

1h

olla

nd

ho

use

libra

ryb

litz1

94

0j

pg

Too bad itrsquos a paper book hard to find multiply distribute and build upon

We need it digital

htt

p

htt

ps

kn

ow

led

geu

top

iaf

iles

wo

rdp

ress

co

m2

01

40

1h

olla

nd

ho

use

libra

ryb

litz1

94

0j

pg

We need it digital

1 Clear copyright with copyright holder (NIOD) Open CC-BY-SA license

2 Scan amp OCR

3 Convert into PDF

4 Put online NIOD site amp Wikimedia Commons

De Winkel as PDF on NIOD website (CC-BY-SA) httpwwwniodnlnlde-ondergrondse-pers-1940-1945

De Winkel as PDF on Wikimedia Commons httpscommonswikimediaorgwikiFilePDF_of_De_Ondergrondse_Pers_1940-1945_-_derde_druk_-_1989pdf

De Winkel as PDF on NIOD website (CC-BY-SA) httpwwwniodnlnlde-ondergrondse-pers-1940-1945

Saved us euro13330 httpwwwbrillcomdutch-underground-press-1940-1945

Wikipedia article about De Winkel httpnlwikipediaorgwikiDe_ondergrondse_pers_1940-1945

Wikipedia article about the author httpnlwikipediaorgwikiLydia_Winkel

Winkel the plusses

Available online (PDF flat file)

Open license (CC-BY-SA) Contextual information Relations

bull Titles Places bull Titles Persons

bull Titles Other titles

htt

p

ww

wa

rch

ives

go

vre

sear

chm

ilita

ryw

w2

ph

oto

sim

ages

ww

2-1

94

jpg

htt

p

2b

pb

logsp

ot

com

_BW

zuYw

iS6-I

TM

geR

sFd3m

IAAAAAAAAElw

3cv

gbZSPW

css

1600d

oct

or+

macr

o+

judy+

scare

djpg

Winkel the minusses

Unstructured data (PDF flat file)

Not very machine readable (unlike CSV XML JSON RDF)

PDF is no (real) open standard (unlike CSV XML JSON RDF)

No links between titles Delpher amp KB-cat No links between titles places amp persons

external sources (like Wikipedia)

but the data sources are

unconnected (and for 3+4 unstructured amp not machine-readable)

To summarize

a lot of information is available about these WW2 underground newspapers

1 Metadata (KB-cat)

2 Content (full-text Delpher)

3 Context (Winkel PDF)

4 Relations titles places persons other titles (Winkel PDF)

5 External resources about titles places and persons

htt

p

2b

pb

logsp

ot

com

_BW

zuYw

iS6-I

TM

geR

sFd3m

IAAAAAAAAElw

3cv

gbZSPW

css

1600d

oct

or+

macr

o+

judy+

scare

djpg

making discovery understanding amp research

of these newspapers (and related places amp persons) more difficult than necessary

We can solve all these issues

Good news

Wikiproject() Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

tinyurlcomverzetskranten (in Dutch)

httpsnlwikipediaorgwikiWikipediaWikiproject httpsenwikipediaorgwikiWikipediaWikiproject

From 14 1300 titles

htt

ps

n

lwik

iped

iao

rgw

iki

Cat

ego

rie

Illeg

ale_

per

s_in

_de_

Twee

de_

Wer

eld

oo

rlo

g

Wikiproject() Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

tinyurlcomverzetskranten (in Dutch)

We need a database

Build central database

httpswwwyoutubecomwatchv=GVDGuCjog_0

Build central database

httpswwwyoutubecomwatchv=GVDGuCjog_0

Step 1 Create Excel-sheet with bull Metadata about newspaper (from Winkel PDF)

bull Unique Wikipedia article title

bull Contextual info incl related persons amp titles (from Winkel PDF)

bull PPN unique ID linking newspaper to

KB-catalogue amp Delpher

Winkel-ID

Place of publication

Other metadata

Title

Unique Wikipedia article title =

ltNewspaper titlegt (verzetsblad ltPlace of publicationgt)

bull Contextual info bull Related persons

bull Related newspaper titles

PPN (107123223)

httpopc4kbnlDB=1PPNPPN=107123223

httpopc4kbnlDB=1PPNPPN=107123223

PPN (107123223)

PPN (107123223)

httpwwwdelphernlnlkrantenresultsindexcoll=dddtitelampcql[]=ppn3D107123223

httpwwwdelphernlnlkrantenresultsindexcoll=dddtitelampcql[]=ppn3D107123223

PPN (107123223)

Build central database

Step 2 Convert Excel into RDF triplestore (=special kind of online database anybody can access)

bull Steps 1-4 from httplinda-projecteulinked-

data-primer-2

bull Step 4 Vocubulary used = Bibframe (httpbibframeorgvocab)

Build central database

Step 3 Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

bull DBpedia = hub for linking different data sets on the Web to each

other Linked Open Data cloud

bull Connect persons amp places in newspaper database to external resources via DBpedia

Step 1c Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

DBpedia allows you to ask sophisticated queries against Wikipedia and to link the different data sets on the Web to Wikipedia data

bull Connect persons amp places in newspaper database to external

resources via DBpedia

httplod-cloudnetversions2010-09-22lod-cloud_coloredpng

Linked Open Data cloud

Build central database

Step 3 Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

bull DBpedia = hub for linking different data sets on the Web to each

other Linked Open Data cloud

bull We use DBpdia to connect persons amp places in our newspaper database to information in other databases

httpnldbpediaorgpageHuib_Drion

httpsnlwikipediaorgwikiHuib_Drion

httpwwwdbnlorgauteursauteurphpid=drio001

httpwwwbiografischportaalnlpersoon41181342

Build central database

Added value of Linked Open Data amp DBpedia Software can automatically query for additional information about places and persons mentioned in De Winkel that is not available in bull KB-catalogue bull Delpher bull De Winkel

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples)

Open license (CC-BY-SA) Open standard (RDF)

Contextual information Links between titles Delpher amp KB-cat (via PPNs)

Relations Links between places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia)

bull Titles Other titles

(PPNs)

http5stardatainfoen

Wikiproject Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

We need a template

Using an article template we can generate 1300 uniform and interlinked Wikipedia articles

from the LOD-database

htt

ps

c1

sta

ticf

lickr

co

m9

82

81

76

99

23

19

18

_11

a73

56

c38

_bjp

g

LOD-database + article template = Wikipedia article

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

Titles from database

Metadata from database

Related persons from database

Related newspapers from database

Winkel-ID from database

Link to full-texts in Delpher from database

Link to KB-catalogue record from database

Fixed categories on WP and Commons

Predefined fixed strings

Grey = bull From database bull Predefined fixed strings

Uniformity between articles guaranteed

All that WP-writers need to add manually

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

The KB can re-use (embed) the Wikipedia content in its own

websites to tackle this problem

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_28verzetsblad29

Delpher - search results

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

Delpher - search results

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Delpher - object presentation

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Over De Geus onder studenten

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog dat vanaf 4 oktober 1940 tot en met 13 juli 1944 in Den Haag werd uitgegeven Het blad verscheen in 1940 1941 en 1943 maandelijks verder onregelmatig in een oplage tussen de 250 en 8000 exemplaren Het werd aanvankelijk gestencild en vanaf november 1942 gedrukt en de inhoud bestond voornamelijk uit opinie-artikelen

Het blad werd uitgegeven door Jan Drion en Huib Drion twee Leidsehellip

Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Delpher - object presentation

Suggested reading

bull httpwwwtedcomtalkstim_berners_lee_on_the_next_web 20 years ago Tim Berners-Lee invented the World Wide Web For his next project hes building a web for open linked data that could do for numbers what the Web did for words pictures video unlock our data and reframe the way we use it together

bull httpsenwikipediaorgwikiLinked_data Wikipedia article related to the above video

bull http5stardatainfoen The 5 stars of Linked Open Data (Tim Berners-Lee)

bull httplinda-projecteulinked-data-primer-2 Short primer about creating LOD in practice starting from an Excel sheet

bull httpwwwprogrammablewebcomnewshow-linked-data-solved-digital-age-marketing-problemanalysis20150831 The figure near the bottom of the first page is a good illustration of the concept of (linked) triples

bull httpsenwikipediaorgwikiDBpedia

bull httpsenwikipediaorgwikiSemantic_network

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Questions

olafjanssenkbnl - ookgezellig

tinyurlcomverzetskranten

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Page 10: Linked Open Data case study (illegal newspapers WW2, Wikipedia, DBpedia) - Lecture Leiden University 3-3-2016

These 1300 newspapers have been described in the KB-cataloguehellip

Bibliographic metadata

Like this one De Geus onder studenten

PPN = unique ID of this title

in KB-catalogue

httpopc4kbnlDB=1PPNPPN=107123223

These newspapers were digitised page by pagehellip resulting in hellip

full texts in Delpher

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

bull Scans bull Full-text OCR

Again De Geus onder studenten

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

PPN = unique ID of this title in Delpher

(same at for KB-catalogue)

Again De Geus onder studenten

On Delpher you can read and (word)search this title

Say I want to know more about this newspaper bull What sortstyle of underground paper was De Geus bull What is the history of this newspaper bull Who were working on it bull Where was this newpaper printed bull How was De Geus distributed and financed bull Were there any relations with other illegal newspapers or resistance

groups bull Etchellip

Under ldquoDetailsrdquo perhaps

OK ok some metadatahellip

but I want to know moacuteoacuteoacuterhellip

Maybe in the catalogue record

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

Question Where would most people start searching contextual

information about De Geus onder studenten

Probably Wikipedia (via Google)

httpnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

Question Where would most people start searching contextual

information about De Geus onder studenten

Probably Wikipedia (via Google)

httpswwwyoutubecomwatchv=VREJV--VHSw

Report on interest in WW2 among Dutch population

httpwwwoorlogsbronnennlgebruikersonderzoek2015 May 2015

Many of us use the internet to search for information [] We often mention Wikipediahellip

Everything is of course on Wikipedia Just type in a name and you can read entire essays (man 70s)

Over half of us think that Wikipedia and Google contribute to our knowledge and understanding of history

When we have to find information about WW2 outside the class setting we fully concentrate on digital resources like Google and Wikipedia (school kids)

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

httpnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

is given in

Context about De Geus onder studenten

But nowhellip

We have another problemhellip

httpsnlwikipediaorgwikiCategorieIllegale_pers_in_de_Tweede_Wereldoorlog

hellip De Geus on Wikipedia is an exception

1 Very few underground newspapers have their own WP articles

2 The overview of these newspapers on WP is far from complete

Good news

htt

p

ww

wa

rch

ives

go

vre

sear

chm

ilita

ryw

w2

ph

oto

sim

ages

ww

2-1

94

jpg

There is a 1-fix solution to this contextual problem

De Ondergrondse Pers 1940-1945

By Lydia E Winkel amp H de Vries

1989 ISBN 9021837463 Veen Uitgevers

This book (ldquoDe Winkelrdquo) contains

contextual articles about

(nearly) all plusmn 1300 illegal WW2 newspapers

ldquoDe Winkelrdquo ndash nr 199

De Ondergrondse Pers 1940-1945 Lydia E Winkel H de Vries 1989

ISBN 9021837463 Veen Uitgevers

Every article has a unique ID

(ldquoWinkel-IDrdquo)

Every article has metadata

bull Title subtitle motto bull Place of publication bull Period of publication bull Publication frequency (daily weekly one-off irregular)

bull Multiplication (stenciled printed typed handwritten)

bull Contents (news opinions poems illustrations humor)

bull Number of prints (min ndash max)

Relation 13

Newspaper Placename

semantics linked data

Relation 13

Newspaper Placename

semantics linked data

Contextual information

Nice material

for a Wikipedia article

Very often persons related to this newspaper are mentioned

Relation 23

Newspaper Persons

semantics linked data

Many articles also contain references to other newspapers

bull 106 = Cereales Vadeness (students resistance newspaper Wageningen) bull 360 = Leidsche Brief (students resistance newspaper Leiden) bull 748 = Sol Justitiae (students resistance newspaper Utrecht)

Relation 33

Newspaper Other newspapers

semantics linked data

htt

p

htt

ps

kn

ow

led

geu

top

iaf

iles

wo

rdp

ress

co

m2

01

40

1h

olla

nd

ho

use

libra

ryb

litz1

94

0j

pg

Too bad itrsquos a paper book hard to find multiply distribute and build upon

We need it digital

htt

p

htt

ps

kn

ow

led

geu

top

iaf

iles

wo

rdp

ress

co

m2

01

40

1h

olla

nd

ho

use

libra

ryb

litz1

94

0j

pg

We need it digital

1 Clear copyright with copyright holder (NIOD) Open CC-BY-SA license

2 Scan amp OCR

3 Convert into PDF

4 Put online NIOD site amp Wikimedia Commons

De Winkel as PDF on NIOD website (CC-BY-SA) httpwwwniodnlnlde-ondergrondse-pers-1940-1945

De Winkel as PDF on Wikimedia Commons httpscommonswikimediaorgwikiFilePDF_of_De_Ondergrondse_Pers_1940-1945_-_derde_druk_-_1989pdf

De Winkel as PDF on NIOD website (CC-BY-SA) httpwwwniodnlnlde-ondergrondse-pers-1940-1945

Saved us euro13330 httpwwwbrillcomdutch-underground-press-1940-1945

Wikipedia article about De Winkel httpnlwikipediaorgwikiDe_ondergrondse_pers_1940-1945

Wikipedia article about the author httpnlwikipediaorgwikiLydia_Winkel

Winkel the plusses

Available online (PDF flat file)

Open license (CC-BY-SA) Contextual information Relations

bull Titles Places bull Titles Persons

bull Titles Other titles

htt

p

ww

wa

rch

ives

go

vre

sear

chm

ilita

ryw

w2

ph

oto

sim

ages

ww

2-1

94

jpg

htt

p

2b

pb

logsp

ot

com

_BW

zuYw

iS6-I

TM

geR

sFd3m

IAAAAAAAAElw

3cv

gbZSPW

css

1600d

oct

or+

macr

o+

judy+

scare

djpg

Winkel the minusses

Unstructured data (PDF flat file)

Not very machine readable (unlike CSV XML JSON RDF)

PDF is no (real) open standard (unlike CSV XML JSON RDF)

No links between titles Delpher amp KB-cat No links between titles places amp persons

external sources (like Wikipedia)

but the data sources are

unconnected (and for 3+4 unstructured amp not machine-readable)

To summarize

a lot of information is available about these WW2 underground newspapers

1 Metadata (KB-cat)

2 Content (full-text Delpher)

3 Context (Winkel PDF)

4 Relations titles places persons other titles (Winkel PDF)

5 External resources about titles places and persons

htt

p

2b

pb

logsp

ot

com

_BW

zuYw

iS6-I

TM

geR

sFd3m

IAAAAAAAAElw

3cv

gbZSPW

css

1600d

oct

or+

macr

o+

judy+

scare

djpg

making discovery understanding amp research

of these newspapers (and related places amp persons) more difficult than necessary

We can solve all these issues

Good news

Wikiproject() Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

tinyurlcomverzetskranten (in Dutch)

httpsnlwikipediaorgwikiWikipediaWikiproject httpsenwikipediaorgwikiWikipediaWikiproject

From 14 1300 titles

htt

ps

n

lwik

iped

iao

rgw

iki

Cat

ego

rie

Illeg

ale_

per

s_in

_de_

Twee

de_

Wer

eld

oo

rlo

g

Wikiproject() Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

tinyurlcomverzetskranten (in Dutch)

We need a database

Build central database

httpswwwyoutubecomwatchv=GVDGuCjog_0

Build central database

httpswwwyoutubecomwatchv=GVDGuCjog_0

Step 1 Create Excel-sheet with bull Metadata about newspaper (from Winkel PDF)

bull Unique Wikipedia article title

bull Contextual info incl related persons amp titles (from Winkel PDF)

bull PPN unique ID linking newspaper to

KB-catalogue amp Delpher

Winkel-ID

Place of publication

Other metadata

Title

Unique Wikipedia article title =

ltNewspaper titlegt (verzetsblad ltPlace of publicationgt)

bull Contextual info bull Related persons

bull Related newspaper titles

PPN (107123223)

httpopc4kbnlDB=1PPNPPN=107123223

httpopc4kbnlDB=1PPNPPN=107123223

PPN (107123223)

PPN (107123223)

httpwwwdelphernlnlkrantenresultsindexcoll=dddtitelampcql[]=ppn3D107123223

httpwwwdelphernlnlkrantenresultsindexcoll=dddtitelampcql[]=ppn3D107123223

PPN (107123223)

Build central database

Step 2 Convert Excel into RDF triplestore (=special kind of online database anybody can access)

bull Steps 1-4 from httplinda-projecteulinked-

data-primer-2

bull Step 4 Vocubulary used = Bibframe (httpbibframeorgvocab)

Build central database

Step 3 Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

bull DBpedia = hub for linking different data sets on the Web to each

other Linked Open Data cloud

bull Connect persons amp places in newspaper database to external resources via DBpedia

Step 1c Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

DBpedia allows you to ask sophisticated queries against Wikipedia and to link the different data sets on the Web to Wikipedia data

bull Connect persons amp places in newspaper database to external

resources via DBpedia

httplod-cloudnetversions2010-09-22lod-cloud_coloredpng

Linked Open Data cloud

Build central database

Step 3 Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

bull DBpedia = hub for linking different data sets on the Web to each

other Linked Open Data cloud

bull We use DBpdia to connect persons amp places in our newspaper database to information in other databases

httpnldbpediaorgpageHuib_Drion

httpsnlwikipediaorgwikiHuib_Drion

httpwwwdbnlorgauteursauteurphpid=drio001

httpwwwbiografischportaalnlpersoon41181342

Build central database

Added value of Linked Open Data amp DBpedia Software can automatically query for additional information about places and persons mentioned in De Winkel that is not available in bull KB-catalogue bull Delpher bull De Winkel

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples)

Open license (CC-BY-SA) Open standard (RDF)

Contextual information Links between titles Delpher amp KB-cat (via PPNs)

Relations Links between places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia)

bull Titles Other titles

(PPNs)

http5stardatainfoen

Wikiproject Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

We need a template

Using an article template we can generate 1300 uniform and interlinked Wikipedia articles

from the LOD-database

htt

ps

c1

sta

ticf

lickr

co

m9

82

81

76

99

23

19

18

_11

a73

56

c38

_bjp

g

LOD-database + article template = Wikipedia article

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

Titles from database

Metadata from database

Related persons from database

Related newspapers from database

Winkel-ID from database

Link to full-texts in Delpher from database

Link to KB-catalogue record from database

Fixed categories on WP and Commons

Predefined fixed strings

Grey = bull From database bull Predefined fixed strings

Uniformity between articles guaranteed

All that WP-writers need to add manually

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

The KB can re-use (embed) the Wikipedia content in its own

websites to tackle this problem

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_28verzetsblad29

Delpher - search results

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

Delpher - search results

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Delpher - object presentation

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Over De Geus onder studenten

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog dat vanaf 4 oktober 1940 tot en met 13 juli 1944 in Den Haag werd uitgegeven Het blad verscheen in 1940 1941 en 1943 maandelijks verder onregelmatig in een oplage tussen de 250 en 8000 exemplaren Het werd aanvankelijk gestencild en vanaf november 1942 gedrukt en de inhoud bestond voornamelijk uit opinie-artikelen

Het blad werd uitgegeven door Jan Drion en Huib Drion twee Leidsehellip

Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Delpher - object presentation

Suggested reading

bull httpwwwtedcomtalkstim_berners_lee_on_the_next_web 20 years ago Tim Berners-Lee invented the World Wide Web For his next project hes building a web for open linked data that could do for numbers what the Web did for words pictures video unlock our data and reframe the way we use it together

bull httpsenwikipediaorgwikiLinked_data Wikipedia article related to the above video

bull http5stardatainfoen The 5 stars of Linked Open Data (Tim Berners-Lee)

bull httplinda-projecteulinked-data-primer-2 Short primer about creating LOD in practice starting from an Excel sheet

bull httpwwwprogrammablewebcomnewshow-linked-data-solved-digital-age-marketing-problemanalysis20150831 The figure near the bottom of the first page is a good illustration of the concept of (linked) triples

bull httpsenwikipediaorgwikiDBpedia

bull httpsenwikipediaorgwikiSemantic_network

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Questions

olafjanssenkbnl - ookgezellig

tinyurlcomverzetskranten

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Page 11: Linked Open Data case study (illegal newspapers WW2, Wikipedia, DBpedia) - Lecture Leiden University 3-3-2016

PPN = unique ID of this title

in KB-catalogue

httpopc4kbnlDB=1PPNPPN=107123223

These newspapers were digitised page by pagehellip resulting in hellip

full texts in Delpher

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

bull Scans bull Full-text OCR

Again De Geus onder studenten

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

PPN = unique ID of this title in Delpher

(same at for KB-catalogue)

Again De Geus onder studenten

On Delpher you can read and (word)search this title

Say I want to know more about this newspaper bull What sortstyle of underground paper was De Geus bull What is the history of this newspaper bull Who were working on it bull Where was this newpaper printed bull How was De Geus distributed and financed bull Were there any relations with other illegal newspapers or resistance

groups bull Etchellip

Under ldquoDetailsrdquo perhaps

OK ok some metadatahellip

but I want to know moacuteoacuteoacuterhellip

Maybe in the catalogue record

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

Question Where would most people start searching contextual

information about De Geus onder studenten

Probably Wikipedia (via Google)

httpnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

Question Where would most people start searching contextual

information about De Geus onder studenten

Probably Wikipedia (via Google)

httpswwwyoutubecomwatchv=VREJV--VHSw

Report on interest in WW2 among Dutch population

httpwwwoorlogsbronnennlgebruikersonderzoek2015 May 2015

Many of us use the internet to search for information [] We often mention Wikipediahellip

Everything is of course on Wikipedia Just type in a name and you can read entire essays (man 70s)

Over half of us think that Wikipedia and Google contribute to our knowledge and understanding of history

When we have to find information about WW2 outside the class setting we fully concentrate on digital resources like Google and Wikipedia (school kids)

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

httpnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

is given in

Context about De Geus onder studenten

But nowhellip

We have another problemhellip

httpsnlwikipediaorgwikiCategorieIllegale_pers_in_de_Tweede_Wereldoorlog

hellip De Geus on Wikipedia is an exception

1 Very few underground newspapers have their own WP articles

2 The overview of these newspapers on WP is far from complete

Good news

htt

p

ww

wa

rch

ives

go

vre

sear

chm

ilita

ryw

w2

ph

oto

sim

ages

ww

2-1

94

jpg

There is a 1-fix solution to this contextual problem

De Ondergrondse Pers 1940-1945

By Lydia E Winkel amp H de Vries

1989 ISBN 9021837463 Veen Uitgevers

This book (ldquoDe Winkelrdquo) contains

contextual articles about

(nearly) all plusmn 1300 illegal WW2 newspapers

ldquoDe Winkelrdquo ndash nr 199

De Ondergrondse Pers 1940-1945 Lydia E Winkel H de Vries 1989

ISBN 9021837463 Veen Uitgevers

Every article has a unique ID

(ldquoWinkel-IDrdquo)

Every article has metadata

bull Title subtitle motto bull Place of publication bull Period of publication bull Publication frequency (daily weekly one-off irregular)

bull Multiplication (stenciled printed typed handwritten)

bull Contents (news opinions poems illustrations humor)

bull Number of prints (min ndash max)

Relation 13

Newspaper Placename

semantics linked data

Relation 13

Newspaper Placename

semantics linked data

Contextual information

Nice material

for a Wikipedia article

Very often persons related to this newspaper are mentioned

Relation 23

Newspaper Persons

semantics linked data

Many articles also contain references to other newspapers

bull 106 = Cereales Vadeness (students resistance newspaper Wageningen) bull 360 = Leidsche Brief (students resistance newspaper Leiden) bull 748 = Sol Justitiae (students resistance newspaper Utrecht)

Relation 33

Newspaper Other newspapers

semantics linked data

htt

p

htt

ps

kn

ow

led

geu

top

iaf

iles

wo

rdp

ress

co

m2

01

40

1h

olla

nd

ho

use

libra

ryb

litz1

94

0j

pg

Too bad itrsquos a paper book hard to find multiply distribute and build upon

We need it digital

htt

p

htt

ps

kn

ow

led

geu

top

iaf

iles

wo

rdp

ress

co

m2

01

40

1h

olla

nd

ho

use

libra

ryb

litz1

94

0j

pg

We need it digital

1 Clear copyright with copyright holder (NIOD) Open CC-BY-SA license

2 Scan amp OCR

3 Convert into PDF

4 Put online NIOD site amp Wikimedia Commons

De Winkel as PDF on NIOD website (CC-BY-SA) httpwwwniodnlnlde-ondergrondse-pers-1940-1945

De Winkel as PDF on Wikimedia Commons httpscommonswikimediaorgwikiFilePDF_of_De_Ondergrondse_Pers_1940-1945_-_derde_druk_-_1989pdf

De Winkel as PDF on NIOD website (CC-BY-SA) httpwwwniodnlnlde-ondergrondse-pers-1940-1945

Saved us euro13330 httpwwwbrillcomdutch-underground-press-1940-1945

Wikipedia article about De Winkel httpnlwikipediaorgwikiDe_ondergrondse_pers_1940-1945

Wikipedia article about the author httpnlwikipediaorgwikiLydia_Winkel

Winkel the plusses

Available online (PDF flat file)

Open license (CC-BY-SA) Contextual information Relations

bull Titles Places bull Titles Persons

bull Titles Other titles

htt

p

ww

wa

rch

ives

go

vre

sear

chm

ilita

ryw

w2

ph

oto

sim

ages

ww

2-1

94

jpg

htt

p

2b

pb

logsp

ot

com

_BW

zuYw

iS6-I

TM

geR

sFd3m

IAAAAAAAAElw

3cv

gbZSPW

css

1600d

oct

or+

macr

o+

judy+

scare

djpg

Winkel the minusses

Unstructured data (PDF flat file)

Not very machine readable (unlike CSV XML JSON RDF)

PDF is no (real) open standard (unlike CSV XML JSON RDF)

No links between titles Delpher amp KB-cat No links between titles places amp persons

external sources (like Wikipedia)

but the data sources are

unconnected (and for 3+4 unstructured amp not machine-readable)

To summarize

a lot of information is available about these WW2 underground newspapers

1 Metadata (KB-cat)

2 Content (full-text Delpher)

3 Context (Winkel PDF)

4 Relations titles places persons other titles (Winkel PDF)

5 External resources about titles places and persons

htt

p

2b

pb

logsp

ot

com

_BW

zuYw

iS6-I

TM

geR

sFd3m

IAAAAAAAAElw

3cv

gbZSPW

css

1600d

oct

or+

macr

o+

judy+

scare

djpg

making discovery understanding amp research

of these newspapers (and related places amp persons) more difficult than necessary

We can solve all these issues

Good news

Wikiproject() Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

tinyurlcomverzetskranten (in Dutch)

httpsnlwikipediaorgwikiWikipediaWikiproject httpsenwikipediaorgwikiWikipediaWikiproject

From 14 1300 titles

htt

ps

n

lwik

iped

iao

rgw

iki

Cat

ego

rie

Illeg

ale_

per

s_in

_de_

Twee

de_

Wer

eld

oo

rlo

g

Wikiproject() Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

tinyurlcomverzetskranten (in Dutch)

We need a database

Build central database

httpswwwyoutubecomwatchv=GVDGuCjog_0

Build central database

httpswwwyoutubecomwatchv=GVDGuCjog_0

Step 1 Create Excel-sheet with bull Metadata about newspaper (from Winkel PDF)

bull Unique Wikipedia article title

bull Contextual info incl related persons amp titles (from Winkel PDF)

bull PPN unique ID linking newspaper to

KB-catalogue amp Delpher

Winkel-ID

Place of publication

Other metadata

Title

Unique Wikipedia article title =

ltNewspaper titlegt (verzetsblad ltPlace of publicationgt)

bull Contextual info bull Related persons

bull Related newspaper titles

PPN (107123223)

httpopc4kbnlDB=1PPNPPN=107123223

httpopc4kbnlDB=1PPNPPN=107123223

PPN (107123223)

PPN (107123223)

httpwwwdelphernlnlkrantenresultsindexcoll=dddtitelampcql[]=ppn3D107123223

httpwwwdelphernlnlkrantenresultsindexcoll=dddtitelampcql[]=ppn3D107123223

PPN (107123223)

Build central database

Step 2 Convert Excel into RDF triplestore (=special kind of online database anybody can access)

bull Steps 1-4 from httplinda-projecteulinked-

data-primer-2

bull Step 4 Vocubulary used = Bibframe (httpbibframeorgvocab)

Build central database

Step 3 Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

bull DBpedia = hub for linking different data sets on the Web to each

other Linked Open Data cloud

bull Connect persons amp places in newspaper database to external resources via DBpedia

Step 1c Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

DBpedia allows you to ask sophisticated queries against Wikipedia and to link the different data sets on the Web to Wikipedia data

bull Connect persons amp places in newspaper database to external

resources via DBpedia

httplod-cloudnetversions2010-09-22lod-cloud_coloredpng

Linked Open Data cloud

Build central database

Step 3 Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

bull DBpedia = hub for linking different data sets on the Web to each

other Linked Open Data cloud

bull We use DBpdia to connect persons amp places in our newspaper database to information in other databases

httpnldbpediaorgpageHuib_Drion

httpsnlwikipediaorgwikiHuib_Drion

httpwwwdbnlorgauteursauteurphpid=drio001

httpwwwbiografischportaalnlpersoon41181342

Build central database

Added value of Linked Open Data amp DBpedia Software can automatically query for additional information about places and persons mentioned in De Winkel that is not available in bull KB-catalogue bull Delpher bull De Winkel

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples)

Open license (CC-BY-SA) Open standard (RDF)

Contextual information Links between titles Delpher amp KB-cat (via PPNs)

Relations Links between places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia)

bull Titles Other titles

(PPNs)

http5stardatainfoen

Wikiproject Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

We need a template

Using an article template we can generate 1300 uniform and interlinked Wikipedia articles

from the LOD-database

htt

ps

c1

sta

ticf

lickr

co

m9

82

81

76

99

23

19

18

_11

a73

56

c38

_bjp

g

LOD-database + article template = Wikipedia article

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

Titles from database

Metadata from database

Related persons from database

Related newspapers from database

Winkel-ID from database

Link to full-texts in Delpher from database

Link to KB-catalogue record from database

Fixed categories on WP and Commons

Predefined fixed strings

Grey = bull From database bull Predefined fixed strings

Uniformity between articles guaranteed

All that WP-writers need to add manually

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

The KB can re-use (embed) the Wikipedia content in its own

websites to tackle this problem

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_28verzetsblad29

Delpher - search results

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

Delpher - search results

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Delpher - object presentation

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Over De Geus onder studenten

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog dat vanaf 4 oktober 1940 tot en met 13 juli 1944 in Den Haag werd uitgegeven Het blad verscheen in 1940 1941 en 1943 maandelijks verder onregelmatig in een oplage tussen de 250 en 8000 exemplaren Het werd aanvankelijk gestencild en vanaf november 1942 gedrukt en de inhoud bestond voornamelijk uit opinie-artikelen

Het blad werd uitgegeven door Jan Drion en Huib Drion twee Leidsehellip

Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Delpher - object presentation

Suggested reading

bull httpwwwtedcomtalkstim_berners_lee_on_the_next_web 20 years ago Tim Berners-Lee invented the World Wide Web For his next project hes building a web for open linked data that could do for numbers what the Web did for words pictures video unlock our data and reframe the way we use it together

bull httpsenwikipediaorgwikiLinked_data Wikipedia article related to the above video

bull http5stardatainfoen The 5 stars of Linked Open Data (Tim Berners-Lee)

bull httplinda-projecteulinked-data-primer-2 Short primer about creating LOD in practice starting from an Excel sheet

bull httpwwwprogrammablewebcomnewshow-linked-data-solved-digital-age-marketing-problemanalysis20150831 The figure near the bottom of the first page is a good illustration of the concept of (linked) triples

bull httpsenwikipediaorgwikiDBpedia

bull httpsenwikipediaorgwikiSemantic_network

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Questions

olafjanssenkbnl - ookgezellig

tinyurlcomverzetskranten

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Page 12: Linked Open Data case study (illegal newspapers WW2, Wikipedia, DBpedia) - Lecture Leiden University 3-3-2016

These newspapers were digitised page by pagehellip resulting in hellip

full texts in Delpher

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

bull Scans bull Full-text OCR

Again De Geus onder studenten

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

PPN = unique ID of this title in Delpher

(same at for KB-catalogue)

Again De Geus onder studenten

On Delpher you can read and (word)search this title

Say I want to know more about this newspaper bull What sortstyle of underground paper was De Geus bull What is the history of this newspaper bull Who were working on it bull Where was this newpaper printed bull How was De Geus distributed and financed bull Were there any relations with other illegal newspapers or resistance

groups bull Etchellip

Under ldquoDetailsrdquo perhaps

OK ok some metadatahellip

but I want to know moacuteoacuteoacuterhellip

Maybe in the catalogue record

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

Question Where would most people start searching contextual

information about De Geus onder studenten

Probably Wikipedia (via Google)

httpnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

Question Where would most people start searching contextual

information about De Geus onder studenten

Probably Wikipedia (via Google)

httpswwwyoutubecomwatchv=VREJV--VHSw

Report on interest in WW2 among Dutch population

httpwwwoorlogsbronnennlgebruikersonderzoek2015 May 2015

Many of us use the internet to search for information [] We often mention Wikipediahellip

Everything is of course on Wikipedia Just type in a name and you can read entire essays (man 70s)

Over half of us think that Wikipedia and Google contribute to our knowledge and understanding of history

When we have to find information about WW2 outside the class setting we fully concentrate on digital resources like Google and Wikipedia (school kids)

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

httpnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

is given in

Context about De Geus onder studenten

But nowhellip

We have another problemhellip

httpsnlwikipediaorgwikiCategorieIllegale_pers_in_de_Tweede_Wereldoorlog

hellip De Geus on Wikipedia is an exception

1 Very few underground newspapers have their own WP articles

2 The overview of these newspapers on WP is far from complete

Good news

htt

p

ww

wa

rch

ives

go

vre

sear

chm

ilita

ryw

w2

ph

oto

sim

ages

ww

2-1

94

jpg

There is a 1-fix solution to this contextual problem

De Ondergrondse Pers 1940-1945

By Lydia E Winkel amp H de Vries

1989 ISBN 9021837463 Veen Uitgevers

This book (ldquoDe Winkelrdquo) contains

contextual articles about

(nearly) all plusmn 1300 illegal WW2 newspapers

ldquoDe Winkelrdquo ndash nr 199

De Ondergrondse Pers 1940-1945 Lydia E Winkel H de Vries 1989

ISBN 9021837463 Veen Uitgevers

Every article has a unique ID

(ldquoWinkel-IDrdquo)

Every article has metadata

bull Title subtitle motto bull Place of publication bull Period of publication bull Publication frequency (daily weekly one-off irregular)

bull Multiplication (stenciled printed typed handwritten)

bull Contents (news opinions poems illustrations humor)

bull Number of prints (min ndash max)

Relation 13

Newspaper Placename

semantics linked data

Relation 13

Newspaper Placename

semantics linked data

Contextual information

Nice material

for a Wikipedia article

Very often persons related to this newspaper are mentioned

Relation 23

Newspaper Persons

semantics linked data

Many articles also contain references to other newspapers

bull 106 = Cereales Vadeness (students resistance newspaper Wageningen) bull 360 = Leidsche Brief (students resistance newspaper Leiden) bull 748 = Sol Justitiae (students resistance newspaper Utrecht)

Relation 33

Newspaper Other newspapers

semantics linked data

htt

p

htt

ps

kn

ow

led

geu

top

iaf

iles

wo

rdp

ress

co

m2

01

40

1h

olla

nd

ho

use

libra

ryb

litz1

94

0j

pg

Too bad itrsquos a paper book hard to find multiply distribute and build upon

We need it digital

htt

p

htt

ps

kn

ow

led

geu

top

iaf

iles

wo

rdp

ress

co

m2

01

40

1h

olla

nd

ho

use

libra

ryb

litz1

94

0j

pg

We need it digital

1 Clear copyright with copyright holder (NIOD) Open CC-BY-SA license

2 Scan amp OCR

3 Convert into PDF

4 Put online NIOD site amp Wikimedia Commons

De Winkel as PDF on NIOD website (CC-BY-SA) httpwwwniodnlnlde-ondergrondse-pers-1940-1945

De Winkel as PDF on Wikimedia Commons httpscommonswikimediaorgwikiFilePDF_of_De_Ondergrondse_Pers_1940-1945_-_derde_druk_-_1989pdf

De Winkel as PDF on NIOD website (CC-BY-SA) httpwwwniodnlnlde-ondergrondse-pers-1940-1945

Saved us euro13330 httpwwwbrillcomdutch-underground-press-1940-1945

Wikipedia article about De Winkel httpnlwikipediaorgwikiDe_ondergrondse_pers_1940-1945

Wikipedia article about the author httpnlwikipediaorgwikiLydia_Winkel

Winkel the plusses

Available online (PDF flat file)

Open license (CC-BY-SA) Contextual information Relations

bull Titles Places bull Titles Persons

bull Titles Other titles

htt

p

ww

wa

rch

ives

go

vre

sear

chm

ilita

ryw

w2

ph

oto

sim

ages

ww

2-1

94

jpg

htt

p

2b

pb

logsp

ot

com

_BW

zuYw

iS6-I

TM

geR

sFd3m

IAAAAAAAAElw

3cv

gbZSPW

css

1600d

oct

or+

macr

o+

judy+

scare

djpg

Winkel the minusses

Unstructured data (PDF flat file)

Not very machine readable (unlike CSV XML JSON RDF)

PDF is no (real) open standard (unlike CSV XML JSON RDF)

No links between titles Delpher amp KB-cat No links between titles places amp persons

external sources (like Wikipedia)

but the data sources are

unconnected (and for 3+4 unstructured amp not machine-readable)

To summarize

a lot of information is available about these WW2 underground newspapers

1 Metadata (KB-cat)

2 Content (full-text Delpher)

3 Context (Winkel PDF)

4 Relations titles places persons other titles (Winkel PDF)

5 External resources about titles places and persons

htt

p

2b

pb

logsp

ot

com

_BW

zuYw

iS6-I

TM

geR

sFd3m

IAAAAAAAAElw

3cv

gbZSPW

css

1600d

oct

or+

macr

o+

judy+

scare

djpg

making discovery understanding amp research

of these newspapers (and related places amp persons) more difficult than necessary

We can solve all these issues

Good news

Wikiproject() Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

tinyurlcomverzetskranten (in Dutch)

httpsnlwikipediaorgwikiWikipediaWikiproject httpsenwikipediaorgwikiWikipediaWikiproject

From 14 1300 titles

htt

ps

n

lwik

iped

iao

rgw

iki

Cat

ego

rie

Illeg

ale_

per

s_in

_de_

Twee

de_

Wer

eld

oo

rlo

g

Wikiproject() Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

tinyurlcomverzetskranten (in Dutch)

We need a database

Build central database

httpswwwyoutubecomwatchv=GVDGuCjog_0

Build central database

httpswwwyoutubecomwatchv=GVDGuCjog_0

Step 1 Create Excel-sheet with bull Metadata about newspaper (from Winkel PDF)

bull Unique Wikipedia article title

bull Contextual info incl related persons amp titles (from Winkel PDF)

bull PPN unique ID linking newspaper to

KB-catalogue amp Delpher

Winkel-ID

Place of publication

Other metadata

Title

Unique Wikipedia article title =

ltNewspaper titlegt (verzetsblad ltPlace of publicationgt)

bull Contextual info bull Related persons

bull Related newspaper titles

PPN (107123223)

httpopc4kbnlDB=1PPNPPN=107123223

httpopc4kbnlDB=1PPNPPN=107123223

PPN (107123223)

PPN (107123223)

httpwwwdelphernlnlkrantenresultsindexcoll=dddtitelampcql[]=ppn3D107123223

httpwwwdelphernlnlkrantenresultsindexcoll=dddtitelampcql[]=ppn3D107123223

PPN (107123223)

Build central database

Step 2 Convert Excel into RDF triplestore (=special kind of online database anybody can access)

bull Steps 1-4 from httplinda-projecteulinked-

data-primer-2

bull Step 4 Vocubulary used = Bibframe (httpbibframeorgvocab)

Build central database

Step 3 Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

bull DBpedia = hub for linking different data sets on the Web to each

other Linked Open Data cloud

bull Connect persons amp places in newspaper database to external resources via DBpedia

Step 1c Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

DBpedia allows you to ask sophisticated queries against Wikipedia and to link the different data sets on the Web to Wikipedia data

bull Connect persons amp places in newspaper database to external

resources via DBpedia

httplod-cloudnetversions2010-09-22lod-cloud_coloredpng

Linked Open Data cloud

Build central database

Step 3 Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

bull DBpedia = hub for linking different data sets on the Web to each

other Linked Open Data cloud

bull We use DBpdia to connect persons amp places in our newspaper database to information in other databases

httpnldbpediaorgpageHuib_Drion

httpsnlwikipediaorgwikiHuib_Drion

httpwwwdbnlorgauteursauteurphpid=drio001

httpwwwbiografischportaalnlpersoon41181342

Build central database

Added value of Linked Open Data amp DBpedia Software can automatically query for additional information about places and persons mentioned in De Winkel that is not available in bull KB-catalogue bull Delpher bull De Winkel

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples)

Open license (CC-BY-SA) Open standard (RDF)

Contextual information Links between titles Delpher amp KB-cat (via PPNs)

Relations Links between places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia)

bull Titles Other titles

(PPNs)

http5stardatainfoen

Wikiproject Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

We need a template

Using an article template we can generate 1300 uniform and interlinked Wikipedia articles

from the LOD-database

htt

ps

c1

sta

ticf

lickr

co

m9

82

81

76

99

23

19

18

_11

a73

56

c38

_bjp

g

LOD-database + article template = Wikipedia article

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

Titles from database

Metadata from database

Related persons from database

Related newspapers from database

Winkel-ID from database

Link to full-texts in Delpher from database

Link to KB-catalogue record from database

Fixed categories on WP and Commons

Predefined fixed strings

Grey = bull From database bull Predefined fixed strings

Uniformity between articles guaranteed

All that WP-writers need to add manually

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

The KB can re-use (embed) the Wikipedia content in its own

websites to tackle this problem

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_28verzetsblad29

Delpher - search results

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

Delpher - search results

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Delpher - object presentation

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Over De Geus onder studenten

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog dat vanaf 4 oktober 1940 tot en met 13 juli 1944 in Den Haag werd uitgegeven Het blad verscheen in 1940 1941 en 1943 maandelijks verder onregelmatig in een oplage tussen de 250 en 8000 exemplaren Het werd aanvankelijk gestencild en vanaf november 1942 gedrukt en de inhoud bestond voornamelijk uit opinie-artikelen

Het blad werd uitgegeven door Jan Drion en Huib Drion twee Leidsehellip

Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Delpher - object presentation

Suggested reading

bull httpwwwtedcomtalkstim_berners_lee_on_the_next_web 20 years ago Tim Berners-Lee invented the World Wide Web For his next project hes building a web for open linked data that could do for numbers what the Web did for words pictures video unlock our data and reframe the way we use it together

bull httpsenwikipediaorgwikiLinked_data Wikipedia article related to the above video

bull http5stardatainfoen The 5 stars of Linked Open Data (Tim Berners-Lee)

bull httplinda-projecteulinked-data-primer-2 Short primer about creating LOD in practice starting from an Excel sheet

bull httpwwwprogrammablewebcomnewshow-linked-data-solved-digital-age-marketing-problemanalysis20150831 The figure near the bottom of the first page is a good illustration of the concept of (linked) triples

bull httpsenwikipediaorgwikiDBpedia

bull httpsenwikipediaorgwikiSemantic_network

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Questions

olafjanssenkbnl - ookgezellig

tinyurlcomverzetskranten

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Page 13: Linked Open Data case study (illegal newspapers WW2, Wikipedia, DBpedia) - Lecture Leiden University 3-3-2016

full texts in Delpher

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

bull Scans bull Full-text OCR

Again De Geus onder studenten

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

PPN = unique ID of this title in Delpher

(same at for KB-catalogue)

Again De Geus onder studenten

On Delpher you can read and (word)search this title

Say I want to know more about this newspaper bull What sortstyle of underground paper was De Geus bull What is the history of this newspaper bull Who were working on it bull Where was this newpaper printed bull How was De Geus distributed and financed bull Were there any relations with other illegal newspapers or resistance

groups bull Etchellip

Under ldquoDetailsrdquo perhaps

OK ok some metadatahellip

but I want to know moacuteoacuteoacuterhellip

Maybe in the catalogue record

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

Question Where would most people start searching contextual

information about De Geus onder studenten

Probably Wikipedia (via Google)

httpnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

Question Where would most people start searching contextual

information about De Geus onder studenten

Probably Wikipedia (via Google)

httpswwwyoutubecomwatchv=VREJV--VHSw

Report on interest in WW2 among Dutch population

httpwwwoorlogsbronnennlgebruikersonderzoek2015 May 2015

Many of us use the internet to search for information [] We often mention Wikipediahellip

Everything is of course on Wikipedia Just type in a name and you can read entire essays (man 70s)

Over half of us think that Wikipedia and Google contribute to our knowledge and understanding of history

When we have to find information about WW2 outside the class setting we fully concentrate on digital resources like Google and Wikipedia (school kids)

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

httpnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

is given in

Context about De Geus onder studenten

But nowhellip

We have another problemhellip

httpsnlwikipediaorgwikiCategorieIllegale_pers_in_de_Tweede_Wereldoorlog

hellip De Geus on Wikipedia is an exception

1 Very few underground newspapers have their own WP articles

2 The overview of these newspapers on WP is far from complete

Good news

htt

p

ww

wa

rch

ives

go

vre

sear

chm

ilita

ryw

w2

ph

oto

sim

ages

ww

2-1

94

jpg

There is a 1-fix solution to this contextual problem

De Ondergrondse Pers 1940-1945

By Lydia E Winkel amp H de Vries

1989 ISBN 9021837463 Veen Uitgevers

This book (ldquoDe Winkelrdquo) contains

contextual articles about

(nearly) all plusmn 1300 illegal WW2 newspapers

ldquoDe Winkelrdquo ndash nr 199

De Ondergrondse Pers 1940-1945 Lydia E Winkel H de Vries 1989

ISBN 9021837463 Veen Uitgevers

Every article has a unique ID

(ldquoWinkel-IDrdquo)

Every article has metadata

bull Title subtitle motto bull Place of publication bull Period of publication bull Publication frequency (daily weekly one-off irregular)

bull Multiplication (stenciled printed typed handwritten)

bull Contents (news opinions poems illustrations humor)

bull Number of prints (min ndash max)

Relation 13

Newspaper Placename

semantics linked data

Relation 13

Newspaper Placename

semantics linked data

Contextual information

Nice material

for a Wikipedia article

Very often persons related to this newspaper are mentioned

Relation 23

Newspaper Persons

semantics linked data

Many articles also contain references to other newspapers

bull 106 = Cereales Vadeness (students resistance newspaper Wageningen) bull 360 = Leidsche Brief (students resistance newspaper Leiden) bull 748 = Sol Justitiae (students resistance newspaper Utrecht)

Relation 33

Newspaper Other newspapers

semantics linked data

htt

p

htt

ps

kn

ow

led

geu

top

iaf

iles

wo

rdp

ress

co

m2

01

40

1h

olla

nd

ho

use

libra

ryb

litz1

94

0j

pg

Too bad itrsquos a paper book hard to find multiply distribute and build upon

We need it digital

htt

p

htt

ps

kn

ow

led

geu

top

iaf

iles

wo

rdp

ress

co

m2

01

40

1h

olla

nd

ho

use

libra

ryb

litz1

94

0j

pg

We need it digital

1 Clear copyright with copyright holder (NIOD) Open CC-BY-SA license

2 Scan amp OCR

3 Convert into PDF

4 Put online NIOD site amp Wikimedia Commons

De Winkel as PDF on NIOD website (CC-BY-SA) httpwwwniodnlnlde-ondergrondse-pers-1940-1945

De Winkel as PDF on Wikimedia Commons httpscommonswikimediaorgwikiFilePDF_of_De_Ondergrondse_Pers_1940-1945_-_derde_druk_-_1989pdf

De Winkel as PDF on NIOD website (CC-BY-SA) httpwwwniodnlnlde-ondergrondse-pers-1940-1945

Saved us euro13330 httpwwwbrillcomdutch-underground-press-1940-1945

Wikipedia article about De Winkel httpnlwikipediaorgwikiDe_ondergrondse_pers_1940-1945

Wikipedia article about the author httpnlwikipediaorgwikiLydia_Winkel

Winkel the plusses

Available online (PDF flat file)

Open license (CC-BY-SA) Contextual information Relations

bull Titles Places bull Titles Persons

bull Titles Other titles

htt

p

ww

wa

rch

ives

go

vre

sear

chm

ilita

ryw

w2

ph

oto

sim

ages

ww

2-1

94

jpg

htt

p

2b

pb

logsp

ot

com

_BW

zuYw

iS6-I

TM

geR

sFd3m

IAAAAAAAAElw

3cv

gbZSPW

css

1600d

oct

or+

macr

o+

judy+

scare

djpg

Winkel the minusses

Unstructured data (PDF flat file)

Not very machine readable (unlike CSV XML JSON RDF)

PDF is no (real) open standard (unlike CSV XML JSON RDF)

No links between titles Delpher amp KB-cat No links between titles places amp persons

external sources (like Wikipedia)

but the data sources are

unconnected (and for 3+4 unstructured amp not machine-readable)

To summarize

a lot of information is available about these WW2 underground newspapers

1 Metadata (KB-cat)

2 Content (full-text Delpher)

3 Context (Winkel PDF)

4 Relations titles places persons other titles (Winkel PDF)

5 External resources about titles places and persons

htt

p

2b

pb

logsp

ot

com

_BW

zuYw

iS6-I

TM

geR

sFd3m

IAAAAAAAAElw

3cv

gbZSPW

css

1600d

oct

or+

macr

o+

judy+

scare

djpg

making discovery understanding amp research

of these newspapers (and related places amp persons) more difficult than necessary

We can solve all these issues

Good news

Wikiproject() Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

tinyurlcomverzetskranten (in Dutch)

httpsnlwikipediaorgwikiWikipediaWikiproject httpsenwikipediaorgwikiWikipediaWikiproject

From 14 1300 titles

htt

ps

n

lwik

iped

iao

rgw

iki

Cat

ego

rie

Illeg

ale_

per

s_in

_de_

Twee

de_

Wer

eld

oo

rlo

g

Wikiproject() Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

tinyurlcomverzetskranten (in Dutch)

We need a database

Build central database

httpswwwyoutubecomwatchv=GVDGuCjog_0

Build central database

httpswwwyoutubecomwatchv=GVDGuCjog_0

Step 1 Create Excel-sheet with bull Metadata about newspaper (from Winkel PDF)

bull Unique Wikipedia article title

bull Contextual info incl related persons amp titles (from Winkel PDF)

bull PPN unique ID linking newspaper to

KB-catalogue amp Delpher

Winkel-ID

Place of publication

Other metadata

Title

Unique Wikipedia article title =

ltNewspaper titlegt (verzetsblad ltPlace of publicationgt)

bull Contextual info bull Related persons

bull Related newspaper titles

PPN (107123223)

httpopc4kbnlDB=1PPNPPN=107123223

httpopc4kbnlDB=1PPNPPN=107123223

PPN (107123223)

PPN (107123223)

httpwwwdelphernlnlkrantenresultsindexcoll=dddtitelampcql[]=ppn3D107123223

httpwwwdelphernlnlkrantenresultsindexcoll=dddtitelampcql[]=ppn3D107123223

PPN (107123223)

Build central database

Step 2 Convert Excel into RDF triplestore (=special kind of online database anybody can access)

bull Steps 1-4 from httplinda-projecteulinked-

data-primer-2

bull Step 4 Vocubulary used = Bibframe (httpbibframeorgvocab)

Build central database

Step 3 Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

bull DBpedia = hub for linking different data sets on the Web to each

other Linked Open Data cloud

bull Connect persons amp places in newspaper database to external resources via DBpedia

Step 1c Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

DBpedia allows you to ask sophisticated queries against Wikipedia and to link the different data sets on the Web to Wikipedia data

bull Connect persons amp places in newspaper database to external

resources via DBpedia

httplod-cloudnetversions2010-09-22lod-cloud_coloredpng

Linked Open Data cloud

Build central database

Step 3 Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

bull DBpedia = hub for linking different data sets on the Web to each

other Linked Open Data cloud

bull We use DBpdia to connect persons amp places in our newspaper database to information in other databases

httpnldbpediaorgpageHuib_Drion

httpsnlwikipediaorgwikiHuib_Drion

httpwwwdbnlorgauteursauteurphpid=drio001

httpwwwbiografischportaalnlpersoon41181342

Build central database

Added value of Linked Open Data amp DBpedia Software can automatically query for additional information about places and persons mentioned in De Winkel that is not available in bull KB-catalogue bull Delpher bull De Winkel

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples)

Open license (CC-BY-SA) Open standard (RDF)

Contextual information Links between titles Delpher amp KB-cat (via PPNs)

Relations Links between places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia)

bull Titles Other titles

(PPNs)

http5stardatainfoen

Wikiproject Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

We need a template

Using an article template we can generate 1300 uniform and interlinked Wikipedia articles

from the LOD-database

htt

ps

c1

sta

ticf

lickr

co

m9

82

81

76

99

23

19

18

_11

a73

56

c38

_bjp

g

LOD-database + article template = Wikipedia article

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

Titles from database

Metadata from database

Related persons from database

Related newspapers from database

Winkel-ID from database

Link to full-texts in Delpher from database

Link to KB-catalogue record from database

Fixed categories on WP and Commons

Predefined fixed strings

Grey = bull From database bull Predefined fixed strings

Uniformity between articles guaranteed

All that WP-writers need to add manually

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

The KB can re-use (embed) the Wikipedia content in its own

websites to tackle this problem

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_28verzetsblad29

Delpher - search results

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

Delpher - search results

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Delpher - object presentation

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Over De Geus onder studenten

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog dat vanaf 4 oktober 1940 tot en met 13 juli 1944 in Den Haag werd uitgegeven Het blad verscheen in 1940 1941 en 1943 maandelijks verder onregelmatig in een oplage tussen de 250 en 8000 exemplaren Het werd aanvankelijk gestencild en vanaf november 1942 gedrukt en de inhoud bestond voornamelijk uit opinie-artikelen

Het blad werd uitgegeven door Jan Drion en Huib Drion twee Leidsehellip

Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Delpher - object presentation

Suggested reading

bull httpwwwtedcomtalkstim_berners_lee_on_the_next_web 20 years ago Tim Berners-Lee invented the World Wide Web For his next project hes building a web for open linked data that could do for numbers what the Web did for words pictures video unlock our data and reframe the way we use it together

bull httpsenwikipediaorgwikiLinked_data Wikipedia article related to the above video

bull http5stardatainfoen The 5 stars of Linked Open Data (Tim Berners-Lee)

bull httplinda-projecteulinked-data-primer-2 Short primer about creating LOD in practice starting from an Excel sheet

bull httpwwwprogrammablewebcomnewshow-linked-data-solved-digital-age-marketing-problemanalysis20150831 The figure near the bottom of the first page is a good illustration of the concept of (linked) triples

bull httpsenwikipediaorgwikiDBpedia

bull httpsenwikipediaorgwikiSemantic_network

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Questions

olafjanssenkbnl - ookgezellig

tinyurlcomverzetskranten

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Page 14: Linked Open Data case study (illegal newspapers WW2, Wikipedia, DBpedia) - Lecture Leiden University 3-3-2016

Again De Geus onder studenten

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

PPN = unique ID of this title in Delpher

(same at for KB-catalogue)

Again De Geus onder studenten

On Delpher you can read and (word)search this title

Say I want to know more about this newspaper bull What sortstyle of underground paper was De Geus bull What is the history of this newspaper bull Who were working on it bull Where was this newpaper printed bull How was De Geus distributed and financed bull Were there any relations with other illegal newspapers or resistance

groups bull Etchellip

Under ldquoDetailsrdquo perhaps

OK ok some metadatahellip

but I want to know moacuteoacuteoacuterhellip

Maybe in the catalogue record

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

Question Where would most people start searching contextual

information about De Geus onder studenten

Probably Wikipedia (via Google)

httpnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

Question Where would most people start searching contextual

information about De Geus onder studenten

Probably Wikipedia (via Google)

httpswwwyoutubecomwatchv=VREJV--VHSw

Report on interest in WW2 among Dutch population

httpwwwoorlogsbronnennlgebruikersonderzoek2015 May 2015

Many of us use the internet to search for information [] We often mention Wikipediahellip

Everything is of course on Wikipedia Just type in a name and you can read entire essays (man 70s)

Over half of us think that Wikipedia and Google contribute to our knowledge and understanding of history

When we have to find information about WW2 outside the class setting we fully concentrate on digital resources like Google and Wikipedia (school kids)

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

httpnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

is given in

Context about De Geus onder studenten

But nowhellip

We have another problemhellip

httpsnlwikipediaorgwikiCategorieIllegale_pers_in_de_Tweede_Wereldoorlog

hellip De Geus on Wikipedia is an exception

1 Very few underground newspapers have their own WP articles

2 The overview of these newspapers on WP is far from complete

Good news

htt

p

ww

wa

rch

ives

go

vre

sear

chm

ilita

ryw

w2

ph

oto

sim

ages

ww

2-1

94

jpg

There is a 1-fix solution to this contextual problem

De Ondergrondse Pers 1940-1945

By Lydia E Winkel amp H de Vries

1989 ISBN 9021837463 Veen Uitgevers

This book (ldquoDe Winkelrdquo) contains

contextual articles about

(nearly) all plusmn 1300 illegal WW2 newspapers

ldquoDe Winkelrdquo ndash nr 199

De Ondergrondse Pers 1940-1945 Lydia E Winkel H de Vries 1989

ISBN 9021837463 Veen Uitgevers

Every article has a unique ID

(ldquoWinkel-IDrdquo)

Every article has metadata

bull Title subtitle motto bull Place of publication bull Period of publication bull Publication frequency (daily weekly one-off irregular)

bull Multiplication (stenciled printed typed handwritten)

bull Contents (news opinions poems illustrations humor)

bull Number of prints (min ndash max)

Relation 13

Newspaper Placename

semantics linked data

Relation 13

Newspaper Placename

semantics linked data

Contextual information

Nice material

for a Wikipedia article

Very often persons related to this newspaper are mentioned

Relation 23

Newspaper Persons

semantics linked data

Many articles also contain references to other newspapers

bull 106 = Cereales Vadeness (students resistance newspaper Wageningen) bull 360 = Leidsche Brief (students resistance newspaper Leiden) bull 748 = Sol Justitiae (students resistance newspaper Utrecht)

Relation 33

Newspaper Other newspapers

semantics linked data

htt

p

htt

ps

kn

ow

led

geu

top

iaf

iles

wo

rdp

ress

co

m2

01

40

1h

olla

nd

ho

use

libra

ryb

litz1

94

0j

pg

Too bad itrsquos a paper book hard to find multiply distribute and build upon

We need it digital

htt

p

htt

ps

kn

ow

led

geu

top

iaf

iles

wo

rdp

ress

co

m2

01

40

1h

olla

nd

ho

use

libra

ryb

litz1

94

0j

pg

We need it digital

1 Clear copyright with copyright holder (NIOD) Open CC-BY-SA license

2 Scan amp OCR

3 Convert into PDF

4 Put online NIOD site amp Wikimedia Commons

De Winkel as PDF on NIOD website (CC-BY-SA) httpwwwniodnlnlde-ondergrondse-pers-1940-1945

De Winkel as PDF on Wikimedia Commons httpscommonswikimediaorgwikiFilePDF_of_De_Ondergrondse_Pers_1940-1945_-_derde_druk_-_1989pdf

De Winkel as PDF on NIOD website (CC-BY-SA) httpwwwniodnlnlde-ondergrondse-pers-1940-1945

Saved us euro13330 httpwwwbrillcomdutch-underground-press-1940-1945

Wikipedia article about De Winkel httpnlwikipediaorgwikiDe_ondergrondse_pers_1940-1945

Wikipedia article about the author httpnlwikipediaorgwikiLydia_Winkel

Winkel the plusses

Available online (PDF flat file)

Open license (CC-BY-SA) Contextual information Relations

bull Titles Places bull Titles Persons

bull Titles Other titles

htt

p

ww

wa

rch

ives

go

vre

sear

chm

ilita

ryw

w2

ph

oto

sim

ages

ww

2-1

94

jpg

htt

p

2b

pb

logsp

ot

com

_BW

zuYw

iS6-I

TM

geR

sFd3m

IAAAAAAAAElw

3cv

gbZSPW

css

1600d

oct

or+

macr

o+

judy+

scare

djpg

Winkel the minusses

Unstructured data (PDF flat file)

Not very machine readable (unlike CSV XML JSON RDF)

PDF is no (real) open standard (unlike CSV XML JSON RDF)

No links between titles Delpher amp KB-cat No links between titles places amp persons

external sources (like Wikipedia)

but the data sources are

unconnected (and for 3+4 unstructured amp not machine-readable)

To summarize

a lot of information is available about these WW2 underground newspapers

1 Metadata (KB-cat)

2 Content (full-text Delpher)

3 Context (Winkel PDF)

4 Relations titles places persons other titles (Winkel PDF)

5 External resources about titles places and persons

htt

p

2b

pb

logsp

ot

com

_BW

zuYw

iS6-I

TM

geR

sFd3m

IAAAAAAAAElw

3cv

gbZSPW

css

1600d

oct

or+

macr

o+

judy+

scare

djpg

making discovery understanding amp research

of these newspapers (and related places amp persons) more difficult than necessary

We can solve all these issues

Good news

Wikiproject() Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

tinyurlcomverzetskranten (in Dutch)

httpsnlwikipediaorgwikiWikipediaWikiproject httpsenwikipediaorgwikiWikipediaWikiproject

From 14 1300 titles

htt

ps

n

lwik

iped

iao

rgw

iki

Cat

ego

rie

Illeg

ale_

per

s_in

_de_

Twee

de_

Wer

eld

oo

rlo

g

Wikiproject() Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

tinyurlcomverzetskranten (in Dutch)

We need a database

Build central database

httpswwwyoutubecomwatchv=GVDGuCjog_0

Build central database

httpswwwyoutubecomwatchv=GVDGuCjog_0

Step 1 Create Excel-sheet with bull Metadata about newspaper (from Winkel PDF)

bull Unique Wikipedia article title

bull Contextual info incl related persons amp titles (from Winkel PDF)

bull PPN unique ID linking newspaper to

KB-catalogue amp Delpher

Winkel-ID

Place of publication

Other metadata

Title

Unique Wikipedia article title =

ltNewspaper titlegt (verzetsblad ltPlace of publicationgt)

bull Contextual info bull Related persons

bull Related newspaper titles

PPN (107123223)

httpopc4kbnlDB=1PPNPPN=107123223

httpopc4kbnlDB=1PPNPPN=107123223

PPN (107123223)

PPN (107123223)

httpwwwdelphernlnlkrantenresultsindexcoll=dddtitelampcql[]=ppn3D107123223

httpwwwdelphernlnlkrantenresultsindexcoll=dddtitelampcql[]=ppn3D107123223

PPN (107123223)

Build central database

Step 2 Convert Excel into RDF triplestore (=special kind of online database anybody can access)

bull Steps 1-4 from httplinda-projecteulinked-

data-primer-2

bull Step 4 Vocubulary used = Bibframe (httpbibframeorgvocab)

Build central database

Step 3 Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

bull DBpedia = hub for linking different data sets on the Web to each

other Linked Open Data cloud

bull Connect persons amp places in newspaper database to external resources via DBpedia

Step 1c Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

DBpedia allows you to ask sophisticated queries against Wikipedia and to link the different data sets on the Web to Wikipedia data

bull Connect persons amp places in newspaper database to external

resources via DBpedia

httplod-cloudnetversions2010-09-22lod-cloud_coloredpng

Linked Open Data cloud

Build central database

Step 3 Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

bull DBpedia = hub for linking different data sets on the Web to each

other Linked Open Data cloud

bull We use DBpdia to connect persons amp places in our newspaper database to information in other databases

httpnldbpediaorgpageHuib_Drion

httpsnlwikipediaorgwikiHuib_Drion

httpwwwdbnlorgauteursauteurphpid=drio001

httpwwwbiografischportaalnlpersoon41181342

Build central database

Added value of Linked Open Data amp DBpedia Software can automatically query for additional information about places and persons mentioned in De Winkel that is not available in bull KB-catalogue bull Delpher bull De Winkel

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples)

Open license (CC-BY-SA) Open standard (RDF)

Contextual information Links between titles Delpher amp KB-cat (via PPNs)

Relations Links between places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia)

bull Titles Other titles

(PPNs)

http5stardatainfoen

Wikiproject Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

We need a template

Using an article template we can generate 1300 uniform and interlinked Wikipedia articles

from the LOD-database

htt

ps

c1

sta

ticf

lickr

co

m9

82

81

76

99

23

19

18

_11

a73

56

c38

_bjp

g

LOD-database + article template = Wikipedia article

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

Titles from database

Metadata from database

Related persons from database

Related newspapers from database

Winkel-ID from database

Link to full-texts in Delpher from database

Link to KB-catalogue record from database

Fixed categories on WP and Commons

Predefined fixed strings

Grey = bull From database bull Predefined fixed strings

Uniformity between articles guaranteed

All that WP-writers need to add manually

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

The KB can re-use (embed) the Wikipedia content in its own

websites to tackle this problem

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_28verzetsblad29

Delpher - search results

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

Delpher - search results

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Delpher - object presentation

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Over De Geus onder studenten

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog dat vanaf 4 oktober 1940 tot en met 13 juli 1944 in Den Haag werd uitgegeven Het blad verscheen in 1940 1941 en 1943 maandelijks verder onregelmatig in een oplage tussen de 250 en 8000 exemplaren Het werd aanvankelijk gestencild en vanaf november 1942 gedrukt en de inhoud bestond voornamelijk uit opinie-artikelen

Het blad werd uitgegeven door Jan Drion en Huib Drion twee Leidsehellip

Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Delpher - object presentation

Suggested reading

bull httpwwwtedcomtalkstim_berners_lee_on_the_next_web 20 years ago Tim Berners-Lee invented the World Wide Web For his next project hes building a web for open linked data that could do for numbers what the Web did for words pictures video unlock our data and reframe the way we use it together

bull httpsenwikipediaorgwikiLinked_data Wikipedia article related to the above video

bull http5stardatainfoen The 5 stars of Linked Open Data (Tim Berners-Lee)

bull httplinda-projecteulinked-data-primer-2 Short primer about creating LOD in practice starting from an Excel sheet

bull httpwwwprogrammablewebcomnewshow-linked-data-solved-digital-age-marketing-problemanalysis20150831 The figure near the bottom of the first page is a good illustration of the concept of (linked) triples

bull httpsenwikipediaorgwikiDBpedia

bull httpsenwikipediaorgwikiSemantic_network

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Questions

olafjanssenkbnl - ookgezellig

tinyurlcomverzetskranten

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Page 15: Linked Open Data case study (illegal newspapers WW2, Wikipedia, DBpedia) - Lecture Leiden University 3-3-2016

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

PPN = unique ID of this title in Delpher

(same at for KB-catalogue)

Again De Geus onder studenten

On Delpher you can read and (word)search this title

Say I want to know more about this newspaper bull What sortstyle of underground paper was De Geus bull What is the history of this newspaper bull Who were working on it bull Where was this newpaper printed bull How was De Geus distributed and financed bull Were there any relations with other illegal newspapers or resistance

groups bull Etchellip

Under ldquoDetailsrdquo perhaps

OK ok some metadatahellip

but I want to know moacuteoacuteoacuterhellip

Maybe in the catalogue record

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

Question Where would most people start searching contextual

information about De Geus onder studenten

Probably Wikipedia (via Google)

httpnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

Question Where would most people start searching contextual

information about De Geus onder studenten

Probably Wikipedia (via Google)

httpswwwyoutubecomwatchv=VREJV--VHSw

Report on interest in WW2 among Dutch population

httpwwwoorlogsbronnennlgebruikersonderzoek2015 May 2015

Many of us use the internet to search for information [] We often mention Wikipediahellip

Everything is of course on Wikipedia Just type in a name and you can read entire essays (man 70s)

Over half of us think that Wikipedia and Google contribute to our knowledge and understanding of history

When we have to find information about WW2 outside the class setting we fully concentrate on digital resources like Google and Wikipedia (school kids)

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

httpnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

is given in

Context about De Geus onder studenten

But nowhellip

We have another problemhellip

httpsnlwikipediaorgwikiCategorieIllegale_pers_in_de_Tweede_Wereldoorlog

hellip De Geus on Wikipedia is an exception

1 Very few underground newspapers have their own WP articles

2 The overview of these newspapers on WP is far from complete

Good news

htt

p

ww

wa

rch

ives

go

vre

sear

chm

ilita

ryw

w2

ph

oto

sim

ages

ww

2-1

94

jpg

There is a 1-fix solution to this contextual problem

De Ondergrondse Pers 1940-1945

By Lydia E Winkel amp H de Vries

1989 ISBN 9021837463 Veen Uitgevers

This book (ldquoDe Winkelrdquo) contains

contextual articles about

(nearly) all plusmn 1300 illegal WW2 newspapers

ldquoDe Winkelrdquo ndash nr 199

De Ondergrondse Pers 1940-1945 Lydia E Winkel H de Vries 1989

ISBN 9021837463 Veen Uitgevers

Every article has a unique ID

(ldquoWinkel-IDrdquo)

Every article has metadata

bull Title subtitle motto bull Place of publication bull Period of publication bull Publication frequency (daily weekly one-off irregular)

bull Multiplication (stenciled printed typed handwritten)

bull Contents (news opinions poems illustrations humor)

bull Number of prints (min ndash max)

Relation 13

Newspaper Placename

semantics linked data

Relation 13

Newspaper Placename

semantics linked data

Contextual information

Nice material

for a Wikipedia article

Very often persons related to this newspaper are mentioned

Relation 23

Newspaper Persons

semantics linked data

Many articles also contain references to other newspapers

bull 106 = Cereales Vadeness (students resistance newspaper Wageningen) bull 360 = Leidsche Brief (students resistance newspaper Leiden) bull 748 = Sol Justitiae (students resistance newspaper Utrecht)

Relation 33

Newspaper Other newspapers

semantics linked data

htt

p

htt

ps

kn

ow

led

geu

top

iaf

iles

wo

rdp

ress

co

m2

01

40

1h

olla

nd

ho

use

libra

ryb

litz1

94

0j

pg

Too bad itrsquos a paper book hard to find multiply distribute and build upon

We need it digital

htt

p

htt

ps

kn

ow

led

geu

top

iaf

iles

wo

rdp

ress

co

m2

01

40

1h

olla

nd

ho

use

libra

ryb

litz1

94

0j

pg

We need it digital

1 Clear copyright with copyright holder (NIOD) Open CC-BY-SA license

2 Scan amp OCR

3 Convert into PDF

4 Put online NIOD site amp Wikimedia Commons

De Winkel as PDF on NIOD website (CC-BY-SA) httpwwwniodnlnlde-ondergrondse-pers-1940-1945

De Winkel as PDF on Wikimedia Commons httpscommonswikimediaorgwikiFilePDF_of_De_Ondergrondse_Pers_1940-1945_-_derde_druk_-_1989pdf

De Winkel as PDF on NIOD website (CC-BY-SA) httpwwwniodnlnlde-ondergrondse-pers-1940-1945

Saved us euro13330 httpwwwbrillcomdutch-underground-press-1940-1945

Wikipedia article about De Winkel httpnlwikipediaorgwikiDe_ondergrondse_pers_1940-1945

Wikipedia article about the author httpnlwikipediaorgwikiLydia_Winkel

Winkel the plusses

Available online (PDF flat file)

Open license (CC-BY-SA) Contextual information Relations

bull Titles Places bull Titles Persons

bull Titles Other titles

htt

p

ww

wa

rch

ives

go

vre

sear

chm

ilita

ryw

w2

ph

oto

sim

ages

ww

2-1

94

jpg

htt

p

2b

pb

logsp

ot

com

_BW

zuYw

iS6-I

TM

geR

sFd3m

IAAAAAAAAElw

3cv

gbZSPW

css

1600d

oct

or+

macr

o+

judy+

scare

djpg

Winkel the minusses

Unstructured data (PDF flat file)

Not very machine readable (unlike CSV XML JSON RDF)

PDF is no (real) open standard (unlike CSV XML JSON RDF)

No links between titles Delpher amp KB-cat No links between titles places amp persons

external sources (like Wikipedia)

but the data sources are

unconnected (and for 3+4 unstructured amp not machine-readable)

To summarize

a lot of information is available about these WW2 underground newspapers

1 Metadata (KB-cat)

2 Content (full-text Delpher)

3 Context (Winkel PDF)

4 Relations titles places persons other titles (Winkel PDF)

5 External resources about titles places and persons

htt

p

2b

pb

logsp

ot

com

_BW

zuYw

iS6-I

TM

geR

sFd3m

IAAAAAAAAElw

3cv

gbZSPW

css

1600d

oct

or+

macr

o+

judy+

scare

djpg

making discovery understanding amp research

of these newspapers (and related places amp persons) more difficult than necessary

We can solve all these issues

Good news

Wikiproject() Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

tinyurlcomverzetskranten (in Dutch)

httpsnlwikipediaorgwikiWikipediaWikiproject httpsenwikipediaorgwikiWikipediaWikiproject

From 14 1300 titles

htt

ps

n

lwik

iped

iao

rgw

iki

Cat

ego

rie

Illeg

ale_

per

s_in

_de_

Twee

de_

Wer

eld

oo

rlo

g

Wikiproject() Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

tinyurlcomverzetskranten (in Dutch)

We need a database

Build central database

httpswwwyoutubecomwatchv=GVDGuCjog_0

Build central database

httpswwwyoutubecomwatchv=GVDGuCjog_0

Step 1 Create Excel-sheet with bull Metadata about newspaper (from Winkel PDF)

bull Unique Wikipedia article title

bull Contextual info incl related persons amp titles (from Winkel PDF)

bull PPN unique ID linking newspaper to

KB-catalogue amp Delpher

Winkel-ID

Place of publication

Other metadata

Title

Unique Wikipedia article title =

ltNewspaper titlegt (verzetsblad ltPlace of publicationgt)

bull Contextual info bull Related persons

bull Related newspaper titles

PPN (107123223)

httpopc4kbnlDB=1PPNPPN=107123223

httpopc4kbnlDB=1PPNPPN=107123223

PPN (107123223)

PPN (107123223)

httpwwwdelphernlnlkrantenresultsindexcoll=dddtitelampcql[]=ppn3D107123223

httpwwwdelphernlnlkrantenresultsindexcoll=dddtitelampcql[]=ppn3D107123223

PPN (107123223)

Build central database

Step 2 Convert Excel into RDF triplestore (=special kind of online database anybody can access)

bull Steps 1-4 from httplinda-projecteulinked-

data-primer-2

bull Step 4 Vocubulary used = Bibframe (httpbibframeorgvocab)

Build central database

Step 3 Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

bull DBpedia = hub for linking different data sets on the Web to each

other Linked Open Data cloud

bull Connect persons amp places in newspaper database to external resources via DBpedia

Step 1c Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

DBpedia allows you to ask sophisticated queries against Wikipedia and to link the different data sets on the Web to Wikipedia data

bull Connect persons amp places in newspaper database to external

resources via DBpedia

httplod-cloudnetversions2010-09-22lod-cloud_coloredpng

Linked Open Data cloud

Build central database

Step 3 Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

bull DBpedia = hub for linking different data sets on the Web to each

other Linked Open Data cloud

bull We use DBpdia to connect persons amp places in our newspaper database to information in other databases

httpnldbpediaorgpageHuib_Drion

httpsnlwikipediaorgwikiHuib_Drion

httpwwwdbnlorgauteursauteurphpid=drio001

httpwwwbiografischportaalnlpersoon41181342

Build central database

Added value of Linked Open Data amp DBpedia Software can automatically query for additional information about places and persons mentioned in De Winkel that is not available in bull KB-catalogue bull Delpher bull De Winkel

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples)

Open license (CC-BY-SA) Open standard (RDF)

Contextual information Links between titles Delpher amp KB-cat (via PPNs)

Relations Links between places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia)

bull Titles Other titles

(PPNs)

http5stardatainfoen

Wikiproject Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

We need a template

Using an article template we can generate 1300 uniform and interlinked Wikipedia articles

from the LOD-database

htt

ps

c1

sta

ticf

lickr

co

m9

82

81

76

99

23

19

18

_11

a73

56

c38

_bjp

g

LOD-database + article template = Wikipedia article

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

Titles from database

Metadata from database

Related persons from database

Related newspapers from database

Winkel-ID from database

Link to full-texts in Delpher from database

Link to KB-catalogue record from database

Fixed categories on WP and Commons

Predefined fixed strings

Grey = bull From database bull Predefined fixed strings

Uniformity between articles guaranteed

All that WP-writers need to add manually

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

The KB can re-use (embed) the Wikipedia content in its own

websites to tackle this problem

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_28verzetsblad29

Delpher - search results

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

Delpher - search results

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Delpher - object presentation

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Over De Geus onder studenten

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog dat vanaf 4 oktober 1940 tot en met 13 juli 1944 in Den Haag werd uitgegeven Het blad verscheen in 1940 1941 en 1943 maandelijks verder onregelmatig in een oplage tussen de 250 en 8000 exemplaren Het werd aanvankelijk gestencild en vanaf november 1942 gedrukt en de inhoud bestond voornamelijk uit opinie-artikelen

Het blad werd uitgegeven door Jan Drion en Huib Drion twee Leidsehellip

Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Delpher - object presentation

Suggested reading

bull httpwwwtedcomtalkstim_berners_lee_on_the_next_web 20 years ago Tim Berners-Lee invented the World Wide Web For his next project hes building a web for open linked data that could do for numbers what the Web did for words pictures video unlock our data and reframe the way we use it together

bull httpsenwikipediaorgwikiLinked_data Wikipedia article related to the above video

bull http5stardatainfoen The 5 stars of Linked Open Data (Tim Berners-Lee)

bull httplinda-projecteulinked-data-primer-2 Short primer about creating LOD in practice starting from an Excel sheet

bull httpwwwprogrammablewebcomnewshow-linked-data-solved-digital-age-marketing-problemanalysis20150831 The figure near the bottom of the first page is a good illustration of the concept of (linked) triples

bull httpsenwikipediaorgwikiDBpedia

bull httpsenwikipediaorgwikiSemantic_network

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Questions

olafjanssenkbnl - ookgezellig

tinyurlcomverzetskranten

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Page 16: Linked Open Data case study (illegal newspapers WW2, Wikipedia, DBpedia) - Lecture Leiden University 3-3-2016

On Delpher you can read and (word)search this title

Say I want to know more about this newspaper bull What sortstyle of underground paper was De Geus bull What is the history of this newspaper bull Who were working on it bull Where was this newpaper printed bull How was De Geus distributed and financed bull Were there any relations with other illegal newspapers or resistance

groups bull Etchellip

Under ldquoDetailsrdquo perhaps

OK ok some metadatahellip

but I want to know moacuteoacuteoacuterhellip

Maybe in the catalogue record

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

Question Where would most people start searching contextual

information about De Geus onder studenten

Probably Wikipedia (via Google)

httpnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

Question Where would most people start searching contextual

information about De Geus onder studenten

Probably Wikipedia (via Google)

httpswwwyoutubecomwatchv=VREJV--VHSw

Report on interest in WW2 among Dutch population

httpwwwoorlogsbronnennlgebruikersonderzoek2015 May 2015

Many of us use the internet to search for information [] We often mention Wikipediahellip

Everything is of course on Wikipedia Just type in a name and you can read entire essays (man 70s)

Over half of us think that Wikipedia and Google contribute to our knowledge and understanding of history

When we have to find information about WW2 outside the class setting we fully concentrate on digital resources like Google and Wikipedia (school kids)

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

httpnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

is given in

Context about De Geus onder studenten

But nowhellip

We have another problemhellip

httpsnlwikipediaorgwikiCategorieIllegale_pers_in_de_Tweede_Wereldoorlog

hellip De Geus on Wikipedia is an exception

1 Very few underground newspapers have their own WP articles

2 The overview of these newspapers on WP is far from complete

Good news

htt

p

ww

wa

rch

ives

go

vre

sear

chm

ilita

ryw

w2

ph

oto

sim

ages

ww

2-1

94

jpg

There is a 1-fix solution to this contextual problem

De Ondergrondse Pers 1940-1945

By Lydia E Winkel amp H de Vries

1989 ISBN 9021837463 Veen Uitgevers

This book (ldquoDe Winkelrdquo) contains

contextual articles about

(nearly) all plusmn 1300 illegal WW2 newspapers

ldquoDe Winkelrdquo ndash nr 199

De Ondergrondse Pers 1940-1945 Lydia E Winkel H de Vries 1989

ISBN 9021837463 Veen Uitgevers

Every article has a unique ID

(ldquoWinkel-IDrdquo)

Every article has metadata

bull Title subtitle motto bull Place of publication bull Period of publication bull Publication frequency (daily weekly one-off irregular)

bull Multiplication (stenciled printed typed handwritten)

bull Contents (news opinions poems illustrations humor)

bull Number of prints (min ndash max)

Relation 13

Newspaper Placename

semantics linked data

Relation 13

Newspaper Placename

semantics linked data

Contextual information

Nice material

for a Wikipedia article

Very often persons related to this newspaper are mentioned

Relation 23

Newspaper Persons

semantics linked data

Many articles also contain references to other newspapers

bull 106 = Cereales Vadeness (students resistance newspaper Wageningen) bull 360 = Leidsche Brief (students resistance newspaper Leiden) bull 748 = Sol Justitiae (students resistance newspaper Utrecht)

Relation 33

Newspaper Other newspapers

semantics linked data

htt

p

htt

ps

kn

ow

led

geu

top

iaf

iles

wo

rdp

ress

co

m2

01

40

1h

olla

nd

ho

use

libra

ryb

litz1

94

0j

pg

Too bad itrsquos a paper book hard to find multiply distribute and build upon

We need it digital

htt

p

htt

ps

kn

ow

led

geu

top

iaf

iles

wo

rdp

ress

co

m2

01

40

1h

olla

nd

ho

use

libra

ryb

litz1

94

0j

pg

We need it digital

1 Clear copyright with copyright holder (NIOD) Open CC-BY-SA license

2 Scan amp OCR

3 Convert into PDF

4 Put online NIOD site amp Wikimedia Commons

De Winkel as PDF on NIOD website (CC-BY-SA) httpwwwniodnlnlde-ondergrondse-pers-1940-1945

De Winkel as PDF on Wikimedia Commons httpscommonswikimediaorgwikiFilePDF_of_De_Ondergrondse_Pers_1940-1945_-_derde_druk_-_1989pdf

De Winkel as PDF on NIOD website (CC-BY-SA) httpwwwniodnlnlde-ondergrondse-pers-1940-1945

Saved us euro13330 httpwwwbrillcomdutch-underground-press-1940-1945

Wikipedia article about De Winkel httpnlwikipediaorgwikiDe_ondergrondse_pers_1940-1945

Wikipedia article about the author httpnlwikipediaorgwikiLydia_Winkel

Winkel the plusses

Available online (PDF flat file)

Open license (CC-BY-SA) Contextual information Relations

bull Titles Places bull Titles Persons

bull Titles Other titles

htt

p

ww

wa

rch

ives

go

vre

sear

chm

ilita

ryw

w2

ph

oto

sim

ages

ww

2-1

94

jpg

htt

p

2b

pb

logsp

ot

com

_BW

zuYw

iS6-I

TM

geR

sFd3m

IAAAAAAAAElw

3cv

gbZSPW

css

1600d

oct

or+

macr

o+

judy+

scare

djpg

Winkel the minusses

Unstructured data (PDF flat file)

Not very machine readable (unlike CSV XML JSON RDF)

PDF is no (real) open standard (unlike CSV XML JSON RDF)

No links between titles Delpher amp KB-cat No links between titles places amp persons

external sources (like Wikipedia)

but the data sources are

unconnected (and for 3+4 unstructured amp not machine-readable)

To summarize

a lot of information is available about these WW2 underground newspapers

1 Metadata (KB-cat)

2 Content (full-text Delpher)

3 Context (Winkel PDF)

4 Relations titles places persons other titles (Winkel PDF)

5 External resources about titles places and persons

htt

p

2b

pb

logsp

ot

com

_BW

zuYw

iS6-I

TM

geR

sFd3m

IAAAAAAAAElw

3cv

gbZSPW

css

1600d

oct

or+

macr

o+

judy+

scare

djpg

making discovery understanding amp research

of these newspapers (and related places amp persons) more difficult than necessary

We can solve all these issues

Good news

Wikiproject() Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

tinyurlcomverzetskranten (in Dutch)

httpsnlwikipediaorgwikiWikipediaWikiproject httpsenwikipediaorgwikiWikipediaWikiproject

From 14 1300 titles

htt

ps

n

lwik

iped

iao

rgw

iki

Cat

ego

rie

Illeg

ale_

per

s_in

_de_

Twee

de_

Wer

eld

oo

rlo

g

Wikiproject() Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

tinyurlcomverzetskranten (in Dutch)

We need a database

Build central database

httpswwwyoutubecomwatchv=GVDGuCjog_0

Build central database

httpswwwyoutubecomwatchv=GVDGuCjog_0

Step 1 Create Excel-sheet with bull Metadata about newspaper (from Winkel PDF)

bull Unique Wikipedia article title

bull Contextual info incl related persons amp titles (from Winkel PDF)

bull PPN unique ID linking newspaper to

KB-catalogue amp Delpher

Winkel-ID

Place of publication

Other metadata

Title

Unique Wikipedia article title =

ltNewspaper titlegt (verzetsblad ltPlace of publicationgt)

bull Contextual info bull Related persons

bull Related newspaper titles

PPN (107123223)

httpopc4kbnlDB=1PPNPPN=107123223

httpopc4kbnlDB=1PPNPPN=107123223

PPN (107123223)

PPN (107123223)

httpwwwdelphernlnlkrantenresultsindexcoll=dddtitelampcql[]=ppn3D107123223

httpwwwdelphernlnlkrantenresultsindexcoll=dddtitelampcql[]=ppn3D107123223

PPN (107123223)

Build central database

Step 2 Convert Excel into RDF triplestore (=special kind of online database anybody can access)

bull Steps 1-4 from httplinda-projecteulinked-

data-primer-2

bull Step 4 Vocubulary used = Bibframe (httpbibframeorgvocab)

Build central database

Step 3 Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

bull DBpedia = hub for linking different data sets on the Web to each

other Linked Open Data cloud

bull Connect persons amp places in newspaper database to external resources via DBpedia

Step 1c Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

DBpedia allows you to ask sophisticated queries against Wikipedia and to link the different data sets on the Web to Wikipedia data

bull Connect persons amp places in newspaper database to external

resources via DBpedia

httplod-cloudnetversions2010-09-22lod-cloud_coloredpng

Linked Open Data cloud

Build central database

Step 3 Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

bull DBpedia = hub for linking different data sets on the Web to each

other Linked Open Data cloud

bull We use DBpdia to connect persons amp places in our newspaper database to information in other databases

httpnldbpediaorgpageHuib_Drion

httpsnlwikipediaorgwikiHuib_Drion

httpwwwdbnlorgauteursauteurphpid=drio001

httpwwwbiografischportaalnlpersoon41181342

Build central database

Added value of Linked Open Data amp DBpedia Software can automatically query for additional information about places and persons mentioned in De Winkel that is not available in bull KB-catalogue bull Delpher bull De Winkel

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples)

Open license (CC-BY-SA) Open standard (RDF)

Contextual information Links between titles Delpher amp KB-cat (via PPNs)

Relations Links between places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia)

bull Titles Other titles

(PPNs)

http5stardatainfoen

Wikiproject Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

We need a template

Using an article template we can generate 1300 uniform and interlinked Wikipedia articles

from the LOD-database

htt

ps

c1

sta

ticf

lickr

co

m9

82

81

76

99

23

19

18

_11

a73

56

c38

_bjp

g

LOD-database + article template = Wikipedia article

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

Titles from database

Metadata from database

Related persons from database

Related newspapers from database

Winkel-ID from database

Link to full-texts in Delpher from database

Link to KB-catalogue record from database

Fixed categories on WP and Commons

Predefined fixed strings

Grey = bull From database bull Predefined fixed strings

Uniformity between articles guaranteed

All that WP-writers need to add manually

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

The KB can re-use (embed) the Wikipedia content in its own

websites to tackle this problem

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_28verzetsblad29

Delpher - search results

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

Delpher - search results

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Delpher - object presentation

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Over De Geus onder studenten

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog dat vanaf 4 oktober 1940 tot en met 13 juli 1944 in Den Haag werd uitgegeven Het blad verscheen in 1940 1941 en 1943 maandelijks verder onregelmatig in een oplage tussen de 250 en 8000 exemplaren Het werd aanvankelijk gestencild en vanaf november 1942 gedrukt en de inhoud bestond voornamelijk uit opinie-artikelen

Het blad werd uitgegeven door Jan Drion en Huib Drion twee Leidsehellip

Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Delpher - object presentation

Suggested reading

bull httpwwwtedcomtalkstim_berners_lee_on_the_next_web 20 years ago Tim Berners-Lee invented the World Wide Web For his next project hes building a web for open linked data that could do for numbers what the Web did for words pictures video unlock our data and reframe the way we use it together

bull httpsenwikipediaorgwikiLinked_data Wikipedia article related to the above video

bull http5stardatainfoen The 5 stars of Linked Open Data (Tim Berners-Lee)

bull httplinda-projecteulinked-data-primer-2 Short primer about creating LOD in practice starting from an Excel sheet

bull httpwwwprogrammablewebcomnewshow-linked-data-solved-digital-age-marketing-problemanalysis20150831 The figure near the bottom of the first page is a good illustration of the concept of (linked) triples

bull httpsenwikipediaorgwikiDBpedia

bull httpsenwikipediaorgwikiSemantic_network

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Questions

olafjanssenkbnl - ookgezellig

tinyurlcomverzetskranten

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Page 17: Linked Open Data case study (illegal newspapers WW2, Wikipedia, DBpedia) - Lecture Leiden University 3-3-2016

Say I want to know more about this newspaper bull What sortstyle of underground paper was De Geus bull What is the history of this newspaper bull Who were working on it bull Where was this newpaper printed bull How was De Geus distributed and financed bull Were there any relations with other illegal newspapers or resistance

groups bull Etchellip

Under ldquoDetailsrdquo perhaps

OK ok some metadatahellip

but I want to know moacuteoacuteoacuterhellip

Maybe in the catalogue record

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

Question Where would most people start searching contextual

information about De Geus onder studenten

Probably Wikipedia (via Google)

httpnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

Question Where would most people start searching contextual

information about De Geus onder studenten

Probably Wikipedia (via Google)

httpswwwyoutubecomwatchv=VREJV--VHSw

Report on interest in WW2 among Dutch population

httpwwwoorlogsbronnennlgebruikersonderzoek2015 May 2015

Many of us use the internet to search for information [] We often mention Wikipediahellip

Everything is of course on Wikipedia Just type in a name and you can read entire essays (man 70s)

Over half of us think that Wikipedia and Google contribute to our knowledge and understanding of history

When we have to find information about WW2 outside the class setting we fully concentrate on digital resources like Google and Wikipedia (school kids)

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

httpnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

is given in

Context about De Geus onder studenten

But nowhellip

We have another problemhellip

httpsnlwikipediaorgwikiCategorieIllegale_pers_in_de_Tweede_Wereldoorlog

hellip De Geus on Wikipedia is an exception

1 Very few underground newspapers have their own WP articles

2 The overview of these newspapers on WP is far from complete

Good news

htt

p

ww

wa

rch

ives

go

vre

sear

chm

ilita

ryw

w2

ph

oto

sim

ages

ww

2-1

94

jpg

There is a 1-fix solution to this contextual problem

De Ondergrondse Pers 1940-1945

By Lydia E Winkel amp H de Vries

1989 ISBN 9021837463 Veen Uitgevers

This book (ldquoDe Winkelrdquo) contains

contextual articles about

(nearly) all plusmn 1300 illegal WW2 newspapers

ldquoDe Winkelrdquo ndash nr 199

De Ondergrondse Pers 1940-1945 Lydia E Winkel H de Vries 1989

ISBN 9021837463 Veen Uitgevers

Every article has a unique ID

(ldquoWinkel-IDrdquo)

Every article has metadata

bull Title subtitle motto bull Place of publication bull Period of publication bull Publication frequency (daily weekly one-off irregular)

bull Multiplication (stenciled printed typed handwritten)

bull Contents (news opinions poems illustrations humor)

bull Number of prints (min ndash max)

Relation 13

Newspaper Placename

semantics linked data

Relation 13

Newspaper Placename

semantics linked data

Contextual information

Nice material

for a Wikipedia article

Very often persons related to this newspaper are mentioned

Relation 23

Newspaper Persons

semantics linked data

Many articles also contain references to other newspapers

bull 106 = Cereales Vadeness (students resistance newspaper Wageningen) bull 360 = Leidsche Brief (students resistance newspaper Leiden) bull 748 = Sol Justitiae (students resistance newspaper Utrecht)

Relation 33

Newspaper Other newspapers

semantics linked data

htt

p

htt

ps

kn

ow

led

geu

top

iaf

iles

wo

rdp

ress

co

m2

01

40

1h

olla

nd

ho

use

libra

ryb

litz1

94

0j

pg

Too bad itrsquos a paper book hard to find multiply distribute and build upon

We need it digital

htt

p

htt

ps

kn

ow

led

geu

top

iaf

iles

wo

rdp

ress

co

m2

01

40

1h

olla

nd

ho

use

libra

ryb

litz1

94

0j

pg

We need it digital

1 Clear copyright with copyright holder (NIOD) Open CC-BY-SA license

2 Scan amp OCR

3 Convert into PDF

4 Put online NIOD site amp Wikimedia Commons

De Winkel as PDF on NIOD website (CC-BY-SA) httpwwwniodnlnlde-ondergrondse-pers-1940-1945

De Winkel as PDF on Wikimedia Commons httpscommonswikimediaorgwikiFilePDF_of_De_Ondergrondse_Pers_1940-1945_-_derde_druk_-_1989pdf

De Winkel as PDF on NIOD website (CC-BY-SA) httpwwwniodnlnlde-ondergrondse-pers-1940-1945

Saved us euro13330 httpwwwbrillcomdutch-underground-press-1940-1945

Wikipedia article about De Winkel httpnlwikipediaorgwikiDe_ondergrondse_pers_1940-1945

Wikipedia article about the author httpnlwikipediaorgwikiLydia_Winkel

Winkel the plusses

Available online (PDF flat file)

Open license (CC-BY-SA) Contextual information Relations

bull Titles Places bull Titles Persons

bull Titles Other titles

htt

p

ww

wa

rch

ives

go

vre

sear

chm

ilita

ryw

w2

ph

oto

sim

ages

ww

2-1

94

jpg

htt

p

2b

pb

logsp

ot

com

_BW

zuYw

iS6-I

TM

geR

sFd3m

IAAAAAAAAElw

3cv

gbZSPW

css

1600d

oct

or+

macr

o+

judy+

scare

djpg

Winkel the minusses

Unstructured data (PDF flat file)

Not very machine readable (unlike CSV XML JSON RDF)

PDF is no (real) open standard (unlike CSV XML JSON RDF)

No links between titles Delpher amp KB-cat No links between titles places amp persons

external sources (like Wikipedia)

but the data sources are

unconnected (and for 3+4 unstructured amp not machine-readable)

To summarize

a lot of information is available about these WW2 underground newspapers

1 Metadata (KB-cat)

2 Content (full-text Delpher)

3 Context (Winkel PDF)

4 Relations titles places persons other titles (Winkel PDF)

5 External resources about titles places and persons

htt

p

2b

pb

logsp

ot

com

_BW

zuYw

iS6-I

TM

geR

sFd3m

IAAAAAAAAElw

3cv

gbZSPW

css

1600d

oct

or+

macr

o+

judy+

scare

djpg

making discovery understanding amp research

of these newspapers (and related places amp persons) more difficult than necessary

We can solve all these issues

Good news

Wikiproject() Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

tinyurlcomverzetskranten (in Dutch)

httpsnlwikipediaorgwikiWikipediaWikiproject httpsenwikipediaorgwikiWikipediaWikiproject

From 14 1300 titles

htt

ps

n

lwik

iped

iao

rgw

iki

Cat

ego

rie

Illeg

ale_

per

s_in

_de_

Twee

de_

Wer

eld

oo

rlo

g

Wikiproject() Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

tinyurlcomverzetskranten (in Dutch)

We need a database

Build central database

httpswwwyoutubecomwatchv=GVDGuCjog_0

Build central database

httpswwwyoutubecomwatchv=GVDGuCjog_0

Step 1 Create Excel-sheet with bull Metadata about newspaper (from Winkel PDF)

bull Unique Wikipedia article title

bull Contextual info incl related persons amp titles (from Winkel PDF)

bull PPN unique ID linking newspaper to

KB-catalogue amp Delpher

Winkel-ID

Place of publication

Other metadata

Title

Unique Wikipedia article title =

ltNewspaper titlegt (verzetsblad ltPlace of publicationgt)

bull Contextual info bull Related persons

bull Related newspaper titles

PPN (107123223)

httpopc4kbnlDB=1PPNPPN=107123223

httpopc4kbnlDB=1PPNPPN=107123223

PPN (107123223)

PPN (107123223)

httpwwwdelphernlnlkrantenresultsindexcoll=dddtitelampcql[]=ppn3D107123223

httpwwwdelphernlnlkrantenresultsindexcoll=dddtitelampcql[]=ppn3D107123223

PPN (107123223)

Build central database

Step 2 Convert Excel into RDF triplestore (=special kind of online database anybody can access)

bull Steps 1-4 from httplinda-projecteulinked-

data-primer-2

bull Step 4 Vocubulary used = Bibframe (httpbibframeorgvocab)

Build central database

Step 3 Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

bull DBpedia = hub for linking different data sets on the Web to each

other Linked Open Data cloud

bull Connect persons amp places in newspaper database to external resources via DBpedia

Step 1c Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

DBpedia allows you to ask sophisticated queries against Wikipedia and to link the different data sets on the Web to Wikipedia data

bull Connect persons amp places in newspaper database to external

resources via DBpedia

httplod-cloudnetversions2010-09-22lod-cloud_coloredpng

Linked Open Data cloud

Build central database

Step 3 Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

bull DBpedia = hub for linking different data sets on the Web to each

other Linked Open Data cloud

bull We use DBpdia to connect persons amp places in our newspaper database to information in other databases

httpnldbpediaorgpageHuib_Drion

httpsnlwikipediaorgwikiHuib_Drion

httpwwwdbnlorgauteursauteurphpid=drio001

httpwwwbiografischportaalnlpersoon41181342

Build central database

Added value of Linked Open Data amp DBpedia Software can automatically query for additional information about places and persons mentioned in De Winkel that is not available in bull KB-catalogue bull Delpher bull De Winkel

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples)

Open license (CC-BY-SA) Open standard (RDF)

Contextual information Links between titles Delpher amp KB-cat (via PPNs)

Relations Links between places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia)

bull Titles Other titles

(PPNs)

http5stardatainfoen

Wikiproject Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

We need a template

Using an article template we can generate 1300 uniform and interlinked Wikipedia articles

from the LOD-database

htt

ps

c1

sta

ticf

lickr

co

m9

82

81

76

99

23

19

18

_11

a73

56

c38

_bjp

g

LOD-database + article template = Wikipedia article

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

Titles from database

Metadata from database

Related persons from database

Related newspapers from database

Winkel-ID from database

Link to full-texts in Delpher from database

Link to KB-catalogue record from database

Fixed categories on WP and Commons

Predefined fixed strings

Grey = bull From database bull Predefined fixed strings

Uniformity between articles guaranteed

All that WP-writers need to add manually

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

The KB can re-use (embed) the Wikipedia content in its own

websites to tackle this problem

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_28verzetsblad29

Delpher - search results

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

Delpher - search results

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Delpher - object presentation

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Over De Geus onder studenten

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog dat vanaf 4 oktober 1940 tot en met 13 juli 1944 in Den Haag werd uitgegeven Het blad verscheen in 1940 1941 en 1943 maandelijks verder onregelmatig in een oplage tussen de 250 en 8000 exemplaren Het werd aanvankelijk gestencild en vanaf november 1942 gedrukt en de inhoud bestond voornamelijk uit opinie-artikelen

Het blad werd uitgegeven door Jan Drion en Huib Drion twee Leidsehellip

Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Delpher - object presentation

Suggested reading

bull httpwwwtedcomtalkstim_berners_lee_on_the_next_web 20 years ago Tim Berners-Lee invented the World Wide Web For his next project hes building a web for open linked data that could do for numbers what the Web did for words pictures video unlock our data and reframe the way we use it together

bull httpsenwikipediaorgwikiLinked_data Wikipedia article related to the above video

bull http5stardatainfoen The 5 stars of Linked Open Data (Tim Berners-Lee)

bull httplinda-projecteulinked-data-primer-2 Short primer about creating LOD in practice starting from an Excel sheet

bull httpwwwprogrammablewebcomnewshow-linked-data-solved-digital-age-marketing-problemanalysis20150831 The figure near the bottom of the first page is a good illustration of the concept of (linked) triples

bull httpsenwikipediaorgwikiDBpedia

bull httpsenwikipediaorgwikiSemantic_network

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Questions

olafjanssenkbnl - ookgezellig

tinyurlcomverzetskranten

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Page 18: Linked Open Data case study (illegal newspapers WW2, Wikipedia, DBpedia) - Lecture Leiden University 3-3-2016

OK ok some metadatahellip

but I want to know moacuteoacuteoacuterhellip

Maybe in the catalogue record

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

Question Where would most people start searching contextual

information about De Geus onder studenten

Probably Wikipedia (via Google)

httpnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

Question Where would most people start searching contextual

information about De Geus onder studenten

Probably Wikipedia (via Google)

httpswwwyoutubecomwatchv=VREJV--VHSw

Report on interest in WW2 among Dutch population

httpwwwoorlogsbronnennlgebruikersonderzoek2015 May 2015

Many of us use the internet to search for information [] We often mention Wikipediahellip

Everything is of course on Wikipedia Just type in a name and you can read entire essays (man 70s)

Over half of us think that Wikipedia and Google contribute to our knowledge and understanding of history

When we have to find information about WW2 outside the class setting we fully concentrate on digital resources like Google and Wikipedia (school kids)

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

httpnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

is given in

Context about De Geus onder studenten

But nowhellip

We have another problemhellip

httpsnlwikipediaorgwikiCategorieIllegale_pers_in_de_Tweede_Wereldoorlog

hellip De Geus on Wikipedia is an exception

1 Very few underground newspapers have their own WP articles

2 The overview of these newspapers on WP is far from complete

Good news

htt

p

ww

wa

rch

ives

go

vre

sear

chm

ilita

ryw

w2

ph

oto

sim

ages

ww

2-1

94

jpg

There is a 1-fix solution to this contextual problem

De Ondergrondse Pers 1940-1945

By Lydia E Winkel amp H de Vries

1989 ISBN 9021837463 Veen Uitgevers

This book (ldquoDe Winkelrdquo) contains

contextual articles about

(nearly) all plusmn 1300 illegal WW2 newspapers

ldquoDe Winkelrdquo ndash nr 199

De Ondergrondse Pers 1940-1945 Lydia E Winkel H de Vries 1989

ISBN 9021837463 Veen Uitgevers

Every article has a unique ID

(ldquoWinkel-IDrdquo)

Every article has metadata

bull Title subtitle motto bull Place of publication bull Period of publication bull Publication frequency (daily weekly one-off irregular)

bull Multiplication (stenciled printed typed handwritten)

bull Contents (news opinions poems illustrations humor)

bull Number of prints (min ndash max)

Relation 13

Newspaper Placename

semantics linked data

Relation 13

Newspaper Placename

semantics linked data

Contextual information

Nice material

for a Wikipedia article

Very often persons related to this newspaper are mentioned

Relation 23

Newspaper Persons

semantics linked data

Many articles also contain references to other newspapers

bull 106 = Cereales Vadeness (students resistance newspaper Wageningen) bull 360 = Leidsche Brief (students resistance newspaper Leiden) bull 748 = Sol Justitiae (students resistance newspaper Utrecht)

Relation 33

Newspaper Other newspapers

semantics linked data

htt

p

htt

ps

kn

ow

led

geu

top

iaf

iles

wo

rdp

ress

co

m2

01

40

1h

olla

nd

ho

use

libra

ryb

litz1

94

0j

pg

Too bad itrsquos a paper book hard to find multiply distribute and build upon

We need it digital

htt

p

htt

ps

kn

ow

led

geu

top

iaf

iles

wo

rdp

ress

co

m2

01

40

1h

olla

nd

ho

use

libra

ryb

litz1

94

0j

pg

We need it digital

1 Clear copyright with copyright holder (NIOD) Open CC-BY-SA license

2 Scan amp OCR

3 Convert into PDF

4 Put online NIOD site amp Wikimedia Commons

De Winkel as PDF on NIOD website (CC-BY-SA) httpwwwniodnlnlde-ondergrondse-pers-1940-1945

De Winkel as PDF on Wikimedia Commons httpscommonswikimediaorgwikiFilePDF_of_De_Ondergrondse_Pers_1940-1945_-_derde_druk_-_1989pdf

De Winkel as PDF on NIOD website (CC-BY-SA) httpwwwniodnlnlde-ondergrondse-pers-1940-1945

Saved us euro13330 httpwwwbrillcomdutch-underground-press-1940-1945

Wikipedia article about De Winkel httpnlwikipediaorgwikiDe_ondergrondse_pers_1940-1945

Wikipedia article about the author httpnlwikipediaorgwikiLydia_Winkel

Winkel the plusses

Available online (PDF flat file)

Open license (CC-BY-SA) Contextual information Relations

bull Titles Places bull Titles Persons

bull Titles Other titles

htt

p

ww

wa

rch

ives

go

vre

sear

chm

ilita

ryw

w2

ph

oto

sim

ages

ww

2-1

94

jpg

htt

p

2b

pb

logsp

ot

com

_BW

zuYw

iS6-I

TM

geR

sFd3m

IAAAAAAAAElw

3cv

gbZSPW

css

1600d

oct

or+

macr

o+

judy+

scare

djpg

Winkel the minusses

Unstructured data (PDF flat file)

Not very machine readable (unlike CSV XML JSON RDF)

PDF is no (real) open standard (unlike CSV XML JSON RDF)

No links between titles Delpher amp KB-cat No links between titles places amp persons

external sources (like Wikipedia)

but the data sources are

unconnected (and for 3+4 unstructured amp not machine-readable)

To summarize

a lot of information is available about these WW2 underground newspapers

1 Metadata (KB-cat)

2 Content (full-text Delpher)

3 Context (Winkel PDF)

4 Relations titles places persons other titles (Winkel PDF)

5 External resources about titles places and persons

htt

p

2b

pb

logsp

ot

com

_BW

zuYw

iS6-I

TM

geR

sFd3m

IAAAAAAAAElw

3cv

gbZSPW

css

1600d

oct

or+

macr

o+

judy+

scare

djpg

making discovery understanding amp research

of these newspapers (and related places amp persons) more difficult than necessary

We can solve all these issues

Good news

Wikiproject() Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

tinyurlcomverzetskranten (in Dutch)

httpsnlwikipediaorgwikiWikipediaWikiproject httpsenwikipediaorgwikiWikipediaWikiproject

From 14 1300 titles

htt

ps

n

lwik

iped

iao

rgw

iki

Cat

ego

rie

Illeg

ale_

per

s_in

_de_

Twee

de_

Wer

eld

oo

rlo

g

Wikiproject() Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

tinyurlcomverzetskranten (in Dutch)

We need a database

Build central database

httpswwwyoutubecomwatchv=GVDGuCjog_0

Build central database

httpswwwyoutubecomwatchv=GVDGuCjog_0

Step 1 Create Excel-sheet with bull Metadata about newspaper (from Winkel PDF)

bull Unique Wikipedia article title

bull Contextual info incl related persons amp titles (from Winkel PDF)

bull PPN unique ID linking newspaper to

KB-catalogue amp Delpher

Winkel-ID

Place of publication

Other metadata

Title

Unique Wikipedia article title =

ltNewspaper titlegt (verzetsblad ltPlace of publicationgt)

bull Contextual info bull Related persons

bull Related newspaper titles

PPN (107123223)

httpopc4kbnlDB=1PPNPPN=107123223

httpopc4kbnlDB=1PPNPPN=107123223

PPN (107123223)

PPN (107123223)

httpwwwdelphernlnlkrantenresultsindexcoll=dddtitelampcql[]=ppn3D107123223

httpwwwdelphernlnlkrantenresultsindexcoll=dddtitelampcql[]=ppn3D107123223

PPN (107123223)

Build central database

Step 2 Convert Excel into RDF triplestore (=special kind of online database anybody can access)

bull Steps 1-4 from httplinda-projecteulinked-

data-primer-2

bull Step 4 Vocubulary used = Bibframe (httpbibframeorgvocab)

Build central database

Step 3 Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

bull DBpedia = hub for linking different data sets on the Web to each

other Linked Open Data cloud

bull Connect persons amp places in newspaper database to external resources via DBpedia

Step 1c Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

DBpedia allows you to ask sophisticated queries against Wikipedia and to link the different data sets on the Web to Wikipedia data

bull Connect persons amp places in newspaper database to external

resources via DBpedia

httplod-cloudnetversions2010-09-22lod-cloud_coloredpng

Linked Open Data cloud

Build central database

Step 3 Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

bull DBpedia = hub for linking different data sets on the Web to each

other Linked Open Data cloud

bull We use DBpdia to connect persons amp places in our newspaper database to information in other databases

httpnldbpediaorgpageHuib_Drion

httpsnlwikipediaorgwikiHuib_Drion

httpwwwdbnlorgauteursauteurphpid=drio001

httpwwwbiografischportaalnlpersoon41181342

Build central database

Added value of Linked Open Data amp DBpedia Software can automatically query for additional information about places and persons mentioned in De Winkel that is not available in bull KB-catalogue bull Delpher bull De Winkel

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples)

Open license (CC-BY-SA) Open standard (RDF)

Contextual information Links between titles Delpher amp KB-cat (via PPNs)

Relations Links between places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia)

bull Titles Other titles

(PPNs)

http5stardatainfoen

Wikiproject Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

We need a template

Using an article template we can generate 1300 uniform and interlinked Wikipedia articles

from the LOD-database

htt

ps

c1

sta

ticf

lickr

co

m9

82

81

76

99

23

19

18

_11

a73

56

c38

_bjp

g

LOD-database + article template = Wikipedia article

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

Titles from database

Metadata from database

Related persons from database

Related newspapers from database

Winkel-ID from database

Link to full-texts in Delpher from database

Link to KB-catalogue record from database

Fixed categories on WP and Commons

Predefined fixed strings

Grey = bull From database bull Predefined fixed strings

Uniformity between articles guaranteed

All that WP-writers need to add manually

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

The KB can re-use (embed) the Wikipedia content in its own

websites to tackle this problem

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_28verzetsblad29

Delpher - search results

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

Delpher - search results

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Delpher - object presentation

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Over De Geus onder studenten

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog dat vanaf 4 oktober 1940 tot en met 13 juli 1944 in Den Haag werd uitgegeven Het blad verscheen in 1940 1941 en 1943 maandelijks verder onregelmatig in een oplage tussen de 250 en 8000 exemplaren Het werd aanvankelijk gestencild en vanaf november 1942 gedrukt en de inhoud bestond voornamelijk uit opinie-artikelen

Het blad werd uitgegeven door Jan Drion en Huib Drion twee Leidsehellip

Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Delpher - object presentation

Suggested reading

bull httpwwwtedcomtalkstim_berners_lee_on_the_next_web 20 years ago Tim Berners-Lee invented the World Wide Web For his next project hes building a web for open linked data that could do for numbers what the Web did for words pictures video unlock our data and reframe the way we use it together

bull httpsenwikipediaorgwikiLinked_data Wikipedia article related to the above video

bull http5stardatainfoen The 5 stars of Linked Open Data (Tim Berners-Lee)

bull httplinda-projecteulinked-data-primer-2 Short primer about creating LOD in practice starting from an Excel sheet

bull httpwwwprogrammablewebcomnewshow-linked-data-solved-digital-age-marketing-problemanalysis20150831 The figure near the bottom of the first page is a good illustration of the concept of (linked) triples

bull httpsenwikipediaorgwikiDBpedia

bull httpsenwikipediaorgwikiSemantic_network

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Questions

olafjanssenkbnl - ookgezellig

tinyurlcomverzetskranten

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Page 19: Linked Open Data case study (illegal newspapers WW2, Wikipedia, DBpedia) - Lecture Leiden University 3-3-2016

Maybe in the catalogue record

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

Question Where would most people start searching contextual

information about De Geus onder studenten

Probably Wikipedia (via Google)

httpnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

Question Where would most people start searching contextual

information about De Geus onder studenten

Probably Wikipedia (via Google)

httpswwwyoutubecomwatchv=VREJV--VHSw

Report on interest in WW2 among Dutch population

httpwwwoorlogsbronnennlgebruikersonderzoek2015 May 2015

Many of us use the internet to search for information [] We often mention Wikipediahellip

Everything is of course on Wikipedia Just type in a name and you can read entire essays (man 70s)

Over half of us think that Wikipedia and Google contribute to our knowledge and understanding of history

When we have to find information about WW2 outside the class setting we fully concentrate on digital resources like Google and Wikipedia (school kids)

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

httpnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

is given in

Context about De Geus onder studenten

But nowhellip

We have another problemhellip

httpsnlwikipediaorgwikiCategorieIllegale_pers_in_de_Tweede_Wereldoorlog

hellip De Geus on Wikipedia is an exception

1 Very few underground newspapers have their own WP articles

2 The overview of these newspapers on WP is far from complete

Good news

htt

p

ww

wa

rch

ives

go

vre

sear

chm

ilita

ryw

w2

ph

oto

sim

ages

ww

2-1

94

jpg

There is a 1-fix solution to this contextual problem

De Ondergrondse Pers 1940-1945

By Lydia E Winkel amp H de Vries

1989 ISBN 9021837463 Veen Uitgevers

This book (ldquoDe Winkelrdquo) contains

contextual articles about

(nearly) all plusmn 1300 illegal WW2 newspapers

ldquoDe Winkelrdquo ndash nr 199

De Ondergrondse Pers 1940-1945 Lydia E Winkel H de Vries 1989

ISBN 9021837463 Veen Uitgevers

Every article has a unique ID

(ldquoWinkel-IDrdquo)

Every article has metadata

bull Title subtitle motto bull Place of publication bull Period of publication bull Publication frequency (daily weekly one-off irregular)

bull Multiplication (stenciled printed typed handwritten)

bull Contents (news opinions poems illustrations humor)

bull Number of prints (min ndash max)

Relation 13

Newspaper Placename

semantics linked data

Relation 13

Newspaper Placename

semantics linked data

Contextual information

Nice material

for a Wikipedia article

Very often persons related to this newspaper are mentioned

Relation 23

Newspaper Persons

semantics linked data

Many articles also contain references to other newspapers

bull 106 = Cereales Vadeness (students resistance newspaper Wageningen) bull 360 = Leidsche Brief (students resistance newspaper Leiden) bull 748 = Sol Justitiae (students resistance newspaper Utrecht)

Relation 33

Newspaper Other newspapers

semantics linked data

htt

p

htt

ps

kn

ow

led

geu

top

iaf

iles

wo

rdp

ress

co

m2

01

40

1h

olla

nd

ho

use

libra

ryb

litz1

94

0j

pg

Too bad itrsquos a paper book hard to find multiply distribute and build upon

We need it digital

htt

p

htt

ps

kn

ow

led

geu

top

iaf

iles

wo

rdp

ress

co

m2

01

40

1h

olla

nd

ho

use

libra

ryb

litz1

94

0j

pg

We need it digital

1 Clear copyright with copyright holder (NIOD) Open CC-BY-SA license

2 Scan amp OCR

3 Convert into PDF

4 Put online NIOD site amp Wikimedia Commons

De Winkel as PDF on NIOD website (CC-BY-SA) httpwwwniodnlnlde-ondergrondse-pers-1940-1945

De Winkel as PDF on Wikimedia Commons httpscommonswikimediaorgwikiFilePDF_of_De_Ondergrondse_Pers_1940-1945_-_derde_druk_-_1989pdf

De Winkel as PDF on NIOD website (CC-BY-SA) httpwwwniodnlnlde-ondergrondse-pers-1940-1945

Saved us euro13330 httpwwwbrillcomdutch-underground-press-1940-1945

Wikipedia article about De Winkel httpnlwikipediaorgwikiDe_ondergrondse_pers_1940-1945

Wikipedia article about the author httpnlwikipediaorgwikiLydia_Winkel

Winkel the plusses

Available online (PDF flat file)

Open license (CC-BY-SA) Contextual information Relations

bull Titles Places bull Titles Persons

bull Titles Other titles

htt

p

ww

wa

rch

ives

go

vre

sear

chm

ilita

ryw

w2

ph

oto

sim

ages

ww

2-1

94

jpg

htt

p

2b

pb

logsp

ot

com

_BW

zuYw

iS6-I

TM

geR

sFd3m

IAAAAAAAAElw

3cv

gbZSPW

css

1600d

oct

or+

macr

o+

judy+

scare

djpg

Winkel the minusses

Unstructured data (PDF flat file)

Not very machine readable (unlike CSV XML JSON RDF)

PDF is no (real) open standard (unlike CSV XML JSON RDF)

No links between titles Delpher amp KB-cat No links between titles places amp persons

external sources (like Wikipedia)

but the data sources are

unconnected (and for 3+4 unstructured amp not machine-readable)

To summarize

a lot of information is available about these WW2 underground newspapers

1 Metadata (KB-cat)

2 Content (full-text Delpher)

3 Context (Winkel PDF)

4 Relations titles places persons other titles (Winkel PDF)

5 External resources about titles places and persons

htt

p

2b

pb

logsp

ot

com

_BW

zuYw

iS6-I

TM

geR

sFd3m

IAAAAAAAAElw

3cv

gbZSPW

css

1600d

oct

or+

macr

o+

judy+

scare

djpg

making discovery understanding amp research

of these newspapers (and related places amp persons) more difficult than necessary

We can solve all these issues

Good news

Wikiproject() Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

tinyurlcomverzetskranten (in Dutch)

httpsnlwikipediaorgwikiWikipediaWikiproject httpsenwikipediaorgwikiWikipediaWikiproject

From 14 1300 titles

htt

ps

n

lwik

iped

iao

rgw

iki

Cat

ego

rie

Illeg

ale_

per

s_in

_de_

Twee

de_

Wer

eld

oo

rlo

g

Wikiproject() Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

tinyurlcomverzetskranten (in Dutch)

We need a database

Build central database

httpswwwyoutubecomwatchv=GVDGuCjog_0

Build central database

httpswwwyoutubecomwatchv=GVDGuCjog_0

Step 1 Create Excel-sheet with bull Metadata about newspaper (from Winkel PDF)

bull Unique Wikipedia article title

bull Contextual info incl related persons amp titles (from Winkel PDF)

bull PPN unique ID linking newspaper to

KB-catalogue amp Delpher

Winkel-ID

Place of publication

Other metadata

Title

Unique Wikipedia article title =

ltNewspaper titlegt (verzetsblad ltPlace of publicationgt)

bull Contextual info bull Related persons

bull Related newspaper titles

PPN (107123223)

httpopc4kbnlDB=1PPNPPN=107123223

httpopc4kbnlDB=1PPNPPN=107123223

PPN (107123223)

PPN (107123223)

httpwwwdelphernlnlkrantenresultsindexcoll=dddtitelampcql[]=ppn3D107123223

httpwwwdelphernlnlkrantenresultsindexcoll=dddtitelampcql[]=ppn3D107123223

PPN (107123223)

Build central database

Step 2 Convert Excel into RDF triplestore (=special kind of online database anybody can access)

bull Steps 1-4 from httplinda-projecteulinked-

data-primer-2

bull Step 4 Vocubulary used = Bibframe (httpbibframeorgvocab)

Build central database

Step 3 Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

bull DBpedia = hub for linking different data sets on the Web to each

other Linked Open Data cloud

bull Connect persons amp places in newspaper database to external resources via DBpedia

Step 1c Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

DBpedia allows you to ask sophisticated queries against Wikipedia and to link the different data sets on the Web to Wikipedia data

bull Connect persons amp places in newspaper database to external

resources via DBpedia

httplod-cloudnetversions2010-09-22lod-cloud_coloredpng

Linked Open Data cloud

Build central database

Step 3 Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

bull DBpedia = hub for linking different data sets on the Web to each

other Linked Open Data cloud

bull We use DBpdia to connect persons amp places in our newspaper database to information in other databases

httpnldbpediaorgpageHuib_Drion

httpsnlwikipediaorgwikiHuib_Drion

httpwwwdbnlorgauteursauteurphpid=drio001

httpwwwbiografischportaalnlpersoon41181342

Build central database

Added value of Linked Open Data amp DBpedia Software can automatically query for additional information about places and persons mentioned in De Winkel that is not available in bull KB-catalogue bull Delpher bull De Winkel

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples)

Open license (CC-BY-SA) Open standard (RDF)

Contextual information Links between titles Delpher amp KB-cat (via PPNs)

Relations Links between places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia)

bull Titles Other titles

(PPNs)

http5stardatainfoen

Wikiproject Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

We need a template

Using an article template we can generate 1300 uniform and interlinked Wikipedia articles

from the LOD-database

htt

ps

c1

sta

ticf

lickr

co

m9

82

81

76

99

23

19

18

_11

a73

56

c38

_bjp

g

LOD-database + article template = Wikipedia article

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

Titles from database

Metadata from database

Related persons from database

Related newspapers from database

Winkel-ID from database

Link to full-texts in Delpher from database

Link to KB-catalogue record from database

Fixed categories on WP and Commons

Predefined fixed strings

Grey = bull From database bull Predefined fixed strings

Uniformity between articles guaranteed

All that WP-writers need to add manually

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

The KB can re-use (embed) the Wikipedia content in its own

websites to tackle this problem

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_28verzetsblad29

Delpher - search results

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

Delpher - search results

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Delpher - object presentation

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Over De Geus onder studenten

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog dat vanaf 4 oktober 1940 tot en met 13 juli 1944 in Den Haag werd uitgegeven Het blad verscheen in 1940 1941 en 1943 maandelijks verder onregelmatig in een oplage tussen de 250 en 8000 exemplaren Het werd aanvankelijk gestencild en vanaf november 1942 gedrukt en de inhoud bestond voornamelijk uit opinie-artikelen

Het blad werd uitgegeven door Jan Drion en Huib Drion twee Leidsehellip

Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Delpher - object presentation

Suggested reading

bull httpwwwtedcomtalkstim_berners_lee_on_the_next_web 20 years ago Tim Berners-Lee invented the World Wide Web For his next project hes building a web for open linked data that could do for numbers what the Web did for words pictures video unlock our data and reframe the way we use it together

bull httpsenwikipediaorgwikiLinked_data Wikipedia article related to the above video

bull http5stardatainfoen The 5 stars of Linked Open Data (Tim Berners-Lee)

bull httplinda-projecteulinked-data-primer-2 Short primer about creating LOD in practice starting from an Excel sheet

bull httpwwwprogrammablewebcomnewshow-linked-data-solved-digital-age-marketing-problemanalysis20150831 The figure near the bottom of the first page is a good illustration of the concept of (linked) triples

bull httpsenwikipediaorgwikiDBpedia

bull httpsenwikipediaorgwikiSemantic_network

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Questions

olafjanssenkbnl - ookgezellig

tinyurlcomverzetskranten

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Page 20: Linked Open Data case study (illegal newspapers WW2, Wikipedia, DBpedia) - Lecture Leiden University 3-3-2016

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

Question Where would most people start searching contextual

information about De Geus onder studenten

Probably Wikipedia (via Google)

httpnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

Question Where would most people start searching contextual

information about De Geus onder studenten

Probably Wikipedia (via Google)

httpswwwyoutubecomwatchv=VREJV--VHSw

Report on interest in WW2 among Dutch population

httpwwwoorlogsbronnennlgebruikersonderzoek2015 May 2015

Many of us use the internet to search for information [] We often mention Wikipediahellip

Everything is of course on Wikipedia Just type in a name and you can read entire essays (man 70s)

Over half of us think that Wikipedia and Google contribute to our knowledge and understanding of history

When we have to find information about WW2 outside the class setting we fully concentrate on digital resources like Google and Wikipedia (school kids)

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

httpnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

is given in

Context about De Geus onder studenten

But nowhellip

We have another problemhellip

httpsnlwikipediaorgwikiCategorieIllegale_pers_in_de_Tweede_Wereldoorlog

hellip De Geus on Wikipedia is an exception

1 Very few underground newspapers have their own WP articles

2 The overview of these newspapers on WP is far from complete

Good news

htt

p

ww

wa

rch

ives

go

vre

sear

chm

ilita

ryw

w2

ph

oto

sim

ages

ww

2-1

94

jpg

There is a 1-fix solution to this contextual problem

De Ondergrondse Pers 1940-1945

By Lydia E Winkel amp H de Vries

1989 ISBN 9021837463 Veen Uitgevers

This book (ldquoDe Winkelrdquo) contains

contextual articles about

(nearly) all plusmn 1300 illegal WW2 newspapers

ldquoDe Winkelrdquo ndash nr 199

De Ondergrondse Pers 1940-1945 Lydia E Winkel H de Vries 1989

ISBN 9021837463 Veen Uitgevers

Every article has a unique ID

(ldquoWinkel-IDrdquo)

Every article has metadata

bull Title subtitle motto bull Place of publication bull Period of publication bull Publication frequency (daily weekly one-off irregular)

bull Multiplication (stenciled printed typed handwritten)

bull Contents (news opinions poems illustrations humor)

bull Number of prints (min ndash max)

Relation 13

Newspaper Placename

semantics linked data

Relation 13

Newspaper Placename

semantics linked data

Contextual information

Nice material

for a Wikipedia article

Very often persons related to this newspaper are mentioned

Relation 23

Newspaper Persons

semantics linked data

Many articles also contain references to other newspapers

bull 106 = Cereales Vadeness (students resistance newspaper Wageningen) bull 360 = Leidsche Brief (students resistance newspaper Leiden) bull 748 = Sol Justitiae (students resistance newspaper Utrecht)

Relation 33

Newspaper Other newspapers

semantics linked data

htt

p

htt

ps

kn

ow

led

geu

top

iaf

iles

wo

rdp

ress

co

m2

01

40

1h

olla

nd

ho

use

libra

ryb

litz1

94

0j

pg

Too bad itrsquos a paper book hard to find multiply distribute and build upon

We need it digital

htt

p

htt

ps

kn

ow

led

geu

top

iaf

iles

wo

rdp

ress

co

m2

01

40

1h

olla

nd

ho

use

libra

ryb

litz1

94

0j

pg

We need it digital

1 Clear copyright with copyright holder (NIOD) Open CC-BY-SA license

2 Scan amp OCR

3 Convert into PDF

4 Put online NIOD site amp Wikimedia Commons

De Winkel as PDF on NIOD website (CC-BY-SA) httpwwwniodnlnlde-ondergrondse-pers-1940-1945

De Winkel as PDF on Wikimedia Commons httpscommonswikimediaorgwikiFilePDF_of_De_Ondergrondse_Pers_1940-1945_-_derde_druk_-_1989pdf

De Winkel as PDF on NIOD website (CC-BY-SA) httpwwwniodnlnlde-ondergrondse-pers-1940-1945

Saved us euro13330 httpwwwbrillcomdutch-underground-press-1940-1945

Wikipedia article about De Winkel httpnlwikipediaorgwikiDe_ondergrondse_pers_1940-1945

Wikipedia article about the author httpnlwikipediaorgwikiLydia_Winkel

Winkel the plusses

Available online (PDF flat file)

Open license (CC-BY-SA) Contextual information Relations

bull Titles Places bull Titles Persons

bull Titles Other titles

htt

p

ww

wa

rch

ives

go

vre

sear

chm

ilita

ryw

w2

ph

oto

sim

ages

ww

2-1

94

jpg

htt

p

2b

pb

logsp

ot

com

_BW

zuYw

iS6-I

TM

geR

sFd3m

IAAAAAAAAElw

3cv

gbZSPW

css

1600d

oct

or+

macr

o+

judy+

scare

djpg

Winkel the minusses

Unstructured data (PDF flat file)

Not very machine readable (unlike CSV XML JSON RDF)

PDF is no (real) open standard (unlike CSV XML JSON RDF)

No links between titles Delpher amp KB-cat No links between titles places amp persons

external sources (like Wikipedia)

but the data sources are

unconnected (and for 3+4 unstructured amp not machine-readable)

To summarize

a lot of information is available about these WW2 underground newspapers

1 Metadata (KB-cat)

2 Content (full-text Delpher)

3 Context (Winkel PDF)

4 Relations titles places persons other titles (Winkel PDF)

5 External resources about titles places and persons

htt

p

2b

pb

logsp

ot

com

_BW

zuYw

iS6-I

TM

geR

sFd3m

IAAAAAAAAElw

3cv

gbZSPW

css

1600d

oct

or+

macr

o+

judy+

scare

djpg

making discovery understanding amp research

of these newspapers (and related places amp persons) more difficult than necessary

We can solve all these issues

Good news

Wikiproject() Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

tinyurlcomverzetskranten (in Dutch)

httpsnlwikipediaorgwikiWikipediaWikiproject httpsenwikipediaorgwikiWikipediaWikiproject

From 14 1300 titles

htt

ps

n

lwik

iped

iao

rgw

iki

Cat

ego

rie

Illeg

ale_

per

s_in

_de_

Twee

de_

Wer

eld

oo

rlo

g

Wikiproject() Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

tinyurlcomverzetskranten (in Dutch)

We need a database

Build central database

httpswwwyoutubecomwatchv=GVDGuCjog_0

Build central database

httpswwwyoutubecomwatchv=GVDGuCjog_0

Step 1 Create Excel-sheet with bull Metadata about newspaper (from Winkel PDF)

bull Unique Wikipedia article title

bull Contextual info incl related persons amp titles (from Winkel PDF)

bull PPN unique ID linking newspaper to

KB-catalogue amp Delpher

Winkel-ID

Place of publication

Other metadata

Title

Unique Wikipedia article title =

ltNewspaper titlegt (verzetsblad ltPlace of publicationgt)

bull Contextual info bull Related persons

bull Related newspaper titles

PPN (107123223)

httpopc4kbnlDB=1PPNPPN=107123223

httpopc4kbnlDB=1PPNPPN=107123223

PPN (107123223)

PPN (107123223)

httpwwwdelphernlnlkrantenresultsindexcoll=dddtitelampcql[]=ppn3D107123223

httpwwwdelphernlnlkrantenresultsindexcoll=dddtitelampcql[]=ppn3D107123223

PPN (107123223)

Build central database

Step 2 Convert Excel into RDF triplestore (=special kind of online database anybody can access)

bull Steps 1-4 from httplinda-projecteulinked-

data-primer-2

bull Step 4 Vocubulary used = Bibframe (httpbibframeorgvocab)

Build central database

Step 3 Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

bull DBpedia = hub for linking different data sets on the Web to each

other Linked Open Data cloud

bull Connect persons amp places in newspaper database to external resources via DBpedia

Step 1c Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

DBpedia allows you to ask sophisticated queries against Wikipedia and to link the different data sets on the Web to Wikipedia data

bull Connect persons amp places in newspaper database to external

resources via DBpedia

httplod-cloudnetversions2010-09-22lod-cloud_coloredpng

Linked Open Data cloud

Build central database

Step 3 Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

bull DBpedia = hub for linking different data sets on the Web to each

other Linked Open Data cloud

bull We use DBpdia to connect persons amp places in our newspaper database to information in other databases

httpnldbpediaorgpageHuib_Drion

httpsnlwikipediaorgwikiHuib_Drion

httpwwwdbnlorgauteursauteurphpid=drio001

httpwwwbiografischportaalnlpersoon41181342

Build central database

Added value of Linked Open Data amp DBpedia Software can automatically query for additional information about places and persons mentioned in De Winkel that is not available in bull KB-catalogue bull Delpher bull De Winkel

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples)

Open license (CC-BY-SA) Open standard (RDF)

Contextual information Links between titles Delpher amp KB-cat (via PPNs)

Relations Links between places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia)

bull Titles Other titles

(PPNs)

http5stardatainfoen

Wikiproject Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

We need a template

Using an article template we can generate 1300 uniform and interlinked Wikipedia articles

from the LOD-database

htt

ps

c1

sta

ticf

lickr

co

m9

82

81

76

99

23

19

18

_11

a73

56

c38

_bjp

g

LOD-database + article template = Wikipedia article

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

Titles from database

Metadata from database

Related persons from database

Related newspapers from database

Winkel-ID from database

Link to full-texts in Delpher from database

Link to KB-catalogue record from database

Fixed categories on WP and Commons

Predefined fixed strings

Grey = bull From database bull Predefined fixed strings

Uniformity between articles guaranteed

All that WP-writers need to add manually

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

The KB can re-use (embed) the Wikipedia content in its own

websites to tackle this problem

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_28verzetsblad29

Delpher - search results

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

Delpher - search results

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Delpher - object presentation

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Over De Geus onder studenten

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog dat vanaf 4 oktober 1940 tot en met 13 juli 1944 in Den Haag werd uitgegeven Het blad verscheen in 1940 1941 en 1943 maandelijks verder onregelmatig in een oplage tussen de 250 en 8000 exemplaren Het werd aanvankelijk gestencild en vanaf november 1942 gedrukt en de inhoud bestond voornamelijk uit opinie-artikelen

Het blad werd uitgegeven door Jan Drion en Huib Drion twee Leidsehellip

Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Delpher - object presentation

Suggested reading

bull httpwwwtedcomtalkstim_berners_lee_on_the_next_web 20 years ago Tim Berners-Lee invented the World Wide Web For his next project hes building a web for open linked data that could do for numbers what the Web did for words pictures video unlock our data and reframe the way we use it together

bull httpsenwikipediaorgwikiLinked_data Wikipedia article related to the above video

bull http5stardatainfoen The 5 stars of Linked Open Data (Tim Berners-Lee)

bull httplinda-projecteulinked-data-primer-2 Short primer about creating LOD in practice starting from an Excel sheet

bull httpwwwprogrammablewebcomnewshow-linked-data-solved-digital-age-marketing-problemanalysis20150831 The figure near the bottom of the first page is a good illustration of the concept of (linked) triples

bull httpsenwikipediaorgwikiDBpedia

bull httpsenwikipediaorgwikiSemantic_network

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Questions

olafjanssenkbnl - ookgezellig

tinyurlcomverzetskranten

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Page 21: Linked Open Data case study (illegal newspapers WW2, Wikipedia, DBpedia) - Lecture Leiden University 3-3-2016

Question Where would most people start searching contextual

information about De Geus onder studenten

Probably Wikipedia (via Google)

httpnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

Question Where would most people start searching contextual

information about De Geus onder studenten

Probably Wikipedia (via Google)

httpswwwyoutubecomwatchv=VREJV--VHSw

Report on interest in WW2 among Dutch population

httpwwwoorlogsbronnennlgebruikersonderzoek2015 May 2015

Many of us use the internet to search for information [] We often mention Wikipediahellip

Everything is of course on Wikipedia Just type in a name and you can read entire essays (man 70s)

Over half of us think that Wikipedia and Google contribute to our knowledge and understanding of history

When we have to find information about WW2 outside the class setting we fully concentrate on digital resources like Google and Wikipedia (school kids)

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

httpnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

is given in

Context about De Geus onder studenten

But nowhellip

We have another problemhellip

httpsnlwikipediaorgwikiCategorieIllegale_pers_in_de_Tweede_Wereldoorlog

hellip De Geus on Wikipedia is an exception

1 Very few underground newspapers have their own WP articles

2 The overview of these newspapers on WP is far from complete

Good news

htt

p

ww

wa

rch

ives

go

vre

sear

chm

ilita

ryw

w2

ph

oto

sim

ages

ww

2-1

94

jpg

There is a 1-fix solution to this contextual problem

De Ondergrondse Pers 1940-1945

By Lydia E Winkel amp H de Vries

1989 ISBN 9021837463 Veen Uitgevers

This book (ldquoDe Winkelrdquo) contains

contextual articles about

(nearly) all plusmn 1300 illegal WW2 newspapers

ldquoDe Winkelrdquo ndash nr 199

De Ondergrondse Pers 1940-1945 Lydia E Winkel H de Vries 1989

ISBN 9021837463 Veen Uitgevers

Every article has a unique ID

(ldquoWinkel-IDrdquo)

Every article has metadata

bull Title subtitle motto bull Place of publication bull Period of publication bull Publication frequency (daily weekly one-off irregular)

bull Multiplication (stenciled printed typed handwritten)

bull Contents (news opinions poems illustrations humor)

bull Number of prints (min ndash max)

Relation 13

Newspaper Placename

semantics linked data

Relation 13

Newspaper Placename

semantics linked data

Contextual information

Nice material

for a Wikipedia article

Very often persons related to this newspaper are mentioned

Relation 23

Newspaper Persons

semantics linked data

Many articles also contain references to other newspapers

bull 106 = Cereales Vadeness (students resistance newspaper Wageningen) bull 360 = Leidsche Brief (students resistance newspaper Leiden) bull 748 = Sol Justitiae (students resistance newspaper Utrecht)

Relation 33

Newspaper Other newspapers

semantics linked data

htt

p

htt

ps

kn

ow

led

geu

top

iaf

iles

wo

rdp

ress

co

m2

01

40

1h

olla

nd

ho

use

libra

ryb

litz1

94

0j

pg

Too bad itrsquos a paper book hard to find multiply distribute and build upon

We need it digital

htt

p

htt

ps

kn

ow

led

geu

top

iaf

iles

wo

rdp

ress

co

m2

01

40

1h

olla

nd

ho

use

libra

ryb

litz1

94

0j

pg

We need it digital

1 Clear copyright with copyright holder (NIOD) Open CC-BY-SA license

2 Scan amp OCR

3 Convert into PDF

4 Put online NIOD site amp Wikimedia Commons

De Winkel as PDF on NIOD website (CC-BY-SA) httpwwwniodnlnlde-ondergrondse-pers-1940-1945

De Winkel as PDF on Wikimedia Commons httpscommonswikimediaorgwikiFilePDF_of_De_Ondergrondse_Pers_1940-1945_-_derde_druk_-_1989pdf

De Winkel as PDF on NIOD website (CC-BY-SA) httpwwwniodnlnlde-ondergrondse-pers-1940-1945

Saved us euro13330 httpwwwbrillcomdutch-underground-press-1940-1945

Wikipedia article about De Winkel httpnlwikipediaorgwikiDe_ondergrondse_pers_1940-1945

Wikipedia article about the author httpnlwikipediaorgwikiLydia_Winkel

Winkel the plusses

Available online (PDF flat file)

Open license (CC-BY-SA) Contextual information Relations

bull Titles Places bull Titles Persons

bull Titles Other titles

htt

p

ww

wa

rch

ives

go

vre

sear

chm

ilita

ryw

w2

ph

oto

sim

ages

ww

2-1

94

jpg

htt

p

2b

pb

logsp

ot

com

_BW

zuYw

iS6-I

TM

geR

sFd3m

IAAAAAAAAElw

3cv

gbZSPW

css

1600d

oct

or+

macr

o+

judy+

scare

djpg

Winkel the minusses

Unstructured data (PDF flat file)

Not very machine readable (unlike CSV XML JSON RDF)

PDF is no (real) open standard (unlike CSV XML JSON RDF)

No links between titles Delpher amp KB-cat No links between titles places amp persons

external sources (like Wikipedia)

but the data sources are

unconnected (and for 3+4 unstructured amp not machine-readable)

To summarize

a lot of information is available about these WW2 underground newspapers

1 Metadata (KB-cat)

2 Content (full-text Delpher)

3 Context (Winkel PDF)

4 Relations titles places persons other titles (Winkel PDF)

5 External resources about titles places and persons

htt

p

2b

pb

logsp

ot

com

_BW

zuYw

iS6-I

TM

geR

sFd3m

IAAAAAAAAElw

3cv

gbZSPW

css

1600d

oct

or+

macr

o+

judy+

scare

djpg

making discovery understanding amp research

of these newspapers (and related places amp persons) more difficult than necessary

We can solve all these issues

Good news

Wikiproject() Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

tinyurlcomverzetskranten (in Dutch)

httpsnlwikipediaorgwikiWikipediaWikiproject httpsenwikipediaorgwikiWikipediaWikiproject

From 14 1300 titles

htt

ps

n

lwik

iped

iao

rgw

iki

Cat

ego

rie

Illeg

ale_

per

s_in

_de_

Twee

de_

Wer

eld

oo

rlo

g

Wikiproject() Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

tinyurlcomverzetskranten (in Dutch)

We need a database

Build central database

httpswwwyoutubecomwatchv=GVDGuCjog_0

Build central database

httpswwwyoutubecomwatchv=GVDGuCjog_0

Step 1 Create Excel-sheet with bull Metadata about newspaper (from Winkel PDF)

bull Unique Wikipedia article title

bull Contextual info incl related persons amp titles (from Winkel PDF)

bull PPN unique ID linking newspaper to

KB-catalogue amp Delpher

Winkel-ID

Place of publication

Other metadata

Title

Unique Wikipedia article title =

ltNewspaper titlegt (verzetsblad ltPlace of publicationgt)

bull Contextual info bull Related persons

bull Related newspaper titles

PPN (107123223)

httpopc4kbnlDB=1PPNPPN=107123223

httpopc4kbnlDB=1PPNPPN=107123223

PPN (107123223)

PPN (107123223)

httpwwwdelphernlnlkrantenresultsindexcoll=dddtitelampcql[]=ppn3D107123223

httpwwwdelphernlnlkrantenresultsindexcoll=dddtitelampcql[]=ppn3D107123223

PPN (107123223)

Build central database

Step 2 Convert Excel into RDF triplestore (=special kind of online database anybody can access)

bull Steps 1-4 from httplinda-projecteulinked-

data-primer-2

bull Step 4 Vocubulary used = Bibframe (httpbibframeorgvocab)

Build central database

Step 3 Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

bull DBpedia = hub for linking different data sets on the Web to each

other Linked Open Data cloud

bull Connect persons amp places in newspaper database to external resources via DBpedia

Step 1c Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

DBpedia allows you to ask sophisticated queries against Wikipedia and to link the different data sets on the Web to Wikipedia data

bull Connect persons amp places in newspaper database to external

resources via DBpedia

httplod-cloudnetversions2010-09-22lod-cloud_coloredpng

Linked Open Data cloud

Build central database

Step 3 Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

bull DBpedia = hub for linking different data sets on the Web to each

other Linked Open Data cloud

bull We use DBpdia to connect persons amp places in our newspaper database to information in other databases

httpnldbpediaorgpageHuib_Drion

httpsnlwikipediaorgwikiHuib_Drion

httpwwwdbnlorgauteursauteurphpid=drio001

httpwwwbiografischportaalnlpersoon41181342

Build central database

Added value of Linked Open Data amp DBpedia Software can automatically query for additional information about places and persons mentioned in De Winkel that is not available in bull KB-catalogue bull Delpher bull De Winkel

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples)

Open license (CC-BY-SA) Open standard (RDF)

Contextual information Links between titles Delpher amp KB-cat (via PPNs)

Relations Links between places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia)

bull Titles Other titles

(PPNs)

http5stardatainfoen

Wikiproject Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

We need a template

Using an article template we can generate 1300 uniform and interlinked Wikipedia articles

from the LOD-database

htt

ps

c1

sta

ticf

lickr

co

m9

82

81

76

99

23

19

18

_11

a73

56

c38

_bjp

g

LOD-database + article template = Wikipedia article

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

Titles from database

Metadata from database

Related persons from database

Related newspapers from database

Winkel-ID from database

Link to full-texts in Delpher from database

Link to KB-catalogue record from database

Fixed categories on WP and Commons

Predefined fixed strings

Grey = bull From database bull Predefined fixed strings

Uniformity between articles guaranteed

All that WP-writers need to add manually

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

The KB can re-use (embed) the Wikipedia content in its own

websites to tackle this problem

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_28verzetsblad29

Delpher - search results

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

Delpher - search results

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Delpher - object presentation

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Over De Geus onder studenten

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog dat vanaf 4 oktober 1940 tot en met 13 juli 1944 in Den Haag werd uitgegeven Het blad verscheen in 1940 1941 en 1943 maandelijks verder onregelmatig in een oplage tussen de 250 en 8000 exemplaren Het werd aanvankelijk gestencild en vanaf november 1942 gedrukt en de inhoud bestond voornamelijk uit opinie-artikelen

Het blad werd uitgegeven door Jan Drion en Huib Drion twee Leidsehellip

Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Delpher - object presentation

Suggested reading

bull httpwwwtedcomtalkstim_berners_lee_on_the_next_web 20 years ago Tim Berners-Lee invented the World Wide Web For his next project hes building a web for open linked data that could do for numbers what the Web did for words pictures video unlock our data and reframe the way we use it together

bull httpsenwikipediaorgwikiLinked_data Wikipedia article related to the above video

bull http5stardatainfoen The 5 stars of Linked Open Data (Tim Berners-Lee)

bull httplinda-projecteulinked-data-primer-2 Short primer about creating LOD in practice starting from an Excel sheet

bull httpwwwprogrammablewebcomnewshow-linked-data-solved-digital-age-marketing-problemanalysis20150831 The figure near the bottom of the first page is a good illustration of the concept of (linked) triples

bull httpsenwikipediaorgwikiDBpedia

bull httpsenwikipediaorgwikiSemantic_network

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Questions

olafjanssenkbnl - ookgezellig

tinyurlcomverzetskranten

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Page 22: Linked Open Data case study (illegal newspapers WW2, Wikipedia, DBpedia) - Lecture Leiden University 3-3-2016

httpnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

Question Where would most people start searching contextual

information about De Geus onder studenten

Probably Wikipedia (via Google)

httpswwwyoutubecomwatchv=VREJV--VHSw

Report on interest in WW2 among Dutch population

httpwwwoorlogsbronnennlgebruikersonderzoek2015 May 2015

Many of us use the internet to search for information [] We often mention Wikipediahellip

Everything is of course on Wikipedia Just type in a name and you can read entire essays (man 70s)

Over half of us think that Wikipedia and Google contribute to our knowledge and understanding of history

When we have to find information about WW2 outside the class setting we fully concentrate on digital resources like Google and Wikipedia (school kids)

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

httpnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

is given in

Context about De Geus onder studenten

But nowhellip

We have another problemhellip

httpsnlwikipediaorgwikiCategorieIllegale_pers_in_de_Tweede_Wereldoorlog

hellip De Geus on Wikipedia is an exception

1 Very few underground newspapers have their own WP articles

2 The overview of these newspapers on WP is far from complete

Good news

htt

p

ww

wa

rch

ives

go

vre

sear

chm

ilita

ryw

w2

ph

oto

sim

ages

ww

2-1

94

jpg

There is a 1-fix solution to this contextual problem

De Ondergrondse Pers 1940-1945

By Lydia E Winkel amp H de Vries

1989 ISBN 9021837463 Veen Uitgevers

This book (ldquoDe Winkelrdquo) contains

contextual articles about

(nearly) all plusmn 1300 illegal WW2 newspapers

ldquoDe Winkelrdquo ndash nr 199

De Ondergrondse Pers 1940-1945 Lydia E Winkel H de Vries 1989

ISBN 9021837463 Veen Uitgevers

Every article has a unique ID

(ldquoWinkel-IDrdquo)

Every article has metadata

bull Title subtitle motto bull Place of publication bull Period of publication bull Publication frequency (daily weekly one-off irregular)

bull Multiplication (stenciled printed typed handwritten)

bull Contents (news opinions poems illustrations humor)

bull Number of prints (min ndash max)

Relation 13

Newspaper Placename

semantics linked data

Relation 13

Newspaper Placename

semantics linked data

Contextual information

Nice material

for a Wikipedia article

Very often persons related to this newspaper are mentioned

Relation 23

Newspaper Persons

semantics linked data

Many articles also contain references to other newspapers

bull 106 = Cereales Vadeness (students resistance newspaper Wageningen) bull 360 = Leidsche Brief (students resistance newspaper Leiden) bull 748 = Sol Justitiae (students resistance newspaper Utrecht)

Relation 33

Newspaper Other newspapers

semantics linked data

htt

p

htt

ps

kn

ow

led

geu

top

iaf

iles

wo

rdp

ress

co

m2

01

40

1h

olla

nd

ho

use

libra

ryb

litz1

94

0j

pg

Too bad itrsquos a paper book hard to find multiply distribute and build upon

We need it digital

htt

p

htt

ps

kn

ow

led

geu

top

iaf

iles

wo

rdp

ress

co

m2

01

40

1h

olla

nd

ho

use

libra

ryb

litz1

94

0j

pg

We need it digital

1 Clear copyright with copyright holder (NIOD) Open CC-BY-SA license

2 Scan amp OCR

3 Convert into PDF

4 Put online NIOD site amp Wikimedia Commons

De Winkel as PDF on NIOD website (CC-BY-SA) httpwwwniodnlnlde-ondergrondse-pers-1940-1945

De Winkel as PDF on Wikimedia Commons httpscommonswikimediaorgwikiFilePDF_of_De_Ondergrondse_Pers_1940-1945_-_derde_druk_-_1989pdf

De Winkel as PDF on NIOD website (CC-BY-SA) httpwwwniodnlnlde-ondergrondse-pers-1940-1945

Saved us euro13330 httpwwwbrillcomdutch-underground-press-1940-1945

Wikipedia article about De Winkel httpnlwikipediaorgwikiDe_ondergrondse_pers_1940-1945

Wikipedia article about the author httpnlwikipediaorgwikiLydia_Winkel

Winkel the plusses

Available online (PDF flat file)

Open license (CC-BY-SA) Contextual information Relations

bull Titles Places bull Titles Persons

bull Titles Other titles

htt

p

ww

wa

rch

ives

go

vre

sear

chm

ilita

ryw

w2

ph

oto

sim

ages

ww

2-1

94

jpg

htt

p

2b

pb

logsp

ot

com

_BW

zuYw

iS6-I

TM

geR

sFd3m

IAAAAAAAAElw

3cv

gbZSPW

css

1600d

oct

or+

macr

o+

judy+

scare

djpg

Winkel the minusses

Unstructured data (PDF flat file)

Not very machine readable (unlike CSV XML JSON RDF)

PDF is no (real) open standard (unlike CSV XML JSON RDF)

No links between titles Delpher amp KB-cat No links between titles places amp persons

external sources (like Wikipedia)

but the data sources are

unconnected (and for 3+4 unstructured amp not machine-readable)

To summarize

a lot of information is available about these WW2 underground newspapers

1 Metadata (KB-cat)

2 Content (full-text Delpher)

3 Context (Winkel PDF)

4 Relations titles places persons other titles (Winkel PDF)

5 External resources about titles places and persons

htt

p

2b

pb

logsp

ot

com

_BW

zuYw

iS6-I

TM

geR

sFd3m

IAAAAAAAAElw

3cv

gbZSPW

css

1600d

oct

or+

macr

o+

judy+

scare

djpg

making discovery understanding amp research

of these newspapers (and related places amp persons) more difficult than necessary

We can solve all these issues

Good news

Wikiproject() Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

tinyurlcomverzetskranten (in Dutch)

httpsnlwikipediaorgwikiWikipediaWikiproject httpsenwikipediaorgwikiWikipediaWikiproject

From 14 1300 titles

htt

ps

n

lwik

iped

iao

rgw

iki

Cat

ego

rie

Illeg

ale_

per

s_in

_de_

Twee

de_

Wer

eld

oo

rlo

g

Wikiproject() Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

tinyurlcomverzetskranten (in Dutch)

We need a database

Build central database

httpswwwyoutubecomwatchv=GVDGuCjog_0

Build central database

httpswwwyoutubecomwatchv=GVDGuCjog_0

Step 1 Create Excel-sheet with bull Metadata about newspaper (from Winkel PDF)

bull Unique Wikipedia article title

bull Contextual info incl related persons amp titles (from Winkel PDF)

bull PPN unique ID linking newspaper to

KB-catalogue amp Delpher

Winkel-ID

Place of publication

Other metadata

Title

Unique Wikipedia article title =

ltNewspaper titlegt (verzetsblad ltPlace of publicationgt)

bull Contextual info bull Related persons

bull Related newspaper titles

PPN (107123223)

httpopc4kbnlDB=1PPNPPN=107123223

httpopc4kbnlDB=1PPNPPN=107123223

PPN (107123223)

PPN (107123223)

httpwwwdelphernlnlkrantenresultsindexcoll=dddtitelampcql[]=ppn3D107123223

httpwwwdelphernlnlkrantenresultsindexcoll=dddtitelampcql[]=ppn3D107123223

PPN (107123223)

Build central database

Step 2 Convert Excel into RDF triplestore (=special kind of online database anybody can access)

bull Steps 1-4 from httplinda-projecteulinked-

data-primer-2

bull Step 4 Vocubulary used = Bibframe (httpbibframeorgvocab)

Build central database

Step 3 Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

bull DBpedia = hub for linking different data sets on the Web to each

other Linked Open Data cloud

bull Connect persons amp places in newspaper database to external resources via DBpedia

Step 1c Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

DBpedia allows you to ask sophisticated queries against Wikipedia and to link the different data sets on the Web to Wikipedia data

bull Connect persons amp places in newspaper database to external

resources via DBpedia

httplod-cloudnetversions2010-09-22lod-cloud_coloredpng

Linked Open Data cloud

Build central database

Step 3 Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

bull DBpedia = hub for linking different data sets on the Web to each

other Linked Open Data cloud

bull We use DBpdia to connect persons amp places in our newspaper database to information in other databases

httpnldbpediaorgpageHuib_Drion

httpsnlwikipediaorgwikiHuib_Drion

httpwwwdbnlorgauteursauteurphpid=drio001

httpwwwbiografischportaalnlpersoon41181342

Build central database

Added value of Linked Open Data amp DBpedia Software can automatically query for additional information about places and persons mentioned in De Winkel that is not available in bull KB-catalogue bull Delpher bull De Winkel

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples)

Open license (CC-BY-SA) Open standard (RDF)

Contextual information Links between titles Delpher amp KB-cat (via PPNs)

Relations Links between places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia)

bull Titles Other titles

(PPNs)

http5stardatainfoen

Wikiproject Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

We need a template

Using an article template we can generate 1300 uniform and interlinked Wikipedia articles

from the LOD-database

htt

ps

c1

sta

ticf

lickr

co

m9

82

81

76

99

23

19

18

_11

a73

56

c38

_bjp

g

LOD-database + article template = Wikipedia article

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

Titles from database

Metadata from database

Related persons from database

Related newspapers from database

Winkel-ID from database

Link to full-texts in Delpher from database

Link to KB-catalogue record from database

Fixed categories on WP and Commons

Predefined fixed strings

Grey = bull From database bull Predefined fixed strings

Uniformity between articles guaranteed

All that WP-writers need to add manually

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

The KB can re-use (embed) the Wikipedia content in its own

websites to tackle this problem

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_28verzetsblad29

Delpher - search results

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

Delpher - search results

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Delpher - object presentation

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Over De Geus onder studenten

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog dat vanaf 4 oktober 1940 tot en met 13 juli 1944 in Den Haag werd uitgegeven Het blad verscheen in 1940 1941 en 1943 maandelijks verder onregelmatig in een oplage tussen de 250 en 8000 exemplaren Het werd aanvankelijk gestencild en vanaf november 1942 gedrukt en de inhoud bestond voornamelijk uit opinie-artikelen

Het blad werd uitgegeven door Jan Drion en Huib Drion twee Leidsehellip

Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Delpher - object presentation

Suggested reading

bull httpwwwtedcomtalkstim_berners_lee_on_the_next_web 20 years ago Tim Berners-Lee invented the World Wide Web For his next project hes building a web for open linked data that could do for numbers what the Web did for words pictures video unlock our data and reframe the way we use it together

bull httpsenwikipediaorgwikiLinked_data Wikipedia article related to the above video

bull http5stardatainfoen The 5 stars of Linked Open Data (Tim Berners-Lee)

bull httplinda-projecteulinked-data-primer-2 Short primer about creating LOD in practice starting from an Excel sheet

bull httpwwwprogrammablewebcomnewshow-linked-data-solved-digital-age-marketing-problemanalysis20150831 The figure near the bottom of the first page is a good illustration of the concept of (linked) triples

bull httpsenwikipediaorgwikiDBpedia

bull httpsenwikipediaorgwikiSemantic_network

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Questions

olafjanssenkbnl - ookgezellig

tinyurlcomverzetskranten

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Page 23: Linked Open Data case study (illegal newspapers WW2, Wikipedia, DBpedia) - Lecture Leiden University 3-3-2016

httpswwwyoutubecomwatchv=VREJV--VHSw

Report on interest in WW2 among Dutch population

httpwwwoorlogsbronnennlgebruikersonderzoek2015 May 2015

Many of us use the internet to search for information [] We often mention Wikipediahellip

Everything is of course on Wikipedia Just type in a name and you can read entire essays (man 70s)

Over half of us think that Wikipedia and Google contribute to our knowledge and understanding of history

When we have to find information about WW2 outside the class setting we fully concentrate on digital resources like Google and Wikipedia (school kids)

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

httpnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

is given in

Context about De Geus onder studenten

But nowhellip

We have another problemhellip

httpsnlwikipediaorgwikiCategorieIllegale_pers_in_de_Tweede_Wereldoorlog

hellip De Geus on Wikipedia is an exception

1 Very few underground newspapers have their own WP articles

2 The overview of these newspapers on WP is far from complete

Good news

htt

p

ww

wa

rch

ives

go

vre

sear

chm

ilita

ryw

w2

ph

oto

sim

ages

ww

2-1

94

jpg

There is a 1-fix solution to this contextual problem

De Ondergrondse Pers 1940-1945

By Lydia E Winkel amp H de Vries

1989 ISBN 9021837463 Veen Uitgevers

This book (ldquoDe Winkelrdquo) contains

contextual articles about

(nearly) all plusmn 1300 illegal WW2 newspapers

ldquoDe Winkelrdquo ndash nr 199

De Ondergrondse Pers 1940-1945 Lydia E Winkel H de Vries 1989

ISBN 9021837463 Veen Uitgevers

Every article has a unique ID

(ldquoWinkel-IDrdquo)

Every article has metadata

bull Title subtitle motto bull Place of publication bull Period of publication bull Publication frequency (daily weekly one-off irregular)

bull Multiplication (stenciled printed typed handwritten)

bull Contents (news opinions poems illustrations humor)

bull Number of prints (min ndash max)

Relation 13

Newspaper Placename

semantics linked data

Relation 13

Newspaper Placename

semantics linked data

Contextual information

Nice material

for a Wikipedia article

Very often persons related to this newspaper are mentioned

Relation 23

Newspaper Persons

semantics linked data

Many articles also contain references to other newspapers

bull 106 = Cereales Vadeness (students resistance newspaper Wageningen) bull 360 = Leidsche Brief (students resistance newspaper Leiden) bull 748 = Sol Justitiae (students resistance newspaper Utrecht)

Relation 33

Newspaper Other newspapers

semantics linked data

htt

p

htt

ps

kn

ow

led

geu

top

iaf

iles

wo

rdp

ress

co

m2

01

40

1h

olla

nd

ho

use

libra

ryb

litz1

94

0j

pg

Too bad itrsquos a paper book hard to find multiply distribute and build upon

We need it digital

htt

p

htt

ps

kn

ow

led

geu

top

iaf

iles

wo

rdp

ress

co

m2

01

40

1h

olla

nd

ho

use

libra

ryb

litz1

94

0j

pg

We need it digital

1 Clear copyright with copyright holder (NIOD) Open CC-BY-SA license

2 Scan amp OCR

3 Convert into PDF

4 Put online NIOD site amp Wikimedia Commons

De Winkel as PDF on NIOD website (CC-BY-SA) httpwwwniodnlnlde-ondergrondse-pers-1940-1945

De Winkel as PDF on Wikimedia Commons httpscommonswikimediaorgwikiFilePDF_of_De_Ondergrondse_Pers_1940-1945_-_derde_druk_-_1989pdf

De Winkel as PDF on NIOD website (CC-BY-SA) httpwwwniodnlnlde-ondergrondse-pers-1940-1945

Saved us euro13330 httpwwwbrillcomdutch-underground-press-1940-1945

Wikipedia article about De Winkel httpnlwikipediaorgwikiDe_ondergrondse_pers_1940-1945

Wikipedia article about the author httpnlwikipediaorgwikiLydia_Winkel

Winkel the plusses

Available online (PDF flat file)

Open license (CC-BY-SA) Contextual information Relations

bull Titles Places bull Titles Persons

bull Titles Other titles

htt

p

ww

wa

rch

ives

go

vre

sear

chm

ilita

ryw

w2

ph

oto

sim

ages

ww

2-1

94

jpg

htt

p

2b

pb

logsp

ot

com

_BW

zuYw

iS6-I

TM

geR

sFd3m

IAAAAAAAAElw

3cv

gbZSPW

css

1600d

oct

or+

macr

o+

judy+

scare

djpg

Winkel the minusses

Unstructured data (PDF flat file)

Not very machine readable (unlike CSV XML JSON RDF)

PDF is no (real) open standard (unlike CSV XML JSON RDF)

No links between titles Delpher amp KB-cat No links between titles places amp persons

external sources (like Wikipedia)

but the data sources are

unconnected (and for 3+4 unstructured amp not machine-readable)

To summarize

a lot of information is available about these WW2 underground newspapers

1 Metadata (KB-cat)

2 Content (full-text Delpher)

3 Context (Winkel PDF)

4 Relations titles places persons other titles (Winkel PDF)

5 External resources about titles places and persons

htt

p

2b

pb

logsp

ot

com

_BW

zuYw

iS6-I

TM

geR

sFd3m

IAAAAAAAAElw

3cv

gbZSPW

css

1600d

oct

or+

macr

o+

judy+

scare

djpg

making discovery understanding amp research

of these newspapers (and related places amp persons) more difficult than necessary

We can solve all these issues

Good news

Wikiproject() Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

tinyurlcomverzetskranten (in Dutch)

httpsnlwikipediaorgwikiWikipediaWikiproject httpsenwikipediaorgwikiWikipediaWikiproject

From 14 1300 titles

htt

ps

n

lwik

iped

iao

rgw

iki

Cat

ego

rie

Illeg

ale_

per

s_in

_de_

Twee

de_

Wer

eld

oo

rlo

g

Wikiproject() Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

tinyurlcomverzetskranten (in Dutch)

We need a database

Build central database

httpswwwyoutubecomwatchv=GVDGuCjog_0

Build central database

httpswwwyoutubecomwatchv=GVDGuCjog_0

Step 1 Create Excel-sheet with bull Metadata about newspaper (from Winkel PDF)

bull Unique Wikipedia article title

bull Contextual info incl related persons amp titles (from Winkel PDF)

bull PPN unique ID linking newspaper to

KB-catalogue amp Delpher

Winkel-ID

Place of publication

Other metadata

Title

Unique Wikipedia article title =

ltNewspaper titlegt (verzetsblad ltPlace of publicationgt)

bull Contextual info bull Related persons

bull Related newspaper titles

PPN (107123223)

httpopc4kbnlDB=1PPNPPN=107123223

httpopc4kbnlDB=1PPNPPN=107123223

PPN (107123223)

PPN (107123223)

httpwwwdelphernlnlkrantenresultsindexcoll=dddtitelampcql[]=ppn3D107123223

httpwwwdelphernlnlkrantenresultsindexcoll=dddtitelampcql[]=ppn3D107123223

PPN (107123223)

Build central database

Step 2 Convert Excel into RDF triplestore (=special kind of online database anybody can access)

bull Steps 1-4 from httplinda-projecteulinked-

data-primer-2

bull Step 4 Vocubulary used = Bibframe (httpbibframeorgvocab)

Build central database

Step 3 Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

bull DBpedia = hub for linking different data sets on the Web to each

other Linked Open Data cloud

bull Connect persons amp places in newspaper database to external resources via DBpedia

Step 1c Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

DBpedia allows you to ask sophisticated queries against Wikipedia and to link the different data sets on the Web to Wikipedia data

bull Connect persons amp places in newspaper database to external

resources via DBpedia

httplod-cloudnetversions2010-09-22lod-cloud_coloredpng

Linked Open Data cloud

Build central database

Step 3 Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

bull DBpedia = hub for linking different data sets on the Web to each

other Linked Open Data cloud

bull We use DBpdia to connect persons amp places in our newspaper database to information in other databases

httpnldbpediaorgpageHuib_Drion

httpsnlwikipediaorgwikiHuib_Drion

httpwwwdbnlorgauteursauteurphpid=drio001

httpwwwbiografischportaalnlpersoon41181342

Build central database

Added value of Linked Open Data amp DBpedia Software can automatically query for additional information about places and persons mentioned in De Winkel that is not available in bull KB-catalogue bull Delpher bull De Winkel

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples)

Open license (CC-BY-SA) Open standard (RDF)

Contextual information Links between titles Delpher amp KB-cat (via PPNs)

Relations Links between places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia)

bull Titles Other titles

(PPNs)

http5stardatainfoen

Wikiproject Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

We need a template

Using an article template we can generate 1300 uniform and interlinked Wikipedia articles

from the LOD-database

htt

ps

c1

sta

ticf

lickr

co

m9

82

81

76

99

23

19

18

_11

a73

56

c38

_bjp

g

LOD-database + article template = Wikipedia article

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

Titles from database

Metadata from database

Related persons from database

Related newspapers from database

Winkel-ID from database

Link to full-texts in Delpher from database

Link to KB-catalogue record from database

Fixed categories on WP and Commons

Predefined fixed strings

Grey = bull From database bull Predefined fixed strings

Uniformity between articles guaranteed

All that WP-writers need to add manually

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

The KB can re-use (embed) the Wikipedia content in its own

websites to tackle this problem

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_28verzetsblad29

Delpher - search results

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

Delpher - search results

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Delpher - object presentation

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Over De Geus onder studenten

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog dat vanaf 4 oktober 1940 tot en met 13 juli 1944 in Den Haag werd uitgegeven Het blad verscheen in 1940 1941 en 1943 maandelijks verder onregelmatig in een oplage tussen de 250 en 8000 exemplaren Het werd aanvankelijk gestencild en vanaf november 1942 gedrukt en de inhoud bestond voornamelijk uit opinie-artikelen

Het blad werd uitgegeven door Jan Drion en Huib Drion twee Leidsehellip

Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Delpher - object presentation

Suggested reading

bull httpwwwtedcomtalkstim_berners_lee_on_the_next_web 20 years ago Tim Berners-Lee invented the World Wide Web For his next project hes building a web for open linked data that could do for numbers what the Web did for words pictures video unlock our data and reframe the way we use it together

bull httpsenwikipediaorgwikiLinked_data Wikipedia article related to the above video

bull http5stardatainfoen The 5 stars of Linked Open Data (Tim Berners-Lee)

bull httplinda-projecteulinked-data-primer-2 Short primer about creating LOD in practice starting from an Excel sheet

bull httpwwwprogrammablewebcomnewshow-linked-data-solved-digital-age-marketing-problemanalysis20150831 The figure near the bottom of the first page is a good illustration of the concept of (linked) triples

bull httpsenwikipediaorgwikiDBpedia

bull httpsenwikipediaorgwikiSemantic_network

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Questions

olafjanssenkbnl - ookgezellig

tinyurlcomverzetskranten

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Page 24: Linked Open Data case study (illegal newspapers WW2, Wikipedia, DBpedia) - Lecture Leiden University 3-3-2016

Report on interest in WW2 among Dutch population

httpwwwoorlogsbronnennlgebruikersonderzoek2015 May 2015

Many of us use the internet to search for information [] We often mention Wikipediahellip

Everything is of course on Wikipedia Just type in a name and you can read entire essays (man 70s)

Over half of us think that Wikipedia and Google contribute to our knowledge and understanding of history

When we have to find information about WW2 outside the class setting we fully concentrate on digital resources like Google and Wikipedia (school kids)

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

httpnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

is given in

Context about De Geus onder studenten

But nowhellip

We have another problemhellip

httpsnlwikipediaorgwikiCategorieIllegale_pers_in_de_Tweede_Wereldoorlog

hellip De Geus on Wikipedia is an exception

1 Very few underground newspapers have their own WP articles

2 The overview of these newspapers on WP is far from complete

Good news

htt

p

ww

wa

rch

ives

go

vre

sear

chm

ilita

ryw

w2

ph

oto

sim

ages

ww

2-1

94

jpg

There is a 1-fix solution to this contextual problem

De Ondergrondse Pers 1940-1945

By Lydia E Winkel amp H de Vries

1989 ISBN 9021837463 Veen Uitgevers

This book (ldquoDe Winkelrdquo) contains

contextual articles about

(nearly) all plusmn 1300 illegal WW2 newspapers

ldquoDe Winkelrdquo ndash nr 199

De Ondergrondse Pers 1940-1945 Lydia E Winkel H de Vries 1989

ISBN 9021837463 Veen Uitgevers

Every article has a unique ID

(ldquoWinkel-IDrdquo)

Every article has metadata

bull Title subtitle motto bull Place of publication bull Period of publication bull Publication frequency (daily weekly one-off irregular)

bull Multiplication (stenciled printed typed handwritten)

bull Contents (news opinions poems illustrations humor)

bull Number of prints (min ndash max)

Relation 13

Newspaper Placename

semantics linked data

Relation 13

Newspaper Placename

semantics linked data

Contextual information

Nice material

for a Wikipedia article

Very often persons related to this newspaper are mentioned

Relation 23

Newspaper Persons

semantics linked data

Many articles also contain references to other newspapers

bull 106 = Cereales Vadeness (students resistance newspaper Wageningen) bull 360 = Leidsche Brief (students resistance newspaper Leiden) bull 748 = Sol Justitiae (students resistance newspaper Utrecht)

Relation 33

Newspaper Other newspapers

semantics linked data

htt

p

htt

ps

kn

ow

led

geu

top

iaf

iles

wo

rdp

ress

co

m2

01

40

1h

olla

nd

ho

use

libra

ryb

litz1

94

0j

pg

Too bad itrsquos a paper book hard to find multiply distribute and build upon

We need it digital

htt

p

htt

ps

kn

ow

led

geu

top

iaf

iles

wo

rdp

ress

co

m2

01

40

1h

olla

nd

ho

use

libra

ryb

litz1

94

0j

pg

We need it digital

1 Clear copyright with copyright holder (NIOD) Open CC-BY-SA license

2 Scan amp OCR

3 Convert into PDF

4 Put online NIOD site amp Wikimedia Commons

De Winkel as PDF on NIOD website (CC-BY-SA) httpwwwniodnlnlde-ondergrondse-pers-1940-1945

De Winkel as PDF on Wikimedia Commons httpscommonswikimediaorgwikiFilePDF_of_De_Ondergrondse_Pers_1940-1945_-_derde_druk_-_1989pdf

De Winkel as PDF on NIOD website (CC-BY-SA) httpwwwniodnlnlde-ondergrondse-pers-1940-1945

Saved us euro13330 httpwwwbrillcomdutch-underground-press-1940-1945

Wikipedia article about De Winkel httpnlwikipediaorgwikiDe_ondergrondse_pers_1940-1945

Wikipedia article about the author httpnlwikipediaorgwikiLydia_Winkel

Winkel the plusses

Available online (PDF flat file)

Open license (CC-BY-SA) Contextual information Relations

bull Titles Places bull Titles Persons

bull Titles Other titles

htt

p

ww

wa

rch

ives

go

vre

sear

chm

ilita

ryw

w2

ph

oto

sim

ages

ww

2-1

94

jpg

htt

p

2b

pb

logsp

ot

com

_BW

zuYw

iS6-I

TM

geR

sFd3m

IAAAAAAAAElw

3cv

gbZSPW

css

1600d

oct

or+

macr

o+

judy+

scare

djpg

Winkel the minusses

Unstructured data (PDF flat file)

Not very machine readable (unlike CSV XML JSON RDF)

PDF is no (real) open standard (unlike CSV XML JSON RDF)

No links between titles Delpher amp KB-cat No links between titles places amp persons

external sources (like Wikipedia)

but the data sources are

unconnected (and for 3+4 unstructured amp not machine-readable)

To summarize

a lot of information is available about these WW2 underground newspapers

1 Metadata (KB-cat)

2 Content (full-text Delpher)

3 Context (Winkel PDF)

4 Relations titles places persons other titles (Winkel PDF)

5 External resources about titles places and persons

htt

p

2b

pb

logsp

ot

com

_BW

zuYw

iS6-I

TM

geR

sFd3m

IAAAAAAAAElw

3cv

gbZSPW

css

1600d

oct

or+

macr

o+

judy+

scare

djpg

making discovery understanding amp research

of these newspapers (and related places amp persons) more difficult than necessary

We can solve all these issues

Good news

Wikiproject() Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

tinyurlcomverzetskranten (in Dutch)

httpsnlwikipediaorgwikiWikipediaWikiproject httpsenwikipediaorgwikiWikipediaWikiproject

From 14 1300 titles

htt

ps

n

lwik

iped

iao

rgw

iki

Cat

ego

rie

Illeg

ale_

per

s_in

_de_

Twee

de_

Wer

eld

oo

rlo

g

Wikiproject() Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

tinyurlcomverzetskranten (in Dutch)

We need a database

Build central database

httpswwwyoutubecomwatchv=GVDGuCjog_0

Build central database

httpswwwyoutubecomwatchv=GVDGuCjog_0

Step 1 Create Excel-sheet with bull Metadata about newspaper (from Winkel PDF)

bull Unique Wikipedia article title

bull Contextual info incl related persons amp titles (from Winkel PDF)

bull PPN unique ID linking newspaper to

KB-catalogue amp Delpher

Winkel-ID

Place of publication

Other metadata

Title

Unique Wikipedia article title =

ltNewspaper titlegt (verzetsblad ltPlace of publicationgt)

bull Contextual info bull Related persons

bull Related newspaper titles

PPN (107123223)

httpopc4kbnlDB=1PPNPPN=107123223

httpopc4kbnlDB=1PPNPPN=107123223

PPN (107123223)

PPN (107123223)

httpwwwdelphernlnlkrantenresultsindexcoll=dddtitelampcql[]=ppn3D107123223

httpwwwdelphernlnlkrantenresultsindexcoll=dddtitelampcql[]=ppn3D107123223

PPN (107123223)

Build central database

Step 2 Convert Excel into RDF triplestore (=special kind of online database anybody can access)

bull Steps 1-4 from httplinda-projecteulinked-

data-primer-2

bull Step 4 Vocubulary used = Bibframe (httpbibframeorgvocab)

Build central database

Step 3 Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

bull DBpedia = hub for linking different data sets on the Web to each

other Linked Open Data cloud

bull Connect persons amp places in newspaper database to external resources via DBpedia

Step 1c Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

DBpedia allows you to ask sophisticated queries against Wikipedia and to link the different data sets on the Web to Wikipedia data

bull Connect persons amp places in newspaper database to external

resources via DBpedia

httplod-cloudnetversions2010-09-22lod-cloud_coloredpng

Linked Open Data cloud

Build central database

Step 3 Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

bull DBpedia = hub for linking different data sets on the Web to each

other Linked Open Data cloud

bull We use DBpdia to connect persons amp places in our newspaper database to information in other databases

httpnldbpediaorgpageHuib_Drion

httpsnlwikipediaorgwikiHuib_Drion

httpwwwdbnlorgauteursauteurphpid=drio001

httpwwwbiografischportaalnlpersoon41181342

Build central database

Added value of Linked Open Data amp DBpedia Software can automatically query for additional information about places and persons mentioned in De Winkel that is not available in bull KB-catalogue bull Delpher bull De Winkel

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples)

Open license (CC-BY-SA) Open standard (RDF)

Contextual information Links between titles Delpher amp KB-cat (via PPNs)

Relations Links between places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia)

bull Titles Other titles

(PPNs)

http5stardatainfoen

Wikiproject Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

We need a template

Using an article template we can generate 1300 uniform and interlinked Wikipedia articles

from the LOD-database

htt

ps

c1

sta

ticf

lickr

co

m9

82

81

76

99

23

19

18

_11

a73

56

c38

_bjp

g

LOD-database + article template = Wikipedia article

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

Titles from database

Metadata from database

Related persons from database

Related newspapers from database

Winkel-ID from database

Link to full-texts in Delpher from database

Link to KB-catalogue record from database

Fixed categories on WP and Commons

Predefined fixed strings

Grey = bull From database bull Predefined fixed strings

Uniformity between articles guaranteed

All that WP-writers need to add manually

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

The KB can re-use (embed) the Wikipedia content in its own

websites to tackle this problem

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_28verzetsblad29

Delpher - search results

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

Delpher - search results

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Delpher - object presentation

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Over De Geus onder studenten

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog dat vanaf 4 oktober 1940 tot en met 13 juli 1944 in Den Haag werd uitgegeven Het blad verscheen in 1940 1941 en 1943 maandelijks verder onregelmatig in een oplage tussen de 250 en 8000 exemplaren Het werd aanvankelijk gestencild en vanaf november 1942 gedrukt en de inhoud bestond voornamelijk uit opinie-artikelen

Het blad werd uitgegeven door Jan Drion en Huib Drion twee Leidsehellip

Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Delpher - object presentation

Suggested reading

bull httpwwwtedcomtalkstim_berners_lee_on_the_next_web 20 years ago Tim Berners-Lee invented the World Wide Web For his next project hes building a web for open linked data that could do for numbers what the Web did for words pictures video unlock our data and reframe the way we use it together

bull httpsenwikipediaorgwikiLinked_data Wikipedia article related to the above video

bull http5stardatainfoen The 5 stars of Linked Open Data (Tim Berners-Lee)

bull httplinda-projecteulinked-data-primer-2 Short primer about creating LOD in practice starting from an Excel sheet

bull httpwwwprogrammablewebcomnewshow-linked-data-solved-digital-age-marketing-problemanalysis20150831 The figure near the bottom of the first page is a good illustration of the concept of (linked) triples

bull httpsenwikipediaorgwikiDBpedia

bull httpsenwikipediaorgwikiSemantic_network

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Questions

olafjanssenkbnl - ookgezellig

tinyurlcomverzetskranten

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Page 25: Linked Open Data case study (illegal newspapers WW2, Wikipedia, DBpedia) - Lecture Leiden University 3-3-2016

Many of us use the internet to search for information [] We often mention Wikipediahellip

Everything is of course on Wikipedia Just type in a name and you can read entire essays (man 70s)

Over half of us think that Wikipedia and Google contribute to our knowledge and understanding of history

When we have to find information about WW2 outside the class setting we fully concentrate on digital resources like Google and Wikipedia (school kids)

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

httpnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

is given in

Context about De Geus onder studenten

But nowhellip

We have another problemhellip

httpsnlwikipediaorgwikiCategorieIllegale_pers_in_de_Tweede_Wereldoorlog

hellip De Geus on Wikipedia is an exception

1 Very few underground newspapers have their own WP articles

2 The overview of these newspapers on WP is far from complete

Good news

htt

p

ww

wa

rch

ives

go

vre

sear

chm

ilita

ryw

w2

ph

oto

sim

ages

ww

2-1

94

jpg

There is a 1-fix solution to this contextual problem

De Ondergrondse Pers 1940-1945

By Lydia E Winkel amp H de Vries

1989 ISBN 9021837463 Veen Uitgevers

This book (ldquoDe Winkelrdquo) contains

contextual articles about

(nearly) all plusmn 1300 illegal WW2 newspapers

ldquoDe Winkelrdquo ndash nr 199

De Ondergrondse Pers 1940-1945 Lydia E Winkel H de Vries 1989

ISBN 9021837463 Veen Uitgevers

Every article has a unique ID

(ldquoWinkel-IDrdquo)

Every article has metadata

bull Title subtitle motto bull Place of publication bull Period of publication bull Publication frequency (daily weekly one-off irregular)

bull Multiplication (stenciled printed typed handwritten)

bull Contents (news opinions poems illustrations humor)

bull Number of prints (min ndash max)

Relation 13

Newspaper Placename

semantics linked data

Relation 13

Newspaper Placename

semantics linked data

Contextual information

Nice material

for a Wikipedia article

Very often persons related to this newspaper are mentioned

Relation 23

Newspaper Persons

semantics linked data

Many articles also contain references to other newspapers

bull 106 = Cereales Vadeness (students resistance newspaper Wageningen) bull 360 = Leidsche Brief (students resistance newspaper Leiden) bull 748 = Sol Justitiae (students resistance newspaper Utrecht)

Relation 33

Newspaper Other newspapers

semantics linked data

htt

p

htt

ps

kn

ow

led

geu

top

iaf

iles

wo

rdp

ress

co

m2

01

40

1h

olla

nd

ho

use

libra

ryb

litz1

94

0j

pg

Too bad itrsquos a paper book hard to find multiply distribute and build upon

We need it digital

htt

p

htt

ps

kn

ow

led

geu

top

iaf

iles

wo

rdp

ress

co

m2

01

40

1h

olla

nd

ho

use

libra

ryb

litz1

94

0j

pg

We need it digital

1 Clear copyright with copyright holder (NIOD) Open CC-BY-SA license

2 Scan amp OCR

3 Convert into PDF

4 Put online NIOD site amp Wikimedia Commons

De Winkel as PDF on NIOD website (CC-BY-SA) httpwwwniodnlnlde-ondergrondse-pers-1940-1945

De Winkel as PDF on Wikimedia Commons httpscommonswikimediaorgwikiFilePDF_of_De_Ondergrondse_Pers_1940-1945_-_derde_druk_-_1989pdf

De Winkel as PDF on NIOD website (CC-BY-SA) httpwwwniodnlnlde-ondergrondse-pers-1940-1945

Saved us euro13330 httpwwwbrillcomdutch-underground-press-1940-1945

Wikipedia article about De Winkel httpnlwikipediaorgwikiDe_ondergrondse_pers_1940-1945

Wikipedia article about the author httpnlwikipediaorgwikiLydia_Winkel

Winkel the plusses

Available online (PDF flat file)

Open license (CC-BY-SA) Contextual information Relations

bull Titles Places bull Titles Persons

bull Titles Other titles

htt

p

ww

wa

rch

ives

go

vre

sear

chm

ilita

ryw

w2

ph

oto

sim

ages

ww

2-1

94

jpg

htt

p

2b

pb

logsp

ot

com

_BW

zuYw

iS6-I

TM

geR

sFd3m

IAAAAAAAAElw

3cv

gbZSPW

css

1600d

oct

or+

macr

o+

judy+

scare

djpg

Winkel the minusses

Unstructured data (PDF flat file)

Not very machine readable (unlike CSV XML JSON RDF)

PDF is no (real) open standard (unlike CSV XML JSON RDF)

No links between titles Delpher amp KB-cat No links between titles places amp persons

external sources (like Wikipedia)

but the data sources are

unconnected (and for 3+4 unstructured amp not machine-readable)

To summarize

a lot of information is available about these WW2 underground newspapers

1 Metadata (KB-cat)

2 Content (full-text Delpher)

3 Context (Winkel PDF)

4 Relations titles places persons other titles (Winkel PDF)

5 External resources about titles places and persons

htt

p

2b

pb

logsp

ot

com

_BW

zuYw

iS6-I

TM

geR

sFd3m

IAAAAAAAAElw

3cv

gbZSPW

css

1600d

oct

or+

macr

o+

judy+

scare

djpg

making discovery understanding amp research

of these newspapers (and related places amp persons) more difficult than necessary

We can solve all these issues

Good news

Wikiproject() Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

tinyurlcomverzetskranten (in Dutch)

httpsnlwikipediaorgwikiWikipediaWikiproject httpsenwikipediaorgwikiWikipediaWikiproject

From 14 1300 titles

htt

ps

n

lwik

iped

iao

rgw

iki

Cat

ego

rie

Illeg

ale_

per

s_in

_de_

Twee

de_

Wer

eld

oo

rlo

g

Wikiproject() Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

tinyurlcomverzetskranten (in Dutch)

We need a database

Build central database

httpswwwyoutubecomwatchv=GVDGuCjog_0

Build central database

httpswwwyoutubecomwatchv=GVDGuCjog_0

Step 1 Create Excel-sheet with bull Metadata about newspaper (from Winkel PDF)

bull Unique Wikipedia article title

bull Contextual info incl related persons amp titles (from Winkel PDF)

bull PPN unique ID linking newspaper to

KB-catalogue amp Delpher

Winkel-ID

Place of publication

Other metadata

Title

Unique Wikipedia article title =

ltNewspaper titlegt (verzetsblad ltPlace of publicationgt)

bull Contextual info bull Related persons

bull Related newspaper titles

PPN (107123223)

httpopc4kbnlDB=1PPNPPN=107123223

httpopc4kbnlDB=1PPNPPN=107123223

PPN (107123223)

PPN (107123223)

httpwwwdelphernlnlkrantenresultsindexcoll=dddtitelampcql[]=ppn3D107123223

httpwwwdelphernlnlkrantenresultsindexcoll=dddtitelampcql[]=ppn3D107123223

PPN (107123223)

Build central database

Step 2 Convert Excel into RDF triplestore (=special kind of online database anybody can access)

bull Steps 1-4 from httplinda-projecteulinked-

data-primer-2

bull Step 4 Vocubulary used = Bibframe (httpbibframeorgvocab)

Build central database

Step 3 Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

bull DBpedia = hub for linking different data sets on the Web to each

other Linked Open Data cloud

bull Connect persons amp places in newspaper database to external resources via DBpedia

Step 1c Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

DBpedia allows you to ask sophisticated queries against Wikipedia and to link the different data sets on the Web to Wikipedia data

bull Connect persons amp places in newspaper database to external

resources via DBpedia

httplod-cloudnetversions2010-09-22lod-cloud_coloredpng

Linked Open Data cloud

Build central database

Step 3 Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

bull DBpedia = hub for linking different data sets on the Web to each

other Linked Open Data cloud

bull We use DBpdia to connect persons amp places in our newspaper database to information in other databases

httpnldbpediaorgpageHuib_Drion

httpsnlwikipediaorgwikiHuib_Drion

httpwwwdbnlorgauteursauteurphpid=drio001

httpwwwbiografischportaalnlpersoon41181342

Build central database

Added value of Linked Open Data amp DBpedia Software can automatically query for additional information about places and persons mentioned in De Winkel that is not available in bull KB-catalogue bull Delpher bull De Winkel

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples)

Open license (CC-BY-SA) Open standard (RDF)

Contextual information Links between titles Delpher amp KB-cat (via PPNs)

Relations Links between places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia)

bull Titles Other titles

(PPNs)

http5stardatainfoen

Wikiproject Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

We need a template

Using an article template we can generate 1300 uniform and interlinked Wikipedia articles

from the LOD-database

htt

ps

c1

sta

ticf

lickr

co

m9

82

81

76

99

23

19

18

_11

a73

56

c38

_bjp

g

LOD-database + article template = Wikipedia article

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

Titles from database

Metadata from database

Related persons from database

Related newspapers from database

Winkel-ID from database

Link to full-texts in Delpher from database

Link to KB-catalogue record from database

Fixed categories on WP and Commons

Predefined fixed strings

Grey = bull From database bull Predefined fixed strings

Uniformity between articles guaranteed

All that WP-writers need to add manually

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

The KB can re-use (embed) the Wikipedia content in its own

websites to tackle this problem

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_28verzetsblad29

Delpher - search results

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

Delpher - search results

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Delpher - object presentation

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Over De Geus onder studenten

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog dat vanaf 4 oktober 1940 tot en met 13 juli 1944 in Den Haag werd uitgegeven Het blad verscheen in 1940 1941 en 1943 maandelijks verder onregelmatig in een oplage tussen de 250 en 8000 exemplaren Het werd aanvankelijk gestencild en vanaf november 1942 gedrukt en de inhoud bestond voornamelijk uit opinie-artikelen

Het blad werd uitgegeven door Jan Drion en Huib Drion twee Leidsehellip

Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Delpher - object presentation

Suggested reading

bull httpwwwtedcomtalkstim_berners_lee_on_the_next_web 20 years ago Tim Berners-Lee invented the World Wide Web For his next project hes building a web for open linked data that could do for numbers what the Web did for words pictures video unlock our data and reframe the way we use it together

bull httpsenwikipediaorgwikiLinked_data Wikipedia article related to the above video

bull http5stardatainfoen The 5 stars of Linked Open Data (Tim Berners-Lee)

bull httplinda-projecteulinked-data-primer-2 Short primer about creating LOD in practice starting from an Excel sheet

bull httpwwwprogrammablewebcomnewshow-linked-data-solved-digital-age-marketing-problemanalysis20150831 The figure near the bottom of the first page is a good illustration of the concept of (linked) triples

bull httpsenwikipediaorgwikiDBpedia

bull httpsenwikipediaorgwikiSemantic_network

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Questions

olafjanssenkbnl - ookgezellig

tinyurlcomverzetskranten

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Page 26: Linked Open Data case study (illegal newspapers WW2, Wikipedia, DBpedia) - Lecture Leiden University 3-3-2016

Everything is of course on Wikipedia Just type in a name and you can read entire essays (man 70s)

Over half of us think that Wikipedia and Google contribute to our knowledge and understanding of history

When we have to find information about WW2 outside the class setting we fully concentrate on digital resources like Google and Wikipedia (school kids)

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

httpnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

is given in

Context about De Geus onder studenten

But nowhellip

We have another problemhellip

httpsnlwikipediaorgwikiCategorieIllegale_pers_in_de_Tweede_Wereldoorlog

hellip De Geus on Wikipedia is an exception

1 Very few underground newspapers have their own WP articles

2 The overview of these newspapers on WP is far from complete

Good news

htt

p

ww

wa

rch

ives

go

vre

sear

chm

ilita

ryw

w2

ph

oto

sim

ages

ww

2-1

94

jpg

There is a 1-fix solution to this contextual problem

De Ondergrondse Pers 1940-1945

By Lydia E Winkel amp H de Vries

1989 ISBN 9021837463 Veen Uitgevers

This book (ldquoDe Winkelrdquo) contains

contextual articles about

(nearly) all plusmn 1300 illegal WW2 newspapers

ldquoDe Winkelrdquo ndash nr 199

De Ondergrondse Pers 1940-1945 Lydia E Winkel H de Vries 1989

ISBN 9021837463 Veen Uitgevers

Every article has a unique ID

(ldquoWinkel-IDrdquo)

Every article has metadata

bull Title subtitle motto bull Place of publication bull Period of publication bull Publication frequency (daily weekly one-off irregular)

bull Multiplication (stenciled printed typed handwritten)

bull Contents (news opinions poems illustrations humor)

bull Number of prints (min ndash max)

Relation 13

Newspaper Placename

semantics linked data

Relation 13

Newspaper Placename

semantics linked data

Contextual information

Nice material

for a Wikipedia article

Very often persons related to this newspaper are mentioned

Relation 23

Newspaper Persons

semantics linked data

Many articles also contain references to other newspapers

bull 106 = Cereales Vadeness (students resistance newspaper Wageningen) bull 360 = Leidsche Brief (students resistance newspaper Leiden) bull 748 = Sol Justitiae (students resistance newspaper Utrecht)

Relation 33

Newspaper Other newspapers

semantics linked data

htt

p

htt

ps

kn

ow

led

geu

top

iaf

iles

wo

rdp

ress

co

m2

01

40

1h

olla

nd

ho

use

libra

ryb

litz1

94

0j

pg

Too bad itrsquos a paper book hard to find multiply distribute and build upon

We need it digital

htt

p

htt

ps

kn

ow

led

geu

top

iaf

iles

wo

rdp

ress

co

m2

01

40

1h

olla

nd

ho

use

libra

ryb

litz1

94

0j

pg

We need it digital

1 Clear copyright with copyright holder (NIOD) Open CC-BY-SA license

2 Scan amp OCR

3 Convert into PDF

4 Put online NIOD site amp Wikimedia Commons

De Winkel as PDF on NIOD website (CC-BY-SA) httpwwwniodnlnlde-ondergrondse-pers-1940-1945

De Winkel as PDF on Wikimedia Commons httpscommonswikimediaorgwikiFilePDF_of_De_Ondergrondse_Pers_1940-1945_-_derde_druk_-_1989pdf

De Winkel as PDF on NIOD website (CC-BY-SA) httpwwwniodnlnlde-ondergrondse-pers-1940-1945

Saved us euro13330 httpwwwbrillcomdutch-underground-press-1940-1945

Wikipedia article about De Winkel httpnlwikipediaorgwikiDe_ondergrondse_pers_1940-1945

Wikipedia article about the author httpnlwikipediaorgwikiLydia_Winkel

Winkel the plusses

Available online (PDF flat file)

Open license (CC-BY-SA) Contextual information Relations

bull Titles Places bull Titles Persons

bull Titles Other titles

htt

p

ww

wa

rch

ives

go

vre

sear

chm

ilita

ryw

w2

ph

oto

sim

ages

ww

2-1

94

jpg

htt

p

2b

pb

logsp

ot

com

_BW

zuYw

iS6-I

TM

geR

sFd3m

IAAAAAAAAElw

3cv

gbZSPW

css

1600d

oct

or+

macr

o+

judy+

scare

djpg

Winkel the minusses

Unstructured data (PDF flat file)

Not very machine readable (unlike CSV XML JSON RDF)

PDF is no (real) open standard (unlike CSV XML JSON RDF)

No links between titles Delpher amp KB-cat No links between titles places amp persons

external sources (like Wikipedia)

but the data sources are

unconnected (and for 3+4 unstructured amp not machine-readable)

To summarize

a lot of information is available about these WW2 underground newspapers

1 Metadata (KB-cat)

2 Content (full-text Delpher)

3 Context (Winkel PDF)

4 Relations titles places persons other titles (Winkel PDF)

5 External resources about titles places and persons

htt

p

2b

pb

logsp

ot

com

_BW

zuYw

iS6-I

TM

geR

sFd3m

IAAAAAAAAElw

3cv

gbZSPW

css

1600d

oct

or+

macr

o+

judy+

scare

djpg

making discovery understanding amp research

of these newspapers (and related places amp persons) more difficult than necessary

We can solve all these issues

Good news

Wikiproject() Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

tinyurlcomverzetskranten (in Dutch)

httpsnlwikipediaorgwikiWikipediaWikiproject httpsenwikipediaorgwikiWikipediaWikiproject

From 14 1300 titles

htt

ps

n

lwik

iped

iao

rgw

iki

Cat

ego

rie

Illeg

ale_

per

s_in

_de_

Twee

de_

Wer

eld

oo

rlo

g

Wikiproject() Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

tinyurlcomverzetskranten (in Dutch)

We need a database

Build central database

httpswwwyoutubecomwatchv=GVDGuCjog_0

Build central database

httpswwwyoutubecomwatchv=GVDGuCjog_0

Step 1 Create Excel-sheet with bull Metadata about newspaper (from Winkel PDF)

bull Unique Wikipedia article title

bull Contextual info incl related persons amp titles (from Winkel PDF)

bull PPN unique ID linking newspaper to

KB-catalogue amp Delpher

Winkel-ID

Place of publication

Other metadata

Title

Unique Wikipedia article title =

ltNewspaper titlegt (verzetsblad ltPlace of publicationgt)

bull Contextual info bull Related persons

bull Related newspaper titles

PPN (107123223)

httpopc4kbnlDB=1PPNPPN=107123223

httpopc4kbnlDB=1PPNPPN=107123223

PPN (107123223)

PPN (107123223)

httpwwwdelphernlnlkrantenresultsindexcoll=dddtitelampcql[]=ppn3D107123223

httpwwwdelphernlnlkrantenresultsindexcoll=dddtitelampcql[]=ppn3D107123223

PPN (107123223)

Build central database

Step 2 Convert Excel into RDF triplestore (=special kind of online database anybody can access)

bull Steps 1-4 from httplinda-projecteulinked-

data-primer-2

bull Step 4 Vocubulary used = Bibframe (httpbibframeorgvocab)

Build central database

Step 3 Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

bull DBpedia = hub for linking different data sets on the Web to each

other Linked Open Data cloud

bull Connect persons amp places in newspaper database to external resources via DBpedia

Step 1c Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

DBpedia allows you to ask sophisticated queries against Wikipedia and to link the different data sets on the Web to Wikipedia data

bull Connect persons amp places in newspaper database to external

resources via DBpedia

httplod-cloudnetversions2010-09-22lod-cloud_coloredpng

Linked Open Data cloud

Build central database

Step 3 Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

bull DBpedia = hub for linking different data sets on the Web to each

other Linked Open Data cloud

bull We use DBpdia to connect persons amp places in our newspaper database to information in other databases

httpnldbpediaorgpageHuib_Drion

httpsnlwikipediaorgwikiHuib_Drion

httpwwwdbnlorgauteursauteurphpid=drio001

httpwwwbiografischportaalnlpersoon41181342

Build central database

Added value of Linked Open Data amp DBpedia Software can automatically query for additional information about places and persons mentioned in De Winkel that is not available in bull KB-catalogue bull Delpher bull De Winkel

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples)

Open license (CC-BY-SA) Open standard (RDF)

Contextual information Links between titles Delpher amp KB-cat (via PPNs)

Relations Links between places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia)

bull Titles Other titles

(PPNs)

http5stardatainfoen

Wikiproject Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

We need a template

Using an article template we can generate 1300 uniform and interlinked Wikipedia articles

from the LOD-database

htt

ps

c1

sta

ticf

lickr

co

m9

82

81

76

99

23

19

18

_11

a73

56

c38

_bjp

g

LOD-database + article template = Wikipedia article

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

Titles from database

Metadata from database

Related persons from database

Related newspapers from database

Winkel-ID from database

Link to full-texts in Delpher from database

Link to KB-catalogue record from database

Fixed categories on WP and Commons

Predefined fixed strings

Grey = bull From database bull Predefined fixed strings

Uniformity between articles guaranteed

All that WP-writers need to add manually

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

The KB can re-use (embed) the Wikipedia content in its own

websites to tackle this problem

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_28verzetsblad29

Delpher - search results

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

Delpher - search results

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Delpher - object presentation

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Over De Geus onder studenten

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog dat vanaf 4 oktober 1940 tot en met 13 juli 1944 in Den Haag werd uitgegeven Het blad verscheen in 1940 1941 en 1943 maandelijks verder onregelmatig in een oplage tussen de 250 en 8000 exemplaren Het werd aanvankelijk gestencild en vanaf november 1942 gedrukt en de inhoud bestond voornamelijk uit opinie-artikelen

Het blad werd uitgegeven door Jan Drion en Huib Drion twee Leidsehellip

Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Delpher - object presentation

Suggested reading

bull httpwwwtedcomtalkstim_berners_lee_on_the_next_web 20 years ago Tim Berners-Lee invented the World Wide Web For his next project hes building a web for open linked data that could do for numbers what the Web did for words pictures video unlock our data and reframe the way we use it together

bull httpsenwikipediaorgwikiLinked_data Wikipedia article related to the above video

bull http5stardatainfoen The 5 stars of Linked Open Data (Tim Berners-Lee)

bull httplinda-projecteulinked-data-primer-2 Short primer about creating LOD in practice starting from an Excel sheet

bull httpwwwprogrammablewebcomnewshow-linked-data-solved-digital-age-marketing-problemanalysis20150831 The figure near the bottom of the first page is a good illustration of the concept of (linked) triples

bull httpsenwikipediaorgwikiDBpedia

bull httpsenwikipediaorgwikiSemantic_network

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Questions

olafjanssenkbnl - ookgezellig

tinyurlcomverzetskranten

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Page 27: Linked Open Data case study (illegal newspapers WW2, Wikipedia, DBpedia) - Lecture Leiden University 3-3-2016

Over half of us think that Wikipedia and Google contribute to our knowledge and understanding of history

When we have to find information about WW2 outside the class setting we fully concentrate on digital resources like Google and Wikipedia (school kids)

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

httpnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

is given in

Context about De Geus onder studenten

But nowhellip

We have another problemhellip

httpsnlwikipediaorgwikiCategorieIllegale_pers_in_de_Tweede_Wereldoorlog

hellip De Geus on Wikipedia is an exception

1 Very few underground newspapers have their own WP articles

2 The overview of these newspapers on WP is far from complete

Good news

htt

p

ww

wa

rch

ives

go

vre

sear

chm

ilita

ryw

w2

ph

oto

sim

ages

ww

2-1

94

jpg

There is a 1-fix solution to this contextual problem

De Ondergrondse Pers 1940-1945

By Lydia E Winkel amp H de Vries

1989 ISBN 9021837463 Veen Uitgevers

This book (ldquoDe Winkelrdquo) contains

contextual articles about

(nearly) all plusmn 1300 illegal WW2 newspapers

ldquoDe Winkelrdquo ndash nr 199

De Ondergrondse Pers 1940-1945 Lydia E Winkel H de Vries 1989

ISBN 9021837463 Veen Uitgevers

Every article has a unique ID

(ldquoWinkel-IDrdquo)

Every article has metadata

bull Title subtitle motto bull Place of publication bull Period of publication bull Publication frequency (daily weekly one-off irregular)

bull Multiplication (stenciled printed typed handwritten)

bull Contents (news opinions poems illustrations humor)

bull Number of prints (min ndash max)

Relation 13

Newspaper Placename

semantics linked data

Relation 13

Newspaper Placename

semantics linked data

Contextual information

Nice material

for a Wikipedia article

Very often persons related to this newspaper are mentioned

Relation 23

Newspaper Persons

semantics linked data

Many articles also contain references to other newspapers

bull 106 = Cereales Vadeness (students resistance newspaper Wageningen) bull 360 = Leidsche Brief (students resistance newspaper Leiden) bull 748 = Sol Justitiae (students resistance newspaper Utrecht)

Relation 33

Newspaper Other newspapers

semantics linked data

htt

p

htt

ps

kn

ow

led

geu

top

iaf

iles

wo

rdp

ress

co

m2

01

40

1h

olla

nd

ho

use

libra

ryb

litz1

94

0j

pg

Too bad itrsquos a paper book hard to find multiply distribute and build upon

We need it digital

htt

p

htt

ps

kn

ow

led

geu

top

iaf

iles

wo

rdp

ress

co

m2

01

40

1h

olla

nd

ho

use

libra

ryb

litz1

94

0j

pg

We need it digital

1 Clear copyright with copyright holder (NIOD) Open CC-BY-SA license

2 Scan amp OCR

3 Convert into PDF

4 Put online NIOD site amp Wikimedia Commons

De Winkel as PDF on NIOD website (CC-BY-SA) httpwwwniodnlnlde-ondergrondse-pers-1940-1945

De Winkel as PDF on Wikimedia Commons httpscommonswikimediaorgwikiFilePDF_of_De_Ondergrondse_Pers_1940-1945_-_derde_druk_-_1989pdf

De Winkel as PDF on NIOD website (CC-BY-SA) httpwwwniodnlnlde-ondergrondse-pers-1940-1945

Saved us euro13330 httpwwwbrillcomdutch-underground-press-1940-1945

Wikipedia article about De Winkel httpnlwikipediaorgwikiDe_ondergrondse_pers_1940-1945

Wikipedia article about the author httpnlwikipediaorgwikiLydia_Winkel

Winkel the plusses

Available online (PDF flat file)

Open license (CC-BY-SA) Contextual information Relations

bull Titles Places bull Titles Persons

bull Titles Other titles

htt

p

ww

wa

rch

ives

go

vre

sear

chm

ilita

ryw

w2

ph

oto

sim

ages

ww

2-1

94

jpg

htt

p

2b

pb

logsp

ot

com

_BW

zuYw

iS6-I

TM

geR

sFd3m

IAAAAAAAAElw

3cv

gbZSPW

css

1600d

oct

or+

macr

o+

judy+

scare

djpg

Winkel the minusses

Unstructured data (PDF flat file)

Not very machine readable (unlike CSV XML JSON RDF)

PDF is no (real) open standard (unlike CSV XML JSON RDF)

No links between titles Delpher amp KB-cat No links between titles places amp persons

external sources (like Wikipedia)

but the data sources are

unconnected (and for 3+4 unstructured amp not machine-readable)

To summarize

a lot of information is available about these WW2 underground newspapers

1 Metadata (KB-cat)

2 Content (full-text Delpher)

3 Context (Winkel PDF)

4 Relations titles places persons other titles (Winkel PDF)

5 External resources about titles places and persons

htt

p

2b

pb

logsp

ot

com

_BW

zuYw

iS6-I

TM

geR

sFd3m

IAAAAAAAAElw

3cv

gbZSPW

css

1600d

oct

or+

macr

o+

judy+

scare

djpg

making discovery understanding amp research

of these newspapers (and related places amp persons) more difficult than necessary

We can solve all these issues

Good news

Wikiproject() Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

tinyurlcomverzetskranten (in Dutch)

httpsnlwikipediaorgwikiWikipediaWikiproject httpsenwikipediaorgwikiWikipediaWikiproject

From 14 1300 titles

htt

ps

n

lwik

iped

iao

rgw

iki

Cat

ego

rie

Illeg

ale_

per

s_in

_de_

Twee

de_

Wer

eld

oo

rlo

g

Wikiproject() Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

tinyurlcomverzetskranten (in Dutch)

We need a database

Build central database

httpswwwyoutubecomwatchv=GVDGuCjog_0

Build central database

httpswwwyoutubecomwatchv=GVDGuCjog_0

Step 1 Create Excel-sheet with bull Metadata about newspaper (from Winkel PDF)

bull Unique Wikipedia article title

bull Contextual info incl related persons amp titles (from Winkel PDF)

bull PPN unique ID linking newspaper to

KB-catalogue amp Delpher

Winkel-ID

Place of publication

Other metadata

Title

Unique Wikipedia article title =

ltNewspaper titlegt (verzetsblad ltPlace of publicationgt)

bull Contextual info bull Related persons

bull Related newspaper titles

PPN (107123223)

httpopc4kbnlDB=1PPNPPN=107123223

httpopc4kbnlDB=1PPNPPN=107123223

PPN (107123223)

PPN (107123223)

httpwwwdelphernlnlkrantenresultsindexcoll=dddtitelampcql[]=ppn3D107123223

httpwwwdelphernlnlkrantenresultsindexcoll=dddtitelampcql[]=ppn3D107123223

PPN (107123223)

Build central database

Step 2 Convert Excel into RDF triplestore (=special kind of online database anybody can access)

bull Steps 1-4 from httplinda-projecteulinked-

data-primer-2

bull Step 4 Vocubulary used = Bibframe (httpbibframeorgvocab)

Build central database

Step 3 Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

bull DBpedia = hub for linking different data sets on the Web to each

other Linked Open Data cloud

bull Connect persons amp places in newspaper database to external resources via DBpedia

Step 1c Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

DBpedia allows you to ask sophisticated queries against Wikipedia and to link the different data sets on the Web to Wikipedia data

bull Connect persons amp places in newspaper database to external

resources via DBpedia

httplod-cloudnetversions2010-09-22lod-cloud_coloredpng

Linked Open Data cloud

Build central database

Step 3 Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

bull DBpedia = hub for linking different data sets on the Web to each

other Linked Open Data cloud

bull We use DBpdia to connect persons amp places in our newspaper database to information in other databases

httpnldbpediaorgpageHuib_Drion

httpsnlwikipediaorgwikiHuib_Drion

httpwwwdbnlorgauteursauteurphpid=drio001

httpwwwbiografischportaalnlpersoon41181342

Build central database

Added value of Linked Open Data amp DBpedia Software can automatically query for additional information about places and persons mentioned in De Winkel that is not available in bull KB-catalogue bull Delpher bull De Winkel

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples)

Open license (CC-BY-SA) Open standard (RDF)

Contextual information Links between titles Delpher amp KB-cat (via PPNs)

Relations Links between places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia)

bull Titles Other titles

(PPNs)

http5stardatainfoen

Wikiproject Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

We need a template

Using an article template we can generate 1300 uniform and interlinked Wikipedia articles

from the LOD-database

htt

ps

c1

sta

ticf

lickr

co

m9

82

81

76

99

23

19

18

_11

a73

56

c38

_bjp

g

LOD-database + article template = Wikipedia article

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

Titles from database

Metadata from database

Related persons from database

Related newspapers from database

Winkel-ID from database

Link to full-texts in Delpher from database

Link to KB-catalogue record from database

Fixed categories on WP and Commons

Predefined fixed strings

Grey = bull From database bull Predefined fixed strings

Uniformity between articles guaranteed

All that WP-writers need to add manually

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

The KB can re-use (embed) the Wikipedia content in its own

websites to tackle this problem

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_28verzetsblad29

Delpher - search results

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

Delpher - search results

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Delpher - object presentation

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Over De Geus onder studenten

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog dat vanaf 4 oktober 1940 tot en met 13 juli 1944 in Den Haag werd uitgegeven Het blad verscheen in 1940 1941 en 1943 maandelijks verder onregelmatig in een oplage tussen de 250 en 8000 exemplaren Het werd aanvankelijk gestencild en vanaf november 1942 gedrukt en de inhoud bestond voornamelijk uit opinie-artikelen

Het blad werd uitgegeven door Jan Drion en Huib Drion twee Leidsehellip

Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Delpher - object presentation

Suggested reading

bull httpwwwtedcomtalkstim_berners_lee_on_the_next_web 20 years ago Tim Berners-Lee invented the World Wide Web For his next project hes building a web for open linked data that could do for numbers what the Web did for words pictures video unlock our data and reframe the way we use it together

bull httpsenwikipediaorgwikiLinked_data Wikipedia article related to the above video

bull http5stardatainfoen The 5 stars of Linked Open Data (Tim Berners-Lee)

bull httplinda-projecteulinked-data-primer-2 Short primer about creating LOD in practice starting from an Excel sheet

bull httpwwwprogrammablewebcomnewshow-linked-data-solved-digital-age-marketing-problemanalysis20150831 The figure near the bottom of the first page is a good illustration of the concept of (linked) triples

bull httpsenwikipediaorgwikiDBpedia

bull httpsenwikipediaorgwikiSemantic_network

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Questions

olafjanssenkbnl - ookgezellig

tinyurlcomverzetskranten

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Page 28: Linked Open Data case study (illegal newspapers WW2, Wikipedia, DBpedia) - Lecture Leiden University 3-3-2016

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

httpnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

is given in

Context about De Geus onder studenten

But nowhellip

We have another problemhellip

httpsnlwikipediaorgwikiCategorieIllegale_pers_in_de_Tweede_Wereldoorlog

hellip De Geus on Wikipedia is an exception

1 Very few underground newspapers have their own WP articles

2 The overview of these newspapers on WP is far from complete

Good news

htt

p

ww

wa

rch

ives

go

vre

sear

chm

ilita

ryw

w2

ph

oto

sim

ages

ww

2-1

94

jpg

There is a 1-fix solution to this contextual problem

De Ondergrondse Pers 1940-1945

By Lydia E Winkel amp H de Vries

1989 ISBN 9021837463 Veen Uitgevers

This book (ldquoDe Winkelrdquo) contains

contextual articles about

(nearly) all plusmn 1300 illegal WW2 newspapers

ldquoDe Winkelrdquo ndash nr 199

De Ondergrondse Pers 1940-1945 Lydia E Winkel H de Vries 1989

ISBN 9021837463 Veen Uitgevers

Every article has a unique ID

(ldquoWinkel-IDrdquo)

Every article has metadata

bull Title subtitle motto bull Place of publication bull Period of publication bull Publication frequency (daily weekly one-off irregular)

bull Multiplication (stenciled printed typed handwritten)

bull Contents (news opinions poems illustrations humor)

bull Number of prints (min ndash max)

Relation 13

Newspaper Placename

semantics linked data

Relation 13

Newspaper Placename

semantics linked data

Contextual information

Nice material

for a Wikipedia article

Very often persons related to this newspaper are mentioned

Relation 23

Newspaper Persons

semantics linked data

Many articles also contain references to other newspapers

bull 106 = Cereales Vadeness (students resistance newspaper Wageningen) bull 360 = Leidsche Brief (students resistance newspaper Leiden) bull 748 = Sol Justitiae (students resistance newspaper Utrecht)

Relation 33

Newspaper Other newspapers

semantics linked data

htt

p

htt

ps

kn

ow

led

geu

top

iaf

iles

wo

rdp

ress

co

m2

01

40

1h

olla

nd

ho

use

libra

ryb

litz1

94

0j

pg

Too bad itrsquos a paper book hard to find multiply distribute and build upon

We need it digital

htt

p

htt

ps

kn

ow

led

geu

top

iaf

iles

wo

rdp

ress

co

m2

01

40

1h

olla

nd

ho

use

libra

ryb

litz1

94

0j

pg

We need it digital

1 Clear copyright with copyright holder (NIOD) Open CC-BY-SA license

2 Scan amp OCR

3 Convert into PDF

4 Put online NIOD site amp Wikimedia Commons

De Winkel as PDF on NIOD website (CC-BY-SA) httpwwwniodnlnlde-ondergrondse-pers-1940-1945

De Winkel as PDF on Wikimedia Commons httpscommonswikimediaorgwikiFilePDF_of_De_Ondergrondse_Pers_1940-1945_-_derde_druk_-_1989pdf

De Winkel as PDF on NIOD website (CC-BY-SA) httpwwwniodnlnlde-ondergrondse-pers-1940-1945

Saved us euro13330 httpwwwbrillcomdutch-underground-press-1940-1945

Wikipedia article about De Winkel httpnlwikipediaorgwikiDe_ondergrondse_pers_1940-1945

Wikipedia article about the author httpnlwikipediaorgwikiLydia_Winkel

Winkel the plusses

Available online (PDF flat file)

Open license (CC-BY-SA) Contextual information Relations

bull Titles Places bull Titles Persons

bull Titles Other titles

htt

p

ww

wa

rch

ives

go

vre

sear

chm

ilita

ryw

w2

ph

oto

sim

ages

ww

2-1

94

jpg

htt

p

2b

pb

logsp

ot

com

_BW

zuYw

iS6-I

TM

geR

sFd3m

IAAAAAAAAElw

3cv

gbZSPW

css

1600d

oct

or+

macr

o+

judy+

scare

djpg

Winkel the minusses

Unstructured data (PDF flat file)

Not very machine readable (unlike CSV XML JSON RDF)

PDF is no (real) open standard (unlike CSV XML JSON RDF)

No links between titles Delpher amp KB-cat No links between titles places amp persons

external sources (like Wikipedia)

but the data sources are

unconnected (and for 3+4 unstructured amp not machine-readable)

To summarize

a lot of information is available about these WW2 underground newspapers

1 Metadata (KB-cat)

2 Content (full-text Delpher)

3 Context (Winkel PDF)

4 Relations titles places persons other titles (Winkel PDF)

5 External resources about titles places and persons

htt

p

2b

pb

logsp

ot

com

_BW

zuYw

iS6-I

TM

geR

sFd3m

IAAAAAAAAElw

3cv

gbZSPW

css

1600d

oct

or+

macr

o+

judy+

scare

djpg

making discovery understanding amp research

of these newspapers (and related places amp persons) more difficult than necessary

We can solve all these issues

Good news

Wikiproject() Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

tinyurlcomverzetskranten (in Dutch)

httpsnlwikipediaorgwikiWikipediaWikiproject httpsenwikipediaorgwikiWikipediaWikiproject

From 14 1300 titles

htt

ps

n

lwik

iped

iao

rgw

iki

Cat

ego

rie

Illeg

ale_

per

s_in

_de_

Twee

de_

Wer

eld

oo

rlo

g

Wikiproject() Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

tinyurlcomverzetskranten (in Dutch)

We need a database

Build central database

httpswwwyoutubecomwatchv=GVDGuCjog_0

Build central database

httpswwwyoutubecomwatchv=GVDGuCjog_0

Step 1 Create Excel-sheet with bull Metadata about newspaper (from Winkel PDF)

bull Unique Wikipedia article title

bull Contextual info incl related persons amp titles (from Winkel PDF)

bull PPN unique ID linking newspaper to

KB-catalogue amp Delpher

Winkel-ID

Place of publication

Other metadata

Title

Unique Wikipedia article title =

ltNewspaper titlegt (verzetsblad ltPlace of publicationgt)

bull Contextual info bull Related persons

bull Related newspaper titles

PPN (107123223)

httpopc4kbnlDB=1PPNPPN=107123223

httpopc4kbnlDB=1PPNPPN=107123223

PPN (107123223)

PPN (107123223)

httpwwwdelphernlnlkrantenresultsindexcoll=dddtitelampcql[]=ppn3D107123223

httpwwwdelphernlnlkrantenresultsindexcoll=dddtitelampcql[]=ppn3D107123223

PPN (107123223)

Build central database

Step 2 Convert Excel into RDF triplestore (=special kind of online database anybody can access)

bull Steps 1-4 from httplinda-projecteulinked-

data-primer-2

bull Step 4 Vocubulary used = Bibframe (httpbibframeorgvocab)

Build central database

Step 3 Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

bull DBpedia = hub for linking different data sets on the Web to each

other Linked Open Data cloud

bull Connect persons amp places in newspaper database to external resources via DBpedia

Step 1c Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

DBpedia allows you to ask sophisticated queries against Wikipedia and to link the different data sets on the Web to Wikipedia data

bull Connect persons amp places in newspaper database to external

resources via DBpedia

httplod-cloudnetversions2010-09-22lod-cloud_coloredpng

Linked Open Data cloud

Build central database

Step 3 Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

bull DBpedia = hub for linking different data sets on the Web to each

other Linked Open Data cloud

bull We use DBpdia to connect persons amp places in our newspaper database to information in other databases

httpnldbpediaorgpageHuib_Drion

httpsnlwikipediaorgwikiHuib_Drion

httpwwwdbnlorgauteursauteurphpid=drio001

httpwwwbiografischportaalnlpersoon41181342

Build central database

Added value of Linked Open Data amp DBpedia Software can automatically query for additional information about places and persons mentioned in De Winkel that is not available in bull KB-catalogue bull Delpher bull De Winkel

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples)

Open license (CC-BY-SA) Open standard (RDF)

Contextual information Links between titles Delpher amp KB-cat (via PPNs)

Relations Links between places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia)

bull Titles Other titles

(PPNs)

http5stardatainfoen

Wikiproject Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

We need a template

Using an article template we can generate 1300 uniform and interlinked Wikipedia articles

from the LOD-database

htt

ps

c1

sta

ticf

lickr

co

m9

82

81

76

99

23

19

18

_11

a73

56

c38

_bjp

g

LOD-database + article template = Wikipedia article

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

Titles from database

Metadata from database

Related persons from database

Related newspapers from database

Winkel-ID from database

Link to full-texts in Delpher from database

Link to KB-catalogue record from database

Fixed categories on WP and Commons

Predefined fixed strings

Grey = bull From database bull Predefined fixed strings

Uniformity between articles guaranteed

All that WP-writers need to add manually

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

The KB can re-use (embed) the Wikipedia content in its own

websites to tackle this problem

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_28verzetsblad29

Delpher - search results

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

Delpher - search results

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Delpher - object presentation

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Over De Geus onder studenten

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog dat vanaf 4 oktober 1940 tot en met 13 juli 1944 in Den Haag werd uitgegeven Het blad verscheen in 1940 1941 en 1943 maandelijks verder onregelmatig in een oplage tussen de 250 en 8000 exemplaren Het werd aanvankelijk gestencild en vanaf november 1942 gedrukt en de inhoud bestond voornamelijk uit opinie-artikelen

Het blad werd uitgegeven door Jan Drion en Huib Drion twee Leidsehellip

Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Delpher - object presentation

Suggested reading

bull httpwwwtedcomtalkstim_berners_lee_on_the_next_web 20 years ago Tim Berners-Lee invented the World Wide Web For his next project hes building a web for open linked data that could do for numbers what the Web did for words pictures video unlock our data and reframe the way we use it together

bull httpsenwikipediaorgwikiLinked_data Wikipedia article related to the above video

bull http5stardatainfoen The 5 stars of Linked Open Data (Tim Berners-Lee)

bull httplinda-projecteulinked-data-primer-2 Short primer about creating LOD in practice starting from an Excel sheet

bull httpwwwprogrammablewebcomnewshow-linked-data-solved-digital-age-marketing-problemanalysis20150831 The figure near the bottom of the first page is a good illustration of the concept of (linked) triples

bull httpsenwikipediaorgwikiDBpedia

bull httpsenwikipediaorgwikiSemantic_network

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Questions

olafjanssenkbnl - ookgezellig

tinyurlcomverzetskranten

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Page 29: Linked Open Data case study (illegal newspapers WW2, Wikipedia, DBpedia) - Lecture Leiden University 3-3-2016

But nowhellip

We have another problemhellip

httpsnlwikipediaorgwikiCategorieIllegale_pers_in_de_Tweede_Wereldoorlog

hellip De Geus on Wikipedia is an exception

1 Very few underground newspapers have their own WP articles

2 The overview of these newspapers on WP is far from complete

Good news

htt

p

ww

wa

rch

ives

go

vre

sear

chm

ilita

ryw

w2

ph

oto

sim

ages

ww

2-1

94

jpg

There is a 1-fix solution to this contextual problem

De Ondergrondse Pers 1940-1945

By Lydia E Winkel amp H de Vries

1989 ISBN 9021837463 Veen Uitgevers

This book (ldquoDe Winkelrdquo) contains

contextual articles about

(nearly) all plusmn 1300 illegal WW2 newspapers

ldquoDe Winkelrdquo ndash nr 199

De Ondergrondse Pers 1940-1945 Lydia E Winkel H de Vries 1989

ISBN 9021837463 Veen Uitgevers

Every article has a unique ID

(ldquoWinkel-IDrdquo)

Every article has metadata

bull Title subtitle motto bull Place of publication bull Period of publication bull Publication frequency (daily weekly one-off irregular)

bull Multiplication (stenciled printed typed handwritten)

bull Contents (news opinions poems illustrations humor)

bull Number of prints (min ndash max)

Relation 13

Newspaper Placename

semantics linked data

Relation 13

Newspaper Placename

semantics linked data

Contextual information

Nice material

for a Wikipedia article

Very often persons related to this newspaper are mentioned

Relation 23

Newspaper Persons

semantics linked data

Many articles also contain references to other newspapers

bull 106 = Cereales Vadeness (students resistance newspaper Wageningen) bull 360 = Leidsche Brief (students resistance newspaper Leiden) bull 748 = Sol Justitiae (students resistance newspaper Utrecht)

Relation 33

Newspaper Other newspapers

semantics linked data

htt

p

htt

ps

kn

ow

led

geu

top

iaf

iles

wo

rdp

ress

co

m2

01

40

1h

olla

nd

ho

use

libra

ryb

litz1

94

0j

pg

Too bad itrsquos a paper book hard to find multiply distribute and build upon

We need it digital

htt

p

htt

ps

kn

ow

led

geu

top

iaf

iles

wo

rdp

ress

co

m2

01

40

1h

olla

nd

ho

use

libra

ryb

litz1

94

0j

pg

We need it digital

1 Clear copyright with copyright holder (NIOD) Open CC-BY-SA license

2 Scan amp OCR

3 Convert into PDF

4 Put online NIOD site amp Wikimedia Commons

De Winkel as PDF on NIOD website (CC-BY-SA) httpwwwniodnlnlde-ondergrondse-pers-1940-1945

De Winkel as PDF on Wikimedia Commons httpscommonswikimediaorgwikiFilePDF_of_De_Ondergrondse_Pers_1940-1945_-_derde_druk_-_1989pdf

De Winkel as PDF on NIOD website (CC-BY-SA) httpwwwniodnlnlde-ondergrondse-pers-1940-1945

Saved us euro13330 httpwwwbrillcomdutch-underground-press-1940-1945

Wikipedia article about De Winkel httpnlwikipediaorgwikiDe_ondergrondse_pers_1940-1945

Wikipedia article about the author httpnlwikipediaorgwikiLydia_Winkel

Winkel the plusses

Available online (PDF flat file)

Open license (CC-BY-SA) Contextual information Relations

bull Titles Places bull Titles Persons

bull Titles Other titles

htt

p

ww

wa

rch

ives

go

vre

sear

chm

ilita

ryw

w2

ph

oto

sim

ages

ww

2-1

94

jpg

htt

p

2b

pb

logsp

ot

com

_BW

zuYw

iS6-I

TM

geR

sFd3m

IAAAAAAAAElw

3cv

gbZSPW

css

1600d

oct

or+

macr

o+

judy+

scare

djpg

Winkel the minusses

Unstructured data (PDF flat file)

Not very machine readable (unlike CSV XML JSON RDF)

PDF is no (real) open standard (unlike CSV XML JSON RDF)

No links between titles Delpher amp KB-cat No links between titles places amp persons

external sources (like Wikipedia)

but the data sources are

unconnected (and for 3+4 unstructured amp not machine-readable)

To summarize

a lot of information is available about these WW2 underground newspapers

1 Metadata (KB-cat)

2 Content (full-text Delpher)

3 Context (Winkel PDF)

4 Relations titles places persons other titles (Winkel PDF)

5 External resources about titles places and persons

htt

p

2b

pb

logsp

ot

com

_BW

zuYw

iS6-I

TM

geR

sFd3m

IAAAAAAAAElw

3cv

gbZSPW

css

1600d

oct

or+

macr

o+

judy+

scare

djpg

making discovery understanding amp research

of these newspapers (and related places amp persons) more difficult than necessary

We can solve all these issues

Good news

Wikiproject() Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

tinyurlcomverzetskranten (in Dutch)

httpsnlwikipediaorgwikiWikipediaWikiproject httpsenwikipediaorgwikiWikipediaWikiproject

From 14 1300 titles

htt

ps

n

lwik

iped

iao

rgw

iki

Cat

ego

rie

Illeg

ale_

per

s_in

_de_

Twee

de_

Wer

eld

oo

rlo

g

Wikiproject() Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

tinyurlcomverzetskranten (in Dutch)

We need a database

Build central database

httpswwwyoutubecomwatchv=GVDGuCjog_0

Build central database

httpswwwyoutubecomwatchv=GVDGuCjog_0

Step 1 Create Excel-sheet with bull Metadata about newspaper (from Winkel PDF)

bull Unique Wikipedia article title

bull Contextual info incl related persons amp titles (from Winkel PDF)

bull PPN unique ID linking newspaper to

KB-catalogue amp Delpher

Winkel-ID

Place of publication

Other metadata

Title

Unique Wikipedia article title =

ltNewspaper titlegt (verzetsblad ltPlace of publicationgt)

bull Contextual info bull Related persons

bull Related newspaper titles

PPN (107123223)

httpopc4kbnlDB=1PPNPPN=107123223

httpopc4kbnlDB=1PPNPPN=107123223

PPN (107123223)

PPN (107123223)

httpwwwdelphernlnlkrantenresultsindexcoll=dddtitelampcql[]=ppn3D107123223

httpwwwdelphernlnlkrantenresultsindexcoll=dddtitelampcql[]=ppn3D107123223

PPN (107123223)

Build central database

Step 2 Convert Excel into RDF triplestore (=special kind of online database anybody can access)

bull Steps 1-4 from httplinda-projecteulinked-

data-primer-2

bull Step 4 Vocubulary used = Bibframe (httpbibframeorgvocab)

Build central database

Step 3 Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

bull DBpedia = hub for linking different data sets on the Web to each

other Linked Open Data cloud

bull Connect persons amp places in newspaper database to external resources via DBpedia

Step 1c Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

DBpedia allows you to ask sophisticated queries against Wikipedia and to link the different data sets on the Web to Wikipedia data

bull Connect persons amp places in newspaper database to external

resources via DBpedia

httplod-cloudnetversions2010-09-22lod-cloud_coloredpng

Linked Open Data cloud

Build central database

Step 3 Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

bull DBpedia = hub for linking different data sets on the Web to each

other Linked Open Data cloud

bull We use DBpdia to connect persons amp places in our newspaper database to information in other databases

httpnldbpediaorgpageHuib_Drion

httpsnlwikipediaorgwikiHuib_Drion

httpwwwdbnlorgauteursauteurphpid=drio001

httpwwwbiografischportaalnlpersoon41181342

Build central database

Added value of Linked Open Data amp DBpedia Software can automatically query for additional information about places and persons mentioned in De Winkel that is not available in bull KB-catalogue bull Delpher bull De Winkel

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples)

Open license (CC-BY-SA) Open standard (RDF)

Contextual information Links between titles Delpher amp KB-cat (via PPNs)

Relations Links between places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia)

bull Titles Other titles

(PPNs)

http5stardatainfoen

Wikiproject Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

We need a template

Using an article template we can generate 1300 uniform and interlinked Wikipedia articles

from the LOD-database

htt

ps

c1

sta

ticf

lickr

co

m9

82

81

76

99

23

19

18

_11

a73

56

c38

_bjp

g

LOD-database + article template = Wikipedia article

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

Titles from database

Metadata from database

Related persons from database

Related newspapers from database

Winkel-ID from database

Link to full-texts in Delpher from database

Link to KB-catalogue record from database

Fixed categories on WP and Commons

Predefined fixed strings

Grey = bull From database bull Predefined fixed strings

Uniformity between articles guaranteed

All that WP-writers need to add manually

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

The KB can re-use (embed) the Wikipedia content in its own

websites to tackle this problem

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_28verzetsblad29

Delpher - search results

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

Delpher - search results

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Delpher - object presentation

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Over De Geus onder studenten

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog dat vanaf 4 oktober 1940 tot en met 13 juli 1944 in Den Haag werd uitgegeven Het blad verscheen in 1940 1941 en 1943 maandelijks verder onregelmatig in een oplage tussen de 250 en 8000 exemplaren Het werd aanvankelijk gestencild en vanaf november 1942 gedrukt en de inhoud bestond voornamelijk uit opinie-artikelen

Het blad werd uitgegeven door Jan Drion en Huib Drion twee Leidsehellip

Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Delpher - object presentation

Suggested reading

bull httpwwwtedcomtalkstim_berners_lee_on_the_next_web 20 years ago Tim Berners-Lee invented the World Wide Web For his next project hes building a web for open linked data that could do for numbers what the Web did for words pictures video unlock our data and reframe the way we use it together

bull httpsenwikipediaorgwikiLinked_data Wikipedia article related to the above video

bull http5stardatainfoen The 5 stars of Linked Open Data (Tim Berners-Lee)

bull httplinda-projecteulinked-data-primer-2 Short primer about creating LOD in practice starting from an Excel sheet

bull httpwwwprogrammablewebcomnewshow-linked-data-solved-digital-age-marketing-problemanalysis20150831 The figure near the bottom of the first page is a good illustration of the concept of (linked) triples

bull httpsenwikipediaorgwikiDBpedia

bull httpsenwikipediaorgwikiSemantic_network

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Questions

olafjanssenkbnl - ookgezellig

tinyurlcomverzetskranten

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Page 30: Linked Open Data case study (illegal newspapers WW2, Wikipedia, DBpedia) - Lecture Leiden University 3-3-2016

httpsnlwikipediaorgwikiCategorieIllegale_pers_in_de_Tweede_Wereldoorlog

hellip De Geus on Wikipedia is an exception

1 Very few underground newspapers have their own WP articles

2 The overview of these newspapers on WP is far from complete

Good news

htt

p

ww

wa

rch

ives

go

vre

sear

chm

ilita

ryw

w2

ph

oto

sim

ages

ww

2-1

94

jpg

There is a 1-fix solution to this contextual problem

De Ondergrondse Pers 1940-1945

By Lydia E Winkel amp H de Vries

1989 ISBN 9021837463 Veen Uitgevers

This book (ldquoDe Winkelrdquo) contains

contextual articles about

(nearly) all plusmn 1300 illegal WW2 newspapers

ldquoDe Winkelrdquo ndash nr 199

De Ondergrondse Pers 1940-1945 Lydia E Winkel H de Vries 1989

ISBN 9021837463 Veen Uitgevers

Every article has a unique ID

(ldquoWinkel-IDrdquo)

Every article has metadata

bull Title subtitle motto bull Place of publication bull Period of publication bull Publication frequency (daily weekly one-off irregular)

bull Multiplication (stenciled printed typed handwritten)

bull Contents (news opinions poems illustrations humor)

bull Number of prints (min ndash max)

Relation 13

Newspaper Placename

semantics linked data

Relation 13

Newspaper Placename

semantics linked data

Contextual information

Nice material

for a Wikipedia article

Very often persons related to this newspaper are mentioned

Relation 23

Newspaper Persons

semantics linked data

Many articles also contain references to other newspapers

bull 106 = Cereales Vadeness (students resistance newspaper Wageningen) bull 360 = Leidsche Brief (students resistance newspaper Leiden) bull 748 = Sol Justitiae (students resistance newspaper Utrecht)

Relation 33

Newspaper Other newspapers

semantics linked data

htt

p

htt

ps

kn

ow

led

geu

top

iaf

iles

wo

rdp

ress

co

m2

01

40

1h

olla

nd

ho

use

libra

ryb

litz1

94

0j

pg

Too bad itrsquos a paper book hard to find multiply distribute and build upon

We need it digital

htt

p

htt

ps

kn

ow

led

geu

top

iaf

iles

wo

rdp

ress

co

m2

01

40

1h

olla

nd

ho

use

libra

ryb

litz1

94

0j

pg

We need it digital

1 Clear copyright with copyright holder (NIOD) Open CC-BY-SA license

2 Scan amp OCR

3 Convert into PDF

4 Put online NIOD site amp Wikimedia Commons

De Winkel as PDF on NIOD website (CC-BY-SA) httpwwwniodnlnlde-ondergrondse-pers-1940-1945

De Winkel as PDF on Wikimedia Commons httpscommonswikimediaorgwikiFilePDF_of_De_Ondergrondse_Pers_1940-1945_-_derde_druk_-_1989pdf

De Winkel as PDF on NIOD website (CC-BY-SA) httpwwwniodnlnlde-ondergrondse-pers-1940-1945

Saved us euro13330 httpwwwbrillcomdutch-underground-press-1940-1945

Wikipedia article about De Winkel httpnlwikipediaorgwikiDe_ondergrondse_pers_1940-1945

Wikipedia article about the author httpnlwikipediaorgwikiLydia_Winkel

Winkel the plusses

Available online (PDF flat file)

Open license (CC-BY-SA) Contextual information Relations

bull Titles Places bull Titles Persons

bull Titles Other titles

htt

p

ww

wa

rch

ives

go

vre

sear

chm

ilita

ryw

w2

ph

oto

sim

ages

ww

2-1

94

jpg

htt

p

2b

pb

logsp

ot

com

_BW

zuYw

iS6-I

TM

geR

sFd3m

IAAAAAAAAElw

3cv

gbZSPW

css

1600d

oct

or+

macr

o+

judy+

scare

djpg

Winkel the minusses

Unstructured data (PDF flat file)

Not very machine readable (unlike CSV XML JSON RDF)

PDF is no (real) open standard (unlike CSV XML JSON RDF)

No links between titles Delpher amp KB-cat No links between titles places amp persons

external sources (like Wikipedia)

but the data sources are

unconnected (and for 3+4 unstructured amp not machine-readable)

To summarize

a lot of information is available about these WW2 underground newspapers

1 Metadata (KB-cat)

2 Content (full-text Delpher)

3 Context (Winkel PDF)

4 Relations titles places persons other titles (Winkel PDF)

5 External resources about titles places and persons

htt

p

2b

pb

logsp

ot

com

_BW

zuYw

iS6-I

TM

geR

sFd3m

IAAAAAAAAElw

3cv

gbZSPW

css

1600d

oct

or+

macr

o+

judy+

scare

djpg

making discovery understanding amp research

of these newspapers (and related places amp persons) more difficult than necessary

We can solve all these issues

Good news

Wikiproject() Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

tinyurlcomverzetskranten (in Dutch)

httpsnlwikipediaorgwikiWikipediaWikiproject httpsenwikipediaorgwikiWikipediaWikiproject

From 14 1300 titles

htt

ps

n

lwik

iped

iao

rgw

iki

Cat

ego

rie

Illeg

ale_

per

s_in

_de_

Twee

de_

Wer

eld

oo

rlo

g

Wikiproject() Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

tinyurlcomverzetskranten (in Dutch)

We need a database

Build central database

httpswwwyoutubecomwatchv=GVDGuCjog_0

Build central database

httpswwwyoutubecomwatchv=GVDGuCjog_0

Step 1 Create Excel-sheet with bull Metadata about newspaper (from Winkel PDF)

bull Unique Wikipedia article title

bull Contextual info incl related persons amp titles (from Winkel PDF)

bull PPN unique ID linking newspaper to

KB-catalogue amp Delpher

Winkel-ID

Place of publication

Other metadata

Title

Unique Wikipedia article title =

ltNewspaper titlegt (verzetsblad ltPlace of publicationgt)

bull Contextual info bull Related persons

bull Related newspaper titles

PPN (107123223)

httpopc4kbnlDB=1PPNPPN=107123223

httpopc4kbnlDB=1PPNPPN=107123223

PPN (107123223)

PPN (107123223)

httpwwwdelphernlnlkrantenresultsindexcoll=dddtitelampcql[]=ppn3D107123223

httpwwwdelphernlnlkrantenresultsindexcoll=dddtitelampcql[]=ppn3D107123223

PPN (107123223)

Build central database

Step 2 Convert Excel into RDF triplestore (=special kind of online database anybody can access)

bull Steps 1-4 from httplinda-projecteulinked-

data-primer-2

bull Step 4 Vocubulary used = Bibframe (httpbibframeorgvocab)

Build central database

Step 3 Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

bull DBpedia = hub for linking different data sets on the Web to each

other Linked Open Data cloud

bull Connect persons amp places in newspaper database to external resources via DBpedia

Step 1c Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

DBpedia allows you to ask sophisticated queries against Wikipedia and to link the different data sets on the Web to Wikipedia data

bull Connect persons amp places in newspaper database to external

resources via DBpedia

httplod-cloudnetversions2010-09-22lod-cloud_coloredpng

Linked Open Data cloud

Build central database

Step 3 Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

bull DBpedia = hub for linking different data sets on the Web to each

other Linked Open Data cloud

bull We use DBpdia to connect persons amp places in our newspaper database to information in other databases

httpnldbpediaorgpageHuib_Drion

httpsnlwikipediaorgwikiHuib_Drion

httpwwwdbnlorgauteursauteurphpid=drio001

httpwwwbiografischportaalnlpersoon41181342

Build central database

Added value of Linked Open Data amp DBpedia Software can automatically query for additional information about places and persons mentioned in De Winkel that is not available in bull KB-catalogue bull Delpher bull De Winkel

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples)

Open license (CC-BY-SA) Open standard (RDF)

Contextual information Links between titles Delpher amp KB-cat (via PPNs)

Relations Links between places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia)

bull Titles Other titles

(PPNs)

http5stardatainfoen

Wikiproject Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

We need a template

Using an article template we can generate 1300 uniform and interlinked Wikipedia articles

from the LOD-database

htt

ps

c1

sta

ticf

lickr

co

m9

82

81

76

99

23

19

18

_11

a73

56

c38

_bjp

g

LOD-database + article template = Wikipedia article

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

Titles from database

Metadata from database

Related persons from database

Related newspapers from database

Winkel-ID from database

Link to full-texts in Delpher from database

Link to KB-catalogue record from database

Fixed categories on WP and Commons

Predefined fixed strings

Grey = bull From database bull Predefined fixed strings

Uniformity between articles guaranteed

All that WP-writers need to add manually

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

The KB can re-use (embed) the Wikipedia content in its own

websites to tackle this problem

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_28verzetsblad29

Delpher - search results

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

Delpher - search results

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Delpher - object presentation

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Over De Geus onder studenten

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog dat vanaf 4 oktober 1940 tot en met 13 juli 1944 in Den Haag werd uitgegeven Het blad verscheen in 1940 1941 en 1943 maandelijks verder onregelmatig in een oplage tussen de 250 en 8000 exemplaren Het werd aanvankelijk gestencild en vanaf november 1942 gedrukt en de inhoud bestond voornamelijk uit opinie-artikelen

Het blad werd uitgegeven door Jan Drion en Huib Drion twee Leidsehellip

Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Delpher - object presentation

Suggested reading

bull httpwwwtedcomtalkstim_berners_lee_on_the_next_web 20 years ago Tim Berners-Lee invented the World Wide Web For his next project hes building a web for open linked data that could do for numbers what the Web did for words pictures video unlock our data and reframe the way we use it together

bull httpsenwikipediaorgwikiLinked_data Wikipedia article related to the above video

bull http5stardatainfoen The 5 stars of Linked Open Data (Tim Berners-Lee)

bull httplinda-projecteulinked-data-primer-2 Short primer about creating LOD in practice starting from an Excel sheet

bull httpwwwprogrammablewebcomnewshow-linked-data-solved-digital-age-marketing-problemanalysis20150831 The figure near the bottom of the first page is a good illustration of the concept of (linked) triples

bull httpsenwikipediaorgwikiDBpedia

bull httpsenwikipediaorgwikiSemantic_network

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Questions

olafjanssenkbnl - ookgezellig

tinyurlcomverzetskranten

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Page 31: Linked Open Data case study (illegal newspapers WW2, Wikipedia, DBpedia) - Lecture Leiden University 3-3-2016

Good news

htt

p

ww

wa

rch

ives

go

vre

sear

chm

ilita

ryw

w2

ph

oto

sim

ages

ww

2-1

94

jpg

There is a 1-fix solution to this contextual problem

De Ondergrondse Pers 1940-1945

By Lydia E Winkel amp H de Vries

1989 ISBN 9021837463 Veen Uitgevers

This book (ldquoDe Winkelrdquo) contains

contextual articles about

(nearly) all plusmn 1300 illegal WW2 newspapers

ldquoDe Winkelrdquo ndash nr 199

De Ondergrondse Pers 1940-1945 Lydia E Winkel H de Vries 1989

ISBN 9021837463 Veen Uitgevers

Every article has a unique ID

(ldquoWinkel-IDrdquo)

Every article has metadata

bull Title subtitle motto bull Place of publication bull Period of publication bull Publication frequency (daily weekly one-off irregular)

bull Multiplication (stenciled printed typed handwritten)

bull Contents (news opinions poems illustrations humor)

bull Number of prints (min ndash max)

Relation 13

Newspaper Placename

semantics linked data

Relation 13

Newspaper Placename

semantics linked data

Contextual information

Nice material

for a Wikipedia article

Very often persons related to this newspaper are mentioned

Relation 23

Newspaper Persons

semantics linked data

Many articles also contain references to other newspapers

bull 106 = Cereales Vadeness (students resistance newspaper Wageningen) bull 360 = Leidsche Brief (students resistance newspaper Leiden) bull 748 = Sol Justitiae (students resistance newspaper Utrecht)

Relation 33

Newspaper Other newspapers

semantics linked data

htt

p

htt

ps

kn

ow

led

geu

top

iaf

iles

wo

rdp

ress

co

m2

01

40

1h

olla

nd

ho

use

libra

ryb

litz1

94

0j

pg

Too bad itrsquos a paper book hard to find multiply distribute and build upon

We need it digital

htt

p

htt

ps

kn

ow

led

geu

top

iaf

iles

wo

rdp

ress

co

m2

01

40

1h

olla

nd

ho

use

libra

ryb

litz1

94

0j

pg

We need it digital

1 Clear copyright with copyright holder (NIOD) Open CC-BY-SA license

2 Scan amp OCR

3 Convert into PDF

4 Put online NIOD site amp Wikimedia Commons

De Winkel as PDF on NIOD website (CC-BY-SA) httpwwwniodnlnlde-ondergrondse-pers-1940-1945

De Winkel as PDF on Wikimedia Commons httpscommonswikimediaorgwikiFilePDF_of_De_Ondergrondse_Pers_1940-1945_-_derde_druk_-_1989pdf

De Winkel as PDF on NIOD website (CC-BY-SA) httpwwwniodnlnlde-ondergrondse-pers-1940-1945

Saved us euro13330 httpwwwbrillcomdutch-underground-press-1940-1945

Wikipedia article about De Winkel httpnlwikipediaorgwikiDe_ondergrondse_pers_1940-1945

Wikipedia article about the author httpnlwikipediaorgwikiLydia_Winkel

Winkel the plusses

Available online (PDF flat file)

Open license (CC-BY-SA) Contextual information Relations

bull Titles Places bull Titles Persons

bull Titles Other titles

htt

p

ww

wa

rch

ives

go

vre

sear

chm

ilita

ryw

w2

ph

oto

sim

ages

ww

2-1

94

jpg

htt

p

2b

pb

logsp

ot

com

_BW

zuYw

iS6-I

TM

geR

sFd3m

IAAAAAAAAElw

3cv

gbZSPW

css

1600d

oct

or+

macr

o+

judy+

scare

djpg

Winkel the minusses

Unstructured data (PDF flat file)

Not very machine readable (unlike CSV XML JSON RDF)

PDF is no (real) open standard (unlike CSV XML JSON RDF)

No links between titles Delpher amp KB-cat No links between titles places amp persons

external sources (like Wikipedia)

but the data sources are

unconnected (and for 3+4 unstructured amp not machine-readable)

To summarize

a lot of information is available about these WW2 underground newspapers

1 Metadata (KB-cat)

2 Content (full-text Delpher)

3 Context (Winkel PDF)

4 Relations titles places persons other titles (Winkel PDF)

5 External resources about titles places and persons

htt

p

2b

pb

logsp

ot

com

_BW

zuYw

iS6-I

TM

geR

sFd3m

IAAAAAAAAElw

3cv

gbZSPW

css

1600d

oct

or+

macr

o+

judy+

scare

djpg

making discovery understanding amp research

of these newspapers (and related places amp persons) more difficult than necessary

We can solve all these issues

Good news

Wikiproject() Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

tinyurlcomverzetskranten (in Dutch)

httpsnlwikipediaorgwikiWikipediaWikiproject httpsenwikipediaorgwikiWikipediaWikiproject

From 14 1300 titles

htt

ps

n

lwik

iped

iao

rgw

iki

Cat

ego

rie

Illeg

ale_

per

s_in

_de_

Twee

de_

Wer

eld

oo

rlo

g

Wikiproject() Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

tinyurlcomverzetskranten (in Dutch)

We need a database

Build central database

httpswwwyoutubecomwatchv=GVDGuCjog_0

Build central database

httpswwwyoutubecomwatchv=GVDGuCjog_0

Step 1 Create Excel-sheet with bull Metadata about newspaper (from Winkel PDF)

bull Unique Wikipedia article title

bull Contextual info incl related persons amp titles (from Winkel PDF)

bull PPN unique ID linking newspaper to

KB-catalogue amp Delpher

Winkel-ID

Place of publication

Other metadata

Title

Unique Wikipedia article title =

ltNewspaper titlegt (verzetsblad ltPlace of publicationgt)

bull Contextual info bull Related persons

bull Related newspaper titles

PPN (107123223)

httpopc4kbnlDB=1PPNPPN=107123223

httpopc4kbnlDB=1PPNPPN=107123223

PPN (107123223)

PPN (107123223)

httpwwwdelphernlnlkrantenresultsindexcoll=dddtitelampcql[]=ppn3D107123223

httpwwwdelphernlnlkrantenresultsindexcoll=dddtitelampcql[]=ppn3D107123223

PPN (107123223)

Build central database

Step 2 Convert Excel into RDF triplestore (=special kind of online database anybody can access)

bull Steps 1-4 from httplinda-projecteulinked-

data-primer-2

bull Step 4 Vocubulary used = Bibframe (httpbibframeorgvocab)

Build central database

Step 3 Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

bull DBpedia = hub for linking different data sets on the Web to each

other Linked Open Data cloud

bull Connect persons amp places in newspaper database to external resources via DBpedia

Step 1c Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

DBpedia allows you to ask sophisticated queries against Wikipedia and to link the different data sets on the Web to Wikipedia data

bull Connect persons amp places in newspaper database to external

resources via DBpedia

httplod-cloudnetversions2010-09-22lod-cloud_coloredpng

Linked Open Data cloud

Build central database

Step 3 Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

bull DBpedia = hub for linking different data sets on the Web to each

other Linked Open Data cloud

bull We use DBpdia to connect persons amp places in our newspaper database to information in other databases

httpnldbpediaorgpageHuib_Drion

httpsnlwikipediaorgwikiHuib_Drion

httpwwwdbnlorgauteursauteurphpid=drio001

httpwwwbiografischportaalnlpersoon41181342

Build central database

Added value of Linked Open Data amp DBpedia Software can automatically query for additional information about places and persons mentioned in De Winkel that is not available in bull KB-catalogue bull Delpher bull De Winkel

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples)

Open license (CC-BY-SA) Open standard (RDF)

Contextual information Links between titles Delpher amp KB-cat (via PPNs)

Relations Links between places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia)

bull Titles Other titles

(PPNs)

http5stardatainfoen

Wikiproject Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

We need a template

Using an article template we can generate 1300 uniform and interlinked Wikipedia articles

from the LOD-database

htt

ps

c1

sta

ticf

lickr

co

m9

82

81

76

99

23

19

18

_11

a73

56

c38

_bjp

g

LOD-database + article template = Wikipedia article

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

Titles from database

Metadata from database

Related persons from database

Related newspapers from database

Winkel-ID from database

Link to full-texts in Delpher from database

Link to KB-catalogue record from database

Fixed categories on WP and Commons

Predefined fixed strings

Grey = bull From database bull Predefined fixed strings

Uniformity between articles guaranteed

All that WP-writers need to add manually

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

The KB can re-use (embed) the Wikipedia content in its own

websites to tackle this problem

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_28verzetsblad29

Delpher - search results

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

Delpher - search results

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Delpher - object presentation

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Over De Geus onder studenten

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog dat vanaf 4 oktober 1940 tot en met 13 juli 1944 in Den Haag werd uitgegeven Het blad verscheen in 1940 1941 en 1943 maandelijks verder onregelmatig in een oplage tussen de 250 en 8000 exemplaren Het werd aanvankelijk gestencild en vanaf november 1942 gedrukt en de inhoud bestond voornamelijk uit opinie-artikelen

Het blad werd uitgegeven door Jan Drion en Huib Drion twee Leidsehellip

Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Delpher - object presentation

Suggested reading

bull httpwwwtedcomtalkstim_berners_lee_on_the_next_web 20 years ago Tim Berners-Lee invented the World Wide Web For his next project hes building a web for open linked data that could do for numbers what the Web did for words pictures video unlock our data and reframe the way we use it together

bull httpsenwikipediaorgwikiLinked_data Wikipedia article related to the above video

bull http5stardatainfoen The 5 stars of Linked Open Data (Tim Berners-Lee)

bull httplinda-projecteulinked-data-primer-2 Short primer about creating LOD in practice starting from an Excel sheet

bull httpwwwprogrammablewebcomnewshow-linked-data-solved-digital-age-marketing-problemanalysis20150831 The figure near the bottom of the first page is a good illustration of the concept of (linked) triples

bull httpsenwikipediaorgwikiDBpedia

bull httpsenwikipediaorgwikiSemantic_network

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Questions

olafjanssenkbnl - ookgezellig

tinyurlcomverzetskranten

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Page 32: Linked Open Data case study (illegal newspapers WW2, Wikipedia, DBpedia) - Lecture Leiden University 3-3-2016

De Ondergrondse Pers 1940-1945

By Lydia E Winkel amp H de Vries

1989 ISBN 9021837463 Veen Uitgevers

This book (ldquoDe Winkelrdquo) contains

contextual articles about

(nearly) all plusmn 1300 illegal WW2 newspapers

ldquoDe Winkelrdquo ndash nr 199

De Ondergrondse Pers 1940-1945 Lydia E Winkel H de Vries 1989

ISBN 9021837463 Veen Uitgevers

Every article has a unique ID

(ldquoWinkel-IDrdquo)

Every article has metadata

bull Title subtitle motto bull Place of publication bull Period of publication bull Publication frequency (daily weekly one-off irregular)

bull Multiplication (stenciled printed typed handwritten)

bull Contents (news opinions poems illustrations humor)

bull Number of prints (min ndash max)

Relation 13

Newspaper Placename

semantics linked data

Relation 13

Newspaper Placename

semantics linked data

Contextual information

Nice material

for a Wikipedia article

Very often persons related to this newspaper are mentioned

Relation 23

Newspaper Persons

semantics linked data

Many articles also contain references to other newspapers

bull 106 = Cereales Vadeness (students resistance newspaper Wageningen) bull 360 = Leidsche Brief (students resistance newspaper Leiden) bull 748 = Sol Justitiae (students resistance newspaper Utrecht)

Relation 33

Newspaper Other newspapers

semantics linked data

htt

p

htt

ps

kn

ow

led

geu

top

iaf

iles

wo

rdp

ress

co

m2

01

40

1h

olla

nd

ho

use

libra

ryb

litz1

94

0j

pg

Too bad itrsquos a paper book hard to find multiply distribute and build upon

We need it digital

htt

p

htt

ps

kn

ow

led

geu

top

iaf

iles

wo

rdp

ress

co

m2

01

40

1h

olla

nd

ho

use

libra

ryb

litz1

94

0j

pg

We need it digital

1 Clear copyright with copyright holder (NIOD) Open CC-BY-SA license

2 Scan amp OCR

3 Convert into PDF

4 Put online NIOD site amp Wikimedia Commons

De Winkel as PDF on NIOD website (CC-BY-SA) httpwwwniodnlnlde-ondergrondse-pers-1940-1945

De Winkel as PDF on Wikimedia Commons httpscommonswikimediaorgwikiFilePDF_of_De_Ondergrondse_Pers_1940-1945_-_derde_druk_-_1989pdf

De Winkel as PDF on NIOD website (CC-BY-SA) httpwwwniodnlnlde-ondergrondse-pers-1940-1945

Saved us euro13330 httpwwwbrillcomdutch-underground-press-1940-1945

Wikipedia article about De Winkel httpnlwikipediaorgwikiDe_ondergrondse_pers_1940-1945

Wikipedia article about the author httpnlwikipediaorgwikiLydia_Winkel

Winkel the plusses

Available online (PDF flat file)

Open license (CC-BY-SA) Contextual information Relations

bull Titles Places bull Titles Persons

bull Titles Other titles

htt

p

ww

wa

rch

ives

go

vre

sear

chm

ilita

ryw

w2

ph

oto

sim

ages

ww

2-1

94

jpg

htt

p

2b

pb

logsp

ot

com

_BW

zuYw

iS6-I

TM

geR

sFd3m

IAAAAAAAAElw

3cv

gbZSPW

css

1600d

oct

or+

macr

o+

judy+

scare

djpg

Winkel the minusses

Unstructured data (PDF flat file)

Not very machine readable (unlike CSV XML JSON RDF)

PDF is no (real) open standard (unlike CSV XML JSON RDF)

No links between titles Delpher amp KB-cat No links between titles places amp persons

external sources (like Wikipedia)

but the data sources are

unconnected (and for 3+4 unstructured amp not machine-readable)

To summarize

a lot of information is available about these WW2 underground newspapers

1 Metadata (KB-cat)

2 Content (full-text Delpher)

3 Context (Winkel PDF)

4 Relations titles places persons other titles (Winkel PDF)

5 External resources about titles places and persons

htt

p

2b

pb

logsp

ot

com

_BW

zuYw

iS6-I

TM

geR

sFd3m

IAAAAAAAAElw

3cv

gbZSPW

css

1600d

oct

or+

macr

o+

judy+

scare

djpg

making discovery understanding amp research

of these newspapers (and related places amp persons) more difficult than necessary

We can solve all these issues

Good news

Wikiproject() Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

tinyurlcomverzetskranten (in Dutch)

httpsnlwikipediaorgwikiWikipediaWikiproject httpsenwikipediaorgwikiWikipediaWikiproject

From 14 1300 titles

htt

ps

n

lwik

iped

iao

rgw

iki

Cat

ego

rie

Illeg

ale_

per

s_in

_de_

Twee

de_

Wer

eld

oo

rlo

g

Wikiproject() Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

tinyurlcomverzetskranten (in Dutch)

We need a database

Build central database

httpswwwyoutubecomwatchv=GVDGuCjog_0

Build central database

httpswwwyoutubecomwatchv=GVDGuCjog_0

Step 1 Create Excel-sheet with bull Metadata about newspaper (from Winkel PDF)

bull Unique Wikipedia article title

bull Contextual info incl related persons amp titles (from Winkel PDF)

bull PPN unique ID linking newspaper to

KB-catalogue amp Delpher

Winkel-ID

Place of publication

Other metadata

Title

Unique Wikipedia article title =

ltNewspaper titlegt (verzetsblad ltPlace of publicationgt)

bull Contextual info bull Related persons

bull Related newspaper titles

PPN (107123223)

httpopc4kbnlDB=1PPNPPN=107123223

httpopc4kbnlDB=1PPNPPN=107123223

PPN (107123223)

PPN (107123223)

httpwwwdelphernlnlkrantenresultsindexcoll=dddtitelampcql[]=ppn3D107123223

httpwwwdelphernlnlkrantenresultsindexcoll=dddtitelampcql[]=ppn3D107123223

PPN (107123223)

Build central database

Step 2 Convert Excel into RDF triplestore (=special kind of online database anybody can access)

bull Steps 1-4 from httplinda-projecteulinked-

data-primer-2

bull Step 4 Vocubulary used = Bibframe (httpbibframeorgvocab)

Build central database

Step 3 Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

bull DBpedia = hub for linking different data sets on the Web to each

other Linked Open Data cloud

bull Connect persons amp places in newspaper database to external resources via DBpedia

Step 1c Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

DBpedia allows you to ask sophisticated queries against Wikipedia and to link the different data sets on the Web to Wikipedia data

bull Connect persons amp places in newspaper database to external

resources via DBpedia

httplod-cloudnetversions2010-09-22lod-cloud_coloredpng

Linked Open Data cloud

Build central database

Step 3 Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

bull DBpedia = hub for linking different data sets on the Web to each

other Linked Open Data cloud

bull We use DBpdia to connect persons amp places in our newspaper database to information in other databases

httpnldbpediaorgpageHuib_Drion

httpsnlwikipediaorgwikiHuib_Drion

httpwwwdbnlorgauteursauteurphpid=drio001

httpwwwbiografischportaalnlpersoon41181342

Build central database

Added value of Linked Open Data amp DBpedia Software can automatically query for additional information about places and persons mentioned in De Winkel that is not available in bull KB-catalogue bull Delpher bull De Winkel

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples)

Open license (CC-BY-SA) Open standard (RDF)

Contextual information Links between titles Delpher amp KB-cat (via PPNs)

Relations Links between places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia)

bull Titles Other titles

(PPNs)

http5stardatainfoen

Wikiproject Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

We need a template

Using an article template we can generate 1300 uniform and interlinked Wikipedia articles

from the LOD-database

htt

ps

c1

sta

ticf

lickr

co

m9

82

81

76

99

23

19

18

_11

a73

56

c38

_bjp

g

LOD-database + article template = Wikipedia article

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

Titles from database

Metadata from database

Related persons from database

Related newspapers from database

Winkel-ID from database

Link to full-texts in Delpher from database

Link to KB-catalogue record from database

Fixed categories on WP and Commons

Predefined fixed strings

Grey = bull From database bull Predefined fixed strings

Uniformity between articles guaranteed

All that WP-writers need to add manually

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

The KB can re-use (embed) the Wikipedia content in its own

websites to tackle this problem

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_28verzetsblad29

Delpher - search results

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

Delpher - search results

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Delpher - object presentation

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Over De Geus onder studenten

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog dat vanaf 4 oktober 1940 tot en met 13 juli 1944 in Den Haag werd uitgegeven Het blad verscheen in 1940 1941 en 1943 maandelijks verder onregelmatig in een oplage tussen de 250 en 8000 exemplaren Het werd aanvankelijk gestencild en vanaf november 1942 gedrukt en de inhoud bestond voornamelijk uit opinie-artikelen

Het blad werd uitgegeven door Jan Drion en Huib Drion twee Leidsehellip

Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Delpher - object presentation

Suggested reading

bull httpwwwtedcomtalkstim_berners_lee_on_the_next_web 20 years ago Tim Berners-Lee invented the World Wide Web For his next project hes building a web for open linked data that could do for numbers what the Web did for words pictures video unlock our data and reframe the way we use it together

bull httpsenwikipediaorgwikiLinked_data Wikipedia article related to the above video

bull http5stardatainfoen The 5 stars of Linked Open Data (Tim Berners-Lee)

bull httplinda-projecteulinked-data-primer-2 Short primer about creating LOD in practice starting from an Excel sheet

bull httpwwwprogrammablewebcomnewshow-linked-data-solved-digital-age-marketing-problemanalysis20150831 The figure near the bottom of the first page is a good illustration of the concept of (linked) triples

bull httpsenwikipediaorgwikiDBpedia

bull httpsenwikipediaorgwikiSemantic_network

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Questions

olafjanssenkbnl - ookgezellig

tinyurlcomverzetskranten

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Page 33: Linked Open Data case study (illegal newspapers WW2, Wikipedia, DBpedia) - Lecture Leiden University 3-3-2016

ldquoDe Winkelrdquo ndash nr 199

De Ondergrondse Pers 1940-1945 Lydia E Winkel H de Vries 1989

ISBN 9021837463 Veen Uitgevers

Every article has a unique ID

(ldquoWinkel-IDrdquo)

Every article has metadata

bull Title subtitle motto bull Place of publication bull Period of publication bull Publication frequency (daily weekly one-off irregular)

bull Multiplication (stenciled printed typed handwritten)

bull Contents (news opinions poems illustrations humor)

bull Number of prints (min ndash max)

Relation 13

Newspaper Placename

semantics linked data

Relation 13

Newspaper Placename

semantics linked data

Contextual information

Nice material

for a Wikipedia article

Very often persons related to this newspaper are mentioned

Relation 23

Newspaper Persons

semantics linked data

Many articles also contain references to other newspapers

bull 106 = Cereales Vadeness (students resistance newspaper Wageningen) bull 360 = Leidsche Brief (students resistance newspaper Leiden) bull 748 = Sol Justitiae (students resistance newspaper Utrecht)

Relation 33

Newspaper Other newspapers

semantics linked data

htt

p

htt

ps

kn

ow

led

geu

top

iaf

iles

wo

rdp

ress

co

m2

01

40

1h

olla

nd

ho

use

libra

ryb

litz1

94

0j

pg

Too bad itrsquos a paper book hard to find multiply distribute and build upon

We need it digital

htt

p

htt

ps

kn

ow

led

geu

top

iaf

iles

wo

rdp

ress

co

m2

01

40

1h

olla

nd

ho

use

libra

ryb

litz1

94

0j

pg

We need it digital

1 Clear copyright with copyright holder (NIOD) Open CC-BY-SA license

2 Scan amp OCR

3 Convert into PDF

4 Put online NIOD site amp Wikimedia Commons

De Winkel as PDF on NIOD website (CC-BY-SA) httpwwwniodnlnlde-ondergrondse-pers-1940-1945

De Winkel as PDF on Wikimedia Commons httpscommonswikimediaorgwikiFilePDF_of_De_Ondergrondse_Pers_1940-1945_-_derde_druk_-_1989pdf

De Winkel as PDF on NIOD website (CC-BY-SA) httpwwwniodnlnlde-ondergrondse-pers-1940-1945

Saved us euro13330 httpwwwbrillcomdutch-underground-press-1940-1945

Wikipedia article about De Winkel httpnlwikipediaorgwikiDe_ondergrondse_pers_1940-1945

Wikipedia article about the author httpnlwikipediaorgwikiLydia_Winkel

Winkel the plusses

Available online (PDF flat file)

Open license (CC-BY-SA) Contextual information Relations

bull Titles Places bull Titles Persons

bull Titles Other titles

htt

p

ww

wa

rch

ives

go

vre

sear

chm

ilita

ryw

w2

ph

oto

sim

ages

ww

2-1

94

jpg

htt

p

2b

pb

logsp

ot

com

_BW

zuYw

iS6-I

TM

geR

sFd3m

IAAAAAAAAElw

3cv

gbZSPW

css

1600d

oct

or+

macr

o+

judy+

scare

djpg

Winkel the minusses

Unstructured data (PDF flat file)

Not very machine readable (unlike CSV XML JSON RDF)

PDF is no (real) open standard (unlike CSV XML JSON RDF)

No links between titles Delpher amp KB-cat No links between titles places amp persons

external sources (like Wikipedia)

but the data sources are

unconnected (and for 3+4 unstructured amp not machine-readable)

To summarize

a lot of information is available about these WW2 underground newspapers

1 Metadata (KB-cat)

2 Content (full-text Delpher)

3 Context (Winkel PDF)

4 Relations titles places persons other titles (Winkel PDF)

5 External resources about titles places and persons

htt

p

2b

pb

logsp

ot

com

_BW

zuYw

iS6-I

TM

geR

sFd3m

IAAAAAAAAElw

3cv

gbZSPW

css

1600d

oct

or+

macr

o+

judy+

scare

djpg

making discovery understanding amp research

of these newspapers (and related places amp persons) more difficult than necessary

We can solve all these issues

Good news

Wikiproject() Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

tinyurlcomverzetskranten (in Dutch)

httpsnlwikipediaorgwikiWikipediaWikiproject httpsenwikipediaorgwikiWikipediaWikiproject

From 14 1300 titles

htt

ps

n

lwik

iped

iao

rgw

iki

Cat

ego

rie

Illeg

ale_

per

s_in

_de_

Twee

de_

Wer

eld

oo

rlo

g

Wikiproject() Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

tinyurlcomverzetskranten (in Dutch)

We need a database

Build central database

httpswwwyoutubecomwatchv=GVDGuCjog_0

Build central database

httpswwwyoutubecomwatchv=GVDGuCjog_0

Step 1 Create Excel-sheet with bull Metadata about newspaper (from Winkel PDF)

bull Unique Wikipedia article title

bull Contextual info incl related persons amp titles (from Winkel PDF)

bull PPN unique ID linking newspaper to

KB-catalogue amp Delpher

Winkel-ID

Place of publication

Other metadata

Title

Unique Wikipedia article title =

ltNewspaper titlegt (verzetsblad ltPlace of publicationgt)

bull Contextual info bull Related persons

bull Related newspaper titles

PPN (107123223)

httpopc4kbnlDB=1PPNPPN=107123223

httpopc4kbnlDB=1PPNPPN=107123223

PPN (107123223)

PPN (107123223)

httpwwwdelphernlnlkrantenresultsindexcoll=dddtitelampcql[]=ppn3D107123223

httpwwwdelphernlnlkrantenresultsindexcoll=dddtitelampcql[]=ppn3D107123223

PPN (107123223)

Build central database

Step 2 Convert Excel into RDF triplestore (=special kind of online database anybody can access)

bull Steps 1-4 from httplinda-projecteulinked-

data-primer-2

bull Step 4 Vocubulary used = Bibframe (httpbibframeorgvocab)

Build central database

Step 3 Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

bull DBpedia = hub for linking different data sets on the Web to each

other Linked Open Data cloud

bull Connect persons amp places in newspaper database to external resources via DBpedia

Step 1c Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

DBpedia allows you to ask sophisticated queries against Wikipedia and to link the different data sets on the Web to Wikipedia data

bull Connect persons amp places in newspaper database to external

resources via DBpedia

httplod-cloudnetversions2010-09-22lod-cloud_coloredpng

Linked Open Data cloud

Build central database

Step 3 Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

bull DBpedia = hub for linking different data sets on the Web to each

other Linked Open Data cloud

bull We use DBpdia to connect persons amp places in our newspaper database to information in other databases

httpnldbpediaorgpageHuib_Drion

httpsnlwikipediaorgwikiHuib_Drion

httpwwwdbnlorgauteursauteurphpid=drio001

httpwwwbiografischportaalnlpersoon41181342

Build central database

Added value of Linked Open Data amp DBpedia Software can automatically query for additional information about places and persons mentioned in De Winkel that is not available in bull KB-catalogue bull Delpher bull De Winkel

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples)

Open license (CC-BY-SA) Open standard (RDF)

Contextual information Links between titles Delpher amp KB-cat (via PPNs)

Relations Links between places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia)

bull Titles Other titles

(PPNs)

http5stardatainfoen

Wikiproject Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

We need a template

Using an article template we can generate 1300 uniform and interlinked Wikipedia articles

from the LOD-database

htt

ps

c1

sta

ticf

lickr

co

m9

82

81

76

99

23

19

18

_11

a73

56

c38

_bjp

g

LOD-database + article template = Wikipedia article

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

Titles from database

Metadata from database

Related persons from database

Related newspapers from database

Winkel-ID from database

Link to full-texts in Delpher from database

Link to KB-catalogue record from database

Fixed categories on WP and Commons

Predefined fixed strings

Grey = bull From database bull Predefined fixed strings

Uniformity between articles guaranteed

All that WP-writers need to add manually

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

The KB can re-use (embed) the Wikipedia content in its own

websites to tackle this problem

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_28verzetsblad29

Delpher - search results

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

Delpher - search results

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Delpher - object presentation

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Over De Geus onder studenten

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog dat vanaf 4 oktober 1940 tot en met 13 juli 1944 in Den Haag werd uitgegeven Het blad verscheen in 1940 1941 en 1943 maandelijks verder onregelmatig in een oplage tussen de 250 en 8000 exemplaren Het werd aanvankelijk gestencild en vanaf november 1942 gedrukt en de inhoud bestond voornamelijk uit opinie-artikelen

Het blad werd uitgegeven door Jan Drion en Huib Drion twee Leidsehellip

Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Delpher - object presentation

Suggested reading

bull httpwwwtedcomtalkstim_berners_lee_on_the_next_web 20 years ago Tim Berners-Lee invented the World Wide Web For his next project hes building a web for open linked data that could do for numbers what the Web did for words pictures video unlock our data and reframe the way we use it together

bull httpsenwikipediaorgwikiLinked_data Wikipedia article related to the above video

bull http5stardatainfoen The 5 stars of Linked Open Data (Tim Berners-Lee)

bull httplinda-projecteulinked-data-primer-2 Short primer about creating LOD in practice starting from an Excel sheet

bull httpwwwprogrammablewebcomnewshow-linked-data-solved-digital-age-marketing-problemanalysis20150831 The figure near the bottom of the first page is a good illustration of the concept of (linked) triples

bull httpsenwikipediaorgwikiDBpedia

bull httpsenwikipediaorgwikiSemantic_network

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Questions

olafjanssenkbnl - ookgezellig

tinyurlcomverzetskranten

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Page 34: Linked Open Data case study (illegal newspapers WW2, Wikipedia, DBpedia) - Lecture Leiden University 3-3-2016

Every article has a unique ID

(ldquoWinkel-IDrdquo)

Every article has metadata

bull Title subtitle motto bull Place of publication bull Period of publication bull Publication frequency (daily weekly one-off irregular)

bull Multiplication (stenciled printed typed handwritten)

bull Contents (news opinions poems illustrations humor)

bull Number of prints (min ndash max)

Relation 13

Newspaper Placename

semantics linked data

Relation 13

Newspaper Placename

semantics linked data

Contextual information

Nice material

for a Wikipedia article

Very often persons related to this newspaper are mentioned

Relation 23

Newspaper Persons

semantics linked data

Many articles also contain references to other newspapers

bull 106 = Cereales Vadeness (students resistance newspaper Wageningen) bull 360 = Leidsche Brief (students resistance newspaper Leiden) bull 748 = Sol Justitiae (students resistance newspaper Utrecht)

Relation 33

Newspaper Other newspapers

semantics linked data

htt

p

htt

ps

kn

ow

led

geu

top

iaf

iles

wo

rdp

ress

co

m2

01

40

1h

olla

nd

ho

use

libra

ryb

litz1

94

0j

pg

Too bad itrsquos a paper book hard to find multiply distribute and build upon

We need it digital

htt

p

htt

ps

kn

ow

led

geu

top

iaf

iles

wo

rdp

ress

co

m2

01

40

1h

olla

nd

ho

use

libra

ryb

litz1

94

0j

pg

We need it digital

1 Clear copyright with copyright holder (NIOD) Open CC-BY-SA license

2 Scan amp OCR

3 Convert into PDF

4 Put online NIOD site amp Wikimedia Commons

De Winkel as PDF on NIOD website (CC-BY-SA) httpwwwniodnlnlde-ondergrondse-pers-1940-1945

De Winkel as PDF on Wikimedia Commons httpscommonswikimediaorgwikiFilePDF_of_De_Ondergrondse_Pers_1940-1945_-_derde_druk_-_1989pdf

De Winkel as PDF on NIOD website (CC-BY-SA) httpwwwniodnlnlde-ondergrondse-pers-1940-1945

Saved us euro13330 httpwwwbrillcomdutch-underground-press-1940-1945

Wikipedia article about De Winkel httpnlwikipediaorgwikiDe_ondergrondse_pers_1940-1945

Wikipedia article about the author httpnlwikipediaorgwikiLydia_Winkel

Winkel the plusses

Available online (PDF flat file)

Open license (CC-BY-SA) Contextual information Relations

bull Titles Places bull Titles Persons

bull Titles Other titles

htt

p

ww

wa

rch

ives

go

vre

sear

chm

ilita

ryw

w2

ph

oto

sim

ages

ww

2-1

94

jpg

htt

p

2b

pb

logsp

ot

com

_BW

zuYw

iS6-I

TM

geR

sFd3m

IAAAAAAAAElw

3cv

gbZSPW

css

1600d

oct

or+

macr

o+

judy+

scare

djpg

Winkel the minusses

Unstructured data (PDF flat file)

Not very machine readable (unlike CSV XML JSON RDF)

PDF is no (real) open standard (unlike CSV XML JSON RDF)

No links between titles Delpher amp KB-cat No links between titles places amp persons

external sources (like Wikipedia)

but the data sources are

unconnected (and for 3+4 unstructured amp not machine-readable)

To summarize

a lot of information is available about these WW2 underground newspapers

1 Metadata (KB-cat)

2 Content (full-text Delpher)

3 Context (Winkel PDF)

4 Relations titles places persons other titles (Winkel PDF)

5 External resources about titles places and persons

htt

p

2b

pb

logsp

ot

com

_BW

zuYw

iS6-I

TM

geR

sFd3m

IAAAAAAAAElw

3cv

gbZSPW

css

1600d

oct

or+

macr

o+

judy+

scare

djpg

making discovery understanding amp research

of these newspapers (and related places amp persons) more difficult than necessary

We can solve all these issues

Good news

Wikiproject() Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

tinyurlcomverzetskranten (in Dutch)

httpsnlwikipediaorgwikiWikipediaWikiproject httpsenwikipediaorgwikiWikipediaWikiproject

From 14 1300 titles

htt

ps

n

lwik

iped

iao

rgw

iki

Cat

ego

rie

Illeg

ale_

per

s_in

_de_

Twee

de_

Wer

eld

oo

rlo

g

Wikiproject() Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

tinyurlcomverzetskranten (in Dutch)

We need a database

Build central database

httpswwwyoutubecomwatchv=GVDGuCjog_0

Build central database

httpswwwyoutubecomwatchv=GVDGuCjog_0

Step 1 Create Excel-sheet with bull Metadata about newspaper (from Winkel PDF)

bull Unique Wikipedia article title

bull Contextual info incl related persons amp titles (from Winkel PDF)

bull PPN unique ID linking newspaper to

KB-catalogue amp Delpher

Winkel-ID

Place of publication

Other metadata

Title

Unique Wikipedia article title =

ltNewspaper titlegt (verzetsblad ltPlace of publicationgt)

bull Contextual info bull Related persons

bull Related newspaper titles

PPN (107123223)

httpopc4kbnlDB=1PPNPPN=107123223

httpopc4kbnlDB=1PPNPPN=107123223

PPN (107123223)

PPN (107123223)

httpwwwdelphernlnlkrantenresultsindexcoll=dddtitelampcql[]=ppn3D107123223

httpwwwdelphernlnlkrantenresultsindexcoll=dddtitelampcql[]=ppn3D107123223

PPN (107123223)

Build central database

Step 2 Convert Excel into RDF triplestore (=special kind of online database anybody can access)

bull Steps 1-4 from httplinda-projecteulinked-

data-primer-2

bull Step 4 Vocubulary used = Bibframe (httpbibframeorgvocab)

Build central database

Step 3 Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

bull DBpedia = hub for linking different data sets on the Web to each

other Linked Open Data cloud

bull Connect persons amp places in newspaper database to external resources via DBpedia

Step 1c Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

DBpedia allows you to ask sophisticated queries against Wikipedia and to link the different data sets on the Web to Wikipedia data

bull Connect persons amp places in newspaper database to external

resources via DBpedia

httplod-cloudnetversions2010-09-22lod-cloud_coloredpng

Linked Open Data cloud

Build central database

Step 3 Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

bull DBpedia = hub for linking different data sets on the Web to each

other Linked Open Data cloud

bull We use DBpdia to connect persons amp places in our newspaper database to information in other databases

httpnldbpediaorgpageHuib_Drion

httpsnlwikipediaorgwikiHuib_Drion

httpwwwdbnlorgauteursauteurphpid=drio001

httpwwwbiografischportaalnlpersoon41181342

Build central database

Added value of Linked Open Data amp DBpedia Software can automatically query for additional information about places and persons mentioned in De Winkel that is not available in bull KB-catalogue bull Delpher bull De Winkel

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples)

Open license (CC-BY-SA) Open standard (RDF)

Contextual information Links between titles Delpher amp KB-cat (via PPNs)

Relations Links between places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia)

bull Titles Other titles

(PPNs)

http5stardatainfoen

Wikiproject Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

We need a template

Using an article template we can generate 1300 uniform and interlinked Wikipedia articles

from the LOD-database

htt

ps

c1

sta

ticf

lickr

co

m9

82

81

76

99

23

19

18

_11

a73

56

c38

_bjp

g

LOD-database + article template = Wikipedia article

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

Titles from database

Metadata from database

Related persons from database

Related newspapers from database

Winkel-ID from database

Link to full-texts in Delpher from database

Link to KB-catalogue record from database

Fixed categories on WP and Commons

Predefined fixed strings

Grey = bull From database bull Predefined fixed strings

Uniformity between articles guaranteed

All that WP-writers need to add manually

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

The KB can re-use (embed) the Wikipedia content in its own

websites to tackle this problem

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_28verzetsblad29

Delpher - search results

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

Delpher - search results

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Delpher - object presentation

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Over De Geus onder studenten

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog dat vanaf 4 oktober 1940 tot en met 13 juli 1944 in Den Haag werd uitgegeven Het blad verscheen in 1940 1941 en 1943 maandelijks verder onregelmatig in een oplage tussen de 250 en 8000 exemplaren Het werd aanvankelijk gestencild en vanaf november 1942 gedrukt en de inhoud bestond voornamelijk uit opinie-artikelen

Het blad werd uitgegeven door Jan Drion en Huib Drion twee Leidsehellip

Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Delpher - object presentation

Suggested reading

bull httpwwwtedcomtalkstim_berners_lee_on_the_next_web 20 years ago Tim Berners-Lee invented the World Wide Web For his next project hes building a web for open linked data that could do for numbers what the Web did for words pictures video unlock our data and reframe the way we use it together

bull httpsenwikipediaorgwikiLinked_data Wikipedia article related to the above video

bull http5stardatainfoen The 5 stars of Linked Open Data (Tim Berners-Lee)

bull httplinda-projecteulinked-data-primer-2 Short primer about creating LOD in practice starting from an Excel sheet

bull httpwwwprogrammablewebcomnewshow-linked-data-solved-digital-age-marketing-problemanalysis20150831 The figure near the bottom of the first page is a good illustration of the concept of (linked) triples

bull httpsenwikipediaorgwikiDBpedia

bull httpsenwikipediaorgwikiSemantic_network

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Questions

olafjanssenkbnl - ookgezellig

tinyurlcomverzetskranten

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Page 35: Linked Open Data case study (illegal newspapers WW2, Wikipedia, DBpedia) - Lecture Leiden University 3-3-2016

Every article has metadata

bull Title subtitle motto bull Place of publication bull Period of publication bull Publication frequency (daily weekly one-off irregular)

bull Multiplication (stenciled printed typed handwritten)

bull Contents (news opinions poems illustrations humor)

bull Number of prints (min ndash max)

Relation 13

Newspaper Placename

semantics linked data

Relation 13

Newspaper Placename

semantics linked data

Contextual information

Nice material

for a Wikipedia article

Very often persons related to this newspaper are mentioned

Relation 23

Newspaper Persons

semantics linked data

Many articles also contain references to other newspapers

bull 106 = Cereales Vadeness (students resistance newspaper Wageningen) bull 360 = Leidsche Brief (students resistance newspaper Leiden) bull 748 = Sol Justitiae (students resistance newspaper Utrecht)

Relation 33

Newspaper Other newspapers

semantics linked data

htt

p

htt

ps

kn

ow

led

geu

top

iaf

iles

wo

rdp

ress

co

m2

01

40

1h

olla

nd

ho

use

libra

ryb

litz1

94

0j

pg

Too bad itrsquos a paper book hard to find multiply distribute and build upon

We need it digital

htt

p

htt

ps

kn

ow

led

geu

top

iaf

iles

wo

rdp

ress

co

m2

01

40

1h

olla

nd

ho

use

libra

ryb

litz1

94

0j

pg

We need it digital

1 Clear copyright with copyright holder (NIOD) Open CC-BY-SA license

2 Scan amp OCR

3 Convert into PDF

4 Put online NIOD site amp Wikimedia Commons

De Winkel as PDF on NIOD website (CC-BY-SA) httpwwwniodnlnlde-ondergrondse-pers-1940-1945

De Winkel as PDF on Wikimedia Commons httpscommonswikimediaorgwikiFilePDF_of_De_Ondergrondse_Pers_1940-1945_-_derde_druk_-_1989pdf

De Winkel as PDF on NIOD website (CC-BY-SA) httpwwwniodnlnlde-ondergrondse-pers-1940-1945

Saved us euro13330 httpwwwbrillcomdutch-underground-press-1940-1945

Wikipedia article about De Winkel httpnlwikipediaorgwikiDe_ondergrondse_pers_1940-1945

Wikipedia article about the author httpnlwikipediaorgwikiLydia_Winkel

Winkel the plusses

Available online (PDF flat file)

Open license (CC-BY-SA) Contextual information Relations

bull Titles Places bull Titles Persons

bull Titles Other titles

htt

p

ww

wa

rch

ives

go

vre

sear

chm

ilita

ryw

w2

ph

oto

sim

ages

ww

2-1

94

jpg

htt

p

2b

pb

logsp

ot

com

_BW

zuYw

iS6-I

TM

geR

sFd3m

IAAAAAAAAElw

3cv

gbZSPW

css

1600d

oct

or+

macr

o+

judy+

scare

djpg

Winkel the minusses

Unstructured data (PDF flat file)

Not very machine readable (unlike CSV XML JSON RDF)

PDF is no (real) open standard (unlike CSV XML JSON RDF)

No links between titles Delpher amp KB-cat No links between titles places amp persons

external sources (like Wikipedia)

but the data sources are

unconnected (and for 3+4 unstructured amp not machine-readable)

To summarize

a lot of information is available about these WW2 underground newspapers

1 Metadata (KB-cat)

2 Content (full-text Delpher)

3 Context (Winkel PDF)

4 Relations titles places persons other titles (Winkel PDF)

5 External resources about titles places and persons

htt

p

2b

pb

logsp

ot

com

_BW

zuYw

iS6-I

TM

geR

sFd3m

IAAAAAAAAElw

3cv

gbZSPW

css

1600d

oct

or+

macr

o+

judy+

scare

djpg

making discovery understanding amp research

of these newspapers (and related places amp persons) more difficult than necessary

We can solve all these issues

Good news

Wikiproject() Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

tinyurlcomverzetskranten (in Dutch)

httpsnlwikipediaorgwikiWikipediaWikiproject httpsenwikipediaorgwikiWikipediaWikiproject

From 14 1300 titles

htt

ps

n

lwik

iped

iao

rgw

iki

Cat

ego

rie

Illeg

ale_

per

s_in

_de_

Twee

de_

Wer

eld

oo

rlo

g

Wikiproject() Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

tinyurlcomverzetskranten (in Dutch)

We need a database

Build central database

httpswwwyoutubecomwatchv=GVDGuCjog_0

Build central database

httpswwwyoutubecomwatchv=GVDGuCjog_0

Step 1 Create Excel-sheet with bull Metadata about newspaper (from Winkel PDF)

bull Unique Wikipedia article title

bull Contextual info incl related persons amp titles (from Winkel PDF)

bull PPN unique ID linking newspaper to

KB-catalogue amp Delpher

Winkel-ID

Place of publication

Other metadata

Title

Unique Wikipedia article title =

ltNewspaper titlegt (verzetsblad ltPlace of publicationgt)

bull Contextual info bull Related persons

bull Related newspaper titles

PPN (107123223)

httpopc4kbnlDB=1PPNPPN=107123223

httpopc4kbnlDB=1PPNPPN=107123223

PPN (107123223)

PPN (107123223)

httpwwwdelphernlnlkrantenresultsindexcoll=dddtitelampcql[]=ppn3D107123223

httpwwwdelphernlnlkrantenresultsindexcoll=dddtitelampcql[]=ppn3D107123223

PPN (107123223)

Build central database

Step 2 Convert Excel into RDF triplestore (=special kind of online database anybody can access)

bull Steps 1-4 from httplinda-projecteulinked-

data-primer-2

bull Step 4 Vocubulary used = Bibframe (httpbibframeorgvocab)

Build central database

Step 3 Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

bull DBpedia = hub for linking different data sets on the Web to each

other Linked Open Data cloud

bull Connect persons amp places in newspaper database to external resources via DBpedia

Step 1c Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

DBpedia allows you to ask sophisticated queries against Wikipedia and to link the different data sets on the Web to Wikipedia data

bull Connect persons amp places in newspaper database to external

resources via DBpedia

httplod-cloudnetversions2010-09-22lod-cloud_coloredpng

Linked Open Data cloud

Build central database

Step 3 Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

bull DBpedia = hub for linking different data sets on the Web to each

other Linked Open Data cloud

bull We use DBpdia to connect persons amp places in our newspaper database to information in other databases

httpnldbpediaorgpageHuib_Drion

httpsnlwikipediaorgwikiHuib_Drion

httpwwwdbnlorgauteursauteurphpid=drio001

httpwwwbiografischportaalnlpersoon41181342

Build central database

Added value of Linked Open Data amp DBpedia Software can automatically query for additional information about places and persons mentioned in De Winkel that is not available in bull KB-catalogue bull Delpher bull De Winkel

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples)

Open license (CC-BY-SA) Open standard (RDF)

Contextual information Links between titles Delpher amp KB-cat (via PPNs)

Relations Links between places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia)

bull Titles Other titles

(PPNs)

http5stardatainfoen

Wikiproject Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

We need a template

Using an article template we can generate 1300 uniform and interlinked Wikipedia articles

from the LOD-database

htt

ps

c1

sta

ticf

lickr

co

m9

82

81

76

99

23

19

18

_11

a73

56

c38

_bjp

g

LOD-database + article template = Wikipedia article

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

Titles from database

Metadata from database

Related persons from database

Related newspapers from database

Winkel-ID from database

Link to full-texts in Delpher from database

Link to KB-catalogue record from database

Fixed categories on WP and Commons

Predefined fixed strings

Grey = bull From database bull Predefined fixed strings

Uniformity between articles guaranteed

All that WP-writers need to add manually

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

The KB can re-use (embed) the Wikipedia content in its own

websites to tackle this problem

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_28verzetsblad29

Delpher - search results

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

Delpher - search results

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Delpher - object presentation

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Over De Geus onder studenten

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog dat vanaf 4 oktober 1940 tot en met 13 juli 1944 in Den Haag werd uitgegeven Het blad verscheen in 1940 1941 en 1943 maandelijks verder onregelmatig in een oplage tussen de 250 en 8000 exemplaren Het werd aanvankelijk gestencild en vanaf november 1942 gedrukt en de inhoud bestond voornamelijk uit opinie-artikelen

Het blad werd uitgegeven door Jan Drion en Huib Drion twee Leidsehellip

Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Delpher - object presentation

Suggested reading

bull httpwwwtedcomtalkstim_berners_lee_on_the_next_web 20 years ago Tim Berners-Lee invented the World Wide Web For his next project hes building a web for open linked data that could do for numbers what the Web did for words pictures video unlock our data and reframe the way we use it together

bull httpsenwikipediaorgwikiLinked_data Wikipedia article related to the above video

bull http5stardatainfoen The 5 stars of Linked Open Data (Tim Berners-Lee)

bull httplinda-projecteulinked-data-primer-2 Short primer about creating LOD in practice starting from an Excel sheet

bull httpwwwprogrammablewebcomnewshow-linked-data-solved-digital-age-marketing-problemanalysis20150831 The figure near the bottom of the first page is a good illustration of the concept of (linked) triples

bull httpsenwikipediaorgwikiDBpedia

bull httpsenwikipediaorgwikiSemantic_network

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Questions

olafjanssenkbnl - ookgezellig

tinyurlcomverzetskranten

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Page 36: Linked Open Data case study (illegal newspapers WW2, Wikipedia, DBpedia) - Lecture Leiden University 3-3-2016

Relation 13

Newspaper Placename

semantics linked data

Relation 13

Newspaper Placename

semantics linked data

Contextual information

Nice material

for a Wikipedia article

Very often persons related to this newspaper are mentioned

Relation 23

Newspaper Persons

semantics linked data

Many articles also contain references to other newspapers

bull 106 = Cereales Vadeness (students resistance newspaper Wageningen) bull 360 = Leidsche Brief (students resistance newspaper Leiden) bull 748 = Sol Justitiae (students resistance newspaper Utrecht)

Relation 33

Newspaper Other newspapers

semantics linked data

htt

p

htt

ps

kn

ow

led

geu

top

iaf

iles

wo

rdp

ress

co

m2

01

40

1h

olla

nd

ho

use

libra

ryb

litz1

94

0j

pg

Too bad itrsquos a paper book hard to find multiply distribute and build upon

We need it digital

htt

p

htt

ps

kn

ow

led

geu

top

iaf

iles

wo

rdp

ress

co

m2

01

40

1h

olla

nd

ho

use

libra

ryb

litz1

94

0j

pg

We need it digital

1 Clear copyright with copyright holder (NIOD) Open CC-BY-SA license

2 Scan amp OCR

3 Convert into PDF

4 Put online NIOD site amp Wikimedia Commons

De Winkel as PDF on NIOD website (CC-BY-SA) httpwwwniodnlnlde-ondergrondse-pers-1940-1945

De Winkel as PDF on Wikimedia Commons httpscommonswikimediaorgwikiFilePDF_of_De_Ondergrondse_Pers_1940-1945_-_derde_druk_-_1989pdf

De Winkel as PDF on NIOD website (CC-BY-SA) httpwwwniodnlnlde-ondergrondse-pers-1940-1945

Saved us euro13330 httpwwwbrillcomdutch-underground-press-1940-1945

Wikipedia article about De Winkel httpnlwikipediaorgwikiDe_ondergrondse_pers_1940-1945

Wikipedia article about the author httpnlwikipediaorgwikiLydia_Winkel

Winkel the plusses

Available online (PDF flat file)

Open license (CC-BY-SA) Contextual information Relations

bull Titles Places bull Titles Persons

bull Titles Other titles

htt

p

ww

wa

rch

ives

go

vre

sear

chm

ilita

ryw

w2

ph

oto

sim

ages

ww

2-1

94

jpg

htt

p

2b

pb

logsp

ot

com

_BW

zuYw

iS6-I

TM

geR

sFd3m

IAAAAAAAAElw

3cv

gbZSPW

css

1600d

oct

or+

macr

o+

judy+

scare

djpg

Winkel the minusses

Unstructured data (PDF flat file)

Not very machine readable (unlike CSV XML JSON RDF)

PDF is no (real) open standard (unlike CSV XML JSON RDF)

No links between titles Delpher amp KB-cat No links between titles places amp persons

external sources (like Wikipedia)

but the data sources are

unconnected (and for 3+4 unstructured amp not machine-readable)

To summarize

a lot of information is available about these WW2 underground newspapers

1 Metadata (KB-cat)

2 Content (full-text Delpher)

3 Context (Winkel PDF)

4 Relations titles places persons other titles (Winkel PDF)

5 External resources about titles places and persons

htt

p

2b

pb

logsp

ot

com

_BW

zuYw

iS6-I

TM

geR

sFd3m

IAAAAAAAAElw

3cv

gbZSPW

css

1600d

oct

or+

macr

o+

judy+

scare

djpg

making discovery understanding amp research

of these newspapers (and related places amp persons) more difficult than necessary

We can solve all these issues

Good news

Wikiproject() Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

tinyurlcomverzetskranten (in Dutch)

httpsnlwikipediaorgwikiWikipediaWikiproject httpsenwikipediaorgwikiWikipediaWikiproject

From 14 1300 titles

htt

ps

n

lwik

iped

iao

rgw

iki

Cat

ego

rie

Illeg

ale_

per

s_in

_de_

Twee

de_

Wer

eld

oo

rlo

g

Wikiproject() Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

tinyurlcomverzetskranten (in Dutch)

We need a database

Build central database

httpswwwyoutubecomwatchv=GVDGuCjog_0

Build central database

httpswwwyoutubecomwatchv=GVDGuCjog_0

Step 1 Create Excel-sheet with bull Metadata about newspaper (from Winkel PDF)

bull Unique Wikipedia article title

bull Contextual info incl related persons amp titles (from Winkel PDF)

bull PPN unique ID linking newspaper to

KB-catalogue amp Delpher

Winkel-ID

Place of publication

Other metadata

Title

Unique Wikipedia article title =

ltNewspaper titlegt (verzetsblad ltPlace of publicationgt)

bull Contextual info bull Related persons

bull Related newspaper titles

PPN (107123223)

httpopc4kbnlDB=1PPNPPN=107123223

httpopc4kbnlDB=1PPNPPN=107123223

PPN (107123223)

PPN (107123223)

httpwwwdelphernlnlkrantenresultsindexcoll=dddtitelampcql[]=ppn3D107123223

httpwwwdelphernlnlkrantenresultsindexcoll=dddtitelampcql[]=ppn3D107123223

PPN (107123223)

Build central database

Step 2 Convert Excel into RDF triplestore (=special kind of online database anybody can access)

bull Steps 1-4 from httplinda-projecteulinked-

data-primer-2

bull Step 4 Vocubulary used = Bibframe (httpbibframeorgvocab)

Build central database

Step 3 Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

bull DBpedia = hub for linking different data sets on the Web to each

other Linked Open Data cloud

bull Connect persons amp places in newspaper database to external resources via DBpedia

Step 1c Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

DBpedia allows you to ask sophisticated queries against Wikipedia and to link the different data sets on the Web to Wikipedia data

bull Connect persons amp places in newspaper database to external

resources via DBpedia

httplod-cloudnetversions2010-09-22lod-cloud_coloredpng

Linked Open Data cloud

Build central database

Step 3 Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

bull DBpedia = hub for linking different data sets on the Web to each

other Linked Open Data cloud

bull We use DBpdia to connect persons amp places in our newspaper database to information in other databases

httpnldbpediaorgpageHuib_Drion

httpsnlwikipediaorgwikiHuib_Drion

httpwwwdbnlorgauteursauteurphpid=drio001

httpwwwbiografischportaalnlpersoon41181342

Build central database

Added value of Linked Open Data amp DBpedia Software can automatically query for additional information about places and persons mentioned in De Winkel that is not available in bull KB-catalogue bull Delpher bull De Winkel

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples)

Open license (CC-BY-SA) Open standard (RDF)

Contextual information Links between titles Delpher amp KB-cat (via PPNs)

Relations Links between places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia)

bull Titles Other titles

(PPNs)

http5stardatainfoen

Wikiproject Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

We need a template

Using an article template we can generate 1300 uniform and interlinked Wikipedia articles

from the LOD-database

htt

ps

c1

sta

ticf

lickr

co

m9

82

81

76

99

23

19

18

_11

a73

56

c38

_bjp

g

LOD-database + article template = Wikipedia article

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

Titles from database

Metadata from database

Related persons from database

Related newspapers from database

Winkel-ID from database

Link to full-texts in Delpher from database

Link to KB-catalogue record from database

Fixed categories on WP and Commons

Predefined fixed strings

Grey = bull From database bull Predefined fixed strings

Uniformity between articles guaranteed

All that WP-writers need to add manually

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

The KB can re-use (embed) the Wikipedia content in its own

websites to tackle this problem

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_28verzetsblad29

Delpher - search results

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

Delpher - search results

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Delpher - object presentation

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Over De Geus onder studenten

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog dat vanaf 4 oktober 1940 tot en met 13 juli 1944 in Den Haag werd uitgegeven Het blad verscheen in 1940 1941 en 1943 maandelijks verder onregelmatig in een oplage tussen de 250 en 8000 exemplaren Het werd aanvankelijk gestencild en vanaf november 1942 gedrukt en de inhoud bestond voornamelijk uit opinie-artikelen

Het blad werd uitgegeven door Jan Drion en Huib Drion twee Leidsehellip

Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Delpher - object presentation

Suggested reading

bull httpwwwtedcomtalkstim_berners_lee_on_the_next_web 20 years ago Tim Berners-Lee invented the World Wide Web For his next project hes building a web for open linked data that could do for numbers what the Web did for words pictures video unlock our data and reframe the way we use it together

bull httpsenwikipediaorgwikiLinked_data Wikipedia article related to the above video

bull http5stardatainfoen The 5 stars of Linked Open Data (Tim Berners-Lee)

bull httplinda-projecteulinked-data-primer-2 Short primer about creating LOD in practice starting from an Excel sheet

bull httpwwwprogrammablewebcomnewshow-linked-data-solved-digital-age-marketing-problemanalysis20150831 The figure near the bottom of the first page is a good illustration of the concept of (linked) triples

bull httpsenwikipediaorgwikiDBpedia

bull httpsenwikipediaorgwikiSemantic_network

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Questions

olafjanssenkbnl - ookgezellig

tinyurlcomverzetskranten

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Page 37: Linked Open Data case study (illegal newspapers WW2, Wikipedia, DBpedia) - Lecture Leiden University 3-3-2016

Relation 13

Newspaper Placename

semantics linked data

Contextual information

Nice material

for a Wikipedia article

Very often persons related to this newspaper are mentioned

Relation 23

Newspaper Persons

semantics linked data

Many articles also contain references to other newspapers

bull 106 = Cereales Vadeness (students resistance newspaper Wageningen) bull 360 = Leidsche Brief (students resistance newspaper Leiden) bull 748 = Sol Justitiae (students resistance newspaper Utrecht)

Relation 33

Newspaper Other newspapers

semantics linked data

htt

p

htt

ps

kn

ow

led

geu

top

iaf

iles

wo

rdp

ress

co

m2

01

40

1h

olla

nd

ho

use

libra

ryb

litz1

94

0j

pg

Too bad itrsquos a paper book hard to find multiply distribute and build upon

We need it digital

htt

p

htt

ps

kn

ow

led

geu

top

iaf

iles

wo

rdp

ress

co

m2

01

40

1h

olla

nd

ho

use

libra

ryb

litz1

94

0j

pg

We need it digital

1 Clear copyright with copyright holder (NIOD) Open CC-BY-SA license

2 Scan amp OCR

3 Convert into PDF

4 Put online NIOD site amp Wikimedia Commons

De Winkel as PDF on NIOD website (CC-BY-SA) httpwwwniodnlnlde-ondergrondse-pers-1940-1945

De Winkel as PDF on Wikimedia Commons httpscommonswikimediaorgwikiFilePDF_of_De_Ondergrondse_Pers_1940-1945_-_derde_druk_-_1989pdf

De Winkel as PDF on NIOD website (CC-BY-SA) httpwwwniodnlnlde-ondergrondse-pers-1940-1945

Saved us euro13330 httpwwwbrillcomdutch-underground-press-1940-1945

Wikipedia article about De Winkel httpnlwikipediaorgwikiDe_ondergrondse_pers_1940-1945

Wikipedia article about the author httpnlwikipediaorgwikiLydia_Winkel

Winkel the plusses

Available online (PDF flat file)

Open license (CC-BY-SA) Contextual information Relations

bull Titles Places bull Titles Persons

bull Titles Other titles

htt

p

ww

wa

rch

ives

go

vre

sear

chm

ilita

ryw

w2

ph

oto

sim

ages

ww

2-1

94

jpg

htt

p

2b

pb

logsp

ot

com

_BW

zuYw

iS6-I

TM

geR

sFd3m

IAAAAAAAAElw

3cv

gbZSPW

css

1600d

oct

or+

macr

o+

judy+

scare

djpg

Winkel the minusses

Unstructured data (PDF flat file)

Not very machine readable (unlike CSV XML JSON RDF)

PDF is no (real) open standard (unlike CSV XML JSON RDF)

No links between titles Delpher amp KB-cat No links between titles places amp persons

external sources (like Wikipedia)

but the data sources are

unconnected (and for 3+4 unstructured amp not machine-readable)

To summarize

a lot of information is available about these WW2 underground newspapers

1 Metadata (KB-cat)

2 Content (full-text Delpher)

3 Context (Winkel PDF)

4 Relations titles places persons other titles (Winkel PDF)

5 External resources about titles places and persons

htt

p

2b

pb

logsp

ot

com

_BW

zuYw

iS6-I

TM

geR

sFd3m

IAAAAAAAAElw

3cv

gbZSPW

css

1600d

oct

or+

macr

o+

judy+

scare

djpg

making discovery understanding amp research

of these newspapers (and related places amp persons) more difficult than necessary

We can solve all these issues

Good news

Wikiproject() Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

tinyurlcomverzetskranten (in Dutch)

httpsnlwikipediaorgwikiWikipediaWikiproject httpsenwikipediaorgwikiWikipediaWikiproject

From 14 1300 titles

htt

ps

n

lwik

iped

iao

rgw

iki

Cat

ego

rie

Illeg

ale_

per

s_in

_de_

Twee

de_

Wer

eld

oo

rlo

g

Wikiproject() Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

tinyurlcomverzetskranten (in Dutch)

We need a database

Build central database

httpswwwyoutubecomwatchv=GVDGuCjog_0

Build central database

httpswwwyoutubecomwatchv=GVDGuCjog_0

Step 1 Create Excel-sheet with bull Metadata about newspaper (from Winkel PDF)

bull Unique Wikipedia article title

bull Contextual info incl related persons amp titles (from Winkel PDF)

bull PPN unique ID linking newspaper to

KB-catalogue amp Delpher

Winkel-ID

Place of publication

Other metadata

Title

Unique Wikipedia article title =

ltNewspaper titlegt (verzetsblad ltPlace of publicationgt)

bull Contextual info bull Related persons

bull Related newspaper titles

PPN (107123223)

httpopc4kbnlDB=1PPNPPN=107123223

httpopc4kbnlDB=1PPNPPN=107123223

PPN (107123223)

PPN (107123223)

httpwwwdelphernlnlkrantenresultsindexcoll=dddtitelampcql[]=ppn3D107123223

httpwwwdelphernlnlkrantenresultsindexcoll=dddtitelampcql[]=ppn3D107123223

PPN (107123223)

Build central database

Step 2 Convert Excel into RDF triplestore (=special kind of online database anybody can access)

bull Steps 1-4 from httplinda-projecteulinked-

data-primer-2

bull Step 4 Vocubulary used = Bibframe (httpbibframeorgvocab)

Build central database

Step 3 Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

bull DBpedia = hub for linking different data sets on the Web to each

other Linked Open Data cloud

bull Connect persons amp places in newspaper database to external resources via DBpedia

Step 1c Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

DBpedia allows you to ask sophisticated queries against Wikipedia and to link the different data sets on the Web to Wikipedia data

bull Connect persons amp places in newspaper database to external

resources via DBpedia

httplod-cloudnetversions2010-09-22lod-cloud_coloredpng

Linked Open Data cloud

Build central database

Step 3 Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

bull DBpedia = hub for linking different data sets on the Web to each

other Linked Open Data cloud

bull We use DBpdia to connect persons amp places in our newspaper database to information in other databases

httpnldbpediaorgpageHuib_Drion

httpsnlwikipediaorgwikiHuib_Drion

httpwwwdbnlorgauteursauteurphpid=drio001

httpwwwbiografischportaalnlpersoon41181342

Build central database

Added value of Linked Open Data amp DBpedia Software can automatically query for additional information about places and persons mentioned in De Winkel that is not available in bull KB-catalogue bull Delpher bull De Winkel

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples)

Open license (CC-BY-SA) Open standard (RDF)

Contextual information Links between titles Delpher amp KB-cat (via PPNs)

Relations Links between places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia)

bull Titles Other titles

(PPNs)

http5stardatainfoen

Wikiproject Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

We need a template

Using an article template we can generate 1300 uniform and interlinked Wikipedia articles

from the LOD-database

htt

ps

c1

sta

ticf

lickr

co

m9

82

81

76

99

23

19

18

_11

a73

56

c38

_bjp

g

LOD-database + article template = Wikipedia article

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

Titles from database

Metadata from database

Related persons from database

Related newspapers from database

Winkel-ID from database

Link to full-texts in Delpher from database

Link to KB-catalogue record from database

Fixed categories on WP and Commons

Predefined fixed strings

Grey = bull From database bull Predefined fixed strings

Uniformity between articles guaranteed

All that WP-writers need to add manually

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

The KB can re-use (embed) the Wikipedia content in its own

websites to tackle this problem

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_28verzetsblad29

Delpher - search results

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

Delpher - search results

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Delpher - object presentation

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Over De Geus onder studenten

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog dat vanaf 4 oktober 1940 tot en met 13 juli 1944 in Den Haag werd uitgegeven Het blad verscheen in 1940 1941 en 1943 maandelijks verder onregelmatig in een oplage tussen de 250 en 8000 exemplaren Het werd aanvankelijk gestencild en vanaf november 1942 gedrukt en de inhoud bestond voornamelijk uit opinie-artikelen

Het blad werd uitgegeven door Jan Drion en Huib Drion twee Leidsehellip

Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Delpher - object presentation

Suggested reading

bull httpwwwtedcomtalkstim_berners_lee_on_the_next_web 20 years ago Tim Berners-Lee invented the World Wide Web For his next project hes building a web for open linked data that could do for numbers what the Web did for words pictures video unlock our data and reframe the way we use it together

bull httpsenwikipediaorgwikiLinked_data Wikipedia article related to the above video

bull http5stardatainfoen The 5 stars of Linked Open Data (Tim Berners-Lee)

bull httplinda-projecteulinked-data-primer-2 Short primer about creating LOD in practice starting from an Excel sheet

bull httpwwwprogrammablewebcomnewshow-linked-data-solved-digital-age-marketing-problemanalysis20150831 The figure near the bottom of the first page is a good illustration of the concept of (linked) triples

bull httpsenwikipediaorgwikiDBpedia

bull httpsenwikipediaorgwikiSemantic_network

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Questions

olafjanssenkbnl - ookgezellig

tinyurlcomverzetskranten

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Page 38: Linked Open Data case study (illegal newspapers WW2, Wikipedia, DBpedia) - Lecture Leiden University 3-3-2016

Contextual information

Nice material

for a Wikipedia article

Very often persons related to this newspaper are mentioned

Relation 23

Newspaper Persons

semantics linked data

Many articles also contain references to other newspapers

bull 106 = Cereales Vadeness (students resistance newspaper Wageningen) bull 360 = Leidsche Brief (students resistance newspaper Leiden) bull 748 = Sol Justitiae (students resistance newspaper Utrecht)

Relation 33

Newspaper Other newspapers

semantics linked data

htt

p

htt

ps

kn

ow

led

geu

top

iaf

iles

wo

rdp

ress

co

m2

01

40

1h

olla

nd

ho

use

libra

ryb

litz1

94

0j

pg

Too bad itrsquos a paper book hard to find multiply distribute and build upon

We need it digital

htt

p

htt

ps

kn

ow

led

geu

top

iaf

iles

wo

rdp

ress

co

m2

01

40

1h

olla

nd

ho

use

libra

ryb

litz1

94

0j

pg

We need it digital

1 Clear copyright with copyright holder (NIOD) Open CC-BY-SA license

2 Scan amp OCR

3 Convert into PDF

4 Put online NIOD site amp Wikimedia Commons

De Winkel as PDF on NIOD website (CC-BY-SA) httpwwwniodnlnlde-ondergrondse-pers-1940-1945

De Winkel as PDF on Wikimedia Commons httpscommonswikimediaorgwikiFilePDF_of_De_Ondergrondse_Pers_1940-1945_-_derde_druk_-_1989pdf

De Winkel as PDF on NIOD website (CC-BY-SA) httpwwwniodnlnlde-ondergrondse-pers-1940-1945

Saved us euro13330 httpwwwbrillcomdutch-underground-press-1940-1945

Wikipedia article about De Winkel httpnlwikipediaorgwikiDe_ondergrondse_pers_1940-1945

Wikipedia article about the author httpnlwikipediaorgwikiLydia_Winkel

Winkel the plusses

Available online (PDF flat file)

Open license (CC-BY-SA) Contextual information Relations

bull Titles Places bull Titles Persons

bull Titles Other titles

htt

p

ww

wa

rch

ives

go

vre

sear

chm

ilita

ryw

w2

ph

oto

sim

ages

ww

2-1

94

jpg

htt

p

2b

pb

logsp

ot

com

_BW

zuYw

iS6-I

TM

geR

sFd3m

IAAAAAAAAElw

3cv

gbZSPW

css

1600d

oct

or+

macr

o+

judy+

scare

djpg

Winkel the minusses

Unstructured data (PDF flat file)

Not very machine readable (unlike CSV XML JSON RDF)

PDF is no (real) open standard (unlike CSV XML JSON RDF)

No links between titles Delpher amp KB-cat No links between titles places amp persons

external sources (like Wikipedia)

but the data sources are

unconnected (and for 3+4 unstructured amp not machine-readable)

To summarize

a lot of information is available about these WW2 underground newspapers

1 Metadata (KB-cat)

2 Content (full-text Delpher)

3 Context (Winkel PDF)

4 Relations titles places persons other titles (Winkel PDF)

5 External resources about titles places and persons

htt

p

2b

pb

logsp

ot

com

_BW

zuYw

iS6-I

TM

geR

sFd3m

IAAAAAAAAElw

3cv

gbZSPW

css

1600d

oct

or+

macr

o+

judy+

scare

djpg

making discovery understanding amp research

of these newspapers (and related places amp persons) more difficult than necessary

We can solve all these issues

Good news

Wikiproject() Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

tinyurlcomverzetskranten (in Dutch)

httpsnlwikipediaorgwikiWikipediaWikiproject httpsenwikipediaorgwikiWikipediaWikiproject

From 14 1300 titles

htt

ps

n

lwik

iped

iao

rgw

iki

Cat

ego

rie

Illeg

ale_

per

s_in

_de_

Twee

de_

Wer

eld

oo

rlo

g

Wikiproject() Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

tinyurlcomverzetskranten (in Dutch)

We need a database

Build central database

httpswwwyoutubecomwatchv=GVDGuCjog_0

Build central database

httpswwwyoutubecomwatchv=GVDGuCjog_0

Step 1 Create Excel-sheet with bull Metadata about newspaper (from Winkel PDF)

bull Unique Wikipedia article title

bull Contextual info incl related persons amp titles (from Winkel PDF)

bull PPN unique ID linking newspaper to

KB-catalogue amp Delpher

Winkel-ID

Place of publication

Other metadata

Title

Unique Wikipedia article title =

ltNewspaper titlegt (verzetsblad ltPlace of publicationgt)

bull Contextual info bull Related persons

bull Related newspaper titles

PPN (107123223)

httpopc4kbnlDB=1PPNPPN=107123223

httpopc4kbnlDB=1PPNPPN=107123223

PPN (107123223)

PPN (107123223)

httpwwwdelphernlnlkrantenresultsindexcoll=dddtitelampcql[]=ppn3D107123223

httpwwwdelphernlnlkrantenresultsindexcoll=dddtitelampcql[]=ppn3D107123223

PPN (107123223)

Build central database

Step 2 Convert Excel into RDF triplestore (=special kind of online database anybody can access)

bull Steps 1-4 from httplinda-projecteulinked-

data-primer-2

bull Step 4 Vocubulary used = Bibframe (httpbibframeorgvocab)

Build central database

Step 3 Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

bull DBpedia = hub for linking different data sets on the Web to each

other Linked Open Data cloud

bull Connect persons amp places in newspaper database to external resources via DBpedia

Step 1c Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

DBpedia allows you to ask sophisticated queries against Wikipedia and to link the different data sets on the Web to Wikipedia data

bull Connect persons amp places in newspaper database to external

resources via DBpedia

httplod-cloudnetversions2010-09-22lod-cloud_coloredpng

Linked Open Data cloud

Build central database

Step 3 Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

bull DBpedia = hub for linking different data sets on the Web to each

other Linked Open Data cloud

bull We use DBpdia to connect persons amp places in our newspaper database to information in other databases

httpnldbpediaorgpageHuib_Drion

httpsnlwikipediaorgwikiHuib_Drion

httpwwwdbnlorgauteursauteurphpid=drio001

httpwwwbiografischportaalnlpersoon41181342

Build central database

Added value of Linked Open Data amp DBpedia Software can automatically query for additional information about places and persons mentioned in De Winkel that is not available in bull KB-catalogue bull Delpher bull De Winkel

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples)

Open license (CC-BY-SA) Open standard (RDF)

Contextual information Links between titles Delpher amp KB-cat (via PPNs)

Relations Links between places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia)

bull Titles Other titles

(PPNs)

http5stardatainfoen

Wikiproject Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

We need a template

Using an article template we can generate 1300 uniform and interlinked Wikipedia articles

from the LOD-database

htt

ps

c1

sta

ticf

lickr

co

m9

82

81

76

99

23

19

18

_11

a73

56

c38

_bjp

g

LOD-database + article template = Wikipedia article

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

Titles from database

Metadata from database

Related persons from database

Related newspapers from database

Winkel-ID from database

Link to full-texts in Delpher from database

Link to KB-catalogue record from database

Fixed categories on WP and Commons

Predefined fixed strings

Grey = bull From database bull Predefined fixed strings

Uniformity between articles guaranteed

All that WP-writers need to add manually

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

The KB can re-use (embed) the Wikipedia content in its own

websites to tackle this problem

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_28verzetsblad29

Delpher - search results

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

Delpher - search results

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Delpher - object presentation

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Over De Geus onder studenten

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog dat vanaf 4 oktober 1940 tot en met 13 juli 1944 in Den Haag werd uitgegeven Het blad verscheen in 1940 1941 en 1943 maandelijks verder onregelmatig in een oplage tussen de 250 en 8000 exemplaren Het werd aanvankelijk gestencild en vanaf november 1942 gedrukt en de inhoud bestond voornamelijk uit opinie-artikelen

Het blad werd uitgegeven door Jan Drion en Huib Drion twee Leidsehellip

Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Delpher - object presentation

Suggested reading

bull httpwwwtedcomtalkstim_berners_lee_on_the_next_web 20 years ago Tim Berners-Lee invented the World Wide Web For his next project hes building a web for open linked data that could do for numbers what the Web did for words pictures video unlock our data and reframe the way we use it together

bull httpsenwikipediaorgwikiLinked_data Wikipedia article related to the above video

bull http5stardatainfoen The 5 stars of Linked Open Data (Tim Berners-Lee)

bull httplinda-projecteulinked-data-primer-2 Short primer about creating LOD in practice starting from an Excel sheet

bull httpwwwprogrammablewebcomnewshow-linked-data-solved-digital-age-marketing-problemanalysis20150831 The figure near the bottom of the first page is a good illustration of the concept of (linked) triples

bull httpsenwikipediaorgwikiDBpedia

bull httpsenwikipediaorgwikiSemantic_network

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Questions

olafjanssenkbnl - ookgezellig

tinyurlcomverzetskranten

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Page 39: Linked Open Data case study (illegal newspapers WW2, Wikipedia, DBpedia) - Lecture Leiden University 3-3-2016

Very often persons related to this newspaper are mentioned

Relation 23

Newspaper Persons

semantics linked data

Many articles also contain references to other newspapers

bull 106 = Cereales Vadeness (students resistance newspaper Wageningen) bull 360 = Leidsche Brief (students resistance newspaper Leiden) bull 748 = Sol Justitiae (students resistance newspaper Utrecht)

Relation 33

Newspaper Other newspapers

semantics linked data

htt

p

htt

ps

kn

ow

led

geu

top

iaf

iles

wo

rdp

ress

co

m2

01

40

1h

olla

nd

ho

use

libra

ryb

litz1

94

0j

pg

Too bad itrsquos a paper book hard to find multiply distribute and build upon

We need it digital

htt

p

htt

ps

kn

ow

led

geu

top

iaf

iles

wo

rdp

ress

co

m2

01

40

1h

olla

nd

ho

use

libra

ryb

litz1

94

0j

pg

We need it digital

1 Clear copyright with copyright holder (NIOD) Open CC-BY-SA license

2 Scan amp OCR

3 Convert into PDF

4 Put online NIOD site amp Wikimedia Commons

De Winkel as PDF on NIOD website (CC-BY-SA) httpwwwniodnlnlde-ondergrondse-pers-1940-1945

De Winkel as PDF on Wikimedia Commons httpscommonswikimediaorgwikiFilePDF_of_De_Ondergrondse_Pers_1940-1945_-_derde_druk_-_1989pdf

De Winkel as PDF on NIOD website (CC-BY-SA) httpwwwniodnlnlde-ondergrondse-pers-1940-1945

Saved us euro13330 httpwwwbrillcomdutch-underground-press-1940-1945

Wikipedia article about De Winkel httpnlwikipediaorgwikiDe_ondergrondse_pers_1940-1945

Wikipedia article about the author httpnlwikipediaorgwikiLydia_Winkel

Winkel the plusses

Available online (PDF flat file)

Open license (CC-BY-SA) Contextual information Relations

bull Titles Places bull Titles Persons

bull Titles Other titles

htt

p

ww

wa

rch

ives

go

vre

sear

chm

ilita

ryw

w2

ph

oto

sim

ages

ww

2-1

94

jpg

htt

p

2b

pb

logsp

ot

com

_BW

zuYw

iS6-I

TM

geR

sFd3m

IAAAAAAAAElw

3cv

gbZSPW

css

1600d

oct

or+

macr

o+

judy+

scare

djpg

Winkel the minusses

Unstructured data (PDF flat file)

Not very machine readable (unlike CSV XML JSON RDF)

PDF is no (real) open standard (unlike CSV XML JSON RDF)

No links between titles Delpher amp KB-cat No links between titles places amp persons

external sources (like Wikipedia)

but the data sources are

unconnected (and for 3+4 unstructured amp not machine-readable)

To summarize

a lot of information is available about these WW2 underground newspapers

1 Metadata (KB-cat)

2 Content (full-text Delpher)

3 Context (Winkel PDF)

4 Relations titles places persons other titles (Winkel PDF)

5 External resources about titles places and persons

htt

p

2b

pb

logsp

ot

com

_BW

zuYw

iS6-I

TM

geR

sFd3m

IAAAAAAAAElw

3cv

gbZSPW

css

1600d

oct

or+

macr

o+

judy+

scare

djpg

making discovery understanding amp research

of these newspapers (and related places amp persons) more difficult than necessary

We can solve all these issues

Good news

Wikiproject() Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

tinyurlcomverzetskranten (in Dutch)

httpsnlwikipediaorgwikiWikipediaWikiproject httpsenwikipediaorgwikiWikipediaWikiproject

From 14 1300 titles

htt

ps

n

lwik

iped

iao

rgw

iki

Cat

ego

rie

Illeg

ale_

per

s_in

_de_

Twee

de_

Wer

eld

oo

rlo

g

Wikiproject() Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

tinyurlcomverzetskranten (in Dutch)

We need a database

Build central database

httpswwwyoutubecomwatchv=GVDGuCjog_0

Build central database

httpswwwyoutubecomwatchv=GVDGuCjog_0

Step 1 Create Excel-sheet with bull Metadata about newspaper (from Winkel PDF)

bull Unique Wikipedia article title

bull Contextual info incl related persons amp titles (from Winkel PDF)

bull PPN unique ID linking newspaper to

KB-catalogue amp Delpher

Winkel-ID

Place of publication

Other metadata

Title

Unique Wikipedia article title =

ltNewspaper titlegt (verzetsblad ltPlace of publicationgt)

bull Contextual info bull Related persons

bull Related newspaper titles

PPN (107123223)

httpopc4kbnlDB=1PPNPPN=107123223

httpopc4kbnlDB=1PPNPPN=107123223

PPN (107123223)

PPN (107123223)

httpwwwdelphernlnlkrantenresultsindexcoll=dddtitelampcql[]=ppn3D107123223

httpwwwdelphernlnlkrantenresultsindexcoll=dddtitelampcql[]=ppn3D107123223

PPN (107123223)

Build central database

Step 2 Convert Excel into RDF triplestore (=special kind of online database anybody can access)

bull Steps 1-4 from httplinda-projecteulinked-

data-primer-2

bull Step 4 Vocubulary used = Bibframe (httpbibframeorgvocab)

Build central database

Step 3 Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

bull DBpedia = hub for linking different data sets on the Web to each

other Linked Open Data cloud

bull Connect persons amp places in newspaper database to external resources via DBpedia

Step 1c Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

DBpedia allows you to ask sophisticated queries against Wikipedia and to link the different data sets on the Web to Wikipedia data

bull Connect persons amp places in newspaper database to external

resources via DBpedia

httplod-cloudnetversions2010-09-22lod-cloud_coloredpng

Linked Open Data cloud

Build central database

Step 3 Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

bull DBpedia = hub for linking different data sets on the Web to each

other Linked Open Data cloud

bull We use DBpdia to connect persons amp places in our newspaper database to information in other databases

httpnldbpediaorgpageHuib_Drion

httpsnlwikipediaorgwikiHuib_Drion

httpwwwdbnlorgauteursauteurphpid=drio001

httpwwwbiografischportaalnlpersoon41181342

Build central database

Added value of Linked Open Data amp DBpedia Software can automatically query for additional information about places and persons mentioned in De Winkel that is not available in bull KB-catalogue bull Delpher bull De Winkel

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples)

Open license (CC-BY-SA) Open standard (RDF)

Contextual information Links between titles Delpher amp KB-cat (via PPNs)

Relations Links between places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia)

bull Titles Other titles

(PPNs)

http5stardatainfoen

Wikiproject Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

We need a template

Using an article template we can generate 1300 uniform and interlinked Wikipedia articles

from the LOD-database

htt

ps

c1

sta

ticf

lickr

co

m9

82

81

76

99

23

19

18

_11

a73

56

c38

_bjp

g

LOD-database + article template = Wikipedia article

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

Titles from database

Metadata from database

Related persons from database

Related newspapers from database

Winkel-ID from database

Link to full-texts in Delpher from database

Link to KB-catalogue record from database

Fixed categories on WP and Commons

Predefined fixed strings

Grey = bull From database bull Predefined fixed strings

Uniformity between articles guaranteed

All that WP-writers need to add manually

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

The KB can re-use (embed) the Wikipedia content in its own

websites to tackle this problem

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_28verzetsblad29

Delpher - search results

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

Delpher - search results

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Delpher - object presentation

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Over De Geus onder studenten

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog dat vanaf 4 oktober 1940 tot en met 13 juli 1944 in Den Haag werd uitgegeven Het blad verscheen in 1940 1941 en 1943 maandelijks verder onregelmatig in een oplage tussen de 250 en 8000 exemplaren Het werd aanvankelijk gestencild en vanaf november 1942 gedrukt en de inhoud bestond voornamelijk uit opinie-artikelen

Het blad werd uitgegeven door Jan Drion en Huib Drion twee Leidsehellip

Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Delpher - object presentation

Suggested reading

bull httpwwwtedcomtalkstim_berners_lee_on_the_next_web 20 years ago Tim Berners-Lee invented the World Wide Web For his next project hes building a web for open linked data that could do for numbers what the Web did for words pictures video unlock our data and reframe the way we use it together

bull httpsenwikipediaorgwikiLinked_data Wikipedia article related to the above video

bull http5stardatainfoen The 5 stars of Linked Open Data (Tim Berners-Lee)

bull httplinda-projecteulinked-data-primer-2 Short primer about creating LOD in practice starting from an Excel sheet

bull httpwwwprogrammablewebcomnewshow-linked-data-solved-digital-age-marketing-problemanalysis20150831 The figure near the bottom of the first page is a good illustration of the concept of (linked) triples

bull httpsenwikipediaorgwikiDBpedia

bull httpsenwikipediaorgwikiSemantic_network

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Questions

olafjanssenkbnl - ookgezellig

tinyurlcomverzetskranten

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Page 40: Linked Open Data case study (illegal newspapers WW2, Wikipedia, DBpedia) - Lecture Leiden University 3-3-2016

Relation 23

Newspaper Persons

semantics linked data

Many articles also contain references to other newspapers

bull 106 = Cereales Vadeness (students resistance newspaper Wageningen) bull 360 = Leidsche Brief (students resistance newspaper Leiden) bull 748 = Sol Justitiae (students resistance newspaper Utrecht)

Relation 33

Newspaper Other newspapers

semantics linked data

htt

p

htt

ps

kn

ow

led

geu

top

iaf

iles

wo

rdp

ress

co

m2

01

40

1h

olla

nd

ho

use

libra

ryb

litz1

94

0j

pg

Too bad itrsquos a paper book hard to find multiply distribute and build upon

We need it digital

htt

p

htt

ps

kn

ow

led

geu

top

iaf

iles

wo

rdp

ress

co

m2

01

40

1h

olla

nd

ho

use

libra

ryb

litz1

94

0j

pg

We need it digital

1 Clear copyright with copyright holder (NIOD) Open CC-BY-SA license

2 Scan amp OCR

3 Convert into PDF

4 Put online NIOD site amp Wikimedia Commons

De Winkel as PDF on NIOD website (CC-BY-SA) httpwwwniodnlnlde-ondergrondse-pers-1940-1945

De Winkel as PDF on Wikimedia Commons httpscommonswikimediaorgwikiFilePDF_of_De_Ondergrondse_Pers_1940-1945_-_derde_druk_-_1989pdf

De Winkel as PDF on NIOD website (CC-BY-SA) httpwwwniodnlnlde-ondergrondse-pers-1940-1945

Saved us euro13330 httpwwwbrillcomdutch-underground-press-1940-1945

Wikipedia article about De Winkel httpnlwikipediaorgwikiDe_ondergrondse_pers_1940-1945

Wikipedia article about the author httpnlwikipediaorgwikiLydia_Winkel

Winkel the plusses

Available online (PDF flat file)

Open license (CC-BY-SA) Contextual information Relations

bull Titles Places bull Titles Persons

bull Titles Other titles

htt

p

ww

wa

rch

ives

go

vre

sear

chm

ilita

ryw

w2

ph

oto

sim

ages

ww

2-1

94

jpg

htt

p

2b

pb

logsp

ot

com

_BW

zuYw

iS6-I

TM

geR

sFd3m

IAAAAAAAAElw

3cv

gbZSPW

css

1600d

oct

or+

macr

o+

judy+

scare

djpg

Winkel the minusses

Unstructured data (PDF flat file)

Not very machine readable (unlike CSV XML JSON RDF)

PDF is no (real) open standard (unlike CSV XML JSON RDF)

No links between titles Delpher amp KB-cat No links between titles places amp persons

external sources (like Wikipedia)

but the data sources are

unconnected (and for 3+4 unstructured amp not machine-readable)

To summarize

a lot of information is available about these WW2 underground newspapers

1 Metadata (KB-cat)

2 Content (full-text Delpher)

3 Context (Winkel PDF)

4 Relations titles places persons other titles (Winkel PDF)

5 External resources about titles places and persons

htt

p

2b

pb

logsp

ot

com

_BW

zuYw

iS6-I

TM

geR

sFd3m

IAAAAAAAAElw

3cv

gbZSPW

css

1600d

oct

or+

macr

o+

judy+

scare

djpg

making discovery understanding amp research

of these newspapers (and related places amp persons) more difficult than necessary

We can solve all these issues

Good news

Wikiproject() Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

tinyurlcomverzetskranten (in Dutch)

httpsnlwikipediaorgwikiWikipediaWikiproject httpsenwikipediaorgwikiWikipediaWikiproject

From 14 1300 titles

htt

ps

n

lwik

iped

iao

rgw

iki

Cat

ego

rie

Illeg

ale_

per

s_in

_de_

Twee

de_

Wer

eld

oo

rlo

g

Wikiproject() Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

tinyurlcomverzetskranten (in Dutch)

We need a database

Build central database

httpswwwyoutubecomwatchv=GVDGuCjog_0

Build central database

httpswwwyoutubecomwatchv=GVDGuCjog_0

Step 1 Create Excel-sheet with bull Metadata about newspaper (from Winkel PDF)

bull Unique Wikipedia article title

bull Contextual info incl related persons amp titles (from Winkel PDF)

bull PPN unique ID linking newspaper to

KB-catalogue amp Delpher

Winkel-ID

Place of publication

Other metadata

Title

Unique Wikipedia article title =

ltNewspaper titlegt (verzetsblad ltPlace of publicationgt)

bull Contextual info bull Related persons

bull Related newspaper titles

PPN (107123223)

httpopc4kbnlDB=1PPNPPN=107123223

httpopc4kbnlDB=1PPNPPN=107123223

PPN (107123223)

PPN (107123223)

httpwwwdelphernlnlkrantenresultsindexcoll=dddtitelampcql[]=ppn3D107123223

httpwwwdelphernlnlkrantenresultsindexcoll=dddtitelampcql[]=ppn3D107123223

PPN (107123223)

Build central database

Step 2 Convert Excel into RDF triplestore (=special kind of online database anybody can access)

bull Steps 1-4 from httplinda-projecteulinked-

data-primer-2

bull Step 4 Vocubulary used = Bibframe (httpbibframeorgvocab)

Build central database

Step 3 Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

bull DBpedia = hub for linking different data sets on the Web to each

other Linked Open Data cloud

bull Connect persons amp places in newspaper database to external resources via DBpedia

Step 1c Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

DBpedia allows you to ask sophisticated queries against Wikipedia and to link the different data sets on the Web to Wikipedia data

bull Connect persons amp places in newspaper database to external

resources via DBpedia

httplod-cloudnetversions2010-09-22lod-cloud_coloredpng

Linked Open Data cloud

Build central database

Step 3 Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

bull DBpedia = hub for linking different data sets on the Web to each

other Linked Open Data cloud

bull We use DBpdia to connect persons amp places in our newspaper database to information in other databases

httpnldbpediaorgpageHuib_Drion

httpsnlwikipediaorgwikiHuib_Drion

httpwwwdbnlorgauteursauteurphpid=drio001

httpwwwbiografischportaalnlpersoon41181342

Build central database

Added value of Linked Open Data amp DBpedia Software can automatically query for additional information about places and persons mentioned in De Winkel that is not available in bull KB-catalogue bull Delpher bull De Winkel

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples)

Open license (CC-BY-SA) Open standard (RDF)

Contextual information Links between titles Delpher amp KB-cat (via PPNs)

Relations Links between places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia)

bull Titles Other titles

(PPNs)

http5stardatainfoen

Wikiproject Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

We need a template

Using an article template we can generate 1300 uniform and interlinked Wikipedia articles

from the LOD-database

htt

ps

c1

sta

ticf

lickr

co

m9

82

81

76

99

23

19

18

_11

a73

56

c38

_bjp

g

LOD-database + article template = Wikipedia article

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

Titles from database

Metadata from database

Related persons from database

Related newspapers from database

Winkel-ID from database

Link to full-texts in Delpher from database

Link to KB-catalogue record from database

Fixed categories on WP and Commons

Predefined fixed strings

Grey = bull From database bull Predefined fixed strings

Uniformity between articles guaranteed

All that WP-writers need to add manually

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

The KB can re-use (embed) the Wikipedia content in its own

websites to tackle this problem

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_28verzetsblad29

Delpher - search results

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

Delpher - search results

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Delpher - object presentation

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Over De Geus onder studenten

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog dat vanaf 4 oktober 1940 tot en met 13 juli 1944 in Den Haag werd uitgegeven Het blad verscheen in 1940 1941 en 1943 maandelijks verder onregelmatig in een oplage tussen de 250 en 8000 exemplaren Het werd aanvankelijk gestencild en vanaf november 1942 gedrukt en de inhoud bestond voornamelijk uit opinie-artikelen

Het blad werd uitgegeven door Jan Drion en Huib Drion twee Leidsehellip

Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Delpher - object presentation

Suggested reading

bull httpwwwtedcomtalkstim_berners_lee_on_the_next_web 20 years ago Tim Berners-Lee invented the World Wide Web For his next project hes building a web for open linked data that could do for numbers what the Web did for words pictures video unlock our data and reframe the way we use it together

bull httpsenwikipediaorgwikiLinked_data Wikipedia article related to the above video

bull http5stardatainfoen The 5 stars of Linked Open Data (Tim Berners-Lee)

bull httplinda-projecteulinked-data-primer-2 Short primer about creating LOD in practice starting from an Excel sheet

bull httpwwwprogrammablewebcomnewshow-linked-data-solved-digital-age-marketing-problemanalysis20150831 The figure near the bottom of the first page is a good illustration of the concept of (linked) triples

bull httpsenwikipediaorgwikiDBpedia

bull httpsenwikipediaorgwikiSemantic_network

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Questions

olafjanssenkbnl - ookgezellig

tinyurlcomverzetskranten

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Page 41: Linked Open Data case study (illegal newspapers WW2, Wikipedia, DBpedia) - Lecture Leiden University 3-3-2016

Many articles also contain references to other newspapers

bull 106 = Cereales Vadeness (students resistance newspaper Wageningen) bull 360 = Leidsche Brief (students resistance newspaper Leiden) bull 748 = Sol Justitiae (students resistance newspaper Utrecht)

Relation 33

Newspaper Other newspapers

semantics linked data

htt

p

htt

ps

kn

ow

led

geu

top

iaf

iles

wo

rdp

ress

co

m2

01

40

1h

olla

nd

ho

use

libra

ryb

litz1

94

0j

pg

Too bad itrsquos a paper book hard to find multiply distribute and build upon

We need it digital

htt

p

htt

ps

kn

ow

led

geu

top

iaf

iles

wo

rdp

ress

co

m2

01

40

1h

olla

nd

ho

use

libra

ryb

litz1

94

0j

pg

We need it digital

1 Clear copyright with copyright holder (NIOD) Open CC-BY-SA license

2 Scan amp OCR

3 Convert into PDF

4 Put online NIOD site amp Wikimedia Commons

De Winkel as PDF on NIOD website (CC-BY-SA) httpwwwniodnlnlde-ondergrondse-pers-1940-1945

De Winkel as PDF on Wikimedia Commons httpscommonswikimediaorgwikiFilePDF_of_De_Ondergrondse_Pers_1940-1945_-_derde_druk_-_1989pdf

De Winkel as PDF on NIOD website (CC-BY-SA) httpwwwniodnlnlde-ondergrondse-pers-1940-1945

Saved us euro13330 httpwwwbrillcomdutch-underground-press-1940-1945

Wikipedia article about De Winkel httpnlwikipediaorgwikiDe_ondergrondse_pers_1940-1945

Wikipedia article about the author httpnlwikipediaorgwikiLydia_Winkel

Winkel the plusses

Available online (PDF flat file)

Open license (CC-BY-SA) Contextual information Relations

bull Titles Places bull Titles Persons

bull Titles Other titles

htt

p

ww

wa

rch

ives

go

vre

sear

chm

ilita

ryw

w2

ph

oto

sim

ages

ww

2-1

94

jpg

htt

p

2b

pb

logsp

ot

com

_BW

zuYw

iS6-I

TM

geR

sFd3m

IAAAAAAAAElw

3cv

gbZSPW

css

1600d

oct

or+

macr

o+

judy+

scare

djpg

Winkel the minusses

Unstructured data (PDF flat file)

Not very machine readable (unlike CSV XML JSON RDF)

PDF is no (real) open standard (unlike CSV XML JSON RDF)

No links between titles Delpher amp KB-cat No links between titles places amp persons

external sources (like Wikipedia)

but the data sources are

unconnected (and for 3+4 unstructured amp not machine-readable)

To summarize

a lot of information is available about these WW2 underground newspapers

1 Metadata (KB-cat)

2 Content (full-text Delpher)

3 Context (Winkel PDF)

4 Relations titles places persons other titles (Winkel PDF)

5 External resources about titles places and persons

htt

p

2b

pb

logsp

ot

com

_BW

zuYw

iS6-I

TM

geR

sFd3m

IAAAAAAAAElw

3cv

gbZSPW

css

1600d

oct

or+

macr

o+

judy+

scare

djpg

making discovery understanding amp research

of these newspapers (and related places amp persons) more difficult than necessary

We can solve all these issues

Good news

Wikiproject() Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

tinyurlcomverzetskranten (in Dutch)

httpsnlwikipediaorgwikiWikipediaWikiproject httpsenwikipediaorgwikiWikipediaWikiproject

From 14 1300 titles

htt

ps

n

lwik

iped

iao

rgw

iki

Cat

ego

rie

Illeg

ale_

per

s_in

_de_

Twee

de_

Wer

eld

oo

rlo

g

Wikiproject() Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

tinyurlcomverzetskranten (in Dutch)

We need a database

Build central database

httpswwwyoutubecomwatchv=GVDGuCjog_0

Build central database

httpswwwyoutubecomwatchv=GVDGuCjog_0

Step 1 Create Excel-sheet with bull Metadata about newspaper (from Winkel PDF)

bull Unique Wikipedia article title

bull Contextual info incl related persons amp titles (from Winkel PDF)

bull PPN unique ID linking newspaper to

KB-catalogue amp Delpher

Winkel-ID

Place of publication

Other metadata

Title

Unique Wikipedia article title =

ltNewspaper titlegt (verzetsblad ltPlace of publicationgt)

bull Contextual info bull Related persons

bull Related newspaper titles

PPN (107123223)

httpopc4kbnlDB=1PPNPPN=107123223

httpopc4kbnlDB=1PPNPPN=107123223

PPN (107123223)

PPN (107123223)

httpwwwdelphernlnlkrantenresultsindexcoll=dddtitelampcql[]=ppn3D107123223

httpwwwdelphernlnlkrantenresultsindexcoll=dddtitelampcql[]=ppn3D107123223

PPN (107123223)

Build central database

Step 2 Convert Excel into RDF triplestore (=special kind of online database anybody can access)

bull Steps 1-4 from httplinda-projecteulinked-

data-primer-2

bull Step 4 Vocubulary used = Bibframe (httpbibframeorgvocab)

Build central database

Step 3 Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

bull DBpedia = hub for linking different data sets on the Web to each

other Linked Open Data cloud

bull Connect persons amp places in newspaper database to external resources via DBpedia

Step 1c Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

DBpedia allows you to ask sophisticated queries against Wikipedia and to link the different data sets on the Web to Wikipedia data

bull Connect persons amp places in newspaper database to external

resources via DBpedia

httplod-cloudnetversions2010-09-22lod-cloud_coloredpng

Linked Open Data cloud

Build central database

Step 3 Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

bull DBpedia = hub for linking different data sets on the Web to each

other Linked Open Data cloud

bull We use DBpdia to connect persons amp places in our newspaper database to information in other databases

httpnldbpediaorgpageHuib_Drion

httpsnlwikipediaorgwikiHuib_Drion

httpwwwdbnlorgauteursauteurphpid=drio001

httpwwwbiografischportaalnlpersoon41181342

Build central database

Added value of Linked Open Data amp DBpedia Software can automatically query for additional information about places and persons mentioned in De Winkel that is not available in bull KB-catalogue bull Delpher bull De Winkel

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples)

Open license (CC-BY-SA) Open standard (RDF)

Contextual information Links between titles Delpher amp KB-cat (via PPNs)

Relations Links between places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia)

bull Titles Other titles

(PPNs)

http5stardatainfoen

Wikiproject Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

We need a template

Using an article template we can generate 1300 uniform and interlinked Wikipedia articles

from the LOD-database

htt

ps

c1

sta

ticf

lickr

co

m9

82

81

76

99

23

19

18

_11

a73

56

c38

_bjp

g

LOD-database + article template = Wikipedia article

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

Titles from database

Metadata from database

Related persons from database

Related newspapers from database

Winkel-ID from database

Link to full-texts in Delpher from database

Link to KB-catalogue record from database

Fixed categories on WP and Commons

Predefined fixed strings

Grey = bull From database bull Predefined fixed strings

Uniformity between articles guaranteed

All that WP-writers need to add manually

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

The KB can re-use (embed) the Wikipedia content in its own

websites to tackle this problem

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_28verzetsblad29

Delpher - search results

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

Delpher - search results

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Delpher - object presentation

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Over De Geus onder studenten

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog dat vanaf 4 oktober 1940 tot en met 13 juli 1944 in Den Haag werd uitgegeven Het blad verscheen in 1940 1941 en 1943 maandelijks verder onregelmatig in een oplage tussen de 250 en 8000 exemplaren Het werd aanvankelijk gestencild en vanaf november 1942 gedrukt en de inhoud bestond voornamelijk uit opinie-artikelen

Het blad werd uitgegeven door Jan Drion en Huib Drion twee Leidsehellip

Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Delpher - object presentation

Suggested reading

bull httpwwwtedcomtalkstim_berners_lee_on_the_next_web 20 years ago Tim Berners-Lee invented the World Wide Web For his next project hes building a web for open linked data that could do for numbers what the Web did for words pictures video unlock our data and reframe the way we use it together

bull httpsenwikipediaorgwikiLinked_data Wikipedia article related to the above video

bull http5stardatainfoen The 5 stars of Linked Open Data (Tim Berners-Lee)

bull httplinda-projecteulinked-data-primer-2 Short primer about creating LOD in practice starting from an Excel sheet

bull httpwwwprogrammablewebcomnewshow-linked-data-solved-digital-age-marketing-problemanalysis20150831 The figure near the bottom of the first page is a good illustration of the concept of (linked) triples

bull httpsenwikipediaorgwikiDBpedia

bull httpsenwikipediaorgwikiSemantic_network

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Questions

olafjanssenkbnl - ookgezellig

tinyurlcomverzetskranten

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Page 42: Linked Open Data case study (illegal newspapers WW2, Wikipedia, DBpedia) - Lecture Leiden University 3-3-2016

Relation 33

Newspaper Other newspapers

semantics linked data

htt

p

htt

ps

kn

ow

led

geu

top

iaf

iles

wo

rdp

ress

co

m2

01

40

1h

olla

nd

ho

use

libra

ryb

litz1

94

0j

pg

Too bad itrsquos a paper book hard to find multiply distribute and build upon

We need it digital

htt

p

htt

ps

kn

ow

led

geu

top

iaf

iles

wo

rdp

ress

co

m2

01

40

1h

olla

nd

ho

use

libra

ryb

litz1

94

0j

pg

We need it digital

1 Clear copyright with copyright holder (NIOD) Open CC-BY-SA license

2 Scan amp OCR

3 Convert into PDF

4 Put online NIOD site amp Wikimedia Commons

De Winkel as PDF on NIOD website (CC-BY-SA) httpwwwniodnlnlde-ondergrondse-pers-1940-1945

De Winkel as PDF on Wikimedia Commons httpscommonswikimediaorgwikiFilePDF_of_De_Ondergrondse_Pers_1940-1945_-_derde_druk_-_1989pdf

De Winkel as PDF on NIOD website (CC-BY-SA) httpwwwniodnlnlde-ondergrondse-pers-1940-1945

Saved us euro13330 httpwwwbrillcomdutch-underground-press-1940-1945

Wikipedia article about De Winkel httpnlwikipediaorgwikiDe_ondergrondse_pers_1940-1945

Wikipedia article about the author httpnlwikipediaorgwikiLydia_Winkel

Winkel the plusses

Available online (PDF flat file)

Open license (CC-BY-SA) Contextual information Relations

bull Titles Places bull Titles Persons

bull Titles Other titles

htt

p

ww

wa

rch

ives

go

vre

sear

chm

ilita

ryw

w2

ph

oto

sim

ages

ww

2-1

94

jpg

htt

p

2b

pb

logsp

ot

com

_BW

zuYw

iS6-I

TM

geR

sFd3m

IAAAAAAAAElw

3cv

gbZSPW

css

1600d

oct

or+

macr

o+

judy+

scare

djpg

Winkel the minusses

Unstructured data (PDF flat file)

Not very machine readable (unlike CSV XML JSON RDF)

PDF is no (real) open standard (unlike CSV XML JSON RDF)

No links between titles Delpher amp KB-cat No links between titles places amp persons

external sources (like Wikipedia)

but the data sources are

unconnected (and for 3+4 unstructured amp not machine-readable)

To summarize

a lot of information is available about these WW2 underground newspapers

1 Metadata (KB-cat)

2 Content (full-text Delpher)

3 Context (Winkel PDF)

4 Relations titles places persons other titles (Winkel PDF)

5 External resources about titles places and persons

htt

p

2b

pb

logsp

ot

com

_BW

zuYw

iS6-I

TM

geR

sFd3m

IAAAAAAAAElw

3cv

gbZSPW

css

1600d

oct

or+

macr

o+

judy+

scare

djpg

making discovery understanding amp research

of these newspapers (and related places amp persons) more difficult than necessary

We can solve all these issues

Good news

Wikiproject() Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

tinyurlcomverzetskranten (in Dutch)

httpsnlwikipediaorgwikiWikipediaWikiproject httpsenwikipediaorgwikiWikipediaWikiproject

From 14 1300 titles

htt

ps

n

lwik

iped

iao

rgw

iki

Cat

ego

rie

Illeg

ale_

per

s_in

_de_

Twee

de_

Wer

eld

oo

rlo

g

Wikiproject() Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

tinyurlcomverzetskranten (in Dutch)

We need a database

Build central database

httpswwwyoutubecomwatchv=GVDGuCjog_0

Build central database

httpswwwyoutubecomwatchv=GVDGuCjog_0

Step 1 Create Excel-sheet with bull Metadata about newspaper (from Winkel PDF)

bull Unique Wikipedia article title

bull Contextual info incl related persons amp titles (from Winkel PDF)

bull PPN unique ID linking newspaper to

KB-catalogue amp Delpher

Winkel-ID

Place of publication

Other metadata

Title

Unique Wikipedia article title =

ltNewspaper titlegt (verzetsblad ltPlace of publicationgt)

bull Contextual info bull Related persons

bull Related newspaper titles

PPN (107123223)

httpopc4kbnlDB=1PPNPPN=107123223

httpopc4kbnlDB=1PPNPPN=107123223

PPN (107123223)

PPN (107123223)

httpwwwdelphernlnlkrantenresultsindexcoll=dddtitelampcql[]=ppn3D107123223

httpwwwdelphernlnlkrantenresultsindexcoll=dddtitelampcql[]=ppn3D107123223

PPN (107123223)

Build central database

Step 2 Convert Excel into RDF triplestore (=special kind of online database anybody can access)

bull Steps 1-4 from httplinda-projecteulinked-

data-primer-2

bull Step 4 Vocubulary used = Bibframe (httpbibframeorgvocab)

Build central database

Step 3 Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

bull DBpedia = hub for linking different data sets on the Web to each

other Linked Open Data cloud

bull Connect persons amp places in newspaper database to external resources via DBpedia

Step 1c Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

DBpedia allows you to ask sophisticated queries against Wikipedia and to link the different data sets on the Web to Wikipedia data

bull Connect persons amp places in newspaper database to external

resources via DBpedia

httplod-cloudnetversions2010-09-22lod-cloud_coloredpng

Linked Open Data cloud

Build central database

Step 3 Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

bull DBpedia = hub for linking different data sets on the Web to each

other Linked Open Data cloud

bull We use DBpdia to connect persons amp places in our newspaper database to information in other databases

httpnldbpediaorgpageHuib_Drion

httpsnlwikipediaorgwikiHuib_Drion

httpwwwdbnlorgauteursauteurphpid=drio001

httpwwwbiografischportaalnlpersoon41181342

Build central database

Added value of Linked Open Data amp DBpedia Software can automatically query for additional information about places and persons mentioned in De Winkel that is not available in bull KB-catalogue bull Delpher bull De Winkel

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples)

Open license (CC-BY-SA) Open standard (RDF)

Contextual information Links between titles Delpher amp KB-cat (via PPNs)

Relations Links between places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia)

bull Titles Other titles

(PPNs)

http5stardatainfoen

Wikiproject Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

We need a template

Using an article template we can generate 1300 uniform and interlinked Wikipedia articles

from the LOD-database

htt

ps

c1

sta

ticf

lickr

co

m9

82

81

76

99

23

19

18

_11

a73

56

c38

_bjp

g

LOD-database + article template = Wikipedia article

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

Titles from database

Metadata from database

Related persons from database

Related newspapers from database

Winkel-ID from database

Link to full-texts in Delpher from database

Link to KB-catalogue record from database

Fixed categories on WP and Commons

Predefined fixed strings

Grey = bull From database bull Predefined fixed strings

Uniformity between articles guaranteed

All that WP-writers need to add manually

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

The KB can re-use (embed) the Wikipedia content in its own

websites to tackle this problem

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_28verzetsblad29

Delpher - search results

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

Delpher - search results

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Delpher - object presentation

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Over De Geus onder studenten

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog dat vanaf 4 oktober 1940 tot en met 13 juli 1944 in Den Haag werd uitgegeven Het blad verscheen in 1940 1941 en 1943 maandelijks verder onregelmatig in een oplage tussen de 250 en 8000 exemplaren Het werd aanvankelijk gestencild en vanaf november 1942 gedrukt en de inhoud bestond voornamelijk uit opinie-artikelen

Het blad werd uitgegeven door Jan Drion en Huib Drion twee Leidsehellip

Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Delpher - object presentation

Suggested reading

bull httpwwwtedcomtalkstim_berners_lee_on_the_next_web 20 years ago Tim Berners-Lee invented the World Wide Web For his next project hes building a web for open linked data that could do for numbers what the Web did for words pictures video unlock our data and reframe the way we use it together

bull httpsenwikipediaorgwikiLinked_data Wikipedia article related to the above video

bull http5stardatainfoen The 5 stars of Linked Open Data (Tim Berners-Lee)

bull httplinda-projecteulinked-data-primer-2 Short primer about creating LOD in practice starting from an Excel sheet

bull httpwwwprogrammablewebcomnewshow-linked-data-solved-digital-age-marketing-problemanalysis20150831 The figure near the bottom of the first page is a good illustration of the concept of (linked) triples

bull httpsenwikipediaorgwikiDBpedia

bull httpsenwikipediaorgwikiSemantic_network

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Questions

olafjanssenkbnl - ookgezellig

tinyurlcomverzetskranten

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Page 43: Linked Open Data case study (illegal newspapers WW2, Wikipedia, DBpedia) - Lecture Leiden University 3-3-2016

htt

p

htt

ps

kn

ow

led

geu

top

iaf

iles

wo

rdp

ress

co

m2

01

40

1h

olla

nd

ho

use

libra

ryb

litz1

94

0j

pg

Too bad itrsquos a paper book hard to find multiply distribute and build upon

We need it digital

htt

p

htt

ps

kn

ow

led

geu

top

iaf

iles

wo

rdp

ress

co

m2

01

40

1h

olla

nd

ho

use

libra

ryb

litz1

94

0j

pg

We need it digital

1 Clear copyright with copyright holder (NIOD) Open CC-BY-SA license

2 Scan amp OCR

3 Convert into PDF

4 Put online NIOD site amp Wikimedia Commons

De Winkel as PDF on NIOD website (CC-BY-SA) httpwwwniodnlnlde-ondergrondse-pers-1940-1945

De Winkel as PDF on Wikimedia Commons httpscommonswikimediaorgwikiFilePDF_of_De_Ondergrondse_Pers_1940-1945_-_derde_druk_-_1989pdf

De Winkel as PDF on NIOD website (CC-BY-SA) httpwwwniodnlnlde-ondergrondse-pers-1940-1945

Saved us euro13330 httpwwwbrillcomdutch-underground-press-1940-1945

Wikipedia article about De Winkel httpnlwikipediaorgwikiDe_ondergrondse_pers_1940-1945

Wikipedia article about the author httpnlwikipediaorgwikiLydia_Winkel

Winkel the plusses

Available online (PDF flat file)

Open license (CC-BY-SA) Contextual information Relations

bull Titles Places bull Titles Persons

bull Titles Other titles

htt

p

ww

wa

rch

ives

go

vre

sear

chm

ilita

ryw

w2

ph

oto

sim

ages

ww

2-1

94

jpg

htt

p

2b

pb

logsp

ot

com

_BW

zuYw

iS6-I

TM

geR

sFd3m

IAAAAAAAAElw

3cv

gbZSPW

css

1600d

oct

or+

macr

o+

judy+

scare

djpg

Winkel the minusses

Unstructured data (PDF flat file)

Not very machine readable (unlike CSV XML JSON RDF)

PDF is no (real) open standard (unlike CSV XML JSON RDF)

No links between titles Delpher amp KB-cat No links between titles places amp persons

external sources (like Wikipedia)

but the data sources are

unconnected (and for 3+4 unstructured amp not machine-readable)

To summarize

a lot of information is available about these WW2 underground newspapers

1 Metadata (KB-cat)

2 Content (full-text Delpher)

3 Context (Winkel PDF)

4 Relations titles places persons other titles (Winkel PDF)

5 External resources about titles places and persons

htt

p

2b

pb

logsp

ot

com

_BW

zuYw

iS6-I

TM

geR

sFd3m

IAAAAAAAAElw

3cv

gbZSPW

css

1600d

oct

or+

macr

o+

judy+

scare

djpg

making discovery understanding amp research

of these newspapers (and related places amp persons) more difficult than necessary

We can solve all these issues

Good news

Wikiproject() Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

tinyurlcomverzetskranten (in Dutch)

httpsnlwikipediaorgwikiWikipediaWikiproject httpsenwikipediaorgwikiWikipediaWikiproject

From 14 1300 titles

htt

ps

n

lwik

iped

iao

rgw

iki

Cat

ego

rie

Illeg

ale_

per

s_in

_de_

Twee

de_

Wer

eld

oo

rlo

g

Wikiproject() Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

tinyurlcomverzetskranten (in Dutch)

We need a database

Build central database

httpswwwyoutubecomwatchv=GVDGuCjog_0

Build central database

httpswwwyoutubecomwatchv=GVDGuCjog_0

Step 1 Create Excel-sheet with bull Metadata about newspaper (from Winkel PDF)

bull Unique Wikipedia article title

bull Contextual info incl related persons amp titles (from Winkel PDF)

bull PPN unique ID linking newspaper to

KB-catalogue amp Delpher

Winkel-ID

Place of publication

Other metadata

Title

Unique Wikipedia article title =

ltNewspaper titlegt (verzetsblad ltPlace of publicationgt)

bull Contextual info bull Related persons

bull Related newspaper titles

PPN (107123223)

httpopc4kbnlDB=1PPNPPN=107123223

httpopc4kbnlDB=1PPNPPN=107123223

PPN (107123223)

PPN (107123223)

httpwwwdelphernlnlkrantenresultsindexcoll=dddtitelampcql[]=ppn3D107123223

httpwwwdelphernlnlkrantenresultsindexcoll=dddtitelampcql[]=ppn3D107123223

PPN (107123223)

Build central database

Step 2 Convert Excel into RDF triplestore (=special kind of online database anybody can access)

bull Steps 1-4 from httplinda-projecteulinked-

data-primer-2

bull Step 4 Vocubulary used = Bibframe (httpbibframeorgvocab)

Build central database

Step 3 Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

bull DBpedia = hub for linking different data sets on the Web to each

other Linked Open Data cloud

bull Connect persons amp places in newspaper database to external resources via DBpedia

Step 1c Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

DBpedia allows you to ask sophisticated queries against Wikipedia and to link the different data sets on the Web to Wikipedia data

bull Connect persons amp places in newspaper database to external

resources via DBpedia

httplod-cloudnetversions2010-09-22lod-cloud_coloredpng

Linked Open Data cloud

Build central database

Step 3 Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

bull DBpedia = hub for linking different data sets on the Web to each

other Linked Open Data cloud

bull We use DBpdia to connect persons amp places in our newspaper database to information in other databases

httpnldbpediaorgpageHuib_Drion

httpsnlwikipediaorgwikiHuib_Drion

httpwwwdbnlorgauteursauteurphpid=drio001

httpwwwbiografischportaalnlpersoon41181342

Build central database

Added value of Linked Open Data amp DBpedia Software can automatically query for additional information about places and persons mentioned in De Winkel that is not available in bull KB-catalogue bull Delpher bull De Winkel

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples)

Open license (CC-BY-SA) Open standard (RDF)

Contextual information Links between titles Delpher amp KB-cat (via PPNs)

Relations Links between places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia)

bull Titles Other titles

(PPNs)

http5stardatainfoen

Wikiproject Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

We need a template

Using an article template we can generate 1300 uniform and interlinked Wikipedia articles

from the LOD-database

htt

ps

c1

sta

ticf

lickr

co

m9

82

81

76

99

23

19

18

_11

a73

56

c38

_bjp

g

LOD-database + article template = Wikipedia article

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

Titles from database

Metadata from database

Related persons from database

Related newspapers from database

Winkel-ID from database

Link to full-texts in Delpher from database

Link to KB-catalogue record from database

Fixed categories on WP and Commons

Predefined fixed strings

Grey = bull From database bull Predefined fixed strings

Uniformity between articles guaranteed

All that WP-writers need to add manually

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

The KB can re-use (embed) the Wikipedia content in its own

websites to tackle this problem

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_28verzetsblad29

Delpher - search results

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

Delpher - search results

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Delpher - object presentation

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Over De Geus onder studenten

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog dat vanaf 4 oktober 1940 tot en met 13 juli 1944 in Den Haag werd uitgegeven Het blad verscheen in 1940 1941 en 1943 maandelijks verder onregelmatig in een oplage tussen de 250 en 8000 exemplaren Het werd aanvankelijk gestencild en vanaf november 1942 gedrukt en de inhoud bestond voornamelijk uit opinie-artikelen

Het blad werd uitgegeven door Jan Drion en Huib Drion twee Leidsehellip

Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Delpher - object presentation

Suggested reading

bull httpwwwtedcomtalkstim_berners_lee_on_the_next_web 20 years ago Tim Berners-Lee invented the World Wide Web For his next project hes building a web for open linked data that could do for numbers what the Web did for words pictures video unlock our data and reframe the way we use it together

bull httpsenwikipediaorgwikiLinked_data Wikipedia article related to the above video

bull http5stardatainfoen The 5 stars of Linked Open Data (Tim Berners-Lee)

bull httplinda-projecteulinked-data-primer-2 Short primer about creating LOD in practice starting from an Excel sheet

bull httpwwwprogrammablewebcomnewshow-linked-data-solved-digital-age-marketing-problemanalysis20150831 The figure near the bottom of the first page is a good illustration of the concept of (linked) triples

bull httpsenwikipediaorgwikiDBpedia

bull httpsenwikipediaorgwikiSemantic_network

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Questions

olafjanssenkbnl - ookgezellig

tinyurlcomverzetskranten

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Page 44: Linked Open Data case study (illegal newspapers WW2, Wikipedia, DBpedia) - Lecture Leiden University 3-3-2016

We need it digital

htt

p

htt

ps

kn

ow

led

geu

top

iaf

iles

wo

rdp

ress

co

m2

01

40

1h

olla

nd

ho

use

libra

ryb

litz1

94

0j

pg

We need it digital

1 Clear copyright with copyright holder (NIOD) Open CC-BY-SA license

2 Scan amp OCR

3 Convert into PDF

4 Put online NIOD site amp Wikimedia Commons

De Winkel as PDF on NIOD website (CC-BY-SA) httpwwwniodnlnlde-ondergrondse-pers-1940-1945

De Winkel as PDF on Wikimedia Commons httpscommonswikimediaorgwikiFilePDF_of_De_Ondergrondse_Pers_1940-1945_-_derde_druk_-_1989pdf

De Winkel as PDF on NIOD website (CC-BY-SA) httpwwwniodnlnlde-ondergrondse-pers-1940-1945

Saved us euro13330 httpwwwbrillcomdutch-underground-press-1940-1945

Wikipedia article about De Winkel httpnlwikipediaorgwikiDe_ondergrondse_pers_1940-1945

Wikipedia article about the author httpnlwikipediaorgwikiLydia_Winkel

Winkel the plusses

Available online (PDF flat file)

Open license (CC-BY-SA) Contextual information Relations

bull Titles Places bull Titles Persons

bull Titles Other titles

htt

p

ww

wa

rch

ives

go

vre

sear

chm

ilita

ryw

w2

ph

oto

sim

ages

ww

2-1

94

jpg

htt

p

2b

pb

logsp

ot

com

_BW

zuYw

iS6-I

TM

geR

sFd3m

IAAAAAAAAElw

3cv

gbZSPW

css

1600d

oct

or+

macr

o+

judy+

scare

djpg

Winkel the minusses

Unstructured data (PDF flat file)

Not very machine readable (unlike CSV XML JSON RDF)

PDF is no (real) open standard (unlike CSV XML JSON RDF)

No links between titles Delpher amp KB-cat No links between titles places amp persons

external sources (like Wikipedia)

but the data sources are

unconnected (and for 3+4 unstructured amp not machine-readable)

To summarize

a lot of information is available about these WW2 underground newspapers

1 Metadata (KB-cat)

2 Content (full-text Delpher)

3 Context (Winkel PDF)

4 Relations titles places persons other titles (Winkel PDF)

5 External resources about titles places and persons

htt

p

2b

pb

logsp

ot

com

_BW

zuYw

iS6-I

TM

geR

sFd3m

IAAAAAAAAElw

3cv

gbZSPW

css

1600d

oct

or+

macr

o+

judy+

scare

djpg

making discovery understanding amp research

of these newspapers (and related places amp persons) more difficult than necessary

We can solve all these issues

Good news

Wikiproject() Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

tinyurlcomverzetskranten (in Dutch)

httpsnlwikipediaorgwikiWikipediaWikiproject httpsenwikipediaorgwikiWikipediaWikiproject

From 14 1300 titles

htt

ps

n

lwik

iped

iao

rgw

iki

Cat

ego

rie

Illeg

ale_

per

s_in

_de_

Twee

de_

Wer

eld

oo

rlo

g

Wikiproject() Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

tinyurlcomverzetskranten (in Dutch)

We need a database

Build central database

httpswwwyoutubecomwatchv=GVDGuCjog_0

Build central database

httpswwwyoutubecomwatchv=GVDGuCjog_0

Step 1 Create Excel-sheet with bull Metadata about newspaper (from Winkel PDF)

bull Unique Wikipedia article title

bull Contextual info incl related persons amp titles (from Winkel PDF)

bull PPN unique ID linking newspaper to

KB-catalogue amp Delpher

Winkel-ID

Place of publication

Other metadata

Title

Unique Wikipedia article title =

ltNewspaper titlegt (verzetsblad ltPlace of publicationgt)

bull Contextual info bull Related persons

bull Related newspaper titles

PPN (107123223)

httpopc4kbnlDB=1PPNPPN=107123223

httpopc4kbnlDB=1PPNPPN=107123223

PPN (107123223)

PPN (107123223)

httpwwwdelphernlnlkrantenresultsindexcoll=dddtitelampcql[]=ppn3D107123223

httpwwwdelphernlnlkrantenresultsindexcoll=dddtitelampcql[]=ppn3D107123223

PPN (107123223)

Build central database

Step 2 Convert Excel into RDF triplestore (=special kind of online database anybody can access)

bull Steps 1-4 from httplinda-projecteulinked-

data-primer-2

bull Step 4 Vocubulary used = Bibframe (httpbibframeorgvocab)

Build central database

Step 3 Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

bull DBpedia = hub for linking different data sets on the Web to each

other Linked Open Data cloud

bull Connect persons amp places in newspaper database to external resources via DBpedia

Step 1c Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

DBpedia allows you to ask sophisticated queries against Wikipedia and to link the different data sets on the Web to Wikipedia data

bull Connect persons amp places in newspaper database to external

resources via DBpedia

httplod-cloudnetversions2010-09-22lod-cloud_coloredpng

Linked Open Data cloud

Build central database

Step 3 Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

bull DBpedia = hub for linking different data sets on the Web to each

other Linked Open Data cloud

bull We use DBpdia to connect persons amp places in our newspaper database to information in other databases

httpnldbpediaorgpageHuib_Drion

httpsnlwikipediaorgwikiHuib_Drion

httpwwwdbnlorgauteursauteurphpid=drio001

httpwwwbiografischportaalnlpersoon41181342

Build central database

Added value of Linked Open Data amp DBpedia Software can automatically query for additional information about places and persons mentioned in De Winkel that is not available in bull KB-catalogue bull Delpher bull De Winkel

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples)

Open license (CC-BY-SA) Open standard (RDF)

Contextual information Links between titles Delpher amp KB-cat (via PPNs)

Relations Links between places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia)

bull Titles Other titles

(PPNs)

http5stardatainfoen

Wikiproject Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

We need a template

Using an article template we can generate 1300 uniform and interlinked Wikipedia articles

from the LOD-database

htt

ps

c1

sta

ticf

lickr

co

m9

82

81

76

99

23

19

18

_11

a73

56

c38

_bjp

g

LOD-database + article template = Wikipedia article

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

Titles from database

Metadata from database

Related persons from database

Related newspapers from database

Winkel-ID from database

Link to full-texts in Delpher from database

Link to KB-catalogue record from database

Fixed categories on WP and Commons

Predefined fixed strings

Grey = bull From database bull Predefined fixed strings

Uniformity between articles guaranteed

All that WP-writers need to add manually

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

The KB can re-use (embed) the Wikipedia content in its own

websites to tackle this problem

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_28verzetsblad29

Delpher - search results

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

Delpher - search results

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Delpher - object presentation

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Over De Geus onder studenten

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog dat vanaf 4 oktober 1940 tot en met 13 juli 1944 in Den Haag werd uitgegeven Het blad verscheen in 1940 1941 en 1943 maandelijks verder onregelmatig in een oplage tussen de 250 en 8000 exemplaren Het werd aanvankelijk gestencild en vanaf november 1942 gedrukt en de inhoud bestond voornamelijk uit opinie-artikelen

Het blad werd uitgegeven door Jan Drion en Huib Drion twee Leidsehellip

Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Delpher - object presentation

Suggested reading

bull httpwwwtedcomtalkstim_berners_lee_on_the_next_web 20 years ago Tim Berners-Lee invented the World Wide Web For his next project hes building a web for open linked data that could do for numbers what the Web did for words pictures video unlock our data and reframe the way we use it together

bull httpsenwikipediaorgwikiLinked_data Wikipedia article related to the above video

bull http5stardatainfoen The 5 stars of Linked Open Data (Tim Berners-Lee)

bull httplinda-projecteulinked-data-primer-2 Short primer about creating LOD in practice starting from an Excel sheet

bull httpwwwprogrammablewebcomnewshow-linked-data-solved-digital-age-marketing-problemanalysis20150831 The figure near the bottom of the first page is a good illustration of the concept of (linked) triples

bull httpsenwikipediaorgwikiDBpedia

bull httpsenwikipediaorgwikiSemantic_network

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Questions

olafjanssenkbnl - ookgezellig

tinyurlcomverzetskranten

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Page 45: Linked Open Data case study (illegal newspapers WW2, Wikipedia, DBpedia) - Lecture Leiden University 3-3-2016

We need it digital

1 Clear copyright with copyright holder (NIOD) Open CC-BY-SA license

2 Scan amp OCR

3 Convert into PDF

4 Put online NIOD site amp Wikimedia Commons

De Winkel as PDF on NIOD website (CC-BY-SA) httpwwwniodnlnlde-ondergrondse-pers-1940-1945

De Winkel as PDF on Wikimedia Commons httpscommonswikimediaorgwikiFilePDF_of_De_Ondergrondse_Pers_1940-1945_-_derde_druk_-_1989pdf

De Winkel as PDF on NIOD website (CC-BY-SA) httpwwwniodnlnlde-ondergrondse-pers-1940-1945

Saved us euro13330 httpwwwbrillcomdutch-underground-press-1940-1945

Wikipedia article about De Winkel httpnlwikipediaorgwikiDe_ondergrondse_pers_1940-1945

Wikipedia article about the author httpnlwikipediaorgwikiLydia_Winkel

Winkel the plusses

Available online (PDF flat file)

Open license (CC-BY-SA) Contextual information Relations

bull Titles Places bull Titles Persons

bull Titles Other titles

htt

p

ww

wa

rch

ives

go

vre

sear

chm

ilita

ryw

w2

ph

oto

sim

ages

ww

2-1

94

jpg

htt

p

2b

pb

logsp

ot

com

_BW

zuYw

iS6-I

TM

geR

sFd3m

IAAAAAAAAElw

3cv

gbZSPW

css

1600d

oct

or+

macr

o+

judy+

scare

djpg

Winkel the minusses

Unstructured data (PDF flat file)

Not very machine readable (unlike CSV XML JSON RDF)

PDF is no (real) open standard (unlike CSV XML JSON RDF)

No links between titles Delpher amp KB-cat No links between titles places amp persons

external sources (like Wikipedia)

but the data sources are

unconnected (and for 3+4 unstructured amp not machine-readable)

To summarize

a lot of information is available about these WW2 underground newspapers

1 Metadata (KB-cat)

2 Content (full-text Delpher)

3 Context (Winkel PDF)

4 Relations titles places persons other titles (Winkel PDF)

5 External resources about titles places and persons

htt

p

2b

pb

logsp

ot

com

_BW

zuYw

iS6-I

TM

geR

sFd3m

IAAAAAAAAElw

3cv

gbZSPW

css

1600d

oct

or+

macr

o+

judy+

scare

djpg

making discovery understanding amp research

of these newspapers (and related places amp persons) more difficult than necessary

We can solve all these issues

Good news

Wikiproject() Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

tinyurlcomverzetskranten (in Dutch)

httpsnlwikipediaorgwikiWikipediaWikiproject httpsenwikipediaorgwikiWikipediaWikiproject

From 14 1300 titles

htt

ps

n

lwik

iped

iao

rgw

iki

Cat

ego

rie

Illeg

ale_

per

s_in

_de_

Twee

de_

Wer

eld

oo

rlo

g

Wikiproject() Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

tinyurlcomverzetskranten (in Dutch)

We need a database

Build central database

httpswwwyoutubecomwatchv=GVDGuCjog_0

Build central database

httpswwwyoutubecomwatchv=GVDGuCjog_0

Step 1 Create Excel-sheet with bull Metadata about newspaper (from Winkel PDF)

bull Unique Wikipedia article title

bull Contextual info incl related persons amp titles (from Winkel PDF)

bull PPN unique ID linking newspaper to

KB-catalogue amp Delpher

Winkel-ID

Place of publication

Other metadata

Title

Unique Wikipedia article title =

ltNewspaper titlegt (verzetsblad ltPlace of publicationgt)

bull Contextual info bull Related persons

bull Related newspaper titles

PPN (107123223)

httpopc4kbnlDB=1PPNPPN=107123223

httpopc4kbnlDB=1PPNPPN=107123223

PPN (107123223)

PPN (107123223)

httpwwwdelphernlnlkrantenresultsindexcoll=dddtitelampcql[]=ppn3D107123223

httpwwwdelphernlnlkrantenresultsindexcoll=dddtitelampcql[]=ppn3D107123223

PPN (107123223)

Build central database

Step 2 Convert Excel into RDF triplestore (=special kind of online database anybody can access)

bull Steps 1-4 from httplinda-projecteulinked-

data-primer-2

bull Step 4 Vocubulary used = Bibframe (httpbibframeorgvocab)

Build central database

Step 3 Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

bull DBpedia = hub for linking different data sets on the Web to each

other Linked Open Data cloud

bull Connect persons amp places in newspaper database to external resources via DBpedia

Step 1c Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

DBpedia allows you to ask sophisticated queries against Wikipedia and to link the different data sets on the Web to Wikipedia data

bull Connect persons amp places in newspaper database to external

resources via DBpedia

httplod-cloudnetversions2010-09-22lod-cloud_coloredpng

Linked Open Data cloud

Build central database

Step 3 Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

bull DBpedia = hub for linking different data sets on the Web to each

other Linked Open Data cloud

bull We use DBpdia to connect persons amp places in our newspaper database to information in other databases

httpnldbpediaorgpageHuib_Drion

httpsnlwikipediaorgwikiHuib_Drion

httpwwwdbnlorgauteursauteurphpid=drio001

httpwwwbiografischportaalnlpersoon41181342

Build central database

Added value of Linked Open Data amp DBpedia Software can automatically query for additional information about places and persons mentioned in De Winkel that is not available in bull KB-catalogue bull Delpher bull De Winkel

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples)

Open license (CC-BY-SA) Open standard (RDF)

Contextual information Links between titles Delpher amp KB-cat (via PPNs)

Relations Links between places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia)

bull Titles Other titles

(PPNs)

http5stardatainfoen

Wikiproject Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

We need a template

Using an article template we can generate 1300 uniform and interlinked Wikipedia articles

from the LOD-database

htt

ps

c1

sta

ticf

lickr

co

m9

82

81

76

99

23

19

18

_11

a73

56

c38

_bjp

g

LOD-database + article template = Wikipedia article

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

Titles from database

Metadata from database

Related persons from database

Related newspapers from database

Winkel-ID from database

Link to full-texts in Delpher from database

Link to KB-catalogue record from database

Fixed categories on WP and Commons

Predefined fixed strings

Grey = bull From database bull Predefined fixed strings

Uniformity between articles guaranteed

All that WP-writers need to add manually

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

The KB can re-use (embed) the Wikipedia content in its own

websites to tackle this problem

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_28verzetsblad29

Delpher - search results

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

Delpher - search results

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Delpher - object presentation

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Over De Geus onder studenten

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog dat vanaf 4 oktober 1940 tot en met 13 juli 1944 in Den Haag werd uitgegeven Het blad verscheen in 1940 1941 en 1943 maandelijks verder onregelmatig in een oplage tussen de 250 en 8000 exemplaren Het werd aanvankelijk gestencild en vanaf november 1942 gedrukt en de inhoud bestond voornamelijk uit opinie-artikelen

Het blad werd uitgegeven door Jan Drion en Huib Drion twee Leidsehellip

Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Delpher - object presentation

Suggested reading

bull httpwwwtedcomtalkstim_berners_lee_on_the_next_web 20 years ago Tim Berners-Lee invented the World Wide Web For his next project hes building a web for open linked data that could do for numbers what the Web did for words pictures video unlock our data and reframe the way we use it together

bull httpsenwikipediaorgwikiLinked_data Wikipedia article related to the above video

bull http5stardatainfoen The 5 stars of Linked Open Data (Tim Berners-Lee)

bull httplinda-projecteulinked-data-primer-2 Short primer about creating LOD in practice starting from an Excel sheet

bull httpwwwprogrammablewebcomnewshow-linked-data-solved-digital-age-marketing-problemanalysis20150831 The figure near the bottom of the first page is a good illustration of the concept of (linked) triples

bull httpsenwikipediaorgwikiDBpedia

bull httpsenwikipediaorgwikiSemantic_network

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Questions

olafjanssenkbnl - ookgezellig

tinyurlcomverzetskranten

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Page 46: Linked Open Data case study (illegal newspapers WW2, Wikipedia, DBpedia) - Lecture Leiden University 3-3-2016

De Winkel as PDF on NIOD website (CC-BY-SA) httpwwwniodnlnlde-ondergrondse-pers-1940-1945

De Winkel as PDF on Wikimedia Commons httpscommonswikimediaorgwikiFilePDF_of_De_Ondergrondse_Pers_1940-1945_-_derde_druk_-_1989pdf

De Winkel as PDF on NIOD website (CC-BY-SA) httpwwwniodnlnlde-ondergrondse-pers-1940-1945

Saved us euro13330 httpwwwbrillcomdutch-underground-press-1940-1945

Wikipedia article about De Winkel httpnlwikipediaorgwikiDe_ondergrondse_pers_1940-1945

Wikipedia article about the author httpnlwikipediaorgwikiLydia_Winkel

Winkel the plusses

Available online (PDF flat file)

Open license (CC-BY-SA) Contextual information Relations

bull Titles Places bull Titles Persons

bull Titles Other titles

htt

p

ww

wa

rch

ives

go

vre

sear

chm

ilita

ryw

w2

ph

oto

sim

ages

ww

2-1

94

jpg

htt

p

2b

pb

logsp

ot

com

_BW

zuYw

iS6-I

TM

geR

sFd3m

IAAAAAAAAElw

3cv

gbZSPW

css

1600d

oct

or+

macr

o+

judy+

scare

djpg

Winkel the minusses

Unstructured data (PDF flat file)

Not very machine readable (unlike CSV XML JSON RDF)

PDF is no (real) open standard (unlike CSV XML JSON RDF)

No links between titles Delpher amp KB-cat No links between titles places amp persons

external sources (like Wikipedia)

but the data sources are

unconnected (and for 3+4 unstructured amp not machine-readable)

To summarize

a lot of information is available about these WW2 underground newspapers

1 Metadata (KB-cat)

2 Content (full-text Delpher)

3 Context (Winkel PDF)

4 Relations titles places persons other titles (Winkel PDF)

5 External resources about titles places and persons

htt

p

2b

pb

logsp

ot

com

_BW

zuYw

iS6-I

TM

geR

sFd3m

IAAAAAAAAElw

3cv

gbZSPW

css

1600d

oct

or+

macr

o+

judy+

scare

djpg

making discovery understanding amp research

of these newspapers (and related places amp persons) more difficult than necessary

We can solve all these issues

Good news

Wikiproject() Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

tinyurlcomverzetskranten (in Dutch)

httpsnlwikipediaorgwikiWikipediaWikiproject httpsenwikipediaorgwikiWikipediaWikiproject

From 14 1300 titles

htt

ps

n

lwik

iped

iao

rgw

iki

Cat

ego

rie

Illeg

ale_

per

s_in

_de_

Twee

de_

Wer

eld

oo

rlo

g

Wikiproject() Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

tinyurlcomverzetskranten (in Dutch)

We need a database

Build central database

httpswwwyoutubecomwatchv=GVDGuCjog_0

Build central database

httpswwwyoutubecomwatchv=GVDGuCjog_0

Step 1 Create Excel-sheet with bull Metadata about newspaper (from Winkel PDF)

bull Unique Wikipedia article title

bull Contextual info incl related persons amp titles (from Winkel PDF)

bull PPN unique ID linking newspaper to

KB-catalogue amp Delpher

Winkel-ID

Place of publication

Other metadata

Title

Unique Wikipedia article title =

ltNewspaper titlegt (verzetsblad ltPlace of publicationgt)

bull Contextual info bull Related persons

bull Related newspaper titles

PPN (107123223)

httpopc4kbnlDB=1PPNPPN=107123223

httpopc4kbnlDB=1PPNPPN=107123223

PPN (107123223)

PPN (107123223)

httpwwwdelphernlnlkrantenresultsindexcoll=dddtitelampcql[]=ppn3D107123223

httpwwwdelphernlnlkrantenresultsindexcoll=dddtitelampcql[]=ppn3D107123223

PPN (107123223)

Build central database

Step 2 Convert Excel into RDF triplestore (=special kind of online database anybody can access)

bull Steps 1-4 from httplinda-projecteulinked-

data-primer-2

bull Step 4 Vocubulary used = Bibframe (httpbibframeorgvocab)

Build central database

Step 3 Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

bull DBpedia = hub for linking different data sets on the Web to each

other Linked Open Data cloud

bull Connect persons amp places in newspaper database to external resources via DBpedia

Step 1c Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

DBpedia allows you to ask sophisticated queries against Wikipedia and to link the different data sets on the Web to Wikipedia data

bull Connect persons amp places in newspaper database to external

resources via DBpedia

httplod-cloudnetversions2010-09-22lod-cloud_coloredpng

Linked Open Data cloud

Build central database

Step 3 Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

bull DBpedia = hub for linking different data sets on the Web to each

other Linked Open Data cloud

bull We use DBpdia to connect persons amp places in our newspaper database to information in other databases

httpnldbpediaorgpageHuib_Drion

httpsnlwikipediaorgwikiHuib_Drion

httpwwwdbnlorgauteursauteurphpid=drio001

httpwwwbiografischportaalnlpersoon41181342

Build central database

Added value of Linked Open Data amp DBpedia Software can automatically query for additional information about places and persons mentioned in De Winkel that is not available in bull KB-catalogue bull Delpher bull De Winkel

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples)

Open license (CC-BY-SA) Open standard (RDF)

Contextual information Links between titles Delpher amp KB-cat (via PPNs)

Relations Links between places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia)

bull Titles Other titles

(PPNs)

http5stardatainfoen

Wikiproject Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

We need a template

Using an article template we can generate 1300 uniform and interlinked Wikipedia articles

from the LOD-database

htt

ps

c1

sta

ticf

lickr

co

m9

82

81

76

99

23

19

18

_11

a73

56

c38

_bjp

g

LOD-database + article template = Wikipedia article

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

Titles from database

Metadata from database

Related persons from database

Related newspapers from database

Winkel-ID from database

Link to full-texts in Delpher from database

Link to KB-catalogue record from database

Fixed categories on WP and Commons

Predefined fixed strings

Grey = bull From database bull Predefined fixed strings

Uniformity between articles guaranteed

All that WP-writers need to add manually

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

The KB can re-use (embed) the Wikipedia content in its own

websites to tackle this problem

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_28verzetsblad29

Delpher - search results

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

Delpher - search results

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Delpher - object presentation

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Over De Geus onder studenten

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog dat vanaf 4 oktober 1940 tot en met 13 juli 1944 in Den Haag werd uitgegeven Het blad verscheen in 1940 1941 en 1943 maandelijks verder onregelmatig in een oplage tussen de 250 en 8000 exemplaren Het werd aanvankelijk gestencild en vanaf november 1942 gedrukt en de inhoud bestond voornamelijk uit opinie-artikelen

Het blad werd uitgegeven door Jan Drion en Huib Drion twee Leidsehellip

Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Delpher - object presentation

Suggested reading

bull httpwwwtedcomtalkstim_berners_lee_on_the_next_web 20 years ago Tim Berners-Lee invented the World Wide Web For his next project hes building a web for open linked data that could do for numbers what the Web did for words pictures video unlock our data and reframe the way we use it together

bull httpsenwikipediaorgwikiLinked_data Wikipedia article related to the above video

bull http5stardatainfoen The 5 stars of Linked Open Data (Tim Berners-Lee)

bull httplinda-projecteulinked-data-primer-2 Short primer about creating LOD in practice starting from an Excel sheet

bull httpwwwprogrammablewebcomnewshow-linked-data-solved-digital-age-marketing-problemanalysis20150831 The figure near the bottom of the first page is a good illustration of the concept of (linked) triples

bull httpsenwikipediaorgwikiDBpedia

bull httpsenwikipediaorgwikiSemantic_network

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Questions

olafjanssenkbnl - ookgezellig

tinyurlcomverzetskranten

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Page 47: Linked Open Data case study (illegal newspapers WW2, Wikipedia, DBpedia) - Lecture Leiden University 3-3-2016

De Winkel as PDF on Wikimedia Commons httpscommonswikimediaorgwikiFilePDF_of_De_Ondergrondse_Pers_1940-1945_-_derde_druk_-_1989pdf

De Winkel as PDF on NIOD website (CC-BY-SA) httpwwwniodnlnlde-ondergrondse-pers-1940-1945

Saved us euro13330 httpwwwbrillcomdutch-underground-press-1940-1945

Wikipedia article about De Winkel httpnlwikipediaorgwikiDe_ondergrondse_pers_1940-1945

Wikipedia article about the author httpnlwikipediaorgwikiLydia_Winkel

Winkel the plusses

Available online (PDF flat file)

Open license (CC-BY-SA) Contextual information Relations

bull Titles Places bull Titles Persons

bull Titles Other titles

htt

p

ww

wa

rch

ives

go

vre

sear

chm

ilita

ryw

w2

ph

oto

sim

ages

ww

2-1

94

jpg

htt

p

2b

pb

logsp

ot

com

_BW

zuYw

iS6-I

TM

geR

sFd3m

IAAAAAAAAElw

3cv

gbZSPW

css

1600d

oct

or+

macr

o+

judy+

scare

djpg

Winkel the minusses

Unstructured data (PDF flat file)

Not very machine readable (unlike CSV XML JSON RDF)

PDF is no (real) open standard (unlike CSV XML JSON RDF)

No links between titles Delpher amp KB-cat No links between titles places amp persons

external sources (like Wikipedia)

but the data sources are

unconnected (and for 3+4 unstructured amp not machine-readable)

To summarize

a lot of information is available about these WW2 underground newspapers

1 Metadata (KB-cat)

2 Content (full-text Delpher)

3 Context (Winkel PDF)

4 Relations titles places persons other titles (Winkel PDF)

5 External resources about titles places and persons

htt

p

2b

pb

logsp

ot

com

_BW

zuYw

iS6-I

TM

geR

sFd3m

IAAAAAAAAElw

3cv

gbZSPW

css

1600d

oct

or+

macr

o+

judy+

scare

djpg

making discovery understanding amp research

of these newspapers (and related places amp persons) more difficult than necessary

We can solve all these issues

Good news

Wikiproject() Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

tinyurlcomverzetskranten (in Dutch)

httpsnlwikipediaorgwikiWikipediaWikiproject httpsenwikipediaorgwikiWikipediaWikiproject

From 14 1300 titles

htt

ps

n

lwik

iped

iao

rgw

iki

Cat

ego

rie

Illeg

ale_

per

s_in

_de_

Twee

de_

Wer

eld

oo

rlo

g

Wikiproject() Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

tinyurlcomverzetskranten (in Dutch)

We need a database

Build central database

httpswwwyoutubecomwatchv=GVDGuCjog_0

Build central database

httpswwwyoutubecomwatchv=GVDGuCjog_0

Step 1 Create Excel-sheet with bull Metadata about newspaper (from Winkel PDF)

bull Unique Wikipedia article title

bull Contextual info incl related persons amp titles (from Winkel PDF)

bull PPN unique ID linking newspaper to

KB-catalogue amp Delpher

Winkel-ID

Place of publication

Other metadata

Title

Unique Wikipedia article title =

ltNewspaper titlegt (verzetsblad ltPlace of publicationgt)

bull Contextual info bull Related persons

bull Related newspaper titles

PPN (107123223)

httpopc4kbnlDB=1PPNPPN=107123223

httpopc4kbnlDB=1PPNPPN=107123223

PPN (107123223)

PPN (107123223)

httpwwwdelphernlnlkrantenresultsindexcoll=dddtitelampcql[]=ppn3D107123223

httpwwwdelphernlnlkrantenresultsindexcoll=dddtitelampcql[]=ppn3D107123223

PPN (107123223)

Build central database

Step 2 Convert Excel into RDF triplestore (=special kind of online database anybody can access)

bull Steps 1-4 from httplinda-projecteulinked-

data-primer-2

bull Step 4 Vocubulary used = Bibframe (httpbibframeorgvocab)

Build central database

Step 3 Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

bull DBpedia = hub for linking different data sets on the Web to each

other Linked Open Data cloud

bull Connect persons amp places in newspaper database to external resources via DBpedia

Step 1c Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

DBpedia allows you to ask sophisticated queries against Wikipedia and to link the different data sets on the Web to Wikipedia data

bull Connect persons amp places in newspaper database to external

resources via DBpedia

httplod-cloudnetversions2010-09-22lod-cloud_coloredpng

Linked Open Data cloud

Build central database

Step 3 Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

bull DBpedia = hub for linking different data sets on the Web to each

other Linked Open Data cloud

bull We use DBpdia to connect persons amp places in our newspaper database to information in other databases

httpnldbpediaorgpageHuib_Drion

httpsnlwikipediaorgwikiHuib_Drion

httpwwwdbnlorgauteursauteurphpid=drio001

httpwwwbiografischportaalnlpersoon41181342

Build central database

Added value of Linked Open Data amp DBpedia Software can automatically query for additional information about places and persons mentioned in De Winkel that is not available in bull KB-catalogue bull Delpher bull De Winkel

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples)

Open license (CC-BY-SA) Open standard (RDF)

Contextual information Links between titles Delpher amp KB-cat (via PPNs)

Relations Links between places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia)

bull Titles Other titles

(PPNs)

http5stardatainfoen

Wikiproject Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

We need a template

Using an article template we can generate 1300 uniform and interlinked Wikipedia articles

from the LOD-database

htt

ps

c1

sta

ticf

lickr

co

m9

82

81

76

99

23

19

18

_11

a73

56

c38

_bjp

g

LOD-database + article template = Wikipedia article

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

Titles from database

Metadata from database

Related persons from database

Related newspapers from database

Winkel-ID from database

Link to full-texts in Delpher from database

Link to KB-catalogue record from database

Fixed categories on WP and Commons

Predefined fixed strings

Grey = bull From database bull Predefined fixed strings

Uniformity between articles guaranteed

All that WP-writers need to add manually

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

The KB can re-use (embed) the Wikipedia content in its own

websites to tackle this problem

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_28verzetsblad29

Delpher - search results

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

Delpher - search results

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Delpher - object presentation

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Over De Geus onder studenten

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog dat vanaf 4 oktober 1940 tot en met 13 juli 1944 in Den Haag werd uitgegeven Het blad verscheen in 1940 1941 en 1943 maandelijks verder onregelmatig in een oplage tussen de 250 en 8000 exemplaren Het werd aanvankelijk gestencild en vanaf november 1942 gedrukt en de inhoud bestond voornamelijk uit opinie-artikelen

Het blad werd uitgegeven door Jan Drion en Huib Drion twee Leidsehellip

Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Delpher - object presentation

Suggested reading

bull httpwwwtedcomtalkstim_berners_lee_on_the_next_web 20 years ago Tim Berners-Lee invented the World Wide Web For his next project hes building a web for open linked data that could do for numbers what the Web did for words pictures video unlock our data and reframe the way we use it together

bull httpsenwikipediaorgwikiLinked_data Wikipedia article related to the above video

bull http5stardatainfoen The 5 stars of Linked Open Data (Tim Berners-Lee)

bull httplinda-projecteulinked-data-primer-2 Short primer about creating LOD in practice starting from an Excel sheet

bull httpwwwprogrammablewebcomnewshow-linked-data-solved-digital-age-marketing-problemanalysis20150831 The figure near the bottom of the first page is a good illustration of the concept of (linked) triples

bull httpsenwikipediaorgwikiDBpedia

bull httpsenwikipediaorgwikiSemantic_network

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Questions

olafjanssenkbnl - ookgezellig

tinyurlcomverzetskranten

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Page 48: Linked Open Data case study (illegal newspapers WW2, Wikipedia, DBpedia) - Lecture Leiden University 3-3-2016

De Winkel as PDF on NIOD website (CC-BY-SA) httpwwwniodnlnlde-ondergrondse-pers-1940-1945

Saved us euro13330 httpwwwbrillcomdutch-underground-press-1940-1945

Wikipedia article about De Winkel httpnlwikipediaorgwikiDe_ondergrondse_pers_1940-1945

Wikipedia article about the author httpnlwikipediaorgwikiLydia_Winkel

Winkel the plusses

Available online (PDF flat file)

Open license (CC-BY-SA) Contextual information Relations

bull Titles Places bull Titles Persons

bull Titles Other titles

htt

p

ww

wa

rch

ives

go

vre

sear

chm

ilita

ryw

w2

ph

oto

sim

ages

ww

2-1

94

jpg

htt

p

2b

pb

logsp

ot

com

_BW

zuYw

iS6-I

TM

geR

sFd3m

IAAAAAAAAElw

3cv

gbZSPW

css

1600d

oct

or+

macr

o+

judy+

scare

djpg

Winkel the minusses

Unstructured data (PDF flat file)

Not very machine readable (unlike CSV XML JSON RDF)

PDF is no (real) open standard (unlike CSV XML JSON RDF)

No links between titles Delpher amp KB-cat No links between titles places amp persons

external sources (like Wikipedia)

but the data sources are

unconnected (and for 3+4 unstructured amp not machine-readable)

To summarize

a lot of information is available about these WW2 underground newspapers

1 Metadata (KB-cat)

2 Content (full-text Delpher)

3 Context (Winkel PDF)

4 Relations titles places persons other titles (Winkel PDF)

5 External resources about titles places and persons

htt

p

2b

pb

logsp

ot

com

_BW

zuYw

iS6-I

TM

geR

sFd3m

IAAAAAAAAElw

3cv

gbZSPW

css

1600d

oct

or+

macr

o+

judy+

scare

djpg

making discovery understanding amp research

of these newspapers (and related places amp persons) more difficult than necessary

We can solve all these issues

Good news

Wikiproject() Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

tinyurlcomverzetskranten (in Dutch)

httpsnlwikipediaorgwikiWikipediaWikiproject httpsenwikipediaorgwikiWikipediaWikiproject

From 14 1300 titles

htt

ps

n

lwik

iped

iao

rgw

iki

Cat

ego

rie

Illeg

ale_

per

s_in

_de_

Twee

de_

Wer

eld

oo

rlo

g

Wikiproject() Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

tinyurlcomverzetskranten (in Dutch)

We need a database

Build central database

httpswwwyoutubecomwatchv=GVDGuCjog_0

Build central database

httpswwwyoutubecomwatchv=GVDGuCjog_0

Step 1 Create Excel-sheet with bull Metadata about newspaper (from Winkel PDF)

bull Unique Wikipedia article title

bull Contextual info incl related persons amp titles (from Winkel PDF)

bull PPN unique ID linking newspaper to

KB-catalogue amp Delpher

Winkel-ID

Place of publication

Other metadata

Title

Unique Wikipedia article title =

ltNewspaper titlegt (verzetsblad ltPlace of publicationgt)

bull Contextual info bull Related persons

bull Related newspaper titles

PPN (107123223)

httpopc4kbnlDB=1PPNPPN=107123223

httpopc4kbnlDB=1PPNPPN=107123223

PPN (107123223)

PPN (107123223)

httpwwwdelphernlnlkrantenresultsindexcoll=dddtitelampcql[]=ppn3D107123223

httpwwwdelphernlnlkrantenresultsindexcoll=dddtitelampcql[]=ppn3D107123223

PPN (107123223)

Build central database

Step 2 Convert Excel into RDF triplestore (=special kind of online database anybody can access)

bull Steps 1-4 from httplinda-projecteulinked-

data-primer-2

bull Step 4 Vocubulary used = Bibframe (httpbibframeorgvocab)

Build central database

Step 3 Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

bull DBpedia = hub for linking different data sets on the Web to each

other Linked Open Data cloud

bull Connect persons amp places in newspaper database to external resources via DBpedia

Step 1c Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

DBpedia allows you to ask sophisticated queries against Wikipedia and to link the different data sets on the Web to Wikipedia data

bull Connect persons amp places in newspaper database to external

resources via DBpedia

httplod-cloudnetversions2010-09-22lod-cloud_coloredpng

Linked Open Data cloud

Build central database

Step 3 Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

bull DBpedia = hub for linking different data sets on the Web to each

other Linked Open Data cloud

bull We use DBpdia to connect persons amp places in our newspaper database to information in other databases

httpnldbpediaorgpageHuib_Drion

httpsnlwikipediaorgwikiHuib_Drion

httpwwwdbnlorgauteursauteurphpid=drio001

httpwwwbiografischportaalnlpersoon41181342

Build central database

Added value of Linked Open Data amp DBpedia Software can automatically query for additional information about places and persons mentioned in De Winkel that is not available in bull KB-catalogue bull Delpher bull De Winkel

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples)

Open license (CC-BY-SA) Open standard (RDF)

Contextual information Links between titles Delpher amp KB-cat (via PPNs)

Relations Links between places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia)

bull Titles Other titles

(PPNs)

http5stardatainfoen

Wikiproject Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

We need a template

Using an article template we can generate 1300 uniform and interlinked Wikipedia articles

from the LOD-database

htt

ps

c1

sta

ticf

lickr

co

m9

82

81

76

99

23

19

18

_11

a73

56

c38

_bjp

g

LOD-database + article template = Wikipedia article

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

Titles from database

Metadata from database

Related persons from database

Related newspapers from database

Winkel-ID from database

Link to full-texts in Delpher from database

Link to KB-catalogue record from database

Fixed categories on WP and Commons

Predefined fixed strings

Grey = bull From database bull Predefined fixed strings

Uniformity between articles guaranteed

All that WP-writers need to add manually

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

The KB can re-use (embed) the Wikipedia content in its own

websites to tackle this problem

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_28verzetsblad29

Delpher - search results

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

Delpher - search results

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Delpher - object presentation

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Over De Geus onder studenten

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog dat vanaf 4 oktober 1940 tot en met 13 juli 1944 in Den Haag werd uitgegeven Het blad verscheen in 1940 1941 en 1943 maandelijks verder onregelmatig in een oplage tussen de 250 en 8000 exemplaren Het werd aanvankelijk gestencild en vanaf november 1942 gedrukt en de inhoud bestond voornamelijk uit opinie-artikelen

Het blad werd uitgegeven door Jan Drion en Huib Drion twee Leidsehellip

Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Delpher - object presentation

Suggested reading

bull httpwwwtedcomtalkstim_berners_lee_on_the_next_web 20 years ago Tim Berners-Lee invented the World Wide Web For his next project hes building a web for open linked data that could do for numbers what the Web did for words pictures video unlock our data and reframe the way we use it together

bull httpsenwikipediaorgwikiLinked_data Wikipedia article related to the above video

bull http5stardatainfoen The 5 stars of Linked Open Data (Tim Berners-Lee)

bull httplinda-projecteulinked-data-primer-2 Short primer about creating LOD in practice starting from an Excel sheet

bull httpwwwprogrammablewebcomnewshow-linked-data-solved-digital-age-marketing-problemanalysis20150831 The figure near the bottom of the first page is a good illustration of the concept of (linked) triples

bull httpsenwikipediaorgwikiDBpedia

bull httpsenwikipediaorgwikiSemantic_network

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Questions

olafjanssenkbnl - ookgezellig

tinyurlcomverzetskranten

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Page 49: Linked Open Data case study (illegal newspapers WW2, Wikipedia, DBpedia) - Lecture Leiden University 3-3-2016

Wikipedia article about De Winkel httpnlwikipediaorgwikiDe_ondergrondse_pers_1940-1945

Wikipedia article about the author httpnlwikipediaorgwikiLydia_Winkel

Winkel the plusses

Available online (PDF flat file)

Open license (CC-BY-SA) Contextual information Relations

bull Titles Places bull Titles Persons

bull Titles Other titles

htt

p

ww

wa

rch

ives

go

vre

sear

chm

ilita

ryw

w2

ph

oto

sim

ages

ww

2-1

94

jpg

htt

p

2b

pb

logsp

ot

com

_BW

zuYw

iS6-I

TM

geR

sFd3m

IAAAAAAAAElw

3cv

gbZSPW

css

1600d

oct

or+

macr

o+

judy+

scare

djpg

Winkel the minusses

Unstructured data (PDF flat file)

Not very machine readable (unlike CSV XML JSON RDF)

PDF is no (real) open standard (unlike CSV XML JSON RDF)

No links between titles Delpher amp KB-cat No links between titles places amp persons

external sources (like Wikipedia)

but the data sources are

unconnected (and for 3+4 unstructured amp not machine-readable)

To summarize

a lot of information is available about these WW2 underground newspapers

1 Metadata (KB-cat)

2 Content (full-text Delpher)

3 Context (Winkel PDF)

4 Relations titles places persons other titles (Winkel PDF)

5 External resources about titles places and persons

htt

p

2b

pb

logsp

ot

com

_BW

zuYw

iS6-I

TM

geR

sFd3m

IAAAAAAAAElw

3cv

gbZSPW

css

1600d

oct

or+

macr

o+

judy+

scare

djpg

making discovery understanding amp research

of these newspapers (and related places amp persons) more difficult than necessary

We can solve all these issues

Good news

Wikiproject() Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

tinyurlcomverzetskranten (in Dutch)

httpsnlwikipediaorgwikiWikipediaWikiproject httpsenwikipediaorgwikiWikipediaWikiproject

From 14 1300 titles

htt

ps

n

lwik

iped

iao

rgw

iki

Cat

ego

rie

Illeg

ale_

per

s_in

_de_

Twee

de_

Wer

eld

oo

rlo

g

Wikiproject() Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

tinyurlcomverzetskranten (in Dutch)

We need a database

Build central database

httpswwwyoutubecomwatchv=GVDGuCjog_0

Build central database

httpswwwyoutubecomwatchv=GVDGuCjog_0

Step 1 Create Excel-sheet with bull Metadata about newspaper (from Winkel PDF)

bull Unique Wikipedia article title

bull Contextual info incl related persons amp titles (from Winkel PDF)

bull PPN unique ID linking newspaper to

KB-catalogue amp Delpher

Winkel-ID

Place of publication

Other metadata

Title

Unique Wikipedia article title =

ltNewspaper titlegt (verzetsblad ltPlace of publicationgt)

bull Contextual info bull Related persons

bull Related newspaper titles

PPN (107123223)

httpopc4kbnlDB=1PPNPPN=107123223

httpopc4kbnlDB=1PPNPPN=107123223

PPN (107123223)

PPN (107123223)

httpwwwdelphernlnlkrantenresultsindexcoll=dddtitelampcql[]=ppn3D107123223

httpwwwdelphernlnlkrantenresultsindexcoll=dddtitelampcql[]=ppn3D107123223

PPN (107123223)

Build central database

Step 2 Convert Excel into RDF triplestore (=special kind of online database anybody can access)

bull Steps 1-4 from httplinda-projecteulinked-

data-primer-2

bull Step 4 Vocubulary used = Bibframe (httpbibframeorgvocab)

Build central database

Step 3 Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

bull DBpedia = hub for linking different data sets on the Web to each

other Linked Open Data cloud

bull Connect persons amp places in newspaper database to external resources via DBpedia

Step 1c Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

DBpedia allows you to ask sophisticated queries against Wikipedia and to link the different data sets on the Web to Wikipedia data

bull Connect persons amp places in newspaper database to external

resources via DBpedia

httplod-cloudnetversions2010-09-22lod-cloud_coloredpng

Linked Open Data cloud

Build central database

Step 3 Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

bull DBpedia = hub for linking different data sets on the Web to each

other Linked Open Data cloud

bull We use DBpdia to connect persons amp places in our newspaper database to information in other databases

httpnldbpediaorgpageHuib_Drion

httpsnlwikipediaorgwikiHuib_Drion

httpwwwdbnlorgauteursauteurphpid=drio001

httpwwwbiografischportaalnlpersoon41181342

Build central database

Added value of Linked Open Data amp DBpedia Software can automatically query for additional information about places and persons mentioned in De Winkel that is not available in bull KB-catalogue bull Delpher bull De Winkel

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples)

Open license (CC-BY-SA) Open standard (RDF)

Contextual information Links between titles Delpher amp KB-cat (via PPNs)

Relations Links between places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia)

bull Titles Other titles

(PPNs)

http5stardatainfoen

Wikiproject Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

We need a template

Using an article template we can generate 1300 uniform and interlinked Wikipedia articles

from the LOD-database

htt

ps

c1

sta

ticf

lickr

co

m9

82

81

76

99

23

19

18

_11

a73

56

c38

_bjp

g

LOD-database + article template = Wikipedia article

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

Titles from database

Metadata from database

Related persons from database

Related newspapers from database

Winkel-ID from database

Link to full-texts in Delpher from database

Link to KB-catalogue record from database

Fixed categories on WP and Commons

Predefined fixed strings

Grey = bull From database bull Predefined fixed strings

Uniformity between articles guaranteed

All that WP-writers need to add manually

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

The KB can re-use (embed) the Wikipedia content in its own

websites to tackle this problem

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_28verzetsblad29

Delpher - search results

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

Delpher - search results

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Delpher - object presentation

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Over De Geus onder studenten

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog dat vanaf 4 oktober 1940 tot en met 13 juli 1944 in Den Haag werd uitgegeven Het blad verscheen in 1940 1941 en 1943 maandelijks verder onregelmatig in een oplage tussen de 250 en 8000 exemplaren Het werd aanvankelijk gestencild en vanaf november 1942 gedrukt en de inhoud bestond voornamelijk uit opinie-artikelen

Het blad werd uitgegeven door Jan Drion en Huib Drion twee Leidsehellip

Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Delpher - object presentation

Suggested reading

bull httpwwwtedcomtalkstim_berners_lee_on_the_next_web 20 years ago Tim Berners-Lee invented the World Wide Web For his next project hes building a web for open linked data that could do for numbers what the Web did for words pictures video unlock our data and reframe the way we use it together

bull httpsenwikipediaorgwikiLinked_data Wikipedia article related to the above video

bull http5stardatainfoen The 5 stars of Linked Open Data (Tim Berners-Lee)

bull httplinda-projecteulinked-data-primer-2 Short primer about creating LOD in practice starting from an Excel sheet

bull httpwwwprogrammablewebcomnewshow-linked-data-solved-digital-age-marketing-problemanalysis20150831 The figure near the bottom of the first page is a good illustration of the concept of (linked) triples

bull httpsenwikipediaorgwikiDBpedia

bull httpsenwikipediaorgwikiSemantic_network

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Questions

olafjanssenkbnl - ookgezellig

tinyurlcomverzetskranten

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Page 50: Linked Open Data case study (illegal newspapers WW2, Wikipedia, DBpedia) - Lecture Leiden University 3-3-2016

Wikipedia article about the author httpnlwikipediaorgwikiLydia_Winkel

Winkel the plusses

Available online (PDF flat file)

Open license (CC-BY-SA) Contextual information Relations

bull Titles Places bull Titles Persons

bull Titles Other titles

htt

p

ww

wa

rch

ives

go

vre

sear

chm

ilita

ryw

w2

ph

oto

sim

ages

ww

2-1

94

jpg

htt

p

2b

pb

logsp

ot

com

_BW

zuYw

iS6-I

TM

geR

sFd3m

IAAAAAAAAElw

3cv

gbZSPW

css

1600d

oct

or+

macr

o+

judy+

scare

djpg

Winkel the minusses

Unstructured data (PDF flat file)

Not very machine readable (unlike CSV XML JSON RDF)

PDF is no (real) open standard (unlike CSV XML JSON RDF)

No links between titles Delpher amp KB-cat No links between titles places amp persons

external sources (like Wikipedia)

but the data sources are

unconnected (and for 3+4 unstructured amp not machine-readable)

To summarize

a lot of information is available about these WW2 underground newspapers

1 Metadata (KB-cat)

2 Content (full-text Delpher)

3 Context (Winkel PDF)

4 Relations titles places persons other titles (Winkel PDF)

5 External resources about titles places and persons

htt

p

2b

pb

logsp

ot

com

_BW

zuYw

iS6-I

TM

geR

sFd3m

IAAAAAAAAElw

3cv

gbZSPW

css

1600d

oct

or+

macr

o+

judy+

scare

djpg

making discovery understanding amp research

of these newspapers (and related places amp persons) more difficult than necessary

We can solve all these issues

Good news

Wikiproject() Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

tinyurlcomverzetskranten (in Dutch)

httpsnlwikipediaorgwikiWikipediaWikiproject httpsenwikipediaorgwikiWikipediaWikiproject

From 14 1300 titles

htt

ps

n

lwik

iped

iao

rgw

iki

Cat

ego

rie

Illeg

ale_

per

s_in

_de_

Twee

de_

Wer

eld

oo

rlo

g

Wikiproject() Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

tinyurlcomverzetskranten (in Dutch)

We need a database

Build central database

httpswwwyoutubecomwatchv=GVDGuCjog_0

Build central database

httpswwwyoutubecomwatchv=GVDGuCjog_0

Step 1 Create Excel-sheet with bull Metadata about newspaper (from Winkel PDF)

bull Unique Wikipedia article title

bull Contextual info incl related persons amp titles (from Winkel PDF)

bull PPN unique ID linking newspaper to

KB-catalogue amp Delpher

Winkel-ID

Place of publication

Other metadata

Title

Unique Wikipedia article title =

ltNewspaper titlegt (verzetsblad ltPlace of publicationgt)

bull Contextual info bull Related persons

bull Related newspaper titles

PPN (107123223)

httpopc4kbnlDB=1PPNPPN=107123223

httpopc4kbnlDB=1PPNPPN=107123223

PPN (107123223)

PPN (107123223)

httpwwwdelphernlnlkrantenresultsindexcoll=dddtitelampcql[]=ppn3D107123223

httpwwwdelphernlnlkrantenresultsindexcoll=dddtitelampcql[]=ppn3D107123223

PPN (107123223)

Build central database

Step 2 Convert Excel into RDF triplestore (=special kind of online database anybody can access)

bull Steps 1-4 from httplinda-projecteulinked-

data-primer-2

bull Step 4 Vocubulary used = Bibframe (httpbibframeorgvocab)

Build central database

Step 3 Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

bull DBpedia = hub for linking different data sets on the Web to each

other Linked Open Data cloud

bull Connect persons amp places in newspaper database to external resources via DBpedia

Step 1c Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

DBpedia allows you to ask sophisticated queries against Wikipedia and to link the different data sets on the Web to Wikipedia data

bull Connect persons amp places in newspaper database to external

resources via DBpedia

httplod-cloudnetversions2010-09-22lod-cloud_coloredpng

Linked Open Data cloud

Build central database

Step 3 Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

bull DBpedia = hub for linking different data sets on the Web to each

other Linked Open Data cloud

bull We use DBpdia to connect persons amp places in our newspaper database to information in other databases

httpnldbpediaorgpageHuib_Drion

httpsnlwikipediaorgwikiHuib_Drion

httpwwwdbnlorgauteursauteurphpid=drio001

httpwwwbiografischportaalnlpersoon41181342

Build central database

Added value of Linked Open Data amp DBpedia Software can automatically query for additional information about places and persons mentioned in De Winkel that is not available in bull KB-catalogue bull Delpher bull De Winkel

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples)

Open license (CC-BY-SA) Open standard (RDF)

Contextual information Links between titles Delpher amp KB-cat (via PPNs)

Relations Links between places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia)

bull Titles Other titles

(PPNs)

http5stardatainfoen

Wikiproject Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

We need a template

Using an article template we can generate 1300 uniform and interlinked Wikipedia articles

from the LOD-database

htt

ps

c1

sta

ticf

lickr

co

m9

82

81

76

99

23

19

18

_11

a73

56

c38

_bjp

g

LOD-database + article template = Wikipedia article

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

Titles from database

Metadata from database

Related persons from database

Related newspapers from database

Winkel-ID from database

Link to full-texts in Delpher from database

Link to KB-catalogue record from database

Fixed categories on WP and Commons

Predefined fixed strings

Grey = bull From database bull Predefined fixed strings

Uniformity between articles guaranteed

All that WP-writers need to add manually

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

The KB can re-use (embed) the Wikipedia content in its own

websites to tackle this problem

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_28verzetsblad29

Delpher - search results

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

Delpher - search results

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Delpher - object presentation

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Over De Geus onder studenten

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog dat vanaf 4 oktober 1940 tot en met 13 juli 1944 in Den Haag werd uitgegeven Het blad verscheen in 1940 1941 en 1943 maandelijks verder onregelmatig in een oplage tussen de 250 en 8000 exemplaren Het werd aanvankelijk gestencild en vanaf november 1942 gedrukt en de inhoud bestond voornamelijk uit opinie-artikelen

Het blad werd uitgegeven door Jan Drion en Huib Drion twee Leidsehellip

Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Delpher - object presentation

Suggested reading

bull httpwwwtedcomtalkstim_berners_lee_on_the_next_web 20 years ago Tim Berners-Lee invented the World Wide Web For his next project hes building a web for open linked data that could do for numbers what the Web did for words pictures video unlock our data and reframe the way we use it together

bull httpsenwikipediaorgwikiLinked_data Wikipedia article related to the above video

bull http5stardatainfoen The 5 stars of Linked Open Data (Tim Berners-Lee)

bull httplinda-projecteulinked-data-primer-2 Short primer about creating LOD in practice starting from an Excel sheet

bull httpwwwprogrammablewebcomnewshow-linked-data-solved-digital-age-marketing-problemanalysis20150831 The figure near the bottom of the first page is a good illustration of the concept of (linked) triples

bull httpsenwikipediaorgwikiDBpedia

bull httpsenwikipediaorgwikiSemantic_network

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Questions

olafjanssenkbnl - ookgezellig

tinyurlcomverzetskranten

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Page 51: Linked Open Data case study (illegal newspapers WW2, Wikipedia, DBpedia) - Lecture Leiden University 3-3-2016

Winkel the plusses

Available online (PDF flat file)

Open license (CC-BY-SA) Contextual information Relations

bull Titles Places bull Titles Persons

bull Titles Other titles

htt

p

ww

wa

rch

ives

go

vre

sear

chm

ilita

ryw

w2

ph

oto

sim

ages

ww

2-1

94

jpg

htt

p

2b

pb

logsp

ot

com

_BW

zuYw

iS6-I

TM

geR

sFd3m

IAAAAAAAAElw

3cv

gbZSPW

css

1600d

oct

or+

macr

o+

judy+

scare

djpg

Winkel the minusses

Unstructured data (PDF flat file)

Not very machine readable (unlike CSV XML JSON RDF)

PDF is no (real) open standard (unlike CSV XML JSON RDF)

No links between titles Delpher amp KB-cat No links between titles places amp persons

external sources (like Wikipedia)

but the data sources are

unconnected (and for 3+4 unstructured amp not machine-readable)

To summarize

a lot of information is available about these WW2 underground newspapers

1 Metadata (KB-cat)

2 Content (full-text Delpher)

3 Context (Winkel PDF)

4 Relations titles places persons other titles (Winkel PDF)

5 External resources about titles places and persons

htt

p

2b

pb

logsp

ot

com

_BW

zuYw

iS6-I

TM

geR

sFd3m

IAAAAAAAAElw

3cv

gbZSPW

css

1600d

oct

or+

macr

o+

judy+

scare

djpg

making discovery understanding amp research

of these newspapers (and related places amp persons) more difficult than necessary

We can solve all these issues

Good news

Wikiproject() Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

tinyurlcomverzetskranten (in Dutch)

httpsnlwikipediaorgwikiWikipediaWikiproject httpsenwikipediaorgwikiWikipediaWikiproject

From 14 1300 titles

htt

ps

n

lwik

iped

iao

rgw

iki

Cat

ego

rie

Illeg

ale_

per

s_in

_de_

Twee

de_

Wer

eld

oo

rlo

g

Wikiproject() Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

tinyurlcomverzetskranten (in Dutch)

We need a database

Build central database

httpswwwyoutubecomwatchv=GVDGuCjog_0

Build central database

httpswwwyoutubecomwatchv=GVDGuCjog_0

Step 1 Create Excel-sheet with bull Metadata about newspaper (from Winkel PDF)

bull Unique Wikipedia article title

bull Contextual info incl related persons amp titles (from Winkel PDF)

bull PPN unique ID linking newspaper to

KB-catalogue amp Delpher

Winkel-ID

Place of publication

Other metadata

Title

Unique Wikipedia article title =

ltNewspaper titlegt (verzetsblad ltPlace of publicationgt)

bull Contextual info bull Related persons

bull Related newspaper titles

PPN (107123223)

httpopc4kbnlDB=1PPNPPN=107123223

httpopc4kbnlDB=1PPNPPN=107123223

PPN (107123223)

PPN (107123223)

httpwwwdelphernlnlkrantenresultsindexcoll=dddtitelampcql[]=ppn3D107123223

httpwwwdelphernlnlkrantenresultsindexcoll=dddtitelampcql[]=ppn3D107123223

PPN (107123223)

Build central database

Step 2 Convert Excel into RDF triplestore (=special kind of online database anybody can access)

bull Steps 1-4 from httplinda-projecteulinked-

data-primer-2

bull Step 4 Vocubulary used = Bibframe (httpbibframeorgvocab)

Build central database

Step 3 Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

bull DBpedia = hub for linking different data sets on the Web to each

other Linked Open Data cloud

bull Connect persons amp places in newspaper database to external resources via DBpedia

Step 1c Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

DBpedia allows you to ask sophisticated queries against Wikipedia and to link the different data sets on the Web to Wikipedia data

bull Connect persons amp places in newspaper database to external

resources via DBpedia

httplod-cloudnetversions2010-09-22lod-cloud_coloredpng

Linked Open Data cloud

Build central database

Step 3 Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

bull DBpedia = hub for linking different data sets on the Web to each

other Linked Open Data cloud

bull We use DBpdia to connect persons amp places in our newspaper database to information in other databases

httpnldbpediaorgpageHuib_Drion

httpsnlwikipediaorgwikiHuib_Drion

httpwwwdbnlorgauteursauteurphpid=drio001

httpwwwbiografischportaalnlpersoon41181342

Build central database

Added value of Linked Open Data amp DBpedia Software can automatically query for additional information about places and persons mentioned in De Winkel that is not available in bull KB-catalogue bull Delpher bull De Winkel

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples)

Open license (CC-BY-SA) Open standard (RDF)

Contextual information Links between titles Delpher amp KB-cat (via PPNs)

Relations Links between places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia)

bull Titles Other titles

(PPNs)

http5stardatainfoen

Wikiproject Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

We need a template

Using an article template we can generate 1300 uniform and interlinked Wikipedia articles

from the LOD-database

htt

ps

c1

sta

ticf

lickr

co

m9

82

81

76

99

23

19

18

_11

a73

56

c38

_bjp

g

LOD-database + article template = Wikipedia article

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

Titles from database

Metadata from database

Related persons from database

Related newspapers from database

Winkel-ID from database

Link to full-texts in Delpher from database

Link to KB-catalogue record from database

Fixed categories on WP and Commons

Predefined fixed strings

Grey = bull From database bull Predefined fixed strings

Uniformity between articles guaranteed

All that WP-writers need to add manually

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

The KB can re-use (embed) the Wikipedia content in its own

websites to tackle this problem

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_28verzetsblad29

Delpher - search results

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

Delpher - search results

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Delpher - object presentation

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Over De Geus onder studenten

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog dat vanaf 4 oktober 1940 tot en met 13 juli 1944 in Den Haag werd uitgegeven Het blad verscheen in 1940 1941 en 1943 maandelijks verder onregelmatig in een oplage tussen de 250 en 8000 exemplaren Het werd aanvankelijk gestencild en vanaf november 1942 gedrukt en de inhoud bestond voornamelijk uit opinie-artikelen

Het blad werd uitgegeven door Jan Drion en Huib Drion twee Leidsehellip

Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Delpher - object presentation

Suggested reading

bull httpwwwtedcomtalkstim_berners_lee_on_the_next_web 20 years ago Tim Berners-Lee invented the World Wide Web For his next project hes building a web for open linked data that could do for numbers what the Web did for words pictures video unlock our data and reframe the way we use it together

bull httpsenwikipediaorgwikiLinked_data Wikipedia article related to the above video

bull http5stardatainfoen The 5 stars of Linked Open Data (Tim Berners-Lee)

bull httplinda-projecteulinked-data-primer-2 Short primer about creating LOD in practice starting from an Excel sheet

bull httpwwwprogrammablewebcomnewshow-linked-data-solved-digital-age-marketing-problemanalysis20150831 The figure near the bottom of the first page is a good illustration of the concept of (linked) triples

bull httpsenwikipediaorgwikiDBpedia

bull httpsenwikipediaorgwikiSemantic_network

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Questions

olafjanssenkbnl - ookgezellig

tinyurlcomverzetskranten

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Page 52: Linked Open Data case study (illegal newspapers WW2, Wikipedia, DBpedia) - Lecture Leiden University 3-3-2016

htt

p

2b

pb

logsp

ot

com

_BW

zuYw

iS6-I

TM

geR

sFd3m

IAAAAAAAAElw

3cv

gbZSPW

css

1600d

oct

or+

macr

o+

judy+

scare

djpg

Winkel the minusses

Unstructured data (PDF flat file)

Not very machine readable (unlike CSV XML JSON RDF)

PDF is no (real) open standard (unlike CSV XML JSON RDF)

No links between titles Delpher amp KB-cat No links between titles places amp persons

external sources (like Wikipedia)

but the data sources are

unconnected (and for 3+4 unstructured amp not machine-readable)

To summarize

a lot of information is available about these WW2 underground newspapers

1 Metadata (KB-cat)

2 Content (full-text Delpher)

3 Context (Winkel PDF)

4 Relations titles places persons other titles (Winkel PDF)

5 External resources about titles places and persons

htt

p

2b

pb

logsp

ot

com

_BW

zuYw

iS6-I

TM

geR

sFd3m

IAAAAAAAAElw

3cv

gbZSPW

css

1600d

oct

or+

macr

o+

judy+

scare

djpg

making discovery understanding amp research

of these newspapers (and related places amp persons) more difficult than necessary

We can solve all these issues

Good news

Wikiproject() Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

tinyurlcomverzetskranten (in Dutch)

httpsnlwikipediaorgwikiWikipediaWikiproject httpsenwikipediaorgwikiWikipediaWikiproject

From 14 1300 titles

htt

ps

n

lwik

iped

iao

rgw

iki

Cat

ego

rie

Illeg

ale_

per

s_in

_de_

Twee

de_

Wer

eld

oo

rlo

g

Wikiproject() Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

tinyurlcomverzetskranten (in Dutch)

We need a database

Build central database

httpswwwyoutubecomwatchv=GVDGuCjog_0

Build central database

httpswwwyoutubecomwatchv=GVDGuCjog_0

Step 1 Create Excel-sheet with bull Metadata about newspaper (from Winkel PDF)

bull Unique Wikipedia article title

bull Contextual info incl related persons amp titles (from Winkel PDF)

bull PPN unique ID linking newspaper to

KB-catalogue amp Delpher

Winkel-ID

Place of publication

Other metadata

Title

Unique Wikipedia article title =

ltNewspaper titlegt (verzetsblad ltPlace of publicationgt)

bull Contextual info bull Related persons

bull Related newspaper titles

PPN (107123223)

httpopc4kbnlDB=1PPNPPN=107123223

httpopc4kbnlDB=1PPNPPN=107123223

PPN (107123223)

PPN (107123223)

httpwwwdelphernlnlkrantenresultsindexcoll=dddtitelampcql[]=ppn3D107123223

httpwwwdelphernlnlkrantenresultsindexcoll=dddtitelampcql[]=ppn3D107123223

PPN (107123223)

Build central database

Step 2 Convert Excel into RDF triplestore (=special kind of online database anybody can access)

bull Steps 1-4 from httplinda-projecteulinked-

data-primer-2

bull Step 4 Vocubulary used = Bibframe (httpbibframeorgvocab)

Build central database

Step 3 Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

bull DBpedia = hub for linking different data sets on the Web to each

other Linked Open Data cloud

bull Connect persons amp places in newspaper database to external resources via DBpedia

Step 1c Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

DBpedia allows you to ask sophisticated queries against Wikipedia and to link the different data sets on the Web to Wikipedia data

bull Connect persons amp places in newspaper database to external

resources via DBpedia

httplod-cloudnetversions2010-09-22lod-cloud_coloredpng

Linked Open Data cloud

Build central database

Step 3 Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

bull DBpedia = hub for linking different data sets on the Web to each

other Linked Open Data cloud

bull We use DBpdia to connect persons amp places in our newspaper database to information in other databases

httpnldbpediaorgpageHuib_Drion

httpsnlwikipediaorgwikiHuib_Drion

httpwwwdbnlorgauteursauteurphpid=drio001

httpwwwbiografischportaalnlpersoon41181342

Build central database

Added value of Linked Open Data amp DBpedia Software can automatically query for additional information about places and persons mentioned in De Winkel that is not available in bull KB-catalogue bull Delpher bull De Winkel

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples)

Open license (CC-BY-SA) Open standard (RDF)

Contextual information Links between titles Delpher amp KB-cat (via PPNs)

Relations Links between places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia)

bull Titles Other titles

(PPNs)

http5stardatainfoen

Wikiproject Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

We need a template

Using an article template we can generate 1300 uniform and interlinked Wikipedia articles

from the LOD-database

htt

ps

c1

sta

ticf

lickr

co

m9

82

81

76

99

23

19

18

_11

a73

56

c38

_bjp

g

LOD-database + article template = Wikipedia article

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

Titles from database

Metadata from database

Related persons from database

Related newspapers from database

Winkel-ID from database

Link to full-texts in Delpher from database

Link to KB-catalogue record from database

Fixed categories on WP and Commons

Predefined fixed strings

Grey = bull From database bull Predefined fixed strings

Uniformity between articles guaranteed

All that WP-writers need to add manually

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

The KB can re-use (embed) the Wikipedia content in its own

websites to tackle this problem

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_28verzetsblad29

Delpher - search results

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

Delpher - search results

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Delpher - object presentation

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Over De Geus onder studenten

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog dat vanaf 4 oktober 1940 tot en met 13 juli 1944 in Den Haag werd uitgegeven Het blad verscheen in 1940 1941 en 1943 maandelijks verder onregelmatig in een oplage tussen de 250 en 8000 exemplaren Het werd aanvankelijk gestencild en vanaf november 1942 gedrukt en de inhoud bestond voornamelijk uit opinie-artikelen

Het blad werd uitgegeven door Jan Drion en Huib Drion twee Leidsehellip

Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Delpher - object presentation

Suggested reading

bull httpwwwtedcomtalkstim_berners_lee_on_the_next_web 20 years ago Tim Berners-Lee invented the World Wide Web For his next project hes building a web for open linked data that could do for numbers what the Web did for words pictures video unlock our data and reframe the way we use it together

bull httpsenwikipediaorgwikiLinked_data Wikipedia article related to the above video

bull http5stardatainfoen The 5 stars of Linked Open Data (Tim Berners-Lee)

bull httplinda-projecteulinked-data-primer-2 Short primer about creating LOD in practice starting from an Excel sheet

bull httpwwwprogrammablewebcomnewshow-linked-data-solved-digital-age-marketing-problemanalysis20150831 The figure near the bottom of the first page is a good illustration of the concept of (linked) triples

bull httpsenwikipediaorgwikiDBpedia

bull httpsenwikipediaorgwikiSemantic_network

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Questions

olafjanssenkbnl - ookgezellig

tinyurlcomverzetskranten

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Page 53: Linked Open Data case study (illegal newspapers WW2, Wikipedia, DBpedia) - Lecture Leiden University 3-3-2016

but the data sources are

unconnected (and for 3+4 unstructured amp not machine-readable)

To summarize

a lot of information is available about these WW2 underground newspapers

1 Metadata (KB-cat)

2 Content (full-text Delpher)

3 Context (Winkel PDF)

4 Relations titles places persons other titles (Winkel PDF)

5 External resources about titles places and persons

htt

p

2b

pb

logsp

ot

com

_BW

zuYw

iS6-I

TM

geR

sFd3m

IAAAAAAAAElw

3cv

gbZSPW

css

1600d

oct

or+

macr

o+

judy+

scare

djpg

making discovery understanding amp research

of these newspapers (and related places amp persons) more difficult than necessary

We can solve all these issues

Good news

Wikiproject() Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

tinyurlcomverzetskranten (in Dutch)

httpsnlwikipediaorgwikiWikipediaWikiproject httpsenwikipediaorgwikiWikipediaWikiproject

From 14 1300 titles

htt

ps

n

lwik

iped

iao

rgw

iki

Cat

ego

rie

Illeg

ale_

per

s_in

_de_

Twee

de_

Wer

eld

oo

rlo

g

Wikiproject() Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

tinyurlcomverzetskranten (in Dutch)

We need a database

Build central database

httpswwwyoutubecomwatchv=GVDGuCjog_0

Build central database

httpswwwyoutubecomwatchv=GVDGuCjog_0

Step 1 Create Excel-sheet with bull Metadata about newspaper (from Winkel PDF)

bull Unique Wikipedia article title

bull Contextual info incl related persons amp titles (from Winkel PDF)

bull PPN unique ID linking newspaper to

KB-catalogue amp Delpher

Winkel-ID

Place of publication

Other metadata

Title

Unique Wikipedia article title =

ltNewspaper titlegt (verzetsblad ltPlace of publicationgt)

bull Contextual info bull Related persons

bull Related newspaper titles

PPN (107123223)

httpopc4kbnlDB=1PPNPPN=107123223

httpopc4kbnlDB=1PPNPPN=107123223

PPN (107123223)

PPN (107123223)

httpwwwdelphernlnlkrantenresultsindexcoll=dddtitelampcql[]=ppn3D107123223

httpwwwdelphernlnlkrantenresultsindexcoll=dddtitelampcql[]=ppn3D107123223

PPN (107123223)

Build central database

Step 2 Convert Excel into RDF triplestore (=special kind of online database anybody can access)

bull Steps 1-4 from httplinda-projecteulinked-

data-primer-2

bull Step 4 Vocubulary used = Bibframe (httpbibframeorgvocab)

Build central database

Step 3 Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

bull DBpedia = hub for linking different data sets on the Web to each

other Linked Open Data cloud

bull Connect persons amp places in newspaper database to external resources via DBpedia

Step 1c Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

DBpedia allows you to ask sophisticated queries against Wikipedia and to link the different data sets on the Web to Wikipedia data

bull Connect persons amp places in newspaper database to external

resources via DBpedia

httplod-cloudnetversions2010-09-22lod-cloud_coloredpng

Linked Open Data cloud

Build central database

Step 3 Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

bull DBpedia = hub for linking different data sets on the Web to each

other Linked Open Data cloud

bull We use DBpdia to connect persons amp places in our newspaper database to information in other databases

httpnldbpediaorgpageHuib_Drion

httpsnlwikipediaorgwikiHuib_Drion

httpwwwdbnlorgauteursauteurphpid=drio001

httpwwwbiografischportaalnlpersoon41181342

Build central database

Added value of Linked Open Data amp DBpedia Software can automatically query for additional information about places and persons mentioned in De Winkel that is not available in bull KB-catalogue bull Delpher bull De Winkel

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples)

Open license (CC-BY-SA) Open standard (RDF)

Contextual information Links between titles Delpher amp KB-cat (via PPNs)

Relations Links between places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia)

bull Titles Other titles

(PPNs)

http5stardatainfoen

Wikiproject Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

We need a template

Using an article template we can generate 1300 uniform and interlinked Wikipedia articles

from the LOD-database

htt

ps

c1

sta

ticf

lickr

co

m9

82

81

76

99

23

19

18

_11

a73

56

c38

_bjp

g

LOD-database + article template = Wikipedia article

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

Titles from database

Metadata from database

Related persons from database

Related newspapers from database

Winkel-ID from database

Link to full-texts in Delpher from database

Link to KB-catalogue record from database

Fixed categories on WP and Commons

Predefined fixed strings

Grey = bull From database bull Predefined fixed strings

Uniformity between articles guaranteed

All that WP-writers need to add manually

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

The KB can re-use (embed) the Wikipedia content in its own

websites to tackle this problem

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_28verzetsblad29

Delpher - search results

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

Delpher - search results

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Delpher - object presentation

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Over De Geus onder studenten

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog dat vanaf 4 oktober 1940 tot en met 13 juli 1944 in Den Haag werd uitgegeven Het blad verscheen in 1940 1941 en 1943 maandelijks verder onregelmatig in een oplage tussen de 250 en 8000 exemplaren Het werd aanvankelijk gestencild en vanaf november 1942 gedrukt en de inhoud bestond voornamelijk uit opinie-artikelen

Het blad werd uitgegeven door Jan Drion en Huib Drion twee Leidsehellip

Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Delpher - object presentation

Suggested reading

bull httpwwwtedcomtalkstim_berners_lee_on_the_next_web 20 years ago Tim Berners-Lee invented the World Wide Web For his next project hes building a web for open linked data that could do for numbers what the Web did for words pictures video unlock our data and reframe the way we use it together

bull httpsenwikipediaorgwikiLinked_data Wikipedia article related to the above video

bull http5stardatainfoen The 5 stars of Linked Open Data (Tim Berners-Lee)

bull httplinda-projecteulinked-data-primer-2 Short primer about creating LOD in practice starting from an Excel sheet

bull httpwwwprogrammablewebcomnewshow-linked-data-solved-digital-age-marketing-problemanalysis20150831 The figure near the bottom of the first page is a good illustration of the concept of (linked) triples

bull httpsenwikipediaorgwikiDBpedia

bull httpsenwikipediaorgwikiSemantic_network

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Questions

olafjanssenkbnl - ookgezellig

tinyurlcomverzetskranten

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Page 54: Linked Open Data case study (illegal newspapers WW2, Wikipedia, DBpedia) - Lecture Leiden University 3-3-2016

htt

p

2b

pb

logsp

ot

com

_BW

zuYw

iS6-I

TM

geR

sFd3m

IAAAAAAAAElw

3cv

gbZSPW

css

1600d

oct

or+

macr

o+

judy+

scare

djpg

making discovery understanding amp research

of these newspapers (and related places amp persons) more difficult than necessary

We can solve all these issues

Good news

Wikiproject() Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

tinyurlcomverzetskranten (in Dutch)

httpsnlwikipediaorgwikiWikipediaWikiproject httpsenwikipediaorgwikiWikipediaWikiproject

From 14 1300 titles

htt

ps

n

lwik

iped

iao

rgw

iki

Cat

ego

rie

Illeg

ale_

per

s_in

_de_

Twee

de_

Wer

eld

oo

rlo

g

Wikiproject() Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

tinyurlcomverzetskranten (in Dutch)

We need a database

Build central database

httpswwwyoutubecomwatchv=GVDGuCjog_0

Build central database

httpswwwyoutubecomwatchv=GVDGuCjog_0

Step 1 Create Excel-sheet with bull Metadata about newspaper (from Winkel PDF)

bull Unique Wikipedia article title

bull Contextual info incl related persons amp titles (from Winkel PDF)

bull PPN unique ID linking newspaper to

KB-catalogue amp Delpher

Winkel-ID

Place of publication

Other metadata

Title

Unique Wikipedia article title =

ltNewspaper titlegt (verzetsblad ltPlace of publicationgt)

bull Contextual info bull Related persons

bull Related newspaper titles

PPN (107123223)

httpopc4kbnlDB=1PPNPPN=107123223

httpopc4kbnlDB=1PPNPPN=107123223

PPN (107123223)

PPN (107123223)

httpwwwdelphernlnlkrantenresultsindexcoll=dddtitelampcql[]=ppn3D107123223

httpwwwdelphernlnlkrantenresultsindexcoll=dddtitelampcql[]=ppn3D107123223

PPN (107123223)

Build central database

Step 2 Convert Excel into RDF triplestore (=special kind of online database anybody can access)

bull Steps 1-4 from httplinda-projecteulinked-

data-primer-2

bull Step 4 Vocubulary used = Bibframe (httpbibframeorgvocab)

Build central database

Step 3 Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

bull DBpedia = hub for linking different data sets on the Web to each

other Linked Open Data cloud

bull Connect persons amp places in newspaper database to external resources via DBpedia

Step 1c Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

DBpedia allows you to ask sophisticated queries against Wikipedia and to link the different data sets on the Web to Wikipedia data

bull Connect persons amp places in newspaper database to external

resources via DBpedia

httplod-cloudnetversions2010-09-22lod-cloud_coloredpng

Linked Open Data cloud

Build central database

Step 3 Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

bull DBpedia = hub for linking different data sets on the Web to each

other Linked Open Data cloud

bull We use DBpdia to connect persons amp places in our newspaper database to information in other databases

httpnldbpediaorgpageHuib_Drion

httpsnlwikipediaorgwikiHuib_Drion

httpwwwdbnlorgauteursauteurphpid=drio001

httpwwwbiografischportaalnlpersoon41181342

Build central database

Added value of Linked Open Data amp DBpedia Software can automatically query for additional information about places and persons mentioned in De Winkel that is not available in bull KB-catalogue bull Delpher bull De Winkel

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples)

Open license (CC-BY-SA) Open standard (RDF)

Contextual information Links between titles Delpher amp KB-cat (via PPNs)

Relations Links between places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia)

bull Titles Other titles

(PPNs)

http5stardatainfoen

Wikiproject Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

We need a template

Using an article template we can generate 1300 uniform and interlinked Wikipedia articles

from the LOD-database

htt

ps

c1

sta

ticf

lickr

co

m9

82

81

76

99

23

19

18

_11

a73

56

c38

_bjp

g

LOD-database + article template = Wikipedia article

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

Titles from database

Metadata from database

Related persons from database

Related newspapers from database

Winkel-ID from database

Link to full-texts in Delpher from database

Link to KB-catalogue record from database

Fixed categories on WP and Commons

Predefined fixed strings

Grey = bull From database bull Predefined fixed strings

Uniformity between articles guaranteed

All that WP-writers need to add manually

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

The KB can re-use (embed) the Wikipedia content in its own

websites to tackle this problem

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_28verzetsblad29

Delpher - search results

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

Delpher - search results

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Delpher - object presentation

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Over De Geus onder studenten

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog dat vanaf 4 oktober 1940 tot en met 13 juli 1944 in Den Haag werd uitgegeven Het blad verscheen in 1940 1941 en 1943 maandelijks verder onregelmatig in een oplage tussen de 250 en 8000 exemplaren Het werd aanvankelijk gestencild en vanaf november 1942 gedrukt en de inhoud bestond voornamelijk uit opinie-artikelen

Het blad werd uitgegeven door Jan Drion en Huib Drion twee Leidsehellip

Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Delpher - object presentation

Suggested reading

bull httpwwwtedcomtalkstim_berners_lee_on_the_next_web 20 years ago Tim Berners-Lee invented the World Wide Web For his next project hes building a web for open linked data that could do for numbers what the Web did for words pictures video unlock our data and reframe the way we use it together

bull httpsenwikipediaorgwikiLinked_data Wikipedia article related to the above video

bull http5stardatainfoen The 5 stars of Linked Open Data (Tim Berners-Lee)

bull httplinda-projecteulinked-data-primer-2 Short primer about creating LOD in practice starting from an Excel sheet

bull httpwwwprogrammablewebcomnewshow-linked-data-solved-digital-age-marketing-problemanalysis20150831 The figure near the bottom of the first page is a good illustration of the concept of (linked) triples

bull httpsenwikipediaorgwikiDBpedia

bull httpsenwikipediaorgwikiSemantic_network

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Questions

olafjanssenkbnl - ookgezellig

tinyurlcomverzetskranten

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Page 55: Linked Open Data case study (illegal newspapers WW2, Wikipedia, DBpedia) - Lecture Leiden University 3-3-2016

We can solve all these issues

Good news

Wikiproject() Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

tinyurlcomverzetskranten (in Dutch)

httpsnlwikipediaorgwikiWikipediaWikiproject httpsenwikipediaorgwikiWikipediaWikiproject

From 14 1300 titles

htt

ps

n

lwik

iped

iao

rgw

iki

Cat

ego

rie

Illeg

ale_

per

s_in

_de_

Twee

de_

Wer

eld

oo

rlo

g

Wikiproject() Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

tinyurlcomverzetskranten (in Dutch)

We need a database

Build central database

httpswwwyoutubecomwatchv=GVDGuCjog_0

Build central database

httpswwwyoutubecomwatchv=GVDGuCjog_0

Step 1 Create Excel-sheet with bull Metadata about newspaper (from Winkel PDF)

bull Unique Wikipedia article title

bull Contextual info incl related persons amp titles (from Winkel PDF)

bull PPN unique ID linking newspaper to

KB-catalogue amp Delpher

Winkel-ID

Place of publication

Other metadata

Title

Unique Wikipedia article title =

ltNewspaper titlegt (verzetsblad ltPlace of publicationgt)

bull Contextual info bull Related persons

bull Related newspaper titles

PPN (107123223)

httpopc4kbnlDB=1PPNPPN=107123223

httpopc4kbnlDB=1PPNPPN=107123223

PPN (107123223)

PPN (107123223)

httpwwwdelphernlnlkrantenresultsindexcoll=dddtitelampcql[]=ppn3D107123223

httpwwwdelphernlnlkrantenresultsindexcoll=dddtitelampcql[]=ppn3D107123223

PPN (107123223)

Build central database

Step 2 Convert Excel into RDF triplestore (=special kind of online database anybody can access)

bull Steps 1-4 from httplinda-projecteulinked-

data-primer-2

bull Step 4 Vocubulary used = Bibframe (httpbibframeorgvocab)

Build central database

Step 3 Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

bull DBpedia = hub for linking different data sets on the Web to each

other Linked Open Data cloud

bull Connect persons amp places in newspaper database to external resources via DBpedia

Step 1c Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

DBpedia allows you to ask sophisticated queries against Wikipedia and to link the different data sets on the Web to Wikipedia data

bull Connect persons amp places in newspaper database to external

resources via DBpedia

httplod-cloudnetversions2010-09-22lod-cloud_coloredpng

Linked Open Data cloud

Build central database

Step 3 Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

bull DBpedia = hub for linking different data sets on the Web to each

other Linked Open Data cloud

bull We use DBpdia to connect persons amp places in our newspaper database to information in other databases

httpnldbpediaorgpageHuib_Drion

httpsnlwikipediaorgwikiHuib_Drion

httpwwwdbnlorgauteursauteurphpid=drio001

httpwwwbiografischportaalnlpersoon41181342

Build central database

Added value of Linked Open Data amp DBpedia Software can automatically query for additional information about places and persons mentioned in De Winkel that is not available in bull KB-catalogue bull Delpher bull De Winkel

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples)

Open license (CC-BY-SA) Open standard (RDF)

Contextual information Links between titles Delpher amp KB-cat (via PPNs)

Relations Links between places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia)

bull Titles Other titles

(PPNs)

http5stardatainfoen

Wikiproject Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

We need a template

Using an article template we can generate 1300 uniform and interlinked Wikipedia articles

from the LOD-database

htt

ps

c1

sta

ticf

lickr

co

m9

82

81

76

99

23

19

18

_11

a73

56

c38

_bjp

g

LOD-database + article template = Wikipedia article

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

Titles from database

Metadata from database

Related persons from database

Related newspapers from database

Winkel-ID from database

Link to full-texts in Delpher from database

Link to KB-catalogue record from database

Fixed categories on WP and Commons

Predefined fixed strings

Grey = bull From database bull Predefined fixed strings

Uniformity between articles guaranteed

All that WP-writers need to add manually

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

The KB can re-use (embed) the Wikipedia content in its own

websites to tackle this problem

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_28verzetsblad29

Delpher - search results

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

Delpher - search results

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Delpher - object presentation

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Over De Geus onder studenten

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog dat vanaf 4 oktober 1940 tot en met 13 juli 1944 in Den Haag werd uitgegeven Het blad verscheen in 1940 1941 en 1943 maandelijks verder onregelmatig in een oplage tussen de 250 en 8000 exemplaren Het werd aanvankelijk gestencild en vanaf november 1942 gedrukt en de inhoud bestond voornamelijk uit opinie-artikelen

Het blad werd uitgegeven door Jan Drion en Huib Drion twee Leidsehellip

Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Delpher - object presentation

Suggested reading

bull httpwwwtedcomtalkstim_berners_lee_on_the_next_web 20 years ago Tim Berners-Lee invented the World Wide Web For his next project hes building a web for open linked data that could do for numbers what the Web did for words pictures video unlock our data and reframe the way we use it together

bull httpsenwikipediaorgwikiLinked_data Wikipedia article related to the above video

bull http5stardatainfoen The 5 stars of Linked Open Data (Tim Berners-Lee)

bull httplinda-projecteulinked-data-primer-2 Short primer about creating LOD in practice starting from an Excel sheet

bull httpwwwprogrammablewebcomnewshow-linked-data-solved-digital-age-marketing-problemanalysis20150831 The figure near the bottom of the first page is a good illustration of the concept of (linked) triples

bull httpsenwikipediaorgwikiDBpedia

bull httpsenwikipediaorgwikiSemantic_network

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Questions

olafjanssenkbnl - ookgezellig

tinyurlcomverzetskranten

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Page 56: Linked Open Data case study (illegal newspapers WW2, Wikipedia, DBpedia) - Lecture Leiden University 3-3-2016

Wikiproject() Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

tinyurlcomverzetskranten (in Dutch)

httpsnlwikipediaorgwikiWikipediaWikiproject httpsenwikipediaorgwikiWikipediaWikiproject

From 14 1300 titles

htt

ps

n

lwik

iped

iao

rgw

iki

Cat

ego

rie

Illeg

ale_

per

s_in

_de_

Twee

de_

Wer

eld

oo

rlo

g

Wikiproject() Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

tinyurlcomverzetskranten (in Dutch)

We need a database

Build central database

httpswwwyoutubecomwatchv=GVDGuCjog_0

Build central database

httpswwwyoutubecomwatchv=GVDGuCjog_0

Step 1 Create Excel-sheet with bull Metadata about newspaper (from Winkel PDF)

bull Unique Wikipedia article title

bull Contextual info incl related persons amp titles (from Winkel PDF)

bull PPN unique ID linking newspaper to

KB-catalogue amp Delpher

Winkel-ID

Place of publication

Other metadata

Title

Unique Wikipedia article title =

ltNewspaper titlegt (verzetsblad ltPlace of publicationgt)

bull Contextual info bull Related persons

bull Related newspaper titles

PPN (107123223)

httpopc4kbnlDB=1PPNPPN=107123223

httpopc4kbnlDB=1PPNPPN=107123223

PPN (107123223)

PPN (107123223)

httpwwwdelphernlnlkrantenresultsindexcoll=dddtitelampcql[]=ppn3D107123223

httpwwwdelphernlnlkrantenresultsindexcoll=dddtitelampcql[]=ppn3D107123223

PPN (107123223)

Build central database

Step 2 Convert Excel into RDF triplestore (=special kind of online database anybody can access)

bull Steps 1-4 from httplinda-projecteulinked-

data-primer-2

bull Step 4 Vocubulary used = Bibframe (httpbibframeorgvocab)

Build central database

Step 3 Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

bull DBpedia = hub for linking different data sets on the Web to each

other Linked Open Data cloud

bull Connect persons amp places in newspaper database to external resources via DBpedia

Step 1c Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

DBpedia allows you to ask sophisticated queries against Wikipedia and to link the different data sets on the Web to Wikipedia data

bull Connect persons amp places in newspaper database to external

resources via DBpedia

httplod-cloudnetversions2010-09-22lod-cloud_coloredpng

Linked Open Data cloud

Build central database

Step 3 Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

bull DBpedia = hub for linking different data sets on the Web to each

other Linked Open Data cloud

bull We use DBpdia to connect persons amp places in our newspaper database to information in other databases

httpnldbpediaorgpageHuib_Drion

httpsnlwikipediaorgwikiHuib_Drion

httpwwwdbnlorgauteursauteurphpid=drio001

httpwwwbiografischportaalnlpersoon41181342

Build central database

Added value of Linked Open Data amp DBpedia Software can automatically query for additional information about places and persons mentioned in De Winkel that is not available in bull KB-catalogue bull Delpher bull De Winkel

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples)

Open license (CC-BY-SA) Open standard (RDF)

Contextual information Links between titles Delpher amp KB-cat (via PPNs)

Relations Links between places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia)

bull Titles Other titles

(PPNs)

http5stardatainfoen

Wikiproject Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

We need a template

Using an article template we can generate 1300 uniform and interlinked Wikipedia articles

from the LOD-database

htt

ps

c1

sta

ticf

lickr

co

m9

82

81

76

99

23

19

18

_11

a73

56

c38

_bjp

g

LOD-database + article template = Wikipedia article

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

Titles from database

Metadata from database

Related persons from database

Related newspapers from database

Winkel-ID from database

Link to full-texts in Delpher from database

Link to KB-catalogue record from database

Fixed categories on WP and Commons

Predefined fixed strings

Grey = bull From database bull Predefined fixed strings

Uniformity between articles guaranteed

All that WP-writers need to add manually

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

The KB can re-use (embed) the Wikipedia content in its own

websites to tackle this problem

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_28verzetsblad29

Delpher - search results

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

Delpher - search results

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Delpher - object presentation

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Over De Geus onder studenten

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog dat vanaf 4 oktober 1940 tot en met 13 juli 1944 in Den Haag werd uitgegeven Het blad verscheen in 1940 1941 en 1943 maandelijks verder onregelmatig in een oplage tussen de 250 en 8000 exemplaren Het werd aanvankelijk gestencild en vanaf november 1942 gedrukt en de inhoud bestond voornamelijk uit opinie-artikelen

Het blad werd uitgegeven door Jan Drion en Huib Drion twee Leidsehellip

Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Delpher - object presentation

Suggested reading

bull httpwwwtedcomtalkstim_berners_lee_on_the_next_web 20 years ago Tim Berners-Lee invented the World Wide Web For his next project hes building a web for open linked data that could do for numbers what the Web did for words pictures video unlock our data and reframe the way we use it together

bull httpsenwikipediaorgwikiLinked_data Wikipedia article related to the above video

bull http5stardatainfoen The 5 stars of Linked Open Data (Tim Berners-Lee)

bull httplinda-projecteulinked-data-primer-2 Short primer about creating LOD in practice starting from an Excel sheet

bull httpwwwprogrammablewebcomnewshow-linked-data-solved-digital-age-marketing-problemanalysis20150831 The figure near the bottom of the first page is a good illustration of the concept of (linked) triples

bull httpsenwikipediaorgwikiDBpedia

bull httpsenwikipediaorgwikiSemantic_network

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Questions

olafjanssenkbnl - ookgezellig

tinyurlcomverzetskranten

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Page 57: Linked Open Data case study (illegal newspapers WW2, Wikipedia, DBpedia) - Lecture Leiden University 3-3-2016

From 14 1300 titles

htt

ps

n

lwik

iped

iao

rgw

iki

Cat

ego

rie

Illeg

ale_

per

s_in

_de_

Twee

de_

Wer

eld

oo

rlo

g

Wikiproject() Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

tinyurlcomverzetskranten (in Dutch)

We need a database

Build central database

httpswwwyoutubecomwatchv=GVDGuCjog_0

Build central database

httpswwwyoutubecomwatchv=GVDGuCjog_0

Step 1 Create Excel-sheet with bull Metadata about newspaper (from Winkel PDF)

bull Unique Wikipedia article title

bull Contextual info incl related persons amp titles (from Winkel PDF)

bull PPN unique ID linking newspaper to

KB-catalogue amp Delpher

Winkel-ID

Place of publication

Other metadata

Title

Unique Wikipedia article title =

ltNewspaper titlegt (verzetsblad ltPlace of publicationgt)

bull Contextual info bull Related persons

bull Related newspaper titles

PPN (107123223)

httpopc4kbnlDB=1PPNPPN=107123223

httpopc4kbnlDB=1PPNPPN=107123223

PPN (107123223)

PPN (107123223)

httpwwwdelphernlnlkrantenresultsindexcoll=dddtitelampcql[]=ppn3D107123223

httpwwwdelphernlnlkrantenresultsindexcoll=dddtitelampcql[]=ppn3D107123223

PPN (107123223)

Build central database

Step 2 Convert Excel into RDF triplestore (=special kind of online database anybody can access)

bull Steps 1-4 from httplinda-projecteulinked-

data-primer-2

bull Step 4 Vocubulary used = Bibframe (httpbibframeorgvocab)

Build central database

Step 3 Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

bull DBpedia = hub for linking different data sets on the Web to each

other Linked Open Data cloud

bull Connect persons amp places in newspaper database to external resources via DBpedia

Step 1c Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

DBpedia allows you to ask sophisticated queries against Wikipedia and to link the different data sets on the Web to Wikipedia data

bull Connect persons amp places in newspaper database to external

resources via DBpedia

httplod-cloudnetversions2010-09-22lod-cloud_coloredpng

Linked Open Data cloud

Build central database

Step 3 Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

bull DBpedia = hub for linking different data sets on the Web to each

other Linked Open Data cloud

bull We use DBpdia to connect persons amp places in our newspaper database to information in other databases

httpnldbpediaorgpageHuib_Drion

httpsnlwikipediaorgwikiHuib_Drion

httpwwwdbnlorgauteursauteurphpid=drio001

httpwwwbiografischportaalnlpersoon41181342

Build central database

Added value of Linked Open Data amp DBpedia Software can automatically query for additional information about places and persons mentioned in De Winkel that is not available in bull KB-catalogue bull Delpher bull De Winkel

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples)

Open license (CC-BY-SA) Open standard (RDF)

Contextual information Links between titles Delpher amp KB-cat (via PPNs)

Relations Links between places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia)

bull Titles Other titles

(PPNs)

http5stardatainfoen

Wikiproject Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

We need a template

Using an article template we can generate 1300 uniform and interlinked Wikipedia articles

from the LOD-database

htt

ps

c1

sta

ticf

lickr

co

m9

82

81

76

99

23

19

18

_11

a73

56

c38

_bjp

g

LOD-database + article template = Wikipedia article

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

Titles from database

Metadata from database

Related persons from database

Related newspapers from database

Winkel-ID from database

Link to full-texts in Delpher from database

Link to KB-catalogue record from database

Fixed categories on WP and Commons

Predefined fixed strings

Grey = bull From database bull Predefined fixed strings

Uniformity between articles guaranteed

All that WP-writers need to add manually

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

The KB can re-use (embed) the Wikipedia content in its own

websites to tackle this problem

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_28verzetsblad29

Delpher - search results

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

Delpher - search results

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Delpher - object presentation

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Over De Geus onder studenten

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog dat vanaf 4 oktober 1940 tot en met 13 juli 1944 in Den Haag werd uitgegeven Het blad verscheen in 1940 1941 en 1943 maandelijks verder onregelmatig in een oplage tussen de 250 en 8000 exemplaren Het werd aanvankelijk gestencild en vanaf november 1942 gedrukt en de inhoud bestond voornamelijk uit opinie-artikelen

Het blad werd uitgegeven door Jan Drion en Huib Drion twee Leidsehellip

Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Delpher - object presentation

Suggested reading

bull httpwwwtedcomtalkstim_berners_lee_on_the_next_web 20 years ago Tim Berners-Lee invented the World Wide Web For his next project hes building a web for open linked data that could do for numbers what the Web did for words pictures video unlock our data and reframe the way we use it together

bull httpsenwikipediaorgwikiLinked_data Wikipedia article related to the above video

bull http5stardatainfoen The 5 stars of Linked Open Data (Tim Berners-Lee)

bull httplinda-projecteulinked-data-primer-2 Short primer about creating LOD in practice starting from an Excel sheet

bull httpwwwprogrammablewebcomnewshow-linked-data-solved-digital-age-marketing-problemanalysis20150831 The figure near the bottom of the first page is a good illustration of the concept of (linked) triples

bull httpsenwikipediaorgwikiDBpedia

bull httpsenwikipediaorgwikiSemantic_network

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Questions

olafjanssenkbnl - ookgezellig

tinyurlcomverzetskranten

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Page 58: Linked Open Data case study (illegal newspapers WW2, Wikipedia, DBpedia) - Lecture Leiden University 3-3-2016

Wikiproject() Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

tinyurlcomverzetskranten (in Dutch)

We need a database

Build central database

httpswwwyoutubecomwatchv=GVDGuCjog_0

Build central database

httpswwwyoutubecomwatchv=GVDGuCjog_0

Step 1 Create Excel-sheet with bull Metadata about newspaper (from Winkel PDF)

bull Unique Wikipedia article title

bull Contextual info incl related persons amp titles (from Winkel PDF)

bull PPN unique ID linking newspaper to

KB-catalogue amp Delpher

Winkel-ID

Place of publication

Other metadata

Title

Unique Wikipedia article title =

ltNewspaper titlegt (verzetsblad ltPlace of publicationgt)

bull Contextual info bull Related persons

bull Related newspaper titles

PPN (107123223)

httpopc4kbnlDB=1PPNPPN=107123223

httpopc4kbnlDB=1PPNPPN=107123223

PPN (107123223)

PPN (107123223)

httpwwwdelphernlnlkrantenresultsindexcoll=dddtitelampcql[]=ppn3D107123223

httpwwwdelphernlnlkrantenresultsindexcoll=dddtitelampcql[]=ppn3D107123223

PPN (107123223)

Build central database

Step 2 Convert Excel into RDF triplestore (=special kind of online database anybody can access)

bull Steps 1-4 from httplinda-projecteulinked-

data-primer-2

bull Step 4 Vocubulary used = Bibframe (httpbibframeorgvocab)

Build central database

Step 3 Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

bull DBpedia = hub for linking different data sets on the Web to each

other Linked Open Data cloud

bull Connect persons amp places in newspaper database to external resources via DBpedia

Step 1c Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

DBpedia allows you to ask sophisticated queries against Wikipedia and to link the different data sets on the Web to Wikipedia data

bull Connect persons amp places in newspaper database to external

resources via DBpedia

httplod-cloudnetversions2010-09-22lod-cloud_coloredpng

Linked Open Data cloud

Build central database

Step 3 Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

bull DBpedia = hub for linking different data sets on the Web to each

other Linked Open Data cloud

bull We use DBpdia to connect persons amp places in our newspaper database to information in other databases

httpnldbpediaorgpageHuib_Drion

httpsnlwikipediaorgwikiHuib_Drion

httpwwwdbnlorgauteursauteurphpid=drio001

httpwwwbiografischportaalnlpersoon41181342

Build central database

Added value of Linked Open Data amp DBpedia Software can automatically query for additional information about places and persons mentioned in De Winkel that is not available in bull KB-catalogue bull Delpher bull De Winkel

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples)

Open license (CC-BY-SA) Open standard (RDF)

Contextual information Links between titles Delpher amp KB-cat (via PPNs)

Relations Links between places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia)

bull Titles Other titles

(PPNs)

http5stardatainfoen

Wikiproject Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

We need a template

Using an article template we can generate 1300 uniform and interlinked Wikipedia articles

from the LOD-database

htt

ps

c1

sta

ticf

lickr

co

m9

82

81

76

99

23

19

18

_11

a73

56

c38

_bjp

g

LOD-database + article template = Wikipedia article

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

Titles from database

Metadata from database

Related persons from database

Related newspapers from database

Winkel-ID from database

Link to full-texts in Delpher from database

Link to KB-catalogue record from database

Fixed categories on WP and Commons

Predefined fixed strings

Grey = bull From database bull Predefined fixed strings

Uniformity between articles guaranteed

All that WP-writers need to add manually

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

The KB can re-use (embed) the Wikipedia content in its own

websites to tackle this problem

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_28verzetsblad29

Delpher - search results

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

Delpher - search results

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Delpher - object presentation

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Over De Geus onder studenten

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog dat vanaf 4 oktober 1940 tot en met 13 juli 1944 in Den Haag werd uitgegeven Het blad verscheen in 1940 1941 en 1943 maandelijks verder onregelmatig in een oplage tussen de 250 en 8000 exemplaren Het werd aanvankelijk gestencild en vanaf november 1942 gedrukt en de inhoud bestond voornamelijk uit opinie-artikelen

Het blad werd uitgegeven door Jan Drion en Huib Drion twee Leidsehellip

Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Delpher - object presentation

Suggested reading

bull httpwwwtedcomtalkstim_berners_lee_on_the_next_web 20 years ago Tim Berners-Lee invented the World Wide Web For his next project hes building a web for open linked data that could do for numbers what the Web did for words pictures video unlock our data and reframe the way we use it together

bull httpsenwikipediaorgwikiLinked_data Wikipedia article related to the above video

bull http5stardatainfoen The 5 stars of Linked Open Data (Tim Berners-Lee)

bull httplinda-projecteulinked-data-primer-2 Short primer about creating LOD in practice starting from an Excel sheet

bull httpwwwprogrammablewebcomnewshow-linked-data-solved-digital-age-marketing-problemanalysis20150831 The figure near the bottom of the first page is a good illustration of the concept of (linked) triples

bull httpsenwikipediaorgwikiDBpedia

bull httpsenwikipediaorgwikiSemantic_network

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Questions

olafjanssenkbnl - ookgezellig

tinyurlcomverzetskranten

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Page 59: Linked Open Data case study (illegal newspapers WW2, Wikipedia, DBpedia) - Lecture Leiden University 3-3-2016

Build central database

httpswwwyoutubecomwatchv=GVDGuCjog_0

Build central database

httpswwwyoutubecomwatchv=GVDGuCjog_0

Step 1 Create Excel-sheet with bull Metadata about newspaper (from Winkel PDF)

bull Unique Wikipedia article title

bull Contextual info incl related persons amp titles (from Winkel PDF)

bull PPN unique ID linking newspaper to

KB-catalogue amp Delpher

Winkel-ID

Place of publication

Other metadata

Title

Unique Wikipedia article title =

ltNewspaper titlegt (verzetsblad ltPlace of publicationgt)

bull Contextual info bull Related persons

bull Related newspaper titles

PPN (107123223)

httpopc4kbnlDB=1PPNPPN=107123223

httpopc4kbnlDB=1PPNPPN=107123223

PPN (107123223)

PPN (107123223)

httpwwwdelphernlnlkrantenresultsindexcoll=dddtitelampcql[]=ppn3D107123223

httpwwwdelphernlnlkrantenresultsindexcoll=dddtitelampcql[]=ppn3D107123223

PPN (107123223)

Build central database

Step 2 Convert Excel into RDF triplestore (=special kind of online database anybody can access)

bull Steps 1-4 from httplinda-projecteulinked-

data-primer-2

bull Step 4 Vocubulary used = Bibframe (httpbibframeorgvocab)

Build central database

Step 3 Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

bull DBpedia = hub for linking different data sets on the Web to each

other Linked Open Data cloud

bull Connect persons amp places in newspaper database to external resources via DBpedia

Step 1c Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

DBpedia allows you to ask sophisticated queries against Wikipedia and to link the different data sets on the Web to Wikipedia data

bull Connect persons amp places in newspaper database to external

resources via DBpedia

httplod-cloudnetversions2010-09-22lod-cloud_coloredpng

Linked Open Data cloud

Build central database

Step 3 Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

bull DBpedia = hub for linking different data sets on the Web to each

other Linked Open Data cloud

bull We use DBpdia to connect persons amp places in our newspaper database to information in other databases

httpnldbpediaorgpageHuib_Drion

httpsnlwikipediaorgwikiHuib_Drion

httpwwwdbnlorgauteursauteurphpid=drio001

httpwwwbiografischportaalnlpersoon41181342

Build central database

Added value of Linked Open Data amp DBpedia Software can automatically query for additional information about places and persons mentioned in De Winkel that is not available in bull KB-catalogue bull Delpher bull De Winkel

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples)

Open license (CC-BY-SA) Open standard (RDF)

Contextual information Links between titles Delpher amp KB-cat (via PPNs)

Relations Links between places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia)

bull Titles Other titles

(PPNs)

http5stardatainfoen

Wikiproject Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

We need a template

Using an article template we can generate 1300 uniform and interlinked Wikipedia articles

from the LOD-database

htt

ps

c1

sta

ticf

lickr

co

m9

82

81

76

99

23

19

18

_11

a73

56

c38

_bjp

g

LOD-database + article template = Wikipedia article

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

Titles from database

Metadata from database

Related persons from database

Related newspapers from database

Winkel-ID from database

Link to full-texts in Delpher from database

Link to KB-catalogue record from database

Fixed categories on WP and Commons

Predefined fixed strings

Grey = bull From database bull Predefined fixed strings

Uniformity between articles guaranteed

All that WP-writers need to add manually

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

The KB can re-use (embed) the Wikipedia content in its own

websites to tackle this problem

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_28verzetsblad29

Delpher - search results

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

Delpher - search results

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Delpher - object presentation

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Over De Geus onder studenten

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog dat vanaf 4 oktober 1940 tot en met 13 juli 1944 in Den Haag werd uitgegeven Het blad verscheen in 1940 1941 en 1943 maandelijks verder onregelmatig in een oplage tussen de 250 en 8000 exemplaren Het werd aanvankelijk gestencild en vanaf november 1942 gedrukt en de inhoud bestond voornamelijk uit opinie-artikelen

Het blad werd uitgegeven door Jan Drion en Huib Drion twee Leidsehellip

Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Delpher - object presentation

Suggested reading

bull httpwwwtedcomtalkstim_berners_lee_on_the_next_web 20 years ago Tim Berners-Lee invented the World Wide Web For his next project hes building a web for open linked data that could do for numbers what the Web did for words pictures video unlock our data and reframe the way we use it together

bull httpsenwikipediaorgwikiLinked_data Wikipedia article related to the above video

bull http5stardatainfoen The 5 stars of Linked Open Data (Tim Berners-Lee)

bull httplinda-projecteulinked-data-primer-2 Short primer about creating LOD in practice starting from an Excel sheet

bull httpwwwprogrammablewebcomnewshow-linked-data-solved-digital-age-marketing-problemanalysis20150831 The figure near the bottom of the first page is a good illustration of the concept of (linked) triples

bull httpsenwikipediaorgwikiDBpedia

bull httpsenwikipediaorgwikiSemantic_network

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Questions

olafjanssenkbnl - ookgezellig

tinyurlcomverzetskranten

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Page 60: Linked Open Data case study (illegal newspapers WW2, Wikipedia, DBpedia) - Lecture Leiden University 3-3-2016

Build central database

httpswwwyoutubecomwatchv=GVDGuCjog_0

Step 1 Create Excel-sheet with bull Metadata about newspaper (from Winkel PDF)

bull Unique Wikipedia article title

bull Contextual info incl related persons amp titles (from Winkel PDF)

bull PPN unique ID linking newspaper to

KB-catalogue amp Delpher

Winkel-ID

Place of publication

Other metadata

Title

Unique Wikipedia article title =

ltNewspaper titlegt (verzetsblad ltPlace of publicationgt)

bull Contextual info bull Related persons

bull Related newspaper titles

PPN (107123223)

httpopc4kbnlDB=1PPNPPN=107123223

httpopc4kbnlDB=1PPNPPN=107123223

PPN (107123223)

PPN (107123223)

httpwwwdelphernlnlkrantenresultsindexcoll=dddtitelampcql[]=ppn3D107123223

httpwwwdelphernlnlkrantenresultsindexcoll=dddtitelampcql[]=ppn3D107123223

PPN (107123223)

Build central database

Step 2 Convert Excel into RDF triplestore (=special kind of online database anybody can access)

bull Steps 1-4 from httplinda-projecteulinked-

data-primer-2

bull Step 4 Vocubulary used = Bibframe (httpbibframeorgvocab)

Build central database

Step 3 Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

bull DBpedia = hub for linking different data sets on the Web to each

other Linked Open Data cloud

bull Connect persons amp places in newspaper database to external resources via DBpedia

Step 1c Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

DBpedia allows you to ask sophisticated queries against Wikipedia and to link the different data sets on the Web to Wikipedia data

bull Connect persons amp places in newspaper database to external

resources via DBpedia

httplod-cloudnetversions2010-09-22lod-cloud_coloredpng

Linked Open Data cloud

Build central database

Step 3 Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

bull DBpedia = hub for linking different data sets on the Web to each

other Linked Open Data cloud

bull We use DBpdia to connect persons amp places in our newspaper database to information in other databases

httpnldbpediaorgpageHuib_Drion

httpsnlwikipediaorgwikiHuib_Drion

httpwwwdbnlorgauteursauteurphpid=drio001

httpwwwbiografischportaalnlpersoon41181342

Build central database

Added value of Linked Open Data amp DBpedia Software can automatically query for additional information about places and persons mentioned in De Winkel that is not available in bull KB-catalogue bull Delpher bull De Winkel

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples)

Open license (CC-BY-SA) Open standard (RDF)

Contextual information Links between titles Delpher amp KB-cat (via PPNs)

Relations Links between places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia)

bull Titles Other titles

(PPNs)

http5stardatainfoen

Wikiproject Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

We need a template

Using an article template we can generate 1300 uniform and interlinked Wikipedia articles

from the LOD-database

htt

ps

c1

sta

ticf

lickr

co

m9

82

81

76

99

23

19

18

_11

a73

56

c38

_bjp

g

LOD-database + article template = Wikipedia article

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

Titles from database

Metadata from database

Related persons from database

Related newspapers from database

Winkel-ID from database

Link to full-texts in Delpher from database

Link to KB-catalogue record from database

Fixed categories on WP and Commons

Predefined fixed strings

Grey = bull From database bull Predefined fixed strings

Uniformity between articles guaranteed

All that WP-writers need to add manually

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

The KB can re-use (embed) the Wikipedia content in its own

websites to tackle this problem

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_28verzetsblad29

Delpher - search results

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

Delpher - search results

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Delpher - object presentation

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Over De Geus onder studenten

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog dat vanaf 4 oktober 1940 tot en met 13 juli 1944 in Den Haag werd uitgegeven Het blad verscheen in 1940 1941 en 1943 maandelijks verder onregelmatig in een oplage tussen de 250 en 8000 exemplaren Het werd aanvankelijk gestencild en vanaf november 1942 gedrukt en de inhoud bestond voornamelijk uit opinie-artikelen

Het blad werd uitgegeven door Jan Drion en Huib Drion twee Leidsehellip

Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Delpher - object presentation

Suggested reading

bull httpwwwtedcomtalkstim_berners_lee_on_the_next_web 20 years ago Tim Berners-Lee invented the World Wide Web For his next project hes building a web for open linked data that could do for numbers what the Web did for words pictures video unlock our data and reframe the way we use it together

bull httpsenwikipediaorgwikiLinked_data Wikipedia article related to the above video

bull http5stardatainfoen The 5 stars of Linked Open Data (Tim Berners-Lee)

bull httplinda-projecteulinked-data-primer-2 Short primer about creating LOD in practice starting from an Excel sheet

bull httpwwwprogrammablewebcomnewshow-linked-data-solved-digital-age-marketing-problemanalysis20150831 The figure near the bottom of the first page is a good illustration of the concept of (linked) triples

bull httpsenwikipediaorgwikiDBpedia

bull httpsenwikipediaorgwikiSemantic_network

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Questions

olafjanssenkbnl - ookgezellig

tinyurlcomverzetskranten

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Page 61: Linked Open Data case study (illegal newspapers WW2, Wikipedia, DBpedia) - Lecture Leiden University 3-3-2016

Winkel-ID

Place of publication

Other metadata

Title

Unique Wikipedia article title =

ltNewspaper titlegt (verzetsblad ltPlace of publicationgt)

bull Contextual info bull Related persons

bull Related newspaper titles

PPN (107123223)

httpopc4kbnlDB=1PPNPPN=107123223

httpopc4kbnlDB=1PPNPPN=107123223

PPN (107123223)

PPN (107123223)

httpwwwdelphernlnlkrantenresultsindexcoll=dddtitelampcql[]=ppn3D107123223

httpwwwdelphernlnlkrantenresultsindexcoll=dddtitelampcql[]=ppn3D107123223

PPN (107123223)

Build central database

Step 2 Convert Excel into RDF triplestore (=special kind of online database anybody can access)

bull Steps 1-4 from httplinda-projecteulinked-

data-primer-2

bull Step 4 Vocubulary used = Bibframe (httpbibframeorgvocab)

Build central database

Step 3 Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

bull DBpedia = hub for linking different data sets on the Web to each

other Linked Open Data cloud

bull Connect persons amp places in newspaper database to external resources via DBpedia

Step 1c Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

DBpedia allows you to ask sophisticated queries against Wikipedia and to link the different data sets on the Web to Wikipedia data

bull Connect persons amp places in newspaper database to external

resources via DBpedia

httplod-cloudnetversions2010-09-22lod-cloud_coloredpng

Linked Open Data cloud

Build central database

Step 3 Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

bull DBpedia = hub for linking different data sets on the Web to each

other Linked Open Data cloud

bull We use DBpdia to connect persons amp places in our newspaper database to information in other databases

httpnldbpediaorgpageHuib_Drion

httpsnlwikipediaorgwikiHuib_Drion

httpwwwdbnlorgauteursauteurphpid=drio001

httpwwwbiografischportaalnlpersoon41181342

Build central database

Added value of Linked Open Data amp DBpedia Software can automatically query for additional information about places and persons mentioned in De Winkel that is not available in bull KB-catalogue bull Delpher bull De Winkel

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples)

Open license (CC-BY-SA) Open standard (RDF)

Contextual information Links between titles Delpher amp KB-cat (via PPNs)

Relations Links between places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia)

bull Titles Other titles

(PPNs)

http5stardatainfoen

Wikiproject Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

We need a template

Using an article template we can generate 1300 uniform and interlinked Wikipedia articles

from the LOD-database

htt

ps

c1

sta

ticf

lickr

co

m9

82

81

76

99

23

19

18

_11

a73

56

c38

_bjp

g

LOD-database + article template = Wikipedia article

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

Titles from database

Metadata from database

Related persons from database

Related newspapers from database

Winkel-ID from database

Link to full-texts in Delpher from database

Link to KB-catalogue record from database

Fixed categories on WP and Commons

Predefined fixed strings

Grey = bull From database bull Predefined fixed strings

Uniformity between articles guaranteed

All that WP-writers need to add manually

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

The KB can re-use (embed) the Wikipedia content in its own

websites to tackle this problem

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_28verzetsblad29

Delpher - search results

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

Delpher - search results

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Delpher - object presentation

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Over De Geus onder studenten

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog dat vanaf 4 oktober 1940 tot en met 13 juli 1944 in Den Haag werd uitgegeven Het blad verscheen in 1940 1941 en 1943 maandelijks verder onregelmatig in een oplage tussen de 250 en 8000 exemplaren Het werd aanvankelijk gestencild en vanaf november 1942 gedrukt en de inhoud bestond voornamelijk uit opinie-artikelen

Het blad werd uitgegeven door Jan Drion en Huib Drion twee Leidsehellip

Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Delpher - object presentation

Suggested reading

bull httpwwwtedcomtalkstim_berners_lee_on_the_next_web 20 years ago Tim Berners-Lee invented the World Wide Web For his next project hes building a web for open linked data that could do for numbers what the Web did for words pictures video unlock our data and reframe the way we use it together

bull httpsenwikipediaorgwikiLinked_data Wikipedia article related to the above video

bull http5stardatainfoen The 5 stars of Linked Open Data (Tim Berners-Lee)

bull httplinda-projecteulinked-data-primer-2 Short primer about creating LOD in practice starting from an Excel sheet

bull httpwwwprogrammablewebcomnewshow-linked-data-solved-digital-age-marketing-problemanalysis20150831 The figure near the bottom of the first page is a good illustration of the concept of (linked) triples

bull httpsenwikipediaorgwikiDBpedia

bull httpsenwikipediaorgwikiSemantic_network

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Questions

olafjanssenkbnl - ookgezellig

tinyurlcomverzetskranten

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Page 62: Linked Open Data case study (illegal newspapers WW2, Wikipedia, DBpedia) - Lecture Leiden University 3-3-2016

Place of publication

Other metadata

Title

Unique Wikipedia article title =

ltNewspaper titlegt (verzetsblad ltPlace of publicationgt)

bull Contextual info bull Related persons

bull Related newspaper titles

PPN (107123223)

httpopc4kbnlDB=1PPNPPN=107123223

httpopc4kbnlDB=1PPNPPN=107123223

PPN (107123223)

PPN (107123223)

httpwwwdelphernlnlkrantenresultsindexcoll=dddtitelampcql[]=ppn3D107123223

httpwwwdelphernlnlkrantenresultsindexcoll=dddtitelampcql[]=ppn3D107123223

PPN (107123223)

Build central database

Step 2 Convert Excel into RDF triplestore (=special kind of online database anybody can access)

bull Steps 1-4 from httplinda-projecteulinked-

data-primer-2

bull Step 4 Vocubulary used = Bibframe (httpbibframeorgvocab)

Build central database

Step 3 Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

bull DBpedia = hub for linking different data sets on the Web to each

other Linked Open Data cloud

bull Connect persons amp places in newspaper database to external resources via DBpedia

Step 1c Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

DBpedia allows you to ask sophisticated queries against Wikipedia and to link the different data sets on the Web to Wikipedia data

bull Connect persons amp places in newspaper database to external

resources via DBpedia

httplod-cloudnetversions2010-09-22lod-cloud_coloredpng

Linked Open Data cloud

Build central database

Step 3 Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

bull DBpedia = hub for linking different data sets on the Web to each

other Linked Open Data cloud

bull We use DBpdia to connect persons amp places in our newspaper database to information in other databases

httpnldbpediaorgpageHuib_Drion

httpsnlwikipediaorgwikiHuib_Drion

httpwwwdbnlorgauteursauteurphpid=drio001

httpwwwbiografischportaalnlpersoon41181342

Build central database

Added value of Linked Open Data amp DBpedia Software can automatically query for additional information about places and persons mentioned in De Winkel that is not available in bull KB-catalogue bull Delpher bull De Winkel

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples)

Open license (CC-BY-SA) Open standard (RDF)

Contextual information Links between titles Delpher amp KB-cat (via PPNs)

Relations Links between places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia)

bull Titles Other titles

(PPNs)

http5stardatainfoen

Wikiproject Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

We need a template

Using an article template we can generate 1300 uniform and interlinked Wikipedia articles

from the LOD-database

htt

ps

c1

sta

ticf

lickr

co

m9

82

81

76

99

23

19

18

_11

a73

56

c38

_bjp

g

LOD-database + article template = Wikipedia article

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

Titles from database

Metadata from database

Related persons from database

Related newspapers from database

Winkel-ID from database

Link to full-texts in Delpher from database

Link to KB-catalogue record from database

Fixed categories on WP and Commons

Predefined fixed strings

Grey = bull From database bull Predefined fixed strings

Uniformity between articles guaranteed

All that WP-writers need to add manually

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

The KB can re-use (embed) the Wikipedia content in its own

websites to tackle this problem

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_28verzetsblad29

Delpher - search results

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

Delpher - search results

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Delpher - object presentation

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Over De Geus onder studenten

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog dat vanaf 4 oktober 1940 tot en met 13 juli 1944 in Den Haag werd uitgegeven Het blad verscheen in 1940 1941 en 1943 maandelijks verder onregelmatig in een oplage tussen de 250 en 8000 exemplaren Het werd aanvankelijk gestencild en vanaf november 1942 gedrukt en de inhoud bestond voornamelijk uit opinie-artikelen

Het blad werd uitgegeven door Jan Drion en Huib Drion twee Leidsehellip

Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Delpher - object presentation

Suggested reading

bull httpwwwtedcomtalkstim_berners_lee_on_the_next_web 20 years ago Tim Berners-Lee invented the World Wide Web For his next project hes building a web for open linked data that could do for numbers what the Web did for words pictures video unlock our data and reframe the way we use it together

bull httpsenwikipediaorgwikiLinked_data Wikipedia article related to the above video

bull http5stardatainfoen The 5 stars of Linked Open Data (Tim Berners-Lee)

bull httplinda-projecteulinked-data-primer-2 Short primer about creating LOD in practice starting from an Excel sheet

bull httpwwwprogrammablewebcomnewshow-linked-data-solved-digital-age-marketing-problemanalysis20150831 The figure near the bottom of the first page is a good illustration of the concept of (linked) triples

bull httpsenwikipediaorgwikiDBpedia

bull httpsenwikipediaorgwikiSemantic_network

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Questions

olafjanssenkbnl - ookgezellig

tinyurlcomverzetskranten

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Page 63: Linked Open Data case study (illegal newspapers WW2, Wikipedia, DBpedia) - Lecture Leiden University 3-3-2016

Other metadata

Title

Unique Wikipedia article title =

ltNewspaper titlegt (verzetsblad ltPlace of publicationgt)

bull Contextual info bull Related persons

bull Related newspaper titles

PPN (107123223)

httpopc4kbnlDB=1PPNPPN=107123223

httpopc4kbnlDB=1PPNPPN=107123223

PPN (107123223)

PPN (107123223)

httpwwwdelphernlnlkrantenresultsindexcoll=dddtitelampcql[]=ppn3D107123223

httpwwwdelphernlnlkrantenresultsindexcoll=dddtitelampcql[]=ppn3D107123223

PPN (107123223)

Build central database

Step 2 Convert Excel into RDF triplestore (=special kind of online database anybody can access)

bull Steps 1-4 from httplinda-projecteulinked-

data-primer-2

bull Step 4 Vocubulary used = Bibframe (httpbibframeorgvocab)

Build central database

Step 3 Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

bull DBpedia = hub for linking different data sets on the Web to each

other Linked Open Data cloud

bull Connect persons amp places in newspaper database to external resources via DBpedia

Step 1c Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

DBpedia allows you to ask sophisticated queries against Wikipedia and to link the different data sets on the Web to Wikipedia data

bull Connect persons amp places in newspaper database to external

resources via DBpedia

httplod-cloudnetversions2010-09-22lod-cloud_coloredpng

Linked Open Data cloud

Build central database

Step 3 Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

bull DBpedia = hub for linking different data sets on the Web to each

other Linked Open Data cloud

bull We use DBpdia to connect persons amp places in our newspaper database to information in other databases

httpnldbpediaorgpageHuib_Drion

httpsnlwikipediaorgwikiHuib_Drion

httpwwwdbnlorgauteursauteurphpid=drio001

httpwwwbiografischportaalnlpersoon41181342

Build central database

Added value of Linked Open Data amp DBpedia Software can automatically query for additional information about places and persons mentioned in De Winkel that is not available in bull KB-catalogue bull Delpher bull De Winkel

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples)

Open license (CC-BY-SA) Open standard (RDF)

Contextual information Links between titles Delpher amp KB-cat (via PPNs)

Relations Links between places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia)

bull Titles Other titles

(PPNs)

http5stardatainfoen

Wikiproject Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

We need a template

Using an article template we can generate 1300 uniform and interlinked Wikipedia articles

from the LOD-database

htt

ps

c1

sta

ticf

lickr

co

m9

82

81

76

99

23

19

18

_11

a73

56

c38

_bjp

g

LOD-database + article template = Wikipedia article

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

Titles from database

Metadata from database

Related persons from database

Related newspapers from database

Winkel-ID from database

Link to full-texts in Delpher from database

Link to KB-catalogue record from database

Fixed categories on WP and Commons

Predefined fixed strings

Grey = bull From database bull Predefined fixed strings

Uniformity between articles guaranteed

All that WP-writers need to add manually

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

The KB can re-use (embed) the Wikipedia content in its own

websites to tackle this problem

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_28verzetsblad29

Delpher - search results

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

Delpher - search results

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Delpher - object presentation

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Over De Geus onder studenten

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog dat vanaf 4 oktober 1940 tot en met 13 juli 1944 in Den Haag werd uitgegeven Het blad verscheen in 1940 1941 en 1943 maandelijks verder onregelmatig in een oplage tussen de 250 en 8000 exemplaren Het werd aanvankelijk gestencild en vanaf november 1942 gedrukt en de inhoud bestond voornamelijk uit opinie-artikelen

Het blad werd uitgegeven door Jan Drion en Huib Drion twee Leidsehellip

Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Delpher - object presentation

Suggested reading

bull httpwwwtedcomtalkstim_berners_lee_on_the_next_web 20 years ago Tim Berners-Lee invented the World Wide Web For his next project hes building a web for open linked data that could do for numbers what the Web did for words pictures video unlock our data and reframe the way we use it together

bull httpsenwikipediaorgwikiLinked_data Wikipedia article related to the above video

bull http5stardatainfoen The 5 stars of Linked Open Data (Tim Berners-Lee)

bull httplinda-projecteulinked-data-primer-2 Short primer about creating LOD in practice starting from an Excel sheet

bull httpwwwprogrammablewebcomnewshow-linked-data-solved-digital-age-marketing-problemanalysis20150831 The figure near the bottom of the first page is a good illustration of the concept of (linked) triples

bull httpsenwikipediaorgwikiDBpedia

bull httpsenwikipediaorgwikiSemantic_network

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Questions

olafjanssenkbnl - ookgezellig

tinyurlcomverzetskranten

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Page 64: Linked Open Data case study (illegal newspapers WW2, Wikipedia, DBpedia) - Lecture Leiden University 3-3-2016

Title

Unique Wikipedia article title =

ltNewspaper titlegt (verzetsblad ltPlace of publicationgt)

bull Contextual info bull Related persons

bull Related newspaper titles

PPN (107123223)

httpopc4kbnlDB=1PPNPPN=107123223

httpopc4kbnlDB=1PPNPPN=107123223

PPN (107123223)

PPN (107123223)

httpwwwdelphernlnlkrantenresultsindexcoll=dddtitelampcql[]=ppn3D107123223

httpwwwdelphernlnlkrantenresultsindexcoll=dddtitelampcql[]=ppn3D107123223

PPN (107123223)

Build central database

Step 2 Convert Excel into RDF triplestore (=special kind of online database anybody can access)

bull Steps 1-4 from httplinda-projecteulinked-

data-primer-2

bull Step 4 Vocubulary used = Bibframe (httpbibframeorgvocab)

Build central database

Step 3 Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

bull DBpedia = hub for linking different data sets on the Web to each

other Linked Open Data cloud

bull Connect persons amp places in newspaper database to external resources via DBpedia

Step 1c Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

DBpedia allows you to ask sophisticated queries against Wikipedia and to link the different data sets on the Web to Wikipedia data

bull Connect persons amp places in newspaper database to external

resources via DBpedia

httplod-cloudnetversions2010-09-22lod-cloud_coloredpng

Linked Open Data cloud

Build central database

Step 3 Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

bull DBpedia = hub for linking different data sets on the Web to each

other Linked Open Data cloud

bull We use DBpdia to connect persons amp places in our newspaper database to information in other databases

httpnldbpediaorgpageHuib_Drion

httpsnlwikipediaorgwikiHuib_Drion

httpwwwdbnlorgauteursauteurphpid=drio001

httpwwwbiografischportaalnlpersoon41181342

Build central database

Added value of Linked Open Data amp DBpedia Software can automatically query for additional information about places and persons mentioned in De Winkel that is not available in bull KB-catalogue bull Delpher bull De Winkel

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples)

Open license (CC-BY-SA) Open standard (RDF)

Contextual information Links between titles Delpher amp KB-cat (via PPNs)

Relations Links between places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia)

bull Titles Other titles

(PPNs)

http5stardatainfoen

Wikiproject Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

We need a template

Using an article template we can generate 1300 uniform and interlinked Wikipedia articles

from the LOD-database

htt

ps

c1

sta

ticf

lickr

co

m9

82

81

76

99

23

19

18

_11

a73

56

c38

_bjp

g

LOD-database + article template = Wikipedia article

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

Titles from database

Metadata from database

Related persons from database

Related newspapers from database

Winkel-ID from database

Link to full-texts in Delpher from database

Link to KB-catalogue record from database

Fixed categories on WP and Commons

Predefined fixed strings

Grey = bull From database bull Predefined fixed strings

Uniformity between articles guaranteed

All that WP-writers need to add manually

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

The KB can re-use (embed) the Wikipedia content in its own

websites to tackle this problem

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_28verzetsblad29

Delpher - search results

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

Delpher - search results

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Delpher - object presentation

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Over De Geus onder studenten

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog dat vanaf 4 oktober 1940 tot en met 13 juli 1944 in Den Haag werd uitgegeven Het blad verscheen in 1940 1941 en 1943 maandelijks verder onregelmatig in een oplage tussen de 250 en 8000 exemplaren Het werd aanvankelijk gestencild en vanaf november 1942 gedrukt en de inhoud bestond voornamelijk uit opinie-artikelen

Het blad werd uitgegeven door Jan Drion en Huib Drion twee Leidsehellip

Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Delpher - object presentation

Suggested reading

bull httpwwwtedcomtalkstim_berners_lee_on_the_next_web 20 years ago Tim Berners-Lee invented the World Wide Web For his next project hes building a web for open linked data that could do for numbers what the Web did for words pictures video unlock our data and reframe the way we use it together

bull httpsenwikipediaorgwikiLinked_data Wikipedia article related to the above video

bull http5stardatainfoen The 5 stars of Linked Open Data (Tim Berners-Lee)

bull httplinda-projecteulinked-data-primer-2 Short primer about creating LOD in practice starting from an Excel sheet

bull httpwwwprogrammablewebcomnewshow-linked-data-solved-digital-age-marketing-problemanalysis20150831 The figure near the bottom of the first page is a good illustration of the concept of (linked) triples

bull httpsenwikipediaorgwikiDBpedia

bull httpsenwikipediaorgwikiSemantic_network

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Questions

olafjanssenkbnl - ookgezellig

tinyurlcomverzetskranten

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Page 65: Linked Open Data case study (illegal newspapers WW2, Wikipedia, DBpedia) - Lecture Leiden University 3-3-2016

Unique Wikipedia article title =

ltNewspaper titlegt (verzetsblad ltPlace of publicationgt)

bull Contextual info bull Related persons

bull Related newspaper titles

PPN (107123223)

httpopc4kbnlDB=1PPNPPN=107123223

httpopc4kbnlDB=1PPNPPN=107123223

PPN (107123223)

PPN (107123223)

httpwwwdelphernlnlkrantenresultsindexcoll=dddtitelampcql[]=ppn3D107123223

httpwwwdelphernlnlkrantenresultsindexcoll=dddtitelampcql[]=ppn3D107123223

PPN (107123223)

Build central database

Step 2 Convert Excel into RDF triplestore (=special kind of online database anybody can access)

bull Steps 1-4 from httplinda-projecteulinked-

data-primer-2

bull Step 4 Vocubulary used = Bibframe (httpbibframeorgvocab)

Build central database

Step 3 Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

bull DBpedia = hub for linking different data sets on the Web to each

other Linked Open Data cloud

bull Connect persons amp places in newspaper database to external resources via DBpedia

Step 1c Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

DBpedia allows you to ask sophisticated queries against Wikipedia and to link the different data sets on the Web to Wikipedia data

bull Connect persons amp places in newspaper database to external

resources via DBpedia

httplod-cloudnetversions2010-09-22lod-cloud_coloredpng

Linked Open Data cloud

Build central database

Step 3 Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

bull DBpedia = hub for linking different data sets on the Web to each

other Linked Open Data cloud

bull We use DBpdia to connect persons amp places in our newspaper database to information in other databases

httpnldbpediaorgpageHuib_Drion

httpsnlwikipediaorgwikiHuib_Drion

httpwwwdbnlorgauteursauteurphpid=drio001

httpwwwbiografischportaalnlpersoon41181342

Build central database

Added value of Linked Open Data amp DBpedia Software can automatically query for additional information about places and persons mentioned in De Winkel that is not available in bull KB-catalogue bull Delpher bull De Winkel

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples)

Open license (CC-BY-SA) Open standard (RDF)

Contextual information Links between titles Delpher amp KB-cat (via PPNs)

Relations Links between places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia)

bull Titles Other titles

(PPNs)

http5stardatainfoen

Wikiproject Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

We need a template

Using an article template we can generate 1300 uniform and interlinked Wikipedia articles

from the LOD-database

htt

ps

c1

sta

ticf

lickr

co

m9

82

81

76

99

23

19

18

_11

a73

56

c38

_bjp

g

LOD-database + article template = Wikipedia article

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

Titles from database

Metadata from database

Related persons from database

Related newspapers from database

Winkel-ID from database

Link to full-texts in Delpher from database

Link to KB-catalogue record from database

Fixed categories on WP and Commons

Predefined fixed strings

Grey = bull From database bull Predefined fixed strings

Uniformity between articles guaranteed

All that WP-writers need to add manually

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

The KB can re-use (embed) the Wikipedia content in its own

websites to tackle this problem

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_28verzetsblad29

Delpher - search results

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

Delpher - search results

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Delpher - object presentation

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Over De Geus onder studenten

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog dat vanaf 4 oktober 1940 tot en met 13 juli 1944 in Den Haag werd uitgegeven Het blad verscheen in 1940 1941 en 1943 maandelijks verder onregelmatig in een oplage tussen de 250 en 8000 exemplaren Het werd aanvankelijk gestencild en vanaf november 1942 gedrukt en de inhoud bestond voornamelijk uit opinie-artikelen

Het blad werd uitgegeven door Jan Drion en Huib Drion twee Leidsehellip

Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Delpher - object presentation

Suggested reading

bull httpwwwtedcomtalkstim_berners_lee_on_the_next_web 20 years ago Tim Berners-Lee invented the World Wide Web For his next project hes building a web for open linked data that could do for numbers what the Web did for words pictures video unlock our data and reframe the way we use it together

bull httpsenwikipediaorgwikiLinked_data Wikipedia article related to the above video

bull http5stardatainfoen The 5 stars of Linked Open Data (Tim Berners-Lee)

bull httplinda-projecteulinked-data-primer-2 Short primer about creating LOD in practice starting from an Excel sheet

bull httpwwwprogrammablewebcomnewshow-linked-data-solved-digital-age-marketing-problemanalysis20150831 The figure near the bottom of the first page is a good illustration of the concept of (linked) triples

bull httpsenwikipediaorgwikiDBpedia

bull httpsenwikipediaorgwikiSemantic_network

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Questions

olafjanssenkbnl - ookgezellig

tinyurlcomverzetskranten

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Page 66: Linked Open Data case study (illegal newspapers WW2, Wikipedia, DBpedia) - Lecture Leiden University 3-3-2016

bull Contextual info bull Related persons

bull Related newspaper titles

PPN (107123223)

httpopc4kbnlDB=1PPNPPN=107123223

httpopc4kbnlDB=1PPNPPN=107123223

PPN (107123223)

PPN (107123223)

httpwwwdelphernlnlkrantenresultsindexcoll=dddtitelampcql[]=ppn3D107123223

httpwwwdelphernlnlkrantenresultsindexcoll=dddtitelampcql[]=ppn3D107123223

PPN (107123223)

Build central database

Step 2 Convert Excel into RDF triplestore (=special kind of online database anybody can access)

bull Steps 1-4 from httplinda-projecteulinked-

data-primer-2

bull Step 4 Vocubulary used = Bibframe (httpbibframeorgvocab)

Build central database

Step 3 Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

bull DBpedia = hub for linking different data sets on the Web to each

other Linked Open Data cloud

bull Connect persons amp places in newspaper database to external resources via DBpedia

Step 1c Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

DBpedia allows you to ask sophisticated queries against Wikipedia and to link the different data sets on the Web to Wikipedia data

bull Connect persons amp places in newspaper database to external

resources via DBpedia

httplod-cloudnetversions2010-09-22lod-cloud_coloredpng

Linked Open Data cloud

Build central database

Step 3 Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

bull DBpedia = hub for linking different data sets on the Web to each

other Linked Open Data cloud

bull We use DBpdia to connect persons amp places in our newspaper database to information in other databases

httpnldbpediaorgpageHuib_Drion

httpsnlwikipediaorgwikiHuib_Drion

httpwwwdbnlorgauteursauteurphpid=drio001

httpwwwbiografischportaalnlpersoon41181342

Build central database

Added value of Linked Open Data amp DBpedia Software can automatically query for additional information about places and persons mentioned in De Winkel that is not available in bull KB-catalogue bull Delpher bull De Winkel

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples)

Open license (CC-BY-SA) Open standard (RDF)

Contextual information Links between titles Delpher amp KB-cat (via PPNs)

Relations Links between places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia)

bull Titles Other titles

(PPNs)

http5stardatainfoen

Wikiproject Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

We need a template

Using an article template we can generate 1300 uniform and interlinked Wikipedia articles

from the LOD-database

htt

ps

c1

sta

ticf

lickr

co

m9

82

81

76

99

23

19

18

_11

a73

56

c38

_bjp

g

LOD-database + article template = Wikipedia article

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

Titles from database

Metadata from database

Related persons from database

Related newspapers from database

Winkel-ID from database

Link to full-texts in Delpher from database

Link to KB-catalogue record from database

Fixed categories on WP and Commons

Predefined fixed strings

Grey = bull From database bull Predefined fixed strings

Uniformity between articles guaranteed

All that WP-writers need to add manually

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

The KB can re-use (embed) the Wikipedia content in its own

websites to tackle this problem

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_28verzetsblad29

Delpher - search results

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

Delpher - search results

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Delpher - object presentation

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Over De Geus onder studenten

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog dat vanaf 4 oktober 1940 tot en met 13 juli 1944 in Den Haag werd uitgegeven Het blad verscheen in 1940 1941 en 1943 maandelijks verder onregelmatig in een oplage tussen de 250 en 8000 exemplaren Het werd aanvankelijk gestencild en vanaf november 1942 gedrukt en de inhoud bestond voornamelijk uit opinie-artikelen

Het blad werd uitgegeven door Jan Drion en Huib Drion twee Leidsehellip

Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Delpher - object presentation

Suggested reading

bull httpwwwtedcomtalkstim_berners_lee_on_the_next_web 20 years ago Tim Berners-Lee invented the World Wide Web For his next project hes building a web for open linked data that could do for numbers what the Web did for words pictures video unlock our data and reframe the way we use it together

bull httpsenwikipediaorgwikiLinked_data Wikipedia article related to the above video

bull http5stardatainfoen The 5 stars of Linked Open Data (Tim Berners-Lee)

bull httplinda-projecteulinked-data-primer-2 Short primer about creating LOD in practice starting from an Excel sheet

bull httpwwwprogrammablewebcomnewshow-linked-data-solved-digital-age-marketing-problemanalysis20150831 The figure near the bottom of the first page is a good illustration of the concept of (linked) triples

bull httpsenwikipediaorgwikiDBpedia

bull httpsenwikipediaorgwikiSemantic_network

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Questions

olafjanssenkbnl - ookgezellig

tinyurlcomverzetskranten

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Page 67: Linked Open Data case study (illegal newspapers WW2, Wikipedia, DBpedia) - Lecture Leiden University 3-3-2016

PPN (107123223)

httpopc4kbnlDB=1PPNPPN=107123223

httpopc4kbnlDB=1PPNPPN=107123223

PPN (107123223)

PPN (107123223)

httpwwwdelphernlnlkrantenresultsindexcoll=dddtitelampcql[]=ppn3D107123223

httpwwwdelphernlnlkrantenresultsindexcoll=dddtitelampcql[]=ppn3D107123223

PPN (107123223)

Build central database

Step 2 Convert Excel into RDF triplestore (=special kind of online database anybody can access)

bull Steps 1-4 from httplinda-projecteulinked-

data-primer-2

bull Step 4 Vocubulary used = Bibframe (httpbibframeorgvocab)

Build central database

Step 3 Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

bull DBpedia = hub for linking different data sets on the Web to each

other Linked Open Data cloud

bull Connect persons amp places in newspaper database to external resources via DBpedia

Step 1c Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

DBpedia allows you to ask sophisticated queries against Wikipedia and to link the different data sets on the Web to Wikipedia data

bull Connect persons amp places in newspaper database to external

resources via DBpedia

httplod-cloudnetversions2010-09-22lod-cloud_coloredpng

Linked Open Data cloud

Build central database

Step 3 Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

bull DBpedia = hub for linking different data sets on the Web to each

other Linked Open Data cloud

bull We use DBpdia to connect persons amp places in our newspaper database to information in other databases

httpnldbpediaorgpageHuib_Drion

httpsnlwikipediaorgwikiHuib_Drion

httpwwwdbnlorgauteursauteurphpid=drio001

httpwwwbiografischportaalnlpersoon41181342

Build central database

Added value of Linked Open Data amp DBpedia Software can automatically query for additional information about places and persons mentioned in De Winkel that is not available in bull KB-catalogue bull Delpher bull De Winkel

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples)

Open license (CC-BY-SA) Open standard (RDF)

Contextual information Links between titles Delpher amp KB-cat (via PPNs)

Relations Links between places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia)

bull Titles Other titles

(PPNs)

http5stardatainfoen

Wikiproject Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

We need a template

Using an article template we can generate 1300 uniform and interlinked Wikipedia articles

from the LOD-database

htt

ps

c1

sta

ticf

lickr

co

m9

82

81

76

99

23

19

18

_11

a73

56

c38

_bjp

g

LOD-database + article template = Wikipedia article

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

Titles from database

Metadata from database

Related persons from database

Related newspapers from database

Winkel-ID from database

Link to full-texts in Delpher from database

Link to KB-catalogue record from database

Fixed categories on WP and Commons

Predefined fixed strings

Grey = bull From database bull Predefined fixed strings

Uniformity between articles guaranteed

All that WP-writers need to add manually

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

The KB can re-use (embed) the Wikipedia content in its own

websites to tackle this problem

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_28verzetsblad29

Delpher - search results

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

Delpher - search results

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Delpher - object presentation

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Over De Geus onder studenten

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog dat vanaf 4 oktober 1940 tot en met 13 juli 1944 in Den Haag werd uitgegeven Het blad verscheen in 1940 1941 en 1943 maandelijks verder onregelmatig in een oplage tussen de 250 en 8000 exemplaren Het werd aanvankelijk gestencild en vanaf november 1942 gedrukt en de inhoud bestond voornamelijk uit opinie-artikelen

Het blad werd uitgegeven door Jan Drion en Huib Drion twee Leidsehellip

Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Delpher - object presentation

Suggested reading

bull httpwwwtedcomtalkstim_berners_lee_on_the_next_web 20 years ago Tim Berners-Lee invented the World Wide Web For his next project hes building a web for open linked data that could do for numbers what the Web did for words pictures video unlock our data and reframe the way we use it together

bull httpsenwikipediaorgwikiLinked_data Wikipedia article related to the above video

bull http5stardatainfoen The 5 stars of Linked Open Data (Tim Berners-Lee)

bull httplinda-projecteulinked-data-primer-2 Short primer about creating LOD in practice starting from an Excel sheet

bull httpwwwprogrammablewebcomnewshow-linked-data-solved-digital-age-marketing-problemanalysis20150831 The figure near the bottom of the first page is a good illustration of the concept of (linked) triples

bull httpsenwikipediaorgwikiDBpedia

bull httpsenwikipediaorgwikiSemantic_network

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Questions

olafjanssenkbnl - ookgezellig

tinyurlcomverzetskranten

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Page 68: Linked Open Data case study (illegal newspapers WW2, Wikipedia, DBpedia) - Lecture Leiden University 3-3-2016

httpopc4kbnlDB=1PPNPPN=107123223

PPN (107123223)

PPN (107123223)

httpwwwdelphernlnlkrantenresultsindexcoll=dddtitelampcql[]=ppn3D107123223

httpwwwdelphernlnlkrantenresultsindexcoll=dddtitelampcql[]=ppn3D107123223

PPN (107123223)

Build central database

Step 2 Convert Excel into RDF triplestore (=special kind of online database anybody can access)

bull Steps 1-4 from httplinda-projecteulinked-

data-primer-2

bull Step 4 Vocubulary used = Bibframe (httpbibframeorgvocab)

Build central database

Step 3 Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

bull DBpedia = hub for linking different data sets on the Web to each

other Linked Open Data cloud

bull Connect persons amp places in newspaper database to external resources via DBpedia

Step 1c Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

DBpedia allows you to ask sophisticated queries against Wikipedia and to link the different data sets on the Web to Wikipedia data

bull Connect persons amp places in newspaper database to external

resources via DBpedia

httplod-cloudnetversions2010-09-22lod-cloud_coloredpng

Linked Open Data cloud

Build central database

Step 3 Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

bull DBpedia = hub for linking different data sets on the Web to each

other Linked Open Data cloud

bull We use DBpdia to connect persons amp places in our newspaper database to information in other databases

httpnldbpediaorgpageHuib_Drion

httpsnlwikipediaorgwikiHuib_Drion

httpwwwdbnlorgauteursauteurphpid=drio001

httpwwwbiografischportaalnlpersoon41181342

Build central database

Added value of Linked Open Data amp DBpedia Software can automatically query for additional information about places and persons mentioned in De Winkel that is not available in bull KB-catalogue bull Delpher bull De Winkel

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples)

Open license (CC-BY-SA) Open standard (RDF)

Contextual information Links between titles Delpher amp KB-cat (via PPNs)

Relations Links between places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia)

bull Titles Other titles

(PPNs)

http5stardatainfoen

Wikiproject Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

We need a template

Using an article template we can generate 1300 uniform and interlinked Wikipedia articles

from the LOD-database

htt

ps

c1

sta

ticf

lickr

co

m9

82

81

76

99

23

19

18

_11

a73

56

c38

_bjp

g

LOD-database + article template = Wikipedia article

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

Titles from database

Metadata from database

Related persons from database

Related newspapers from database

Winkel-ID from database

Link to full-texts in Delpher from database

Link to KB-catalogue record from database

Fixed categories on WP and Commons

Predefined fixed strings

Grey = bull From database bull Predefined fixed strings

Uniformity between articles guaranteed

All that WP-writers need to add manually

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

The KB can re-use (embed) the Wikipedia content in its own

websites to tackle this problem

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_28verzetsblad29

Delpher - search results

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

Delpher - search results

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Delpher - object presentation

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Over De Geus onder studenten

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog dat vanaf 4 oktober 1940 tot en met 13 juli 1944 in Den Haag werd uitgegeven Het blad verscheen in 1940 1941 en 1943 maandelijks verder onregelmatig in een oplage tussen de 250 en 8000 exemplaren Het werd aanvankelijk gestencild en vanaf november 1942 gedrukt en de inhoud bestond voornamelijk uit opinie-artikelen

Het blad werd uitgegeven door Jan Drion en Huib Drion twee Leidsehellip

Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Delpher - object presentation

Suggested reading

bull httpwwwtedcomtalkstim_berners_lee_on_the_next_web 20 years ago Tim Berners-Lee invented the World Wide Web For his next project hes building a web for open linked data that could do for numbers what the Web did for words pictures video unlock our data and reframe the way we use it together

bull httpsenwikipediaorgwikiLinked_data Wikipedia article related to the above video

bull http5stardatainfoen The 5 stars of Linked Open Data (Tim Berners-Lee)

bull httplinda-projecteulinked-data-primer-2 Short primer about creating LOD in practice starting from an Excel sheet

bull httpwwwprogrammablewebcomnewshow-linked-data-solved-digital-age-marketing-problemanalysis20150831 The figure near the bottom of the first page is a good illustration of the concept of (linked) triples

bull httpsenwikipediaorgwikiDBpedia

bull httpsenwikipediaorgwikiSemantic_network

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Questions

olafjanssenkbnl - ookgezellig

tinyurlcomverzetskranten

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Page 69: Linked Open Data case study (illegal newspapers WW2, Wikipedia, DBpedia) - Lecture Leiden University 3-3-2016

PPN (107123223)

httpwwwdelphernlnlkrantenresultsindexcoll=dddtitelampcql[]=ppn3D107123223

httpwwwdelphernlnlkrantenresultsindexcoll=dddtitelampcql[]=ppn3D107123223

PPN (107123223)

Build central database

Step 2 Convert Excel into RDF triplestore (=special kind of online database anybody can access)

bull Steps 1-4 from httplinda-projecteulinked-

data-primer-2

bull Step 4 Vocubulary used = Bibframe (httpbibframeorgvocab)

Build central database

Step 3 Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

bull DBpedia = hub for linking different data sets on the Web to each

other Linked Open Data cloud

bull Connect persons amp places in newspaper database to external resources via DBpedia

Step 1c Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

DBpedia allows you to ask sophisticated queries against Wikipedia and to link the different data sets on the Web to Wikipedia data

bull Connect persons amp places in newspaper database to external

resources via DBpedia

httplod-cloudnetversions2010-09-22lod-cloud_coloredpng

Linked Open Data cloud

Build central database

Step 3 Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

bull DBpedia = hub for linking different data sets on the Web to each

other Linked Open Data cloud

bull We use DBpdia to connect persons amp places in our newspaper database to information in other databases

httpnldbpediaorgpageHuib_Drion

httpsnlwikipediaorgwikiHuib_Drion

httpwwwdbnlorgauteursauteurphpid=drio001

httpwwwbiografischportaalnlpersoon41181342

Build central database

Added value of Linked Open Data amp DBpedia Software can automatically query for additional information about places and persons mentioned in De Winkel that is not available in bull KB-catalogue bull Delpher bull De Winkel

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples)

Open license (CC-BY-SA) Open standard (RDF)

Contextual information Links between titles Delpher amp KB-cat (via PPNs)

Relations Links between places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia)

bull Titles Other titles

(PPNs)

http5stardatainfoen

Wikiproject Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

We need a template

Using an article template we can generate 1300 uniform and interlinked Wikipedia articles

from the LOD-database

htt

ps

c1

sta

ticf

lickr

co

m9

82

81

76

99

23

19

18

_11

a73

56

c38

_bjp

g

LOD-database + article template = Wikipedia article

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

Titles from database

Metadata from database

Related persons from database

Related newspapers from database

Winkel-ID from database

Link to full-texts in Delpher from database

Link to KB-catalogue record from database

Fixed categories on WP and Commons

Predefined fixed strings

Grey = bull From database bull Predefined fixed strings

Uniformity between articles guaranteed

All that WP-writers need to add manually

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

The KB can re-use (embed) the Wikipedia content in its own

websites to tackle this problem

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_28verzetsblad29

Delpher - search results

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

Delpher - search results

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Delpher - object presentation

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Over De Geus onder studenten

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog dat vanaf 4 oktober 1940 tot en met 13 juli 1944 in Den Haag werd uitgegeven Het blad verscheen in 1940 1941 en 1943 maandelijks verder onregelmatig in een oplage tussen de 250 en 8000 exemplaren Het werd aanvankelijk gestencild en vanaf november 1942 gedrukt en de inhoud bestond voornamelijk uit opinie-artikelen

Het blad werd uitgegeven door Jan Drion en Huib Drion twee Leidsehellip

Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Delpher - object presentation

Suggested reading

bull httpwwwtedcomtalkstim_berners_lee_on_the_next_web 20 years ago Tim Berners-Lee invented the World Wide Web For his next project hes building a web for open linked data that could do for numbers what the Web did for words pictures video unlock our data and reframe the way we use it together

bull httpsenwikipediaorgwikiLinked_data Wikipedia article related to the above video

bull http5stardatainfoen The 5 stars of Linked Open Data (Tim Berners-Lee)

bull httplinda-projecteulinked-data-primer-2 Short primer about creating LOD in practice starting from an Excel sheet

bull httpwwwprogrammablewebcomnewshow-linked-data-solved-digital-age-marketing-problemanalysis20150831 The figure near the bottom of the first page is a good illustration of the concept of (linked) triples

bull httpsenwikipediaorgwikiDBpedia

bull httpsenwikipediaorgwikiSemantic_network

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Questions

olafjanssenkbnl - ookgezellig

tinyurlcomverzetskranten

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Page 70: Linked Open Data case study (illegal newspapers WW2, Wikipedia, DBpedia) - Lecture Leiden University 3-3-2016

httpwwwdelphernlnlkrantenresultsindexcoll=dddtitelampcql[]=ppn3D107123223

PPN (107123223)

Build central database

Step 2 Convert Excel into RDF triplestore (=special kind of online database anybody can access)

bull Steps 1-4 from httplinda-projecteulinked-

data-primer-2

bull Step 4 Vocubulary used = Bibframe (httpbibframeorgvocab)

Build central database

Step 3 Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

bull DBpedia = hub for linking different data sets on the Web to each

other Linked Open Data cloud

bull Connect persons amp places in newspaper database to external resources via DBpedia

Step 1c Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

DBpedia allows you to ask sophisticated queries against Wikipedia and to link the different data sets on the Web to Wikipedia data

bull Connect persons amp places in newspaper database to external

resources via DBpedia

httplod-cloudnetversions2010-09-22lod-cloud_coloredpng

Linked Open Data cloud

Build central database

Step 3 Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

bull DBpedia = hub for linking different data sets on the Web to each

other Linked Open Data cloud

bull We use DBpdia to connect persons amp places in our newspaper database to information in other databases

httpnldbpediaorgpageHuib_Drion

httpsnlwikipediaorgwikiHuib_Drion

httpwwwdbnlorgauteursauteurphpid=drio001

httpwwwbiografischportaalnlpersoon41181342

Build central database

Added value of Linked Open Data amp DBpedia Software can automatically query for additional information about places and persons mentioned in De Winkel that is not available in bull KB-catalogue bull Delpher bull De Winkel

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples)

Open license (CC-BY-SA) Open standard (RDF)

Contextual information Links between titles Delpher amp KB-cat (via PPNs)

Relations Links between places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia)

bull Titles Other titles

(PPNs)

http5stardatainfoen

Wikiproject Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

We need a template

Using an article template we can generate 1300 uniform and interlinked Wikipedia articles

from the LOD-database

htt

ps

c1

sta

ticf

lickr

co

m9

82

81

76

99

23

19

18

_11

a73

56

c38

_bjp

g

LOD-database + article template = Wikipedia article

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

Titles from database

Metadata from database

Related persons from database

Related newspapers from database

Winkel-ID from database

Link to full-texts in Delpher from database

Link to KB-catalogue record from database

Fixed categories on WP and Commons

Predefined fixed strings

Grey = bull From database bull Predefined fixed strings

Uniformity between articles guaranteed

All that WP-writers need to add manually

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

The KB can re-use (embed) the Wikipedia content in its own

websites to tackle this problem

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_28verzetsblad29

Delpher - search results

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

Delpher - search results

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Delpher - object presentation

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Over De Geus onder studenten

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog dat vanaf 4 oktober 1940 tot en met 13 juli 1944 in Den Haag werd uitgegeven Het blad verscheen in 1940 1941 en 1943 maandelijks verder onregelmatig in een oplage tussen de 250 en 8000 exemplaren Het werd aanvankelijk gestencild en vanaf november 1942 gedrukt en de inhoud bestond voornamelijk uit opinie-artikelen

Het blad werd uitgegeven door Jan Drion en Huib Drion twee Leidsehellip

Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Delpher - object presentation

Suggested reading

bull httpwwwtedcomtalkstim_berners_lee_on_the_next_web 20 years ago Tim Berners-Lee invented the World Wide Web For his next project hes building a web for open linked data that could do for numbers what the Web did for words pictures video unlock our data and reframe the way we use it together

bull httpsenwikipediaorgwikiLinked_data Wikipedia article related to the above video

bull http5stardatainfoen The 5 stars of Linked Open Data (Tim Berners-Lee)

bull httplinda-projecteulinked-data-primer-2 Short primer about creating LOD in practice starting from an Excel sheet

bull httpwwwprogrammablewebcomnewshow-linked-data-solved-digital-age-marketing-problemanalysis20150831 The figure near the bottom of the first page is a good illustration of the concept of (linked) triples

bull httpsenwikipediaorgwikiDBpedia

bull httpsenwikipediaorgwikiSemantic_network

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Questions

olafjanssenkbnl - ookgezellig

tinyurlcomverzetskranten

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Page 71: Linked Open Data case study (illegal newspapers WW2, Wikipedia, DBpedia) - Lecture Leiden University 3-3-2016

Build central database

Step 2 Convert Excel into RDF triplestore (=special kind of online database anybody can access)

bull Steps 1-4 from httplinda-projecteulinked-

data-primer-2

bull Step 4 Vocubulary used = Bibframe (httpbibframeorgvocab)

Build central database

Step 3 Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

bull DBpedia = hub for linking different data sets on the Web to each

other Linked Open Data cloud

bull Connect persons amp places in newspaper database to external resources via DBpedia

Step 1c Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

DBpedia allows you to ask sophisticated queries against Wikipedia and to link the different data sets on the Web to Wikipedia data

bull Connect persons amp places in newspaper database to external

resources via DBpedia

httplod-cloudnetversions2010-09-22lod-cloud_coloredpng

Linked Open Data cloud

Build central database

Step 3 Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

bull DBpedia = hub for linking different data sets on the Web to each

other Linked Open Data cloud

bull We use DBpdia to connect persons amp places in our newspaper database to information in other databases

httpnldbpediaorgpageHuib_Drion

httpsnlwikipediaorgwikiHuib_Drion

httpwwwdbnlorgauteursauteurphpid=drio001

httpwwwbiografischportaalnlpersoon41181342

Build central database

Added value of Linked Open Data amp DBpedia Software can automatically query for additional information about places and persons mentioned in De Winkel that is not available in bull KB-catalogue bull Delpher bull De Winkel

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples)

Open license (CC-BY-SA) Open standard (RDF)

Contextual information Links between titles Delpher amp KB-cat (via PPNs)

Relations Links between places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia)

bull Titles Other titles

(PPNs)

http5stardatainfoen

Wikiproject Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

We need a template

Using an article template we can generate 1300 uniform and interlinked Wikipedia articles

from the LOD-database

htt

ps

c1

sta

ticf

lickr

co

m9

82

81

76

99

23

19

18

_11

a73

56

c38

_bjp

g

LOD-database + article template = Wikipedia article

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

Titles from database

Metadata from database

Related persons from database

Related newspapers from database

Winkel-ID from database

Link to full-texts in Delpher from database

Link to KB-catalogue record from database

Fixed categories on WP and Commons

Predefined fixed strings

Grey = bull From database bull Predefined fixed strings

Uniformity between articles guaranteed

All that WP-writers need to add manually

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

The KB can re-use (embed) the Wikipedia content in its own

websites to tackle this problem

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_28verzetsblad29

Delpher - search results

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

Delpher - search results

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Delpher - object presentation

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Over De Geus onder studenten

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog dat vanaf 4 oktober 1940 tot en met 13 juli 1944 in Den Haag werd uitgegeven Het blad verscheen in 1940 1941 en 1943 maandelijks verder onregelmatig in een oplage tussen de 250 en 8000 exemplaren Het werd aanvankelijk gestencild en vanaf november 1942 gedrukt en de inhoud bestond voornamelijk uit opinie-artikelen

Het blad werd uitgegeven door Jan Drion en Huib Drion twee Leidsehellip

Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Delpher - object presentation

Suggested reading

bull httpwwwtedcomtalkstim_berners_lee_on_the_next_web 20 years ago Tim Berners-Lee invented the World Wide Web For his next project hes building a web for open linked data that could do for numbers what the Web did for words pictures video unlock our data and reframe the way we use it together

bull httpsenwikipediaorgwikiLinked_data Wikipedia article related to the above video

bull http5stardatainfoen The 5 stars of Linked Open Data (Tim Berners-Lee)

bull httplinda-projecteulinked-data-primer-2 Short primer about creating LOD in practice starting from an Excel sheet

bull httpwwwprogrammablewebcomnewshow-linked-data-solved-digital-age-marketing-problemanalysis20150831 The figure near the bottom of the first page is a good illustration of the concept of (linked) triples

bull httpsenwikipediaorgwikiDBpedia

bull httpsenwikipediaorgwikiSemantic_network

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Questions

olafjanssenkbnl - ookgezellig

tinyurlcomverzetskranten

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Page 72: Linked Open Data case study (illegal newspapers WW2, Wikipedia, DBpedia) - Lecture Leiden University 3-3-2016

Build central database

Step 3 Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

bull DBpedia = hub for linking different data sets on the Web to each

other Linked Open Data cloud

bull Connect persons amp places in newspaper database to external resources via DBpedia

Step 1c Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

DBpedia allows you to ask sophisticated queries against Wikipedia and to link the different data sets on the Web to Wikipedia data

bull Connect persons amp places in newspaper database to external

resources via DBpedia

httplod-cloudnetversions2010-09-22lod-cloud_coloredpng

Linked Open Data cloud

Build central database

Step 3 Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

bull DBpedia = hub for linking different data sets on the Web to each

other Linked Open Data cloud

bull We use DBpdia to connect persons amp places in our newspaper database to information in other databases

httpnldbpediaorgpageHuib_Drion

httpsnlwikipediaorgwikiHuib_Drion

httpwwwdbnlorgauteursauteurphpid=drio001

httpwwwbiografischportaalnlpersoon41181342

Build central database

Added value of Linked Open Data amp DBpedia Software can automatically query for additional information about places and persons mentioned in De Winkel that is not available in bull KB-catalogue bull Delpher bull De Winkel

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples)

Open license (CC-BY-SA) Open standard (RDF)

Contextual information Links between titles Delpher amp KB-cat (via PPNs)

Relations Links between places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia)

bull Titles Other titles

(PPNs)

http5stardatainfoen

Wikiproject Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

We need a template

Using an article template we can generate 1300 uniform and interlinked Wikipedia articles

from the LOD-database

htt

ps

c1

sta

ticf

lickr

co

m9

82

81

76

99

23

19

18

_11

a73

56

c38

_bjp

g

LOD-database + article template = Wikipedia article

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

Titles from database

Metadata from database

Related persons from database

Related newspapers from database

Winkel-ID from database

Link to full-texts in Delpher from database

Link to KB-catalogue record from database

Fixed categories on WP and Commons

Predefined fixed strings

Grey = bull From database bull Predefined fixed strings

Uniformity between articles guaranteed

All that WP-writers need to add manually

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

The KB can re-use (embed) the Wikipedia content in its own

websites to tackle this problem

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_28verzetsblad29

Delpher - search results

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

Delpher - search results

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Delpher - object presentation

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Over De Geus onder studenten

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog dat vanaf 4 oktober 1940 tot en met 13 juli 1944 in Den Haag werd uitgegeven Het blad verscheen in 1940 1941 en 1943 maandelijks verder onregelmatig in een oplage tussen de 250 en 8000 exemplaren Het werd aanvankelijk gestencild en vanaf november 1942 gedrukt en de inhoud bestond voornamelijk uit opinie-artikelen

Het blad werd uitgegeven door Jan Drion en Huib Drion twee Leidsehellip

Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Delpher - object presentation

Suggested reading

bull httpwwwtedcomtalkstim_berners_lee_on_the_next_web 20 years ago Tim Berners-Lee invented the World Wide Web For his next project hes building a web for open linked data that could do for numbers what the Web did for words pictures video unlock our data and reframe the way we use it together

bull httpsenwikipediaorgwikiLinked_data Wikipedia article related to the above video

bull http5stardatainfoen The 5 stars of Linked Open Data (Tim Berners-Lee)

bull httplinda-projecteulinked-data-primer-2 Short primer about creating LOD in practice starting from an Excel sheet

bull httpwwwprogrammablewebcomnewshow-linked-data-solved-digital-age-marketing-problemanalysis20150831 The figure near the bottom of the first page is a good illustration of the concept of (linked) triples

bull httpsenwikipediaorgwikiDBpedia

bull httpsenwikipediaorgwikiSemantic_network

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Questions

olafjanssenkbnl - ookgezellig

tinyurlcomverzetskranten

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Page 73: Linked Open Data case study (illegal newspapers WW2, Wikipedia, DBpedia) - Lecture Leiden University 3-3-2016

Step 1c Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

DBpedia allows you to ask sophisticated queries against Wikipedia and to link the different data sets on the Web to Wikipedia data

bull Connect persons amp places in newspaper database to external

resources via DBpedia

httplod-cloudnetversions2010-09-22lod-cloud_coloredpng

Linked Open Data cloud

Build central database

Step 3 Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

bull DBpedia = hub for linking different data sets on the Web to each

other Linked Open Data cloud

bull We use DBpdia to connect persons amp places in our newspaper database to information in other databases

httpnldbpediaorgpageHuib_Drion

httpsnlwikipediaorgwikiHuib_Drion

httpwwwdbnlorgauteursauteurphpid=drio001

httpwwwbiografischportaalnlpersoon41181342

Build central database

Added value of Linked Open Data amp DBpedia Software can automatically query for additional information about places and persons mentioned in De Winkel that is not available in bull KB-catalogue bull Delpher bull De Winkel

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples)

Open license (CC-BY-SA) Open standard (RDF)

Contextual information Links between titles Delpher amp KB-cat (via PPNs)

Relations Links between places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia)

bull Titles Other titles

(PPNs)

http5stardatainfoen

Wikiproject Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

We need a template

Using an article template we can generate 1300 uniform and interlinked Wikipedia articles

from the LOD-database

htt

ps

c1

sta

ticf

lickr

co

m9

82

81

76

99

23

19

18

_11

a73

56

c38

_bjp

g

LOD-database + article template = Wikipedia article

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

Titles from database

Metadata from database

Related persons from database

Related newspapers from database

Winkel-ID from database

Link to full-texts in Delpher from database

Link to KB-catalogue record from database

Fixed categories on WP and Commons

Predefined fixed strings

Grey = bull From database bull Predefined fixed strings

Uniformity between articles guaranteed

All that WP-writers need to add manually

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

The KB can re-use (embed) the Wikipedia content in its own

websites to tackle this problem

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_28verzetsblad29

Delpher - search results

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

Delpher - search results

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Delpher - object presentation

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Over De Geus onder studenten

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog dat vanaf 4 oktober 1940 tot en met 13 juli 1944 in Den Haag werd uitgegeven Het blad verscheen in 1940 1941 en 1943 maandelijks verder onregelmatig in een oplage tussen de 250 en 8000 exemplaren Het werd aanvankelijk gestencild en vanaf november 1942 gedrukt en de inhoud bestond voornamelijk uit opinie-artikelen

Het blad werd uitgegeven door Jan Drion en Huib Drion twee Leidsehellip

Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Delpher - object presentation

Suggested reading

bull httpwwwtedcomtalkstim_berners_lee_on_the_next_web 20 years ago Tim Berners-Lee invented the World Wide Web For his next project hes building a web for open linked data that could do for numbers what the Web did for words pictures video unlock our data and reframe the way we use it together

bull httpsenwikipediaorgwikiLinked_data Wikipedia article related to the above video

bull http5stardatainfoen The 5 stars of Linked Open Data (Tim Berners-Lee)

bull httplinda-projecteulinked-data-primer-2 Short primer about creating LOD in practice starting from an Excel sheet

bull httpwwwprogrammablewebcomnewshow-linked-data-solved-digital-age-marketing-problemanalysis20150831 The figure near the bottom of the first page is a good illustration of the concept of (linked) triples

bull httpsenwikipediaorgwikiDBpedia

bull httpsenwikipediaorgwikiSemantic_network

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Questions

olafjanssenkbnl - ookgezellig

tinyurlcomverzetskranten

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Page 74: Linked Open Data case study (illegal newspapers WW2, Wikipedia, DBpedia) - Lecture Leiden University 3-3-2016

Build central database

Step 3 Link to external resources bull Step 5 from httplinda-projecteulinked-data-primer-2

bull DBpedia = machine-readable structured version of Wikipedia

bull DBpedia = hub for linking different data sets on the Web to each

other Linked Open Data cloud

bull We use DBpdia to connect persons amp places in our newspaper database to information in other databases

httpnldbpediaorgpageHuib_Drion

httpsnlwikipediaorgwikiHuib_Drion

httpwwwdbnlorgauteursauteurphpid=drio001

httpwwwbiografischportaalnlpersoon41181342

Build central database

Added value of Linked Open Data amp DBpedia Software can automatically query for additional information about places and persons mentioned in De Winkel that is not available in bull KB-catalogue bull Delpher bull De Winkel

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples)

Open license (CC-BY-SA) Open standard (RDF)

Contextual information Links between titles Delpher amp KB-cat (via PPNs)

Relations Links between places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia)

bull Titles Other titles

(PPNs)

http5stardatainfoen

Wikiproject Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

We need a template

Using an article template we can generate 1300 uniform and interlinked Wikipedia articles

from the LOD-database

htt

ps

c1

sta

ticf

lickr

co

m9

82

81

76

99

23

19

18

_11

a73

56

c38

_bjp

g

LOD-database + article template = Wikipedia article

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

Titles from database

Metadata from database

Related persons from database

Related newspapers from database

Winkel-ID from database

Link to full-texts in Delpher from database

Link to KB-catalogue record from database

Fixed categories on WP and Commons

Predefined fixed strings

Grey = bull From database bull Predefined fixed strings

Uniformity between articles guaranteed

All that WP-writers need to add manually

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

The KB can re-use (embed) the Wikipedia content in its own

websites to tackle this problem

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_28verzetsblad29

Delpher - search results

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

Delpher - search results

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Delpher - object presentation

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Over De Geus onder studenten

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog dat vanaf 4 oktober 1940 tot en met 13 juli 1944 in Den Haag werd uitgegeven Het blad verscheen in 1940 1941 en 1943 maandelijks verder onregelmatig in een oplage tussen de 250 en 8000 exemplaren Het werd aanvankelijk gestencild en vanaf november 1942 gedrukt en de inhoud bestond voornamelijk uit opinie-artikelen

Het blad werd uitgegeven door Jan Drion en Huib Drion twee Leidsehellip

Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Delpher - object presentation

Suggested reading

bull httpwwwtedcomtalkstim_berners_lee_on_the_next_web 20 years ago Tim Berners-Lee invented the World Wide Web For his next project hes building a web for open linked data that could do for numbers what the Web did for words pictures video unlock our data and reframe the way we use it together

bull httpsenwikipediaorgwikiLinked_data Wikipedia article related to the above video

bull http5stardatainfoen The 5 stars of Linked Open Data (Tim Berners-Lee)

bull httplinda-projecteulinked-data-primer-2 Short primer about creating LOD in practice starting from an Excel sheet

bull httpwwwprogrammablewebcomnewshow-linked-data-solved-digital-age-marketing-problemanalysis20150831 The figure near the bottom of the first page is a good illustration of the concept of (linked) triples

bull httpsenwikipediaorgwikiDBpedia

bull httpsenwikipediaorgwikiSemantic_network

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Questions

olafjanssenkbnl - ookgezellig

tinyurlcomverzetskranten

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Page 75: Linked Open Data case study (illegal newspapers WW2, Wikipedia, DBpedia) - Lecture Leiden University 3-3-2016

httpnldbpediaorgpageHuib_Drion

httpsnlwikipediaorgwikiHuib_Drion

httpwwwdbnlorgauteursauteurphpid=drio001

httpwwwbiografischportaalnlpersoon41181342

Build central database

Added value of Linked Open Data amp DBpedia Software can automatically query for additional information about places and persons mentioned in De Winkel that is not available in bull KB-catalogue bull Delpher bull De Winkel

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples)

Open license (CC-BY-SA) Open standard (RDF)

Contextual information Links between titles Delpher amp KB-cat (via PPNs)

Relations Links between places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia)

bull Titles Other titles

(PPNs)

http5stardatainfoen

Wikiproject Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

We need a template

Using an article template we can generate 1300 uniform and interlinked Wikipedia articles

from the LOD-database

htt

ps

c1

sta

ticf

lickr

co

m9

82

81

76

99

23

19

18

_11

a73

56

c38

_bjp

g

LOD-database + article template = Wikipedia article

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

Titles from database

Metadata from database

Related persons from database

Related newspapers from database

Winkel-ID from database

Link to full-texts in Delpher from database

Link to KB-catalogue record from database

Fixed categories on WP and Commons

Predefined fixed strings

Grey = bull From database bull Predefined fixed strings

Uniformity between articles guaranteed

All that WP-writers need to add manually

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

The KB can re-use (embed) the Wikipedia content in its own

websites to tackle this problem

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_28verzetsblad29

Delpher - search results

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

Delpher - search results

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Delpher - object presentation

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Over De Geus onder studenten

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog dat vanaf 4 oktober 1940 tot en met 13 juli 1944 in Den Haag werd uitgegeven Het blad verscheen in 1940 1941 en 1943 maandelijks verder onregelmatig in een oplage tussen de 250 en 8000 exemplaren Het werd aanvankelijk gestencild en vanaf november 1942 gedrukt en de inhoud bestond voornamelijk uit opinie-artikelen

Het blad werd uitgegeven door Jan Drion en Huib Drion twee Leidsehellip

Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Delpher - object presentation

Suggested reading

bull httpwwwtedcomtalkstim_berners_lee_on_the_next_web 20 years ago Tim Berners-Lee invented the World Wide Web For his next project hes building a web for open linked data that could do for numbers what the Web did for words pictures video unlock our data and reframe the way we use it together

bull httpsenwikipediaorgwikiLinked_data Wikipedia article related to the above video

bull http5stardatainfoen The 5 stars of Linked Open Data (Tim Berners-Lee)

bull httplinda-projecteulinked-data-primer-2 Short primer about creating LOD in practice starting from an Excel sheet

bull httpwwwprogrammablewebcomnewshow-linked-data-solved-digital-age-marketing-problemanalysis20150831 The figure near the bottom of the first page is a good illustration of the concept of (linked) triples

bull httpsenwikipediaorgwikiDBpedia

bull httpsenwikipediaorgwikiSemantic_network

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Questions

olafjanssenkbnl - ookgezellig

tinyurlcomverzetskranten

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Page 76: Linked Open Data case study (illegal newspapers WW2, Wikipedia, DBpedia) - Lecture Leiden University 3-3-2016

Build central database

Added value of Linked Open Data amp DBpedia Software can automatically query for additional information about places and persons mentioned in De Winkel that is not available in bull KB-catalogue bull Delpher bull De Winkel

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples)

Open license (CC-BY-SA) Open standard (RDF)

Contextual information Links between titles Delpher amp KB-cat (via PPNs)

Relations Links between places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia)

bull Titles Other titles

(PPNs)

http5stardatainfoen

Wikiproject Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

We need a template

Using an article template we can generate 1300 uniform and interlinked Wikipedia articles

from the LOD-database

htt

ps

c1

sta

ticf

lickr

co

m9

82

81

76

99

23

19

18

_11

a73

56

c38

_bjp

g

LOD-database + article template = Wikipedia article

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

Titles from database

Metadata from database

Related persons from database

Related newspapers from database

Winkel-ID from database

Link to full-texts in Delpher from database

Link to KB-catalogue record from database

Fixed categories on WP and Commons

Predefined fixed strings

Grey = bull From database bull Predefined fixed strings

Uniformity between articles guaranteed

All that WP-writers need to add manually

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

The KB can re-use (embed) the Wikipedia content in its own

websites to tackle this problem

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_28verzetsblad29

Delpher - search results

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

Delpher - search results

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Delpher - object presentation

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Over De Geus onder studenten

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog dat vanaf 4 oktober 1940 tot en met 13 juli 1944 in Den Haag werd uitgegeven Het blad verscheen in 1940 1941 en 1943 maandelijks verder onregelmatig in een oplage tussen de 250 en 8000 exemplaren Het werd aanvankelijk gestencild en vanaf november 1942 gedrukt en de inhoud bestond voornamelijk uit opinie-artikelen

Het blad werd uitgegeven door Jan Drion en Huib Drion twee Leidsehellip

Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Delpher - object presentation

Suggested reading

bull httpwwwtedcomtalkstim_berners_lee_on_the_next_web 20 years ago Tim Berners-Lee invented the World Wide Web For his next project hes building a web for open linked data that could do for numbers what the Web did for words pictures video unlock our data and reframe the way we use it together

bull httpsenwikipediaorgwikiLinked_data Wikipedia article related to the above video

bull http5stardatainfoen The 5 stars of Linked Open Data (Tim Berners-Lee)

bull httplinda-projecteulinked-data-primer-2 Short primer about creating LOD in practice starting from an Excel sheet

bull httpwwwprogrammablewebcomnewshow-linked-data-solved-digital-age-marketing-problemanalysis20150831 The figure near the bottom of the first page is a good illustration of the concept of (linked) triples

bull httpsenwikipediaorgwikiDBpedia

bull httpsenwikipediaorgwikiSemantic_network

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Questions

olafjanssenkbnl - ookgezellig

tinyurlcomverzetskranten

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Page 77: Linked Open Data case study (illegal newspapers WW2, Wikipedia, DBpedia) - Lecture Leiden University 3-3-2016

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples)

Open license (CC-BY-SA) Open standard (RDF)

Contextual information Links between titles Delpher amp KB-cat (via PPNs)

Relations Links between places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia)

bull Titles Other titles

(PPNs)

http5stardatainfoen

Wikiproject Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

We need a template

Using an article template we can generate 1300 uniform and interlinked Wikipedia articles

from the LOD-database

htt

ps

c1

sta

ticf

lickr

co

m9

82

81

76

99

23

19

18

_11

a73

56

c38

_bjp

g

LOD-database + article template = Wikipedia article

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

Titles from database

Metadata from database

Related persons from database

Related newspapers from database

Winkel-ID from database

Link to full-texts in Delpher from database

Link to KB-catalogue record from database

Fixed categories on WP and Commons

Predefined fixed strings

Grey = bull From database bull Predefined fixed strings

Uniformity between articles guaranteed

All that WP-writers need to add manually

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

The KB can re-use (embed) the Wikipedia content in its own

websites to tackle this problem

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_28verzetsblad29

Delpher - search results

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

Delpher - search results

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Delpher - object presentation

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Over De Geus onder studenten

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog dat vanaf 4 oktober 1940 tot en met 13 juli 1944 in Den Haag werd uitgegeven Het blad verscheen in 1940 1941 en 1943 maandelijks verder onregelmatig in een oplage tussen de 250 en 8000 exemplaren Het werd aanvankelijk gestencild en vanaf november 1942 gedrukt en de inhoud bestond voornamelijk uit opinie-artikelen

Het blad werd uitgegeven door Jan Drion en Huib Drion twee Leidsehellip

Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Delpher - object presentation

Suggested reading

bull httpwwwtedcomtalkstim_berners_lee_on_the_next_web 20 years ago Tim Berners-Lee invented the World Wide Web For his next project hes building a web for open linked data that could do for numbers what the Web did for words pictures video unlock our data and reframe the way we use it together

bull httpsenwikipediaorgwikiLinked_data Wikipedia article related to the above video

bull http5stardatainfoen The 5 stars of Linked Open Data (Tim Berners-Lee)

bull httplinda-projecteulinked-data-primer-2 Short primer about creating LOD in practice starting from an Excel sheet

bull httpwwwprogrammablewebcomnewshow-linked-data-solved-digital-age-marketing-problemanalysis20150831 The figure near the bottom of the first page is a good illustration of the concept of (linked) triples

bull httpsenwikipediaorgwikiDBpedia

bull httpsenwikipediaorgwikiSemantic_network

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Questions

olafjanssenkbnl - ookgezellig

tinyurlcomverzetskranten

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Page 78: Linked Open Data case study (illegal newspapers WW2, Wikipedia, DBpedia) - Lecture Leiden University 3-3-2016

Summary data about 1300 newspapers

Available online Structured data (RDF-triples) Open license (CC-BY-SA) Open standard (RDF) Contextual information Links between titles Delpher amp KB-cat Relations Links between titles places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia) bull Titles Other titles

Summary data about 1300 newspapers

Available online Structured data (RDF-triples)

Open license (CC-BY-SA) Open standard (RDF)

Contextual information Links between titles Delpher amp KB-cat (via PPNs)

Relations Links between places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia)

bull Titles Other titles

(PPNs)

http5stardatainfoen

Wikiproject Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

We need a template

Using an article template we can generate 1300 uniform and interlinked Wikipedia articles

from the LOD-database

htt

ps

c1

sta

ticf

lickr

co

m9

82

81

76

99

23

19

18

_11

a73

56

c38

_bjp

g

LOD-database + article template = Wikipedia article

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

Titles from database

Metadata from database

Related persons from database

Related newspapers from database

Winkel-ID from database

Link to full-texts in Delpher from database

Link to KB-catalogue record from database

Fixed categories on WP and Commons

Predefined fixed strings

Grey = bull From database bull Predefined fixed strings

Uniformity between articles guaranteed

All that WP-writers need to add manually

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

The KB can re-use (embed) the Wikipedia content in its own

websites to tackle this problem

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_28verzetsblad29

Delpher - search results

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

Delpher - search results

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Delpher - object presentation

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Over De Geus onder studenten

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog dat vanaf 4 oktober 1940 tot en met 13 juli 1944 in Den Haag werd uitgegeven Het blad verscheen in 1940 1941 en 1943 maandelijks verder onregelmatig in een oplage tussen de 250 en 8000 exemplaren Het werd aanvankelijk gestencild en vanaf november 1942 gedrukt en de inhoud bestond voornamelijk uit opinie-artikelen

Het blad werd uitgegeven door Jan Drion en Huib Drion twee Leidsehellip

Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Delpher - object presentation

Suggested reading

bull httpwwwtedcomtalkstim_berners_lee_on_the_next_web 20 years ago Tim Berners-Lee invented the World Wide Web For his next project hes building a web for open linked data that could do for numbers what the Web did for words pictures video unlock our data and reframe the way we use it together

bull httpsenwikipediaorgwikiLinked_data Wikipedia article related to the above video

bull http5stardatainfoen The 5 stars of Linked Open Data (Tim Berners-Lee)

bull httplinda-projecteulinked-data-primer-2 Short primer about creating LOD in practice starting from an Excel sheet

bull httpwwwprogrammablewebcomnewshow-linked-data-solved-digital-age-marketing-problemanalysis20150831 The figure near the bottom of the first page is a good illustration of the concept of (linked) triples

bull httpsenwikipediaorgwikiDBpedia

bull httpsenwikipediaorgwikiSemantic_network

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Questions

olafjanssenkbnl - ookgezellig

tinyurlcomverzetskranten

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Page 79: Linked Open Data case study (illegal newspapers WW2, Wikipedia, DBpedia) - Lecture Leiden University 3-3-2016

Summary data about 1300 newspapers

Available online Structured data (RDF-triples)

Open license (CC-BY-SA) Open standard (RDF)

Contextual information Links between titles Delpher amp KB-cat (via PPNs)

Relations Links between places

bull Titles Places amp persons external bull Titles Persons sources (via DBpedia)

bull Titles Other titles

(PPNs)

http5stardatainfoen

Wikiproject Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

We need a template

Using an article template we can generate 1300 uniform and interlinked Wikipedia articles

from the LOD-database

htt

ps

c1

sta

ticf

lickr

co

m9

82

81

76

99

23

19

18

_11

a73

56

c38

_bjp

g

LOD-database + article template = Wikipedia article

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

Titles from database

Metadata from database

Related persons from database

Related newspapers from database

Winkel-ID from database

Link to full-texts in Delpher from database

Link to KB-catalogue record from database

Fixed categories on WP and Commons

Predefined fixed strings

Grey = bull From database bull Predefined fixed strings

Uniformity between articles guaranteed

All that WP-writers need to add manually

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

The KB can re-use (embed) the Wikipedia content in its own

websites to tackle this problem

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_28verzetsblad29

Delpher - search results

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

Delpher - search results

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Delpher - object presentation

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Over De Geus onder studenten

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog dat vanaf 4 oktober 1940 tot en met 13 juli 1944 in Den Haag werd uitgegeven Het blad verscheen in 1940 1941 en 1943 maandelijks verder onregelmatig in een oplage tussen de 250 en 8000 exemplaren Het werd aanvankelijk gestencild en vanaf november 1942 gedrukt en de inhoud bestond voornamelijk uit opinie-artikelen

Het blad werd uitgegeven door Jan Drion en Huib Drion twee Leidsehellip

Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Delpher - object presentation

Suggested reading

bull httpwwwtedcomtalkstim_berners_lee_on_the_next_web 20 years ago Tim Berners-Lee invented the World Wide Web For his next project hes building a web for open linked data that could do for numbers what the Web did for words pictures video unlock our data and reframe the way we use it together

bull httpsenwikipediaorgwikiLinked_data Wikipedia article related to the above video

bull http5stardatainfoen The 5 stars of Linked Open Data (Tim Berners-Lee)

bull httplinda-projecteulinked-data-primer-2 Short primer about creating LOD in practice starting from an Excel sheet

bull httpwwwprogrammablewebcomnewshow-linked-data-solved-digital-age-marketing-problemanalysis20150831 The figure near the bottom of the first page is a good illustration of the concept of (linked) triples

bull httpsenwikipediaorgwikiDBpedia

bull httpsenwikipediaorgwikiSemantic_network

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Questions

olafjanssenkbnl - ookgezellig

tinyurlcomverzetskranten

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Page 80: Linked Open Data case study (illegal newspapers WW2, Wikipedia, DBpedia) - Lecture Leiden University 3-3-2016

http5stardatainfoen

Wikiproject Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

We need a template

Using an article template we can generate 1300 uniform and interlinked Wikipedia articles

from the LOD-database

htt

ps

c1

sta

ticf

lickr

co

m9

82

81

76

99

23

19

18

_11

a73

56

c38

_bjp

g

LOD-database + article template = Wikipedia article

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

Titles from database

Metadata from database

Related persons from database

Related newspapers from database

Winkel-ID from database

Link to full-texts in Delpher from database

Link to KB-catalogue record from database

Fixed categories on WP and Commons

Predefined fixed strings

Grey = bull From database bull Predefined fixed strings

Uniformity between articles guaranteed

All that WP-writers need to add manually

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

The KB can re-use (embed) the Wikipedia content in its own

websites to tackle this problem

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_28verzetsblad29

Delpher - search results

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

Delpher - search results

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Delpher - object presentation

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Over De Geus onder studenten

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog dat vanaf 4 oktober 1940 tot en met 13 juli 1944 in Den Haag werd uitgegeven Het blad verscheen in 1940 1941 en 1943 maandelijks verder onregelmatig in een oplage tussen de 250 en 8000 exemplaren Het werd aanvankelijk gestencild en vanaf november 1942 gedrukt en de inhoud bestond voornamelijk uit opinie-artikelen

Het blad werd uitgegeven door Jan Drion en Huib Drion twee Leidsehellip

Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Delpher - object presentation

Suggested reading

bull httpwwwtedcomtalkstim_berners_lee_on_the_next_web 20 years ago Tim Berners-Lee invented the World Wide Web For his next project hes building a web for open linked data that could do for numbers what the Web did for words pictures video unlock our data and reframe the way we use it together

bull httpsenwikipediaorgwikiLinked_data Wikipedia article related to the above video

bull http5stardatainfoen The 5 stars of Linked Open Data (Tim Berners-Lee)

bull httplinda-projecteulinked-data-primer-2 Short primer about creating LOD in practice starting from an Excel sheet

bull httpwwwprogrammablewebcomnewshow-linked-data-solved-digital-age-marketing-problemanalysis20150831 The figure near the bottom of the first page is a good illustration of the concept of (linked) triples

bull httpsenwikipediaorgwikiDBpedia

bull httpsenwikipediaorgwikiSemantic_network

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Questions

olafjanssenkbnl - ookgezellig

tinyurlcomverzetskranten

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Page 81: Linked Open Data case study (illegal newspapers WW2, Wikipedia, DBpedia) - Lecture Leiden University 3-3-2016

Wikiproject Verzetskranten

Systematically and uniformly describe amp link all 1300 Dutch underground newspapers from WW2

on Dutch Wikipedia

We need a template

Using an article template we can generate 1300 uniform and interlinked Wikipedia articles

from the LOD-database

htt

ps

c1

sta

ticf

lickr

co

m9

82

81

76

99

23

19

18

_11

a73

56

c38

_bjp

g

LOD-database + article template = Wikipedia article

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

Titles from database

Metadata from database

Related persons from database

Related newspapers from database

Winkel-ID from database

Link to full-texts in Delpher from database

Link to KB-catalogue record from database

Fixed categories on WP and Commons

Predefined fixed strings

Grey = bull From database bull Predefined fixed strings

Uniformity between articles guaranteed

All that WP-writers need to add manually

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

The KB can re-use (embed) the Wikipedia content in its own

websites to tackle this problem

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_28verzetsblad29

Delpher - search results

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

Delpher - search results

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Delpher - object presentation

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Over De Geus onder studenten

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog dat vanaf 4 oktober 1940 tot en met 13 juli 1944 in Den Haag werd uitgegeven Het blad verscheen in 1940 1941 en 1943 maandelijks verder onregelmatig in een oplage tussen de 250 en 8000 exemplaren Het werd aanvankelijk gestencild en vanaf november 1942 gedrukt en de inhoud bestond voornamelijk uit opinie-artikelen

Het blad werd uitgegeven door Jan Drion en Huib Drion twee Leidsehellip

Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Delpher - object presentation

Suggested reading

bull httpwwwtedcomtalkstim_berners_lee_on_the_next_web 20 years ago Tim Berners-Lee invented the World Wide Web For his next project hes building a web for open linked data that could do for numbers what the Web did for words pictures video unlock our data and reframe the way we use it together

bull httpsenwikipediaorgwikiLinked_data Wikipedia article related to the above video

bull http5stardatainfoen The 5 stars of Linked Open Data (Tim Berners-Lee)

bull httplinda-projecteulinked-data-primer-2 Short primer about creating LOD in practice starting from an Excel sheet

bull httpwwwprogrammablewebcomnewshow-linked-data-solved-digital-age-marketing-problemanalysis20150831 The figure near the bottom of the first page is a good illustration of the concept of (linked) triples

bull httpsenwikipediaorgwikiDBpedia

bull httpsenwikipediaorgwikiSemantic_network

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Questions

olafjanssenkbnl - ookgezellig

tinyurlcomverzetskranten

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Page 82: Linked Open Data case study (illegal newspapers WW2, Wikipedia, DBpedia) - Lecture Leiden University 3-3-2016

Using an article template we can generate 1300 uniform and interlinked Wikipedia articles

from the LOD-database

htt

ps

c1

sta

ticf

lickr

co

m9

82

81

76

99

23

19

18

_11

a73

56

c38

_bjp

g

LOD-database + article template = Wikipedia article

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

Titles from database

Metadata from database

Related persons from database

Related newspapers from database

Winkel-ID from database

Link to full-texts in Delpher from database

Link to KB-catalogue record from database

Fixed categories on WP and Commons

Predefined fixed strings

Grey = bull From database bull Predefined fixed strings

Uniformity between articles guaranteed

All that WP-writers need to add manually

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

The KB can re-use (embed) the Wikipedia content in its own

websites to tackle this problem

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_28verzetsblad29

Delpher - search results

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

Delpher - search results

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Delpher - object presentation

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Over De Geus onder studenten

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog dat vanaf 4 oktober 1940 tot en met 13 juli 1944 in Den Haag werd uitgegeven Het blad verscheen in 1940 1941 en 1943 maandelijks verder onregelmatig in een oplage tussen de 250 en 8000 exemplaren Het werd aanvankelijk gestencild en vanaf november 1942 gedrukt en de inhoud bestond voornamelijk uit opinie-artikelen

Het blad werd uitgegeven door Jan Drion en Huib Drion twee Leidsehellip

Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Delpher - object presentation

Suggested reading

bull httpwwwtedcomtalkstim_berners_lee_on_the_next_web 20 years ago Tim Berners-Lee invented the World Wide Web For his next project hes building a web for open linked data that could do for numbers what the Web did for words pictures video unlock our data and reframe the way we use it together

bull httpsenwikipediaorgwikiLinked_data Wikipedia article related to the above video

bull http5stardatainfoen The 5 stars of Linked Open Data (Tim Berners-Lee)

bull httplinda-projecteulinked-data-primer-2 Short primer about creating LOD in practice starting from an Excel sheet

bull httpwwwprogrammablewebcomnewshow-linked-data-solved-digital-age-marketing-problemanalysis20150831 The figure near the bottom of the first page is a good illustration of the concept of (linked) triples

bull httpsenwikipediaorgwikiDBpedia

bull httpsenwikipediaorgwikiSemantic_network

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Questions

olafjanssenkbnl - ookgezellig

tinyurlcomverzetskranten

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Page 83: Linked Open Data case study (illegal newspapers WW2, Wikipedia, DBpedia) - Lecture Leiden University 3-3-2016

LOD-database + article template = Wikipedia article

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

Titles from database

Metadata from database

Related persons from database

Related newspapers from database

Winkel-ID from database

Link to full-texts in Delpher from database

Link to KB-catalogue record from database

Fixed categories on WP and Commons

Predefined fixed strings

Grey = bull From database bull Predefined fixed strings

Uniformity between articles guaranteed

All that WP-writers need to add manually

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

The KB can re-use (embed) the Wikipedia content in its own

websites to tackle this problem

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_28verzetsblad29

Delpher - search results

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

Delpher - search results

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Delpher - object presentation

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Over De Geus onder studenten

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog dat vanaf 4 oktober 1940 tot en met 13 juli 1944 in Den Haag werd uitgegeven Het blad verscheen in 1940 1941 en 1943 maandelijks verder onregelmatig in een oplage tussen de 250 en 8000 exemplaren Het werd aanvankelijk gestencild en vanaf november 1942 gedrukt en de inhoud bestond voornamelijk uit opinie-artikelen

Het blad werd uitgegeven door Jan Drion en Huib Drion twee Leidsehellip

Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Delpher - object presentation

Suggested reading

bull httpwwwtedcomtalkstim_berners_lee_on_the_next_web 20 years ago Tim Berners-Lee invented the World Wide Web For his next project hes building a web for open linked data that could do for numbers what the Web did for words pictures video unlock our data and reframe the way we use it together

bull httpsenwikipediaorgwikiLinked_data Wikipedia article related to the above video

bull http5stardatainfoen The 5 stars of Linked Open Data (Tim Berners-Lee)

bull httplinda-projecteulinked-data-primer-2 Short primer about creating LOD in practice starting from an Excel sheet

bull httpwwwprogrammablewebcomnewshow-linked-data-solved-digital-age-marketing-problemanalysis20150831 The figure near the bottom of the first page is a good illustration of the concept of (linked) triples

bull httpsenwikipediaorgwikiDBpedia

bull httpsenwikipediaorgwikiSemantic_network

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Questions

olafjanssenkbnl - ookgezellig

tinyurlcomverzetskranten

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Page 84: Linked Open Data case study (illegal newspapers WW2, Wikipedia, DBpedia) - Lecture Leiden University 3-3-2016

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

Titles from database

Metadata from database

Related persons from database

Related newspapers from database

Winkel-ID from database

Link to full-texts in Delpher from database

Link to KB-catalogue record from database

Fixed categories on WP and Commons

Predefined fixed strings

Grey = bull From database bull Predefined fixed strings

Uniformity between articles guaranteed

All that WP-writers need to add manually

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

The KB can re-use (embed) the Wikipedia content in its own

websites to tackle this problem

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_28verzetsblad29

Delpher - search results

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

Delpher - search results

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Delpher - object presentation

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Over De Geus onder studenten

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog dat vanaf 4 oktober 1940 tot en met 13 juli 1944 in Den Haag werd uitgegeven Het blad verscheen in 1940 1941 en 1943 maandelijks verder onregelmatig in een oplage tussen de 250 en 8000 exemplaren Het werd aanvankelijk gestencild en vanaf november 1942 gedrukt en de inhoud bestond voornamelijk uit opinie-artikelen

Het blad werd uitgegeven door Jan Drion en Huib Drion twee Leidsehellip

Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Delpher - object presentation

Suggested reading

bull httpwwwtedcomtalkstim_berners_lee_on_the_next_web 20 years ago Tim Berners-Lee invented the World Wide Web For his next project hes building a web for open linked data that could do for numbers what the Web did for words pictures video unlock our data and reframe the way we use it together

bull httpsenwikipediaorgwikiLinked_data Wikipedia article related to the above video

bull http5stardatainfoen The 5 stars of Linked Open Data (Tim Berners-Lee)

bull httplinda-projecteulinked-data-primer-2 Short primer about creating LOD in practice starting from an Excel sheet

bull httpwwwprogrammablewebcomnewshow-linked-data-solved-digital-age-marketing-problemanalysis20150831 The figure near the bottom of the first page is a good illustration of the concept of (linked) triples

bull httpsenwikipediaorgwikiDBpedia

bull httpsenwikipediaorgwikiSemantic_network

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Questions

olafjanssenkbnl - ookgezellig

tinyurlcomverzetskranten

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Page 85: Linked Open Data case study (illegal newspapers WW2, Wikipedia, DBpedia) - Lecture Leiden University 3-3-2016

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_(verzetsblad)

Titles from database

Metadata from database

Related persons from database

Related newspapers from database

Winkel-ID from database

Link to full-texts in Delpher from database

Link to KB-catalogue record from database

Fixed categories on WP and Commons

Predefined fixed strings

Grey = bull From database bull Predefined fixed strings

Uniformity between articles guaranteed

All that WP-writers need to add manually

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

The KB can re-use (embed) the Wikipedia content in its own

websites to tackle this problem

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_28verzetsblad29

Delpher - search results

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

Delpher - search results

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Delpher - object presentation

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Over De Geus onder studenten

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog dat vanaf 4 oktober 1940 tot en met 13 juli 1944 in Den Haag werd uitgegeven Het blad verscheen in 1940 1941 en 1943 maandelijks verder onregelmatig in een oplage tussen de 250 en 8000 exemplaren Het werd aanvankelijk gestencild en vanaf november 1942 gedrukt en de inhoud bestond voornamelijk uit opinie-artikelen

Het blad werd uitgegeven door Jan Drion en Huib Drion twee Leidsehellip

Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Delpher - object presentation

Suggested reading

bull httpwwwtedcomtalkstim_berners_lee_on_the_next_web 20 years ago Tim Berners-Lee invented the World Wide Web For his next project hes building a web for open linked data that could do for numbers what the Web did for words pictures video unlock our data and reframe the way we use it together

bull httpsenwikipediaorgwikiLinked_data Wikipedia article related to the above video

bull http5stardatainfoen The 5 stars of Linked Open Data (Tim Berners-Lee)

bull httplinda-projecteulinked-data-primer-2 Short primer about creating LOD in practice starting from an Excel sheet

bull httpwwwprogrammablewebcomnewshow-linked-data-solved-digital-age-marketing-problemanalysis20150831 The figure near the bottom of the first page is a good illustration of the concept of (linked) triples

bull httpsenwikipediaorgwikiDBpedia

bull httpsenwikipediaorgwikiSemantic_network

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Questions

olafjanssenkbnl - ookgezellig

tinyurlcomverzetskranten

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Page 86: Linked Open Data case study (illegal newspapers WW2, Wikipedia, DBpedia) - Lecture Leiden University 3-3-2016

Metadata from database

Related persons from database

Related newspapers from database

Winkel-ID from database

Link to full-texts in Delpher from database

Link to KB-catalogue record from database

Fixed categories on WP and Commons

Predefined fixed strings

Grey = bull From database bull Predefined fixed strings

Uniformity between articles guaranteed

All that WP-writers need to add manually

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

The KB can re-use (embed) the Wikipedia content in its own

websites to tackle this problem

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_28verzetsblad29

Delpher - search results

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

Delpher - search results

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Delpher - object presentation

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Over De Geus onder studenten

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog dat vanaf 4 oktober 1940 tot en met 13 juli 1944 in Den Haag werd uitgegeven Het blad verscheen in 1940 1941 en 1943 maandelijks verder onregelmatig in een oplage tussen de 250 en 8000 exemplaren Het werd aanvankelijk gestencild en vanaf november 1942 gedrukt en de inhoud bestond voornamelijk uit opinie-artikelen

Het blad werd uitgegeven door Jan Drion en Huib Drion twee Leidsehellip

Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Delpher - object presentation

Suggested reading

bull httpwwwtedcomtalkstim_berners_lee_on_the_next_web 20 years ago Tim Berners-Lee invented the World Wide Web For his next project hes building a web for open linked data that could do for numbers what the Web did for words pictures video unlock our data and reframe the way we use it together

bull httpsenwikipediaorgwikiLinked_data Wikipedia article related to the above video

bull http5stardatainfoen The 5 stars of Linked Open Data (Tim Berners-Lee)

bull httplinda-projecteulinked-data-primer-2 Short primer about creating LOD in practice starting from an Excel sheet

bull httpwwwprogrammablewebcomnewshow-linked-data-solved-digital-age-marketing-problemanalysis20150831 The figure near the bottom of the first page is a good illustration of the concept of (linked) triples

bull httpsenwikipediaorgwikiDBpedia

bull httpsenwikipediaorgwikiSemantic_network

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Questions

olafjanssenkbnl - ookgezellig

tinyurlcomverzetskranten

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Page 87: Linked Open Data case study (illegal newspapers WW2, Wikipedia, DBpedia) - Lecture Leiden University 3-3-2016

Related persons from database

Related newspapers from database

Winkel-ID from database

Link to full-texts in Delpher from database

Link to KB-catalogue record from database

Fixed categories on WP and Commons

Predefined fixed strings

Grey = bull From database bull Predefined fixed strings

Uniformity between articles guaranteed

All that WP-writers need to add manually

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

The KB can re-use (embed) the Wikipedia content in its own

websites to tackle this problem

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_28verzetsblad29

Delpher - search results

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

Delpher - search results

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Delpher - object presentation

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Over De Geus onder studenten

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog dat vanaf 4 oktober 1940 tot en met 13 juli 1944 in Den Haag werd uitgegeven Het blad verscheen in 1940 1941 en 1943 maandelijks verder onregelmatig in een oplage tussen de 250 en 8000 exemplaren Het werd aanvankelijk gestencild en vanaf november 1942 gedrukt en de inhoud bestond voornamelijk uit opinie-artikelen

Het blad werd uitgegeven door Jan Drion en Huib Drion twee Leidsehellip

Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Delpher - object presentation

Suggested reading

bull httpwwwtedcomtalkstim_berners_lee_on_the_next_web 20 years ago Tim Berners-Lee invented the World Wide Web For his next project hes building a web for open linked data that could do for numbers what the Web did for words pictures video unlock our data and reframe the way we use it together

bull httpsenwikipediaorgwikiLinked_data Wikipedia article related to the above video

bull http5stardatainfoen The 5 stars of Linked Open Data (Tim Berners-Lee)

bull httplinda-projecteulinked-data-primer-2 Short primer about creating LOD in practice starting from an Excel sheet

bull httpwwwprogrammablewebcomnewshow-linked-data-solved-digital-age-marketing-problemanalysis20150831 The figure near the bottom of the first page is a good illustration of the concept of (linked) triples

bull httpsenwikipediaorgwikiDBpedia

bull httpsenwikipediaorgwikiSemantic_network

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Questions

olafjanssenkbnl - ookgezellig

tinyurlcomverzetskranten

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Page 88: Linked Open Data case study (illegal newspapers WW2, Wikipedia, DBpedia) - Lecture Leiden University 3-3-2016

Related newspapers from database

Winkel-ID from database

Link to full-texts in Delpher from database

Link to KB-catalogue record from database

Fixed categories on WP and Commons

Predefined fixed strings

Grey = bull From database bull Predefined fixed strings

Uniformity between articles guaranteed

All that WP-writers need to add manually

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

The KB can re-use (embed) the Wikipedia content in its own

websites to tackle this problem

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_28verzetsblad29

Delpher - search results

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

Delpher - search results

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Delpher - object presentation

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Over De Geus onder studenten

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog dat vanaf 4 oktober 1940 tot en met 13 juli 1944 in Den Haag werd uitgegeven Het blad verscheen in 1940 1941 en 1943 maandelijks verder onregelmatig in een oplage tussen de 250 en 8000 exemplaren Het werd aanvankelijk gestencild en vanaf november 1942 gedrukt en de inhoud bestond voornamelijk uit opinie-artikelen

Het blad werd uitgegeven door Jan Drion en Huib Drion twee Leidsehellip

Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Delpher - object presentation

Suggested reading

bull httpwwwtedcomtalkstim_berners_lee_on_the_next_web 20 years ago Tim Berners-Lee invented the World Wide Web For his next project hes building a web for open linked data that could do for numbers what the Web did for words pictures video unlock our data and reframe the way we use it together

bull httpsenwikipediaorgwikiLinked_data Wikipedia article related to the above video

bull http5stardatainfoen The 5 stars of Linked Open Data (Tim Berners-Lee)

bull httplinda-projecteulinked-data-primer-2 Short primer about creating LOD in practice starting from an Excel sheet

bull httpwwwprogrammablewebcomnewshow-linked-data-solved-digital-age-marketing-problemanalysis20150831 The figure near the bottom of the first page is a good illustration of the concept of (linked) triples

bull httpsenwikipediaorgwikiDBpedia

bull httpsenwikipediaorgwikiSemantic_network

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Questions

olafjanssenkbnl - ookgezellig

tinyurlcomverzetskranten

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Page 89: Linked Open Data case study (illegal newspapers WW2, Wikipedia, DBpedia) - Lecture Leiden University 3-3-2016

Winkel-ID from database

Link to full-texts in Delpher from database

Link to KB-catalogue record from database

Fixed categories on WP and Commons

Predefined fixed strings

Grey = bull From database bull Predefined fixed strings

Uniformity between articles guaranteed

All that WP-writers need to add manually

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

The KB can re-use (embed) the Wikipedia content in its own

websites to tackle this problem

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_28verzetsblad29

Delpher - search results

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

Delpher - search results

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Delpher - object presentation

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Over De Geus onder studenten

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog dat vanaf 4 oktober 1940 tot en met 13 juli 1944 in Den Haag werd uitgegeven Het blad verscheen in 1940 1941 en 1943 maandelijks verder onregelmatig in een oplage tussen de 250 en 8000 exemplaren Het werd aanvankelijk gestencild en vanaf november 1942 gedrukt en de inhoud bestond voornamelijk uit opinie-artikelen

Het blad werd uitgegeven door Jan Drion en Huib Drion twee Leidsehellip

Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Delpher - object presentation

Suggested reading

bull httpwwwtedcomtalkstim_berners_lee_on_the_next_web 20 years ago Tim Berners-Lee invented the World Wide Web For his next project hes building a web for open linked data that could do for numbers what the Web did for words pictures video unlock our data and reframe the way we use it together

bull httpsenwikipediaorgwikiLinked_data Wikipedia article related to the above video

bull http5stardatainfoen The 5 stars of Linked Open Data (Tim Berners-Lee)

bull httplinda-projecteulinked-data-primer-2 Short primer about creating LOD in practice starting from an Excel sheet

bull httpwwwprogrammablewebcomnewshow-linked-data-solved-digital-age-marketing-problemanalysis20150831 The figure near the bottom of the first page is a good illustration of the concept of (linked) triples

bull httpsenwikipediaorgwikiDBpedia

bull httpsenwikipediaorgwikiSemantic_network

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Questions

olafjanssenkbnl - ookgezellig

tinyurlcomverzetskranten

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Page 90: Linked Open Data case study (illegal newspapers WW2, Wikipedia, DBpedia) - Lecture Leiden University 3-3-2016

Link to full-texts in Delpher from database

Link to KB-catalogue record from database

Fixed categories on WP and Commons

Predefined fixed strings

Grey = bull From database bull Predefined fixed strings

Uniformity between articles guaranteed

All that WP-writers need to add manually

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

The KB can re-use (embed) the Wikipedia content in its own

websites to tackle this problem

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_28verzetsblad29

Delpher - search results

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

Delpher - search results

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Delpher - object presentation

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Over De Geus onder studenten

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog dat vanaf 4 oktober 1940 tot en met 13 juli 1944 in Den Haag werd uitgegeven Het blad verscheen in 1940 1941 en 1943 maandelijks verder onregelmatig in een oplage tussen de 250 en 8000 exemplaren Het werd aanvankelijk gestencild en vanaf november 1942 gedrukt en de inhoud bestond voornamelijk uit opinie-artikelen

Het blad werd uitgegeven door Jan Drion en Huib Drion twee Leidsehellip

Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Delpher - object presentation

Suggested reading

bull httpwwwtedcomtalkstim_berners_lee_on_the_next_web 20 years ago Tim Berners-Lee invented the World Wide Web For his next project hes building a web for open linked data that could do for numbers what the Web did for words pictures video unlock our data and reframe the way we use it together

bull httpsenwikipediaorgwikiLinked_data Wikipedia article related to the above video

bull http5stardatainfoen The 5 stars of Linked Open Data (Tim Berners-Lee)

bull httplinda-projecteulinked-data-primer-2 Short primer about creating LOD in practice starting from an Excel sheet

bull httpwwwprogrammablewebcomnewshow-linked-data-solved-digital-age-marketing-problemanalysis20150831 The figure near the bottom of the first page is a good illustration of the concept of (linked) triples

bull httpsenwikipediaorgwikiDBpedia

bull httpsenwikipediaorgwikiSemantic_network

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Questions

olafjanssenkbnl - ookgezellig

tinyurlcomverzetskranten

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Page 91: Linked Open Data case study (illegal newspapers WW2, Wikipedia, DBpedia) - Lecture Leiden University 3-3-2016

Link to KB-catalogue record from database

Fixed categories on WP and Commons

Predefined fixed strings

Grey = bull From database bull Predefined fixed strings

Uniformity between articles guaranteed

All that WP-writers need to add manually

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

The KB can re-use (embed) the Wikipedia content in its own

websites to tackle this problem

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_28verzetsblad29

Delpher - search results

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

Delpher - search results

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Delpher - object presentation

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Over De Geus onder studenten

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog dat vanaf 4 oktober 1940 tot en met 13 juli 1944 in Den Haag werd uitgegeven Het blad verscheen in 1940 1941 en 1943 maandelijks verder onregelmatig in een oplage tussen de 250 en 8000 exemplaren Het werd aanvankelijk gestencild en vanaf november 1942 gedrukt en de inhoud bestond voornamelijk uit opinie-artikelen

Het blad werd uitgegeven door Jan Drion en Huib Drion twee Leidsehellip

Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Delpher - object presentation

Suggested reading

bull httpwwwtedcomtalkstim_berners_lee_on_the_next_web 20 years ago Tim Berners-Lee invented the World Wide Web For his next project hes building a web for open linked data that could do for numbers what the Web did for words pictures video unlock our data and reframe the way we use it together

bull httpsenwikipediaorgwikiLinked_data Wikipedia article related to the above video

bull http5stardatainfoen The 5 stars of Linked Open Data (Tim Berners-Lee)

bull httplinda-projecteulinked-data-primer-2 Short primer about creating LOD in practice starting from an Excel sheet

bull httpwwwprogrammablewebcomnewshow-linked-data-solved-digital-age-marketing-problemanalysis20150831 The figure near the bottom of the first page is a good illustration of the concept of (linked) triples

bull httpsenwikipediaorgwikiDBpedia

bull httpsenwikipediaorgwikiSemantic_network

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Questions

olafjanssenkbnl - ookgezellig

tinyurlcomverzetskranten

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Page 92: Linked Open Data case study (illegal newspapers WW2, Wikipedia, DBpedia) - Lecture Leiden University 3-3-2016

Fixed categories on WP and Commons

Predefined fixed strings

Grey = bull From database bull Predefined fixed strings

Uniformity between articles guaranteed

All that WP-writers need to add manually

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

The KB can re-use (embed) the Wikipedia content in its own

websites to tackle this problem

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_28verzetsblad29

Delpher - search results

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

Delpher - search results

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Delpher - object presentation

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Over De Geus onder studenten

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog dat vanaf 4 oktober 1940 tot en met 13 juli 1944 in Den Haag werd uitgegeven Het blad verscheen in 1940 1941 en 1943 maandelijks verder onregelmatig in een oplage tussen de 250 en 8000 exemplaren Het werd aanvankelijk gestencild en vanaf november 1942 gedrukt en de inhoud bestond voornamelijk uit opinie-artikelen

Het blad werd uitgegeven door Jan Drion en Huib Drion twee Leidsehellip

Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Delpher - object presentation

Suggested reading

bull httpwwwtedcomtalkstim_berners_lee_on_the_next_web 20 years ago Tim Berners-Lee invented the World Wide Web For his next project hes building a web for open linked data that could do for numbers what the Web did for words pictures video unlock our data and reframe the way we use it together

bull httpsenwikipediaorgwikiLinked_data Wikipedia article related to the above video

bull http5stardatainfoen The 5 stars of Linked Open Data (Tim Berners-Lee)

bull httplinda-projecteulinked-data-primer-2 Short primer about creating LOD in practice starting from an Excel sheet

bull httpwwwprogrammablewebcomnewshow-linked-data-solved-digital-age-marketing-problemanalysis20150831 The figure near the bottom of the first page is a good illustration of the concept of (linked) triples

bull httpsenwikipediaorgwikiDBpedia

bull httpsenwikipediaorgwikiSemantic_network

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Questions

olafjanssenkbnl - ookgezellig

tinyurlcomverzetskranten

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Page 93: Linked Open Data case study (illegal newspapers WW2, Wikipedia, DBpedia) - Lecture Leiden University 3-3-2016

Predefined fixed strings

Grey = bull From database bull Predefined fixed strings

Uniformity between articles guaranteed

All that WP-writers need to add manually

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

The KB can re-use (embed) the Wikipedia content in its own

websites to tackle this problem

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_28verzetsblad29

Delpher - search results

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

Delpher - search results

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Delpher - object presentation

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Over De Geus onder studenten

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog dat vanaf 4 oktober 1940 tot en met 13 juli 1944 in Den Haag werd uitgegeven Het blad verscheen in 1940 1941 en 1943 maandelijks verder onregelmatig in een oplage tussen de 250 en 8000 exemplaren Het werd aanvankelijk gestencild en vanaf november 1942 gedrukt en de inhoud bestond voornamelijk uit opinie-artikelen

Het blad werd uitgegeven door Jan Drion en Huib Drion twee Leidsehellip

Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Delpher - object presentation

Suggested reading

bull httpwwwtedcomtalkstim_berners_lee_on_the_next_web 20 years ago Tim Berners-Lee invented the World Wide Web For his next project hes building a web for open linked data that could do for numbers what the Web did for words pictures video unlock our data and reframe the way we use it together

bull httpsenwikipediaorgwikiLinked_data Wikipedia article related to the above video

bull http5stardatainfoen The 5 stars of Linked Open Data (Tim Berners-Lee)

bull httplinda-projecteulinked-data-primer-2 Short primer about creating LOD in practice starting from an Excel sheet

bull httpwwwprogrammablewebcomnewshow-linked-data-solved-digital-age-marketing-problemanalysis20150831 The figure near the bottom of the first page is a good illustration of the concept of (linked) triples

bull httpsenwikipediaorgwikiDBpedia

bull httpsenwikipediaorgwikiSemantic_network

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Questions

olafjanssenkbnl - ookgezellig

tinyurlcomverzetskranten

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Page 94: Linked Open Data case study (illegal newspapers WW2, Wikipedia, DBpedia) - Lecture Leiden University 3-3-2016

Grey = bull From database bull Predefined fixed strings

Uniformity between articles guaranteed

All that WP-writers need to add manually

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

The KB can re-use (embed) the Wikipedia content in its own

websites to tackle this problem

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_28verzetsblad29

Delpher - search results

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

Delpher - search results

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Delpher - object presentation

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Over De Geus onder studenten

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog dat vanaf 4 oktober 1940 tot en met 13 juli 1944 in Den Haag werd uitgegeven Het blad verscheen in 1940 1941 en 1943 maandelijks verder onregelmatig in een oplage tussen de 250 en 8000 exemplaren Het werd aanvankelijk gestencild en vanaf november 1942 gedrukt en de inhoud bestond voornamelijk uit opinie-artikelen

Het blad werd uitgegeven door Jan Drion en Huib Drion twee Leidsehellip

Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Delpher - object presentation

Suggested reading

bull httpwwwtedcomtalkstim_berners_lee_on_the_next_web 20 years ago Tim Berners-Lee invented the World Wide Web For his next project hes building a web for open linked data that could do for numbers what the Web did for words pictures video unlock our data and reframe the way we use it together

bull httpsenwikipediaorgwikiLinked_data Wikipedia article related to the above video

bull http5stardatainfoen The 5 stars of Linked Open Data (Tim Berners-Lee)

bull httplinda-projecteulinked-data-primer-2 Short primer about creating LOD in practice starting from an Excel sheet

bull httpwwwprogrammablewebcomnewshow-linked-data-solved-digital-age-marketing-problemanalysis20150831 The figure near the bottom of the first page is a good illustration of the concept of (linked) triples

bull httpsenwikipediaorgwikiDBpedia

bull httpsenwikipediaorgwikiSemantic_network

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Questions

olafjanssenkbnl - ookgezellig

tinyurlcomverzetskranten

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Page 95: Linked Open Data case study (illegal newspapers WW2, Wikipedia, DBpedia) - Lecture Leiden University 3-3-2016

All that WP-writers need to add manually

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

The KB can re-use (embed) the Wikipedia content in its own

websites to tackle this problem

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_28verzetsblad29

Delpher - search results

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

Delpher - search results

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Delpher - object presentation

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Over De Geus onder studenten

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog dat vanaf 4 oktober 1940 tot en met 13 juli 1944 in Den Haag werd uitgegeven Het blad verscheen in 1940 1941 en 1943 maandelijks verder onregelmatig in een oplage tussen de 250 en 8000 exemplaren Het werd aanvankelijk gestencild en vanaf november 1942 gedrukt en de inhoud bestond voornamelijk uit opinie-artikelen

Het blad werd uitgegeven door Jan Drion en Huib Drion twee Leidsehellip

Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Delpher - object presentation

Suggested reading

bull httpwwwtedcomtalkstim_berners_lee_on_the_next_web 20 years ago Tim Berners-Lee invented the World Wide Web For his next project hes building a web for open linked data that could do for numbers what the Web did for words pictures video unlock our data and reframe the way we use it together

bull httpsenwikipediaorgwikiLinked_data Wikipedia article related to the above video

bull http5stardatainfoen The 5 stars of Linked Open Data (Tim Berners-Lee)

bull httplinda-projecteulinked-data-primer-2 Short primer about creating LOD in practice starting from an Excel sheet

bull httpwwwprogrammablewebcomnewshow-linked-data-solved-digital-age-marketing-problemanalysis20150831 The figure near the bottom of the first page is a good illustration of the concept of (linked) triples

bull httpsenwikipediaorgwikiDBpedia

bull httpsenwikipediaorgwikiSemantic_network

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Questions

olafjanssenkbnl - ookgezellig

tinyurlcomverzetskranten

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Page 96: Linked Open Data case study (illegal newspapers WW2, Wikipedia, DBpedia) - Lecture Leiden University 3-3-2016

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

The KB can re-use (embed) the Wikipedia content in its own

websites to tackle this problem

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_28verzetsblad29

Delpher - search results

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

Delpher - search results

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Delpher - object presentation

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Over De Geus onder studenten

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog dat vanaf 4 oktober 1940 tot en met 13 juli 1944 in Den Haag werd uitgegeven Het blad verscheen in 1940 1941 en 1943 maandelijks verder onregelmatig in een oplage tussen de 250 en 8000 exemplaren Het werd aanvankelijk gestencild en vanaf november 1942 gedrukt en de inhoud bestond voornamelijk uit opinie-artikelen

Het blad werd uitgegeven door Jan Drion en Huib Drion twee Leidsehellip

Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Delpher - object presentation

Suggested reading

bull httpwwwtedcomtalkstim_berners_lee_on_the_next_web 20 years ago Tim Berners-Lee invented the World Wide Web For his next project hes building a web for open linked data that could do for numbers what the Web did for words pictures video unlock our data and reframe the way we use it together

bull httpsenwikipediaorgwikiLinked_data Wikipedia article related to the above video

bull http5stardatainfoen The 5 stars of Linked Open Data (Tim Berners-Lee)

bull httplinda-projecteulinked-data-primer-2 Short primer about creating LOD in practice starting from an Excel sheet

bull httpwwwprogrammablewebcomnewshow-linked-data-solved-digital-age-marketing-problemanalysis20150831 The figure near the bottom of the first page is a good illustration of the concept of (linked) triples

bull httpsenwikipediaorgwikiDBpedia

bull httpsenwikipediaorgwikiSemantic_network

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Questions

olafjanssenkbnl - ookgezellig

tinyurlcomverzetskranten

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Page 97: Linked Open Data case study (illegal newspapers WW2, Wikipedia, DBpedia) - Lecture Leiden University 3-3-2016

Problem with Delpher (and KB-catalogue)

Veacutery little contextual information about the newspaper(title)s

httpsthejungleisneutralfileswordpresscom201311lostjpg

The KB can re-use (embed) the Wikipedia content in its own

websites to tackle this problem

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_28verzetsblad29

Delpher - search results

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

Delpher - search results

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Delpher - object presentation

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Over De Geus onder studenten

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog dat vanaf 4 oktober 1940 tot en met 13 juli 1944 in Den Haag werd uitgegeven Het blad verscheen in 1940 1941 en 1943 maandelijks verder onregelmatig in een oplage tussen de 250 en 8000 exemplaren Het werd aanvankelijk gestencild en vanaf november 1942 gedrukt en de inhoud bestond voornamelijk uit opinie-artikelen

Het blad werd uitgegeven door Jan Drion en Huib Drion twee Leidsehellip

Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Delpher - object presentation

Suggested reading

bull httpwwwtedcomtalkstim_berners_lee_on_the_next_web 20 years ago Tim Berners-Lee invented the World Wide Web For his next project hes building a web for open linked data that could do for numbers what the Web did for words pictures video unlock our data and reframe the way we use it together

bull httpsenwikipediaorgwikiLinked_data Wikipedia article related to the above video

bull http5stardatainfoen The 5 stars of Linked Open Data (Tim Berners-Lee)

bull httplinda-projecteulinked-data-primer-2 Short primer about creating LOD in practice starting from an Excel sheet

bull httpwwwprogrammablewebcomnewshow-linked-data-solved-digital-age-marketing-problemanalysis20150831 The figure near the bottom of the first page is a good illustration of the concept of (linked) triples

bull httpsenwikipediaorgwikiDBpedia

bull httpsenwikipediaorgwikiSemantic_network

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Questions

olafjanssenkbnl - ookgezellig

tinyurlcomverzetskranten

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Page 98: Linked Open Data case study (illegal newspapers WW2, Wikipedia, DBpedia) - Lecture Leiden University 3-3-2016

httpsnlwikipediaorgwikiDe_Geus_onder_studenten_28verzetsblad29

Delpher - search results

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

Delpher - search results

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Delpher - object presentation

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Over De Geus onder studenten

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog dat vanaf 4 oktober 1940 tot en met 13 juli 1944 in Den Haag werd uitgegeven Het blad verscheen in 1940 1941 en 1943 maandelijks verder onregelmatig in een oplage tussen de 250 en 8000 exemplaren Het werd aanvankelijk gestencild en vanaf november 1942 gedrukt en de inhoud bestond voornamelijk uit opinie-artikelen

Het blad werd uitgegeven door Jan Drion en Huib Drion twee Leidsehellip

Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Delpher - object presentation

Suggested reading

bull httpwwwtedcomtalkstim_berners_lee_on_the_next_web 20 years ago Tim Berners-Lee invented the World Wide Web For his next project hes building a web for open linked data that could do for numbers what the Web did for words pictures video unlock our data and reframe the way we use it together

bull httpsenwikipediaorgwikiLinked_data Wikipedia article related to the above video

bull http5stardatainfoen The 5 stars of Linked Open Data (Tim Berners-Lee)

bull httplinda-projecteulinked-data-primer-2 Short primer about creating LOD in practice starting from an Excel sheet

bull httpwwwprogrammablewebcomnewshow-linked-data-solved-digital-age-marketing-problemanalysis20150831 The figure near the bottom of the first page is a good illustration of the concept of (linked) triples

bull httpsenwikipediaorgwikiDBpedia

bull httpsenwikipediaorgwikiSemantic_network

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Questions

olafjanssenkbnl - ookgezellig

tinyurlcomverzetskranten

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Page 99: Linked Open Data case study (illegal newspapers WW2, Wikipedia, DBpedia) - Lecture Leiden University 3-3-2016

Delpher - search results

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

Delpher - search results

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Delpher - object presentation

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Over De Geus onder studenten

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog dat vanaf 4 oktober 1940 tot en met 13 juli 1944 in Den Haag werd uitgegeven Het blad verscheen in 1940 1941 en 1943 maandelijks verder onregelmatig in een oplage tussen de 250 en 8000 exemplaren Het werd aanvankelijk gestencild en vanaf november 1942 gedrukt en de inhoud bestond voornamelijk uit opinie-artikelen

Het blad werd uitgegeven door Jan Drion en Huib Drion twee Leidsehellip

Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Delpher - object presentation

Suggested reading

bull httpwwwtedcomtalkstim_berners_lee_on_the_next_web 20 years ago Tim Berners-Lee invented the World Wide Web For his next project hes building a web for open linked data that could do for numbers what the Web did for words pictures video unlock our data and reframe the way we use it together

bull httpsenwikipediaorgwikiLinked_data Wikipedia article related to the above video

bull http5stardatainfoen The 5 stars of Linked Open Data (Tim Berners-Lee)

bull httplinda-projecteulinked-data-primer-2 Short primer about creating LOD in practice starting from an Excel sheet

bull httpwwwprogrammablewebcomnewshow-linked-data-solved-digital-age-marketing-problemanalysis20150831 The figure near the bottom of the first page is a good illustration of the concept of (linked) triples

bull httpsenwikipediaorgwikiDBpedia

bull httpsenwikipediaorgwikiSemantic_network

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Questions

olafjanssenkbnl - ookgezellig

tinyurlcomverzetskranten

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Page 100: Linked Open Data case study (illegal newspapers WW2, Wikipedia, DBpedia) - Lecture Leiden University 3-3-2016

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpwwwdelphernlnlkrantenresultscoll=dddtitelampcql[]=ppn+any+(107123223)

Delpher - search results

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog

dat vanaf 4 oktober 1940 tot en met 13 juli 1944 hellip Lees verder op Wikipedia

Delpher - object presentation

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Over De Geus onder studenten

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog dat vanaf 4 oktober 1940 tot en met 13 juli 1944 in Den Haag werd uitgegeven Het blad verscheen in 1940 1941 en 1943 maandelijks verder onregelmatig in een oplage tussen de 250 en 8000 exemplaren Het werd aanvankelijk gestencild en vanaf november 1942 gedrukt en de inhoud bestond voornamelijk uit opinie-artikelen

Het blad werd uitgegeven door Jan Drion en Huib Drion twee Leidsehellip

Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Delpher - object presentation

Suggested reading

bull httpwwwtedcomtalkstim_berners_lee_on_the_next_web 20 years ago Tim Berners-Lee invented the World Wide Web For his next project hes building a web for open linked data that could do for numbers what the Web did for words pictures video unlock our data and reframe the way we use it together

bull httpsenwikipediaorgwikiLinked_data Wikipedia article related to the above video

bull http5stardatainfoen The 5 stars of Linked Open Data (Tim Berners-Lee)

bull httplinda-projecteulinked-data-primer-2 Short primer about creating LOD in practice starting from an Excel sheet

bull httpwwwprogrammablewebcomnewshow-linked-data-solved-digital-age-marketing-problemanalysis20150831 The figure near the bottom of the first page is a good illustration of the concept of (linked) triples

bull httpsenwikipediaorgwikiDBpedia

bull httpsenwikipediaorgwikiSemantic_network

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Questions

olafjanssenkbnl - ookgezellig

tinyurlcomverzetskranten

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Page 101: Linked Open Data case study (illegal newspapers WW2, Wikipedia, DBpedia) - Lecture Leiden University 3-3-2016

Delpher - object presentation

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Over De Geus onder studenten

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog dat vanaf 4 oktober 1940 tot en met 13 juli 1944 in Den Haag werd uitgegeven Het blad verscheen in 1940 1941 en 1943 maandelijks verder onregelmatig in een oplage tussen de 250 en 8000 exemplaren Het werd aanvankelijk gestencild en vanaf november 1942 gedrukt en de inhoud bestond voornamelijk uit opinie-artikelen

Het blad werd uitgegeven door Jan Drion en Huib Drion twee Leidsehellip

Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Delpher - object presentation

Suggested reading

bull httpwwwtedcomtalkstim_berners_lee_on_the_next_web 20 years ago Tim Berners-Lee invented the World Wide Web For his next project hes building a web for open linked data that could do for numbers what the Web did for words pictures video unlock our data and reframe the way we use it together

bull httpsenwikipediaorgwikiLinked_data Wikipedia article related to the above video

bull http5stardatainfoen The 5 stars of Linked Open Data (Tim Berners-Lee)

bull httplinda-projecteulinked-data-primer-2 Short primer about creating LOD in practice starting from an Excel sheet

bull httpwwwprogrammablewebcomnewshow-linked-data-solved-digital-age-marketing-problemanalysis20150831 The figure near the bottom of the first page is a good illustration of the concept of (linked) triples

bull httpsenwikipediaorgwikiDBpedia

bull httpsenwikipediaorgwikiSemantic_network

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Questions

olafjanssenkbnl - ookgezellig

tinyurlcomverzetskranten

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Page 102: Linked Open Data case study (illegal newspapers WW2, Wikipedia, DBpedia) - Lecture Leiden University 3-3-2016

Over De Geus onder studenten

De geus (onder studenten) was een verzetsblad uit de Tweede Wereldoorlog dat vanaf 4 oktober 1940 tot en met 13 juli 1944 in Den Haag werd uitgegeven Het blad verscheen in 1940 1941 en 1943 maandelijks verder onregelmatig in een oplage tussen de 250 en 8000 exemplaren Het werd aanvankelijk gestencild en vanaf november 1942 gedrukt en de inhoud bestond voornamelijk uit opinie-artikelen

Het blad werd uitgegeven door Jan Drion en Huib Drion twee Leidsehellip

Lees verder op Wikipedia

Embedded contextual snippet from Wikipedia

httpresolverkbnlresolveurn=ddd010424553mpeg21p001

Delpher - object presentation

Suggested reading

bull httpwwwtedcomtalkstim_berners_lee_on_the_next_web 20 years ago Tim Berners-Lee invented the World Wide Web For his next project hes building a web for open linked data that could do for numbers what the Web did for words pictures video unlock our data and reframe the way we use it together

bull httpsenwikipediaorgwikiLinked_data Wikipedia article related to the above video

bull http5stardatainfoen The 5 stars of Linked Open Data (Tim Berners-Lee)

bull httplinda-projecteulinked-data-primer-2 Short primer about creating LOD in practice starting from an Excel sheet

bull httpwwwprogrammablewebcomnewshow-linked-data-solved-digital-age-marketing-problemanalysis20150831 The figure near the bottom of the first page is a good illustration of the concept of (linked) triples

bull httpsenwikipediaorgwikiDBpedia

bull httpsenwikipediaorgwikiSemantic_network

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Questions

olafjanssenkbnl - ookgezellig

tinyurlcomverzetskranten

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Page 103: Linked Open Data case study (illegal newspapers WW2, Wikipedia, DBpedia) - Lecture Leiden University 3-3-2016

Suggested reading

bull httpwwwtedcomtalkstim_berners_lee_on_the_next_web 20 years ago Tim Berners-Lee invented the World Wide Web For his next project hes building a web for open linked data that could do for numbers what the Web did for words pictures video unlock our data and reframe the way we use it together

bull httpsenwikipediaorgwikiLinked_data Wikipedia article related to the above video

bull http5stardatainfoen The 5 stars of Linked Open Data (Tim Berners-Lee)

bull httplinda-projecteulinked-data-primer-2 Short primer about creating LOD in practice starting from an Excel sheet

bull httpwwwprogrammablewebcomnewshow-linked-data-solved-digital-age-marketing-problemanalysis20150831 The figure near the bottom of the first page is a good illustration of the concept of (linked) triples

bull httpsenwikipediaorgwikiDBpedia

bull httpsenwikipediaorgwikiSemantic_network

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Questions

olafjanssenkbnl - ookgezellig

tinyurlcomverzetskranten

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65

Page 104: Linked Open Data case study (illegal newspapers WW2, Wikipedia, DBpedia) - Lecture Leiden University 3-3-2016

Questions

olafjanssenkbnl - ookgezellig

tinyurlcomverzetskranten

htt

p

ww

wg

etty

imag

esn

ld

etai

ln

ieu

wsf

oto

st

hre

e-w

om

en-o

f-th

e-at

s-lig

ht-

up

-to

geth

er-a

ts-r

egu

lati

on

s-n

ieu

wsf

oto

s3

09

42

65