29
NLW Datasets National Library of Wales Owain Roberts @owainrr Glen Robson @glenrobson

NLW Linked Open Data Sets

Embed Size (px)

Citation preview

Page 1: NLW Linked Open Data Sets

NLW DatasetsNational Library of Wales

Owain Roberts @owainrrGlen Robson @glenrobson

Page 2: NLW Linked Open Data Sets

Background• NLW has been digitising since late 90s

• Digitised material tends to be static material – the digtised stuff don’t change with time!

• Datasets and databases treated outside the collections systems

• Infrastructre gap identified for dealing with datasets

• Move to dealing with born-digital material and datasets

Page 3: NLW Linked Open Data Sets

Datasets / Derived Content

PHYSICAL COLLECTIONS

DIGITISED COLLECTIONS

Digitisation

DERIVED CONTENT

Transcription

Automation / Crowdsourcing

DERIVED CONTENT

PHYSICAL COLLECTIONS

DIGITAL COLLECTIONS

DERIVED CONTENT

Automation / Crowdsourcing

Page 4: NLW Linked Open Data Sets

The Storage Problem

Where do we put these?

• Datasets derived from physical collections

• Data derived from digital collections

Page 5: NLW Linked Open Data Sets

What is linked data?

A way of connecting silos of data

A way of enhancing existing data

A way of structuring data (like a database or XML)

A standard way of sharing data (like an API)

Page 6: NLW Linked Open Data Sets

Triples

Subject Predicate Object

•Person hasName Owain•Person hasAge 24•Person worksIn NLW – literal•Person worksIn NlW - literal•Person worksIn http://www.llgc.org.uk/ - URI

Page 7: NLW Linked Open Data Sets

Aberystwyth Shipping Records

Transcribed as part of NLW Volunteer Programme544 Ships covering period 1856-1914

An example…

Page 8: NLW Linked Open Data Sets
Page 9: NLW Linked Open Data Sets

Cynefin Tithe Maps

1838 - 1947

Cymru 1914.org and Wales at War

1914-1918

Shipping Records1856 - 1914

Crime and Punishment

Database1730 - 1830

Welsh Biography Online

0 - 1970

Welsh Newspapers Online

1804 - 1919

EXTERNAL DATASETS

EXTERNAL DATASET

EXTERNAL DATASET

Cymru 1900~ 1900

Page 10: NLW Linked Open Data Sets

Aberystwyth Observer23 March 1905

Page 11: NLW Linked Open Data Sets

Aberystwyth Observer23 April 1905

Page 12: NLW Linked Open Data Sets

Events

Page 13: NLW Linked Open Data Sets

Cynefin Tithe Maps

1838 - 1947

Cymru 1914.org and Wales at War

1914-1918

Shipping Records1856 - 1914

Crime and Punishment

Database1730 - 1830

Welsh Biography Online

0 - 1970

Welsh Newspapers Online

1804 - 1919

EXTERNAL DATASETS

EXTERNAL DATASET

EXTERNAL DATASET

Cymru 1900~ 1900

Page 14: NLW Linked Open Data Sets
Page 15: NLW Linked Open Data Sets

New Developments

• IIIF• Linked Open Data

Page 16: NLW Linked Open Data Sets

CommunityNational Libraries• Austria• British Library• France• Denmark• Egypt• Israel• New Zealand• Norway• Poland• Serbia• Vatican• Wales

http://www.slideshare.net/azaroth42/introduction-to-iiif

Research Institutions• C2RMF (France)• Cornell University• Johns Hopkins Univ. • Harvard University • Oxford University• Princeton University• Stanford University• Wellcome Library• Yale University• plus several more

Museums • YCBA• British Museum Aggregators• Artstor• DPLA• Europeana Projects• Biblissima • e-codices• TPEN• TextGrid

Page 17: NLW Linked Open Data Sets

What can I do with it?

National Library of Wales

Repository

British LibraryDigital Library

National Library of NorwayRepository

BnFRepository

Image APIPresentation API

MiradorIIIF Viewer

Wellcome/Universal Viewer

IIIF Viewer

Page 18: NLW Linked Open Data Sets

Mirador

http://stanford.io/1PW789d

Page 19: NLW Linked Open Data Sets
Page 20: NLW Linked Open Data Sets
Page 21: NLW Linked Open Data Sets

Linked Open Data

• “A method of publishing structured data so that it can be interlinked and become more useful through semantic queries”

• Tim Berners-Lee coined the term in a 2006 design note about the Semantic Web project

https://en.wikipedia.org/wiki/Linked_data• Turing the web into data rather than

documents

Page 22: NLW Linked Open Data Sets

Benefits for Research

• Queryable: SPARQL• Open Data

– Not limited by website – Not limited to an API– Keys to the database

• Linkable to other datasets– Wikipedia– Geonames

• Built to be added to

Page 23: NLW Linked Open Data Sets

Book of Remembrance

• Once transcribed it will be a complete dataset of the Welsh fallen– Query by rank, location, service

• Linkable– Geonames – county/area– Wales at War - http://www.walesatwar.org

Page 24: NLW Linked Open Data Sets

Shipping Registers

• 544 merchant vessels registered at the port of Aberystwyth

• 1856-1914• Crew lists – name, position, birth date, reason

for leaving, location • Transcribed by volunteers• Modelled in Linked Open Data

Page 25: NLW Linked Open Data Sets
Page 26: NLW Linked Open Data Sets

Top 4 Places Visited

Page 27: NLW Linked Open Data Sets

Top Visits

Page 28: NLW Linked Open Data Sets

Problems• Linking out

– Places -> Geonames, Cynefin– Ships -> Wikipedia– Ships -> Newspapers?

• Disambiguation– Between people

• In Shipping Records• Across resources e.g. Newspapers

– Between resources– Dutch Shipping to Newspaper linking: http://bit.ly/1Talish/

Page 29: NLW Linked Open Data Sets

Going forward• Release more datasets as LOD• Crowd sourcing to create data• Working with researchers on enhancing datasets.• Can we turn the Newspapers into a Queryable

dataset?– Name Entity Recognition– Crowd– Research Projects

• Can we link our digital resources together?