22
Ricardo Santos Muñoz (Speaker) Ana Manchado Mangas Biblioteca Nacional de España / National Library of Spain Daniel Vila Suero Ontology Engineering Group Universidad Politécnica de Madrid / Technical University of Madrid 81st IFLA World Library and Information Congress – Session 207 15-21 August, Cape Town, South Africa BIBLIOTECA NACIONAL DE ESPAÑA datos.bne.es: experimenting with LOD and FRBR to access National Library of Spain collections

datos.bne.es: experimenting with LOD and FRBR to access National Library of Spain collections. Ricardo Santos Muñoz, Ana Manchado Mangas, Daniel Vila Suero

Embed Size (px)

Citation preview

Page 1: datos.bne.es: experimenting with LOD and FRBR to access National Library of Spain collections. Ricardo Santos Muñoz, Ana Manchado Mangas, Daniel Vila Suero

Ricardo Santos Muñoz (Speaker) Ana Manchado Mangas Biblioteca Nacional de España / National Library of Spain Daniel Vila Suero Ontology Engineering Group Universidad Politécnica de Madrid / Technical University of Madrid

81st IFLA World Library and Information Congress – Session 207 15-21 August, Cape Town, South Africa

BIBLIOTECA NACIONAL DE ESPAÑA

datos.bne.es: experimenting with LOD and FRBR to access National Library of Spain collections

Page 2: datos.bne.es: experimenting with LOD and FRBR to access National Library of Spain collections. Ricardo Santos Muñoz, Ana Manchado Mangas, Daniel Vila Suero

BIBLIOTECA NACIONAL DE ESPAÑA 2

The players

Cataloguing

Digital Library & Information Systems

“Focus group”

LOD experts

Developers

Page 3: datos.bne.es: experimenting with LOD and FRBR to access National Library of Spain collections. Ricardo Santos Muñoz, Ana Manchado Mangas, Daniel Vila Suero

BIBLIOTECA NACIONAL DE ESPAÑA 3

Datos 1.0 (2011)

Publish data according to IFLA models and vocabularies: FRBR, ISBD

PRIMARY GOAL:

CONTENT:

2.4 millions bib records (ancient and modern monographs, printed music and sound recordings) and authority records

AVAILABILITY:

Datadumps and Sparql end-point access

TARGET AUDIENCE:

Developers, LOD experts, data consumers

Page 4: datos.bne.es: experimenting with LOD and FRBR to access National Library of Spain collections. Ricardo Santos Muñoz, Ana Manchado Mangas, Daniel Vila Suero

BIBLIOTECA NACIONAL DE ESPAÑA 4

Datos 2.0 (2014) PRIMARY GOAL:

CONTENT:

AVAILABILITY:

Human interface, Sparql end-point access and other services

Build an innovative gateway to collections, profiting from Linked Data technology and FRBR model

Almost all the catalogue (except serials and items), access to digital objects

TARGET AUDIENCE:

Common users, researchers, reference librarians

Page 5: datos.bne.es: experimenting with LOD and FRBR to access National Library of Spain collections. Ricardo Santos Muñoz, Ana Manchado Mangas, Daniel Vila Suero

BIBLIOTECA NACIONAL DE ESPAÑA 5

Datos 2.0 (2014) MAIN FEATURES:

Allow users to experiment with new ways of discovering resources Covering nearly all the resources in the library Access to digitized content Data enriched and linked to outside sources FRBR as a model of reference Resources to be discovered and accessed straight from Google Resources described with BNE ontology

Page 6: datos.bne.es: experimenting with LOD and FRBR to access National Library of Spain collections. Ricardo Santos Muñoz, Ana Manchado Mangas, Daniel Vila Suero

BIBLIOTECA NACIONAL DE ESPAÑA 6

Datos 2.0: front-end A BRAND NEW CATALOGUE!!!

MULTI-DEVICE RESPONSIVE!!!

ENTITY-DRIVEN SEARCH!!!

FULLY ENRICHED AND LINKED!!!

FRBR-BASED SEARCH RESULTS RANKING!!!

LIVE DEMO

(FINGERS CROSSED)

Page 7: datos.bne.es: experimenting with LOD and FRBR to access National Library of Spain collections. Ricardo Santos Muñoz, Ana Manchado Mangas, Daniel Vila Suero

BIBLIOTECA NACIONAL DE ESPAÑA 7

An FRBR flavour

FRBR/FRAD ENTITIES:

Persons / Corporate Bodies WORKs Expressions Manifestations

Datos 1.0 FRBR as a data model Datos 2.0 FRBR as a reference model

Authority records

Bibliographic records

Page 8: datos.bne.es: experimenting with LOD and FRBR to access National Library of Spain collections. Ricardo Santos Muñoz, Ana Manchado Mangas, Daniel Vila Suero

BIBLIOTECA NACIONAL DE ESPAÑA 8

FRBR links building

Expression manifested

Expression of Work

Author of Work

Page 9: datos.bne.es: experimenting with LOD and FRBR to access National Library of Spain collections. Ricardo Santos Muñoz, Ana Manchado Mangas, Daniel Vila Suero

BIBLIOTECA NACIONAL DE ESPAÑA 9

FRBR links building

Expression manifested

http://datos.bne.es/version/XX2135693eng

Page 10: datos.bne.es: experimenting with LOD and FRBR to access National Library of Spain collections. Ricardo Santos Muñoz, Ana Manchado Mangas, Daniel Vila Suero

BIBLIOTECA NACIONAL DE ESPAÑA 10

datos.bne.es: the making-of MARC21 data extracted

and analyzed Mapping 1: assigning data

to WEM entities

Mapping 2: relating entities

Mapping 3: Properties’ annotation

Publication pipeline

Page 11: datos.bne.es: experimenting with LOD and FRBR to access National Library of Spain collections. Ricardo Santos Muñoz, Ana Manchado Mangas, Daniel Vila Suero

BIBLIOTECA NACIONAL DE ESPAÑA 11

DATOS 1.0

Struggling with FRBR

Data that could not be linked was ignored Solid data model, but huge data loss

Page 12: datos.bne.es: experimenting with LOD and FRBR to access National Library of Spain collections. Ricardo Santos Muñoz, Ana Manchado Mangas, Daniel Vila Suero

BIBLIOTECA NACIONAL DE ESPAÑA 12

DATOS 1.0 DATOS 2.0

Creator bne:OP5002

Contributor bne:OP3005

has relation bne:3008 has subject

bne:7001

Struggling with FRBR

No data loss, but FRBR loopholes The “missing works” affair

Expressions as mere facets?

Page 13: datos.bne.es: experimenting with LOD and FRBR to access National Library of Spain collections. Ricardo Santos Muñoz, Ana Manchado Mangas, Daniel Vila Suero

BIBLIOTECA NACIONAL DE ESPAÑA 13

BIBO

RDA

Ontology DATOS 1.0

BNE data ISBD FRBR FRSAD

FRAD

DATOS 2.0

BNE data BNE ontology

Alignments

SKOS/ MADS for Subjects

Page 14: datos.bne.es: experimenting with LOD and FRBR to access National Library of Spain collections. Ricardo Santos Muñoz, Ana Manchado Mangas, Daniel Vila Suero

BIBLIOTECA NACIONAL DE ESPAÑA 14

Behind the scenes

MARiMbA

Data (Marc)

Data analysis Mapping & sorting

Linked Data generation

Data generation

Publication pipeline

Indexing

Double storing Front-end Triple store

Ranking based in FRBR relationships

No data transformation made after generation Schema.org annotations in every entity html page

Post processing Linking

Enrichment

Page 15: datos.bne.es: experimenting with LOD and FRBR to access National Library of Spain collections. Ricardo Santos Muñoz, Ana Manchado Mangas, Daniel Vila Suero

BIBLIOTECA NACIONAL DE ESPAÑA 15

Other [LOD] services

SPARQL end-point Search API Content negotiation In development: Extended API Dumps

Page 16: datos.bne.es: experimenting with LOD and FRBR to access National Library of Spain collections. Ricardo Santos Muñoz, Ana Manchado Mangas, Daniel Vila Suero

BIBLIOTECA NACIONAL DE ESPAÑA 16

Outcomes datos.bne.es as:

Source for data consuming applications Intermediate source of data

Expert users

Reference tool Integrating device Librarians

New ways to access and discover resources Access from search engine results

Common users

Page 17: datos.bne.es: experimenting with LOD and FRBR to access National Library of Spain collections. Ricardo Santos Muñoz, Ana Manchado Mangas, Daniel Vila Suero

BIBLIOTECA NACIONAL DE ESPAÑA 17

Internet exposure People coming straight from search engine results

Specially in the “long tail” resources

Page 18: datos.bne.es: experimenting with LOD and FRBR to access National Library of Spain collections. Ricardo Santos Muñoz, Ana Manchado Mangas, Daniel Vila Suero

BIBLIOTECA NACIONAL DE ESPAÑA 18

Internet exposure : contents

- Hidden resources made visible. (You got

pictures of my town!!, That’s the weird out-of-print issue I’ve been looking for!!)

- Errors warning

- Misconceptions about services (I want to buy

/ download this item !!!) - False expectations.

Page 19: datos.bne.es: experimenting with LOD and FRBR to access National Library of Spain collections. Ricardo Santos Muñoz, Ana Manchado Mangas, Daniel Vila Suero

BIBLIOTECA NACIONAL DE ESPAÑA 19

Internet exposure : people

- Hey¡ That’s me on the Internet!. Do you want more data? Do you want more of my books? - I know that author!!!. Do you want more data?

- Hey¡ That’s me on the Internet!. And I don’t want to be on the Internet!!!!

Page 20: datos.bne.es: experimenting with LOD and FRBR to access National Library of Spain collections. Ricardo Santos Muñoz, Ana Manchado Mangas, Daniel Vila Suero

BIBLIOTECA NACIONAL DE ESPAÑA 20

FRBR for users

Useful for:

- Grouping related things - As a general model for stablishing relationships

Lessons learned: - Use FRBR, don’t talk FRBR. - Simple relationships are harder to explain.

Moderador
Notas de la presentación
Page 21: datos.bne.es: experimenting with LOD and FRBR to access National Library of Spain collections. Ricardo Santos Muñoz, Ana Manchado Mangas, Daniel Vila Suero

BIBLIOTECA NACIONAL DE ESPAÑA 21

To be continued

Content Entities Relationships Data sources Data enrichment

Entities descriptions Catalogue procedures Search capabilities Ontology alignments LOD services & documentation

Page 22: datos.bne.es: experimenting with LOD and FRBR to access National Library of Spain collections. Ricardo Santos Muñoz, Ana Manchado Mangas, Daniel Vila Suero

BIBLIOTECA NACIONAL DE ESPAÑA

Pº de Recoletos 20-22 28071 Madrid

España T +34 915 807 800

www.bne.es

Ricardo Santos Muñoz Technical Processes Department National Library of Spain [email protected]

Many thanks for your attention