View
1.507
Download
0
Embed Size (px)
Citation preview
Ricardo Santos Muñoz (Speaker) Ana Manchado Mangas Biblioteca Nacional de España / National Library of Spain Daniel Vila Suero Ontology Engineering Group Universidad Politécnica de Madrid / Technical University of Madrid
81st IFLA World Library and Information Congress – Session 207 15-21 August, Cape Town, South Africa
BIBLIOTECA NACIONAL DE ESPAÑA
datos.bne.es: experimenting with LOD and FRBR to access National Library of Spain collections
BIBLIOTECA NACIONAL DE ESPAÑA 2
The players
Cataloguing
Digital Library & Information Systems
“Focus group”
LOD experts
Developers
BIBLIOTECA NACIONAL DE ESPAÑA 3
Datos 1.0 (2011)
Publish data according to IFLA models and vocabularies: FRBR, ISBD
PRIMARY GOAL:
CONTENT:
2.4 millions bib records (ancient and modern monographs, printed music and sound recordings) and authority records
AVAILABILITY:
Datadumps and Sparql end-point access
TARGET AUDIENCE:
Developers, LOD experts, data consumers
BIBLIOTECA NACIONAL DE ESPAÑA 4
Datos 2.0 (2014) PRIMARY GOAL:
CONTENT:
AVAILABILITY:
Human interface, Sparql end-point access and other services
Build an innovative gateway to collections, profiting from Linked Data technology and FRBR model
Almost all the catalogue (except serials and items), access to digital objects
TARGET AUDIENCE:
Common users, researchers, reference librarians
BIBLIOTECA NACIONAL DE ESPAÑA 5
Datos 2.0 (2014) MAIN FEATURES:
Allow users to experiment with new ways of discovering resources Covering nearly all the resources in the library Access to digitized content Data enriched and linked to outside sources FRBR as a model of reference Resources to be discovered and accessed straight from Google Resources described with BNE ontology
BIBLIOTECA NACIONAL DE ESPAÑA 6
Datos 2.0: front-end A BRAND NEW CATALOGUE!!!
MULTI-DEVICE RESPONSIVE!!!
ENTITY-DRIVEN SEARCH!!!
FULLY ENRICHED AND LINKED!!!
FRBR-BASED SEARCH RESULTS RANKING!!!
LIVE DEMO
(FINGERS CROSSED)
BIBLIOTECA NACIONAL DE ESPAÑA 7
An FRBR flavour
FRBR/FRAD ENTITIES:
Persons / Corporate Bodies WORKs Expressions Manifestations
Datos 1.0 FRBR as a data model Datos 2.0 FRBR as a reference model
Authority records
Bibliographic records
BIBLIOTECA NACIONAL DE ESPAÑA 8
FRBR links building
Expression manifested
Expression of Work
Author of Work
BIBLIOTECA NACIONAL DE ESPAÑA 9
FRBR links building
Expression manifested
http://datos.bne.es/version/XX2135693eng
BIBLIOTECA NACIONAL DE ESPAÑA 10
datos.bne.es: the making-of MARC21 data extracted
and analyzed Mapping 1: assigning data
to WEM entities
Mapping 2: relating entities
Mapping 3: Properties’ annotation
Publication pipeline
BIBLIOTECA NACIONAL DE ESPAÑA 11
DATOS 1.0
Struggling with FRBR
Data that could not be linked was ignored Solid data model, but huge data loss
BIBLIOTECA NACIONAL DE ESPAÑA 12
DATOS 1.0 DATOS 2.0
Creator bne:OP5002
Contributor bne:OP3005
has relation bne:3008 has subject
bne:7001
Struggling with FRBR
No data loss, but FRBR loopholes The “missing works” affair
Expressions as mere facets?
BIBLIOTECA NACIONAL DE ESPAÑA 13
BIBO
RDA
Ontology DATOS 1.0
BNE data ISBD FRBR FRSAD
FRAD
DATOS 2.0
BNE data BNE ontology
Alignments
SKOS/ MADS for Subjects
BIBLIOTECA NACIONAL DE ESPAÑA 14
Behind the scenes
MARiMbA
Data (Marc)
Data analysis Mapping & sorting
Linked Data generation
Data generation
Publication pipeline
Indexing
Double storing Front-end Triple store
Ranking based in FRBR relationships
No data transformation made after generation Schema.org annotations in every entity html page
Post processing Linking
Enrichment
BIBLIOTECA NACIONAL DE ESPAÑA 15
Other [LOD] services
SPARQL end-point Search API Content negotiation In development: Extended API Dumps
BIBLIOTECA NACIONAL DE ESPAÑA 16
Outcomes datos.bne.es as:
Source for data consuming applications Intermediate source of data
Expert users
Reference tool Integrating device Librarians
New ways to access and discover resources Access from search engine results
Common users
BIBLIOTECA NACIONAL DE ESPAÑA 17
Internet exposure People coming straight from search engine results
Specially in the “long tail” resources
BIBLIOTECA NACIONAL DE ESPAÑA 18
Internet exposure : contents
- Hidden resources made visible. (You got
pictures of my town!!, That’s the weird out-of-print issue I’ve been looking for!!)
- Errors warning
- Misconceptions about services (I want to buy
/ download this item !!!) - False expectations.
BIBLIOTECA NACIONAL DE ESPAÑA 19
Internet exposure : people
- Hey¡ That’s me on the Internet!. Do you want more data? Do you want more of my books? - I know that author!!!. Do you want more data?
- Hey¡ That’s me on the Internet!. And I don’t want to be on the Internet!!!!
BIBLIOTECA NACIONAL DE ESPAÑA 20
FRBR for users
Useful for:
- Grouping related things - As a general model for stablishing relationships
Lessons learned: - Use FRBR, don’t talk FRBR. - Simple relationships are harder to explain.
BIBLIOTECA NACIONAL DE ESPAÑA 21
To be continued
Content Entities Relationships Data sources Data enrichment
Entities descriptions Catalogue procedures Search capabilities Ontology alignments LOD services & documentation
BIBLIOTECA NACIONAL DE ESPAÑA
Pº de Recoletos 20-22 28071 Madrid
España T +34 915 807 800
www.bne.es
Ricardo Santos Muñoz Technical Processes Department National Library of Spain [email protected]
Many thanks for your attention