Upload
talis-consulting
View
1.886
Download
2
Tags:
Embed Size (px)
Citation preview
Jochen SchirrwagenBielefeld University Library, Germany
CRIS and OAR entities as Linked
Data in scholarly communication –
a vision scenario
About OpenAIRE - Motivation
Implementation of a support infrastructure for the European Open Access pilot (2009-2012)
Research Assessment
– Identification, Capture, Measurement of EC funded FP7 project results (Special Clause 39)
Addressing of Interoperability aspects regarding
– European Commission (EC) Tools
– Current Research Information Systems (CRIS) JISC Research Excellence Framework
– Open Access Repositories (OAR)
2 London, 14th of July 2011 – Linked Data and Libraries
OpenAIRE facts
Itself an EC funded FP7 project
38 OpenAIRE partners across Europe
27 National Open Access Liaison Offices
6887 projects in FP7
– Using CORDA as authoritative source
Some 10.000 publications estimated
Data-Sources (striving for OpenAIRE Guidelines compliancy)
– Institutional repositories Using OpenDOAR as authoritative source of ~850 repositories
– Coverage of subject-based repositories planned
– Coverage of OA journals planned
3 London, 14th of July 2011 – Linked Data and Libraries
Interoperability Scenario
4 London, 14th of July 2011 – Linked Data and Libraries
OpenAIRE
CRIS CORDA
OA-Repositories
Deposit/claiming
of publications
related to project
Project data
at ECProject data at
institution
Bibliographic
data; DC;
OAI-PMH
Author
KE-CRIS-OAR;
PMH; ORE
Candidates for Entities &
Vocabularies
5 London, 14th of July 2011 – Linked Data and Libraries
Challenges – Data and
Interoperability
Capturing research output from different domains involves:
Different responsibilities and tasks
Different metadata formats used
Different metadata exchange interfaces and protocols
Different metadata granularity
– In CRIS -> fine
– In OAR -> coarse
6 London, 14th of July 2011 – Linked Data and Libraries
Challenges – Data and
Interoperability
In the CRIS domain
– Covers the research process
– Run by the administrative department
– Broader view on research information
– Diverse data models and formats
CERIF (-like) models
DDF-MXD, METIS, PURE
In the OAR domain
– Covers research publications
– Run by the library department
– Focus on bibliographic quality
– Diverse metadata formats
DC, DIDL/MODS, EPrints
7London, 14th of July 2011 – Linked Data and Libraries
Issues addressed
by KE CRIS-OAR
Working group within the quadrolateral Knowledge Exchange-Initiative (KE: SURF-NL, JISC-UK, DFG-DE, DEFF-DK)
Aiming to increase interoperability between CRIS and OAR domains
– Increasing metadata quality and re-use
– Increasing level of interface standards
– By taking existing formats into account: Defining a metadata exchange format With a corresponding set of common
vocabularies
8 London, 14th of July 2011 – Linked Data and Libraries
Publication entity as the center
of interest
9 London, 14th of July 2011 – Linked Data and Libraries
Person
Organisation
Event
Project
Publication
How could “Linked Data” help ?
Common way of linkages of content from distinct domains
– Use of native web-technologies Controlled vocabularies may help to tame semantic variability
– URIfying named entities Data publishers keep control of their data
Avoids context loss compared to interchange formats
May avoid double input and thus redundant data in each domain
Vocabulary helps to tame semantic variability
May reduce the identifier problem by assigning persistent URIs to the entities
– Person (Author) identifier (DAI, ORCID)
– Publication identifier (DOI, URN, …)
– Project identifier (?)
– Event identifier (?)
– Organisation identifier (?)
10 London, 14th of July 2011 – Linked Data and Libraries
Aggregation of Interlinked Data
Task to be addressed:
– “bulk import/export” of publication and project data -> new wording “exposure”
– Representing different views on theinformation packages, e.g.: Publication as an Aggregation of Person,
Organisation, Project and Event entities
Nested aggregation as a collection of all publications, where each publication is itself an aggregation of relative CRIS-OAR entities
11 London, 14th of July 2011 – Linked Data and Libraries
OAI-ORE Approach Sample
London, 14th of July 2011 – Linked Data and Libraries12
ore:aggregates
ore:describes ore:isDescribedBy
ore:describes
ore:aggregates
ore:aggregates
ore:aggregates
ore:aggregatesore:aggregates
ore:aggregates
ore:aggregates
ore:aggregates
A-Pub
kecrisoar:
publication
AR-Pers
AR-Org
AR-Eve kecrisoar:event
kecrisoar:organization
kecrisoar:person
AR-Proj kecrisoar:projectore:aggregates
ReM-
Pub ReM-
Proj
Fulltext
Extract of a ORE RDF
Serialization
13 London, 14th of July 2011 – Linked Data and Libraries
Next Steps
Adopting KE CRIS-OAR model and vocabulary
Addressing “Linked Data” in OpenAIREplusstarting in 12/2011
– Linkage of publications and research data
– Linkage of CRIS and OAR domains
Further scenarios may include linked data ascitations or statistical data
14 London, 14th of July 2011 – Linked Data and Libraries
Further Links
• Portal and project home: www.openaire.eu
• EC pilot: ec.europa.eu/research/science-society/open_access
• CERIF: www.eurocris.org
• KE CRIS-OAR: http://knowledge-exchange.info/Default.aspx?ID=340
• CRIS-OAR schema and vocabulary: https://infoshare.dtv.dk/twiki/bin/view/KeCrisOar/KeCrisOarFormat
15 London, 14th of July 2011 – Linked Data and Libraries
Jochen [email protected]
Wolfram [email protected]