Upload
drusilla-melton
View
214
Download
0
Embed Size (px)
Citation preview
CRISP WP17 2/2
Data ContinuumAchievements & Perspectives
18th March 2013 Jean-François Perrin - Institut Laue Langevin - CRISP 2nd Annual Meeting 1
Data Continuum
Aim for better sustainable links, traceability an transparency for the whole chain of
experimental science.
18th March 2013 Jean-François Perrin - Institut Laue Langevin - CRISP 2nd Annual Meeting 2
Data Continuum
18th March 2013 Jean-François Perrin - Institut Laue Langevin - CRISP 2nd Annual Meeting 3
• Data archiving• Proper annotation (Exp context, sample description, …)• Cataloguing• Analysed / Derived data• Software archiving• Policies• Data persistent identification• Publications• Links with publication
– Publication -> Data– Data -> Publication
DoW
• T4 Identify the PI system which best fits the needs of the partners;
• T5 Elaborate a data publication process satisfying data policies of the participating RIs;
• T6 Implement the persistent Identifier technology;
• T7 Cooperate with the major publishers to ensure that publications, issued from data generated at the facilities, provide reference to the experimental data sets.
18th March 2013 Jean-François Perrin - Institut Laue Langevin - CRISP 2nd Annual Meeting 4
Persistent Identifiers Technologies
• Objectives :– Being able to identify permanently and
sustainably the Data Sets– Sealing the links between datasets and
publication • Which one, ARK, PURL(Z), HANDLE/DOI,
our own … ?
18th March 2013 Jean-François Perrin - Institut Laue Langevin - CRISP 2nd Annual Meeting 5
DOI
• DOI, Why ?– Technically (persistence, scope, scalability,
… ) fits our needs (MS7 - Laura Rueda)– DOI has a very good libraries/publisher
support and market penetration.– ISO standard– DOI is becoming The standard for PID– Worst case scenario -> handle system
18th March 2013 Jean-François Perrin - Institut Laue Langevin - CRISP 2nd Annual Meeting 6
Registration Agency• Datacite : amongst the 10 Ras, DataCite is
the only one that focus on scientific data. – DOI’s metadata are adapted for scientific
experimental data; – Provides APIs (Web Services) for automating the
generation of DOIs; and much more (export /OAI-PMH, metadata search engine, citation formatter, statistics …)
– Datacite is composed of major scientific Libraries;– Very active organisation - Agreements with
Publishers.
18th March 2013 Jean-François Perrin - Institut Laue Langevin - CRISP 2nd Annual Meeting 7
Contract• Have to deal with national representative
– ESRF & ILL with INIST (CNRS)• Annual fee in order to cover the Infrastructure
cost (1000€ for 10,000 DOIs)
• Difficulties– High level quality metadata (samples are
prepared by the users, difficult to asserts for the RIs)
– Mandatory metadata vs competition & non disclosure period
18th March 2013 Jean-François Perrin - Institut Laue Langevin - CRISP 2nd Annual Meeting 8
DOI Metadata
• DOI = 10.5291/ILL-DATA.6-05-579• Creator = Proposal team• URL of landing page• Title = Title of the experimental proposal• Publisher = Institut Laue - Langevin• Date = 3rd December 2010 • Data Format = ILL ascii / NXS• Data Citation = 1.Fontana, Aldo, Fabiani,
…18th March 2013 Jean-François Perrin - Institut Laue Langevin - CRISP 2nd Annual Meeting 9
Landing page
18th March 2013 Jean-François Perrin - Institut Laue Langevin - CRISP 2nd Annual Meeting 10
Workflow
• ILL - current– Mint DOI right after the end of the first
experiment– Might change, if Authors list and title appears
to be too sensitive– Update the full set of Metadata after the end
of the non-disclosure period.
18th March 2013 Jean-François Perrin - Institut Laue Langevin - CRISP 2nd Annual Meeting 11
Current status
• CERN – DESY – ESRF – ILL agreed on DOI and DataCite.
• Implementation of DOIs
– ILL and DataCite are signing the contract– Deployment DOIs generation mechanism– Development of registration tools– Landing page public rollout
18th March 2013 Jean-François Perrin - Institut Laue Langevin - CRISP 2nd Annual Meeting 12
Perspective & Open discussion
• T5: Elaborate a data publication process satisfying data policies of the participating RIs;
Who got a Data Policy ?
18th March 2013 Jean-François Perrin - Institut Laue Langevin - CRISP 2nd Annual Meeting 13
Perspective & Open discussion
• T7: Publication -> Data– Who has contact with publishers (commercial
or Open Access) ?• T7: Data -> Publication
– Harvesting ourselves using OAI-PMH ?– Co-development with specialized company
(e.g. http://www.symplectic.co.uk/) ?
– Work with DataCite ?
18th March 2013 Jean-François Perrin - Institut Laue Langevin - CRISP 2nd Annual Meeting 14
Questions ?
18th March 2013 Jean-François Perrin - Institut Laue Langevin - CRISP 2nd Annual Meeting 15