15
CRISP WP17 2/2 Data Continuum Achievements & Perspectives 18th March 2013 Jean-François Perrin - Institut Laue Langevin - CRISP 2nd Annual Meeting 1

CRISP WP17 2/2 Data Continuum Achievements & Perspectives 18th March 2013Jean-François Perrin - Institut Laue Langevin - CRISP 2nd Annual Meeting1

Embed Size (px)

Citation preview

Page 1: CRISP WP17 2/2 Data Continuum Achievements & Perspectives 18th March 2013Jean-François Perrin - Institut Laue Langevin - CRISP 2nd Annual Meeting1

CRISP WP17 2/2

Data ContinuumAchievements & Perspectives

18th March 2013 Jean-François Perrin - Institut Laue Langevin - CRISP 2nd Annual Meeting 1

Page 2: CRISP WP17 2/2 Data Continuum Achievements & Perspectives 18th March 2013Jean-François Perrin - Institut Laue Langevin - CRISP 2nd Annual Meeting1

Data Continuum

Aim for better sustainable links, traceability an transparency for the whole chain of

experimental science.

18th March 2013 Jean-François Perrin - Institut Laue Langevin - CRISP 2nd Annual Meeting 2

Page 3: CRISP WP17 2/2 Data Continuum Achievements & Perspectives 18th March 2013Jean-François Perrin - Institut Laue Langevin - CRISP 2nd Annual Meeting1

Data Continuum

18th March 2013 Jean-François Perrin - Institut Laue Langevin - CRISP 2nd Annual Meeting 3

• Data archiving• Proper annotation (Exp context, sample description, …)• Cataloguing• Analysed / Derived data• Software archiving• Policies• Data persistent identification• Publications• Links with publication

– Publication -> Data– Data -> Publication

Page 4: CRISP WP17 2/2 Data Continuum Achievements & Perspectives 18th March 2013Jean-François Perrin - Institut Laue Langevin - CRISP 2nd Annual Meeting1

DoW

• T4 Identify the PI system which best fits the needs of the partners;

• T5 Elaborate a data publication process satisfying data policies of the participating RIs;

• T6 Implement the persistent Identifier technology;

• T7 Cooperate with the major publishers to ensure that publications, issued from data generated at the facilities, provide reference to the experimental data sets.

18th March 2013 Jean-François Perrin - Institut Laue Langevin - CRISP 2nd Annual Meeting 4

Page 5: CRISP WP17 2/2 Data Continuum Achievements & Perspectives 18th March 2013Jean-François Perrin - Institut Laue Langevin - CRISP 2nd Annual Meeting1

Persistent Identifiers Technologies

• Objectives :– Being able to identify permanently and

sustainably the Data Sets– Sealing the links between datasets and

publication • Which one, ARK, PURL(Z), HANDLE/DOI,

our own … ?

18th March 2013 Jean-François Perrin - Institut Laue Langevin - CRISP 2nd Annual Meeting 5

Page 6: CRISP WP17 2/2 Data Continuum Achievements & Perspectives 18th March 2013Jean-François Perrin - Institut Laue Langevin - CRISP 2nd Annual Meeting1

DOI

• DOI, Why ?– Technically (persistence, scope, scalability,

… ) fits our needs (MS7 - Laura Rueda)– DOI has a very good libraries/publisher

support and market penetration.– ISO standard– DOI is becoming The standard for PID– Worst case scenario -> handle system

18th March 2013 Jean-François Perrin - Institut Laue Langevin - CRISP 2nd Annual Meeting 6

Page 7: CRISP WP17 2/2 Data Continuum Achievements & Perspectives 18th March 2013Jean-François Perrin - Institut Laue Langevin - CRISP 2nd Annual Meeting1

Registration Agency• Datacite : amongst the 10 Ras, DataCite is

the only one that focus on scientific data. – DOI’s metadata are adapted for scientific

experimental data; – Provides APIs (Web Services) for automating the

generation of DOIs; and much more (export /OAI-PMH, metadata search engine, citation formatter, statistics …)

– Datacite is composed of major scientific Libraries;– Very active organisation - Agreements with

Publishers.

18th March 2013 Jean-François Perrin - Institut Laue Langevin - CRISP 2nd Annual Meeting 7

Page 8: CRISP WP17 2/2 Data Continuum Achievements & Perspectives 18th March 2013Jean-François Perrin - Institut Laue Langevin - CRISP 2nd Annual Meeting1

Contract• Have to deal with national representative

– ESRF & ILL with INIST (CNRS)• Annual fee in order to cover the Infrastructure

cost (1000€ for 10,000 DOIs)

• Difficulties– High level quality metadata (samples are

prepared by the users, difficult to asserts for the RIs)

– Mandatory metadata vs competition & non disclosure period

18th March 2013 Jean-François Perrin - Institut Laue Langevin - CRISP 2nd Annual Meeting 8

Page 9: CRISP WP17 2/2 Data Continuum Achievements & Perspectives 18th March 2013Jean-François Perrin - Institut Laue Langevin - CRISP 2nd Annual Meeting1

DOI Metadata

• DOI = 10.5291/ILL-DATA.6-05-579• Creator = Proposal team• URL of landing page• Title = Title of the experimental proposal• Publisher = Institut Laue - Langevin• Date = 3rd December 2010 • Data Format = ILL ascii / NXS• Data Citation = 1.Fontana, Aldo, Fabiani,

…18th March 2013 Jean-François Perrin - Institut Laue Langevin - CRISP 2nd Annual Meeting 9

Page 10: CRISP WP17 2/2 Data Continuum Achievements & Perspectives 18th March 2013Jean-François Perrin - Institut Laue Langevin - CRISP 2nd Annual Meeting1

Landing page

18th March 2013 Jean-François Perrin - Institut Laue Langevin - CRISP 2nd Annual Meeting 10

Page 11: CRISP WP17 2/2 Data Continuum Achievements & Perspectives 18th March 2013Jean-François Perrin - Institut Laue Langevin - CRISP 2nd Annual Meeting1

Workflow

• ILL - current– Mint DOI right after the end of the first

experiment– Might change, if Authors list and title appears

to be too sensitive– Update the full set of Metadata after the end

of the non-disclosure period.

18th March 2013 Jean-François Perrin - Institut Laue Langevin - CRISP 2nd Annual Meeting 11

Page 12: CRISP WP17 2/2 Data Continuum Achievements & Perspectives 18th March 2013Jean-François Perrin - Institut Laue Langevin - CRISP 2nd Annual Meeting1

Current status

• CERN – DESY – ESRF – ILL agreed on DOI and DataCite.

• Implementation of DOIs

– ILL and DataCite are signing the contract– Deployment DOIs generation mechanism– Development of registration tools– Landing page public rollout

18th March 2013 Jean-François Perrin - Institut Laue Langevin - CRISP 2nd Annual Meeting 12

Page 13: CRISP WP17 2/2 Data Continuum Achievements & Perspectives 18th March 2013Jean-François Perrin - Institut Laue Langevin - CRISP 2nd Annual Meeting1

Perspective & Open discussion

• T5: Elaborate a data publication process satisfying data policies of the participating RIs;

Who got a Data Policy ?

18th March 2013 Jean-François Perrin - Institut Laue Langevin - CRISP 2nd Annual Meeting 13

Page 14: CRISP WP17 2/2 Data Continuum Achievements & Perspectives 18th March 2013Jean-François Perrin - Institut Laue Langevin - CRISP 2nd Annual Meeting1

Perspective & Open discussion

• T7: Publication -> Data– Who has contact with publishers (commercial

or Open Access) ?• T7: Data -> Publication

– Harvesting ourselves using OAI-PMH ?– Co-development with specialized company

(e.g. http://www.symplectic.co.uk/) ?

– Work with DataCite ?

18th March 2013 Jean-François Perrin - Institut Laue Langevin - CRISP 2nd Annual Meeting 14

Page 15: CRISP WP17 2/2 Data Continuum Achievements & Perspectives 18th March 2013Jean-François Perrin - Institut Laue Langevin - CRISP 2nd Annual Meeting1

Questions ?

18th March 2013 Jean-François Perrin - Institut Laue Langevin - CRISP 2nd Annual Meeting 15