DECLARATION OF DATA CITATION PRINCIPLES AMSTERDAM MANIFESTO
Mercè Crosas Director of Data Science, IQSS, Harvard University
A li;le history
• Beyond the PDF in Amsterdam, March 2013, organized by Force11
• 1K Challenge: What would you do to change scholarly communica4on?
Amsterdam Manifesto: A simple set of Data CitaKon Principles “We wish to promote best pracKces in data citaKon to facilitate access to data sets and to enable a;ribuKon and reward for those who publish data. Through formal data citaKon, the contribuKons to science by those that share their data will be recognized and potenKally rewarded. To that end, we propose that:...”
Mercè Crosas, Todd Carpenter, David Sho;on and ChrisKne Borgman
But there were other Principles …
Slide: Maryann Martone
And others …
Many groups working on similar documents, each from a slightly different perspecKve
Slide: Maryann Martone
Photo: Flickr
Paul Uhlir “...a plea to come together”
Data CitaKon Synthesis Group
• 36 members, ~ 20 organizaKons • Weekly meeKngs for 3 months, led by Force 11 • Comparison of 4 sets of principles: – Amsterdam Manifesto – CoData Data CitaKon Principles – DataCite Data CitaKon Principles – Digital Data Center Data CitaKon Principles
• Merge them to create a synthesis set of principles -‐ high level and simple h;p://www.force11.org/node/4381
THE CONSENSUS: 8 PRINCIPLES
1. IMPORTANCE
Data should be considered legiKmate, citable products of research. Data citaKons should be accorded the same importance in the scholarly record as citaKons of other research objects, such as publicaKons.
2. CREDIT AND ATTRIBUTION
Data citaKons should facilitate giving scholarly credit and normaKve and legal a;ribuKon to all contributors to the data, recognizing that a single style or mechanism of a;ribuKon may not be applicable to all data.
3. EVIDENCE
Where a specific claim rests upon data, the corresponding data citaKon should be provided.
4. UNIQUE IDENTIFIERS
A data citaKon should include a persistent method for idenKficaKon that is machine acKonable, globally unique, and widely used by a community.
5. ACCESS
Data citaKons should facilitate access to the data themselves and to such associated metadata, documentaKon, and other materials, as are necessary for both humans and machines to make informed use of the referenced data.
6. PERSISTENCE
Metadata describing the data, and unique idenKfiers should persist, even beyond the lifespan of the data they describe.
7. VERSIONING AND GRANULARITY
Data citaKons should facilitate idenKficaKon and access to different versions and/or subsets of data. CitaKons should include sufficient detail to verifiably link the ciKng work to the porKon and version of data cited.
8. INTEROPERABILITY AND FLEXIBILITY
Data citaKon methods should be sufficiently flexible to accommodate the variant pracKces among communiKes but should not differ so much that they compromise interoperability of data citaKon pracKces across communiKes.
WHAT NEXT
Review Process
• Now to End of October: – Drag document by synthesis group: • Preamble • Principles (“as is”) • Supplementary materials (example + glossary + references)
• November – January: – Open document to wider review (publishers, data repositories, libraries, funders, gov agencies, …)
DisseminaKon Plan
• Beginning January: – Integrate comments from stakeholders/community
• Mid January: – Publish final Principles in web site – IniKate endorsement requests – Press releases, disseminaKon to scholars within insKtuKons, social media, media kit
– TransiKon to other groups next steps ager DeclaraKon is endorsed widely
THANK YOU! The Synthesis Group
h;p://www.force11.org/node/4381