19
Making Sense of Data: What You Need to know about Persistent Identifiers, Best Practices, and Funder Requirements Council of Science Editors May 23, 2017 Shelley Stall, MBA, CDMP, EDME AGU Assistant Director, Enterprise Data Management [email protected] @ShelleyStall

Making Sense of Data: What You Need to know about Persistent Identifiers… · 2017. 6. 5. · Making Sense of Data: What You Need to know about Persistent Identifiers, Best Practices,

  • Upload
    others

  • View
    2

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Making Sense of Data: What You Need to know about Persistent Identifiers… · 2017. 6. 5. · Making Sense of Data: What You Need to know about Persistent Identifiers, Best Practices,

Making Sense of Data: What You Need to know about

Persistent Identifiers, Best Practices, and Funder Requirements

Council of Science Editors May 23, 2017

Shelley Stall, MBA, CDMP, EDME

AGU Assistant Director, Enterprise Data Management

[email protected] @ShelleyStall

Page 2: Making Sense of Data: What You Need to know about Persistent Identifiers… · 2017. 6. 5. · Making Sense of Data: What You Need to know about Persistent Identifiers, Best Practices,

About AGU • Largest Earth and planetary science society

– 60,000 members; 137 countries – Much more than “geophysics” – Humongous annual meeting (22,000 abstracts; 25,000 attendees)

2

• Largest Society Publisher in the ESS – 20 journals; 4 gold open access – >6000 published papers in 2016 – Explaining science broadly

• Outreach to government leaders and the public – Eos.org – Large press effort; sharing science – Legislative affairs

Page 3: Making Sense of Data: What You Need to know about Persistent Identifiers… · 2017. 6. 5. · Making Sense of Data: What You Need to know about Persistent Identifiers, Best Practices,

3 https://sciencepolicy.agu.org/files/2013/07/AGU-Data-Position-Statement-Final-2015.pdf

Page 4: Making Sense of Data: What You Need to know about Persistent Identifiers… · 2017. 6. 5. · Making Sense of Data: What You Need to know about Persistent Identifiers, Best Practices,

AGU Data Engagement Activities • Started Earth and Space Science in 2014

– Data and methods papers (along with research) • Data Management Assessment Program for

Repositories • AGU Data Blog (Part of GeoSpace) • Coalition on Publishing Data in the Earth and Space

Sciences (COPDESS.org) and many other community efforts aimed at elevating best practices.

• Participating in elevating data best practices – Several Policy pieces in Eos.org and, with others, Science

• Data Fair at AGU Fall Meeting—Information to scientists around data skills and resources

• Many sessions around data and best practices at meetings

4

Page 5: Making Sense of Data: What You Need to know about Persistent Identifiers… · 2017. 6. 5. · Making Sense of Data: What You Need to know about Persistent Identifiers, Best Practices,

2017 SSP 39TH ANNUAL MEETING

5

Thursday, June 1, 2017 • Concurrent 1D: Research Data Policies: Craft

Effective Policy and Improve your Impact – CrossRef, Dryad, Dataverse, PLOS ONE, Springer Nature

• Concurrent 3B: Implementing Best Practices around Open Data, Samples, and Code in Scholarly Publications – AGU, Center for Open Science, Research Data Alliance, Science/AAAS

Page 6: Making Sense of Data: What You Need to know about Persistent Identifiers… · 2017. 6. 5. · Making Sense of Data: What You Need to know about Persistent Identifiers, Best Practices,

6

AGU’s Data Policy states…

All data necessary to understand, evaluate, replicate, and build upon the reported research must be made available and

accessible whenever possible.

Page 7: Making Sense of Data: What You Need to know about Persistent Identifiers… · 2017. 6. 5. · Making Sense of Data: What You Need to know about Persistent Identifiers, Best Practices,

Research Data – Best Practices for a Researcher • Deposit the data in support of your publication in a leading

domain repository that curates such data. • If a domain repository is not available for some or all of your

data, deposit your data in a general repository such as Zenodo, Dryad, or Figshare. All can assign a DOI for deposited data, or use your institution’s archive. Data should not be listed as “available from authors.”

• For small datasets, tables, or images you can also use supplemental material.

• Make sure that the data are available publicly at the time of publication and available to reviewers after submission.

• Cite data or code sets used in your study as part of the reference list. Citations should follow the Joint Declaration of Data Citation Principles.

7

Source: AGU FAQ and Best Practices (Authors) http://publications.agu.org/author-resource-center/publication-policies/data-policy/data-policy-faq/

Page 8: Making Sense of Data: What You Need to know about Persistent Identifiers… · 2017. 6. 5. · Making Sense of Data: What You Need to know about Persistent Identifiers, Best Practices,

Samples, Code and Identifiers – Best Practices for a Researcher • Develop and deposit software in github and cite that

or include simple scripts in a supplement. • Identify your samples with IGSN’s • List all funding sources (including in-kind support) in

the acknowledgments and indicate grant numbers and funders in your submission.

• Include ORCID’s for all authors • Use and follow these resources

(http://www.copdess.org/resources-for-researchers/) to develop projects that support reproducibility and integrity in research.

8

Source: AGU FAQ and Best Practices (Authors) http://publications.agu.org/author-resource-center/publication-policies/data-policy/data-policy-faq/

Page 9: Making Sense of Data: What You Need to know about Persistent Identifiers… · 2017. 6. 5. · Making Sense of Data: What You Need to know about Persistent Identifiers, Best Practices,

Publishers and Repositories are Working Together…

• TOP (Transparency and Openness Promotion) guidelines, signed by 2900 journals and organizations

• COPDESS.org (Coalition on Publishing Data in the Earth and Space Sciences)—Statement of Commitment endorsed by most publishers and repositories in the Earth and space sciences

• Joint Declaration of Data Citation Principles endorsed by 114 organizations including most major publishers.

• Reproducibility conferences and outcomes (AAAS and other orgs)

• Quality/certification standards for repositories expanding

9

Challenge is practicing what you preach

Page 10: Making Sense of Data: What You Need to know about Persistent Identifiers… · 2017. 6. 5. · Making Sense of Data: What You Need to know about Persistent Identifiers, Best Practices,

Coalition on Publishing Data in the Earth and

Space Sciences (COPDESS.org)

Connecting Earth Science publishers and Data Facilities to help translate the aspirations of open, available, and useful data from policy

into practice. – Formed in October 2014

– Endorsed a Statement of Commitment, 2015

– Includes: joint best practices between journals and repositories; references.

10

Page 11: Making Sense of Data: What You Need to know about Persistent Identifiers… · 2017. 6. 5. · Making Sense of Data: What You Need to know about Persistent Identifiers, Best Practices,

What are scholarly publishing best practices… • Joint declaration of data citation principles

– Citing data in the references – Separate data publications when appropriate (more journals now

available) • Transparency about how researchers can access data (e.g.,

statement in acknowledgements) • Include ORCID’s and other persistent identifiers

– Funders, samples, author-credit, institutions (still to come) • Use trusted domain repositories if they are available • Use repositories that allow for data access during peer review • Supplements should follow NISO guidelines • All references should be in main reference list (not in

supplements) • Key references and data should be available at time of

publication (no unpublished or in-press references)

11

Source: COPDESS Statement of Commitment http://www.copdess.org/statement-of- commitment/

Page 12: Making Sense of Data: What You Need to know about Persistent Identifiers… · 2017. 6. 5. · Making Sense of Data: What You Need to know about Persistent Identifiers, Best Practices,

Resources • AGU Data Position Statement:

– https://sciencepolicy.agu.org/files/2013/07/AGU-Data-Position-Statement-Final-2015.pdf

• AGU’s Data Policy – http://publications.agu.org/author-resource-center/publication-

policies/data-policy/ • AGU FAQ and Best Practices (Authors)

– http://publications.agu.org/author-resource-center/publication-policies/data-policy/data-policy-faq/

• COPDESS Statement of Commitment – http://www.copdess.org/statement-of- commitment/

• Transparency and Openness Promotion Guidelines (TOP) – https://cos.io/top/

• Joint Declaration of Data Citation Principles – https://www.force11.org/group/joint-declaration-data-citation-principles-

final

12

Page 13: Making Sense of Data: What You Need to know about Persistent Identifiers… · 2017. 6. 5. · Making Sense of Data: What You Need to know about Persistent Identifiers, Best Practices,

TOP – Modular Standards

Citation Standards Describes citation of data

Data Transparency Describes availability and sharing of data

Analytical Methods Transparency Describes analytical code accessibility

Research Materials Transparency Describes research materials accessibility

Design and Analysis Transparency Sets standards for research design disclosures

Preregistration of Studies Specification of study details before data collection

Preregistration of Analysis Plans Specification of analytical details before data collection

Replication Encourages publication of replication studies

13 Source: https://cos.io/top/

Page 14: Making Sense of Data: What You Need to know about Persistent Identifiers… · 2017. 6. 5. · Making Sense of Data: What You Need to know about Persistent Identifiers, Best Practices,

TOP – Implementation Levels LEVEL 0 LEVEL 1 LEVEL 2 LEVEL 3

CITATION STANDARDS

Journal encourages citation of data, code, and materials, or says nothing.

Journal describes citation of data in guidelines to authors with clear rules and examples.

Article provides appropriate citation for data and materials used consistent with journal's author guidelines.

Article is not published until providing appropriate citation for data and materials following journal's author guidelines.

DATA TRANSPARENCY

Journal encourages data sharing, or says nothing.

Article states whether data are available, and, if so, where to access them.

Data must be posted to a trusted repository. Exceptions must be identified at article submission.

Data must be posted to a trusted repository, and reported analyses will be reproduced independently prior to publication.

ANALYTIC METHODS (CODE) TRANSPARENCY

Journal encourages code sharing, or says nothing.

Article states whether code is available, and, if so, where to access- them.

Code must be posted to a trusted repository. Exceptions must be identified at article submission.

Code must be posted to a trusted repository, and reported analyses will be reproduced independently prior to publication.

14

Are You a Level 0?

Time to Get on

the Move!!

Page 15: Making Sense of Data: What You Need to know about Persistent Identifiers… · 2017. 6. 5. · Making Sense of Data: What You Need to know about Persistent Identifiers, Best Practices,

Next Steps • Review the Statement of Commitment on COPDESS.org

– Best Practices for Data Publication – Links to the policies written by ESS publishers based on the

statement are good examples. • Review the TOP Guidelines for data related standards, and

the others. – Be a signatory if you are not currently. – Make plans to move beyond Level 0.

• Work with your community organizations and institutions on communicating the need for well documented, citable data. – Research Data Alliance (RDA) is a good resource.

• Work together on messaging to authors and in author workshops around data. – Be consistent within your domain and cross domain as much as

possible. – Leverage existing standards.

Page 16: Making Sense of Data: What You Need to know about Persistent Identifiers… · 2017. 6. 5. · Making Sense of Data: What You Need to know about Persistent Identifiers, Best Practices,

Contact Information:

Shelley Stall [email protected]

AGU Data Management Program: http://dataservices.agu.org/dmm/

16

Page 17: Making Sense of Data: What You Need to know about Persistent Identifiers… · 2017. 6. 5. · Making Sense of Data: What You Need to know about Persistent Identifiers, Best Practices,

The details about data… • What data are required (do you really mean all of it)?

– Usually determined by the discipline and the data that are typically stored in domain repositories. More rarely the raw data, but certainly the tabulated data in support of reported results. Anonymized where appropriate.

• Enforcement of policies – We need to help authors (starts at funding agencies and data collection), and be aware

(acknowledgement statement). It is an ethical obligation and key for advancing science

– Ask reviewers and editors about data availability. – May require editorial statements of concern after publication if data are not provided. – Hold or coordinate publications until data and references are available.

• How about someone stealing my data or scooping me? – Widely adopted citation standards and norms – Some communities have much experience in open data – Some urban legends used as excuses

• I received data from someone else or it is commercial or restricted by my government or laws and I can’t release it. – Transparency in access. – IP is ok if data, data products, or software are available and scholarly reuse is allowed. – Alert authors to think about data and access up front, in negotiating transfer

agreements.

17

Page 18: Making Sense of Data: What You Need to know about Persistent Identifiers… · 2017. 6. 5. · Making Sense of Data: What You Need to know about Persistent Identifiers, Best Practices,

Questions for coordination in policies—what would be helpful? • ORCID • IGSN • Data availability statements • Code availability statements • Data citation • Cite COPDESS statement? • Joint statement to funders on data management

plans? • Joint statement on policies?

18

Page 19: Making Sense of Data: What You Need to know about Persistent Identifiers… · 2017. 6. 5. · Making Sense of Data: What You Need to know about Persistent Identifiers, Best Practices,