59
32 The Evolving World of Research Data Management Options and Opportunties @MarkHahnel @figshare

BLC & Digital Science: Mark Hahnel, Figshare

Embed Size (px)

Citation preview

Page 1: BLC & Digital Science: Mark Hahnel, Figshare

32

The Evolving World of Research Data Management

Options and Opportunties

@MarkHahnel @figshare

Page 2: BLC & Digital Science: Mark Hahnel, Figshare

“But taxpayers who are paying for that research will want to see something back. Directly – through open access to results and data. And indirectly – through making science work better for all of us. That’s why we will require open access to all publications stemming from EU-funded research. That’s why we will progressively open access to the research data, too. And why we’re asking national funding bodies to do the same.” Neelie Kroes. Vice President for the Eurpoean Commission

Page 3: BLC & Digital Science: Mark Hahnel, Figshare
Page 4: BLC & Digital Science: Mark Hahnel, Figshare

4

“The Obama Administration is committed to the proposition that citizens deserve easy access to the results of scientific research their tax dollars have paid for. That’s why, in a policy memorandum released today, OSTP Director John Holdren has directed Federal agencies with more than $100M in R&D expenditures to develop plans to make the published results of federally funded research freely available to the public within one year of publication and requiring researchers to better account for and manage the digital data resulting from federally funded scientific research.” February 22nd 2013

Page 5: BLC & Digital Science: Mark Hahnel, Figshare

“Investigators are expected to share with other researchers, at no more than incremental cost and within a reasonable time, the primary data, samples, physical collections and other supporting materials created or gathered in the course of work under NSF grants” http://www.nsf.gov/pubs/policydocs/pappguide/nsf11001/aag_6.jsp#VID4

“NIH expects the timely release and sharing of data to be no later than the acceptance for publication of the main findings from the final dataset” http://grants.nih.gov/grants/policy/data_sharingdata_sharing_guidance.htm#time

“NEH is committed to timely and rapid data distribution” http://www.neh.gov/files/grants/data_management_plans_2012.pdf

Page 6: BLC & Digital Science: Mark Hahnel, Figshare

6

"Products of research are not just publications.” NSF senior policy specialist Beth Strausser.

Biographical Sketch(es), has been revised to rename the “Publications” section to “Products” and amend terminology and instructions accordingly. 13 January 2013: "National Science Foundation’s Merit Review Criteria: Review and Revisions” Chapter II.C.2.f(i)(c),

Page 7: BLC & Digital Science: Mark Hahnel, Figshare
Page 8: BLC & Digital Science: Mark Hahnel, Figshare
Page 9: BLC & Digital Science: Mark Hahnel, Figshare
Page 10: BLC & Digital Science: Mark Hahnel, Figshare
Page 11: BLC & Digital Science: Mark Hahnel, Figshare

11

Page 12: BLC & Digital Science: Mark Hahnel, Figshare

1.  Recommended open access to scholarly papers of publicly funded research

2.  Recommended open access to all digital outputs of publicly funded research

3.  Mandated open access to scholarly papers of publicly funded research

4.  Mandated open access to all digital outputs of publicly funded research

5.  Enforced, mandated open access to scholarly papers of publicly funded research

6.  Enforced, mandated open access to all digital outputs of publicly funded research

The Open Academic Tidal Wave

Page 13: BLC & Digital Science: Mark Hahnel, Figshare

1.  Recommended open access to scholarly papers of publicly funded research

2.  Recommended open access to all digital outputs of publicly funded research

3.  Mandated open access to scholarly papers of publicly funded research

4.  Mandated open access to all digital outputs of publicly funded research

5.  Enforced, mandated open access to scholarly papers of publicly funded research

6.  Enforced, mandated open access to all digital outputs of publicly funded research

The Open Academic Tidal Wave

Page 14: BLC & Digital Science: Mark Hahnel, Figshare

14

Page 15: BLC & Digital Science: Mark Hahnel, Figshare

2  

A cloud based research data management system for academics and administrators:  

What is figshare?  

Manage their research outputs privately and securely, with controlled collaborative spaces

Public repository of all research outputs from an

institution, with impact and usage metrics

Page 16: BLC & Digital Science: Mark Hahnel, Figshare
Page 17: BLC & Digital Science: Mark Hahnel, Figshare

17

Page 18: BLC & Digital Science: Mark Hahnel, Figshare

Storing  it  properly  

Making  it  discoverable  

Managing  Open  Data  

Promo9ng  Sharing  

Page 19: BLC & Digital Science: Mark Hahnel, Figshare

Edi9ng  an  item  on  figshare  

Page 20: BLC & Digital Science: Mark Hahnel, Figshare

Confiden9al  item  on  figshare  

Page 21: BLC & Digital Science: Mark Hahnel, Figshare

Linked  item  on  figshare  

Page 22: BLC & Digital Science: Mark Hahnel, Figshare
Page 23: BLC & Digital Science: Mark Hahnel, Figshare
Page 24: BLC & Digital Science: Mark Hahnel, Figshare
Page 25: BLC & Digital Science: Mark Hahnel, Figshare

There are 109 metrics! ‘Greater effort than expected: over 500 person hours’ ‘A full audit would cost us 10,000 to 25,000 euro’s, a midterm review 5,000 to 10,000 euro’s. Every year such an effort would not be feasible and too costly’ ‘The formulation of the metrics is a bit idealistic (“down to the bit level”)… since no archive is perfect, what will be the ‘less than perfect’ level (or levels for the different metrics), which is acceptable and deserves certification?’ Feedback from test audits http://www.alliancepermanentaccess.org/wp-content/uploads/downloads/2012/04/APARSEN-REP-D33_1B-01-1_0.pdf

16363

Page 26: BLC & Digital Science: Mark Hahnel, Figshare

2 1

3 4 Reporting Dashboard

Impact and Usage Reporting.

Administrative Workflow Portal A portal where administrators can manage curation of files to be made public, storage space allocation and user rights.

Public Digital Research Repository A customisable public portal with all digital files made public at an institutional, departmental and group level.

Research Data Management Private, controlled storage and collaborative spaces for every academic at the institution.

4 Key Modules

Page 27: BLC & Digital Science: Mark Hahnel, Figshare
Page 28: BLC & Digital Science: Mark Hahnel, Figshare
Page 29: BLC & Digital Science: Mark Hahnel, Figshare

37  

Institutional API  

The figshare API allows you to push data to figshare, or pull data out. This allows you to build applications on top of your academic’s research.

Page 30: BLC & Digital Science: Mark Hahnel, Figshare
Page 31: BLC & Digital Science: Mark Hahnel, Figshare
Page 32: BLC & Digital Science: Mark Hahnel, Figshare

32  26

Page 33: BLC & Digital Science: Mark Hahnel, Figshare

33  27

Page 34: BLC & Digital Science: Mark Hahnel, Figshare
Page 35: BLC & Digital Science: Mark Hahnel, Figshare
Page 36: BLC & Digital Science: Mark Hahnel, Figshare
Page 37: BLC & Digital Science: Mark Hahnel, Figshare
Page 38: BLC & Digital Science: Mark Hahnel, Figshare
Page 39: BLC & Digital Science: Mark Hahnel, Figshare

• Incentivising compliance • Facilitating international collaboration • Integration into user workflows  

• Quantifying impact • Administrative curation layer • Embargo support  

• Open data principles • Citable – with DOIs • Increases impact of research  

• Trusted Repository • Persistent links • Heavyweight infrastructure

Page 40: BLC & Digital Science: Mark Hahnel, Figshare
Page 41: BLC & Digital Science: Mark Hahnel, Figshare
Page 42: BLC & Digital Science: Mark Hahnel, Figshare
Page 43: BLC & Digital Science: Mark Hahnel, Figshare

43  

Persistent identifiers are essential

Page 44: BLC & Digital Science: Mark Hahnel, Figshare

44  

Persistent identifiers are essential

Page 45: BLC & Digital Science: Mark Hahnel, Figshare

45  

APIs  are  essen9al  

Page 46: BLC & Digital Science: Mark Hahnel, Figshare

46  

Open  Access  is  essen9al  

Page 47: BLC & Digital Science: Mark Hahnel, Figshare

47  

Advocacy    is  essen9al  

Page 48: BLC & Digital Science: Mark Hahnel, Figshare

48  

Page 49: BLC & Digital Science: Mark Hahnel, Figshare

49  

Institutions Generating the world’s knowledge

Page 50: BLC & Digital Science: Mark Hahnel, Figshare

50  

Page 51: BLC & Digital Science: Mark Hahnel, Figshare

Thanks for your time.  

@markhahnel @figshare figshare.com api.figshare.com institutions.figshare.com [email protected]  

Page 52: BLC & Digital Science: Mark Hahnel, Figshare

51  

http://www.plosgenetics.org/article/info%3Adoi%2F10.1371%2Fjournal.pgen.1003094#s5 http://www.plosone.org/article/info%3Adoi%2F10.1371%2Fjournal.pone.0059671#s4 http://www.plosone.org/article/info%3Adoi%2F10.1371%2Fjournal.pone.0059503#s5 http://f1000research.com/articles/2-5/v1 http://f1000research.com/articles/1-47/v1

Publisher examples  

Page 53: BLC & Digital Science: Mark Hahnel, Figshare
Page 54: BLC & Digital Science: Mark Hahnel, Figshare
Page 55: BLC & Digital Science: Mark Hahnel, Figshare
Page 56: BLC & Digital Science: Mark Hahnel, Figshare
Page 57: BLC & Digital Science: Mark Hahnel, Figshare
Page 58: BLC & Digital Science: Mark Hahnel, Figshare

Figshare Mendelay Archivum Research Gate Dryad Eprints

Fedora+Front End Zenodo

Lab Archive

✓ ✓ no ✓ have the community

✓ Needs developers. Files all stored as individual objects

Can but don’t have a community of eyes on the system. Example of Missouri

✓ ✓

no no no no Can track use at level of article.

No - needs manual intervention

no no

✓ ✓ no ✓ ✓ ✓ ✓ ✓ ✓

✓ No – focused on papers. None of the permanence

✓ no ✓

but not an institutional offer

✓ Own servers so yes

✓ because its on the institutions servers

No – as only a 5 (2?) year funding plan

no Storing it properly

Making it discoverable

Managing Open Data

Promoting Sharing

•  advocacy – driving uptake of tools

•  training for researchers, •  incentives? •  facilitating international

collaboration

•  knowing the numbers. How many papers, how many citations, also for data

•  Allocation of space around the institution – e.g. 30GB / user. User management

•  Having a rights system for access approval. CCO, CCBY, CCNC etc

•  Configurable workflow?

•  Open data principles •  Having data stored somewhere

where – technically – it’s discoverable – ie not on hard drives

•  Ensuring metadata attached within 12 months

•  Raw storage capacity •  Security and back up •  Persitent links •  Storage for 10 years from last use

(which must therefore be known) •  Archiving for posterity

Active Data

Figshare’s  posi9oning:    the  only  player  to  support  ins9tu9ons  all  the  way  to  the  top  of  the  hierarchy:  ‘Ac9ve  Data’  

Page 59: BLC & Digital Science: Mark Hahnel, Figshare