Upload
others
View
1
Download
0
Embed Size (px)
Citation preview
Click to edit Master title style
Click to edit Master subtitle style
9/1/17 1
RDM in Canada & Portage Network
Jeff Moon, Director, Portage Network
&
James Doiron, RDM Services Coordinator, University of Alberta
ACCOLEDS, November 29, 2017Kamloops, BC
Data Repositories in Canada: Dataverse & FRDR
- Data repositories: What are they and what do they do?- Dataverse 101 - Dataverse in Canada- Portage Dataverse North Working Group - DV North Training survey- Federated Research Data Repository (FRDR): Teaser!
What is a research data repository?
A trusted space to:- deposit, store and preserve digital research data and
related (metadata) files - discover, explore, and analyze deposited research
data - access and download deposited research data
What are some of the things that research data repositories do for us?
● Provide a means to store and preserve research data and metadata
● Allow research data to be discoverable
● Increase visibility and impact of research
● Facilitates access and use of research data
● Help researchers to comply with funder and journal requirements regarding open access
Types of Data Repositories
- Open vs. Restricted access - Discipline specific repositories (e.g. - Aquatic Commons;
LingBuzz; EarthChem)
- Open source (Dataverse; Dryad; Zenodo)
- Proprietary (figshare; Open ICPSR; data.world) - Funding models (private; government; grants; institutional)- Free vs Paid for services
There is a nice comparative overview of various repositories available at: https://dataverse.org/blog/comparative-review-various-data-repositories
What is Dataverse?
• Open source web application to share, preserve, cite, explore and analyze research data
• Developed at Harvard University (2006)
• Makes data discoverable and accessible
• Researchers & affiliated institutions receive academic credit and web visibility
• Each Dataverse contains datasets, and each dataset contains descriptive metadata & data files
• Free end user access and use
Dataverse: A bird’s eye view
Dataverse features
Digital Object Identifier
Open source → development code available
Dataverse: versions & upcoming features
Dataverse is an ever evolving platform! *Right now, UAL is running v4.5, with plans to upgrade to the most recent version (currently v4.8.4) in early 2018
Upcoming features include:- Administrative dashboard (v4.7.1) (incl.: ORCID ID autofill; addt’l user support data - AWS S3 & Large Data upload integration (v4.8)- ORCID ID integration & API support (v4.8.3)- Schema.org support (4.8.4)- File level security & access and data provenance (v5.0)
On the radar:- File streaming - File hierarchy within datasets- Large data & http upload support - Embargoes datasets
Dataverse in Canada
• Currently there are five Dataverse installations in Canada:
- UBC- U of A- U of Manitoba- Scholars Portal (U of T)- UNB
Portage Dataverse North Working Group
• Pan-Canadian WG, with 26 members
• A community of practice to support Dataverse services, outreach & support strategies, as well as infrastructure development
• Nationally coordinating services & strategies
Dataverse North Working Group: Goals
● Provide equitable access for Canadian researchers
● Standardize services across Canada
● Support users (librarians and researchers)
● Guide feature developments and integrations
*Ahem* - thanks for the slide Corey Davis!
Dataverse North Working Group: sub-groups
• Business models– Develop a framework for hosting and support
services– Explore a common business model
• Training– Identify and coordinate training opportunities– Gather and develop training materials
• Metadata– Recommendations for templates, data files,
and DOIs
DV North Training Sub-Working Group
The mandate of the Dataverse North Training Working Group is to:
a. Identify the kinds of training that are most desirable for librarians, library staff (e.g. Dataverse admin, Dataverse uploads, analysis tools like R, Tableau)
b. Identify the kinds of training that are most desirable for faculty, researchers, and graduate students
c. Gather existing training documentation from members & related orgsd. Propose models for sharing docs & offering shared training sessionse. Assessment & evaluation of Dataverse as a tool for different types of
data
Dataverse Training Needs Survey
● Designed to capture key information regarding Dataverse training needs & delivery methods (e.g., web-based; in-person; videos; text; etc)
● 27 variables across three categories: Demographics, Experience with Dataverse, & Dataverse Training
● Survey data collected Nov 7th-24th● 117 responses● ~75% librarians; 25% researchers/research staff/student
Some preliminary results (English version only)
Some preliminary results (English version only)
DV North Sub-WG timelines
Focus at this time is towards year 1 work and objectives. The below timelines are for all three sub-Working Groups
● Preliminary outline of work & objectives (1-½ -2 pages) - August 31st, 2017
● Interim progress report - October 31st, 2017
● Circulate draft briefing paper and recommendations to all members - January
31st, 2018
● Final briefing paper & recommendations submitted to CARL/Portage - March
31st, 2018
CARL-Portage/Compute Canada: Federated Research Data Repository
CARL-Portage/Compute Canada: Federated Research Data Repository
● Federated storage model: Individual institutions or organizations can store data and collections locally by connecting their storage into FRDR.
● Federated support model: On-campus support for researchers.● Scalable model: Scalable systems accommodate growth in the adoption of FRDR by
researchers and in the amount of data they store.● National data discovery: Data collections hosted in FRDR and other existing data
repositories are discoverable through a single federated search tool.● Data preservation: FRDR can automatically create archival packages, with metadata and
alternate formats for data that are suitable for long-term preservation.● Access control mechanisms: Precise control over who can discover and access each data
set, with support for embargo periods.● Data set registration: DOIs can be automatically issued and registered to data sets, with
licensing arranged through DataCite Canada.
FRDR Demo Site
After the break…..FRDR presentation!
Lee WilsonInterim Service ManagerPortage/ACENET
Portage & RDM Training
RDM Training Expert Group overview
White paper: ‘RDM Training Landscape in Canada’
Portage RDM training resources currently available
Training resources in progress
Portage & RDM Training
Portage RDM-TEG members
● Jane Fry, Carleton University (Chair)● James Doiron, University of Alberta● Danny Létourneau, Université de Montréal● Laure Perrier, University of Toronto● Carol Perry, University of Guelph● Wendy Watkins, Carleton University, Emerita
RDM-TEG focus
The two main areas of focus of the RDM Training Expert Group are:
1) to develop and maintain a high-level training roadmap for RDM in Canada
2) to organize and oversee working groups to prepare and support training materials on RDM.
A few of the things we’ve been up too...
• White Paper: RDM Training Landscape in Canada
• External Training Resource Library
• Request for RDM Training: Process and support
• RDM Primer
• Development of web-based training resources
RDM Training Landscape White Paper
TEG White Paper Recommendations
External Training Resource Library
External Training Resource Library
Request for RDM Training
Request for RDM Training
RDM Training - lots of options!
• For all stakeholder types • General vs Specific training• In-person, web-based• Collaborative or fully delivered• Bilingual• Cost friendly (free where able!)• “Train the trainers”
RDM Primer
Development of web-based training resources
Development of web-based RDM training resources is currently underway!
First two Working Groups are:1. RDM 101(Carol Perry, Lead)
2. Data Management Planning (James Doiron, Lead)
Development of web-based training resources
DMP Training Resources
• Inspired by the ‘MANTRA’ model
• Includes general information, as well as specific relating to DMP content
• Walks users through using ‘DMP Assistant’
• Links, guidance, videos, and more!
• Content will be finalized Dec 2017
Group Discussion