38
rdc-drc.ca @rdc_drc Research Data Canada is supported by CANARIE, an organization dedicated to advancing Canada's knowledge and innovation infrastructure. National Data Services: Review Mark Leggott, Executive Director | ReConnect 16| Oct 25, 2016 Let’s connect: [email protected] | @mleggott

Building a Canadian National Research Data Management Framework - Mark Leggott

  • Upload
    casrai

  • View
    81

  • Download
    1

Embed Size (px)

Citation preview

Page 1: Building a Canadian National Research Data Management Framework - Mark Leggott

rdc-drc.ca @rdc_drc

Research Data Canada is supported by CANARIE, an organization dedicated to advancing Canada's knowledge and innovation infrastructure.

National Data Services: ReviewMark Leggott, Executive Director | ReConnect 16| Oct 25, 2016Let’s connect: [email protected] | @mleggott

Page 2: Building a Canadian National Research Data Management Framework - Mark Leggott

rdc-drc.ca @rdc_drc 2

> Context

Page 3: Building a Canadian National Research Data Management Framework - Mark Leggott

rdc-drc.ca @rdc_drc 3

Publish or Perish

Open by

Default

Page 4: Building a Canadian National Research Data Management Framework - Mark Leggott
Page 5: Building a Canadian National Research Data Management Framework - Mark Leggott

rdc-drc.ca @rdc_drc 5

Data InputData

Enhancement

Data Validation

Reproducibility

Discoverability

Serendipity

Linkages

Innovation

Impact

Training

Reusability

Page 6: Building a Canadian National Research Data Management Framework - Mark Leggott

rdc-drc.ca @rdc_drc 6

National Data Services˃Storage & Preservation Services˃Computational & Analysis Services˃Discovery Services˃Identifier Services˃Dissemination Services˃Support and Training Services˃Policy Rationalization and Development˃Communication and Coordination

Page 7: Building a Canadian National Research Data Management Framework - Mark Leggott

rdc-drc.ca @rdc_drc 7

National Data Services - Level˃Level

• National• Regional• Consortial• Institutional• Project

˃Design• Centralized• Federated• Hybrid

Page 8: Building a Canadian National Research Data Management Framework - Mark Leggott

rdc-drc.ca @rdc_drc 8

National Data Services - Scope˃Data has no boundaries

• Data as Research Outputs• Data as Research Inputs

˃Functions for managing data are pretty much the same for both

˃Can we use same infrastructure for both?

Page 9: Building a Canadian National Research Data Management Framework - Mark Leggott

9

Interoperability

Linda Naughton, Jisc. June 2016,

Page 10: Building a Canadian National Research Data Management Framework - Mark Leggott

Jisc - RDM shared services

Linda Naughton, Jisc. June 2016,

Front End/ User Interface

Middle Layer

Storage Layer

Preservation Layer

Basic Metadata EntryIngest UI

Registry/catalogue search function

Data discovery UILanding page with DOI,

Discovery Metadata, and metrics

Data Publication UI

Data Registry/ Catalogue/ Repository

API’s CRIS, DataCite, ORCID, LOD, funders Etc.

Archival Management

Access Data Storage

Access Data Storage

Archive Data Storage

Archive Data Storage

Preservation/ Curation Metadata

File Format Identification

tools

File/ media migration/

transformation tools

Emulation tools

Other preservation/ Curation tools

Page 11: Building a Canadian National Research Data Management Framework - Mark Leggott

rdc-drc.ca @rdc_drc 11 11

Page 12: Building a Canadian National Research Data Management Framework - Mark Leggott

rdc-drc.ca @rdc_drc 12

EUDAT Services

12

Page 13: Building a Canadian National Research Data Management Framework - Mark Leggott

rdc-drc.ca @rdc_drc 13

Page 14: Building a Canadian National Research Data Management Framework - Mark Leggott

rdc-drc.ca @rdc_drc 14

Page 16: Building a Canadian National Research Data Management Framework - Mark Leggott

rdc-drc.ca @rdc_drc 16

Storage and Preservation – Possible?˃Pronom-like authority for identifying/transforming research data files and outputs.

˃Policy-based replication of all research outputs to regional and international storage.

˃One-Click acquisition of storage resources from a national shared infrastructure.

˃Synchronization of Active Data Management Plans and auto-provision of storage/compute resources.

˃Create preservation storage via backend allocation of a % of active storage from all institutions.

Page 17: Building a Canadian National Research Data Management Framework - Mark Leggott

rdc-drc.ca @rdc_drc 17

Compute & Analysis Services - Current

˃Integration between HPC and Data platforms• EUDAT B2STAGE (iRODS/GridFTP)• VRE4EIC• Compute Canada Globus Portal

˃Integration of Science Workflow systems for computation AND RDM• Taverna, VisTrails, Kepler

˃Visualization Tools• Ninaliit

Page 18: Building a Canadian National Research Data Management Framework - Mark Leggott

rdc-drc.ca @rdc_drc 18

Compute & Analysis Services – Possible?˃Automatic selection and analysis of slice of big data based on English language query

˃Virtual Research Data Centres – secure and accessible

˃EU Open Science Cloud˃BitTorrent for Live Research Data?

Page 20: Building a Canadian National Research Data Management Framework - Mark Leggott

rdc-drc.ca @rdc_drc 20

Discovery Services – Possible?˃Siri for Research – AI Interfaces to all Outputs

˃Index fulltext/intelligent harvest of all outputs in domain/region

˃Rich Linked Data repository of all outputs• ResearchLink• Research Connection

˃Other Interesting Technologies• ContentMine, Research Data Switchboard

Page 22: Building a Canadian National Research Data Management Framework - Mark Leggott

rdc-drc.ca @rdc_drc 22

Identifier Services – Possible?˃Automatic collaborator detection engine based on description of new research approach.

˃Auto-selection of peer reviewers attached to open peer review system.

˃Simpler harvest of disparate research/data systems via a single API (e.g. ORCID).

˃Development of lightweight ID minting services that can be integrated into any SW platform.

˃Adoption of ORCID by all Canadian organizations and uptake by 100% of researchers.

Page 23: Building a Canadian National Research Data Management Framework - Mark Leggott

rdc-drc.ca @rdc_drc 23

Dissemination Services - Current˃Data Sharing

• EUDAT B2DROP• Compute Canada Globus Portal

˃Data Publication• OpenTrials, Open Lab/Note Books, Zenodo, Open Data

Button• Default publication of all results

– JNRBM, JNR, PLOS Missing Pieces• Danish Open Access Barometer

Page 24: Building a Canadian National Research Data Management Framework - Mark Leggott

rdc-drc.ca @rdc_drc 24

Dissemination Services – Possible?˃CI service with full compute environment & data

˃Default to Containers for Reproducible Research• GUIdock, SSI, OSF Container Strategies Workshop,

ReproZip˃Innovation in data/outputs/alerting/editing• Biosharing• nowomics-style updates on the latest outputs• symplur-style “flattening” of data from all sources• Dokieli-style article publishing

Page 25: Building a Canadian National Research Data Management Framework - Mark Leggott

rdc-drc.ca @rdc_drc 25

Support and Training - Current˃Support Networks

• Portage– DMP Tool, RDM Services, Network of Expertise

• GoC Open Data eXchange

Page 26: Building a Canadian National Research Data Management Framework - Mark Leggott

rdc-drc.ca @rdc_drc 26

Support and Training – Possible?˃A modular international curriculum˃Development of an Open Textbook for RDM

˃Use of Open Notebooks and related Open Data frameworks as learning platforms

Page 27: Building a Canadian National Research Data Management Framework - Mark Leggott

rdc-drc.ca @rdc_drc 27

Policy - Current˃Principles and Policies

• TC3 OA Policy and RDM Guidelines• RDC RDM Principles

˃Research Information Infrastructure• OpenRIF semantic efforts• CASRAI Community

Page 28: Building a Canadian National Research Data Management Framework - Mark Leggott

rdc-drc.ca @rdc_drc 28

Policy – Possible?˃Allocation of 2% of total R&D annual spend by public institutions.

˃Adoption of a common set of RDM Principles by all publicly funded organizations by 2026.

˃Adoption of RDM and Open by Default Policies by 50% of publicly funded institutions by 2020.

˃Synchronization of Canadian policy frameworks with EU and other partners by 2020.

˃Require immediate data sharing for public health emergencies

Page 29: Building a Canadian National Research Data Management Framework - Mark Leggott

rdc-drc.ca @rdc_drc 29

RDC Portage

CASRAI RDA

Re-Use

Research Data

Research Information

LCDICC

CANARIE

NRC

COU

ISED

CUCCIO

CARL CAULODC TC3+

Open Information

Open Data

ONC

Comms & Coordination - Current

Page 30: Building a Canadian National Research Data Management Framework - Mark Leggott

rdc-drc.ca @rdc_drc 30

Comms & Coordination – Possible?˃A single source of coordination for Canada’s RDM and DRI organizations, with representation from all core organizations.

˃A coordination of funding for National Data Services.

Page 31: Building a Canadian National Research Data Management Framework - Mark Leggott

rdc-drc.ca @rdc_drc 31

Portage

RDC

Coordination

Page 32: Building a Canadian National Research Data Management Framework - Mark Leggott

rdc-drc.ca @rdc_drc 32

> Research Data Canada works with stakeholders to ensure research data is available to support innovation that benefits all Canadians.

Page 33: Building a Canadian National Research Data Management Framework - Mark Leggott

rdc-drc.ca @rdc_drc 33

The DCC Curation Lifecycle Model: http://www.ijdc.net/index.php/ijdc/article/viewFile/69/48.

Universities

Federal Funding Agencies

Federal Research Agencies

Provincial Funding Agencies

Provincial Research Agencies

Open Data Organizatio

ns

Non-Profit & NGO

Research Organizatio

ns

Commercial Research

Organizations

International Agencies and Collaborators

Page 34: Building a Canadian National Research Data Management Framework - Mark Leggott

rdc-drc.ca @rdc_drc 34

Role of RDC˃Engage full stakeholder community

• Organizations that receive public research funds• Organizations that give public research funds• Organizations that facilitate these efforts

˃Facilitation and Coordination˃Outreach and Communication˃Development and Promotion of Best Practices

˃International Liaison

Page 35: Building a Canadian National Research Data Management Framework - Mark Leggott

rdc-drc.ca @rdc_drc 35

researchlink.rdc-drc.ca/vivo

Page 36: Building a Canadian National Research Data Management Framework - Mark Leggott

rdc-drc.ca @rdc_drc 36

RDC Outputs

National Data Services

Framework Requirements

& Best PracticesMar 2017

Portage/CC/CASRAI Outputs

& Systems

Jul 2016 Jun 2017

RDA Outputs

Federal & Provincial Outputs

Other Canadian Outputs

Jisc OutputsOther

International Outputs

Vision for a National Data

Services FrameworkNov 2016

National Data

Services and Federated Research

Data Repository Framework

RDM Ecosystem Map

Semantic

Repository Pilot

ORCID-CA +

CAF SPs

DOI Service

s

Page 37: Building a Canadian National Research Data Management Framework - Mark Leggott

rdc-drc.ca @rdc_drc 37

Brainstorming Session˃Charge

• Where do we want to be in 10 years?• Let’s Blue Sky, worry about how at the next meeting!• There will be a prize for the team that generates the

most ideas!˃Not allowed

• But there are privacy issues…• That would be so expensive…• Who would do that?

Page 38: Building a Canadian National Research Data Management Framework - Mark Leggott

rdc-drc.ca @rdc_drcContact me:

Research Data Canada is supported by CANARIE, an organization dedicated to advancing Canada's knowledge and innovation infrastructure.

[email protected] | @mleggott