29
Open Research Data: Present and planned EC Policy Jean-Claude Burgelman, S. Luber, R. Von Schomberg, D. Spichtinger, W.Lusoli Head of Unit European Commission DG RTD/A6 Keynote ORD Conference Open Research Data: Implications for Science and Society - Warsaw May 2015

Jean claude burgelman implications of open data

Embed Size (px)

Citation preview

Page 1: Jean claude burgelman implications of open data

Open Research Data: Present and planned EC

Policy Jean-Claude Burgelman, S. Luber, R. Von Schomberg, D.

Spichtinger, W.Lusoli

Head of Unit European Commission

DG RTD/A6

Keynote ORD Conference Open Research Data: Implications for

Science and Society - Warsaw May 2015

Page 2: Jean claude burgelman implications of open data

A new Commission (2014-19)

Andrus Ansip, Vice-President, Digital Single Market

Carlos Moedas, Commissioner forResearch, Science and Innovation

Günther Oettinger, Commissionerfor Digital Economy and Society

Page 3: Jean claude burgelman implications of open data

Commissioner view

"Open Science, of which Open Access is an important part, will be vital to ensuring European progress and prosperity in the future"(Speech at NETHER, January 26, 2015)

Page 4: Jean claude burgelman implications of open data

Open Research Data

• ORD refers to making research data freely available for reuse beyond the purpose for which they were originally collected.

• Making Research data freely available aid further discovery, make scientific process more cost efficient and reliable.

• ORD is part of a broader change: data driven science underpinning Open Science

Page 5: Jean claude burgelman implications of open data

• A systemic change in the modus operandi of science and research

• Affecting the whole research cycle and its stakeholders

Open Research Data - Open Science

Page 6: Jean claude burgelman implications of open data

Analysis

Publication

ReviewConceptualisation

Data gathering

Open access

Scientific blogs

Collaborative bibliographie

s

Alternative

Reputation systems

Citizens science Open

code

Open workflows

Open annotatio

n

Open data

Pre-print

Data-intensive

6

Sci-starter.com

Runmycode.org

ArXiv

Roar.eprints.org

Impact Story

Altmetric.com

Mendeley.comAcademia.edu

Researchgate.com

Openannotation.org

Datadryad.org

Myexperiment.org

Figshare.com

An emerging ecosystem of services and

standards

It's real!

Page 7: Jean claude burgelman implications of open data

Research and Innovation

OPEN SCIENCE

OA Research data

Data-intensive Science

Data-intensive Science

OA to research

publications

TDM

OPEN SCIENCE

BIG DATA

OA to research data

ORD “small” part of Open Data

Page 8: Jean claude burgelman implications of open data

Research and Innovation

ORD and Data Driven Science: Big Data in Open Science

• Traditional modus operandi for Science: scientific experiments and observations generate data to test Scientific Hypotheses. - many limitations (empirical blackholes)

• New opportunities due to “big data”:- The digitisation of science (e.g. DNA sequencing)- The internet of everything

• Data Driven Science is the application of Big Data in Science.• Enormous opportunities:

- 4th paradigm in science (inductive, computional)- “here are the data, where is the hypothesis?”- Potential to reboot completely SSH (“social physics”)

• Serious policy issues

Page 9: Jean claude burgelman implications of open data

Research and Innovation

Data Driven Science: Big Data in Science

Most important policy issues for DDS to take off:

• TDM• Open Access• Copyright• Data protection• Cloud

Page 10: Jean claude burgelman implications of open data

The EC wants to optimise the impact of public funded research

• At European level (FP7 & Horizon 2020)• At Member State level

One way to get there: open access • to peer-reviewed scientific publications • to research data

Expected benefits: • Better, more transparant & efficient science Open Science & RRI• Faster uptake of research leading to faster & better innovation &

economic growth Innovation Union

Page 11: Jean claude burgelman implications of open data

The ORD policies of the European Commission are threefold.

The EC as Policy maker• It proposes EU legislation & legislates with other EU

institutions• It invites Member States to act

The EC as Funding agency• It sets its own access and dissemination rules for EC-funded

research

The EC Capacity builder• It funds projects that support EC/EU policy

ORD Policy developed jointly in DG RTD and CNECT, with input from the R&I family

Page 12: Jean claude burgelman implications of open data

EC as funder - Open access in Horizon 2020

• Mandatory for publications• Pilot for data

Page 13: Jean claude burgelman implications of open data

H2020: OA to publications

FP7• Green open access pilot in 7

areas of FP7 with 'best effort' stipulation

• Allowed embargoes: 6/12 months

• Gold open access costs eligible for reimbursement as part of the project budget while the project runs

Horizon 2020 • Obligation to provide OA, either

through the Green or Gold way in all areas

• Allowed embargoes: 6/12 months• Gold open access costs eligible

for reimbursement as part of the project budget while the project runs & post-grant support being piloted

• Authors encouraged to retain copyright and grant licences instead

Page 14: Jean claude burgelman implications of open data

H2020: Pilot on Open Research Data

• Certain areas• Voluntary• Opt out of the pilot • Opt in for other areas• DMP provisions

Page 15: Jean claude burgelman implications of open data

H2020: Pilot on Open Research DataAreas of the 2014-2015 Work Programme participating in the Open Research Data Pilot are:

• Future and Emerging Technologies• Research infrastructures – part e-Infrastructures • Leadership in enabling and industrial technologies – ICT• Societal Challenge: Secure, Clean and Efficient Energy – part Smart cities

and communities• Societal Challenge: Climate Action, Environment, Resource Efficiency and

Raw materials – except raw materials• Societal Challenge: Europe in a changing world – inclusive, innovative and

reflective Societies• Science with and for SocietyProjects in other areas can participate on a voluntary basis (already new areas to be added for 2016-17 WP

Page 16: Jean claude burgelman implications of open data

Projects may opt out of the Pilot on ORD, if:• The project will not generate / collect any data• Conflict with obligation to protect results• Conflict with confidentiality obligations• Conflict with security obligations• Conflict with rules on protection of personal data• The achievement of the action’s main objective would be

jeopardised by making specific parts of the research data openly accessible (to be explained in data management plan)

H2020: Pilot on Open Research Data

Page 17: Jean claude burgelman implications of open data

H2020: Pilot on Open Research DataTypes of data concerned:• Data needed to validate the results presented in scientific publications

("underlying data")• Other data as specified in DMP (=up to projects)

Beneficiaries participating in the Pilot will:• Deposit this data in a research data repository of their choice• Take measures to make it possible to access, mine, exploit, reproduce

and disseminate free of charge• Provide information about tools and instruments at the disposal of the

beneficiaries and necessary for validating the results (where possible, provide the tools and instruments themselves)

Page 18: Jean claude burgelman implications of open data

H2020: Pilot on Open Research Data: Data Management Plan

• DMPs are NOT part of the proposal evaluation: to be delivered within the first six months of the project and updated as needed

• DMP’s mandatory for all projects participating in the Pilot, optional for others

• DMP questions:•What data will be collected / generated?•What standards will be used / how will metadata be generated?•What data will be exploited? What data will be shared / made open?•How will data be curated and preserved?

Page 19: Jean claude burgelman implications of open data

H2020: Pilot on ORD: take-up in first callsBasis: 3054 Horizon 2020 proposals

Calls in core-areas: opt out 24.2% (442 of 1824 proposals) – range from 9,1-29,1%Other areas: voluntary opt in 27.2% (334 of 1230 proposals) – range from 9 to 50%

Preliminary but encouraging results

Page 20: Jean claude burgelman implications of open data

EC as policy maker on OA: currently ongoing…

• Analysis of Uptake of ORD Pilot in signed grant agreements (versus proposals)

• DMP implementation: investigating best-practice; tools to be developed (2015)

• Top-notch monitoring of OA policies is crucial for further policy development

Page 21: Jean claude burgelman implications of open data

EC as capacity builder on OA: coordination and support actions (ongoing - FP7 funded)

PASTEUR4OA (Open Access Policy Alignment Strategies for European Union Research) Started 2014

FOSTER (Foster Open Science Training for European Research) Started 2014

RECODE - (Policy Recommendations for Open Access to Research Data in Europe) – 2013, finishing

OpenAIRE/OpenAIRE+: supporting the implementation of Open Access in Europe (publications and data)

Infrastructure projects(with OA components), e.g. GEO/GEOSS, ELIXIR…

Page 22: Jean claude burgelman implications of open data

EC as policymaker - Open access policies across the EU

Preliminary findings from (i) NPR reporting template of 13 EU MS & 1 Associated Country (ii) Results of 2014 ERA Progress Report

General findings

1. Mostly soft measures rather than legislation: exceptions exist2. OA to publications > than OA to data. Progress as to infrastructures for

data (repositories), but openness still quite complex an issue and not addressed in many countries (for data)

3. Bigger and “richer” countries have more comprehensive OA policies and OA enabling infrastructures, as well as tend to lead or participate more actively in OA networking initiatives

4. Nevertheless, smaller or less federated countries have the advantage of easier coordination and better synergistic capabilities

Page 23: Jean claude burgelman implications of open data

ORD in Poland

• OCEAN – a new national research datacentre with focuses on Big and Open Data is being built. Its tasks will include large scale data mining as well as long term preservation of research results.

• There are about 100 digital libraries which contain scientific papers and several classical scientific repositories.

• Poland is in the FOSTER (Facilitate Open Science Training for European Research) project, where one of the areas covered is to provide training on open access to research data.

Page 24: Jean claude burgelman implications of open data

EC as policymaker – A European Open Science Cloud?

• Rationale and first ideas• Work in progress. • Not to be quoted

Page 25: Jean claude burgelman implications of open data

Science 2.0 consultation (July-Sept 2014 + validation WS)

o ~ 85 % agree to some extent that data infrastructures are a bottlenecko Spontaneous position papers from research stakeholders

Possible actions

1. Mandate the development of common interfaces and data standards

2. Coordinate at European Level the funding/ maintenance and interoperability of research infrastructures

3. Support the development of a European Open Science Cloud for research

European Open Science

Agenda

Page 26: Jean claude burgelman implications of open data

A European Open Science Cloud: part of Europe´s ambition to support the transition to Open Science and make the most of data-driven science.

o European scientists strongly stated the need for a research data infrastructure that is cost-effective, preserves privacy and is IPR-conscious (Science 2.0 consultation).

o The cloud provides all EU researchers a virtual environment with free, open and seamless services for data storage, management, analysis and re-use, across disciplines.

o The cloud will federate existing and emerging horizontal and thematic data infrastructures, effectively bridging todays fragmentation and ad-hoc solutions.

o The cloud adds value - scale, data-driven science, inter-disciplinarity, data to knowledge to innovation - and leverages current and past infrastructure investment (10b per year by MS, two decades EU investment).

European Open Science

Cloud

Page 27: Jean claude burgelman implications of open data

Life

sci

ence

s

Lead users… Scientific communities …long tail

Phys

ics

Eart

h sc

ienc

es

Econ

omic

s

Soci

alsc

ienc

esScal

e of

sci

enti

fic a

ctiv

ity

(da

ta-

driv

en s

cien

ce)

Appl

ied

- eng

inee

ring

… …

Humanities Citizen science

European Open Science

Cloud

Page 28: Jean claude burgelman implications of open data

Data layer

Service layer

Governance layer

Life

sci

ence

s

Lead users… Scientific communities …long tail

Phys

ics

Eart

h sc

ienc

es

Econ

omic

s

Soci

alsc

ienc

esScal

e of

sci

enti

fic a

ctiv

ity

(da

ta-d

rive

n sc

ienc

e)

High performance computingData fusion across disciplines

Big data analyticsPrivacy and personal data protection

… … Data discovery and catalogue

Data manipulation and export

Data access and re-use

TrustLeverage of MS investment

Legacy and sustainability

IPR protection

Federation

Appl

ied

- eng

inee

ring

… …

Humanities

Data storage

Citizen science

European Open Science

CloudBottom-up governance

Page 29: Jean claude burgelman implications of open data

In summary• The EC is a strong funder, policy maker and capacity builder

with regard to ORD in particular and OA in general• It sees it as part of an irreversible change in the modus

operandi of science: open science• A lot is ongoing and planned• Much more needs to be done if we want Europe and its

science stakeholders to capitalise on the opportunities ORD, OA and OS offer.

• The EC follows a bottom up and stakeholder driven apporach