View
140
Download
2
Category
Preview:
Citation preview
Open Research Data: Present and planned EC
Policy Jean-Claude Burgelman, S. Luber, R. Von Schomberg, D.
Spichtinger, W.Lusoli
Head of Unit European Commission
DG RTD/A6
Keynote ORD Conference Open Research Data: Implications for
Science and Society - Warsaw May 2015
A new Commission (2014-19)
Andrus Ansip, Vice-President, Digital Single Market
Carlos Moedas, Commissioner forResearch, Science and Innovation
Günther Oettinger, Commissionerfor Digital Economy and Society
Commissioner view
"Open Science, of which Open Access is an important part, will be vital to ensuring European progress and prosperity in the future"(Speech at NETHER, January 26, 2015)
Open Research Data
• ORD refers to making research data freely available for reuse beyond the purpose for which they were originally collected.
• Making Research data freely available aid further discovery, make scientific process more cost efficient and reliable.
• ORD is part of a broader change: data driven science underpinning Open Science
• A systemic change in the modus operandi of science and research
• Affecting the whole research cycle and its stakeholders
Open Research Data - Open Science
Analysis
Publication
ReviewConceptualisation
Data gathering
Open access
Scientific blogs
Collaborative bibliographie
s
Alternative
Reputation systems
Citizens science Open
code
Open workflows
Open annotatio
n
Open data
Pre-print
Data-intensive
6
Sci-starter.com
Runmycode.org
ArXiv
Roar.eprints.org
Impact Story
Altmetric.com
Mendeley.comAcademia.edu
Researchgate.com
Openannotation.org
Datadryad.org
Myexperiment.org
Figshare.com
An emerging ecosystem of services and
standards
It's real!
Research and Innovation
OPEN SCIENCE
OA Research data
Data-intensive Science
Data-intensive Science
OA to research
publications
TDM
OPEN SCIENCE
BIG DATA
OA to research data
ORD “small” part of Open Data
Research and Innovation
ORD and Data Driven Science: Big Data in Open Science
• Traditional modus operandi for Science: scientific experiments and observations generate data to test Scientific Hypotheses. - many limitations (empirical blackholes)
• New opportunities due to “big data”:- The digitisation of science (e.g. DNA sequencing)- The internet of everything
• Data Driven Science is the application of Big Data in Science.• Enormous opportunities:
- 4th paradigm in science (inductive, computional)- “here are the data, where is the hypothesis?”- Potential to reboot completely SSH (“social physics”)
• Serious policy issues
Research and Innovation
Data Driven Science: Big Data in Science
Most important policy issues for DDS to take off:
• TDM• Open Access• Copyright• Data protection• Cloud
The EC wants to optimise the impact of public funded research
• At European level (FP7 & Horizon 2020)• At Member State level
One way to get there: open access • to peer-reviewed scientific publications • to research data
Expected benefits: • Better, more transparant & efficient science Open Science & RRI• Faster uptake of research leading to faster & better innovation &
economic growth Innovation Union
The ORD policies of the European Commission are threefold.
The EC as Policy maker• It proposes EU legislation & legislates with other EU
institutions• It invites Member States to act
The EC as Funding agency• It sets its own access and dissemination rules for EC-funded
research
The EC Capacity builder• It funds projects that support EC/EU policy
ORD Policy developed jointly in DG RTD and CNECT, with input from the R&I family
EC as funder - Open access in Horizon 2020
• Mandatory for publications• Pilot for data
H2020: OA to publications
FP7• Green open access pilot in 7
areas of FP7 with 'best effort' stipulation
• Allowed embargoes: 6/12 months
• Gold open access costs eligible for reimbursement as part of the project budget while the project runs
Horizon 2020 • Obligation to provide OA, either
through the Green or Gold way in all areas
• Allowed embargoes: 6/12 months• Gold open access costs eligible
for reimbursement as part of the project budget while the project runs & post-grant support being piloted
• Authors encouraged to retain copyright and grant licences instead
H2020: Pilot on Open Research Data
• Certain areas• Voluntary• Opt out of the pilot • Opt in for other areas• DMP provisions
H2020: Pilot on Open Research DataAreas of the 2014-2015 Work Programme participating in the Open Research Data Pilot are:
• Future and Emerging Technologies• Research infrastructures – part e-Infrastructures • Leadership in enabling and industrial technologies – ICT• Societal Challenge: Secure, Clean and Efficient Energy – part Smart cities
and communities• Societal Challenge: Climate Action, Environment, Resource Efficiency and
Raw materials – except raw materials• Societal Challenge: Europe in a changing world – inclusive, innovative and
reflective Societies• Science with and for SocietyProjects in other areas can participate on a voluntary basis (already new areas to be added for 2016-17 WP
Projects may opt out of the Pilot on ORD, if:• The project will not generate / collect any data• Conflict with obligation to protect results• Conflict with confidentiality obligations• Conflict with security obligations• Conflict with rules on protection of personal data• The achievement of the action’s main objective would be
jeopardised by making specific parts of the research data openly accessible (to be explained in data management plan)
H2020: Pilot on Open Research Data
H2020: Pilot on Open Research DataTypes of data concerned:• Data needed to validate the results presented in scientific publications
("underlying data")• Other data as specified in DMP (=up to projects)
Beneficiaries participating in the Pilot will:• Deposit this data in a research data repository of their choice• Take measures to make it possible to access, mine, exploit, reproduce
and disseminate free of charge• Provide information about tools and instruments at the disposal of the
beneficiaries and necessary for validating the results (where possible, provide the tools and instruments themselves)
H2020: Pilot on Open Research Data: Data Management Plan
• DMPs are NOT part of the proposal evaluation: to be delivered within the first six months of the project and updated as needed
• DMP’s mandatory for all projects participating in the Pilot, optional for others
• DMP questions:•What data will be collected / generated?•What standards will be used / how will metadata be generated?•What data will be exploited? What data will be shared / made open?•How will data be curated and preserved?
H2020: Pilot on ORD: take-up in first callsBasis: 3054 Horizon 2020 proposals
Calls in core-areas: opt out 24.2% (442 of 1824 proposals) – range from 9,1-29,1%Other areas: voluntary opt in 27.2% (334 of 1230 proposals) – range from 9 to 50%
Preliminary but encouraging results
EC as policy maker on OA: currently ongoing…
• Analysis of Uptake of ORD Pilot in signed grant agreements (versus proposals)
• DMP implementation: investigating best-practice; tools to be developed (2015)
• Top-notch monitoring of OA policies is crucial for further policy development
EC as capacity builder on OA: coordination and support actions (ongoing - FP7 funded)
PASTEUR4OA (Open Access Policy Alignment Strategies for European Union Research) Started 2014
FOSTER (Foster Open Science Training for European Research) Started 2014
RECODE - (Policy Recommendations for Open Access to Research Data in Europe) – 2013, finishing
OpenAIRE/OpenAIRE+: supporting the implementation of Open Access in Europe (publications and data)
Infrastructure projects(with OA components), e.g. GEO/GEOSS, ELIXIR…
EC as policymaker - Open access policies across the EU
Preliminary findings from (i) NPR reporting template of 13 EU MS & 1 Associated Country (ii) Results of 2014 ERA Progress Report
General findings
1. Mostly soft measures rather than legislation: exceptions exist2. OA to publications > than OA to data. Progress as to infrastructures for
data (repositories), but openness still quite complex an issue and not addressed in many countries (for data)
3. Bigger and “richer” countries have more comprehensive OA policies and OA enabling infrastructures, as well as tend to lead or participate more actively in OA networking initiatives
4. Nevertheless, smaller or less federated countries have the advantage of easier coordination and better synergistic capabilities
ORD in Poland
• OCEAN – a new national research datacentre with focuses on Big and Open Data is being built. Its tasks will include large scale data mining as well as long term preservation of research results.
• There are about 100 digital libraries which contain scientific papers and several classical scientific repositories.
• Poland is in the FOSTER (Facilitate Open Science Training for European Research) project, where one of the areas covered is to provide training on open access to research data.
EC as policymaker – A European Open Science Cloud?
• Rationale and first ideas• Work in progress. • Not to be quoted
Science 2.0 consultation (July-Sept 2014 + validation WS)
o ~ 85 % agree to some extent that data infrastructures are a bottlenecko Spontaneous position papers from research stakeholders
Possible actions
1. Mandate the development of common interfaces and data standards
2. Coordinate at European Level the funding/ maintenance and interoperability of research infrastructures
3. Support the development of a European Open Science Cloud for research
European Open Science
Agenda
A European Open Science Cloud: part of Europe´s ambition to support the transition to Open Science and make the most of data-driven science.
o European scientists strongly stated the need for a research data infrastructure that is cost-effective, preserves privacy and is IPR-conscious (Science 2.0 consultation).
o The cloud provides all EU researchers a virtual environment with free, open and seamless services for data storage, management, analysis and re-use, across disciplines.
o The cloud will federate existing and emerging horizontal and thematic data infrastructures, effectively bridging todays fragmentation and ad-hoc solutions.
o The cloud adds value - scale, data-driven science, inter-disciplinarity, data to knowledge to innovation - and leverages current and past infrastructure investment (10b per year by MS, two decades EU investment).
European Open Science
Cloud
Life
sci
ence
s
Lead users… Scientific communities …long tail
Phys
ics
Eart
h sc
ienc
es
Econ
omic
s
Soci
alsc
ienc
esScal
e of
sci
enti
fic a
ctiv
ity
(da
ta-
driv
en s
cien
ce)
Appl
ied
- eng
inee
ring
… …
Humanities Citizen science
European Open Science
Cloud
Data layer
Service layer
Governance layer
Life
sci
ence
s
Lead users… Scientific communities …long tail
Phys
ics
Eart
h sc
ienc
es
Econ
omic
s
Soci
alsc
ienc
esScal
e of
sci
enti
fic a
ctiv
ity
(da
ta-d
rive
n sc
ienc
e)
High performance computingData fusion across disciplines
Big data analyticsPrivacy and personal data protection
… … Data discovery and catalogue
Data manipulation and export
Data access and re-use
TrustLeverage of MS investment
Legacy and sustainability
IPR protection
Federation
Appl
ied
- eng
inee
ring
… …
Humanities
Data storage
Citizen science
European Open Science
CloudBottom-up governance
In summary• The EC is a strong funder, policy maker and capacity builder
with regard to ORD in particular and OA in general• It sees it as part of an irreversible change in the modus
operandi of science: open science• A lot is ongoing and planned• Much more needs to be done if we want Europe and its
science stakeholders to capitalise on the opportunities ORD, OA and OS offer.
• The EC follows a bottom up and stakeholder driven apporach
Recommended