22
••• 1 Data infrastructures for Science DRIVER Summit Towards a Confederation of Digital Repositories Göttingen, 16 th January 2008 Mário Campolargo European Commission - DG INFSO F Acting Director "The views expressed in this presentation are those of the author and do not necessarily reflect the views of the European Commission"

Data infrastructures for Science

  • Upload
    tessa

  • View
    41

  • Download
    0

Embed Size (px)

DESCRIPTION

Mário Campolargo European Commission - DG INFSO F Acting Director. Data infrastructures for Science. DRIVER Summit Towards a Confederation of Digital Repositories G ö ttingen, 16 th January 2008. - PowerPoint PPT Presentation

Citation preview

Page 1: Data infrastructures for Science

••• 1

Data infrastructures for Science

DRIVER SummitTowards a Confederation of Digital Repositories

Göttingen, 16th January 2008

Mário CampolargoEuropean Commission - DG INFSO F

Acting Director"The views expressed in this presentation are those of the author and do not necessarily reflect the views of the European

Commission"

Page 2: Data infrastructures for Science

••• 2

a new vision for Science

Global challenges with high societal impact

Data deluge… wet-labs versus virtual-labs

Improved scientific process… role of simulation

Cross-disciplinarity

Virtual Research Communities

networkinggridsinstrumentationcomputingdata curation…

Tech

nolo

gy

pu

sh

value added of distributed

collaborative research (virtual

communities)

Ap

plic

atio

n p

ull

Page 3: Data infrastructures for Science

••• 3

building Science through collaboration

Research Communities common goals, complementary and shared

information, tools and knowledge, awareness of research protocols, effective means of collaboration, interest in being part of the communityVirtual research

from empirical, experimental, theoretical and computational science… to intensive use of data… abstraction… models… simulation… e-Science

Virtual communities no geographical, time or institutional boundaries

Globalisation Global challenges, global dimension, win-win

situation

Page 4: Data infrastructures for Science

••• 4

VirtualCommunity

Meetings, etc.

Workspace

Virtual Labs

Scientific Data

Grid

Network

VirtualCommunity

Meetings, etc.

Workspace

Virtual Labs

Scientific Data

Grid

Network

global virtual research community

VirtualCommunity

Meetings, etc.

Workspace

Virtual Labs

Scientific Data

Grid

Network

Scientific Data

Grid

NetworkEcon

om

ies

of

Scale

Eff

icie

ncy

Gain

s

Page 5: Data infrastructures for Science

••• 5

ICT for Science: e-Infrastructure

Linking at the speed of the lightLinking at the speed of the light

Sharing computers, instruments and applicationsSharing computers, instruments and applications

Sharing and federating scientific dataSharing and federating scientific data

. . . . . .

.

Astrophysics

community

WeatherForecast

community

Biomedics

community

Connecting the finest mindsSharing and federating the best scientific

resourcesBuilding global virtual communities

Page 6: Data infrastructures for Science

••• 6

network

e-Infrastructure - implementation

middleware

data

virtual labs

user communities

Page 7: Data infrastructures for Science

••• 7

Framework Programme 7 (2007-13)

Capacities4097 M€

JRC1751 M€

Ideas7510 M€

Euratom4062 M€

People4750 M€

Cooperation32413 M€

Dev. of policiesINCO

Sciencein Society

Research Infrastructures 42% - 1715 M€

SMEsResearch Potential

Regions of Knowledge

e-Infrastructures(ICT for Science)

572 M€

Page 8: Data infrastructures for Science

••• 8

e-Infrastructure - implementation

network

middleware

data

virtual labs

user communities

Su

pp

ort

acti

on

s

Dep

loym

en

t of

e-I

nfr

ast

ructu

re f

or

scie

nti

fic c

om

mu

nit

ies

Scie

nti

fic

Dig

ital

Rep

osi

t.

e-S

cie

nce

gri

d

Infr

ast

.

Scie

nti

fic

Data

In

frast

.

AN

T

new infrastructures - PRACE

35M€

Page 9: Data infrastructures for Science

••• 9

importance of data for Science

extract knowledg

e

addvalue

publishdisseminat

e

store validatemanage

capturecollectcreate

From raw data to publications

From research to education

From one to multiple disciplines

From past and present to future

From one to multiple organisations

Several "continuum"

Page 10: Data infrastructures for Science

••• 10

implementation strategy

Policy actions Studies

e-Infrastructure projects

Scientific Digital Rep. Scientific Data Infr. User communities Support actions

ICTe-Contents

eSciDR(Towards a Europ.

e-Infrastructure for e-Science Digital

Repositories)

creating an European

Scientific Data Infrastructure Council Decision

CommunicationScientific Inf. /ERA

ESFRI WG e-IRG

Page 11: Data infrastructures for Science

••• 11

Improved access to, and sharing of, research data:

Promotes new research practices;

Makes possible the testing of new or alternative hypotheses and methods of analysis;

Enables new scientific insights by the exploration of topics not envisioned by the initial investigators;

Permits the creation of new data sets by the combination of data from multiple sources.

Facilitates the education of new researchers.

sharing scientific data

Page 12: Data infrastructures for Science

••• 12

Improved access to, and sharing of, research data:

Supports good management of public investment.

Potentially creates strong innovation value chains.

Enhances the value of global co-operations.

Impacts outside science.

sharing scientific data

Policy requirements identified by e-IRG and ESFRI

(availability, preservation/curation, quality, rights of use, interoperability)

Conclusions of eSciDR study coming soon

Page 13: Data infrastructures for Science

••• 13

qualities of repositories

repositoriesinfrastructure

sustained

protected

discoverable

selected contents

managed

trusted

concern for quality

organisational context

source: eSciDR study (adapted)

Page 14: Data infrastructures for Science

••• 14

Council of European Union, 22/23 Nov 2007: Considering

Access to and dissemination of publications and data crucial for the European Research Area and innovation

Effective long lasting preservation is fundamental

Invites the Member States / CommissionEnhance coordination between MS and large research organisation and funding bodies on access, preservation and dissemination policies and practices

Experiment OA to data and publication from EU projects

Encourages research into digital preservation

Wide deployment of scientific data infrastructures with cross border, cross institution and cross discipline value added for OA and preservation

council conclusions

Page 15: Data infrastructures for Science

••• 15

working with scientific communities

EGEEDEISA

DRIVER

GÉANT

neuGRIDEUFORIAD4SCIENCEETSF

IMPACTMETAFOREuroVO-AIDAGENESI-DR

netw

ork

mid

dle

ware

data

generic e-Infrastructure… user communities involvement

FEDERICAEVALSOEDGES

Page 16: Data infrastructures for Science

••• 16

FP7: data repositories

IMPACT

NMDB

DRIVER II

METAFOR

EuroVO-AIDA

GENESI-DR

bio-informatics

space physics

federated digital repositories

climatology

astronomy

geosciences

Page 17: Data infrastructures for Science

••• 17

data repositories projects - highlights

IMPACT

unifies data from 10 major databases related to protein families.

NMDB

establishes a digital repository for cosmic-ray data, and develops a real-time database from many neutron monitoring stations.

METAFOR

defines a Common Information Model (CIM) for climate data handling heterogeneous metadata stand.

Page 18: Data infrastructures for Science

••• 18

data repositories projects - highlights

EuroVO-AIDA

unifies digital data collections of astronomy, integrating European data centres into a global Virtual Observatory.

GENESI-DR

provides access to earth science digital repository (data from space, airborne, in-situ sensors).

DRIVER II

federates scientific repositories based on open standards, supporting complex information objects, cross-discipline.

Page 19: Data infrastructures for Science

••• 19

FP7: user communities/support actions*

EUFORIA

neuGRID

D4Science

ETSF

 e-NMR

DORII

 PESI

PARSE.insight

magnetic fusion

medical e-Support

environment - e-Infrastructure

spectroscopy

 data analysis - biology

environmental science, seismology

biodiversity

permanent access records of science

* Proposals under negotiation at this stage

Page 20: Data infrastructures for Science

••• 20

data deluge?

The animals boarding Noah’s Ark two by two

painting by the American Edwards Hicks (1780-1849)

Page 21: Data infrastructures for Science

••• 21

Modern Science makes increasing use of ICTs to change the way it is operating.

Amount and importance of scientific data produced is growing exponentially.

e-Infrastructure must support the management of data throughout the whole life cycle (including curation and long term preservation).

Data “is” an infrastructure for Science; scientific data infrastructure is therefore a key perspective in building a European e-Infrastructure.

Europe launched the basis for a consistent action (studies, policies, projects).

conclusions

Page 22: Data infrastructures for Science

••• 22

further information

www.cordis.europa.eu/fp7/ict/e-infrastructure/