46
Horizon 2020 Coordination and Support Action GARRI-3-2014 Scientific Information in the Digital Age: Text and Data Mining (TDM) Project number: 665940 Increasing Uptake of Text and Data Mining in the EU FutureTDM Reducing Barriers and Increasing Uptake of Text and Data Mining for Research Environments using a Collaborative Knowledge and Open Information Approach Melanie Imming, LIBER, FutureTDM Workshop, European Parliament, Brussels 29 March 2017

FutureTDM Workshop II 29 March

Embed Size (px)

Citation preview

Page 1: FutureTDM Workshop II 29 March

OpenDataMonitor

Horizon 2020

Coordination and Support Action

GARRI-3-2014 Scientific Information in the Digital Age: Text and Data Mining (TDM)

Project number: 665940

Increasing Uptake of Text and Data Mining in the EU

FutureTDMReducing Barriers and Increasing Uptake of Text and Data Mining for Research Environments

using a Collaborative Knowledge and Open Information Approach

Melanie Imming, LIBER, FutureTDM Workshop, European Parliament, Brussels

29 March 2017

Page 2: FutureTDM Workshop II 29 March

What we are talking about today: an overview of the Workshop

A reminder of what we mean by TDM

An overview of the project and its aims

Presentations on our most recent Reports

− Economics of TDM

− Policy Recommendations

Later today:

Session 2: What stakeholders can do to improve TDM in their

sector – developing guidelines

2FutureTDM

Page 3: FutureTDM Workshop II 29 March

“the discovery by computer of new, previously unknown information, by automatically extracting and relating information from different (…)resources, to reveal otherwise hidden meanings” (Hearst, 1999)

What is TDM?

3

ICT Availability

Computing

Power

Data

16 trillion

gigabytes of

data by 2020

(236% growth)

Doubles every

2 years

(Moores Law,

1965)

Over 80% EU

citizens have

internet access

(Eurostat 2014)

Page 4: FutureTDM Workshop II 29 March

Potential of TDM

4

• Addressing grand challenges such as climate change and global

epidemics

• Improving population health, wealth and development

• Creating new jobs and employment

• Exponentially increasing the speed and progress of science through new

insights and greater efficiency of research

• Increasing transparency of governments and their actions

• Fostering innovation and collaboration and boosting the impact of open

science

• Creating tools for education and research

• Providing new and richer cultural insights

• Speeding economic and social development in all parts of the globe

(The Hague Declaration on Knowledge Discovery)

Page 5: FutureTDM Workshop II 29 March

5

TDM is not a homogeneous, self-contained, scientific domain, but rather a diverse and complex set of methods and technologies

deployed in the framework of diverse disciplines and business activities

The challenge

Page 6: FutureTDM Workshop II 29 March

FutureTDM - the opportunity

The FutureTDM project seeks to improve uptake of text and data mining

(TDM) in the EU by actively engaging with stakeholders such as

researchers, developers, publishers and SMEs.

The use of content mining is significantly lower in Europe

than in some American and Asian countries.

The partners in the FutureTDM consortium share the ambition behind the

EC’s call to develop policy and legal frameworks to reduce the barriers of

TDM uptake and with it, promote the awareness of TDM opportunities

across Europe.

6

Page 7: FutureTDM Workshop II 29 March

FutureTDM

7

ELABORATE a legal and policy framework for future TDM, define policy priorities, specify a research agenda to foster the spread of TDM in various research fields within the EU

BUILD a Collaborative Knowledge Base and an Open Information Hub combined on a web-based platform including intuitive tools

ANALYSE current application areas and trends in TDM including

statistics and key figures, collect relevant research and industrial

projects and best practices

ASSESS existing studies, legal regulations and policies on TDM within the European

Union

Main Objectives of FutureTDM

INVOLVE all key stakeholders to identify practices,

requirements, and specific challenges in the field of

TDM

INCREASE awareness of TDM to attract new target groups and science domains

Page 8: FutureTDM Workshop II 29 March

FutureTDM the importance of the community

FutureTDM: Improving uptake of text and data mining

in the EU

9

Importance of the community:

Making the project accessible

to all stakeholders

Collaboration with other

projects like OpenMinTeD

who are focusing on technical

aspects

Page 9: FutureTDM Workshop II 29 March

Reaching the TDM community – what we’ve been doing

We met with people at events, ran our knowledge cafes and held

interviews across Europe

10

Page 10: FutureTDM Workshop II 29 March

Reaching out to the TDM community - media

Perhaps you saw our videos?

We encouraged people to:

Blog on our website www.futuretdm.eu

Tweet us at @futureTDM #FutureTDM

11

Page 11: FutureTDM Workshop II 29 March

What we have produced - Expert Reports and Awareness Sheets

Awareness sheets highlighting TDM projects,

tools and services, organisations and

interviews

FutureTDM Knowledge Library

• Research report on TDM landscape in Europe

• Baseline report of policies, and barriers of TDM in Europe

• European landscape of TDM applications report

• Compendium of best practices and methodologies

• FutureTDM policy framework

Coming soon – practitioner guidelines and roadmap for TDM uptake

12

Page 12: FutureTDM Workshop II 29 March

Key barriers discussed at our first workshop

13

AWARENESS

TALKING POINTS

REWARD & INCENTIVE LEGAL CLARITY

REDUCE SKILLS GAPS ACCESS TO DATA

Knowledge Discovery

Standardisation & Harmonisation

Risk Reduction Licensing?

Privacy Protection & Ethics

Useful data formats

Interoperability - Tools & Resources

No Hidden Barriers

Discoverability

User Friendly

Inclusion Support

Better Data Management Collaboration Across Fields

Promote Innovation

Commercial & Non-Commercial Interaction

Who licenses the internet?

Main users - Industry

Skills lacking

Page 13: FutureTDM Workshop II 29 March

Workshop 1 some feedback

“Anyone with lawful access to data should be allowed to conduct

TDM”. (Advocacy stakeholder)

“The perception seems to be that legal barriers are very significant.

This is true but other issues such as skills and infrastructure are

equally if not more significant and these will take a long time to

address” (Consultant)

“Definitions of TDM may actually be the largest barrier in the

copyright/legal discussions on making progress” (Legal

Practice)

“Licensing remains for us the most certain legal solution,

achieving a good balance between different stakeholders”

(Publisher)

“We need a copyright exception for TDM for commercial and non-

commercial entities” [Science and Innovation Researcher]

14

Page 14: FutureTDM Workshop II 29 March

Now we want to help you

The purpose of the workshop today is to move on from the barriers and talk

solutions. We want positivity, best practice and recommendations to help EU

maximise the TDM opportunity

That’s why we will present:

▪ Economics and impact of TDM

▪ EU Policy recommendations

and later today...

▪ what this might look like in practice for stakeholders

In particular we welcome from you the solutions you see for TDM policy from your

sector’s perspective - how this might look at EU level and in your own environment.

A workshop blog and report will be published on the website after the event

15

Page 15: FutureTDM Workshop II 29 March

What’s next?

Please continue to make your thoughts on this visible to the project so we can continue to

shape best practice:

● tweet today using hashtag #FutureTDM

● write a guest blog on our website

● fill in our feedback cards here today in session 2

● Fill in our quick poll online at www.futuretdm.eu

Save the date 13 June 2017 for for our final Symposium in Salzburg where our

stakeholder guidelines and roadmap for TDM uptake will be made available

16

Page 16: FutureTDM Workshop II 29 March

Our engagement so far...

17

17

Join the Community!

FutureTDM Collaborative Knowledge Base and Open Information Hub

Platform: http://futuretdm.eu

Expert Blog Best Practice Catalogue

Project Database

Tool Selector

Baseline Reports

Comparative Analysis

FutureTDM Guidelines

Leading Projects Area (promoting/knowledge exchange - e.g. OpenMinTeD)

Landscape Overview

Page 17: FutureTDM Workshop II 29 March

OpenDataMonitor

Horizon 2020

Coordination and Support Action

GARRI-3-2014 Scientific Information in the Digital Age: Text and Data Mining (TDM)

Project number: 665940

TDM – Economic Perspective

FutureTDMReducing Barriers and Increasing Uptake of Text and Data Mining for Research Environments

using a Collaborative Knowledge and Open Information Approach

Jan Strycharz, FutureTDM Workshop 2, European Parliament, Brussels

29th March 2017

Page 18: FutureTDM Workshop II 29 March

TDM & Big Data

19FutureTDM

Page 19: FutureTDM Workshop II 29 March

Value is KEY

20

Page 20: FutureTDM Workshop II 29 March

The Meaning of Value

21

Page 21: FutureTDM Workshop II 29 March

22

Page 22: FutureTDM Workshop II 29 March

23

Translating data…

Under-standing

SelectingPrepro-cessing

Transfor-mation

Choosing the method

ExploringDiscovering

patternsInterpreting

…into action

TDM Process

Page 23: FutureTDM Workshop II 29 March

Types of Data Analyzed #1

24

Page 24: FutureTDM Workshop II 29 March

Types of Data #2

25

Page 25: FutureTDM Workshop II 29 March

How Much - Market

26

Europe 2016 2021

Big Data 9.4 B USD 29.4 B USD

TDM 2.5 B USD 10.3 B USD

World 2016 2019

Big Data 26 B USD 48 B USD

TDM 7 B USD 17 B USD

Page 26: FutureTDM Workshop II 29 March

How Much – Economic Impact

27

Page 27: FutureTDM Workshop II 29 March

Value

28

1:5

1:11

Page 28: FutureTDM Workshop II 29 March

How much – Economic Impact

29

Page 29: FutureTDM Workshop II 29 March

Obstacles

30

Page 30: FutureTDM Workshop II 29 March

Organizational Culture – Data Savvy

31

Page 31: FutureTDM Workshop II 29 March

Organizational Culture – Data Savvy

32

REFERENCES TO ECONOMIC PERSPECTIVES:

Barel, Yannick. Big Data. Big Opportunity, IBM: Power Systems.

Bradshaw, David, Gabriella Cattaneo, Rosanna Lifonti, John Simcox. 2014. Uptake of Cloud in Europe. Follow-up of

IDC Study on Quantitative estimates of the demand for Cloud Computing in Europe and the likely barriers to take-up,

Framingham: IDC Analyze the Future.

Brown, Brad, Jacques Bughin, Angela Hung Byers, Michael Chui, Richard Dobbs, James Manyika, Charles Roxburgh.

2011. Big data: The next frontier for innovation, competition and productivity, McKinsey Global Institute

Buchholtz, Sonia, Maciej Bukowski, Aleksander Śniegocki. 2014. Big and open data in Europe. A growth engine or a

missed opportunity?, Warszawa: demosEUROPA, Warsaw Institute for Economic Studies.

Buytendijk, Frank. 2014. Hype Cycle for Big Data, Stamford: Gartner.

Danson, Forrest et al. (red.). 2016. Analytics Trends 2016. The Next Evolution, New York: Deloitte.

European Commission, Digital Market Factsheet (2015)

European Parliament. 2016. Towards a thriving data-driven economy, Strasburg.

Forbes Insights. 2015. Betting on Big Data. How the right culture, strategy and investments can help you leapfrog the

competition. Washington.

Fujitsu. 2014. Fujitsu Market Insights. Big Data, Fujitsu Technology Solutions GmbH.

Girard, Greg. From Big Data to Real-Time Business Value, Framingham: IDC Analyze the Future.

Heudecker, Nick, Lisa Kart. 2014. Survey Analysis: Big Data Investment Grows but Deployments Remain Scarce in

2014, Stamford: Gartner.

Heudecker, Nick, Merv Adrian. 2015. Survey Analysis: Hadoop Adoption Drivers and Challenges, Stamford: Gartner.

Heudecker, Nick, Lisa Kart. 2015. Survey Analysis: Practical Challenges Mount as Big Data Moves to Mainstream,

Stamford: Gartner.

Kart, Lisa. 2015. Big Data Industry Insights, Stamford: Gartner.

Page 32: FutureTDM Workshop II 29 March

OpenDataMonitor

Horizon 2020

Coordination and Support Action

GARRI-3-2014 Scientific Information in the Digital Age: Text and Data Mining (TDM)

Project number: 665940

Overcoming legal barriers

FutureTDMReducing Barriers and Increasing Uptake of Text and Data Mining for Research Environments

using a Collaborative Knowledge and Open Information Approach

FutureTDM Workshop II | Brussels | March 29, 2017

Marco Caspers (Institute for Information Law, University of Amsterdam)

Page 33: FutureTDM Workshop II 29 March

Introduction

To discuss:

▪Our work in FutureTDM

▪Policy framework

▪Principles

▪Designing a TDM exception

34FutureTDM

Page 34: FutureTDM Workshop II 29 March

Our work

• Desk research•regulation

•policies

•economy

• Stakeholder engagement•interviews

•workshops

•conferences

• Results:•Barriers to TDM

•Legal & policies

•Skills and education

•Economy

•Technical & infrastructure

•Recommendations!

Page 35: FutureTDM Workshop II 29 March

FutureTDM Policy Framework

• Deliverable D5.1 recommendations cover:•Legal & Policies

•Skills & Education

•Economy & Incentives

•Technical & Infrastructure

• This presentation:•Only legal recos in context of TDM exception

•Underlying research•Legal research into copyright and database law

•Questionnaire on national exceptions

•Response: 16 Member States•Stakeholder engagement

•Views on legal rules

Page 36: FutureTDM Workshop II 29 March

Fundamental principles

37

Page 37: FutureTDM Workshop II 29 March

Scope and design of TDM exception

• From our policy framework:• “Ensure legal rules reflect a fair balance

between the interests of TDM

practitioners and rightsholders (of

copyright and database rights), and

reaffirm that ideas and facts as such are

not protected”

• “Create harmonised and mandatory

rules”

• “Make clear rules” minimise

borderline cases

38

Acts?

Purpose?

Beneficiary?

Overridable?

Subject matter?

Page 38: FutureTDM Workshop II 29 March

Acts

• Reproductions & Extractions• UK (only repro) | EU

• Digital copies• FR

• FutureTDM:• All reproductions and extractions

• No visibility for public• only aggregated information

• No trade on expression

extraction of facts and ideas• No limits to retention

• Reproducibility & verifiability

• Or else: other exception?

39

Acts?

Purpose?

Beneficiary?

Overridable?

Subject-matter?

Page 39: FutureTDM Workshop II 29 March

Purpose

• Scientific research• UK | DSM proposal

• Non-commercial• UK | France

• FutureTDM:• No restriction

• prevent border cases

40

Acts?

Purpose?

Beneficiary?

Overridable?

Subject-matter?

Page 40: FutureTDM Workshop II 29 March

Beneficiary

• Lawful access• UK | FR | DSM proposal

• Research organisation• DSM proposal

• trade-of with ‘non-commercial’

• FutureTDM• Lawful access

• If allowed to read, allowed to mine• trade on ideas, not expression

41

Acts?

Purpose?

Beneficiary?

Overridable?

Subject-matter?

Page 41: FutureTDM Workshop II 29 March

Overridable

• Overridable by contract• Not: UK | FR | DSM proposal

• technical overridability?

• FutureTDM• Not overridable

• Fragmented licensing landscape

• Effectiveness

42

Acts?

Purpose?

Beneficiary?

Overridable?

Subject-matter?

Page 42: FutureTDM Workshop II 29 March

Subject-matter

• No restriction• UK | EU

• Scientific literature• FR

• FutureTDM• No restriction

• Copyright principles apply horizontally

• Uncertainty borderline cases

43

Acts?

Purpose?

Beneficiary?

Overridable?

Subject-matter?

Page 43: FutureTDM Workshop II 29 March

Compensation

• UK | France | EU• No

• Considerations for compensation• purpose: missed out on exploitation of the work (expression)

• no where meant to cover investments (in services)

Page 44: FutureTDM Workshop II 29 March

Measures

• In essence no copyright issue

• Overridability by (technical) security measures?•if yes: may not change status quo

•if no: must achieve fair balance

• How to balance?•voluntary ‘agreed’ best practices?

•service-level obligations?

•transparency?

Page 45: FutureTDM Workshop II 29 March

What’s next?

Want to know more?

• See our Deliverable D5.1 “FutureTDM Policy Framework”

• and upcoming deliverables:•Guidelines for stakeholders (end April)

• You can provide your view in our next session now

Page 46: FutureTDM Workshop II 29 March

FutureTDM Workshop II, 29 March – Any Questions?

Q & A

All slides will be on FutureTDM slideshare after the event

#FutureTDM

47