33
1 Dati: la quinta rivoluzione dell’Information Technology Mario Rasetti ISI Foundation Torino ISI Global Science Foundation – New York

Rasetti fondazioneisi 29_06_2015

Embed Size (px)

Citation preview

Page 1: Rasetti fondazioneisi 29_06_2015

1

Dati: la quinta rivoluzione dell’Information Technology

Mario Rasetti

ISI Foundation – Torino ISI Global Science Foundation – New York

Page 2: Rasetti fondazioneisi 29_06_2015

The context: il mondo in cui viviamo

Page 3: Rasetti fondazioneisi 29_06_2015
Page 4: Rasetti fondazioneisi 29_06_2015
Page 5: Rasetti fondazioneisi 29_06_2015

Italy 44.5 50.8 1950 to 1990

Page 6: Rasetti fondazioneisi 29_06_2015

Le Città

Page 7: Rasetti fondazioneisi 29_06_2015

‣ process & workflow analytics ‣ location-aware services ‣ organizational science ‣ social science & health ‣ infectious disease dynamics

Internet of (Every)thing: nuove interazioni fra persone, oggetti, luoghi

Page 8: Rasetti fondazioneisi 29_06_2015
Page 9: Rasetti fondazioneisi 29_06_2015

Our world is more and more complex and data-based:

4.7 billion people own a cell phone, every day over 410 billion e-mails and 35 billion SMSs are exchanged, 700 million pictures are uploaded on Facebook. The information created and exchanged yerarly (2014) is 6 zettabyte of data (1 zettabyte = 1021 bytes: the 1250 pages of Tolstoj’s War and Peace would be contained 323 billion times in it, and the whole Library of Congress of Washinton 4 million times) and every year grows 40% (in 3 years we’ll reach a yottabyte, 1024).

A world in which every year 1.4 billion cars circulate, over 2.5 billion people fly on airplanes, the growth of population, urbanization, commercial exchanges, global migrations is more and more entangled with that of technologies, generating one interconnected ‘socio-technical’ system.

Page 10: Rasetti fondazioneisi 29_06_2015

Let’s concentrate first on the

Big Data Big Challenge,

that of extracting the large amount of information flowing in and around the complex systems we live with, itself available through huge amounts of data Big Data have a variety of diverse features:

• BD in science (e.g., Hubble, Genoma, CERN) : typically well organized in well built data-bases

• BD in society : allow for a true TOMOGRAPHY of SOCIETY and make possible ‘predictions’ not reasonable before (cf., e.g., H1N1 pandemics, 2009)

Page 11: Rasetti fondazioneisi 29_06_2015

or targets/strategies:

BD hardware challenge (HPC: high performance computing); e.g., quantum: a mix of limits & hopes

BD manipulation the greatest computer science challenges: beyond Turing, interaction-based computing, data mining:

extracting value from data

i.e., turning 'Big Data Science' into an 'ICT Big Science’, coupling methods and data with theories and models, endowing ICT with new, more efficient tools to turn

data into information,

information into knowledge,

and eventually

knowledge into wisdom.

as ICT becomes integral part of the fabric of nature and society.

Page 12: Rasetti fondazioneisi 29_06_2015

? IT is facing what is referred to as its fifth revolution; the next step after the mainframe era, the PC era, the Internet and Web 1.0 era, the mobile and Web 2.0 era: the era of Big Data.

The true essence of Big Data is Evidence-based Decision Making

Page 13: Rasetti fondazioneisi 29_06_2015
Page 14: Rasetti fondazioneisi 29_06_2015
Page 15: Rasetti fondazioneisi 29_06_2015

www.whitehouse.gov/issues/technology/big-data-review

Page 16: Rasetti fondazioneisi 29_06_2015

• Big data is saving lives. […] By collecting and analyzing millions of data points from a neonatal intensive care unit, one study was able to identify factors, like slight changes in body temperature and heart rate, that serve as early warning signs an infection may be taking root […]

• Big data is making the economy work better. […] Utility companies

are starting to use big data to predict periods of peak electric demand, adjusting the grid to be more efficient and potentially averting brown-outs.

• Big data is saving taxpayer dollars. The Centers for Medicare and

Medicaid Services have begun using predictive analytics […] to flag likely instances of reimbursement fraud before claims are paid. […] has already stopped, prevented, or identified $115 million in fraudulent payments.

opportunità

Page 17: Rasetti fondazioneisi 29_06_2015

• Big data tools can alter the balance of power between government and citizen. […] government uses of big data also have the potential to chill the exercise of free speech or free association. As more data is collected, analyzed, and stored on both public and private systems, we must be vigilant in ensuring that balance is maintained between government and citizens […]

• Big data tools could lead to discriminatory outcomes. As more

decisions about our commercial and personal lives are determined by algorithms […] we must pay careful attention that big data does not systematically disadvantage certain groups, whether inadvertently or intentionally.

• Big data tools can reveal intimate personal details. […] data

fusion can also lead to the so-called “mosaic effect,” whereby personally identifiable information can be discerned even from ostensibly anonymized data. […]

rischi hi

Page 18: Rasetti fondazioneisi 29_06_2015

MacKinsey BD Report (2013)

(USA)

IBM estimate (2014) - worldwide

Page 19: Rasetti fondazioneisi 29_06_2015

the big picture

the digital image of the world is tracking the world more and more closely:

use computation to extract patterns and establish causal inferences using tools from data mining, machine learning, statistics

mathematical modeling and forecast happen on a data-rich landscape and are fed by data streams from multiple sources

we can assess our models against reality at unprecedented speed and scale, and feed this back to models

society at large is involved, besides nature

Page 20: Rasetti fondazioneisi 29_06_2015
Page 21: Rasetti fondazioneisi 29_06_2015
Page 22: Rasetti fondazioneisi 29_06_2015
Page 23: Rasetti fondazioneisi 29_06_2015
Page 24: Rasetti fondazioneisi 29_06_2015

the role of Complexity Science

Page 25: Rasetti fondazioneisi 29_06_2015

Bevys of Starlings

Multi-agent – Multi-scale – Emergent effects

25

Page 26: Rasetti fondazioneisi 29_06_2015

www.gov.uk/government/publications/reducing-obesity-obesity-system-map

complessità

Page 27: Rasetti fondazioneisi 29_06_2015

TOPOLOGY-BASED ALGORITHMS

Page 28: Rasetti fondazioneisi 29_06_2015
Page 29: Rasetti fondazioneisi 29_06_2015
Page 30: Rasetti fondazioneisi 29_06_2015

Connection strengths

(shades of gray; 20 brain regions)

Binarization (threshold)

Reordering and modularization

Brain Network

Page 31: Rasetti fondazioneisi 29_06_2015

The data space splits into the direct sum of irreps of G. The general ‘covariance matrix’ of a generic machine learning algorithm becomes ‘block-diagonal’: all zeroes are pushed to the upper-right / lower-left corners.

Page 32: Rasetti fondazioneisi 29_06_2015
Page 33: Rasetti fondazioneisi 29_06_2015

Data Driven Approach

Theory & Models

Mathematics & Foundation of

Complex Systems

Data Science Big Data

Computational Social Science

Collective Phenomena in Physics & Materials Science

Quantum Science &

Complexity

Complexity Science

Computational Epidemiology & Public Health

Citizen Science & Smart Cities

ISI Foundation: research