29
The Future of Data-Centric Computing Babak Falsafi ecocloud.ch

The Future of Data-Centric Computing

  • Upload
    haliem

  • View
    225

  • Download
    2

Embed Size (px)

Citation preview

The Future of Data-Centric ComputingBabak Falsafiecocloud.ch

Information Technology (IT): Five Decades of Exponential Growth

IT is at the core everything we do & has become an indispensable pillar for a modern day society!

Intel 4004, 1971

Intel Xeon, 2014

92,000 ops/sec

266,000,000,000 ops/sec

A Brief History of IT

§ From computing-centric to data-centric§ Consumer Era: Internet-of-Things in the Cloud

1970s-

PC Era

Mobile Era

Mainframes1980s 1990s Today+

Consumer Era

Two Inflection Points for IT

1. Big Data§ Data growth at 1.5x/year§ Huge demand on scaling platforms

2. Platforms§ Efficiency scaling stopped ten years ago § Silicon density scaling has slowed down§ IT growth not sustainable

Data is Shaping Future of IT

§ Data growth (by 2015) = 100x in ten years [IDC 2012]§ Population growth = 10% in ten years

§ Monetizing data for commerce, health, science, services, ….§ Big Data is shaping IT & pretty much whatever we do!

Data Shaping All Science & Technology

Science entering 4th paradigm§Analytics using IT on

§ Instrument data§ Simulation data§ Sensor data§ Human data § …

Complements theory, empirical science & simulation

Data-centric science key for innovation-based economies!

Datacenters Growing Fast

BC 332.

Source: James Hamilton, 2012

Daily IT growth in 2012 = IT first five years of business!

Warning! Datacenters are not Supercomputers

• Run heterogeneous data services at massive scale• Driven for commercial use• Fundamentally different design, operation, reliability, TCO

• Density 10-25KW/rack as compared to 25-90KW/rack• Tier 3 (~2 hrs/downtime) vs. Tier 1 (upto one day/downtime)• ……and lots more

Datacenters are the IT utility plants of the future

Supercomputing Cloud Computing

But, platforms are not scaling2transistors>1xenergy1transistor=1xenergy

2yearslater

1transistor=1xenergy

2yearslater

2transistors=1xenergy

Good old days (1970~2004):– Used to make transistors smaller– Smaller transistors less electricity to operate– Chip energy consumption remained ~ same

Now (2004-):– Continue to make transistors smaller– But, they use similar electricity to operate– Chip energy consumption is shooting up

[source: ITRS]

The fundamental energy silver bullet is gone!

Today

Projections

0

0.2

0.4

0.6

0.8

1

1.2

2001 2006 2011 2016 2021 2026

Powe

r Sup

ply V

dd

2001

2013

Slope = .053

Slope = .014

Operating voltages leveling

The Rise of Parallelism to Save the Day

With voltages leveling:§Parallelism has emerged as the only silver bullet

§Use simpler cores §Prius instead of Audi R8§GPU’s instead of CPU’s

§Restructure software§Each core è

fewer joules/op

MulticoreScaling

Conven

tionalServer

CPU(e

.g.,Xe

on)

Mod

ernMulticore

CPU(e

.g.,Tilera)

The Rise of Dark Silicon:End of Multicore/GPU Scaling

But parallelism can not offset leveling voltages

Even in servers with abundant parallelism

Core complexity has leveled off too!

Need a holistic approach Hardavellas et. al. Toward Dark Silicon in Servers

IEEE Micro, 2011

1248

163264

128256512

1024

2004 2007 2010 2013 2016 2019

Num

ber o

f Cor

es

Year of Technology

Max EMB CoresEmbedded (EMB)General-Purpose (GPP)

DarkSilicon

Silicon not getting much denser….

0.01

0.1

1

130nm 90nm 65nm 45nm 32nm 22nm 14nm 10nm

$/Transis

tor

Mark Bohr’s (Intel) Keynote [ISSCC’15]

Moore’s Law: $/transistor dropping for fifty years• Intel is pushing for a bit more• TSMC hinted at prices going up for the first time in 2014

Higher Demand + Lower Efficiency: Datacenter Growth not Sustainable!

§Modern datacenters è 20 MW!§ In modern world, 6% of all electricity and growing fast!

Billio

n Ki

low

att h

our/y

ear

2001 2005 2009 2013 2017 04080120160200240280 Datacenter

Electricity DemandsIn the US(source: Energy Star)

65 million Swiss homes

A Modern Datacenter

17x football field, $3 billion

? IT’s Future

BridgingTechnologies

Big Data Big Energy

Center to bring efficiency to data§ 16 faculty, 50 researchers§ 6M CHF/year external funds

Mission:§ Energy-efficient data-centric IT§ From algorithms to infrastructure§ Maximizing value for data

ecocloud.ch

Our Vision:Holistic Optimization of Datacenters

Holistic optimization• Algorithms to

infrastructure• Cross-layer• Paradigms to monitor,

manage & reduce energy

Open technologies!

Algorithm

Infr

astr

uctu

re

Our Vision:The ISA Triangle of Efficiency

Approximation(Tailor Precision for Fewer Joules)

Holistic Optimization

Integrated Cooling: CMOSAIC

3D server chipTwo-phase liquid cooling

§ Enables higher thermals§ Dramatically better heat removal

Prototyped by IBM

PCB

Micro-HeaterLiquid

Micro-Channels

Integrated Compute in Memory

Why in-memory? • Minimize data movement & energy• Leverage DRAM's massive internal BW

Basic data services:• Scan, Join, GroupBy, Filter• Best for sequential access• Accelerators must co-exist with

conventional memory semantics

10x better efficiency for a database join operation!

Integrated Thermal Management @ Credit Suisse

• Designed fine-grain power/thermal sensors

• Deployed with real-time monitoring of 5K servers

• Software for synergistic load/thermal provisioning

50% better efficiency!

In-Memory AnalyticsRecommendation System

Media StreamingNginx, HTTP Server

Web Search Apache Solr & Nutch

Web ServingNginx, PHP server

Data ServingCassandra NoSQL

Graph AnalyticsGraphX

Data CachingMemcached

Data AnalyticsMachine learning

Cloud Benchmarking with CloudSuite 3.0 (parsa.epfl.ch/cloudsuite)

Building block for Google PerfKit, EEMBC Big Data!

Specialized Servers:Cavium ThunderX

48-core 64-bit ARM SoC[blueprinted at EPFL]:• Designed to serve data• Custom organization• Runs off-the-shelf software stack• 10x better faster than Xeon

Spatial Computing: An idea whose time has come

• One StratixV FPGA per blade• DSL’s (e.g., Scala, TensorFlow)• Analytics with spatial computing

Microsoft Unveils Catapult to Accelerate Bing!

[EcoCloud Annual Event, June 5th, 2014]

dbtoaster.org

Compilation of offline analytics into online/incremental engines Aggressive code specialization

Data Stream

Low-latency stream processing Up to 6 OOM faster than commercial systems

110100100010000100000100000010000000

Q3 Q9 Q10 Q11 Q14 Q15 Q21

Specialized Databases:Breaking Up Data Jams

Big Data AnalyticsConvex machine learning models

§ Broad set of applications: sparse SVM’s, low-rank matrix completions... § Bigger data è faster algorithms for the same statistical risk!

Lecture 0: Motivation

Role of convex analysis and optimization

I Convex models Fı = minxœRp

{F(x) := f (x) + g(x) : x œ �}

I Key advantages

1. convex geometry: provable estimation & noise stability

2. scalable algorithms: computation vs. accuracy tradeo�s

3. superb practical performance: de facto standard

key concepts: time-data tradeo�s for approximate estimation / prediction

0.3 0.4 0.5 0.6 0.7 0.8 0.90

100

200

300

400

500

Ite

ratio

ns

Number of samples/dimension (n/p)

0.3 0.4 0.5 0.6 0.7 0.8 0.90

1

2

3

4

5

µ

Iterations to solve elastic net

Elastic net parameter µ

0.3 0.4 0.5 0.6 0.7 0.8 0.90

5

10

15

20

25

30

Number of samples/dimension (n/p)

Tim

e (

s)

Prof. Volkan Cevher [email protected] Mathematics of Data: From Theory to Computation

Algorithms to simplify problem as data size gets larger!

Bringing it All Together

Two inflection points for IT§Data growing at an exponential rate§Platform scaling is a grand challenge§Future IT platforms designed around data

Holistic approaches to datacenter designIntegrate + Specialize + Approximate (ISA)

For more information please visit us at ecocloud.ch

Thank You!

7 Giants of Analytics[National Academies, 2013]

§Basic statistics§N-body problem§Graph theory§Linear algebra§Generalized Optimization§Computing integrals§Alignment problems