103
From Bits to Buildings: Energy Efficiency and the Path to Exaflops Horst D. Simon Lawrence Berkeley National Laboratory and EECS Dept., UC Berkeley [email protected] Scicomp 16/SPXXL San Francisco, May 12, 2010

From Bits to Buildings: Energy Efficiency and the Path to Exaflops · 2010-05-13 · From Bits to Buildings: Energy Efficiency and the Path to Exaflops Horst D. Simon Lawrence Berkeley

  • Upload
    others

  • View
    1

  • Download
    0

Embed Size (px)

Citation preview

Page 1: From Bits to Buildings: Energy Efficiency and the Path to Exaflops · 2010-05-13 · From Bits to Buildings: Energy Efficiency and the Path to Exaflops Horst D. Simon Lawrence Berkeley

From Bits to Buildings: Energy Efficiency and the Path to Exaflops

Horst D. Simon Lawrence Berkeley National Laboratory

and EECS Dept., UC Berkeley [email protected]

Scicomp 16/SPXXL San Francisco, May 12, 2010

Page 2: From Bits to Buildings: Energy Efficiency and the Path to Exaflops · 2010-05-13 · From Bits to Buildings: Energy Efficiency and the Path to Exaflops Horst D. Simon Lawrence Berkeley

Acknowledgements A large number of individuals have contributed to energy efficiency in computing at Berkeley Lab, UC Berkeley, and to this presentation: David Bailey (CRD), Michael Banda (CRD), Michael Bennett (ITD), Shoaib Kamil (CRD), Jonathan Koomey (Stanford), Randy Katz (EECS), Tsu Jae King (EECS), Chuck McParland (CRD), Juan Meza (CRD), Bruce Nordman (EETD), Lenny Oliker (CRD), Ekow Otoo (CRD), Vern Paxson (UCB/ICSI/CRD), Doron Rotem (CRD), Dale Sartor (EETD), John Shalf (NERSC), Erich Strohmaier (CRD), Bill Tschudi (EETD), Howard Walter (NERSC), Michael Wehner (CRD), Kathy Yelick (NERSC/CRD) … and many others

Check the LBNL website for more details:

http://www.lbl.gov/cs/html/energy_efficient_computing.html

Page 3: From Bits to Buildings: Energy Efficiency and the Path to Exaflops · 2010-05-13 · From Bits to Buildings: Energy Efficiency and the Path to Exaflops Horst D. Simon Lawrence Berkeley

Outline

Trends in Power Consumption and Energy Efficiency for HPC

Building and infrastructure problem -- continued increase in demand for computing (“buildings”)

Computer technology problem -- no more power density scaling (“bits”)

Page 4: From Bits to Buildings: Energy Efficiency and the Path to Exaflops · 2010-05-13 · From Bits to Buildings: Energy Efficiency and the Path to Exaflops Horst D. Simon Lawrence Berkeley

Summary of the Talk

Power has become THE dominant problem in computing

Page 5: From Bits to Buildings: Energy Efficiency and the Path to Exaflops · 2010-05-13 · From Bits to Buildings: Energy Efficiency and the Path to Exaflops Horst D. Simon Lawrence Berkeley

Energy Production and Use in the U.S.

10-8-2008 5

Page 6: From Bits to Buildings: Energy Efficiency and the Path to Exaflops · 2010-05-13 · From Bits to Buildings: Energy Efficiency and the Path to Exaflops Horst D. Simon Lawrence Berkeley

Why does saving energy matter?

Page 7: From Bits to Buildings: Energy Efficiency and the Path to Exaflops · 2010-05-13 · From Bits to Buildings: Energy Efficiency and the Path to Exaflops Horst D. Simon Lawrence Berkeley

Source: Art Rosenfeld, California Energy Commission, http://www.energy.ca.gov/commission/commissioners/rosenfeld_docs/index.html

1946 1973 2005

Page 8: From Bits to Buildings: Energy Efficiency and the Path to Exaflops · 2010-05-13 · From Bits to Buildings: Energy Efficiency and the Path to Exaflops Horst D. Simon Lawrence Berkeley

An Honest Question?

Does the HPC community really care about reducing the carbon footprint?

NO!

Page 9: From Bits to Buildings: Energy Efficiency and the Path to Exaflops · 2010-05-13 · From Bits to Buildings: Energy Efficiency and the Path to Exaflops Horst D. Simon Lawrence Berkeley

HPC Interests

•  Energy efficiency in computer rooms – Spend more resources on computing

than on infrastructure •  Energy efficient technology

– Maintain performance growth and get things done that could not be done before

Page 10: From Bits to Buildings: Energy Efficiency and the Path to Exaflops · 2010-05-13 · From Bits to Buildings: Energy Efficiency and the Path to Exaflops Horst D. Simon Lawrence Berkeley

Khazzoom-Brookes Postulate

•  Energy efficiency at the micro-level leads to higher energy consumption at the macro-level – cheaper energy increases use –  increased energy efficiency leads to

economic growth –  increased efficiency in one bottleneck

resource increases use of companion technologies

•  HPC follows Khazzoom-Brookes

Page 11: From Bits to Buildings: Energy Efficiency and the Path to Exaflops · 2010-05-13 · From Bits to Buildings: Energy Efficiency and the Path to Exaflops Horst D. Simon Lawrence Berkeley

Energy and IT •  “Big IT” – all electronics

–  PCs / etc., consumer electronics, telephony •  Residential, commercial, industrial

–  More than 200 TWh/year

–  $16 billion/year •  Based on .08$/KWh

–  Nearly 150 million tons of CO2 per year •  Roughly equivalent to

30 million cars!

One central baseload power plant (about 7 TWh/yr)

Numbers represent U.S. only

Page 12: From Bits to Buildings: Energy Efficiency and the Path to Exaflops · 2010-05-13 · From Bits to Buildings: Energy Efficiency and the Path to Exaflops Horst D. Simon Lawrence Berkeley

… and IT electricity use is increasing data taken from: Jonathan Koomey, “Estimating Total Power Consumption by Servers in the U.S. and the World”

Available at: http://www.koomey.com/publications.html

Page 13: From Bits to Buildings: Energy Efficiency and the Path to Exaflops · 2010-05-13 · From Bits to Buildings: Energy Efficiency and the Path to Exaflops Horst D. Simon Lawrence Berkeley

Worldwide IT Carbon Footprint

13

820m tons CO2

360m tons CO2

260m tons CO2

2007 Worldwide IT���carbon footprint:���2% = 830 m tons CO2 Comparable to the���global aviation ���industry

Expected to grow ���to 4% by 2020

Page 14: From Bits to Buildings: Energy Efficiency and the Path to Exaflops · 2010-05-13 · From Bits to Buildings: Energy Efficiency and the Path to Exaflops Horst D. Simon Lawrence Berkeley

2020 IT Carbon Footprint

14

“SMART 2020: Enabling the Low Carbon Economy in the Information Age”, The Climate Group

USA China Telecoms DC PCs

Datacenters: Owned by single entity interested in reducing opex

billion tons CO2

Page 15: From Bits to Buildings: Energy Efficiency and the Path to Exaflops · 2010-05-13 · From Bits to Buildings: Energy Efficiency and the Path to Exaflops Horst D. Simon Lawrence Berkeley

Don’t forget: not all energy use produces carbon

Find out the power mix from your local utility!

Page 16: From Bits to Buildings: Energy Efficiency and the Path to Exaflops · 2010-05-13 · From Bits to Buildings: Energy Efficiency and the Path to Exaflops Horst D. Simon Lawrence Berkeley

Performance  Development  

0.1  

1  

10  

100  

1000  

10000  

100000  

1000000  

10000000  

100000000  

1 Gflop/s

1 Tflop/s

100 Mflop/s

100 Gflop/s

100 Tflop/s

10 Gflop/s

10 Tflop/s

1 Pflop/s

100 Pflop/s

10 Pflop/s

59.7  GFlop/s  

400  MFlop/s  

1.17  TFlop/s  

1.75  PFlop/s  

20.05  TFlop/s  

27.9  PFlop/s  

SUM  

N=1  

N=500  

Page 17: From Bits to Buildings: Energy Efficiency and the Path to Exaflops · 2010-05-13 · From Bits to Buildings: Energy Efficiency and the Path to Exaflops Horst D. Simon Lawrence Berkeley

0  

1000  

2000  

3000  

4000  

5000  

6000  

7000  

0   50   100   150   200   250   300   350   400   450   500  

Power  KWa>

s  

TOP500  Rank  

Absolute  Power  Levels  

Page 18: From Bits to Buildings: Energy Efficiency and the Path to Exaflops · 2010-05-13 · From Bits to Buildings: Energy Efficiency and the Path to Exaflops Horst D. Simon Lawrence Berkeley

Processor Technology Trend • 1990s - R&D computing hardware dominated by

desktop/COTS – Had to learn how to use COTS technology for HPC

• 2010 - R&D investments moving rapidly to consumer electronics/ embedded processing

– Must learn how to leverage embedded processor technology for future HPC systems

Page 19: From Bits to Buildings: Energy Efficiency and the Path to Exaflops · 2010-05-13 · From Bits to Buildings: Energy Efficiency and the Path to Exaflops Horst D. Simon Lawrence Berkeley

Consumer Electronics has Replaced PCs as the Dominant Market Force in CPU Design!!

Apple Introduces

IPod"

IPod+ITunes exceeds 50% of Appleʼs Net Profit"

Apple Introduces Cell Phone

(iPhone)"

Page 20: From Bits to Buildings: Energy Efficiency and the Path to Exaflops · 2010-05-13 · From Bits to Buildings: Energy Efficiency and the Path to Exaflops Horst D. Simon Lawrence Berkeley

Power  Efficiency  related  to  Processors  

Page 21: From Bits to Buildings: Energy Efficiency and the Path to Exaflops · 2010-05-13 · From Bits to Buildings: Energy Efficiency and the Path to Exaflops Horst D. Simon Lawrence Berkeley

Koomey’s Law

•  Computations per kWh have improved by a factor about 1.5 per year

•  “Assessing Trends in Electrical Efficiency over Time”, see IEEE Spectrum, March 2010

Page 22: From Bits to Buildings: Energy Efficiency and the Path to Exaflops · 2010-05-13 · From Bits to Buildings: Energy Efficiency and the Path to Exaflops Horst D. Simon Lawrence Berkeley

Trend Analysis

•  Processors and Systems have become more energy efficient over time –  Koomey’s Law shows factor of 1.5

improvement in kWh/computations •  Supercomputers have become more powerful

over time –  TOP500 data show factor of 1.86 increase of

computations/sec •  Consequently power/system increases by about

1.24 per year

Page 23: From Bits to Buildings: Energy Efficiency and the Path to Exaflops · 2010-05-13 · From Bits to Buildings: Energy Efficiency and the Path to Exaflops Horst D. Simon Lawrence Berkeley

Outline

Trends in Power Consumption and Energy Efficiency for HPC

Building and infrastructure problem -- continued increase in demand for computing (“buildings”)

Computer technology problem -- no more power density scaling (“bits”)

Page 24: From Bits to Buildings: Energy Efficiency and the Path to Exaflops · 2010-05-13 · From Bits to Buildings: Energy Efficiency and the Path to Exaflops Horst D. Simon Lawrence Berkeley

The Problem

Source: Luiz André Barroso (Google), “The Price of Performance,” ACM Queue, Vol. 2, No. 7, pp. 48-53, September 2005 (Modified with permission)

Unrestrained IT power consumption could eclipse hardware costs and put great pressure on affordability, data center infrastructure, and the environment.

Page 25: From Bits to Buildings: Energy Efficiency and the Path to Exaflops · 2010-05-13 · From Bits to Buildings: Energy Efficiency and the Path to Exaflops Horst D. Simon Lawrence Berkeley

Top Challenges to Clusters

n = 96 0% 5% 10% 15% 20% 25% 30% 35%

Facility issues noise

Interconnect complexity

3rd-party software costs

I/O performance

Interconnect bandwidth

Supported data storage mechanisms

Facility issues, space, density

Interconnect latency

Complexity of purchase and deployment

Application availability/maturity

Complexity of parallel algorithms

System management capability

Facility issues power, cooling

Page 26: From Bits to Buildings: Energy Efficiency and the Path to Exaflops · 2010-05-13 · From Bits to Buildings: Energy Efficiency and the Path to Exaflops Horst D. Simon Lawrence Berkeley

Responses

•  Cloud •  Containerized data centers •  Large scale data “factories” •  Increased emphasis on computer

room and building efficiency

Page 27: From Bits to Buildings: Energy Efficiency and the Path to Exaflops · 2010-05-13 · From Bits to Buildings: Energy Efficiency and the Path to Exaflops Horst D. Simon Lawrence Berkeley

Cloud Computing is a Operations Model! Not a technology!

•  Premise: it is more cost- and energy efficient to run one large datacenter than many smaller ones

•  How do you enable one large datacenter to appear to be a set of private resources (how to outsource IT) –  IBM and HP on-demand centers: Image nodes on-

demand and long-term contracts –  Amazon EC2: Using VMs to do rapid on-the-fly

provisioning of machines with short-term (hourly) cost model

–  Both cases: technology enables the business model

Page 28: From Bits to Buildings: Energy Efficiency and the Path to Exaflops · 2010-05-13 · From Bits to Buildings: Energy Efficiency and the Path to Exaflops Horst D. Simon Lawrence Berkeley

Magellan – Exploring Cloud Computing

•  National Energy Research Scientific Computing Center (NERSC)

•  Argonne Leadership Computing Facility (ALCF)

•  IBM iDataPlex dx360 M2 –  738 nodes –  2 quad-core Nehalem 2.67 GHz / node –  24 GB DDR3 /node

Page 29: From Bits to Buildings: Energy Efficiency and the Path to Exaflops · 2010-05-13 · From Bits to Buildings: Energy Efficiency and the Path to Exaflops Horst D. Simon Lawrence Berkeley

Performance Comparison of Hadoop and Task Farming

•  Evaluated small-scale BLAST problem (2500 sequences) on multiple platforms (Limited by access and costs)

•  Similar per-core performance across platforms

0

500

1000

1500

2000

2500

3000

32 64 128

Tim

e (s

econ

ds)

Number of Cores

EC2 Hadoop

Planck Hadoop On Demand (HOD)

Planck Task Farming

Franklin Task Farming

29

Page 30: From Bits to Buildings: Energy Efficiency and the Path to Exaflops · 2010-05-13 · From Bits to Buildings: Energy Efficiency and the Path to Exaflops Horst D. Simon Lawrence Berkeley

NERSC delivers cost-effective computing

30

0.00

0.01

0.02

0.03

0.04

0.05

0.06

0.07

0.08

0.09

0.10

Franklin Carver/Magellan Hopper

$ p

er

core

-ho

ur

Center

SysAdmin

Power & cooling Purchase & Maintenance

Amazon EC2 (Reserved large)

2x12-core 2010 150K cores

2x4-core 2009

8K cores

1x4-core 2007

38K cores

SGI Cyclone Cloud $0.95/core-hour

Page 31: From Bits to Buildings: Energy Efficiency and the Path to Exaflops · 2010-05-13 · From Bits to Buildings: Energy Efficiency and the Path to Exaflops Horst D. Simon Lawrence Berkeley

The Dark Side of Clouds

•  Interconnect suitable only for loosely coupled applications

•  Practical limits to the size of a cluster

•  Non-uniform execution times (VM jitter)

•  Poor shared disk I/O •  Substantial data storage and

I/O costs •  Still self-supported

31

Page 32: From Bits to Buildings: Energy Efficiency and the Path to Exaflops · 2010-05-13 · From Bits to Buildings: Energy Efficiency and the Path to Exaflops Horst D. Simon Lawrence Berkeley

Containerized Datacenter Mechanical-Electrical Design

32

Page 33: From Bits to Buildings: Energy Efficiency and the Path to Exaflops · 2010-05-13 · From Bits to Buildings: Energy Efficiency and the Path to Exaflops Horst D. Simon Lawrence Berkeley

Data Center Economic Reality (2006)

Source: New York Times, June 14, 2006

•  June 2006 - Google begins building a new data center near the Columbia River on the border between Washington and Oregon

–  Because the location is “at the intersection of cheap electricity and readily accessible data networking”

•  Microsoft and Yahoo are building big data centers upstream in Wenatchee and Quincy, Wash.

–  To keep up with Google, which means they need cheap electricity and readily accessible data networking

“Hiding in Plain Sight, Google Seeks More Power” by John Markoff, NYT, June 14, 2006

Page 34: From Bits to Buildings: Energy Efficiency and the Path to Exaflops · 2010-05-13 · From Bits to Buildings: Energy Efficiency and the Path to Exaflops Horst D. Simon Lawrence Berkeley

Google Dalles Oregon Facility 68,680 Sq Ft Per Pod

Source: Levy and Snowhorn, Data Center Power Trends, February 18, 2008

Page 35: From Bits to Buildings: Energy Efficiency and the Path to Exaflops · 2010-05-13 · From Bits to Buildings: Energy Efficiency and the Path to Exaflops Horst D. Simon Lawrence Berkeley

Microsoft Quincy, Wash. 470,000 Sq Ft, 47MW!

Source: Levy and Snowhorn, Data Center Power Trends, February 18, 2008

Page 36: From Bits to Buildings: Energy Efficiency and the Path to Exaflops · 2010-05-13 · From Bits to Buildings: Energy Efficiency and the Path to Exaflops Horst D. Simon Lawrence Berkeley

Microsoft’s Chicago Modular Datacenter

36

Page 37: From Bits to Buildings: Energy Efficiency and the Path to Exaflops · 2010-05-13 · From Bits to Buildings: Energy Efficiency and the Path to Exaflops Horst D. Simon Lawrence Berkeley

The Million Server Datacenter

•  24000 sq. m housing 400 containers – Each container contains 2500 servers –  Integrated computing, networking,

power, cooling systems •  300 MW supplied from two power

substations situated on opposite sides of the datacenter

•  Dual water-based cooling systems circulate cold water to containers, eliminating need for air conditioned rooms 37

Page 38: From Bits to Buildings: Energy Efficiency and the Path to Exaflops · 2010-05-13 · From Bits to Buildings: Energy Efficiency and the Path to Exaflops Horst D. Simon Lawrence Berkeley

•  20-40% savings typically possible •  Aggressive strategies can yield

better than 50% savings •  Extend life and capacity of

existing data center infrastructures

•  But is my center good or bad?

Potential Benefits of Improved Data Center Energy Efficiency:

Page 39: From Bits to Buildings: Energy Efficiency and the Path to Exaflops · 2010-05-13 · From Bits to Buildings: Energy Efficiency and the Path to Exaflops Horst D. Simon Lawrence Berkeley

Benchmarking for Energy Performance Improvement:

Energy benchmarking can allow comparison to peers and help identify best practices

LBNL conducted studies of over 30 data centers:

– Found wide variation in performance

– Identified best practices

Page 40: From Bits to Buildings: Energy Efficiency and the Path to Exaflops · 2010-05-13 · From Bits to Buildings: Energy Efficiency and the Path to Exaflops Horst D. Simon Lawrence Berkeley

High Level Metric— Data Center Infrastructure Efficiency (DCiE)

Ratio of Electricity Delivered to IT Equipment to Total

Average .57

Higher is better

Page 41: From Bits to Buildings: Energy Efficiency and the Path to Exaflops · 2010-05-13 · From Bits to Buildings: Energy Efficiency and the Path to Exaflops Horst D. Simon Lawrence Berkeley

Focus on PUE •  PUE = “power usage effectiveness” metric

promoted by “Green Grid” •  PUE = total facility power/ computer equipment

power = 1/DCiE •  Reduce PUE by consistent application of facilities

improvements •  Take all PUE claims with a grain of salt

PUE Current Trends 1.9 Improved Operations 1.7 Best Practices 1.3 State-of-the-Art 1.2

Page 42: From Bits to Buildings: Energy Efficiency and the Path to Exaflops · 2010-05-13 · From Bits to Buildings: Energy Efficiency and the Path to Exaflops Horst D. Simon Lawrence Berkeley

•  Air management •  Right-sizing •  Central plant optimization •  Efficient air handling •  Liquid cooling •  Free cooling •  Humidity control •  Improve power chain •  On-site generation •  Design and M&O processes

Using benchmark results to find best practices:

Page 43: From Bits to Buildings: Energy Efficiency and the Path to Exaflops · 2010-05-13 · From Bits to Buildings: Energy Efficiency and the Path to Exaflops Horst D. Simon Lawrence Berkeley

UC’s Computational Research and Theory (CRT) Facility

Page 44: From Bits to Buildings: Energy Efficiency and the Path to Exaflops · 2010-05-13 · From Bits to Buildings: Energy Efficiency and the Path to Exaflops Horst D. Simon Lawrence Berkeley

0

50

100

150

200

250

300

350

400

450

500

Hrs/YR

28 35 42 49 56 63 70 77 84 91 98 105 112

Temperature (F)

Berkeley Weather

hr/yr

0 10 0

11 20 0

21 30 3

31 40 504

41 50 2325

51 60 3153

61 70 1542

71 80 846

81 90 297

91 100 75

101 110 15

111 120 0

Range F

Page 45: From Bits to Buildings: Energy Efficiency and the Path to Exaflops · 2010-05-13 · From Bits to Buildings: Energy Efficiency and the Path to Exaflops Horst D. Simon Lawrence Berkeley

•  Water-side Economizers – No contamination question – Can be in series with chiller

•  Outside-Air Economizers – Can be very effective (24/7 load) – Must consider humidity

Use Free Cooling:

Page 46: From Bits to Buildings: Energy Efficiency and the Path to Exaflops · 2010-05-13 · From Bits to Buildings: Energy Efficiency and the Path to Exaflops Horst D. Simon Lawrence Berkeley

•  Air-Side Economizer (93% of hours)

•  Direct Evaporative Cooling for Humidification/ pre-cooling

•  Low Pressure-Drop Design (1.5” total static)

System Design Approach:

Page 47: From Bits to Buildings: Energy Efficiency and the Path to Exaflops · 2010-05-13 · From Bits to Buildings: Energy Efficiency and the Path to Exaflops Horst D. Simon Lawrence Berkeley

•  Allows multiple temperature feeds at server locations through mixing of CHW & TRW

•  Closed-loop treated cooling water from cooling towers (via heat exchanger)

•  Chilled water from chillers •  Headers, valves and caps for modularity and future

flexibility

Water Cooling: Four-pipe System

Predicted CRT Performance •  DCIE of 0.95 based on annual

energy •  DCIE of 0.88 based on peak

power

Page 48: From Bits to Buildings: Energy Efficiency and the Path to Exaflops · 2010-05-13 · From Bits to Buildings: Energy Efficiency and the Path to Exaflops Horst D. Simon Lawrence Berkeley

•  Design Guides were developed based upon the observed best practices !!

•  Guides are available through PG&E and LBNL websites!

•  Self benchmarking protocol also available!

Design Guidelines Are Available

http://hightech.lbl.gov/datacenters.html!

Page 49: From Bits to Buildings: Energy Efficiency and the Path to Exaflops · 2010-05-13 · From Bits to Buildings: Energy Efficiency and the Path to Exaflops Horst D. Simon Lawrence Berkeley

Links to Get Started DOE Website: Sign up to stay up to date on new developments www.eere.energy.gov/datacenters

Lawrence Berkeley National Laboratory (LBNL) http://hightech.lbl.gov/datacenters.html

LBNL Best Practices Guidelines (cooling, power, IT systems) http://hightech.lbl.gov/datacenters-bpg.html

ASHRAE Data Center technical guidebooks http://tc99.ashraetcs.org/

The Green Grid Association – White papers on metrics http://www.thegreengrid.org/gg_content/

Energy Star® Program http://www.energystar.gov/index.cfm?c=prod_development.server_efficiency

Uptime Institute white papers www.uptimeinstitute.org

Page 50: From Bits to Buildings: Energy Efficiency and the Path to Exaflops · 2010-05-13 · From Bits to Buildings: Energy Efficiency and the Path to Exaflops Horst D. Simon Lawrence Berkeley

TALK TO DALE: Join his network to share information and Pull market towards higher efficiency products

Contact Information: Dale Sartor, P.E. Lawrence Berkeley National Laboratory Applications Team MS 90-3111 University of California Berkeley, CA 94720

[email protected] (510) 486-5988 http://Ateam.LBL.gov

Page 51: From Bits to Buildings: Energy Efficiency and the Path to Exaflops · 2010-05-13 · From Bits to Buildings: Energy Efficiency and the Path to Exaflops Horst D. Simon Lawrence Berkeley

Outline

Trends in Power Consumption and Energy Efficiency for HPC

Building and infrastructure problem -- continued increase in demand for computing (“buildings”)

Computer technology problem -- no more power density scaling (“bits”)

Page 52: From Bits to Buildings: Energy Efficiency and the Path to Exaflops · 2010-05-13 · From Bits to Buildings: Energy Efficiency and the Path to Exaflops Horst D. Simon Lawrence Berkeley

An Early Warning

•  Presented by Shekhar Borkar in Berkeley in November 2000

Page 53: From Bits to Buildings: Energy Efficiency and the Path to Exaflops · 2010-05-13 · From Bits to Buildings: Energy Efficiency and the Path to Exaflops Horst D. Simon Lawrence Berkeley

53

Power will be a problem

5KW 18KW 1.5KW

500W

4004 8008 8080 8085 8086 286 386 486

Pentium® proc

0.1 1

10 100

1000 10000

100000

1971 1974 1978 1985 1992 2000 2004 2008 Year

Pow

er (W

atts

)

Power delivery and dissipation will be prohibitive

Page 54: From Bits to Buildings: Energy Efficiency and the Path to Exaflops · 2010-05-13 · From Bits to Buildings: Energy Efficiency and the Path to Exaflops Horst D. Simon Lawrence Berkeley

54

Power density will increase

4004 8008 8080

8085 8086

286 386 486 Pentium® proc P6

1

10

100

1000

10000

1970 1980 1990 2000 2010 Year

Pow

er D

ensit

y (W

/cm

2)

Hot Plate

Nuclear Reactor

Rocket Nozzle

Power density too high to keep junctions at low temp

Page 55: From Bits to Buildings: Energy Efficiency and the Path to Exaflops · 2010-05-13 · From Bits to Buildings: Energy Efficiency and the Path to Exaflops Horst D. Simon Lawrence Berkeley

Traditional Sources of Performance Improvement are Flat-Lining (2004)

•  New Constraints –  15 years of exponential

clock rate growth has ended

•  Moore’s Law reinterpreted: –  How do we use all of those

transistors to keep performance increasing at historical rates?

–  Industry Response: #cores per chip doubles every 18 months instead of clock frequency!

–  multicore

Figure courtesy of Kunle Olukotun, Lance Hammond, Herb Sutter, and Burton Smith

Page 56: From Bits to Buildings: Energy Efficiency and the Path to Exaflops · 2010-05-13 · From Bits to Buildings: Energy Efficiency and the Path to Exaflops Horst D. Simon Lawrence Berkeley

Estimated Exascale Power Requirements

•  LBNL IJHPCA Study for ~1/5 Exaflop for Climate Science –  Extrapolation of Blue Gene and AMD design trends –  Estimate: 20 MW for BG and 179 MW for AMD

•  DOE E3 Report –  Extrapolation of existing design trends to exascale in 2016 –  Estimate: 130 MW

•  DARPA Study –  More detailed assessment of component technologies for

exascale system –  Estimate: more than 120 MW

•  The current approach is not sustainable!

Page 57: From Bits to Buildings: Energy Efficiency and the Path to Exaflops · 2010-05-13 · From Bits to Buildings: Energy Efficiency and the Path to Exaflops Horst D. Simon Lawrence Berkeley

DARPA Exascale Study

•  Commissioned by DARPA to explore the challenges for Exaflop computing

•  Two model for future performance growth –  Simplistic: ITRS roadmap; power for memory

grows linear with #of chips; power for interconnect stays constant

–  Fully scaled: same as simplistic, but memory and router power grow with peak flops per chip

Page 58: From Bits to Buildings: Energy Efficiency and the Path to Exaflops · 2010-05-13 · From Bits to Buildings: Energy Efficiency and the Path to Exaflops Horst D. Simon Lawrence Berkeley

From Peter Kogge, DARPA Exascale Study

We won’t reach Exaflops with this approach

Page 59: From Bits to Buildings: Energy Efficiency and the Path to Exaflops · 2010-05-13 · From Bits to Buildings: Energy Efficiency and the Path to Exaflops Horst D. Simon Lawrence Berkeley

… and the power costs will still be staggering

From Peter Kogge, DARPA Exascale Study

Page 60: From Bits to Buildings: Energy Efficiency and the Path to Exaflops · 2010-05-13 · From Bits to Buildings: Energy Efficiency and the Path to Exaflops Horst D. Simon Lawrence Berkeley

A decadal DOE plan for providing exascale applications and technologies for DOE mission

needs!

Rick Stevens and Andy White, co-chairs Pete Beckman, Ray Bair-ANL; Jim Hack, Jeff Nichols, Al Geist-

ORNL; Horst Simon, Kathy Yelick, John Shalf-LBNL; Steve Ashby, Moe Khaleel-PNNL; Michel McCoy, Mark Seager, Brent

Gorda-LLNL; John Morrison, Cheryl Wampler-LANL; James Peery, Sudip Dosanjh, Jim Ang-SNL; Jim Davenport, Tom

Schlagel, BNL; Fred Johnson, Paul Messina, ex officio

Page 61: From Bits to Buildings: Energy Efficiency and the Path to Exaflops · 2010-05-13 · From Bits to Buildings: Energy Efficiency and the Path to Exaflops Horst D. Simon Lawrence Berkeley

Process for identifying exascale applications and technology for DOE

missions ensures broad community input!

•  Town Hall Meetings April-June 2007 •  Scientific Grand Challenges

Workshops Nov, 2008 – Oct, 2009 •  Climate Science (11/08), •  High Energy Physics (12/08), •  Nuclear Physics (1/09), •  Fusion Energy (3/09), •  Nuclear Energy (5/09), •  Biology (8/09), •  Material Science and Chemistry (8/09), •  National Security (10/09) •  Cross-cutting technologies (2/10)

•  Exascale Steering Committee •  “Denver” vendor NDA visits 8/2009 •  SC09 vendor feedback meetings •  Extreme Architecture and Technology

Workshop 12/2009 •  International Exascale Software

Project •  Santa Fe, NM 4/2009; Paris, France

6/2009; Tsukuba, Japan 10/2009

MISSION IMPERATIVES

FUNDAMENTAL SCIENCE

Page 62: From Bits to Buildings: Energy Efficiency and the Path to Exaflops · 2010-05-13 · From Bits to Buildings: Energy Efficiency and the Path to Exaflops Horst D. Simon Lawrence Berkeley

Potential System Architecture Targets!

System attributes

2010 “2015” “2018”

System peak 2 Peta 200 Petaflop/sec 1 Exaflop/sec

Power 6 MW 15 MW 20 MW

System memory 0.3 PB 5 PB 32-64 PB

Node performance 125 GF 0.5 TF 7 TF 1 TF 10 TF

Node memory BW 25 GB/s 0.1 TB/sec 1 TB/sec 0.4 TB/sec 4 TB/sec

Node concurrency 12 O(100) O(1,000) O(1,000) O(10,000)

System size (nodes)

18,700 50,000 5,000 1,000,000 100,000

Total Node Interconnect BW

1.5 GB/s 20 GB/sec 200 GB/sec

MTTI days O(1day) O(1 day)

Page 63: From Bits to Buildings: Energy Efficiency and the Path to Exaflops · 2010-05-13 · From Bits to Buildings: Energy Efficiency and the Path to Exaflops Horst D. Simon Lawrence Berkeley

What are critical exascale technology investments?!

•  System power is a first class constraint on exascale system performance and effectiveness.

•  Memory is an important component of meeting exascale power and applications goals.

•  Programming model. Early investment in several efforts to decide in 2013 on exascale programming model, allowing exemplar applications effective access to 2015 system for both mission and science.

•  Investment in exascale processor design to achieve an exascale-like system in 2015.

•  Operating System strategy for exascale is critical for node performance at scale and for efficient support of new programming models and run time systems.

•  Reliability and resiliency are critical at this scale and require applications neutral movement of the file system (for check pointing, in particular) closer to the running apps.

•  HPC co-design strategy and implementation requires a set of a hierarchical performance models and simulators as well as commitment from apps, software and architecture communities.

Page 64: From Bits to Buildings: Energy Efficiency and the Path to Exaflops · 2010-05-13 · From Bits to Buildings: Energy Efficiency and the Path to Exaflops Horst D. Simon Lawrence Berkeley

DOE Exascale Technology Roadmap

Key Observations from DOE Exascale Architecture and Technology Workshop, San Diego, Dec. 2009, http://extremecomputing.labworks.org/hardware/index.stm

•  supercomputers are power limited •  the biggest energy delta is off-chip data movement

•  see John Shalf’s presentation after the break

Page 65: From Bits to Buildings: Energy Efficiency and the Path to Exaflops · 2010-05-13 · From Bits to Buildings: Energy Efficiency and the Path to Exaflops Horst D. Simon Lawrence Berkeley

Memory Power Consumption

•  Power Consumption with standard Technology Roadmap

•  Power Consumption with Investment in Advanced Memory Technology

10.6

48

12

FPU

Memory

Interconnect

10.6

6.4

12

FPU

Memory

Interconnect

20 Megawatts total 70 Megawatts total

Page 66: From Bits to Buildings: Energy Efficiency and the Path to Exaflops · 2010-05-13 · From Bits to Buildings: Energy Efficiency and the Path to Exaflops Horst D. Simon Lawrence Berkeley

Memory Technology Bandwidth costs power

0

10

20

30

40

50

60

70

80

90

100

0.01 0.1 0.2 0.5 1 2

Mem

ory

Pow

er C

onsu

mpt

ion

in M

egaw

atts

(MW

)

Bytes/FLOP ratio (# bytes per peak FLOP)

Stacked JEDEC 30pj/bit 2018 ($20M)

Advanced 7pj/bit Memory ($100M)

Enhanced 4pj/bit Advanced Memory ($150M cumulative) Feasible Power Envelope (20MW)

Page 67: From Bits to Buildings: Energy Efficiency and the Path to Exaflops · 2010-05-13 · From Bits to Buildings: Energy Efficiency and the Path to Exaflops Horst D. Simon Lawrence Berkeley

•  Project by Shalf, Oliker, Wehner and others at LBNL

•  A route to exascale computing –  Target specific machine designs to answer a

scientific question –  Use of new technologies driven by the consumer

market.

Green Flash: Ultra-Efficient Climate Modeling

Page 68: From Bits to Buildings: Energy Efficiency and the Path to Exaflops · 2010-05-13 · From Bits to Buildings: Energy Efficiency and the Path to Exaflops Horst D. Simon Lawrence Berkeley

Ultra-Efficient “Green Flash” Computing at NERSC: 100x over Business as Usual

Radically change HPC system development via application-driven hardware/software co-design –  Achieve 100x power efficiency and 100x

capability of mainstream HPC approach for targeted high-impact applications

–  Accelerate development cycle for exascale HPC systems

–  Approach is applicable to numerous scientific applications

–  Proposed pilot application: Ultra-high resolution climate change simulation

Page 69: From Bits to Buildings: Energy Efficiency and the Path to Exaflops · 2010-05-13 · From Bits to Buildings: Energy Efficiency and the Path to Exaflops Horst D. Simon Lawrence Berkeley

Path to Power Efficiency Reducing Waste in Computing

•  Examine methodology of low-power embedded computing market –  optimized for low power, low cost and high computational

efficiency

“Years of research in low-power embedded computing have shown only one design technique to reduce power: reduce waste.”

⎯ Mark Horowitz, Stanford University & Rambus Inc.

•  Sources of waste –  Wasted transistors (surface area) –  Wasted computation (useless work/speculation/stalls) –  Wasted bandwidth (data movement) –  Designing for serial performance

Page 70: From Bits to Buildings: Energy Efficiency and the Path to Exaflops · 2010-05-13 · From Bits to Buildings: Energy Efficiency and the Path to Exaflops Horst D. Simon Lawrence Berkeley

Design for Low Power: More Concurrency

Intel Core2 15W

Power 5 120W

This is how iPhones and MP3 players are designed to maximize battery life and minimize cost

PPC450 3W

Tensilica DP 0.09W

•  Cubic power improvement with lower clock rate due to V2F

•  Slower clock rates enable use of simpler cores

•  Simpler cores use less area (lower leakage) and reduce cost

•  Tailor design to application to reduce waste

Page 71: From Bits to Buildings: Energy Efficiency and the Path to Exaflops · 2010-05-13 · From Bits to Buildings: Energy Efficiency and the Path to Exaflops Horst D. Simon Lawrence Berkeley

Low Power Design Principles •  IBM Power5 (server)

–  120W@1900MHz –  Baseline

•  Intel Core2 sc (laptop) : –  15W@1000MHz –  4x more FLOPs/watt than baseline

•  IBM PPC 450 (BG/P - low power) –  0.625W@800MHz –  90x more

•  Tensilica XTensa (Moto Razor) : –  0.09W@600MHz –  400x more

Intel Core2

Tensilica DP .09W

Power 5

Even if each core operates at 1/3 to 1/10th efficiency of largest chip, you can pack 100s more cores onto a chip and consume 1/20 the power

Page 72: From Bits to Buildings: Energy Efficiency and the Path to Exaflops · 2010-05-13 · From Bits to Buildings: Energy Efficiency and the Path to Exaflops Horst D. Simon Lawrence Berkeley

Customization Continuum: Green Flash

General Purpose! Special Purpose! Single Purpose!

Cray XT3 D.E. Shaw Anton

MD Grape BlueGene Green Flash

Application Driven!

•  Application-driven does NOT necessitate a special purpose machine •  MD-Grape: Full custom ASIC design

–  1 Petaflop performance for one application using 260 kW for $9M

•  D.E. Shaw Anton System: Full and Semi-custom design –  Simulate 100x–1000x timescales vs any existing HPC system (~200kW)

•  Application-Driven Architecture (Green Flash): Semicustom design –  Highly programmable core architecture using C/C++/Fortran –  Goal of 100x power efficiency improvement vs general HPC approach –  Better understand how to build/buy application-driven systems –  Potential: 1km-scale model (~200 Petaflops peak) running in O(5 years)

Page 73: From Bits to Buildings: Energy Efficiency and the Path to Exaflops · 2010-05-13 · From Bits to Buildings: Energy Efficiency and the Path to Exaflops Horst D. Simon Lawrence Berkeley

Green Flash Strawman System Design We examined three different approaches (in 2008 technology)

Computation .015oX.02oX100L: 10 PFlops sustained, ~200 PFlops peak •  AMD Opteron: Commodity approach, lower efficiency for scientific

applications offset by cost efficiencies of mass market •  BlueGene: Generic embedded processor core and customize system-on-

chip (SoC) to improve power efficiency for scientific applications •  Tensilica XTensa: Customized embedded CPU w/SoC provides further

power efficiency benefits but maintains programmability Processor Clock Peak/

Core (Gflops)

Cores/ Socket

Sockets Cores Power Cost 2008

AMD Opteron 2.8GHz 5.6 2 890K 1.7M 179 MW $1B+ IBM BG/P 850MHz 3.4 4 740K 3.0M 20 MW $1B+ Green Flash / Tensilica XTensa

650MHz 2.7 32 120K 4.0M 3 MW $75M

Page 74: From Bits to Buildings: Energy Efficiency and the Path to Exaflops · 2010-05-13 · From Bits to Buildings: Energy Efficiency and the Path to Exaflops Horst D. Simon Lawrence Berkeley

Climate System Design Concept Strawman Design Study

32 boards per rack

100 racks @ ~25KW

power + comms

32 chip + memory clusters per board (2.7

TFLOPS @ 700W

VLIW CPU: •  128b load-store + 2 DP MUL/ADD + integer op/ DMA

per cycle: •  Synthesizable at 650MHz in commodity 65nm •  1mm2 core, 1.8-2.8mm2 with inst cache, data cache

data RAM, DMA interface, 0.25mW/MHz •  Double precision SIMD FP : 4 ops/cycle (2.7GFLOPs) •  Vectorizing compiler, cycle-accurate simulator,

debugger GUI (Existing part of Tensilica Tool Set) •  8 channel DMA for streaming from on/off chip DRAM •  Nearest neighbor 2D communications grid

Proc Array

RAM RAM

RAM RAM

8 DRAM per processor chip:

~50 GB/s

CPU 64-128K D

2x128b

32K I

8 chan DMA

CPU

D

I DM A

CPU

D

I DM A

CPU

D

I DM A

CPU

D

I DM A

CPU

D

I DM A

CPU

D

I DM A

CPU

D

I DM A

CPU

D

I DM A

CPU

D

I DM A

CPU

D

I DM A

CPU

D

I DM A

CPU

D

I DM A

CPU

D

I DM A

CPU

D

I DM A

CPU

D

I DM A

CPU

D

I DM A

CPU

D

I DM A

CPU

D

I DM A

CPU

D

I DM A

CPU

D

I DM A

CPU

D

I DM A

CPU

D

I DM A

CPU

D

I DM A

CPU

D

I DM A

CPU

D

I DM A

CPU

D

I DM A

CPU

D

I DM A

CPU

D

I DM A

CPU

D

I DM A

CPU

D

I DM A

CPU

D

I DM A

CPU

D

I DM A

Opt. 8M

B em

bedded DR

AM

External DRAM interface

External DRAM interface

External D

RA

M interface E

xter

nal D

RA

M in

terfa

ce

Master Processor

Comm Link Control

32 processors per 65nm chip 83 GFLOPS @ 7W

Page 75: From Bits to Buildings: Energy Efficiency and the Path to Exaflops · 2010-05-13 · From Bits to Buildings: Energy Efficiency and the Path to Exaflops Horst D. Simon Lawrence Berkeley

Green Flash Hardware Demo at SC08 and SC09

•  Demonstrated during SC ’08 and ‘09

•  Proof of concept –  CSU atmospheric model ported

to Tensilica Architecture –  Single Tensilica processor

running atmospheric model at 50MHz

•  Emulation performance advantage –  Processor running at 50MHz

vs. Functional model at 100 kHz

–  500x Speedup •  Actual code running - not

representative benchmark

Page 76: From Bits to Buildings: Energy Efficiency and the Path to Exaflops · 2010-05-13 · From Bits to Buildings: Energy Efficiency and the Path to Exaflops Horst D. Simon Lawrence Berkeley

Silicon Photonics for Energy-Efficient Communication

•  Silicon photonics enables optics to be integrated with conventional CMOS

•  Enables up to 27x improvement in communication energy efficiency!

Silicon Photonic Ring Resonator

Page 77: From Bits to Buildings: Energy Efficiency and the Path to Exaflops · 2010-05-13 · From Bits to Buildings: Energy Efficiency and the Path to Exaflops Horst D. Simon Lawrence Berkeley

The Cat is out of the Bag: Cortical Simulations with 109 neurons, 1013 synapses

Gordon Bell Prize at SC09

Authors: Rajagopal Ananthanarayanan, Steven K. Esser, Horst D. Simon, and Dharmendra S. Modha

Machine: LLNL / NNSA Dawn Blue Gene/P

From bits to buildings … to brains

Page 78: From Bits to Buildings: Energy Efficiency and the Path to Exaflops · 2010-05-13 · From Bits to Buildings: Energy Efficiency and the Path to Exaflops Horst D. Simon Lawrence Berkeley

From brain to model

Pyramidal

Fast spiking

Page 79: From Bits to Buildings: Energy Efficiency and the Path to Exaflops · 2010-05-13 · From Bits to Buildings: Energy Efficiency and the Path to Exaflops Horst D. Simon Lawrence Berkeley

LLNL Dawn BG/P May, 2009

Human 22 x 109 220 x 1012

Rat 56 x 106 448 x 109

Mouse 16 x 106 128 x 109

N: S:

Monkey 2 x 109 20 x 1012

Cat 763 x 106 6.1 x 1012

Almaden BG/L December, 2006

Watson BG/L April, 2007

WatsonShaheen BG/P March, 2009

BlueGene Meets Brain

Latest simulations achieve unprecedented scale of 109 neurons and 1013 synapses

New results for SC09

Page 80: From Bits to Buildings: Energy Efficiency and the Path to Exaflops · 2010-05-13 · From Bits to Buildings: Energy Efficiency and the Path to Exaflops Horst D. Simon Lawrence Berkeley

Inevitable?

Page 81: From Bits to Buildings: Energy Efficiency and the Path to Exaflops · 2010-05-13 · From Bits to Buildings: Energy Efficiency and the Path to Exaflops Horst D. Simon Lawrence Berkeley

The Power Conundrum

•  Unless there are new breakthroughs an Exaflops computer in 2019 will consume 120 MW

•  The human brain operates at 20 – 40 W •  We will need another factor of 1 M to

match the energy efficiency of the human brain

Page 82: From Bits to Buildings: Energy Efficiency and the Path to Exaflops · 2010-05-13 · From Bits to Buildings: Energy Efficiency and the Path to Exaflops Horst D. Simon Lawrence Berkeley

Summary •  Power consumption is a huge problem in HPC

– “Bits”: we may not be able to scale to Exaflops without new technologies •  In particular need technology investment in

memory •  Be prepared for low byte/Flop ratios

– “Buildings”: we may have to spend more $$ on infrastructure and less on computing •  Get ready for cloud computing as power

consumption is going to change economics of computing

Page 83: From Bits to Buildings: Energy Efficiency and the Path to Exaflops · 2010-05-13 · From Bits to Buildings: Energy Efficiency and the Path to Exaflops Horst D. Simon Lawrence Berkeley

Extra Slides

Page 84: From Bits to Buildings: Energy Efficiency and the Path to Exaflops · 2010-05-13 · From Bits to Buildings: Energy Efficiency and the Path to Exaflops Horst D. Simon Lawrence Berkeley

Outline

1. Power consumption has become an industry-wide issue for computing

2. Building and computer room energy efficiency

3. Computer architecture for energy efficiency- the Green Flash project

4. Future

Page 85: From Bits to Buildings: Energy Efficiency and the Path to Exaflops · 2010-05-13 · From Bits to Buildings: Energy Efficiency and the Path to Exaflops Horst D. Simon Lawrence Berkeley

Consumer Electronics Convergence

Page 86: From Bits to Buildings: Energy Efficiency and the Path to Exaflops · 2010-05-13 · From Bits to Buildings: Energy Efficiency and the Path to Exaflops Horst D. Simon Lawrence Berkeley

Consumer Electronics Convergence

Page 87: From Bits to Buildings: Energy Efficiency and the Path to Exaflops · 2010-05-13 · From Bits to Buildings: Energy Efficiency and the Path to Exaflops Horst D. Simon Lawrence Berkeley

Consumer Electronics has Replaced PCs as the Dominant Market Force in CPU Design!!

Page 88: From Bits to Buildings: Energy Efficiency and the Path to Exaflops · 2010-05-13 · From Bits to Buildings: Energy Efficiency and the Path to Exaflops Horst D. Simon Lawrence Berkeley

Consumer Electronics has Replaced PCs as the Dominant Market Force in CPU Design!!

Page 89: From Bits to Buildings: Energy Efficiency and the Path to Exaflops · 2010-05-13 · From Bits to Buildings: Energy Efficiency and the Path to Exaflops Horst D. Simon Lawrence Berkeley

Consumer Electronics has Replaced PCs as the Dominant Market Force in CPU Design!!

Apple Introduces

IPod"

IPod+ITunes exceeds 50% of Appleʼs Net Profit"

Apple Introduces Cell Phone

(iPhone)"

Page 90: From Bits to Buildings: Energy Efficiency and the Path to Exaflops · 2010-05-13 · From Bits to Buildings: Energy Efficiency and the Path to Exaflops Horst D. Simon Lawrence Berkeley

Power  Efficiency  related  to  Processors  

0  

100  

200  

300  

400  

500  

600  

Power  Efficien

cy  [M

Flop

s/Wa>

]  

(8+1)  core  

QC  embedded  

Quadcore  

DC  embedded  

Dualcore  

Page 91: From Bits to Buildings: Energy Efficiency and the Path to Exaflops · 2010-05-13 · From Bits to Buildings: Energy Efficiency and the Path to Exaflops Horst D. Simon Lawrence Berkeley

Extrapolating to Exaflop/s in 2018

Source: David Turek, IBM

Page 92: From Bits to Buildings: Energy Efficiency and the Path to Exaflops · 2010-05-13 · From Bits to Buildings: Energy Efficiency and the Path to Exaflops Horst D. Simon Lawrence Berkeley

Presented at STF Workshop, Sept. 2008 by Bill Camp, Intel

Page 93: From Bits to Buildings: Energy Efficiency and the Path to Exaflops · 2010-05-13 · From Bits to Buildings: Energy Efficiency and the Path to Exaflops Horst D. Simon Lawrence Berkeley

Power Ranking and How Not to do it!

•  To rank objects by “size” one needs extensive properties: –  Weight or Volume –  Rmax (TOP500)

•  A ‘larger’ system should have a larger Rmax. •  The ratio of 2 extensive properties is an intensive one:

–  (weight/volumne = density) –  Performance / Power Consumption = Power_efficiency

•  One can-not ‘rank’ objects with densities BY SIZE: –  Density does not tell anything about size of an object –  A piece of lead is not heavier or larger than one piece of wood.

•  Linpack (sub-linear) / Power (linear) will always sort smaller systems before larger ones!

31st List / June 2008

Page 94: From Bits to Buildings: Energy Efficiency and the Path to Exaflops · 2010-05-13 · From Bits to Buildings: Energy Efficiency and the Path to Exaflops Horst D. Simon Lawrence Berkeley

The Transition to Low-Power Technology is Inevitable

•  Information “factories” are only affordable for a few government labs and large commercial companies (Google, MSN, Yahoo …)

–  Midrange installations will soon hit the 1 - 2 MW wall, requiring costly new installations

–  Economics will change if operating expenses of a server exceed acquisition cost

•  The industry will switch to low-power technology within 2 - 3 years

•  Embedded processors or game processors will be the next step (BG, Cell, Nvidia, SiCortex, Tensilica)

–  Example RR, first Petaflops system

Does it make sense to build systems that require the electric power equivalent of an aluminum smelter?

Page 95: From Bits to Buildings: Energy Efficiency and the Path to Exaflops · 2010-05-13 · From Bits to Buildings: Energy Efficiency and the Path to Exaflops Horst D. Simon Lawrence Berkeley

Absolute Power Levels

31st List / June 2008

Page 96: From Bits to Buildings: Energy Efficiency and the Path to Exaflops · 2010-05-13 · From Bits to Buildings: Energy Efficiency and the Path to Exaflops Horst D. Simon Lawrence Berkeley

Power Efficiency related to Processors

31st List / June 2008

Page 97: From Bits to Buildings: Energy Efficiency and the Path to Exaflops · 2010-05-13 · From Bits to Buildings: Energy Efficiency and the Path to Exaflops Horst D. Simon Lawrence Berkeley

Frequencies and Power Efficiency

31st List / June 2008

Power rating is 80 Watts each!

Page 98: From Bits to Buildings: Energy Efficiency and the Path to Exaflops · 2010-05-13 · From Bits to Buildings: Energy Efficiency and the Path to Exaflops Horst D. Simon Lawrence Berkeley

Most Power Efficient Systems

31st List / June 2008

Page 99: From Bits to Buildings: Energy Efficiency and the Path to Exaflops · 2010-05-13 · From Bits to Buildings: Energy Efficiency and the Path to Exaflops Horst D. Simon Lawrence Berkeley

Convergence of Platforms – Multiple parallel general-purpose processors (GPPs) – Multiple application-specific processors (ASPs)

“The Processor is the new

Transistor” [Rowen]

Intel 4004 (1971): 4-bit processor, 2312 transistors,

~100 KIPS, 10 micron PMOS,

11 mm2 chip

1000s of processor cores per

die

Sun Niagara 8 GPP cores (32 threads)

Intel® XScale

™ Core 32K IC 32K DC

MEv2 10

MEv2 11

MEv2 12

MEv2 15

MEv2 14

MEv2 13

Rbuf 64 @ 128B

Tbuf 64 @ 128B Hash

48/64/128

Scratch 16KB

QDR SRAM

2

QDR SRAM

1

RDRAM 1

RDRAM 3

RDRAM 2

G A S K E T

PCI

(64b) 66

MHz

S P I 4 or C S I X

Stripe

E/D Q E/D Q

QDR SRAM

3 E/D Q

MEv2 9

MEv2 16

MEv2 2

MEv2 3

MEv2 4

MEv2 7

MEv2 6

MEv2 5

MEv2 1

MEv2 8

CSRs -Fast_wr -UART -Timers

-GPIO -BootROM/SlowPort

QDR SRAM

4 E/D Q

Intel Network Processor 1 GPP Core

16 ASPs (128 threads)

IBM Cell 1 GPP (2 threads)

8 ASPs

Picochip DSP 1 GPP core 248 ASPs

Cisco CRS-1 188 Tensilica GPPs

Page 100: From Bits to Buildings: Energy Efficiency and the Path to Exaflops · 2010-05-13 · From Bits to Buildings: Energy Efficiency and the Path to Exaflops Horst D. Simon Lawrence Berkeley

BG/L—the Rise of the Embedded Processor

TOP 500 Performance by Architecture

1

10

100

1000

10000

100000

1000000

10000000 MPP SMP Cluster Constellations Single Processor SIMD Others MPP embedded

Agg

rega

te R

max

(Tflo

p/s)

Page 101: From Bits to Buildings: Energy Efficiency and the Path to Exaflops · 2010-05-13 · From Bits to Buildings: Energy Efficiency and the Path to Exaflops Horst D. Simon Lawrence Berkeley

Summary (1)

•  LBNL has taken a comprehensive approach to the power in computing problem –  Component level (investigate use of low-power

components and build new system) –  System level (measuring and understanding energy

consumption of system –  Computer Room level (understand airflow and cooling

technology) –  Building Level (enforce rigorous energy standards in

new computer building and use of innovative energy savings technology)

Page 102: From Bits to Buildings: Energy Efficiency and the Path to Exaflops · 2010-05-13 · From Bits to Buildings: Energy Efficiency and the Path to Exaflops Horst D. Simon Lawrence Berkeley

Summary (2)

•  Economic factors are driving us already to more energy efficient solutions in computing

•  Incremental improvements are well on track, but we may ultimately need revolutionary new technology to reach the Exaflop/s level and beyond

Page 103: From Bits to Buildings: Energy Efficiency and the Path to Exaflops · 2010-05-13 · From Bits to Buildings: Energy Efficiency and the Path to Exaflops Horst D. Simon Lawrence Berkeley

Outline

1. Power consumption has become an industry-wide issue for computing

2. Building and computer room energy efficiency

3. Computer architecture for energy efficiency- the Green Flash project

4. Towards a better understanding of “green computing”