26
CSE 222A: Computer Communication Networks George Porter Lecture 7: Data Center Networks Thanks: Alex C. Snoeren and Nick Feamster

Lecture 7: Data Center Networkscseweb.ucsd.edu/classes/fa15/cse222A-a/lectures/222A-wi...Cloud Computing" Elastic resources Expand and contract resources Pay-per-use Infrastructure

  • Upload
    others

  • View
    1

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Lecture 7: Data Center Networkscseweb.ucsd.edu/classes/fa15/cse222A-a/lectures/222A-wi...Cloud Computing" Elastic resources Expand and contract resources Pay-per-use Infrastructure

CSE 222A: Computer Communication Networks George Porter

Lecture 7:Data Center Networks"

Thanks: Alex C. Snoeren and Nick Feamster

Page 2: Lecture 7: Data Center Networkscseweb.ucsd.edu/classes/fa15/cse222A-a/lectures/222A-wi...Cloud Computing" Elastic resources Expand and contract resources Pay-per-use Infrastructure

Lecture 7 Overview"●  Project discussion

●  Data Centers overview

●  “Fat Tree” paper discussion

2 CSE 222A – Lecture 7: Data Center Networks"

Page 3: Lecture 7: Data Center Networkscseweb.ucsd.edu/classes/fa15/cse222A-a/lectures/222A-wi...Cloud Computing" Elastic resources Expand and contract resources Pay-per-use Infrastructure

Cloud Computing"●  Elastic resources

◆  Expand and contract resources ◆  Pay-per-use ◆  Infrastructure on demand

●  Multi-tenancy ◆  Multiple independent users ◆  Security and resource isolation ◆  Amortize the cost of the (shared) infrastructure

●  Flexible service management ◆  Resiliency: isolate failure of servers and storage ◆  Workload movement: move work to other locations

CSE 222A – Lecture 7: Data Center Networks" 3

Page 4: Lecture 7: Data Center Networkscseweb.ucsd.edu/classes/fa15/cse222A-a/lectures/222A-wi...Cloud Computing" Elastic resources Expand and contract resources Pay-per-use Infrastructure

Cloud Service Models"●  Software as a Service (Saas)

◆  Provider licenses applications to users as a service ◆  e.g., customer relationship management, email, … ◆  Avoid costs of installation, maintenance, patches, …

●  Platform as a Service (Paas) ◆  Provider offers software platform for building applications ◆  e.g., Google’s App-Engine ◆  Avoid worrying about scalability of platform

●  Infrastructure as a Service (Iaas) ◆  Provider offers raw computing, storage, and network ◆  e.g., Amazon’s Elastic Computing Cloud (EC2) ◆  Avoid buying servers and estimating resource needs

CSE 222A – Lecture 7: Data Center Networks" 4

Page 5: Lecture 7: Data Center Networkscseweb.ucsd.edu/classes/fa15/cse222A-a/lectures/222A-wi...Cloud Computing" Elastic resources Expand and contract resources Pay-per-use Infrastructure

Multi-Tier Applications"●  Applications consist of tasks

◆  Many separate components ◆  Running on different machines

●  Commodity computers ◆  Many general-purpose computers ◆  Not one big mainframe ◆  Easier scaling

5

Front end Server

Aggregator

Aggregator Aggregator … …!

Aggregator

Worker

…!

Worker Worker

…!

Worker Worker CSE 222A – Lecture 7: Data Center Networks"

Page 6: Lecture 7: Data Center Networkscseweb.ucsd.edu/classes/fa15/cse222A-a/lectures/222A-wi...Cloud Computing" Elastic resources Expand and contract resources Pay-per-use Infrastructure

Host Virtualization"

●  Multiple virtual machines on one physical machine ●  Applications run unmodified as on real machine ●  VM can migrate from one computer to another

CSE 222A – Lecture 7: Data Center Networks" 6

Page 7: Lecture 7: Data Center Networkscseweb.ucsd.edu/classes/fa15/cse222A-a/lectures/222A-wi...Cloud Computing" Elastic resources Expand and contract resources Pay-per-use Infrastructure

VMM Virtual Switches"

CSE 222A – Lecture 7: Data Center Networks" 7

Page 8: Lecture 7: Data Center Networkscseweb.ucsd.edu/classes/fa15/cse222A-a/lectures/222A-wi...Cloud Computing" Elastic resources Expand and contract resources Pay-per-use Infrastructure

Top-of-Rack Architecture"●  Rack of servers

◆  Commodity servers ◆  And top-of-rack switch

●  Modular design ◆  Preconfigured racks ◆  Power, network, and

storage cabling

●  Aggregate to the next level

CSE 222A – Lecture 7: Data Center Networks" 8

Page 9: Lecture 7: Data Center Networkscseweb.ucsd.edu/classes/fa15/cse222A-a/lectures/222A-wi...Cloud Computing" Elastic resources Expand and contract resources Pay-per-use Infrastructure

Extreme Modularity"●  Containers

●  Many containers

CSE 222A – Lecture 7: Data Center Networks" 9

Page 10: Lecture 7: Data Center Networkscseweb.ucsd.edu/classes/fa15/cse222A-a/lectures/222A-wi...Cloud Computing" Elastic resources Expand and contract resources Pay-per-use Infrastructure

10"

Data Center Challenges"●  Traffic load balance ●  Support for VM migration ●  Achieving bisection bandwidth ●  Power savings / Cooling ●  Network management (provisioning) ●  Security (dealing with multiple tenants)

CSE 222A – Lecture 7: Data Center Networks"

Page 11: Lecture 7: Data Center Networkscseweb.ucsd.edu/classes/fa15/cse222A-a/lectures/222A-wi...Cloud Computing" Elastic resources Expand and contract resources Pay-per-use Infrastructure

Data Center Costs"●  James Hamilton published basic 2008 breakdown

●  Servers: 45% ◆  CPU, memory, disk

●  Infrastructure: 25% ◆  UPS, cooling, power distribution

●  Power draw: 15% ◆  Electrical utility costs

●  Network: 15% ◆  Switches, links, transit

CSE 222A – Lecture 7: Data Center Networks" 11

Page 12: Lecture 7: Data Center Networkscseweb.ucsd.edu/classes/fa15/cse222A-a/lectures/222A-wi...Cloud Computing" Elastic resources Expand and contract resources Pay-per-use Infrastructure

Traditional DC Topology"Internet

Servers

Layer-2 switch Access

Data Center

Layer-2/3 switch Aggregation

Layer-3 router Core

CSE 222A – Lecture 7: Data Center Networks" 12

Page 13: Lecture 7: Data Center Networkscseweb.ucsd.edu/classes/fa15/cse222A-a/lectures/222A-wi...Cloud Computing" Elastic resources Expand and contract resources Pay-per-use Infrastructure

DC Network Requirements"●  Scalability

◆  Incremental build out?

●  Reliability ◆  Loop free forwarding

●  VM migration

●  Reasonable management burden ◆  Humans in the loop?

CSE 222A – Lecture 7: Data Center Networks" 13

Page 14: Lecture 7: Data Center Networkscseweb.ucsd.edu/classes/fa15/cse222A-a/lectures/222A-wi...Cloud Computing" Elastic resources Expand and contract resources Pay-per-use Infrastructure

●  Over subscription of links higher up in the topology

●  Tradeoff between cost and provisioning

●  Single point of failure

Traditional Topologies"

CSE 222A – Lecture 7: Data Center Networks" 14

Page 15: Lecture 7: Data Center Networkscseweb.ucsd.edu/classes/fa15/cse222A-a/lectures/222A-wi...Cloud Computing" Elastic resources Expand and contract resources Pay-per-use Infrastructure

Capacity Bottlenecks"

CR CR

AR AR AR AR

S S

S S

A A A …

S S

A A A …

. . .

S S

S S

A A A …

S S

A A A …

~ 5:1

~ 40:1

~ 200:1

CSE 222A – Lecture 7: Data Center Networks" 15

Page 16: Lecture 7: Data Center Networkscseweb.ucsd.edu/classes/fa15/cse222A-a/lectures/222A-wi...Cloud Computing" Elastic resources Expand and contract resources Pay-per-use Infrastructure

Management: L2 vs. L3"●  Ethernet switching (layer 2)

◆  Cheaper switch equipment ◆  Fixed addresses and auto-configuration ◆  Seamless mobility, migration, and failover

●  IP routing (layer 3) ◆  Scalability through hierarchical addressing ◆  Efficiency through shortest-path routing ◆  Multipath routing through equal-cost multipath

●  Data centers often connect layer-2 islands by IP routers

CSE 222A – Lecture 7: Data Center Networks" 16

Page 17: Lecture 7: Data Center Networkscseweb.ucsd.edu/classes/fa15/cse222A-a/lectures/222A-wi...Cloud Computing" Elastic resources Expand and contract resources Pay-per-use Infrastructure

Advantages of Layer 2"●  Certain monitoring apps require server with same role

to be on the same VLAN

●  Using same IP on dual homed servers

●  Allows organic growth of server farms

●  VM migration is easier

CSE 222A – Lecture 7: Data Center Networks" 17

Page 18: Lecture 7: Data Center Networkscseweb.ucsd.edu/classes/fa15/cse222A-a/lectures/222A-wi...Cloud Computing" Elastic resources Expand and contract resources Pay-per-use Infrastructure

Layer 2 Pods w/L3 Backbone"CR CR

AR AR AR AR . . .

S S

DC-Layer 3

Internet

S S

A A A …

S S

A A A …

. . .

DC-Layer 2

Key •  CR = Core Router (L3) •  AR = Access Router (L3) •  S = Ethernet Switch (L2) •  A = Rack of app. servers

~ 1,000 servers/pod == IP subnet

S! S! S! S!

S!S!

CSE 222A – Lecture 7: Data Center Networks" 18

Page 19: Lecture 7: Data Center Networkscseweb.ucsd.edu/classes/fa15/cse222A-a/lectures/222A-wi...Cloud Computing" Elastic resources Expand and contract resources Pay-per-use Infrastructure

FAT Tree-Based Solution"●  An all Layer-3 solution

●  Connect end-host together using a “fat-tree” topology ◆  Infrastructure consist of cheap devices

»  Each port supports same speed as endhost ◆  All devices can transmit at line speed if packets are distributed

along existing paths ◆  A k-port fat tree can support k3/4 hosts

CSE 222A – Lecture 7: Data Center Networks" 19

Page 20: Lecture 7: Data Center Networkscseweb.ucsd.edu/classes/fa15/cse222A-a/lectures/222A-wi...Cloud Computing" Elastic resources Expand and contract resources Pay-per-use Infrastructure

“Fat-Tree” Topology"

CSE 222A – Lecture 7: Data Center Networks" 20

Page 21: Lecture 7: Data Center Networkscseweb.ucsd.edu/classes/fa15/cse222A-a/lectures/222A-wi...Cloud Computing" Elastic resources Expand and contract resources Pay-per-use Infrastructure

Fat-Tree Challenges"●  Layer 3 will only use one of the existing equal cost

paths ●  Packet re-ordering occurs if layer 3 blindly takes

advantage of path diversity ◆  E.g., ECMP

CSE 222A – Lecture 7: Data Center Networks" 21

Page 22: Lecture 7: Data Center Networkscseweb.ucsd.edu/classes/fa15/cse222A-a/lectures/222A-wi...Cloud Computing" Elastic resources Expand and contract resources Pay-per-use Infrastructure

Modified Fat Tree"●  Enforce special addressing scheme in DC

◆  Allows host attached to same switch to route only through switch

◆  Allows inter-pod traffic to stay within pod ◆  unused.PodNumber.switchnumber.Endhost

●  Use two level look-ups to distribute traffic and maintain packet ordering.

CSE 222A – Lecture 7: Data Center Networks" 22

Page 23: Lecture 7: Data Center Networkscseweb.ucsd.edu/classes/fa15/cse222A-a/lectures/222A-wi...Cloud Computing" Elastic resources Expand and contract resources Pay-per-use Infrastructure

Two-Level Lookups"●  First level is prefix lookup

◆  Used to route down the topology to endhost

●  Second level is a suffix lookup ◆  Used to route up towards core ◆  Diffuses and spreads out traffic ◆  Maintains packet ordering by using the same ports for the

same endhost

CSE 222A – Lecture 7: Data Center Networks" 23

Page 24: Lecture 7: Data Center Networkscseweb.ucsd.edu/classes/fa15/cse222A-a/lectures/222A-wi...Cloud Computing" Elastic resources Expand and contract resources Pay-per-use Infrastructure

Diffusion Optimizations"●  Flow classification

◆  Eliminates local congestion ◆  Assign to traffic to ports on a per-flow basis instead of a per-

host basis

●  Flow scheduling ◆  Eliminates global congestion ◆  Prevent long lived flows from sharing the same links ◆  Assign long lived flows to different links

CSE 222A – Lecture 7: Data Center Networks" 24

Page 25: Lecture 7: Data Center Networkscseweb.ucsd.edu/classes/fa15/cse222A-a/lectures/222A-wi...Cloud Computing" Elastic resources Expand and contract resources Pay-per-use Infrastructure

Discussion"●  VM mobility

●  Data center build out

●  Internet connectivity

●  Address space allocation ◆  Doing flow placement anyway?

CSE 222A – Lecture 7: Data Center Networks" 25

Page 26: Lecture 7: Data Center Networkscseweb.ucsd.edu/classes/fa15/cse222A-a/lectures/222A-wi...Cloud Computing" Elastic resources Expand and contract resources Pay-per-use Infrastructure

For Next Class…"

●  Read Jupiter Rising paper

●  Work on project proposals ◆  Details available on the

course Web page

26 CSE 222A – Lecture 7: Data Center Networks"

Rising