23
Relying on Autonomous Multipath Routing to Achieve Inter-Domain Load Balancing in the Internet Robert Löfman Ph.D Student

Relying on Autonomous Multipath Routing to Achieve Inter-Domain Load Balancing in the Internet

  • Upload
    heath

  • View
    31

  • Download
    0

Embed Size (px)

DESCRIPTION

Relying on Autonomous Multipath Routing to Achieve Inter-Domain Load Balancing in the Internet. Robert Löfman Ph.D Student. The Goal of the Research. Provide better QoS (not guaranteed), end-to-end, by traffic engineering Means: Multipath Routing Focus: Throughput - PowerPoint PPT Presentation

Citation preview

Page 1: Relying on Autonomous Multipath Routing to Achieve Inter-Domain Load Balancing in the Internet

Relying on Autonomous Multipath Routing to Achieve Inter-Domain

Load Balancing in the Internet

Robert Löfman

Ph.D Student

Page 2: Relying on Autonomous Multipath Routing to Achieve Inter-Domain Load Balancing in the Internet

The Goal of the Research

• Provide better QoS (not guaranteed), end-to-end, by traffic engineering

– Means: Multipath Routing

– Focus: Throughput

– Can improve performance of • apps. insensitive to packet reordering but also, • Video-on-demand (if there is time to wait for buffering, when

packets are reordered)• A/V Conferencing apps. (if packet reordering rate is acceptable)

Multipath routing = some or all routers can forward to more than one nexthop

Page 3: Relying on Autonomous Multipath Routing to Achieve Inter-Domain Load Balancing in the Internet

Some Other Traffic Engineering Techniques:

• Queuing policies (needed if strict QoS is to be guaranteed)– DiffServ: class of packets receive preferential treatment– IntServ: resources reserved on-demand – Permanent packet scheduling (Priority Q, Fair Q, Weighted FQ….)– Drop schemes (Random Early Detection)

• Source Routing– MPLS

• Fast layer-2 forwarding, based on virtual circuits. • Enables better addressability and discrimination of routes.

• Over provisioning– Have so much hardware that capacity cannot run out

Page 4: Relying on Autonomous Multipath Routing to Achieve Inter-Domain Load Balancing in the Internet

Background - The Internet’s Infrastructure

• Intra-Domain (Autonomous System, AS)

– A network of a single organization– Private routing (RIP, OSPF, EIGRP…)– Spans a local region.

• Inter-Domain– The routes between ASes.

– Routing domain is global. – Border Gateway Protocol (BGP)

Page 5: Relying on Autonomous Multipath Routing to Achieve Inter-Domain Load Balancing in the Internet

Background - The Internet’s Infrastructure

Inter-Domain

Intra-Domain

AS

Page 6: Relying on Autonomous Multipath Routing to Achieve Inter-Domain Load Balancing in the Internet

Intra-domain Multipath Schemes• EIGRP, proportional routing = traffic is dispersed proportionally to metrics

• OSPF-Equal Cost MPR, shares load on equal optimal routes.

• OSPF Optimized MPR, disseminates load levels and can reroute around cong.

• MP extended Link State and Distance Vector, calculate and address the k-best paths.

• “A survey of multipath routing for traffic engineering", Gyu Myoung Lee, Jin Seek Choi.

• MultiPath Algorithm (MPA)• Discount Shortest Path Algorithm (DSPA)• Capacity Removal Algorithm (CRA)• Multipath Distance Vector Algorithm (MDVA)• Multipath Partial Dissemination Algorithm (MPDA)• Quality Multiple Partial Dissemination Algorithm (QMPDA)

• Diffusing Algorithm for Shortest Multipath (DASM)

• MPATH

Page 7: Relying on Autonomous Multipath Routing to Achieve Inter-Domain Load Balancing in the Internet

Inter-domain Multipath Routing• In use:

– There are none, only BGP is used.• BGP installs only a single route.• Multihomed ASes can have some basic TE by policy

• Some proposed methods for enabling MPR in BGP:

– Dynamic egress-router selection for an multihomed AS

– Advertisement of multiple AS-paths, and inter-domain source routing by means of hashed IDs, which are based on the AS-paths.

– Advertisement of the min. SLA-guaranteed BW that the path supports, so that paths can be discriminated

– Overlay Networks• Probe for “better” routes to destinations via other overlay nodes

Page 8: Relying on Autonomous Multipath Routing to Achieve Inter-Domain Load Balancing in the Internet

The Problem• Initial research:

– “Can throughput be expected to improve by letting intermediate routers disperse traffic arbitrarily on possibly joint routes, without cross-domain routing ?”

• If so,– no changes to routing protocols would be needed in order to obtain

better performance– The hierarchical structure of the Internet may be preserved.

• Biggest obstacle: The intra-domain routes are not visible to the inter-domain.– Proportional routing impossible.

Might disperse too much traffic on a low BW route

• An answer was sought by simulation.

Page 9: Relying on Autonomous Multipath Routing to Achieve Inter-Domain Load Balancing in the Internet

Simulation – Set-up

• OPNET Simulator

• Partial Internet Routes– Inter/Intra-domains and LANs (BGP and EIGRP)– Fictional (generated) and real WANs

• Full end-to-end path diversity: – Intra-domains may have mult. routes to mult. egress points.– Inter-domain routers also use MPR (tweaked BGP)

• Paths may have joint links

Page 10: Relying on Autonomous Multipath Routing to Achieve Inter-Domain Load Balancing in the Internet

Simulation – Set-up

• Studied:

– MultiPath Routing 2 and 3 (MPR-2, MPR-3)• MPR-2, all routers forward packets to 2 nexthops,

if possible• Realized by allowing 2 or 3 sub-optimal routes in

EIGRP in each AS.

– Compared to: SinglePath Routing (SPR)• Every router forwards to one nexthop

Page 11: Relying on Autonomous Multipath Routing to Achieve Inter-Domain Load Balancing in the Internet

Simulation – Set-up• Reordered packets can invoke fast retransmit wrongly.

• Avoided by Increased Maximum Segment Size (15000B, MSS) Decreased packet reordering at TCP-layer

Num of Retransmissions

0

5

10

15

20

25

30

35

40

45

5K 6K 7K 8K 9K 10K 11K 12K 13K

Num ofRetransmissions

Note: No FRs occurred with TCP Reno when that function was turned off. => all inv. of FR false.

Number of false fast retransmissions

Page 12: Relying on Autonomous Multipath Routing to Achieve Inter-Domain Load Balancing in the Internet

Simulation – Set-up

• Large MSS (15000 bytes): - Forces fragmentation

Increased chance that all segments have fragments which traverses the “slowest” path

– Increases buffering at IP-layer as segments “wait” for delayed fragments Smoothes the segment delay deviation (15K MSS had an order of

magnitude less deviation)

– Now it is possible to examine TCP over MPR beyond the FR-problem.

• Infinite receive buffers assumed

• Last Mile-link capacity 1Gb/s

Page 13: Relying on Autonomous Multipath Routing to Achieve Inter-Domain Load Balancing in the Internet

Simulation – Set-up

– Background traffic• Packet size of 40B, 570B or 1500B randomly • Measurements taken during 0, 25, 50, 75, 90, 95

and 96 percent back.load and averaged.

– Delay-metrics set equal.– BW-metric set to the true BW of the link.

– The allowed variance of sub-optimal routes was kept low to avoid loops (loops where detected when TTL values dropped to 0)

Page 14: Relying on Autonomous Multipath Routing to Achieve Inter-Domain Load Balancing in the Internet

Simulation – Results, UDP over MPR• Starting point: UDP, because it does not constrain its transmission.

0,2

0,3

0,4

0,5

0,6

0,7

0,8

0 % 25 % 50 % 75 % 90 % 95 % 96 %

SPR

MPR-2

MPR-3

0,35

0,4

0,45

0,5

0,55

0,6

0,65

0,7

0,75

0,8

0 % 25 % 50 % 75 % 90 % 95 % 96 %

SPR

MPR-2

MPR-3

Throughput: Transmission rate is 300 million pkt/h. MPR better perf.

Throughput: Transmission rate is only 200 million pkt/h of size 1500B each. A function of back.load.

• Indication: – MPR performs best when trans.rate is high or residual BW low.

• Sending must be at a rate which the SP can’t handle. (Trans_Rate / Res_BWSP > 1)

• Dispersing packets while the SP has enough BW increases the delay

Page 15: Relying on Autonomous Multipath Routing to Achieve Inter-Domain Load Balancing in the Internet

Simulation – Results, UDP over MPR• Throughput improvement possible,

– Good news for transferring data which is to be used after complete reception

• but– This might not be the throughput perceived by real-time applications due to packet reordering

0

0,001

0,002

0,003

0,004

0,005

0,006

0,007

0,008

0,009

0 % 25 % 50 % 75 % 90 % 95 % 96 %

SPR

MPR-2

MPR-3

Standard deviation of RTT times. MPR deviates more.

Page 16: Relying on Autonomous Multipath Routing to Achieve Inter-Domain Load Balancing in the Internet

Simulation – Results, UDP over MPR

• In-sequence BW = the rate of packets/s that arrive in-order.• Out-of-sequence BW = the rate of packets/s that arrive.

• Especially important for real-time applications.

• If all paths in MPR have a BW greater than an nth of the BW of the SP:

possible to transmit more than n packets in parallel before n packets are transmitted on the SP.

also the in-sequence BW of MPR would be greater than SPR

Page 17: Relying on Autonomous Multipath Routing to Achieve Inter-Domain Load Balancing in the Internet

Simulation – Results, TCP over MPR

• MPR always performs worse

8,5

9

9,5

10

10,5

11

11,5

0 % 25 % 50 % 75 % 90 % 95 % 96 %

SPR

MPR-2

MPR-3

Transfer time of a file (seconds).

Note, no false “fast retransmits”.

Page 18: Relying on Autonomous Multipath Routing to Achieve Inter-Domain Load Balancing in the Internet

Simulation – Results, TCP over MPR

• SPR always has faster CWIND growth and grows to a much greater end value• Reason for a slowly growing or a small window must be that it takes a long time to

transmit the packet bursts which the CWIND allows.

0

1000

2000

3000

4000

5000

60000

0,8

1,2

1,6 2

2,4

2,8 0

0,8

1,2

1,6 2

2,4

2,8 0

0,8

1,2

1,6 2

2,4

2,8

3,2 0

0,8

1,2

1,6 2

2,4

2,8

3,2 0

0,8

1,2

1,6 2

2,4

2,8 0

0,8

1,2

1,6 2

2,4

2,8 0

0,8

1,2

1,6 2

2,4

2,8

3,2

3,6 4

4,4

4,8

5,2

5,6 6

6,4

SPR, 0% MPR-2, 0% MPR-3, 0% SPR, 25% MPR-2, 25% MPR-3, 25%

SPR, 50% MPR-2, 50% MPR-3, 50% SPR, 75% MPR-2, 75% MPR-3, 75%

SPR, 90% MPR-2, 90% MPR-3, 90% SPR, 95% MPR-2, 95% MPR-3, 95%

SPR, 96% MPR-2, 96% MPR-3, 96%

Several congestion windows (in thousands) as a function of time (seconds). Graphs are clustered according to back.load. X values begin at 0 for every cluster. This is a representative sample.

0% 25% 50% 75% 90% 95% 96%

SPR SPR SPR SPR SPR SPRSPR

MPRMPR MPR

MPR MPR MPR

MPR

Page 19: Relying on Autonomous Multipath Routing to Achieve Inter-Domain Load Balancing in the Internet

Simulation – Results, TCP over MPR

0,05

0,051

0,052

0,053

0,054

0,055

0,056

0,057

0,058

0,059

0,06

0,061

0,062

0,063

0,064

0,065

0,066

0,067

0,068

0,069

0,07

0 % 25 % 50 % 75 % 90 % 95 % 96 %

SPR

MPR-2

MPR-3

0

0,0005

0,001

0,0015

0,002

0,0025

0,003

0,0035

0,004

0,0045

0,005

0,0055

0,006

0,0065

0,007

0,0075

0,008

0,0085

0,009

0,0095

0,01

0 % 25 % 50 % 75 % 90 % 95 % 96 %

SPR

MPR-2

MPR-3

• The reason for small and slowly growing MPR CWIND – MPR has longer fragment delay with heavy delay deviation.

Segments are passed at slow rate to TCP as they “wait” for delayed fragments

Average fragment delay (seconds) Standard deviation of fragment delay.

Page 20: Relying on Autonomous Multipath Routing to Achieve Inter-Domain Load Balancing in the Internet

Simulation – Results, TCP over MPR

Slower transmission rate for MPR, due to the slowly growing CWIND holding the transmission rate back.

4500

6500

8500

10500

12500

14500

16500

SPR, Load 0 MPR-2, Load 0 MPR-3 Load 0 SPR, Load 25

MPR-2, Load 25 MPR-3 Load 25 SPR, Load 50 MPR-2, Load 50MPR-3 Load 50 SPR, Load 75 MPR-2, Load 75 MPR-3 Load 75

SPR, Load 90 MPR-2, Load 90 MPR-3 Load 90 SPR, Load 95MPR-2, Load 95 MPR-3 Load 95 SPR, Load 95 MPR-2, Load 95

MPR-3 Load 95

Sent segment sequence numbers (in thousands) as a function of time (seconds)

SPR

SPR SPR SPRSPR

SPRSPR

Page 21: Relying on Autonomous Multipath Routing to Achieve Inter-Domain Load Balancing in the Internet

Simulation – Results, TCP over MPR

MPR experiences more pauses in transmission since the CWIND chokes the transmission more often.

0,2

0,25

0,3

0,35

0,4

0,45

0,5

0,55

0,6

0,65

0,7

0,75

0,8

0,85

0,9

0,95

1

1,05

1,1

0 % 25 % 50 % 75 % 90 % 95 % 96 %

SPR

MPR-2

MPR-3

Cumulative pause time due to a full CWIND

Page 22: Relying on Autonomous Multipath Routing to Achieve Inter-Domain Load Balancing in the Internet

Simulation – Interpretation of Results, TCP over MPR

1. In the beginning, the transmission is very dependant on a small delay (CWIND portions transmission into small burst) in order to get the CWIND opened quickly.

2. MPR: Too many fragments are put on high delay paths as there is no proportional forwarding. Segments “wait” for delayed fragments (although it has a large BW which isn’t

needed at this point)

Burst transmission takes longer slower CWIND growth

Vicious circle: future allowed burst are also smaller (compared to SPR)

– SPR: Gets the CWIND quickly opened and therefore can transmit large bursts which allow even larger future bursts.

– Need to get the in-sequence BW higher in order for MPR to be successful.

Page 23: Relying on Autonomous Multipath Routing to Achieve Inter-Domain Load Balancing in the Internet

Future Work• Goal: Provide I-D prop.forwarding without cross-domain routing

• Possible solution algorithm:– At the creation of a new flow, use SPR

– Probe for highest possible throughput

– Raise the MPR-X value, in order to let packets take sub-optimal routes also.

– Then probe for a new throughput high, and so on…

– Mark packets for every probe with an ID, and require routers to disperse packets with same ID always to the same nexthops

the transmission rate increment from every probe is always put on the same paths (might not scale due to the need for per flow record-keeping)