30
Network Distance Estimation Sridhar Srinivasan Network Distance Estimation – p. 1

Network Distance Estimation

  • Upload
    others

  • View
    8

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Network Distance Estimation

Network Distance EstimationSridhar Srinivasan

Network Distance Estimation – p. 1

Page 2: Network Distance Estimation

Motivation

Network Distance Estimation – p. 2

Page 3: Network Distance Estimation

Network DistanceWhat is network distance?

Latency

Why do we need network distance?Identification of nearest server(s) for games,downloadsLocating peers in an overlay network

Why not use ping or traceroute?Need low overheadCannot estimate distance between arbitrary hosts

Network Distance Estimation – p. 3

Page 4: Network Distance Estimation

TechniquesInfrastructure based

Require some infrastructure to be setupEstimate distance based on delay measurementsExamples: IDMaps, M-Coop

Coordinate basedRequire only a small set of beacons or landmarksHosts are assigned coordinatesDistance is estimated as a function of coordinatesExamples: Global Network Positioning, InternetCoordinate System, Virtual Landmarks

Using existing infrastructureUses existing Internet-wide infrastructureExample : King, uses DNS

Network Distance Estimation – p. 4

Page 5: Network Distance Estimation

Direct Measurement Based

Network Distance Estimation – p. 5

Page 6: Network Distance Estimation

TriangulationS. Hotz, “Routing information organization to supportscalable interdomain routing with heterogeneous pathrequirements”, Ph.D. Thesis.

Select N nodes in the network to be Beacons

Coordinate of node H assigned as tuple of distances toeach of the Beacons, e.g., (dHB1

, dHB2, . . . , dHBN

)

Distance D between any two hosts H1 and H2 isbounded by

maxi∈{1,2,...,N} |dH1Bi− dH2Bi

| ≤ D

and

D ≤ mini∈{1,2,...,N}(dH1Bi+ dH2Bi

)

Network Distance Estimation – p. 6

Page 7: Network Distance Estimation

IDMapsP. Francis, S. Jamin, C. Jin„ D. Raz, Y. Shavitt, L. Zhang,“IDMaps: A Global Internet Host Distance EstimationService”, IEEE/ACM Trans. on Networking, Oct. 2001

Hosts are aggregated into Address Prefixes (APs),consecutive range of IP addresses within which allhosts are equidistant to the rest of the Internet.

Systems, called Tracers, are placed such that each APis close to one or more Tracers

Tracers measure distance between each other and totheir closest APs.

Distance computed as sum of distance of each AP toclosest Tracer and distance between these Tracers

Network Distance Estimation – p. 7

Page 8: Network Distance Estimation

IDMaps Operation

� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �

� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �

� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �

� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �

� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �

� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �

� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �

� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �

Address Prefix

T

T

T

T

T

IDMaps Clients gather distance information from Tracersand build a virtual distance map of the Internet

Network Distance Estimation – p. 8

Page 9: Network Distance Estimation

IDMaps Operation

� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �

� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �

� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �

� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �

� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �

� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �

� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �

� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �

Address Prefix

T

T

T

T

T

query (A,B)

A

B

To get distance infomation between any pair of hosts, theIDMaps Client is queried

Network Distance Estimation – p. 8

Page 10: Network Distance Estimation

IDMaps Operation

� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �

� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �

� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �

� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �

� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �

� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �

� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �

� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �

Address Prefix

T

T

T

T

T

A

B

Client runs shortest path algorithm on its distance map tocompute required distance

Network Distance Estimation – p. 8

Page 11: Network Distance Estimation

IDMaps Operation

� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �

� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �

� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �

� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �

� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �

� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �

� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �� � � � � � �

� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �� � � � � � � �

Address Prefix

T

T

T

T

T

A

B

reply (150ms)

Results are returned to querying host

Network Distance Estimation – p. 8

Page 12: Network Distance Estimation

KingK. P. Gummadi, S. Saroiu, S. D. Gribble., “King:Estimating latency between arbitrary Internet endhosts”, IMW 2002

Designed as a tool to estimate latencies betweenarbitrary Internet hosts

Uses DNS architecture for performing queries

No offline extrapolation from measured paths

Recursive DNS queries are used to measure latencybetween pairs of DNS servers closest to the hosts

Network Distance Estimation – p. 9

Page 13: Network Distance Estimation

Operation

Network Distance Estimation – p. 10

Page 14: Network Distance Estimation

Potential IssuesAssumes that most end hosts are close to their DNSname servers

Depends on Name Servers performing recursivequeries for arbitrary hosts

Closest authoritative server is picked using heuristicssuch as matching hostnames or IP addresses

Network Distance Estimation – p. 11

Page 15: Network Distance Estimation

Evaluation

Comparison of King and IDMaps from traceroute servers toset of web servers

Network Distance Estimation – p. 12

Page 16: Network Distance Estimation

Evaluation (2)

Comparison of King and IDMaps from traceroute servers toset of Napster clients

Network Distance Estimation – p. 13

Page 17: Network Distance Estimation

Coordinate System Based

Network Distance Estimation – p. 14

Page 18: Network Distance Estimation

Global Network PositioningT. S. E. Ng, H. Zhang, “Predicting Internet NetworkDistance with Coordinates-Based Approaches”,INFOCOM’02

Models the Internet as a geometric space

Assigns coordinates to each host

Distance between pair of hosts is computed as afunction of the coordinates of the hosts.

Network Distance Estimation – p. 15

Page 19: Network Distance Estimation

Landmark Operations

(x1, y1)

(x2, y2)

y

x

Internet

2−D Coordinate space

L

1L

L 3

2

L 3

1L

L 2

(x3, y3)

The N Landmarks measure the latencies to each otherusing ping

Landmarks compute coordinates by minimizing theoverall discrepancy between the measured distancesand the computed distances

This is a multi-dimensional global minimization problem

Network Distance Estimation – p. 16

Page 20: Network Distance Estimation

Host Operations

(x3, y3)

(x1, y1)

y

x

(x4, y4)

(x2, y2)

Internet2−D Coordinate space

L

L

L

C

C

L

L

L

2

1

1

2

3

3

A host measures its latencies to the N Landmarksusing ping

Host computes its own coordinates relative to theLandmarks’ coordinates

Minimizes the discrepancy between its measureddistances to the Landmarks and the computeddistances

Network Distance Estimation – p. 17

Page 21: Network Distance Estimation

Relative Error Comparison

Network Distance Estimation – p. 18

Page 22: Network Distance Estimation

Issues in GNPHow many dimensions does the Internet have?

Placement of the Landmark nodes

Network Distance Estimation – p. 19

Page 23: Network Distance Estimation

Internet Coordinate SystemH. Lim, J. Hou, C-H. Choi, “Constructing InternetCoordinate System Based on Delay Measurement”,IMC 2003.

Coordinate based approach

Uses a set of beacon nodes

Host measures its latency to the beacon nodes and thiscreates a distance vector

Distance vector is projected into a smaller dimensionspace to create the coordinates of the host

Network Distance Estimation – p. 20

Page 24: Network Distance Estimation

Principal Component AnalysisLet D be a distance matrix of four hosts with

D =

2

6

6

6

6

6

4

0 1 3 3

1 0 3 3

3 3 0 1

3 3 1 0

3

7

7

7

7

7

5

Singular value decomposition (SVD) of D is given by

D = U.W. V T

with U =

2

6

6

6

6

6

4

− 1

2− 1

2− 1√

20

− 1

2− 1

2

1√2

0

− 1

2

1

20 − 1√

2

− 1

2

1

20 1√

2

3

7

7

7

7

7

5

, W =

2

6

6

6

6

6

4

7 0 0 0

0 5 0 0

0 0 1 0

0 0 0 1

3

7

7

7

7

7

5

Network Distance Estimation – p. 21

Page 25: Network Distance Estimation

Principal Component Analysis (2)Each element of D can be expressed as

Dij = Σmk=1wkUikVjk 1 ≤ i ≤ m, 1 ≤ j ≤ m

Since the wis are decreasing, it is possible toapproximate D using only a few of them, e.g., 2 insteadof 4.

U =

2

6

6

6

6

6

4

− 1

2− 1

2

− 1

2− 1

2

− 1

2

1

2

− 1

2

1

2

3

7

7

7

7

7

5

, W =

2

4

7 0

0 5

3

5, V ′T =

2

4

− 1

2− 1

2− 1

2− 1

2

1

2

1

2− 1

2− 1

2

3

5 give

D′ =

2

6

6

6

6

6

4

1

2

1

23 3

1

2

1

23 3

3 3 1

2

1

2

3 3 1

2

1

2

3

7

7

7

7

7

5

Network Distance Estimation – p. 22

Page 26: Network Distance Estimation

Principal Component Analysis (3)The columns of U are the principal components and arethe orthogonal basis of the new subspace

Using the first n columns denoted by Un, them-dimensional space can be projected into ann-dimensional one

ci = UTn . di

From the example, the distance vector of the first host is

di = [0 1 3 3]T

This is converted into the two-dimensional vector

ci =

2

4

− 1

2− 1

2− 1

2− 1

2

− 1

2− 1

2

1

2

1

2

3

5 .

2

6

6

6

6

6

4

0

1

3

3

3

7

7

7

7

7

5

=

2

4

− 7

2

5

2

3

5

Network Distance Estimation – p. 23

Page 27: Network Distance Estimation

WorkingEach beacon nodemeasures its latencies toall other beacons

An administrative nodeaggregates the delayinformation to produce D

Using PCA, the node de-termines the dimensionof the coordinate systemand calculates Un

Network Distance Estimation – p. 24

Page 28: Network Distance Estimation

Working (2)A host measures to a set of beacon nodes and obtainsthe transformation matrix

Calculates its coordinate by multiplying the measureddistance vector with the transformation matrix

Reduction in dimensions may cause the calculateddistance in the coordinate system to be different frommeasured distance

This is fixed by scaling the calculated distance by α.The optimal scaling factor α∗(n) is found by minimizingthe error

J(α) = Σmi Σm

j (L2(αci, αcj) − dij)2

Final transformation matrix Un = α∗(n)Un

Network Distance Estimation – p. 25

Page 29: Network Distance Estimation

Effect of Dimensions

Performance of ICS unchanged for n ≥ 6

Network Distance Estimation – p. 26

Page 30: Network Distance Estimation

Open Issues in Distance EstimationHow many dimensions does the Internet have?Different datasets give different values

How to place Landmarks or Beacons to get the bestperformance? How sensitive are these techniques tothe location of landmarks?

More evaluation of these methods

Network Distance Estimation – p. 27