25
George Kola Computer Sciences Department University of Wisconsin-Madison [email protected] http://www.cs.wisc.edu/condor DiskRouter: A Mechanism for High Performance Large Scale Data Transfers

George Kola Computer Sciences Department University of Wisconsin-Madison [email protected] DiskRouter: A Mechanism for High

Embed Size (px)

Citation preview

Page 1: George Kola Computer Sciences Department University of Wisconsin-Madison kola@cs.wisc.edu  DiskRouter: A Mechanism for High

George KolaComputer Sciences DepartmentUniversity of Wisconsin-Madison

[email protected]://www.cs.wisc.edu/condor

DiskRouter: A Mechanism for High

Performance Large Scale Data Transfers

Page 2: George Kola Computer Sciences Department University of Wisconsin-Madison kola@cs.wisc.edu  DiskRouter: A Mechanism for High

2www.cs.wisc.edu/condor

Outline

› Problem

› DiskRouter Overview

› Details

› Real life DiskRouters

› Experiments

Page 3: George Kola Computer Sciences Department University of Wisconsin-Madison kola@cs.wisc.edu  DiskRouter: A Mechanism for High

3www.cs.wisc.edu/condor

ProblemSDSC to NCSABottleneck Bandwidth : 12.5 MBPSLatency 67 ms

Transfer Rate got by applications for a 1GB file

Scp : 0.66MBPSGridFTP(1 stream) : 0.85 MBPSGridFTP(10 streams) : 3.52 MBPS

Page 4: George Kola Computer Sciences Department University of Wisconsin-Madison kola@cs.wisc.edu  DiskRouter: A Mechanism for High

4www.cs.wisc.edu/condor

DiskRouter Overview

› Mechanism to efficiently move large amounts of data (order of terabytes)

› Uses disk as a buffer to aid in large scale data transfers

› Application-level overlay network used for routing

› Ability to use higher level knowledge for data movement

Page 5: George Kola Computer Sciences Department University of Wisconsin-Madison kola@cs.wisc.edu  DiskRouter: A Mechanism for High

5www.cs.wisc.edu/condor

A Simple Case

A B

A is transferring a large amount of data to B

Page 6: George Kola Computer Sciences Department University of Wisconsin-Madison kola@cs.wisc.edu  DiskRouter: A Mechanism for High

6www.cs.wisc.edu/condor

A Simple Case

A B

C

DiskRouter

C is an intermediate node between A and B

Page 7: George Kola Computer Sciences Department University of Wisconsin-Madison kola@cs.wisc.edu  DiskRouter: A Mechanism for High

7www.cs.wisc.edu/condor

A Simple Case with DiskRouter

A B

C

Improves performance when bandwidth fluctuation between A and C is independent of the bandwidth fluctuation between C and B

DiskRouter

With DiskRouter

Without DiskRouter

Page 8: George Kola Computer Sciences Department University of Wisconsin-Madison kola@cs.wisc.edu  DiskRouter: A Mechanism for High

8www.cs.wisc.edu/condor

Data Mover/Distributed Cach e

Source writes to the closest DiskRouter and Destination receives it up from its closest DiskRouter

SourceDestinationDiskRouter

Cloud

Page 9: George Kola Computer Sciences Department University of Wisconsin-Madison kola@cs.wisc.edu  DiskRouter: A Mechanism for High

9www.cs.wisc.edu/condor

Outline

› Problem

› DiskRouter Overview

› Details

› Real life DiskRouters

› Experiments

Page 10: George Kola Computer Sciences Department University of Wisconsin-Madison kola@cs.wisc.edu  DiskRouter: A Mechanism for High

10www.cs.wisc.edu/condor

Routing Between DiskRouters

C need not be in the path between A and B

DiskRouter ADiskRouter B

DiskRouter C

Page 11: George Kola Computer Sciences Department University of Wisconsin-Madison kola@cs.wisc.edu  DiskRouter: A Mechanism for High

11www.cs.wisc.edu/condor

Network Monitoring› Uses ‘Pathrate’ for estimating network

capacity

› Performs actual transfers for measurement

› Logging the data rate seen by different components

› Generate network interface stats on the machines involved in the transfers

Page 12: George Kola Computer Sciences Department University of Wisconsin-Madison kola@cs.wisc.edu  DiskRouter: A Mechanism for High

12www.cs.wisc.edu/condor

Implementation Details

› Uses multiple sockets and explicitly sets TCP buffer sizes

› Overlaps disk I/O and socket I/O

Page 13: George Kola Computer Sciences Department University of Wisconsin-Madison kola@cs.wisc.edu  DiskRouter: A Mechanism for High

13www.cs.wisc.edu/condor

Client Side

› Client library provided› Applications can call library

functions for network I/O› Functions provided for common

case file transfer (overlaps network I/O and disk I/O)

› Third party transfer support

Page 14: George Kola Computer Sciences Department University of Wisconsin-Madison kola@cs.wisc.edu  DiskRouter: A Mechanism for High

14www.cs.wisc.edu/condor

Outline

› Problem

› DiskRouter Overview

› Details

› Real life DiskRouters

› Experiments

Page 15: George Kola Computer Sciences Department University of Wisconsin-Madison kola@cs.wisc.edu  DiskRouter: A Mechanism for High

15www.cs.wisc.edu/condor

Real Life DiskRouters

UW Madison

StarLight

MCS ANL

INFN Italy90 Mbps

3.3 ms

90 Mbps

5.5 ms 514 Mbps

0.85 ms

30 Mbps

126.6 ms

411 Mbps

8 ms

SDSCUW

MilwaukeeNCSA

518 Mbps

67 ms94 Mbps

2.7 ms

Page 16: George Kola Computer Sciences Department University of Wisconsin-Madison kola@cs.wisc.edu  DiskRouter: A Mechanism for High

16www.cs.wisc.edu/condor

Outline

› Overview

› Details

› Real Life DiskRouters

› Experiments

Page 17: George Kola Computer Sciences Department University of Wisconsin-Madison kola@cs.wisc.edu  DiskRouter: A Mechanism for High

17www.cs.wisc.edu/condor

Testing Multiroute

UW Madison

StarLight

90 Mbps

3.3 ms

90 Mbps

5.5 ms

411 Mbps

8 ms

UW Milwaukee

Page 18: George Kola Computer Sciences Department University of Wisconsin-Madison kola@cs.wisc.edu  DiskRouter: A Mechanism for High

18www.cs.wisc.edu/condor

Multiroute Improves Performance

Total Data into Starlight

Data From Milwaukee

Data From Madison

Meg

abit

s/se

cond

Page 19: George Kola Computer Sciences Department University of Wisconsin-Madison kola@cs.wisc.edu  DiskRouter: A Mechanism for High

19www.cs.wisc.edu/condor

SRB to Unitree Transfer Using Stork

› Data movement from SDSC to NCSA via Starlight (3 TB of data had to be moved)

› Integrated into Stork

› Found significant performance gain

Page 20: George Kola Computer Sciences Department University of Wisconsin-Madison kola@cs.wisc.edu  DiskRouter: A Mechanism for High

20www.cs.wisc.edu/condor

Link between SDSC and NCSA

StarLight

SDSCNCSA

518 Mbps

67 ms94 Mbps

2.7 ms

Page 21: George Kola Computer Sciences Department University of Wisconsin-Madison kola@cs.wisc.edu  DiskRouter: A Mechanism for High

21www.cs.wisc.edu/condor

Starlight DiskRouter Stats

Data Inflow

Data Outflow

Memory Used

Disk Used

Page 22: George Kola Computer Sciences Department University of Wisconsin-Madison kola@cs.wisc.edu  DiskRouter: A Mechanism for High

22www.cs.wisc.edu/condor

GridFTP vs DiskRouterM

egab

ytes

/sec

ond

End-to-End Data Rate Seen by Stork(MBPS) vs. Time

GridFTP DiskRouter

Page 23: George Kola Computer Sciences Department University of Wisconsin-Madison kola@cs.wisc.edu  DiskRouter: A Mechanism for High

23www.cs.wisc.edu/condor

A Glimpse of Performance

Transfer of 1 GB file from SDSC (SanDiego) to NCSA (Urbana-Champaign)

Tool Transfer RateScp 0.66 MBPSGridFTP(1 stream) 0.85 MBPSGridFTP(10 streams) 3.52 MBPSDiskRouter 10.77 MBPS

Page 24: George Kola Computer Sciences Department University of Wisconsin-Madison kola@cs.wisc.edu  DiskRouter: A Mechanism for High

24www.cs.wisc.edu/condor

Work In Progress

› Computation on data streams in the DiskRouter

› Ability to perform computation in the nodes attached locally to the DiskRouter

› Working together with Stork to add intelligence to data movement

Page 25: George Kola Computer Sciences Department University of Wisconsin-Madison kola@cs.wisc.edu  DiskRouter: A Mechanism for High

25www.cs.wisc.edu/condor

Questions

› Thanks for listening