Upload
ce107
View
218
Download
0
Embed Size (px)
Citation preview
7/28/2019 b_eff
1/5
Effective Bandwith Benchmark (beff) Version 3.5High-Performance Computing-Center Stuttgart, HLRS
Thu Jan 24 13:21:20 2013 on CNK r00idj07 2.6.32-220.23.3.bgq.el6 V1R1M2 0.ppc64 1 BGQ
beff = 89490.849 MB/s = 43.697 * 2048 PEs with 128 MB/PE
number beff Lmax beff beff Latency Latency Latency ping-pongof pro- at Lmax at Lmax rings& rings ping- bandwidthcessors rings& rings random only pong
random onlyMByte/s MByte/s MByte/s mircosec microsec microsec MByte/s
accumulated 2048 89491 1 MB 191847 417199 8.585 8.056 4.882 2211
per process 44 94 204
Ping-Pong result (only the processes with rank 0 and 1 in MPI COMM WORLD were used):Latency: 4.882 microsec per message Bandwidth: 2210.760 MB/s (with MB/s = 106 byte/s)
0.01
0.1
1
10
100
1000
10000
1 10 100 1000 10000 100000 1e+06
bandwith[MB/s]
message length per process [Byte]
Sndrcv, ring & random patterns
ring-1024*2fixring-512*4fixring-256*8fixring-4*512fix
ring-2*1024fixring-1*2048fix
worst randomavg randombest random
0.01
0.1
1
10
100
1000
10000
1 10 100 1000 10000 100000 1e+06
ba
ndwith[MB/s]
message length per process [Byte]
Sndrcv, additional patterns
worst-cyc-1dimbest bi-section
worst bi-sectionacyclic-2dim-allacyclic-3dim-all
cyclic-2dim-xcyclic-2dim-y
cyclic-2dim-allcyclic-3dim-x
1
7/28/2019 b_eff
2/5
0.01
0.1
1
10
100
1000
10000
1 10 100 1000 10000 100000 1e+06
bandw
ith[MB/s]
message length per process [Byte]
Alltoal, ring & random patterns
ring-1024*2fixring-512*4fixring-256*8fixring-4*512fix
ring-2*1024fixring-1*2048fix
worst randomavg randombest random
0.01
0.1
1
10
100
1000
10000
1 10 100 1000 10000 100000 1e+06
bandwith[MB/s]
message length per process [Byte]
Alltoal, additional patterns
worst-cyc-1dimbest bi-section
worst bi-sectionacyclic-2dim-allacyclic-3dim-all
cyclic-2dim-xcyclic-2dim-y
cyclic-2dim-allcyclic-3dim-x
2
7/28/2019 b_eff
3/5
0.01
0.1
1
10
100
1000
10000
1 10 100 1000 10000 100000 1e+06
bandwith[MB/s]
message length per process [Byte]
non-blk, ring & random patterns
ring-1024*2fixring-512*4fixring-256*8fixring-4*512fix
ring-2*1024fix
ring-1*2048fixworst randomavg randombest random
0.01
0.1
1
10
100
1000
10000
1 10 100 1000 10000 100000 1e+06
bandwith[MB/s]
message length per process [Byte]
non-blk, additional patterns
worst-cyc-1dimbest bi-section
worst bi-sectionacyclic-2dim-allacyclic-3dim-all
cyclic-2dim-xcyclic-2dim-y
cyclic-2dim-allcyclic-3dim-x
3
7/28/2019 b_eff
4/5
0.01
0.1
1
10
100
1000
10000
1 10 100 1000 10000 100000 1e+06 1e+07
bandwith[MB/s]
message length per process [Byte]
Best transfer method, ring & random patterns
ring-1024*2fixring-512*4fixring-256*8fixring-4*512fix
ring-2*1024fix
ring-1*2048fixworst randomavg randombest random
0.01
0.1
1
10
100
1000
10000
1 10 100 1000 10000 100000 1e+06 1e+07
bandwith[MB/s]
message length per process [Byte]
Best transfer method, additional patterns
worst-cyc-1dimbest bi-section
worst bi-sectionacyclic-2dim-allacyclic-3dim-all
cyclic-2dim-xcyclic-2dim-y
cyclic-2dim-allcyclic-3dim-x
4
7/28/2019 b_eff
5/5
0.01
0.1
1
10
100
1000
10000
1 10 100 1000 10000 100000 1e+06
bandwith[MB/s]
message length per process [Byte]
Ring & random average: Sndrcv, Alltoal, non-blk
Sendrcv ringsAlltoal rings
non-blk ringsSendrcv randomAlltoal random
non-blk random
0.01
0.1
1
10
100
1000
10000
1 10 100 1000 10000 100000 1e+06 1e+07
bandwith[MB/s]
message length per process [Byte]
Best method: rings & random
rings minumumrings average
rings maximumrandom minimum
random averagerandom maximum
ring & random average
5