33
1 Rateless codes and random walks for P2P resource discovery in Grids IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, NOV. 2012. Valerio Bioglio Rossano Gaeta Marco Grangetto Matteo Sereno

Rateless codes and random walks for P2P resource discovery in Grids

  • Upload
    sakina

  • View
    47

  • Download
    0

Embed Size (px)

DESCRIPTION

Rateless codes and random walks for P2P resource discovery in Grids. IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, NOV . 2012 . Valerio Bioglio Rossano Gaeta Marco Grangetto Matteo Sereno. Outline. Introduction Related Work Proposed System Analysis Simulation Results - PowerPoint PPT Presentation

Citation preview

Page 1: Rateless  codes and random walks for P2P resource discovery in Grids

1

Rateless codes and random walks for P2P resource discovery in Grids

IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, NOV. 2012.

Valerio BioglioRossano Gaeta

Marco GrangettoMatteo Sereno

Page 2: Rateless  codes and random walks for P2P resource discovery in Grids

2

Outline

ه Introductionه Related Workه Proposed Systemه Analysisه Simulation Resultsه Conclusion

Page 3: Rateless  codes and random walks for P2P resource discovery in Grids

3

Introduction

ه The system is presented as a set of nodes connected to form a P2P network.ه each node contains a piece of information.ه all nodes may leave or join dynamically.

ه A peer to obtain a local view of global information defined on all peers of a P2P unstructured network.

ه Every node must communicate to all the participants so as to obtain the information of other peers.

Page 4: Rateless  codes and random walks for P2P resource discovery in Grids

4

Introduction

ه Many proposals exploiting unstructured P2P systems share a common characteristic : ه The interface peers

ى have one administrative domainى connect to other interface peersى maintain data of their local nodes

ه This paper assume ه each peer holds a piece of information.ه any peer requires to access the data of all other peers at rate λ

queries/sec.

Page 5: Rateless  codes and random walks for P2P resource discovery in Grids

5

Introduction

ه The goals to be achieved are threefold : 1. The complete global information can be collect by every node.

2. The communication overhead must be limited.

3. The processing power of each node must be used parsimoniously.

Page 6: Rateless  codes and random walks for P2P resource discovery in Grids

6

Contribution

1. A continuous flow of control packets exchanged among the nodes using the random walk principle.

2. The information combined by each node has to be the same version.

3. The proposed solution is suitable for large size data held by each node.

Page 7: Rateless  codes and random walks for P2P resource discovery in Grids

7

Outline

ه Introductionه Related Workه Proposed Systemه Analysisه Simulation Resultsه Conclusion

Page 8: Rateless  codes and random walks for P2P resource discovery in Grids

8

Related Work (1/2)

ه The flow control used by [6] on the maximum rate at which a participant can submit updates without creating a backlog and devises content reconciliation mechanisms to reduce message redundancy.

ه Algebraic Gossip, proposed in [11], in this paper a gossip algorithm based on Network Coding is presented, and it is proved that the spreading time of this algorithm is O(K).

[6] “Efficient reconciliation and flow control for anti-entropy protocols,” in Proceedings of the 2nd Workshop on Large-Scale Distributed Systems and Middleware, LADIS ’08. ACM, 2008.[11] “Algebraic gossip: a network coding approach to optimal multiple rumor mongering,” IEEE Transactions on Information Theory, vol. 52, no. 6, pp. 2486–2507, JUN 2006.

Page 9: Rateless  codes and random walks for P2P resource discovery in Grids

9

Related Work (2/2)

ه In [13] distributed fountain codes are proposed for networked storage. To create a new encoded packet, each storage node asks information to a randomly selected node of the network.

ه A similar algorithm is proposed in [14], but the coded packet formation mechanism is reversed.

ه The nodes cope with the information gathering and the encoding operations; in [16] this responsibility is assigned to the packets.

[13] “Bistributed fountain codes for networked storage,” in IEEE ICASSP, 2006.[14] “Data persistence in large-scale sensor networks with decentralized fountain codes,” in IEEE Infocom, 2007.[16] “Rateless packet approach for data gathering in wireless sensor networks,” Selected Areas in Communications, IEEE Journal on, vol. 28, no. 9, pp. 1169–1179, Sep. 2010.

Page 10: Rateless  codes and random walks for P2P resource discovery in Grids

10

Outline

ه Introductionه Related Workه Proposed Systemه Analysisه Simulation Resultsه Conclusion

Page 11: Rateless  codes and random walks for P2P resource discovery in Grids

11

System Description (1/3)

ه This paper models the interface peers of a Grid system and the connections among them as a graph G(V, E).ه V are the set of interface peers ه E are the set of edgesه is node IDه is time-stamp, i.e. generationsه is informationه m bits each information

Page 12: Rateless  codes and random walks for P2P resource discovery in Grids

12

System Description (2/3)

ه To realize a concurrent broadcasting of all the information collected by all the nodes in the network.ه all nodes should communicate with each other.

ه This paper proposes a fully distributed solution based on random walks. ه each node starts a limited number ω of packets.ه those packets are propagated by random walk in the network.ه all the nodes use the packets to solve a system of linear equations.

Page 13: Rateless  codes and random walks for P2P resource discovery in Grids

13

System Description (3/3)

ه The shortcomings of network coding1. The added computational complexity

ى Solutionي using simple combinations XORي using rateless codes, known as LT codes

2. The impossibility of asynchronous updating ى Solution

ي asynchronous updating

Node A

Node B

Page 14: Rateless  codes and random walks for P2P resource discovery in Grids

14

Random Walk and LT Coding

Header dF di v1 t1 v2 t2 v3 t3 v4 t4 ceq1eq2

Page 15: Rateless  codes and random walks for P2P resource discovery in Grids

15

Random Walk and LT Coding

ه When a packet approaches the maximum dimension DIM, the eldest equation carried by it is deleted.

ه When the acknowledgement timer reaches 0 the receiving node acknowledges the originator that its random walker is still alive.

Page 16: Rateless  codes and random walks for P2P resource discovery in Grids

16

Asynchronous Update and LT Coding (1/3)

ه The information spread by the random walkers can be recovered by any node as soon as the number of equations has been collected.

ه The decoder task can be formulated as the solution of the following system of linear equations Gx = c.ه G is an N×N binary matrix.

ى rows : N possible independent equations collected by the node

ه x is N×1 column vectors .ى N unknown pieces of information

ه c is the corresponding buffered linear combinations.

Page 17: Rateless  codes and random walks for P2P resource discovery in Grids

17

Asynchronous Update and LT Coding (2/3)

ه The nodes are allowed to update their information only when a new generation is initiated.ه the vector x is extended to the (ν+1)·N×1 vector ˜xه ˜G turns to be a (ν + 1)N×(ν + 1)N extended decoding matrix

ه The information collected in the network with a sliding window mechanism including the (ν+1) most recent generations for the information.

Page 18: Rateless  codes and random walks for P2P resource discovery in Grids

18

Asynchronous Update and LT Coding (3/3)

ه The idea is to keep the decoding as updated as possible aiming at reconstructing the last N elements of ˜x.

ه This paper proposes a strategy to manage the extended decoding matrix ˜G in order to make the decoding process robust to asynchronous updates of the information.

Page 19: Rateless  codes and random walks for P2P resource discovery in Grids

19

Asynchronous Update Algorithm

[21] V. Bioglio, M. Grangetto, R. Gaeta, and M. Sereno, “An optimal partial decoding algorithm for rateless codes,” in IEEE International Symposium on Information Theory (ISIT), aug 2011, pp. 2731 –2735.

Page 20: Rateless  codes and random walks for P2P resource discovery in Grids

20

Outline

ه Introductionه Related Workه Proposed Systemه Analysisه Simulation Resultsه Conclusion

Page 21: Rateless  codes and random walks for P2P resource discovery in Grids

21

Recovery Time (1/6)

ه The time required to spread all the local information to all the participants in the network is defined as recovery time.

ه Model the recovery time as a function of ه the size of the local information mه the number of random walkers generated per node ωه the number of nodes in the network Nه the maximum size of the random walk packets DIM.

Page 22: Rateless  codes and random walks for P2P resource discovery in Grids

22

Recovery Time (2/6)

ه Given the size DIM (in bits) of the transmission packet.ه header size is h ه other size is f = DIM − hه the pair (vl, tl) size is gه combined message ci size is m

ه So, the size of a single equation : ه Coded : eC = di g + m, di = 2lnNه Uncoded : eU = g + m

Page 23: Rateless  codes and random walks for P2P resource discovery in Grids

23

Recovery Time (3/6)

ه We can know that nU and nC the maximum number of equations storable in an uncoded and encoded packet are :

ه .

ه .

Page 24: Rateless  codes and random walks for P2P resource discovery in Grids

24

Recovery Time (4/6)

ه It is possible to predict the number of hops TC required to distribute a certain number of equations RC using the coded approach.

Page 25: Rateless  codes and random walks for P2P resource discovery in Grids

25

Recovery Time (5/6)

ه N = 1000 nodes

Page 26: Rateless  codes and random walks for P2P resource discovery in Grids

26

Recovery Time (6/6)

ه N = 1000, Nneigh = 50, ω = 1ه 95% confidence interval

Page 27: Rateless  codes and random walks for P2P resource discovery in Grids

27

Outline

ه Introductionه Related Workه Proposed Systemه Analysisه Simulation Resultsه Conclusion

Page 28: Rateless  codes and random walks for P2P resource discovery in Grids

28

Simulation Results (1/4)

ه In order to simulate the real P2P circumstances in networks : 1. at each time slot 30 random nodes shuffle their neighborhood by

exchanging one random neighbor.2. when a node joins it connects to a random set of neighboring

nodes. when a node leaves its neighbors replace it through the described shuffling mechanism.

3. keep constant the overall number of packets in the network ideal signaling is assumed

Page 29: Rateless  codes and random walks for P2P resource discovery in Grids

29

Simulation Results (2/4)

ه For each node vl we calculate the percentage of overall information retrieved by that node as a function of time T :

Page 30: Rateless  codes and random walks for P2P resource discovery in Grids

30

Simulation Results (3/4)

ه The average value of the previous index computed on the set of nodes A(T) that are active.

ه All the numerical results based on the previous definitions have been averaged over 30 independent trials so as to guarantee statistically meaningful values.

Page 31: Rateless  codes and random walks for P2P resource discovery in Grids

31

Simulation Results (4/4)

Page 32: Rateless  codes and random walks for P2P resource discovery in Grids

32

Outline

ه Introductionه Related Workه Proposed Systemه Analysisه Simulation Resultsه Conclusion

Page 33: Rateless  codes and random walks for P2P resource discovery in Grids

33

Conclusion

ه The design of a novel decoder for rateless codes that is robust to asynchronous updates of the information.

ه The development of a simple analytical model for the estimation of the time required to spread the information.

ه The encoded system scales better than the uncoded one when the number of nodes in the distributed system increases.