Simple and fast distributed multicommodity flow algorithm

Simple and FastDistributed Multicommodity Flow AlgorithmAnil Kamath� Omri Palmony Serge PlotkinzAbstractIn this paper, we present a simple and fast distributed algorithm for approximately solv-ing the minimum-cost multicommodity ow problem. If there exists a ow that satis�es allof the demands, our algorithm runs in O(m2��2 logn) communication rounds and producesa ow that satis�es (1� �)-fraction of each demand, where m denotes the number of edgesin the graph. If there exists a feasible ow of cost B, then our algorithm terminates inO(m2��2��2 log(nCU ) log2(n��1)) rounds and �nds a ow that satis�es (1� �) fraction ofeach demand and whose cost is below (1 + �)B, where C and U are maximum cost andmaximum edge capacity, respectively.The algorithm is local in the sense that in one round each processor makes a local compu-tation and exchanges information with each one of its neighbors. The only previously knownlocal multicommodity ow approximation algorithm, due to Awerbuch and Leighton [2], cannot deal with the min-cost problem, and runs in O(m2k1:5n��3) communication rounds,which can be as much as (n4��1) factor slower than our algorithm.Perhaps the most important property of our algorithm is its resilience to failures. Forexample, in the case where there are no costs, failure of an edge can delay the convergenceof the algorithm by at most O(m2��2 logn) rounds, provided that this failure does not makethe problem infeasible.The performance of our distributed algorithm should be contrasted with the performanceone can achieve by taking the best sequential approximation algorithm for the min-costmulticommodity ow problem [8] and implementing it in a distributed setting. This resultsin a distributed algorithm that runs in O�(kmn��2) communication rounds, where k is thenumber of commodities and n is the number of nodes in the graph. This algorithm is notresilient to edge failures and is slower than the algorithm presented here for constant � and� and large k.�Department of Computer Science, Stanford University. Research supported by U.S. Army Research O�ceGrant DAAL-03-91-G-0102.yDepartment of Computer Science, Stanford University. Research supported by a grant from Stanford O�ceof Technology Licensing.zDepartment of Computer Science, Stanford University. Research supported by U.S. Army Research O�ceGrant DAAL-03-91-G-0102 and by a grant from Mitsubishi Electric Laboratories.

1 IntroductionThe multicommodity ow problem involves simultaneously shipping several di�erent commodi-ties from their respective sources to their sinks in a single network so that the total amount of ow going through each edge is no more than its capacity. Associated with each commodity isa demand, which is the amount of that commodity that we wish to ship. Given a multicom-modity ow problem, one often wants to know if there is a feasible ow, i.e. if it is possible to�nd a ow which satis�es the demands and obeys the capacity constraints. More generally, wemight wish to know the maximum percentage z such that at least z percent of each demandcan be shipped without violating the capacity constraints. The latter problem is known as theconcurrent ow problem, and is equivalent to the problem of determining the minimum ratioby which the capacities must be increased in order to ship 100% of each demands.In this paper we present a simple and fast distributed approximation algorithm for a variantof the minimum-cost concurrent multicommodity ow. If there exists a feasible ow, i.e. owthat satis�es all the demands, then in O�(m2��2) communication rounds, our algorithm �nds a ow that satis�es at least (1��)-fraction of each demand, where m is the number of edges in thenetwork.1 If there exists a feasible ow of cost B, then our algorithm �nds a ow that satis�es(1 � �) fraction of the demands and whose cost is below (1 + �)B in O�(m2��2��2 log(CU))rounds, where C and U are maximum edge cost and maximum edge capacity, respectively.A communication round consists of each processor performing a computation on its localdata and sending the results to all of its neighbors. Our algorithm is local in a sense that it doesnot depend on any global computations like shortest paths or min-cost ow, that were the mainsubroutines of the previously known e�cient combinatorial multicommodity ow algorithms [5,6, 8]. The only previously known local approximate concurrent ow algorithm, due to Awerbuchand Leighton [2], does not address the min-cost problem and runs in O�(m2k1:5n��3) rounds,where k is the number of commodities and n is the number of nodes. This can be as much asan (n4��1) factor slower than our algorithm.The main advantages of a \local" algorithm is ease of implementation in a distributedsystem where centralized control is either infeasible or very expensive. Moreover, our algorithmis resilient to failures: an edge failure (or, as a matter of fact, and change in the networktopology) that does not make the problem infeasible, can delay the convergence of the algorithmby at most the number of rounds stated above. The only two nodes needed to be noti�ed aboutthe failure are the nodes adjacent to the failed edge. This makes it unnecessary to run thecomplicated and costly \global reset" algorithms [1] after every failure/reappearance of anedge. Another interesting property of our algorithm is that it can be modi�ed to run forever,maintaining an approximate solution to the ow, automatically adjusting it after every edgefailure or demand change.Multicommodity ow arises naturally in many contexts, including virtual circuit routingin a communication network, VLSI layout, scheduling, and transportation, and hence wasextensively studied [3, 4, 11, 12, 7, 9, 10]. Since it can be written as a linear program, one can use1We say that f(n) = O�(g(n) if f(n) = O(g(n) logk n) for some constant k.1

interior point methods to solve it exactly. Unfortunately, the fact that the number of variables islarge causes these methods to generate rather slow algorithms. In particular, the fastest interior-point min-cost multicommodity ow algorithm [4, 11, 12] relies on fast matrix multiplicationsubroutine and takes O(k3:5n3m:5 log(nDU)) time for the multicommodity ow problem withinteger demands and at least O(k2:5n2m:5 log(n��1DU)) time to �nd an approximate solution,where D is the largest demand, and U is the largest edge capacity. Since this algorithm involvesrepeated matrix multiplications, it does not seem appropriate for e�cient implementation inthe distributed setting.Signi�cantly better running times were achieved by approximation algorithms. In particular,if there exists a feasible solution, then the algorithm due to Leighton, Makedon, Plotkin, Stein,Tardos, and Tragoudas [6] will produce an �-approximation to the feasible ow in O�(k2mn��2)time. A ow that both satis�es at least (1 � �) fraction of each demand and whose cost iswithin (1 + �) of the optimum can be obtained by the algorithm due to Plotkin, Shmoys andTardos [8] in O�(k2m2��2 logC) time, where C is the maximum edge cost. The running timesof these algorithms can be improved by a factor of k by use of randomization. Both of thesealgorithms are based on repeated computations of minimum-cost ows and shortest paths, andhence are not very easy to implement in a distributed setting. (Especially in networks withwith dynamic topology.) The best performance one can hope to achieve by implementing thealgorithm in [8] in a static network is O�(kmn��2 logC) rounds. This signi�cantly exceeds thenumber of rounds of the algorithm presented in this paper for large k and constant �; �.The algorithm presented in this paper is very simple. The idea is to maintain a pre ow,which is ow that obeys capacity constraints but does not obey ow-conservation constraints.We refer to the di�erence between the ow into a node and the ow out of the node as theexcess at this node. The algorithm starts with zero ow, which is a valid pre ow. Each iterationconsists of two phases. In the �rst phase, each node distributes its excess among its adjacentedges. Then, independently for each edge, the ow through the edge is updated in a way thatreduces a certain non-linear function of the excesses assigned to this edge. Intuitively, thisoperation \spreads" the excesses throughout the graph, eventually obtaining a solution withvery small excesses.Our algorithm is in some sense similar to the one in [2]. There are two main di�erencesthat allowed us to get much better performance. First, we use a di�erent local optimizationcriteria. Our criteria can be viewed as an adaptation to the excess-minimization context ofthe exponential \length function", introduced by Shahrokhi and Matula [9]. Second, we usepre ow instead of explicitly maintaining queues. Roughly speaking, if one interprets ow asthe \amount of ow in unit time", then the fact that a node has excess implies that each timeunit there is more ow coming in than going out. The algorithm in [2] explicitly maintainsthe number of ow units queued at each node and tries to minimize the size of these queues.Instead, our algorithm minimizes the excess, which corresponds to the rate of growth of thequeues. 2

2 Preliminaries and De�nitionsAn instance of the multicommodity ow problem consists of an undirected graph G = (V;E),a non-negative capacity u(vw) for every edge vw 2 E, and a speci�cation of k commodities,numbered 1 through k, where the speci�cation for commodity i consists of a source-sink pairsi; ti 2 V and a non-negative demand di. We will denote the number of nodes by n, and thenumber of edges by m. We assume that m � n, and that the graph G is connected and has noparallel edges. Also, for notational convenience, we arbitrarily direct each edge. If there is anedge directed from v to w, this edge is unique by assumption, and we denote it by vw.A multicommodity ow f consists of a function fi(vw) on the edges ofG for every commodityi, which represents the ow of commodity i on edge vw. If the ow of commodity i on edgevw is oriented in the same direction as edge vw, then fi(vw) will be positive, otherwise it willbe negative. The signs only serve to indicate the direction of the ows. Given ow fi(vw), theexcess Exi(f; v) of commodity i at node v is de�ned by:Exi(f; v) def= di(v)� Xw:vw2E fi(vw);where di(si) = di, di(ti) = �ti and di(v) = 0 otherwise.A ow where all excesses are zero is said to satisfy the conservation constraints. A owsatis�es capacity constraints if Pi jfi(vw)j � u(vw) for all edges vw. A ow that satis�esboth the capacity and the conservation constraints is feasible. During the description of ouralgorithm it will be convenient to work with pre ow, which is a ow that satis�es capacityconstraints but does not necessarily satisfy the conservation constraints. An �-approximationto the multicommodity ow feasibility problem is a feasible ow that satis�es demands (1��)di.3 The Flow Without CostsIn this section we present the distributed multicommodity ow algorithm. If a feasible solutionsatisfying all the demands exists, the algorithm will produce a ow that satis�es at least (1� �)fraction of each demand.The Algorithm The algorithm starts with zero ow, which is a valid pre ow. The algorithmmaintains for every node pair v; w such that vw 2 E or wv 2 E, commodity i, and edge vw 2 Ean excess Exi(f; vw).2 Excess of commodity i at node v is the sum of Exi(f; vw) for all w. Eachiteration consists of two phases. In the �rst phase, each node recomputes Exi(f; vw), e�ectivelydistributing its excess among its edges. Then, independently for each edge, the ow throughthe edge is updated in a way that reduces the potential function de�ned below. Intuitively, thisoperation \spreads" the excesses throughout the graph, eventually obtaining a solution withvery small remaining excesses.2Note that, in general, Exi(f; vw) 6= Exi(f;wv). 3

Given a ow f , we de�ne the following potential function:� (f) def= Xvw2E kXi=1 exp��Exi(f; vw)di �+ exp��Exi(f; wv)di � ; (1)where � def= 4��1m log(mk):The algorithm starts from the zero ow and proceeds in iterations, where each iteration con-sists of two phases. The �rst phase computes the excess Exi(f; v) at each node and distributesit equally among the edges incident to this node by setting Exi(f; vw) = Exi(f; v)=�(v), where�(v) is the degree of v. The second phase calculates the ow update �fi(vw) on each edge vwfor each commodity i where the goal is to minimize the potential function � (f). In fact, sincethis function can be written as a sum of functions where each one depends on the ow on asingle edge, the algorithm minimizes the following function independently for each edge vw:kXi=1 exp��Exi(f; vw)��fi(vw)di �+ exp��Exi(f; wv) + �fi(vw)di � : (2)In order to ensure convergence to feasible ow, the minimization is done under the followingconstraints: kXi=1 jfi(vw) + �fi(vw)j � u(vw)jfi(vw) + �fi(vw)j � diThe new ow is computed by setting fi(vw) = fi(vw)+�fi(vw). In the rest of this section wewill assume that the minimization is executed as described above. In fact, one can minimize amuch simpler expression, thus improving the complexity of each iteration. We will address thisissue in Section 4.Analysis of the algorithm We will show that existence of a feasible solution that satis�es allof the demands implies that each iteration reduces the potential function. We will concentrateon proving the reduction due to the second phase in each iteration, since the �rst phase cannot increase the potential function. The following theorem shows that su�ciently small valueof � (f) implies that the sum of the excesses of any commodity does not exceed an �-fractionof the demand of this commodity. Note that such a pre ow can be easily converted into a owthat satis�es (1� �)-fraction of each demand.Theorem 3.1 If the value of the potential function � (f) � exp �� m � then for each commodity i,we have Pv jExi(f; v)j � �di. 4

Theorem 3.2 For each iteration � (f + �f) � � (f) : Furthermore, if there exists a feasible solutionto the problem then in each iteration� (f +�f) � � (f) 1� log2 �(f)mk16�2 ! :Proof: As we have noted above, the �rst phase of each iteration can not increase the potentialfunction. Obviously, the second phase can not increase the potential function value. It remainsto prove that the decrease due to the second phase is large.Let f� denote a feasible solution that satis�es all of the demands such that jf�(vw)j � difor every commodity i and edge vw. Let ~f def= f� � f . Observe that ow update de�ned by � ~ffor any � 2 [0; ��1] satis�es the optimization constraints. Hence the reduction in the potentialfunction can be bounded by computing the e�ect of updating the current ow by � ~f .We will use the fact that ex+t � ex + tex + t2ex for jtj � 1. Thus, if �� ~fi(vw)�� di for all iand vw, we have:� �f + � ~f� � kXi=1 Xvw2E exp �Exi(f; vw)� � ~fi(vw)di !+ exp �Exi(f; wv) + � ~fi(vw)di ! (3)� � (f)� kXi=1 Xvw2E �� ~fi(vw)di �exp��Exi(f; vw)di �� exp��Exi(f; wv)di �� (4)+ kXi=1 Xvw2E �2�2 ~fi(vw)di !2 �exp��Exi(f; vw)di �+ exp��Exi(f; wv)di �� : (5)We will evaluate each one of the terms in the above expression separately. To evaluate (4)we decompose ~f into cycles and paths from positive to negative excesses. Notice that the cyclesdo not a�ect the value of (4). Moreover, each ow path of value F that starts with edge vv0and ends with u0u, contributes��F � �exp��Exi(f; vv0)di �� exp��Exi(f; uu0)di �� :Hence, (4) is equal to:�Xi Xvw2E �Exi(f; vw)di exp��Exi(f; vw)di � � �� (f) log�� (f)mk � ;where the inequality follows from the fact that the sum has mk terms and P yi log yi is convex.To evaluate expression (5) we use the fact that �� ~fi(vw)�� 2di and thusXi Xvw2E �2�2 ~fi(vw)di !2 exp��Exi(f; vw)di � � 4�2�2� (f) :5

Combining our bounds on (4) and (5) we get from (3) that for � � ��1:� �f + � ~f� � � (f)� �� (f) log � (f)mk + 4�2�2� (f) :Thus taking � = log �(f)km8�2we get � �f + � ~f� � � (f) "1� log2 �(f)mk16�2 # :This completes the proof of the theorem.Remark: Note that the way we proved the decrease in the potential function is by expressinga lower bound on this decrease as a sum of quadratic functions in �f and then showing thatthere exists a ow that achieves a large decrease. We will use this fact in Section 5, where wepresent an e�cient implementation of each iteration.Theorem 3.3 After O(m2��2 logmk) iterations, we have Pv jExi(f; v)j � �di for each commodityi.Proof: Theorem 3.2 implies that as long as � (f) � 2m2k2, the value of the potential functionis halved in O(�2= log2� (f)). Thus, the number of iterations until the potential functionreaches 2m2k2 is bounded by O ��2= log(2m2k2)� = O(m2��2 logmk). The claim follows bysubstituting the value of � into the claim of Theorem 3.1.4 Flow With CostsIn this section we show how to modify the algorithm described in Section 3 to approximatelysolve the minimum cost multicommodity ow problem. More precisely, we consider the problemwhere every edge vw has an associated cost c(vw), and the goal is to �nd a ow f that satis�esat least a (1��) fraction of each demand and whose costPi;vw jfi(vw)jc(vw)� (1+�)B, whereB is a given budget and � > 0 is some given parameter.In this section we will assume that B is given and that there exists a feasible ow of costat most B that satis�es all the demands. Note that by doing bisection search, one can easilyextend our approximation algorithm to the case where B is unknown in advance. Details willappear in the full paper.In order to take the costs into account, we will use the following potential function, whichis a modi�cation of the function used in the previous section.�(f) def= Xvw2E kXi=1 �exp��Exi(f; vw)di �+ exp��Exi(f; wv)di ��6

+ (1 + �)BXi Xvw2E ci(vw)jfi(vw)jwhere = m2k2e4=�=� and � = m� log . To simplify notation, we will use �1 to denote the�rst term and �2 to denote the second. Also, we will write � instead of �(f) where it will notcause confusion.Observe that we can express �(f) as a sum of terms such that each term depends onlyon the ow on a single edge and is independent of the ows on the other edges. Thus, thedistributed algorithm presented in Section 3 can be based on �(f) instead of the potentialfunction given by (1). In other words, the only di�erence will be the minimization step, whichwill update the ow on each edge in order to minimize the corresponding term in �(f) insteadof the corresponding term in (1).As before, it is easy to see that the algorithm ensures that � does not increase. Roughlyspeaking, our analysis consists of two parts. First we show that a su�ciently small value of�(f) implies that we have good approximation to the MFCB problem. Then we prove that�(f) decreases su�ciently fast.Theorem 4.1 If pre ow f obeys capacity constraints and �(f) � (1+ �), then the cost of f is atmost (1 + 3�)B and the total excess of commodity i is bounded by �di.Proof: Omitted.Observe that using the standard technique of decomposing pre ow into paths from sourcesto sinks, from positive to negative excesses, and into cycles, a pre ow that satis�es the conditionof the above theorem can be easily transformed into a ow of cost at most (1+�)B that satis�es(1� �) fraction of each demand. This transformation is done di�erently in the sequential anddistributed setting and we defer the details to the full paper.Theorem 4.2 The algorithm that minimizes the potential function �(f) solves the MCFB problemin O( m2�2�2 log2 (mk��1) log (mCU)) iterations where C is the maximum cost and U is the maximumcapacity associated with any edge in the graph.Proof: Let f� denote the optimum ow of cost at most B that satis�es all the demands. Like inthe proof of Theorem 3.2, we will use the fact that the decrease in the total potential functionat each iteration is at least as large as the decrease obtained by setting the new ow to be(1 � �)f + �f�. Let �f = �(f� � f). Then, for � � 1=�, the decrease in the total potentialfunction is at least:�(f)� �(f + �f) � ��1(f) log �1mk � 4�2�2�1(f) + ��2(f)� � (1 + �) : (6)7

We will distinguish three cases, according to the relative values of �1 and �2. First considerthe case where �1 � �=2 and �2 � �1. Using the fact that log �1mk � 4=�, we get that for� = log �1mk8�2the decrease is bounded by: log2 �1mk32�2 �:The number of iterations of this type is at least O(�2= log(mk)).The second type of iteration occurs when �1 � �=2 and �2 � �1. Using the same value of� as above, the decrease per iteration can be bounded in this case by (�=�2). Since in thiscase � � � O( mCU), the number of such iterations is bounded by O(�2 log(mCU)).The third type of iteration occurs when �1 � �=2. We will bound the number of suchiterations for � 2 [(1+�) ; (1+2�) ] for � � �. When � is in this range, then �2 � (1+�=2) .Observe that the �rst term in expression (6) is always nonnegative. For � � 1, since�2 � =(1+ �) � ��2=3by choosing � = �8��2we see that the decrease in the potential function is at least (�2=(�2�)�). Hence, the numberof iterations of this type for � 2 [(1+ �) ; (1+ 2�) ] is bounded by O(�2�=�). Thus, the totalnumber of such iterations for the range � 2 [(1 + �) ; 2 ] is bounded by O(�2). For the casewhere �1 � �=2 and � � 2 , choice of � = 1=(8�2�) implies that the decrease is at least(�2�), and hence the total number of such iterations is bounded by O(�2� log(mCU)).Observe that if we are not in any of the types of iterations addressed above, then � � (1+�) .Since � monotonically decreases, the claim of the theorem follows by summing up the boundon the number of iterations of each type and applying Theorem 4.1.5 Implementation of a single iterationThe previous sections described the algorithms in terms of iterations, where the main part ofeach iteration is minimization of a set of independent functions of ow, one for each edge. Inthis section we will address the question of how to perform this minimization e�ciently.The main idea is to minimize a quadratic approximation to the potential function instead ofthe function itself. The proofs of Theorems 3.2 and 4.2 rely on the fact that the decrease in � asa function of the ow update �f , can be bounded by a sum of independent quadratic functionsin �f (one per edge), as long as the following constraints are satis�ed for every commodity i8

and edge vw: j�fi(vw)j � di=�jfi(vw) + �fi(vw)j � di (7)Xi jfi(vw) + �fi(vw)j � u(vw):Moreover, these proofs show that there exists a �f that satis�es the above constraintsand for which the value of the sum of these quadratic functions is su�ciently large. Hence,instead of minimizing the potential function itself, we can achieve at least as large a decreaseby maximizing these quadratic functions over the region de�ned by (7).The advantage of working with the quadratic approximation is that it is relatively easierto optimize than the original potential function that includes exponentials. Observe that thequadratic functions that we need to maximize are concave. The following theorem proves thatthe optimization can be done in O(k log k) time per edge.Theorem 5.1 MaximizingPki=1 aixi + bix2i where bi � 0, subject to the constraintsPki=1 jxi + yij � u and ei � xi � fi can be done in O(k log k) time.Proof: Omitted.References[1] Y. Afek, B. Awerbuch, and E. Gafni. Applying Static Network Protocols to DynamicNetworks. In Proc. 28th IEEE Annual Symposium on Foundations of Computer Science,pages 358{370, 1987.[2] B. Awerbuch and T. Leighton. A simple local-control approximation algorithm for mul-ticommodity ow. In Proc. 34rd IEEE Annual Symposium on Foundations of ComputerScience, pages 459{468, 1993.[3] T. C. Hu. Multi-Commodity Network Flows. J. ORSA, 11:344{360, 1963.[4] S. Kapoor and P. M. Vaidya. Fast Algorithms for Convex Quadratic Programming andMulticommodity Flows. In Proc. 18th Annual ACM Symposium on Theory of Computing,pages 147{159, 1986.[5] P. Klein, S. Plotkin, C. Stein, and �E. Tardos. Faster approximation algorithms for the unitcapacity concurrent ow problem with applications to routing and �nding sparse cuts.SIAM Journal on Computing, 1992. Accepted for Publication.[6] T. Leighton, F. Makedon, S. Plotkin, C. Stein, �E. Tardos, and S. Tragoudas. Fast approx-imation algorithms for multicommodity ow problem. In Proc. 23th ACM Symposium onthe Theory of Computing, pages 101{111, May 1991.9

[7] T. Leighton and S. Rao. An approximate max- ow min-cut theorem for uniform multi-commodity ow problems with applications to approximation algorithms. In Proc. 29thIEEE Annual Symposium on Foundations of Computer Science, pages 422{431, 1988.[8] S. A. Plotkin, D. Shmoys, and �E. Tardos. Fast Approximation Algorithms for FractionalPacking and Covering. In Proc. 32nd IEEE Annual Symposium on Foundations of Com-puter Science, 1991.[9] F. Shahrokhi and D. W. Matula. The maximum concurrent ow problem. Technical ReportCSR-183, Department of Computer Science, New Mexico Tech., 1988.[10] C. Stein. Approximation algorithms for multicommodity ow and scheduling problems.PhD thesis, MIT, 1992.[11] P. M. Vaidya. Speeding up Linear Programming Using Fast Matrix Multiplication. InProc. 30th IEEE Annual Symposium on Foundations of Computer Science, 1989.[12] P.M. Vaidya. A new algorithm for minimizing convex functions over convex sets. In Proc.30th IEEE Annual Symposium on Foundations of Computer Science, pages 338{343, 1989.

10

Documents

Simple and fast distributed multicommodity flow algorithm