77
Computing high-quality Lagrangian bounds of the stochastic mixed-integer programming problem Prof. Andrew Eberhard Mathematical Sciences School of Sciences Royal Melbourne Institute of Technology (RMIT) University Melbourne, Victoria, Australia AMSI Optimise Melbourne 30th June 2017.

Computing high-quality Lagrangian bounds of the stochastic ... · Computing high-quality Lagrangian bounds of the stochastic mixed-integer programming problem Prof. Andrew Eberhard

  • Upload
    others

  • View
    4

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Computing high-quality Lagrangian bounds of the stochastic ... · Computing high-quality Lagrangian bounds of the stochastic mixed-integer programming problem Prof. Andrew Eberhard

Computing high-quality Lagrangian bounds ofthe stochastic mixed-integer programming

problem

Prof. Andrew Eberhard

Mathematical SciencesSchool of Sciences

Royal Melbourne Institute of Technology (RMIT) UniversityMelbourne, Victoria, Australia

AMSI OptimiseMelbourne

30th June 2017.

Page 2: Computing high-quality Lagrangian bounds of the stochastic ... · Computing high-quality Lagrangian bounds of the stochastic mixed-integer programming problem Prof. Andrew Eberhard

Team on this DP ProjectCo-authors:

I At Georgia Tech:I Prof. Natashia Boland

I At RMIT University:I Dr. Brian Dandurand (Postdoc)I Mr. Jeffrey Christiansen (Ph.D. student)I Dr. Fabricio Oliveira (Postdoc)

I At University of Wisconsin-Madison:I Prof. Jeffrey LinderothI Prof. James Luedtke

Funding:I For Boland, Christiansen, Dandurand, Eberhard, Oliveira,

and Linderoth: (ARC) Grant ARC DP140100985;I For Linderoth and Luedtke: U.S. Department of Energy,

Applied Mathematics program under contract numberDE-AC02-06CH11357.

Page 3: Computing high-quality Lagrangian bounds of the stochastic ... · Computing high-quality Lagrangian bounds of the stochastic mixed-integer programming problem Prof. Andrew Eberhard

Outline

Stochastic Mixed-Integer Optimization: Background

Algorithm FW-PH

Computational Experiments

Generating Primal Solutions

Page 4: Computing high-quality Lagrangian bounds of the stochastic ... · Computing high-quality Lagrangian bounds of the stochastic mixed-integer programming problem Prof. Andrew Eberhard

Stochastic Mixed-Integer Optimization

I Provides a framework for modeling problems wheredecisions are made in stages.

I Between stages, some uncertainty in the problemparameters is unveiled, and decisions in subsequentstages may depend on the outcome of this uncertainty.

I When some decisions are modeled using discretevariables, the problem is known as a StochasticMixed-Integer Programming (SMIP) problem.

I However, the combination of both uncertainty withdiscreteness makes this class of problems particularlychallenging from a computational perspective.

Page 5: Computing high-quality Lagrangian bounds of the stochastic ... · Computing high-quality Lagrangian bounds of the stochastic mixed-integer programming problem Prof. Andrew Eberhard

Stochastic Mixed-Integer Optimization

I Provides a framework for modeling problems wheredecisions are made in stages.

I Between stages, some uncertainty in the problemparameters is unveiled, and decisions in subsequentstages may depend on the outcome of this uncertainty.

I When some decisions are modeled using discretevariables, the problem is known as a StochasticMixed-Integer Programming (SMIP) problem.

I However, the combination of both uncertainty withdiscreteness makes this class of problems particularlychallenging from a computational perspective.

Page 6: Computing high-quality Lagrangian bounds of the stochastic ... · Computing high-quality Lagrangian bounds of the stochastic mixed-integer programming problem Prof. Andrew Eberhard

Stochastic Mixed-Integer Optimization

I Provides a framework for modeling problems wheredecisions are made in stages.

I Between stages, some uncertainty in the problemparameters is unveiled, and decisions in subsequentstages may depend on the outcome of this uncertainty.

I When some decisions are modeled using discretevariables, the problem is known as a StochasticMixed-Integer Programming (SMIP) problem.

I However, the combination of both uncertainty withdiscreteness makes this class of problems particularlychallenging from a computational perspective.

Page 7: Computing high-quality Lagrangian bounds of the stochastic ... · Computing high-quality Lagrangian bounds of the stochastic mixed-integer programming problem Prof. Andrew Eberhard

Stochastic Mixed-Integer Optimization

I Provides a framework for modeling problems wheredecisions are made in stages.

I Between stages, some uncertainty in the problemparameters is unveiled, and decisions in subsequentstages may depend on the outcome of this uncertainty.

I When some decisions are modeled using discretevariables, the problem is known as a StochasticMixed-Integer Programming (SMIP) problem.

I However, the combination of both uncertainty withdiscreteness makes this class of problems particularlychallenging from a computational perspective.

Page 8: Computing high-quality Lagrangian bounds of the stochastic ... · Computing high-quality Lagrangian bounds of the stochastic mixed-integer programming problem Prof. Andrew Eberhard

Applications: SMIP models includeI unit commitment and hydro-thermal generation scheduling

[NR00, TBL96],I military operations [SWM09],I vaccination planning [OPSR11, TSN08],I air traffic flow management [AAAEP12],I forestry management and forest fire response

[BWW+15, NAS+12], andI supply chain and logistics planning [LLM92, Lou86].

Page 9: Computing high-quality Lagrangian bounds of the stochastic ... · Computing high-quality Lagrangian bounds of the stochastic mixed-integer programming problem Prof. Andrew Eberhard

Two-stage SMIP FormulationI A two-stage SMIP:

ζSMIP := minx

{c>x +Q(x) : x ∈ X

}where

I vector c ∈ Rnx is knownI X ⊂ Rnx is a constraint set consisting of linear constraints

and integer restrictions on some components of xI Function Q : Rnx 7→ R outputs the expected recourse value

Q(x) := Eξ[min

y

{q(ξ)>y : W (ξ)y = h(ξ)− T (ξ)x , y ∈ Y (ξ)

}]I each realization ξs of ξ, is called a scenario and encodes

the realizations observed for each of the random elements(qs,hs,Ws,Ts,Ys) for each scenario index s ∈ S

I for each s ∈ S, the set Ys ⊂ Rny is a mixed-integer setcontaining both linear constraints and integrality constraintson a subset of the variables, ys

Page 10: Computing high-quality Lagrangian bounds of the stochastic ... · Computing high-quality Lagrangian bounds of the stochastic mixed-integer programming problem Prof. Andrew Eberhard

Two-stage SMIP FormulationI A two-stage SMIP:

ζSMIP := minx

{c>x +Q(x) : x ∈ X

}where

I vector c ∈ Rnx is knownI X ⊂ Rnx is a constraint set consisting of linear constraints

and integer restrictions on some components of xI Function Q : Rnx 7→ R outputs the expected recourse value

Q(x) := Eξ[min

y

{q(ξ)>y : W (ξ)y = h(ξ)− T (ξ)x , y ∈ Y (ξ)

}]I each realization ξs of ξ, is called a scenario and encodes

the realizations observed for each of the random elements(qs,hs,Ws,Ts,Ys) for each scenario index s ∈ S

I for each s ∈ S, the set Ys ⊂ Rny is a mixed-integer setcontaining both linear constraints and integrality constraintson a subset of the variables, ys

Page 11: Computing high-quality Lagrangian bounds of the stochastic ... · Computing high-quality Lagrangian bounds of the stochastic mixed-integer programming problem Prof. Andrew Eberhard

When we have a finite number of senarios with probabilities{ps}s∈S this problem SMIP may be reformulated as itsdeterministic equivalent

ζSMIP = minx ,y

{c>x +

∑s∈S

psq>s ys : (x , ys) ∈ Ks,∀s ∈ S},

where Ks := {(x , ys) : Wsys = hs − Tsx , x ∈ X , ys ∈ Ys}.

We assume throughout thatI problem SMIP is feasible, andI the sets Ks, s ∈ S are bounded.I These problems grow exponentially in size with the number

of scenarios. This leads to the situation that they cannot besolved using normal methods.

I This leads to the search for methods that utilise thestructure of the problem to enable decomposition intosmaller problems that can be solved in parallel.

Page 12: Computing high-quality Lagrangian bounds of the stochastic ... · Computing high-quality Lagrangian bounds of the stochastic mixed-integer programming problem Prof. Andrew Eberhard

When we have a finite number of senarios with probabilities{ps}s∈S this problem SMIP may be reformulated as itsdeterministic equivalent

ζSMIP = minx ,y

{c>x +

∑s∈S

psq>s ys : (x , ys) ∈ Ks,∀s ∈ S},

where Ks := {(x , ys) : Wsys = hs − Tsx , x ∈ X , ys ∈ Ys}.

We assume throughout thatI problem SMIP is feasible, andI the sets Ks, s ∈ S are bounded.I These problems grow exponentially in size with the number

of scenarios. This leads to the situation that they cannot besolved using normal methods.

I This leads to the search for methods that utilise thestructure of the problem to enable decomposition intosmaller problems that can be solved in parallel.

Page 13: Computing high-quality Lagrangian bounds of the stochastic ... · Computing high-quality Lagrangian bounds of the stochastic mixed-integer programming problem Prof. Andrew Eberhard

To induce a decomposable structure, scenario-dependentcopies xs for each s ∈ S of the first-stage variable x areintroduced to create the following split-variable reformulation ofSMIP:

ζSMIP = minx ,y ,z

∑s∈S

ps(c>xs + q>s ys) :

(xs, ys) ∈ Ks, xs = z, ∀ s ∈ S, z ∈ Rnx

.

I The constraints xs = z, s ∈ S, enforce nonanticipativity forthe first-stage decisions.

Page 14: Computing high-quality Lagrangian bounds of the stochastic ... · Computing high-quality Lagrangian bounds of the stochastic mixed-integer programming problem Prof. Andrew Eberhard

To induce a decomposable structure, scenario-dependentcopies xs for each s ∈ S of the first-stage variable x areintroduced to create the following split-variable reformulation ofSMIP:

ζSMIP = minx ,y ,z

∑s∈S

ps(c>xs + q>s ys) :

(xs, ys) ∈ Ks, xs = z, ∀ s ∈ S, z ∈ Rnx

.

I The constraints xs = z, s ∈ S, enforce nonanticipativity forthe first-stage decisions.

Page 15: Computing high-quality Lagrangian bounds of the stochastic ... · Computing high-quality Lagrangian bounds of the stochastic mixed-integer programming problem Prof. Andrew Eberhard

Lagrangian Dual Problem

Applying Lagrangian relaxation to the nonanticipativityconstraints in the reformulated problem SMIP yields thenonanticipative Lagrangian dual function

φ(µ) := minx ,y ,z

{ ∑s∈S ps(c>xs + q>s ys)+µ>s (xs − z) :

(xs, ys) ∈ Ks, ∀s ∈ S, z ∈ Rnx

},

where µ := (µ1, . . . , µ|S|) ∈∏

s∈S Rnx is the vector of multipliersassociated with the relaxed constraints xs = z, s ∈ S.

I We assume the following dual feasibility condition:∑s∈S µ

s = 0.

Page 16: Computing high-quality Lagrangian bounds of the stochastic ... · Computing high-quality Lagrangian bounds of the stochastic mixed-integer programming problem Prof. Andrew Eberhard

By setting ωs := 1psµs, the dual function φ may be rewritten as

φ(ω) := minx ,y ,z

{ ∑s∈S psLs(xs, ys, z, ωs) :

(xs, ys) ∈ Ks, ∀s ∈ S, z ∈ Rnx

},

where Ls(xs, ys, z, ωs) := c>xs + q>s ys + ω>s (xs − z).

I In order for the Lagrangian function φ(ω) to be boundedfrom below in z, the dual feasibility condition∑

s∈S psωs = 0 must be satisfied.I Under this assumption, the z term vanishes, and the

Lagrangian dual function φ(ω) decomposes into separablefunctions,

φ(ω) =∑s∈S

psφs(ωs),

where for each s ∈ S,

φs(ωs) := minx ,y

{(c + ωs)>x + q>s y : (x , y) ∈ Ks

}.

Page 17: Computing high-quality Lagrangian bounds of the stochastic ... · Computing high-quality Lagrangian bounds of the stochastic mixed-integer programming problem Prof. Andrew Eberhard

By setting ωs := 1psµs, the dual function φ may be rewritten as

φ(ω) := minx ,y ,z

{ ∑s∈S psLs(xs, ys, z, ωs) :

(xs, ys) ∈ Ks, ∀s ∈ S, z ∈ Rnx

},

where Ls(xs, ys, z, ωs) := c>xs + q>s ys + ω>s (xs − z).

I In order for the Lagrangian function φ(ω) to be boundedfrom below in z, the dual feasibility condition∑

s∈S psωs = 0 must be satisfied.I Under this assumption, the z term vanishes, and the

Lagrangian dual function φ(ω) decomposes into separablefunctions,

φ(ω) =∑s∈S

psφs(ωs),

where for each s ∈ S,

φs(ωs) := minx ,y

{(c + ωs)>x + q>s y : (x , y) ∈ Ks

}.

Page 18: Computing high-quality Lagrangian bounds of the stochastic ... · Computing high-quality Lagrangian bounds of the stochastic mixed-integer programming problem Prof. Andrew Eberhard

By setting ωs := 1psµs, the dual function φ may be rewritten as

φ(ω) := minx ,y ,z

{ ∑s∈S psLs(xs, ys, z, ωs) :

(xs, ys) ∈ Ks, ∀s ∈ S, z ∈ Rnx

},

where Ls(xs, ys, z, ωs) := c>xs + q>s ys + ω>s (xs − z).

I In order for the Lagrangian function φ(ω) to be boundedfrom below in z, the dual feasibility condition∑

s∈S psωs = 0 must be satisfied.I Under this assumption, the z term vanishes, and the

Lagrangian dual function φ(ω) decomposes into separablefunctions,

φ(ω) =∑s∈S

psφs(ωs),

where for each s ∈ S,

φs(ωs) := minx ,y

{(c + ωs)>x + q>s y : (x , y) ∈ Ks

}.

Page 19: Computing high-quality Lagrangian bounds of the stochastic ... · Computing high-quality Lagrangian bounds of the stochastic mixed-integer programming problem Prof. Andrew Eberhard

For any choice of ω =(ω1, . . . , ω|S|

), it is well-known that the

value of the Lagrangian provides a lower bound on the optimalsolution to SMIP: φ(ω) ≤ ζSMIP .

I The ability to compute high-quality Lagrangian boundsφ(ω) ≤ ζSMIP is useful for exact, enumerative approaches,such as those in [CS99, LMPS13].

I The problem of finding the best such lower bound is theLagrangian dual problem:

ζLD := supω

{φ(ω) :

∑s∈S

psωs = 0

}.

I Computing the optimal Lagrangian bound cannot usuallybe carried out in closed form and so numerical techniquesare need.

Page 20: Computing high-quality Lagrangian bounds of the stochastic ... · Computing high-quality Lagrangian bounds of the stochastic mixed-integer programming problem Prof. Andrew Eberhard

For any choice of ω =(ω1, . . . , ω|S|

), it is well-known that the

value of the Lagrangian provides a lower bound on the optimalsolution to SMIP: φ(ω) ≤ ζSMIP .

I The ability to compute high-quality Lagrangian boundsφ(ω) ≤ ζSMIP is useful for exact, enumerative approaches,such as those in [CS99, LMPS13].

I The problem of finding the best such lower bound is theLagrangian dual problem:

ζLD := supω

{φ(ω) :

∑s∈S

psωs = 0

}.

I Computing the optimal Lagrangian bound cannot usuallybe carried out in closed form and so numerical techniquesare need.

Page 21: Computing high-quality Lagrangian bounds of the stochastic ... · Computing high-quality Lagrangian bounds of the stochastic mixed-integer programming problem Prof. Andrew Eberhard

Outline

Stochastic Mixed-Integer Optimization: Background

Algorithm FW-PH

Computational Experiments

Generating Primal Solutions

Page 22: Computing high-quality Lagrangian bounds of the stochastic ... · Computing high-quality Lagrangian bounds of the stochastic mixed-integer programming problem Prof. Andrew Eberhard

Introducing Algorithm FW-PH

To this end, we contribute the method FW-PH:I integrates Frank-Wolfe (FW) like iterations and progressive

hedging (PH) iterations;I parallelizable;I theoretical support provides mild conditions under which

(dual) optimal convergence is realized.I can be easily extended to multi-stage problems.

Page 23: Computing high-quality Lagrangian bounds of the stochastic ... · Computing high-quality Lagrangian bounds of the stochastic mixed-integer programming problem Prof. Andrew Eberhard

To motivate our approach, we first consider the application ofPH to the following well-known primal characterization of ζLD:

ζLD = minx ,y ,z

{ ∑s∈S ps(c>xs + q>s ys) :

(xs, ys) ∈ conv(Ks), xs = z, ∀s ∈ S

},

where conv(Ks) denotes the convex hull of Ks for each s ∈ S.(See, for example, Theorem 6.2 of [NW88].)

I We refer to the above primal characterization problem asConv-SMIP.

I The sequence of Lagrangian bounds{φ(ωk )

}generated

by the application of PH to Conv-SMIP is known to beconvergent (see Rockafellar-Wetts et al).

I Thus, the value of the Lagrangian dual, ζLD, may, in theory,be approached by applying PH to Conv-SMIP.

However, an explicit polyhedral description of conv(Ks), s ∈ S isnot available, and so this is not directly implementable.

Page 24: Computing high-quality Lagrangian bounds of the stochastic ... · Computing high-quality Lagrangian bounds of the stochastic mixed-integer programming problem Prof. Andrew Eberhard

To motivate our approach, we first consider the application ofPH to the following well-known primal characterization of ζLD:

ζLD = minx ,y ,z

{ ∑s∈S ps(c>xs + q>s ys) :

(xs, ys) ∈ conv(Ks), xs = z, ∀s ∈ S

},

where conv(Ks) denotes the convex hull of Ks for each s ∈ S.(See, for example, Theorem 6.2 of [NW88].)

I We refer to the above primal characterization problem asConv-SMIP.

I The sequence of Lagrangian bounds{φ(ωk )

}generated

by the application of PH to Conv-SMIP is known to beconvergent (see Rockafellar-Wetts et al).

I Thus, the value of the Lagrangian dual, ζLD, may, in theory,be approached by applying PH to Conv-SMIP.

However, an explicit polyhedral description of conv(Ks), s ∈ S isnot available, and so this is not directly implementable.

Page 25: Computing high-quality Lagrangian bounds of the stochastic ... · Computing high-quality Lagrangian bounds of the stochastic mixed-integer programming problem Prof. Andrew Eberhard

To motivate our approach, we first consider the application ofPH to the following well-known primal characterization of ζLD:

ζLD = minx ,y ,z

{ ∑s∈S ps(c>xs + q>s ys) :

(xs, ys) ∈ conv(Ks), xs = z, ∀s ∈ S

},

where conv(Ks) denotes the convex hull of Ks for each s ∈ S.(See, for example, Theorem 6.2 of [NW88].)

I We refer to the above primal characterization problem asConv-SMIP.

I The sequence of Lagrangian bounds{φ(ωk )

}generated

by the application of PH to Conv-SMIP is known to beconvergent (see Rockafellar-Wetts et al).

I Thus, the value of the Lagrangian dual, ζLD, may, in theory,be approached by applying PH to Conv-SMIP.

However, an explicit polyhedral description of conv(Ks), s ∈ S isnot available, and so this is not directly implementable.

Page 26: Computing high-quality Lagrangian bounds of the stochastic ... · Computing high-quality Lagrangian bounds of the stochastic mixed-integer programming problem Prof. Andrew Eberhard

Progressive Hedging (PH)In the following, we consider how to modify the PH algorithm toobtain the required implementability.

I The Augmented Lagrangian (AL) function:

Lρ((xs, ys)s∈S , z, ω) :=∑s∈S

psLρs(xs, ys, z, ωs),

where

Lρs(xs, ys, z, ωs) := c>xs + q>s ys + ω>s (xs − z) +ρ

2‖xs − z‖22

and ρ > 0 is a penalty parameter.I The augmented Lagrangian method (a.k.a, the method of

mulipliers) [Hes69, Pow69, Roc73, Ber82, EY14]: ForDs = Ks we solve with a QMIP.

I At iteration k ≥ 1, compute:1) ((xk

s , y ks )s∈S , zk ) ∈

argminx,y,z

{Lρ((xs, ys)s∈S , z, ωk ) :

(xs, ys) ∈ Ds ∀s ∈ S, z ∈ Rn

}2) ωk+1

s ← ωks + ρ(xk

s − zk ) for all s ∈ S

Page 27: Computing high-quality Lagrangian bounds of the stochastic ... · Computing high-quality Lagrangian bounds of the stochastic mixed-integer programming problem Prof. Andrew Eberhard

Progressive Hedging (PH)In the following, we consider how to modify the PH algorithm toobtain the required implementability.

I The Augmented Lagrangian (AL) function:

Lρ((xs, ys)s∈S , z, ω) :=∑s∈S

psLρs(xs, ys, z, ωs),

where

Lρs(xs, ys, z, ωs) := c>xs + q>s ys + ω>s (xs − z) +ρ

2‖xs − z‖22

and ρ > 0 is a penalty parameter.I The augmented Lagrangian method (a.k.a, the method of

mulipliers) [Hes69, Pow69, Roc73, Ber82, EY14]: ForDs = Ks we solve with a QMIP.

I At iteration k ≥ 1, compute:1) ((xk

s , y ks )s∈S , zk ) ∈

argminx,y,z

{Lρ((xs, ys)s∈S , z, ωk ) :

(xs, ys) ∈ Ds ∀s ∈ S, z ∈ Rn

}2) ωk+1

s ← ωks + ρ(xk

s − zk ) for all s ∈ S

Page 28: Computing high-quality Lagrangian bounds of the stochastic ... · Computing high-quality Lagrangian bounds of the stochastic mixed-integer programming problem Prof. Andrew Eberhard

Progressive Hedging (PH)In the following, we consider how to modify the PH algorithm toobtain the required implementability.

I The Augmented Lagrangian (AL) function:

Lρ((xs, ys)s∈S , z, ω) :=∑s∈S

psLρs(xs, ys, z, ωs),

where

Lρs(xs, ys, z, ωs) := c>xs + q>s ys + ω>s (xs − z) +ρ

2‖xs − z‖22

and ρ > 0 is a penalty parameter.I The augmented Lagrangian method (a.k.a, the method of

mulipliers) [Hes69, Pow69, Roc73, Ber82, EY14]: ForDs = Ks we solve with a QMIP.

I At iteration k ≥ 1, compute:1) ((xk

s , y ks )s∈S , zk ) ∈

argminx,y,z

{Lρ((xs, ys)s∈S , z, ωk ) :

(xs, ys) ∈ Ds ∀s ∈ S, z ∈ Rn

}2) ωk+1

s ← ωks + ρ(xk

s − zk ) for all s ∈ S

Page 29: Computing high-quality Lagrangian bounds of the stochastic ... · Computing high-quality Lagrangian bounds of the stochastic mixed-integer programming problem Prof. Andrew Eberhard

Progressive Hedging (PH)In the following, we consider how to modify the PH algorithm toobtain the required implementability.

I The Augmented Lagrangian (AL) function:

Lρ((xs, ys)s∈S , z, ω) :=∑s∈S

psLρs(xs, ys, z, ωs),

where

Lρs(xs, ys, z, ωs) := c>xs + q>s ys + ω>s (xs − z) +ρ

2‖xs − z‖22

and ρ > 0 is a penalty parameter.I The augmented Lagrangian method (a.k.a, the method of

mulipliers) [Hes69, Pow69, Roc73, Ber82, EY14]: ForDs = Ks we solve with a QMIP.

I At iteration k ≥ 1, compute:1) ((xk

s , y ks )s∈S , zk ) ∈

argminx,y,z

{Lρ((xs, ys)s∈S , z, ωk ) :

(xs, ys) ∈ Ds ∀s ∈ S, z ∈ Rn

}2) ωk+1

s ← ωks + ρ(xk

s − zk ) for all s ∈ S

Page 30: Computing high-quality Lagrangian bounds of the stochastic ... · Computing high-quality Lagrangian bounds of the stochastic mixed-integer programming problem Prof. Andrew Eberhard

Progressive Hedging (PH)In the following, we consider how to modify the PH algorithm toobtain the required implementability.

I The Augmented Lagrangian (AL) function:

Lρ(x , y , z, ω) :=∑s∈S

psLρs(xs, ys, z, ωs),

where

Lρs(xs, ys, z, ωs) := c>xs + q>s ys + ω>s (xs − z) +ρ

2‖xs − z‖22

and ρ > 0 is a penalty parameter.I The k th iteration, k ≥ 1, of progressive hedging (PH):

1) For all s ∈ S, take(xk

s , yks ) ∈ argminx,y

{Lρs (x , y , zk−1, ωk

s ) : (x , y) ∈ Ds}

2) Compute zk ←∑

s∈S psxks

3) Compute ωk+1s ← ωk

s + ρ(xks − zk ) for all s ∈ S

I PH is a specialization of the alternating direction method ofmultipliers (ADMM) [GM76, EB92, BPC+11] to thedecomposed SMIP.

Page 31: Computing high-quality Lagrangian bounds of the stochastic ... · Computing high-quality Lagrangian bounds of the stochastic mixed-integer programming problem Prof. Andrew Eberhard

Progressive Hedging (PH)In the following, we consider how to modify the PH algorithm toobtain the required implementability.

I The Augmented Lagrangian (AL) function:

Lρ(x , y , z, ω) :=∑s∈S

psLρs(xs, ys, z, ωs),

where

Lρs(xs, ys, z, ωs) := c>xs + q>s ys + ω>s (xs − z) +ρ

2‖xs − z‖22

and ρ > 0 is a penalty parameter.I The k th iteration, k ≥ 1, of progressive hedging (PH):

1) For all s ∈ S, take(xk

s , yks ) ∈ argminx,y

{Lρs (x , y , zk−1, ωk

s ) : (x , y) ∈ Ds}

2) Compute zk ←∑

s∈S psxks

3) Compute ωk+1s ← ωk

s + ρ(xks − zk ) for all s ∈ S

I PH is a specialization of the alternating direction method ofmultipliers (ADMM) [GM76, EB92, BPC+11] to thedecomposed SMIP.

Page 32: Computing high-quality Lagrangian bounds of the stochastic ... · Computing high-quality Lagrangian bounds of the stochastic mixed-integer programming problem Prof. Andrew Eberhard

Progressive Hedging (PH)In the following, we consider how to modify the PH algorithm toobtain the required implementability.

I The Augmented Lagrangian (AL) function:

Lρ(x , y , z, ω) :=∑s∈S

psLρs(xs, ys, z, ωs),

where

Lρs(xs, ys, z, ωs) := c>xs + q>s ys + ω>s (xs − z) +ρ

2‖xs − z‖22

and ρ > 0 is a penalty parameter.I The k th iteration, k ≥ 1, of progressive hedging (PH):

1) For all s ∈ S, take(xk

s , yks ) ∈ argminx,y

{Lρs (x , y , zk−1, ωk

s ) : (x , y) ∈ Ds}

2) Compute zk ←∑

s∈S psxks

3) Compute ωk+1s ← ωk

s + ρ(xks − zk ) for all s ∈ S

I PH is a specialization of the alternating direction method ofmultipliers (ADMM) [GM76, EB92, BPC+11] to thedecomposed SMIP.

Page 33: Computing high-quality Lagrangian bounds of the stochastic ... · Computing high-quality Lagrangian bounds of the stochastic mixed-integer programming problem Prof. Andrew Eberhard

PropositionAssume that problem Conv-SMIP is feasible with conv(Ks) boundedfor each s ∈ S, and let Algorithm PH be applied to problemConv-SMIP (so that Ds = conv(Ks) for each s ∈ S) with toleranceε = 0 for each k ≥ 1. Then, the limit limk→∞ ωk = ω∗ exists, andfurthermore,

1. limk→∞∑

s∈S ps(c>xks + q>s yk

s ) = ζLD,

2. limk→∞ φ(ωk ) = ζLD,

3. limk→∞(xks − zk ) = 0 for each s ∈ S,

and each limit point (((x∗s , y∗s )s∈S , z∗) is an optimal solution forConv-SMIP.

Proof.Since the constraint sets Ds = conv(Ks), s ∈ S, are bounded andpolyhedral, and the objective is linear (and thus convex), problemConv-SMIP has a saddle point ((x∗s , y∗s )s∈S , z∗, ω∗) and the classical(continuous convex) convergence analysis applied.

Page 34: Computing high-quality Lagrangian bounds of the stochastic ... · Computing high-quality Lagrangian bounds of the stochastic mixed-integer programming problem Prof. Andrew Eberhard

PropositionAssume that problem Conv-SMIP is feasible with conv(Ks) boundedfor each s ∈ S, and let Algorithm PH be applied to problemConv-SMIP (so that Ds = conv(Ks) for each s ∈ S) with toleranceε = 0 for each k ≥ 1. Then, the limit limk→∞ ωk = ω∗ exists, andfurthermore,

1. limk→∞∑

s∈S ps(c>xks + q>s yk

s ) = ζLD,

2. limk→∞ φ(ωk ) = ζLD,

3. limk→∞(xks − zk ) = 0 for each s ∈ S,

and each limit point (((x∗s , y∗s )s∈S , z∗) is an optimal solution forConv-SMIP.

Proof.Since the constraint sets Ds = conv(Ks), s ∈ S, are bounded andpolyhedral, and the objective is linear (and thus convex), problemConv-SMIP has a saddle point ((x∗s , y∗s )s∈S , z∗, ω∗) and the classical(continuous convex) convergence analysis applied.

Page 35: Computing high-quality Lagrangian bounds of the stochastic ... · Computing high-quality Lagrangian bounds of the stochastic mixed-integer programming problem Prof. Andrew Eberhard

Remark:

PH can be applied to the original split-variable SMIP in practicewhere one solves MIP when the structure of conv Ks is notknown.

I Although there is not guarantee of optimal convergence intheory or practice, reasonable (but very slow) apparentconvergence can nevertheless be observed [WW11] forsmall values of penalty parameters ρ.

I As a means to generate (lower) Lagrangian bounds, PHcan also be applied directly to the original split-variableSMIP [GHR+15] to obtain bounds using a QMIP solve.

I However, optimal dual convergence is not realized, andimprovement in PH as a tool to generate strong Lagrangianbounds is needed.

Page 36: Computing high-quality Lagrangian bounds of the stochastic ... · Computing high-quality Lagrangian bounds of the stochastic mixed-integer programming problem Prof. Andrew Eberhard

Remark:

PH can be applied to the original split-variable SMIP in practicewhere one solves MIP when the structure of conv Ks is notknown.

I Although there is not guarantee of optimal convergence intheory or practice, reasonable (but very slow) apparentconvergence can nevertheless be observed [WW11] forsmall values of penalty parameters ρ.

I As a means to generate (lower) Lagrangian bounds, PHcan also be applied directly to the original split-variableSMIP [GHR+15] to obtain bounds using a QMIP solve.

I However, optimal dual convergence is not realized, andimprovement in PH as a tool to generate strong Lagrangianbounds is needed.

Page 37: Computing high-quality Lagrangian bounds of the stochastic ... · Computing high-quality Lagrangian bounds of the stochastic mixed-integer programming problem Prof. Andrew Eberhard

The Frank-Wolfe (FW) method and the simplicialdecomposition method (SDM)

To use Algorithm PH to solve Conv-SMIP requires a method forsolving the subproblem

(xks , y

ks ) ∈ argmin

x,y

{Lρs (x , y , zk−1, ωk

s ) : (x , y) ∈ conv(Ks)}.

I We apply an iterative approach similar to the well-knownFrank-Wolfe method [FW56], known as the simplicialdecomposition method (SDM) [Hol74, VH77].

I The following summarizes iteration t of the SDM:

for s ∈ S do1) Compute:

(xs, ys) ∈ argminx,y

∇(x,y)Lρs (x t−1, y t−1, zk−1, ωk

s )

[xy

]:

(x, y) ∈ V(conv(Ks))

2) Construct: V t

s ← V t−1s ∪

{(xs, ys)

}3) Compute:(x t , y t ) ∈ argminx,y

{Lρs (x, y, zk−1, ωk

s ) : (x, y) ∈ conv(V ts)}

end for

Page 38: Computing high-quality Lagrangian bounds of the stochastic ... · Computing high-quality Lagrangian bounds of the stochastic mixed-integer programming problem Prof. Andrew Eberhard

The Frank-Wolfe (FW) method and the simplicialdecomposition method (SDM)

To use Algorithm PH to solve Conv-SMIP requires a method forsolving the subproblem

(xks , y

ks ) ∈ argmin

x,y

{Lρs (x , y , zk−1, ωk

s ) : (x , y) ∈ conv(Ks)}.

I We apply an iterative approach similar to the well-knownFrank-Wolfe method [FW56], known as the simplicialdecomposition method (SDM) [Hol74, VH77].

I The following summarizes iteration t of the SDM:

for s ∈ S do1) Compute:

(xs, ys) ∈ argminx,y

∇(x,y)Lρs (x t−1, y t−1, zk−1, ωk

s )

[xy

]:

(x, y) ∈ V(conv(Ks))

2) Construct: V t

s ← V t−1s ∪

{(xs, ys)

}3) Compute:(x t , y t ) ∈ argminx,y

{Lρs (x, y, zk−1, ωk

s ) : (x, y) ∈ conv(V ts)}

end for

Page 39: Computing high-quality Lagrangian bounds of the stochastic ... · Computing high-quality Lagrangian bounds of the stochastic mixed-integer programming problem Prof. Andrew Eberhard

The Frank-Wolfe (FW) method and the simplicialdecomposition method (SDM)

To use Algorithm PH to solve Conv-SMIP requires a method forsolving the subproblem

(xks , y

ks ) ∈ argmin

x,y

{Lρs (x , y , zk−1, ωk

s ) : (x , y) ∈ conv(Ks)}.

I We apply an iterative approach similar to the well-knownFrank-Wolfe method [FW56], known as the simplicialdecomposition method (SDM) [Hol74, VH77].

I The following summarizes iteration t of the SDM:

for s ∈ S do1) Compute:

(xs, ys) ∈ argminx,y

∇(x,y)Lρs (x t−1, y t−1, zk−1, ωk

s )

[xy

]:

(x, y) ∈ V(conv(Ks))

2) Construct: V t

s ← V t−1s ∪

{(xs, ys)

}3) Compute:(x t , y t ) ∈ argminx,y

{Lρs (x, y, zk−1, ωk

s ) : (x, y) ∈ conv(V ts)}

end for

Page 40: Computing high-quality Lagrangian bounds of the stochastic ... · Computing high-quality Lagrangian bounds of the stochastic mixed-integer programming problem Prof. Andrew Eberhard

Precondition: V 0s ⊂ conv(Ks), z ∈ argminz

{∑s∈S ps

∥∥x0s − z

∥∥22

}function SDM(V 0

s , x0s , ωs, z, tmax , τ )

for t = 1, . . . , tmax doωt

s ← ωs + ρ(x t−1s − z)

(xs, ys) ∈ argminx,y

{(c + ωt

s)>x + q>s y :(x , y) ∈ V(Ks)

}if t = 1 then

φs ← (c + ωts)>xs + q>s ys

end ifΓt ← −[(c + ωt

s)>(xs − x t−1s ) + q>s (ys − y t−1

s )]V t

s ← V t−1s ∪

{(xs, ys)

}(x t

s, y ts) ∈ argminx,y

{Lρs (x , y , z, ωs) : (x , y) ∈ conv(V t

s)}

if Γt ≤ τ thenreturn

(x t

s, y ts,V t

s , φs)

end ifend forreturn

(x tmax

s , y tmaxs ,V tmax

s , φs)

end function

Page 41: Computing high-quality Lagrangian bounds of the stochastic ... · Computing high-quality Lagrangian bounds of the stochastic mixed-integer programming problem Prof. Andrew Eberhard

Precondition:∑

s∈S psω0s = 0, ρ > 0

function FW-PH((V 0s )s∈S , ω0, ρ, α, ε, kmax , tmax , {τk})

z0 ←∑

s∈S psx0s

ω1s ← ω0

s + ρ(x0s − z0), for s ∈ S

for k = 1, . . . , kmax dofor s ∈ S do

xs ← (1− α)zk−1 + αxk−1s (average the concesus and current primal value for s ∈ S)

[xks , yk

s , V ks , φ

ks ]← SDM(V k−1

s , xs, ωks , zk−1, tmax , τk ) (the FW - SDM step)

end forφk ←

∑s∈S psφ

ks (the lower bound on the dual value)

zk ←∑

s∈S psxks (the G-S step - minimises the dispersion on z)

if∑

s∈S ps

∥∥∥xks − zk−1

∥∥∥2

2< ε then

return ((xks , yk

s )s∈S , zk , ωk , φk )end ifωk+1

s ← ωks + ρ(xk

s − zk ), for s ∈ Send forreturn

((xkmax

s , ykmaxs )s∈S , zkmax ), ωkmax , φkmax

)end function

Optimal convergence limk→∞ φk = φ∗ can be established when

I the augmented Lagrangian Lρs is modified to be strongly convex (add a quadratic proximal term in y );

I the sequence {τk} of SDM convergence tolerances is generated a priori so that∑∞

k=1√τk <∞.

It can be shown that these above two conditions imply the satisfaction of assumptions used in the convergenceanalysis of inexact ADMM in Theorem 8 of [EB92].

I However, requiring the SDM termination condition to be satisfied for each such τk ≥ 0 is not needed inpractice (and is undesirable); nor will it be required in theory, as we shall soon see.

Page 42: Computing high-quality Lagrangian bounds of the stochastic ... · Computing high-quality Lagrangian bounds of the stochastic mixed-integer programming problem Prof. Andrew Eberhard

Precondition:∑

s∈S psω0s = 0, ρ > 0

function FW-PH((V 0s )s∈S , ω0, ρ, α, ε, kmax , tmax , {τk})

z0 ←∑

s∈S psx0s

ω1s ← ω0

s + ρ(x0s − z0), for s ∈ S

for k = 1, . . . , kmax dofor s ∈ S do

xs ← (1− α)zk−1 + αxk−1s (average the concesus and current primal value for s ∈ S)

[xks , yk

s , V ks , φ

ks ]← SDM(V k−1

s , xs, ωks , zk−1, tmax , τk ) (the FW - SDM step)

end forφk ←

∑s∈S psφ

ks (the lower bound on the dual value)

zk ←∑

s∈S psxks (the G-S step - minimises the dispersion on z)

if∑

s∈S ps

∥∥∥xks − zk−1

∥∥∥2

2< ε then

return ((xks , yk

s )s∈S , zk , ωk , φk )end ifωk+1

s ← ωks + ρ(xk

s − zk ), for s ∈ Send forreturn

((xkmax

s , ykmaxs )s∈S , zkmax ), ωkmax , φkmax

)end function

Optimal convergence limk→∞ φk = φ∗ can be established when

I the augmented Lagrangian Lρs is modified to be strongly convex (add a quadratic proximal term in y );

I the sequence {τk} of SDM convergence tolerances is generated a priori so that∑∞

k=1√τk <∞.

It can be shown that these above two conditions imply the satisfaction of assumptions used in the convergenceanalysis of inexact ADMM in Theorem 8 of [EB92].

I However, requiring the SDM termination condition to be satisfied for each such τk ≥ 0 is not needed inpractice (and is undesirable); nor will it be required in theory, as we shall soon see.

Page 43: Computing high-quality Lagrangian bounds of the stochastic ... · Computing high-quality Lagrangian bounds of the stochastic mixed-integer programming problem Prof. Andrew Eberhard

Precondition:∑

s∈S psω0s = 0, ρ > 0

function FW-PH((V 0s )s∈S , ω0, ρ, α, ε, kmax , tmax , {τk})

z0 ←∑

s∈S psx0s

ω1s ← ω0

s + ρ(x0s − z0), for s ∈ S

for k = 1, . . . , kmax dofor s ∈ S do

xs ← (1− α)zk−1 + αxk−1s (average the concesus and current primal value for s ∈ S)

[xks , yk

s , V ks , φ

ks ]← SDM(V k−1

s , xs, ωks , zk−1, tmax , τk ) (the FW - SDM step)

end forφk ←

∑s∈S psφ

ks (the lower bound on the dual value)

zk ←∑

s∈S psxks (the G-S step - minimises the dispersion on z)

if∑

s∈S ps

∥∥∥xks − zk−1

∥∥∥2

2< ε then

return ((xks , yk

s )s∈S , zk , ωk , φk )end ifωk+1

s ← ωks + ρ(xk

s − zk ), for s ∈ Send forreturn

((xkmax

s , ykmaxs )s∈S , zkmax ), ωkmax , φkmax

)end function

Optimal convergence limk→∞ φk = φ∗ can be established when

I the augmented Lagrangian Lρs is modified to be strongly convex (add a quadratic proximal term in y );

I the sequence {τk} of SDM convergence tolerances is generated a priori so that∑∞

k=1√τk <∞.

It can be shown that these above two conditions imply the satisfaction of assumptions used in the convergenceanalysis of inexact ADMM in Theorem 8 of [EB92].

I However, requiring the SDM termination condition to be satisfied for each such τk ≥ 0 is not needed inpractice (and is undesirable); nor will it be required in theory, as we shall soon see.

Page 44: Computing high-quality Lagrangian bounds of the stochastic ... · Computing high-quality Lagrangian bounds of the stochastic mixed-integer programming problem Prof. Andrew Eberhard

LemmaAt each iteration, k, of Algorithm FW-PH, the value,φk =

∑s∈S psφ

ks , is the value of the Lagrangian relaxation φ(·)

evaluated at a feasible Lagrangian dual feasible point ωk , andhence provides a lower bound on ζLD.

Proof.I In iteration k , the problem solved, for each s ∈ S, at Line 5 in the first

iteration (t = 1) of Algorithm SDM, corresponds to the evaluation of theLagrangian bound φ(ωk ), where

ωks := ω1

s = ωks + ρ(xs − zk−1)

= ωks + ρ((1− α)zk−1 + αxk−1

s − zk−1)

= ωks + αρ(xk−1

s − zk−1).

I By construction, the points ((xk−1s )s∈S , zk−1) always satisfy∑

s∈S ps(xk−1s − zk−1) = 0 and

∑s∈S psω

ks = 0.

I Thus,∑

s∈S psωks = 0, so ωk is feasible for the Lagrangian dual

problem, and φ(ωk ) =∑

s∈S psφks ≤ ζLD .

Page 45: Computing high-quality Lagrangian bounds of the stochastic ... · Computing high-quality Lagrangian bounds of the stochastic mixed-integer programming problem Prof. Andrew Eberhard

However we consider a convergence analysis of FW-PH that isnot based on the optimal convergence (approximate orotherwise) of SDM.

I Convergence will depend instead on the SDM expansionof the inner approximations conv(Vs) “as needed”.

Page 46: Computing high-quality Lagrangian bounds of the stochastic ... · Computing high-quality Lagrangian bounds of the stochastic mixed-integer programming problem Prof. Andrew Eberhard

LemmaFor any given scenario s ∈ S, let Algorithm SDM be applied tothe iteration k ≥ 1 PH subproblem

minx ,y

{Lρs(x , y , zk−1, ωk

s ) : (x , y) ∈ conv(Ks)}

(1)

For 1 ≤ t < tmax , if (x ts, y t

s) is not optimal for (1), then

conv(V t+1s ) ⊃ conv(V t

s)

.

Page 47: Computing high-quality Lagrangian bounds of the stochastic ... · Computing high-quality Lagrangian bounds of the stochastic mixed-integer programming problem Prof. Andrew Eberhard

Optimal Convergence of FW-PH

PropositionLet Algorithm FW-PH with kmax =∞, ε = 0, α ∈ R, andtmax ≥ 1 be applied to the convexified separable deterministicequivalent SMIP, which is assumed to have an optimal solution.If either tmax ≥ 2 or ∩s∈SProjx (conv(V 0

s )) 6= ∅ holds, thenlimk→∞ φ

k = ζLD.Proof: (Basic ideas: See [BCD+16] for detail.)First note that for any tmax ≥ 1, the sequence of innerapproximations conv(V k

s ), s ∈ S, will stabilize, in that, for somethreshold 0 ≤ ks, we have for all k ≥ ks

conv(V ks ) =: Ds ⊆ conv(Ks). (2)

This follows due to V ks ← V k−1

s ∪{

(xs, ys)}

, where (xs, ys) is avertex of conv(Ks). Since each polyhedron conv(Ks), s ∈ S hasonly a finite number of such vertices, the stabilization (2) mustoccur at some ks <∞.

Page 48: Computing high-quality Lagrangian bounds of the stochastic ... · Computing high-quality Lagrangian bounds of the stochastic mixed-integer programming problem Prof. Andrew Eberhard

The stabilizations (2), s ∈ S, are reached at some iterationk := maxs∈S

{ks}

. Noting that Ds = conv(V ks ) for k > k we

must have

(xks , y

ks ) ∈ argmin

x ,y

{Lρs(x , y , zk−1, ωk

s ) : (x , y) ∈ conv(Ks)}. (3)

Otherwise, due to Lemma 2, the call to SDM on Line 8 mustreturn V k

s ⊃ V k−1s , contradicting the stabilization (2).

Therefore, the k ≥ k iterations of Algorithm FW-PH areidentical to Algorithm PH iterations applied to Conv-SMIP, andso Proposition 1 implies that

1. limk→∞ xks − zk = 0, s ∈ S, and

2. limk→∞∑

s∈S φs(ωks + α(xk−1

s − zk−1)) =limk→∞

∑s∈S φs(ωk

s ) = limk→∞ φ(ωk ) = ζLD for all α ∈ R.In the case tmax = 1 does need some consideration of extraissues (which we omit here).

Page 49: Computing high-quality Lagrangian bounds of the stochastic ... · Computing high-quality Lagrangian bounds of the stochastic mixed-integer programming problem Prof. Andrew Eberhard

The stabilizations (2), s ∈ S, are reached at some iterationk := maxs∈S

{ks}

. Noting that Ds = conv(V ks ) for k > k we

must have

(xks , y

ks ) ∈ argmin

x ,y

{Lρs(x , y , zk−1, ωk

s ) : (x , y) ∈ conv(Ks)}. (3)

Otherwise, due to Lemma 2, the call to SDM on Line 8 mustreturn V k

s ⊃ V k−1s , contradicting the stabilization (2).

Therefore, the k ≥ k iterations of Algorithm FW-PH areidentical to Algorithm PH iterations applied to Conv-SMIP, andso Proposition 1 implies that

1. limk→∞ xks − zk = 0, s ∈ S, and

2. limk→∞∑

s∈S φs(ωks + α(xk−1

s − zk−1)) =limk→∞

∑s∈S φs(ωk

s ) = limk→∞ φ(ωk ) = ζLD for all α ∈ R.In the case tmax = 1 does need some consideration of extraissues (which we omit here).

Page 50: Computing high-quality Lagrangian bounds of the stochastic ... · Computing high-quality Lagrangian bounds of the stochastic mixed-integer programming problem Prof. Andrew Eberhard

Outline

Stochastic Mixed-Integer Optimization: Background

Algorithm FW-PH

Computational Experiments

Generating Primal Solutions

Page 51: Computing high-quality Lagrangian bounds of the stochastic ... · Computing high-quality Lagrangian bounds of the stochastic mixed-integer programming problem Prof. Andrew Eberhard

Computational Experiments

I We performed computations using a C++ implementation ofAlgorithms PH (Ds = Ks, s ∈ S) and FW-PH using CPLEX12.5 [IBM] as the solver.

I Computations run on the Raijin cluster:

I high performance computing (HPC) environment;I maintained by Australia’s National Computing Infrastructure

(NCI) and supported by the Australian Government [NCI];

I In the experiments with Algorithms PH and FW-PH, we set theconvergence tolerance ε = 10−3 and the maximum number ofouter loop iterations at kmax = 200.

I For Algorithm FW-PH, we set tmax = 1.

I Also, for all experiments performed, we set ω0 = 0.

Page 52: Computing high-quality Lagrangian bounds of the stochastic ... · Computing high-quality Lagrangian bounds of the stochastic mixed-integer programming problem Prof. Andrew Eberhard

Computational Experiments

I We performed computations using a C++ implementation ofAlgorithms PH (Ds = Ks, s ∈ S) and FW-PH using CPLEX12.5 [IBM] as the solver.

I Computations run on the Raijin cluster:

I high performance computing (HPC) environment;I maintained by Australia’s National Computing Infrastructure

(NCI) and supported by the Australian Government [NCI];

I In the experiments with Algorithms PH and FW-PH, we set theconvergence tolerance ε = 10−3 and the maximum number ofouter loop iterations at kmax = 200.

I For Algorithm FW-PH, we set tmax = 1.

I Also, for all experiments performed, we set ω0 = 0.

Page 53: Computing high-quality Lagrangian bounds of the stochastic ... · Computing high-quality Lagrangian bounds of the stochastic mixed-integer programming problem Prof. Andrew Eberhard

Computational Experiments

I We performed computations using a C++ implementation ofAlgorithms PH (Ds = Ks, s ∈ S) and FW-PH using CPLEX12.5 [IBM] as the solver.

I Computations run on the Raijin cluster:

I high performance computing (HPC) environment;I maintained by Australia’s National Computing Infrastructure

(NCI) and supported by the Australian Government [NCI];

I In the experiments with Algorithms PH and FW-PH, we set theconvergence tolerance ε = 10−3 and the maximum number ofouter loop iterations at kmax = 200.

I For Algorithm FW-PH, we set tmax = 1.

I Also, for all experiments performed, we set ω0 = 0.

Page 54: Computing high-quality Lagrangian bounds of the stochastic ... · Computing high-quality Lagrangian bounds of the stochastic mixed-integer programming problem Prof. Andrew Eberhard

Computational Experiments

I We performed computations using a C++ implementation ofAlgorithms PH (Ds = Ks, s ∈ S) and FW-PH using CPLEX12.5 [IBM] as the solver.

I Computations run on the Raijin cluster:

I high performance computing (HPC) environment;I maintained by Australia’s National Computing Infrastructure

(NCI) and supported by the Australian Government [NCI];

I In the experiments with Algorithms PH and FW-PH, we set theconvergence tolerance ε = 10−3 and the maximum number ofouter loop iterations at kmax = 200.

I For Algorithm FW-PH, we set tmax = 1.

I Also, for all experiments performed, we set ω0 = 0.

Page 55: Computing high-quality Lagrangian bounds of the stochastic ... · Computing high-quality Lagrangian bounds of the stochastic mixed-integer programming problem Prof. Andrew Eberhard

I Two sets of Algorithm FW-PH experiments correspond tovariants considering α = 1 and α = 0.

I Computations were performed on four problems:1. the CAP (capacitated facility locations) instance 101 with

the first 250 scenarios (CAP-101-250) [BDGL14],2. the DCAP (dynamic capacity allocations) instance

DCAP233_500 with 500 scenarios,3. the SSLP (server location under uncertainty) instances

SSLP5.25.50 with 50 scenarios (SSLP-5-25-50) and4. SSLP10.50.100 with 100 scenarios (SSLP-10-50-100).

I The latter three problems are described in detailin [Nta04, GTU] and accessible at [GTU].

I All computational experiments were allowed to run for amaximum of two hours in wall clock time.

Page 56: Computing high-quality Lagrangian bounds of the stochastic ... · Computing high-quality Lagrangian bounds of the stochastic mixed-integer programming problem Prof. Andrew Eberhard

Percentage gap # Iterations Termination

Penalty PH FW-PH PH FW-PH PH FW-PHα = 0 α = 1 α = 0 α = 1 α = 0 α = 1

20 0.08% 0.10% 0.11% 466 439 430 T T T100 0.01% 0.00% 0.00% 178 406 437 C T T500 0.07% 0.00% 0.00% 468 92 93 T C C

1000 0.15% 0.00% 0.00% 516 127 130 T C C2500 0.34% 0.00% 0.00% 469 259 274 T C C5000 0.66% 0.00% 0.00% 33 431 464 C T T7500 0.99% 0.00% 0.00% 28 18 19 C C C

15000 1.59% 0.00% 0.00% 567 28 33 T C C

Table: Result summary for CAP-101-250, with the absolutepercentage gap based on the known optimal value 733827.3

Page 57: Computing high-quality Lagrangian bounds of the stochastic ... · Computing high-quality Lagrangian bounds of the stochastic mixed-integer programming problem Prof. Andrew Eberhard

Percentage gap # Iterations Termination

Penalty PH FW-PH PH FW-PH PH FW-PHα = 0 α = 1 α = 0 α = 1 α = 0 α = 1

2 0.13% 0.12% 0.12% 1717 574 600 T T T5 0.22% 0.09% 0.09% 2074 589 574 T T T

10 0.23% 0.07% 0.07% 2598 592 587 T T T20 0.35% 0.07% 0.07% 1942 590 599 T T T50 1.25% 0.06% 0.06% 2718 597 533 T T T

100 1.29% 0.06% 0.06% 2772 428 438 T C C200 2.58% 0.06% 0.06% 2695 256 262 T C C500 2.58% 0.07% 0.07% 2871 244 246 T C C

Table: Result summary for DCAP-233-500, with the absolutepercentage gap based on the best known lower bound 1737.7.

Page 58: Computing high-quality Lagrangian bounds of the stochastic ... · Computing high-quality Lagrangian bounds of the stochastic mixed-integer programming problem Prof. Andrew Eberhard

Percentage gap # Iterations Termination

Penalty PH FW-PH PH FW-PH PH FW-PHα = 0 α = 1 α = 0 α = 1 α = 0 α = 1

1 0.30% 0.00% 0.00% 105 115 116 C C C2 0.73% 0.00% 0.00% 51 56 56 C C C5 0.91% 0.00% 0.00% 25 26 27 C C C

15 3.15% 0.00% 0.00% 12 16 17 C C C30 6.45% 0.00% 0.00% 12 18 18 C C C50 9.48% 0.00% 0.00% 18 25 26 C C C

100 9.48% 0.00% 0.00% 8 45 45 C C C

Table: Result summary for SSLP-5-25-50, with the absolutepercentage gap based on the known optimal value -121.6

Page 59: Computing high-quality Lagrangian bounds of the stochastic ... · Computing high-quality Lagrangian bounds of the stochastic mixed-integer programming problem Prof. Andrew Eberhard

Percentage gap # Iterations Termination

Penalty PH FW-PH PH FW-PH PH FW-PHα = 0 α = 1 α = 0 α = 1 α = 0 α = 1

1 0.57% 0.22% 0.23% 126 234 233 T T T2 0.63% 0.03% 0.03% 127 226 228 T T T5 1.00% 0.00% 0.00% 104 219 220 C T T

15 2.92% 0.00% 0.00% 33 45 118 C C C30 4.63% 0.00% 0.00% 18 21 22 C C C50 4.63% 0.00% 0.00% 11 26 27 C C C

100 4.63% 0.00% 0.00% 9 43 45 C C C

Table: Result summary for SSLP-10-50-100, with the absolutepercentage gap based on the known optimal value -354.2

Page 60: Computing high-quality Lagrangian bounds of the stochastic ... · Computing high-quality Lagrangian bounds of the stochastic mixed-integer programming problem Prof. Andrew Eberhard

0 100 200 300 400 500 600 700 800 900 1000Wall clock time (seconds)

7.1

7.15

7.2

7.25

7.3

7.35

Lagr

angi

an d

ual b

ound

105 CAP-101-250

Best Known Objective Value=500-PH=500-FW-PH ( = 0)=2500-PH=2500-FW-PH ( = 0)=7500-PH=7500-FW-PH ( = 0)=15000-PH=15000-FW-PH ( = 0)

Figure: Convergence profile for CAP-101-250

Page 61: Computing high-quality Lagrangian bounds of the stochastic ... · Computing high-quality Lagrangian bounds of the stochastic mixed-integer programming problem Prof. Andrew Eberhard

0 100 200 300 400 500 600 700 800 900 1000Wall clock time (seconds)

1690

1695

1700

1705

1710

1715

1720

1725

1730

1735

1740

Lagr

angi

an d

ual b

ound

DCAP-233-500

Best Known Objective Value=5-PH=5-FW-PH ( = 0)=10-PH=10-FW-PH ( = 0)=20-PH=20-FW-PH ( = 0)=50-PH=50-FW-PH ( = 0)

Figure: Convergence profile for DCAP-233-500

Page 62: Computing high-quality Lagrangian bounds of the stochastic ... · Computing high-quality Lagrangian bounds of the stochastic mixed-integer programming problem Prof. Andrew Eberhard

0 10 20 30 40 50 60 70 80 90 100Wall clock time (seconds)

-135

-130

-125

-120

Lagr

angi

an d

ual b

ound

SSLP-5-25-50

Best Known Objective Value=2-PH=2-FW-PH ( = 0)=5-PH=5-FW-PH ( = 0)=15-PH=15-FW-PH ( = 0)=30-PH=30-FW-PH ( = 0)

Figure: Convergence profile for SSLP-5-25-50 (chosen penaltiesmatch those of Figure 2 of [GHR+15])

Page 63: Computing high-quality Lagrangian bounds of the stochastic ... · Computing high-quality Lagrangian bounds of the stochastic mixed-integer programming problem Prof. Andrew Eberhard

0 200 400 600 800 1000 1200 1400 1600 1800 2000Wall clock time (seconds)

-385

-380

-375

-370

-365

-360

-355

-350

Lagr

angi

an d

ual b

ound

SSLP-10-50-100

Best Known Objective Value=2-PH=2-FW-PH ( = 0)=5-PH=5-FW-PH ( = 0)=15-PH=15-FW-PH ( = 0)=30-PH=30-FW-PH ( = 0)

Figure: Convergence profile for SSLP-10-50-100

Page 64: Computing high-quality Lagrangian bounds of the stochastic ... · Computing high-quality Lagrangian bounds of the stochastic mixed-integer programming problem Prof. Andrew Eberhard

Outline

Stochastic Mixed-Integer Optimization: Background

Algorithm FW-PH

Computational Experiments

Generating Primal Solutions

Page 65: Computing high-quality Lagrangian bounds of the stochastic ... · Computing high-quality Lagrangian bounds of the stochastic mixed-integer programming problem Prof. Andrew Eberhard

Generating Primal SolutionsI The FW-FW solves the problem involving the convexified

feasible region. Thus the concensus z contains fractionalvalues for integer variables. Thus we try three types ofadditional steps to extract an optimal solutuion.

I H1: Use the last xs (i.e., when termination of FW-PH isobserved) added to Vs, s ∈ S, as candidate first-stagesolutions, solve for the corresponding second stagevariables and reporting that with the best objective value.

I H2 and H3: The second and third strategies consist ofsolving the MIQPs that would have been solved in PH forthe current value of z obtained from FW-PH, which returnsintegral solutions that can be evaluated in the samemanner as before.

I In H2 we keep the penalty term unchanged while in H3 weuse for this last step the smallest ρ considered for eachproblem.

Page 66: Computing high-quality Lagrangian bounds of the stochastic ... · Computing high-quality Lagrangian bounds of the stochastic mixed-integer programming problem Prof. Andrew Eberhard

Generating Primal SolutionsI The FW-FW solves the problem involving the convexified

feasible region. Thus the concensus z contains fractionalvalues for integer variables. Thus we try three types ofadditional steps to extract an optimal solutuion.

I H1: Use the last xs (i.e., when termination of FW-PH isobserved) added to Vs, s ∈ S, as candidate first-stagesolutions, solve for the corresponding second stagevariables and reporting that with the best objective value.

I H2 and H3: The second and third strategies consist ofsolving the MIQPs that would have been solved in PH forthe current value of z obtained from FW-PH, which returnsintegral solutions that can be evaluated in the samemanner as before.

I In H2 we keep the penalty term unchanged while in H3 weuse for this last step the smallest ρ considered for eachproblem.

Page 67: Computing high-quality Lagrangian bounds of the stochastic ... · Computing high-quality Lagrangian bounds of the stochastic mixed-integer programming problem Prof. Andrew Eberhard

Generating Primal SolutionsI The FW-FW solves the problem involving the convexified

feasible region. Thus the concensus z contains fractionalvalues for integer variables. Thus we try three types ofadditional steps to extract an optimal solutuion.

I H1: Use the last xs (i.e., when termination of FW-PH isobserved) added to Vs, s ∈ S, as candidate first-stagesolutions, solve for the corresponding second stagevariables and reporting that with the best objective value.

I H2 and H3: The second and third strategies consist ofsolving the MIQPs that would have been solved in PH forthe current value of z obtained from FW-PH, which returnsintegral solutions that can be evaluated in the samemanner as before.

I In H2 we keep the penalty term unchanged while in H3 weuse for this last step the smallest ρ considered for eachproblem.

Page 68: Computing high-quality Lagrangian bounds of the stochastic ... · Computing high-quality Lagrangian bounds of the stochastic mixed-integer programming problem Prof. Andrew Eberhard

Generating Primal SolutionsI The FW-FW solves the problem involving the convexified

feasible region. Thus the concensus z contains fractionalvalues for integer variables. Thus we try three types ofadditional steps to extract an optimal solutuion.

I H1: Use the last xs (i.e., when termination of FW-PH isobserved) added to Vs, s ∈ S, as candidate first-stagesolutions, solve for the corresponding second stagevariables and reporting that with the best objective value.

I H2 and H3: The second and third strategies consist ofsolving the MIQPs that would have been solved in PH forthe current value of z obtained from FW-PH, which returnsintegral solutions that can be evaluated in the samemanner as before.

I In H2 we keep the penalty term unchanged while in H3 weuse for this last step the smallest ρ considered for eachproblem.

Page 69: Computing high-quality Lagrangian bounds of the stochastic ... · Computing high-quality Lagrangian bounds of the stochastic mixed-integer programming problem Prof. Andrew Eberhard

ρ PH FW-PHH1 H2 H3

20 0.14% 0.15% 0.15% 0.15%100 0.01% 0.00% 0.00% 0.00%500 0.07% 0.00% 0.00% 0.00%

1000 0.15% 0.00% 0.00% 0.00%2500 0.34% 0.00% 0.00% 0.00%5000 0.65% 0.00% 0.00% 0.00%7500 0.98% 0.00% 0.00% 0.00%

15000 1.66% 0.00% 0.00% 0.00%

Table: CAP-101-250

Page 70: Computing high-quality Lagrangian bounds of the stochastic ... · Computing high-quality Lagrangian bounds of the stochastic mixed-integer programming problem Prof. Andrew Eberhard

ρ PH FW-PHH1 H2 H3

2 0.64% 0.71% 0.31% 0.31%5 0.63% 0.29% 0.59% 0.59%

10 0.64% 0.28% 0.49% 0.49%20 0.51% 0.11% 0.48% 0.48%50 1.23% 0.26% 0.48% 0.26%100 1.68% 0.47% 0.48% 0.46%200 2.92% 2.82% 0.48% 0.47%500 2.51% 0.19% 0.48% 0.08%

Table: DCAP-233-500

Page 71: Computing high-quality Lagrangian bounds of the stochastic ... · Computing high-quality Lagrangian bounds of the stochastic mixed-integer programming problem Prof. Andrew Eberhard

ρ PH FW-PHH1 H2 H3

1 0.31% 0.00% 0.00% 0.00%2 0.73% 0.00% 0.00% 0.00%5 0.92% 0.00% 0.00% 0.00%15 3.25% 0.00% 0.00% 0.00%30 6.90% 0.00% 0.00% 0.00%50 10.48% 0.00% 0.00% 0.00%

100 12.91% 0.00% 0.00% 0.00%

Table: SSLP-5-25-50

Page 72: Computing high-quality Lagrangian bounds of the stochastic ... · Computing high-quality Lagrangian bounds of the stochastic mixed-integer programming problem Prof. Andrew Eberhard

ρ PH FW-PHH1 H2 H3

1 0.59% 0.25% 0.25% 0.25%2 0.57% 0.03% 0.03% 0.03%5 1.01% 0.00% 0.00% 0.00%

15 3.01% 0.00% 0.00% 0.00%30 4.86% 0.00% 0.00% 0.00%50 4.86% 0.00% 0.00% 0.00%100 4.86% 0.00% 0.00% 0.00%

Table: SSLP-10-50-100

Page 73: Computing high-quality Lagrangian bounds of the stochastic ... · Computing high-quality Lagrangian bounds of the stochastic mixed-integer programming problem Prof. Andrew Eberhard

Thank you!

Page 74: Computing high-quality Lagrangian bounds of the stochastic ... · Computing high-quality Lagrangian bounds of the stochastic mixed-integer programming problem Prof. Andrew Eberhard

A. Agustín, A. Alonso-Ayuso, L. F. Escudero, and C. Pizarro.On air traffic flow management with rerouting. Part II: Stochastic case.European Journal of Operational Research, 219(1):167–177, 2012.

N.L. Boland, J. Christiansen, B. Dandurand, A. Eberhard, J. Linderoth, J. Luedtke, and F. Oliveira.Progressive hedging with a Frank-Wolfe based method for computing stochastic mixed-integer programmingLagrangian dual bounds.Optimization Online, 2016.

M. Bodur, S. Dash, O. Günlük, and J. Luedtke.Strengthened benders cuts for stochastic integer programs with continuous recourse.2014.Last accessed on 13 January 2015.

D.P. Bertsekas.Constrained Optimization and Lagrange Multiplier Methods.Academic Press, 1982.

S. Boyd, N. Parikh, E. Chu, B. Peleato, and J. Eckstein.Distributed optimization and statistical learning via the alternating direction method of multipliers.Foundation and Trends in Machine Learning, 3(1):1–122, 2011.

Fernando Badilla Veliz, Jean-Paul Watson, Andres Weintraub, Roger J.-B. Wets, and David L. Woodruff.Stochastic optimization models in forest planning: a progressive hedging solution approach.Annals of Operations Research, 232:259–274, 2015.

Claus C Carøe and Rüdiger Schultz.Dual decomposition in stochastic integer programming.Operations Research Letters, 24(1):37–45, 1999.

J. Eckstein and D.P. Bertsekas.On the Douglas-Rachford splitting method and the proximal point algorithm for maximal monotone operators.Mathematical Programming, 55(1-3):293–318, 1992.

J. Eckstein and W. Yao.

Page 75: Computing high-quality Lagrangian bounds of the stochastic ... · Computing high-quality Lagrangian bounds of the stochastic mixed-integer programming problem Prof. Andrew Eberhard

Understanding the convergence of the alternating direction method of multipliers: Theoretical andcomputational perspectives.Technical report, Rutgers University, 2014.

M. Frank and P. Wolfe.An algorithm for quadratic programming.Naval Research Logistics Quarterly, 3:149–154, 1956.

D. Gade, G. Hackebeil, S.M. Ryan, J.-P. Watson, R. J-B Wets, and D.L. Woodruff.Obtaining lower bounds from the progressive hedging algorithm for stochastic mixed-integer programs.Technical report, Graduate School of Management, UC Davis, 2015.

D. Gabay and B. Mercier.A dual algorithm for the solution of nonlinear variational problems via finite element approximation.Computers and Mathematics with Applications, 2:17–40, 1976.

Last accessed 23 December, 2015.

M. R. Hestenes.Multiplier and gradient methods.Journal of Optimization Theory and Applications, pages 303–320, 1969.

C.A. Holloway.An extension of the Frank and Wolfe method of feasible directions.Mathematical Programming, 6(1):14–27, 1974.

IBM Corporation.IBM ILOG CPLEX V12.5.Last accessed 28 Jan 2016.

G. Laporte, F. V. Louveaux, and H. Mercure.The vehicle routing problem with stochastic travel times.Transportation Science, 26:161–170, 1992.

M. Lubin, K. Martin, C.G. Petra, and B. Sandıkçı.On parallelizing dual decomposition in stochastic integer programming.Operations Research Letters, 41(3):252–258, 2013.

Page 76: Computing high-quality Lagrangian bounds of the stochastic ... · Computing high-quality Lagrangian bounds of the stochastic mixed-integer programming problem Prof. Andrew Eberhard

F. V. Louveaux.Discrete stochastic location models.Annals of Operations Research, 6:23–34, 1986.

A. Løkketangen and D. Woodruff.Progressive hedging and tabu search applied to mixed integer (0,1) multi-stage stochastic programming.Journal of Heuristics, 2(2):111–128, 1996.

L. Ntaimo, J. A. Gallego Arrubla, C. Stripling, J. Young, and T. Spencer.A stochastic programming standard response model for wildfire initial attack planning.Canadian Journal of Forest Research, 42(6):987–1001, 2012.

Last accessed 20 April, 2016.

M. P. Nowak and W. Römisch.Stochastic Lagrangian relaxation applied to power scheduling in a hydro-thermal system under uncertainty.Annals of Operations Research, 100:251–272, 2000.

Lewis Ntaimo.Decomposition Algorithms for Stochastic Combinatorial Optimization: Computational Experiments andExtensions.PhD thesis, 2004.

G.L. Nemhauser and L.A. Wolsey.Integer and combinatorial optimization.Wiley-Interscience series in discrete mathematics and optimization. Wiley, 1988.

O. Y. Özaltın, O. A. Prokopyev, A. J. Schaefer, and M. S. Roberts.Optimizing the societal benefits of the annual influenza vaccine: A stochastic programming approach.Operations Research, 59:1131–1143, 2011.

M. J. D. Powell.A method for nonlinear constraints in minimization problems.In R. Fletcher, editor, Optimization. New York: Academic Press, 1969.

R.T. Rockafellar.

Page 77: Computing high-quality Lagrangian bounds of the stochastic ... · Computing high-quality Lagrangian bounds of the stochastic mixed-integer programming problem Prof. Andrew Eberhard

The multiplier method of hestenes and powell applied to convex programming.Journal of Optimization Theory and Applications, 12(6):555–562, 1973.

J. Salmeron, R. K. Wood, and D. P. Morton.A stochastic program for optimizing military sealift subject to attack.Military Operations Research, 14:19–39, 2009.

S. Takriti, J. Birge, and E. Long.A stochastic model for the unit commitment problem.IEEE Transactions on Power Systems, 11(3):1497–1508, 1996.

M. W. Tanner, L. Sattenspiel, and L. Ntaimo.Finding optimal vaccination strategies under parameter uncertainty using stochastic programming.Mathematical Biosciences, 215(2):144–151, 2008.

B. Von Hohenbalken.Simplicial decomposition in nonlinear programming algorithms.Mathematical Programming, 13(1):49–68, 1977.

J.P. Watson and D. Woodruff.Progressive hedging innovations for a class of stochastic resource allocation problems.Computational Management Science, 8(4):355–370, 2011.