CS672: Approximation Algorithms Spring 2020 Intro to ...SDP Relaxation maximize P (i;j)2E 1 Y ij 2...

CS672: Approximation AlgorithmsSpring 2020

Intro to Semidefinite Programming

Instructor: Shaddin Dughmi

Outline

1 Basics of PSD Matrices

2 Semidefinite Programming

3 Max Cut

Symmetric MatricesA matrix A ∈ Rn×n is symmetric if and only if it is square and Aij = Aji

for all i, j.We denote the cone of n× n symmetric matrices by Sn.

FactA matrix A ∈ Rn×n is symmetric if and only if it is orthogonallydiagonalizable.

i.e. A = QDQᵀ where Q is an orthogonal matrix andD = diag(λ1, . . . , λn).The columns of Q are the (normalized) eigenvectors of A, withcorresponding eigenvalues λ1, . . . , λnEquivalently: As a linear operator, A scales the space along anorthonormal basis QThe scaling factor λi along direction qi may be negative, positive,or 0.

Basics of PSD Matrices 1/13

Positive Semi-Definite MatricesA matrix A ∈ Rn×n is positive semi-definite if it is symmetric andmoreover all its eigenvalues are nonnegative.

We denote the cone of n× n positive semi-definite matrices by Sn+

We use A � 0 as shorthand for A ∈ Sn+

A = QDQᵀ where Q is an orthogonal matrix andD = diag(λ1, . . . , λn), where λi ≥ 0.As a linear operator, A performs nonnegative scaling along anorthonormal basis Q

NotePositive definite, negative semi-definite, and negative definite definedsimilarly.

Geometric Intuition for PSD Matrices

For A � 0, let q1, . . . , qn be the orthonormal eigenbasis for A, andlet λ1, . . . , λn ≥ 0 be the corresponding eigenvalues.The linear operator x→ Ax scales the qi component of x by λiWhen applied to every x in the unit ball, the image of A is anellipsoid centered at the origin with principal directions q1, . . . , qnand corresponding diameters 2λ1, . . . , 2λn

When A is positive definite (i.e.λi > 0), and therefore invertible, theellipsoid is the set

{y : yT (AAT )−1y ≤ 1

}Basics of PSD Matrices 3/13

Useful Properties of PSD Matrices

If A � 0, thenxTAx ≥ 0 for all xA has a positive semi-definite square root A

A12 = Qdiag(

√λ1, . . . ,

√λn)Q

A = BTB for some matrix B.Interpretation: PSD matrices encode the “pairwise similarity”relationships of a family of vectors. Aij is dot product of the ith andjth columns of B.Interpretation: The quadratic form xTAx is the length of a lineartransformation of x, namely ||Bx||22

The quadratic function xTAx is convexA can be expressed as a sum of vector outer-products

e.g., A =∑n

i=1 vivTi for ~vi =

√λi~qi

As it turns out, each of the above is also sufficient for A � 0 (assumingA is symmetric).

Useful Properties of PSD Matrices

If A � 0, thenxTAx ≥ 0 for all xA has a positive semi-definite square root A

A12 = Qdiag(

√λ1, . . . ,

√λn)Q

A = BTB for some matrix B.Interpretation: PSD matrices encode the “pairwise similarity”relationships of a family of vectors. Aij is dot product of the ith andjth columns of B.Interpretation: The quadratic form xTAx is the length of a lineartransformation of x, namely ||Bx||22

The quadratic function xTAx is convexA can be expressed as a sum of vector outer-products

e.g., A =∑n

i=1 vivTi for ~vi =

√λi~qi

As it turns out, each of the above is also sufficient for A � 0 (assumingA is symmetric).

Properties of PSD Matrices Relevant for Computation

The set of PSD matrices is convexFollows from the characterization: xTAx ≥ 0 for all x

The set of PSD matrices admits an efficient separation oracleGiven A , find eigenvector v with negative eigenvalue: vTAv < 0.

A PSD matrix A ∈ Rn×n implicitly encodes the “pairwisesimilarities” of a family of vectors b1, . . . , bn ∈ Rn.

Follows from the characterization A = BTB for some BAij = 〈bi, bj〉

Can convert between A and B efficiently.B to A: Matrix multiplicationA to B: B can be expressed in terms of eigenvectors/eigenvalues ofA, which can be easily computed to arbitrary precision via poweringmethods. Alternatively: Cholesky decomposition, SVD, . . . .

Outline

3 Max Cut

Convex Optimization

min (or max) f(x)subject to x ∈ X

Convex Set

Convex Optimization ProblemGeneralization of LP where

Feasible set X convex: αx+ (1− α)y ∈ X , for all x, y ∈ X andα ∈ [0, 1]

Objective function f is convex in case of minimizationf(αx+(1−α)y) ≤ αf(x)+ (1−α)f(y) for all x, y ∈ X and α ∈ [0, 1]

Objective function f is concave in case of maximization

Semidefinite Programming 6/13

Convex Optimization

min (or max) f(x)subject to x ∈ X

Convex Set

Convex Optimization Problems Solvable efficiently (i.e. in polynomialtime) to arbitrary precision under mild conditions

Separation oracle for XFirst-order oracle for evaluating f(x) and 5f(x).

For more detailTake CSCI 675!

Semidefinite ProgramsThese are Optimization problems where the feasible set is the cone ofPSD cone, possibly intersected with linear constraints.

Generalization of LP.Special case of Convex Optimization.

maximize cᵀxsubject to Ax � b

x1F1 + x2F2 . . . xnFn +G is PSD

F1, . . . , Fn, G, and A are given matrices, and c, b are given vectors.

ExamplesFitting a distribution, say a Gaussian, to observed data. Variable isa positive semi-definite covariance matrix.As a relaxation to combinatorial problems that encode pairwiserelationships: e.g. finding the maximum cut of a graph.

FactSDP can be solved in polytime to arbitrary precision, since PSDconstraints admit a polytime separation oracle.

Outline

3 Max Cut

The Max Cut ProblemGiven an undirected graph G = (V,E), find a partition of V into(S, V \ S) maximizing number of edges with exactly one end in S.

maximize∑

(i,j)∈E1−xixj

subject to xi ∈ {−1, 1} , for i ∈ V.

Instead of requiring xi to be on the 1 dimensional sphere, we relax andpermit it to be in the n-dimensional sphere, where n = |V |.

Vector Program relaxation

maximize∑

(i,j)∈E1−~vi·~vj

subject to ||~vi||2 = 1, for i ∈ V.~vi ∈ Rn, for i ∈ V.

Max Cut 8/13

The Max Cut ProblemGiven an undirected graph G = (V,E), find a partition of V into(S, V \ S) maximizing number of edges with exactly one end in S.

maximize∑

(i,j)∈E1−xixj

subject to xi ∈ {−1, 1} , for i ∈ V.

Instead of requiring xi to be on the 1 dimensional sphere, we relax andpermit it to be in the n-dimensional sphere, where n = |V |.

maximize∑

Max Cut 8/13

SDP Relaxation

Recall: A symmetric n× n matrix Y is PSD iff Y = V TV for n× nmatrix VEquivalently: PSD matrices encode pairwise dot products ofcolumns of VWhen diagonal entries of Y are 1, V has unit length columnsRecall: Y and V can be recovered from each other efficiently

maximize∑

SDP Relaxation

maximize∑

(i,j)∈E1−Yij

subject to Yii = 1, for i ∈ V.Y ∈ Sn

Max Cut 9/13

SDP Relaxation

Recall: A symmetric n× n matrix Y is PSD iff Y = V TV for n× nmatrix VEquivalently: PSD matrices encode pairwise dot products ofcolumns of VWhen diagonal entries of Y are 1, V has unit length columnsRecall: Y and V can be recovered from each other efficiently

maximize∑

SDP Relaxation

maximize∑

(i,j)∈E1−Yij

subject to Yii = 1, for i ∈ V.Y ∈ Sn

Max Cut 9/13

Goemans Williamson Algorithm forMax Cut

1 Solve the SDP to get Y � 0

2 Decompose Y to V V T

3 Draw random vector r on unit sphere4 Place nodes i with vi · r ≥ 0 on one

side of cut, the rest on the other side

SDP Relaxation

maximize∑

(i,j)∈E1−Yij

subject to Yii = 1 ∀iY ∈ Sn

Max Cut 10/13

We will prove the following Lemma

LemmaThe random hyperplane cuts each edge (i, j) with probability at least0.878

1−Yij

Therefore, by linearity of expectations, and the fact thatOPTSDP ≥ OPT (i.e. relaxation).

TheoremThe Goemans Williamson algorithm outputs a random cut of expectedsize at least 0.878 OPT .

Max Cut 11/13

We will prove the following Lemma

1−Yij

Therefore, by linearity of expectations, and the fact thatOPTSDP ≥ OPT (i.e. relaxation).

TheoremThe Goemans Williamson algorithm outputs a random cut of expectedsize at least 0.878 OPT .

Max Cut 11/13

We use the following fact

FactFor all angles θ ∈ [0, π],

π≥ 0.878 · 1− cos(θ)

Max Cut 12/13

1−Yij

(i, j) is cut iff sign(r · vi) 6= sign(r · vj)Can zoom in on the 2-d plane which includes vi and vj

Discard component r perpendicular to that plane, leaving r̂Direction of r̂ is uniform in the plane

Let θij be angle between vi and vj . Note Yij = vi · vj = cos(θij)r̂ cuts (i, j) w.p.

2θij2π

=θijπ≥ 0.878

1− cos θij2

= 0.8781− Yij

Max Cut 13/13

1−Yij

(i, j) is cut iff sign(r · vi) 6= sign(r · vj)

Can zoom in on the 2-d plane which includes vi and vjDiscard component r perpendicular to that plane, leaving r̂Direction of r̂ is uniform in the plane

2θij2π

=θijπ≥ 0.878

1− cos θij2

= 0.8781− Yij

Max Cut 13/13

1−Yij

2θij2π

=θijπ≥ 0.878

1− cos θij2

= 0.8781− Yij

Max Cut 13/13

1−Yij

Let θij be angle between vi and vj . Note Yij = vi · vj = cos(θij)

r̂ cuts (i, j) w.p.2θij2π

=θijπ≥ 0.878

1− cos θij2

= 0.8781− Yij

Max Cut 13/13

1−Yij

2θij2π

=θijπ≥ 0.878

1− cos θij2

= 0.8781− Yij

Max Cut 13/13

CS672: Approximation Algorithms Spring 2020 Intro to ...SDP Relaxation maximize P (i;j)2E 1 Y ij 2...

Documents

· 4 Leuchten Zusatzprogramm 4112-.. 4118-.. i-vi.4inn— LED / PICTO Zusatzprogramm i2V Leuchte IP20CE Gestell: chrom inkl. LED Leuchtmittel warm- Oder neutralweiß (ca. 3000k o

Stanford I2V: A News Video Dataset for Query-by-Image ... · Araujo et al., Stanford I2V: A News Video Dataset for Query-by-Image Experiments. 25 Summary - Dataset for video retrieval

Www.cctvcentersl.es. Grabadores digitales serie IS › IS04 - IS08 - IS16 › Puntos fuertes Grabadores digitales serie I2V › Diferencias IS // I2V ›

COMBINATORIAL OPTIMIZATION VIA THE SUM ... - Computer Sciencepeople.cs.uchicago.edu/~goutham/msthesis.pdf · Goemans and Williamson[GW95] proved that this randomized rounding achieves

Debs and Goemans, Regime Type, The Fate of Leaders, And War

Managing Highly Automated Vehicles€¦ · MAVEN use case overview •I2V interactions •Negotiation (signal timing vs. arrival pattern), speed change advisory, lane change advisory

ApproximatingtheStochasticKnapsackProblem ...goemans/PAPERS/DeanGoemansVondrak-200… · MATHEMATICS OF OPERATIONS RESEARCH Vol.33,No.4,November2008,pp.945–964 issn0364-765X eissn1526-5471

Water in the West: Current Use and Future Challenges Christopher Goemans, Ph.D. Agricultural and Resource Economics 13 th Annual Farmers Cooperatives Conference

Planning as Satisfiability CS672. 2 Outline 0. Overview of Planning 1. Modeling and Solving Planning Problems as SAT - SATPLAN 2. Improved Encodings using

Semidefinite Programming (SDP) and the Goemans … · Semidefinite Programming (SDP) and the Goemans-Williamson MAXCUT Paper ... . Outline

I2V Highway and Urban Vehicular Networks: A Comparative

Primal-dual approximation algorithms for feedback …goemans/PAPERS/GoemansWilliamson-1996...Primal-Dual Approximation Algorithms for Feedback Problems in Planar Graphs Michel X. Goemans

CS672: Approximation Algorithms Spring 2020 Linear ...shaddin/cs672sp20/slides/LP.pdf · Linear Programming Basics 6/37. Basic Facts about LPs and Polyhedrons Fact Feasible regions

Maurizio Patrignani seminario sullarticolo On the Single-Source Unsplittable Flow Problem di Yefim Dinitz Naveen Garg Michel X. Goemans (aprile 98)

Aufruf-ID: m13v0538mathehoch13.de/Worksheets/m13ws0538.pdf · QPh Stochastik Binomialverteilung Aufruf-ID: m13v0538 © mathehoch13 by Christoph Goemans – kostenlose Weitergabe erlaubt

Connected, Limited Device Configurationdiscolab.rutgers.edu/classes/cs672-fall00/j2me/CLDCspec10.pdfx Connected, Limited Device Configuration• May 19, 2000 Chapter 1, “Introduction

CS672: MPLS Architecture, Applications and Fault-Tolerance

Linear programming 1 Basics - Mathematicsgoemans/18310S15/lpnotes310.pdf18.310A lecture notes March 17, 2015 Linear programming Lecturer: Michel Goemans 1 Basics Linear Programming

Goemans, Marinov- Coups and Democracy

Lorenzo Spadini (1), Pierre Goemans (2), Olivier Faure (3), Nicolas Bonnafous (2) et Nicolas Geoffroy (1,4) (1) Université Joseph Fourier, Laboratoire