The satisfiability threshold and clusters of solutions in the 3-SAT problem Elitza Maneva IBM Almaden Research Center

The satisfiability threshold The satisfiability threshold and clusters of solutions and clusters of solutions

in the 3-SAT problemin the 3-SAT problem

Elitza ManevaElitza Maneva

IBM Almaden Research CenterIBM Almaden Research Center

3-SAT3-SAT

Variables: x1, x2, …, xn take values {TRUE, FALSE}

Constraints: (x1 or x2 or not x3) , (not x2 or x4 or not x6), …

(x1 x2 x3) ( x2 x4 x6) …

x1

x2

x3

x4

x5

x6x7

x8

_ _ _

1

1

0

0

0

PLRPLR

Random walkRandom walk

Belief propagationBelief propagation

Survey propagationSurvey propagation Not Not satisfiablesatisfiable

SatisfiableSatisfiable

SatisfiableSatisfiable Not Not satisfiablesatisfiable

Random 3-SATRandom 3-SAT

00 1.631.63 3.953.953.523.52 4.274.27 4.514.51

MyopicMyopic

x1 x2 x3 x4 x5 x6 x7 x8 n

m = n

Red = proved, green = unproved

Rigorous bounds for random 3-SATRigorous bounds for random 3-SAT

1999: [Friedgut] there is a sharp threshold of satisfiability c(n)

2002

KaporisKirousisLalas

2002

HajiaghayiSorkin

3.52


00 1.631.63 3.523.52 4.514.51 5.195.19

Pure Literal Rule Algorithm:If any variable appears only positive or only negative

assign it 1 or 0 respectivelySimplify the formula by removing the satisfied clausesRepeat

(x1 x2 x3) ( x2 x4 x5) (x1 x2 x4) (x3 x4 x5)

1 1

_ _ _ __

0

1


00 1.631.63 3.523.52 4.514.51 5.195.19

Myopic Algorithms:Choose a variable according to # positive and negative occurrencesAssign the variable the more popular valueSimplify the formula by 1. removing the satisfied clauses

2. removing the FALSE literals 3. assigning variables in unit clauses 4. assigning pure variables

Repeat

Best rule: maximum |# positive occurr. – # negative occurr.|


00 1.631.63 3.523.52 4.514.51

E [# solutions] = 2n Pr [00…0 is a solution] = = 2n (1-1/8)m = = (2 (7/8))n

For >5.191, E [# solutions] 0, so Pr [satisfiable] 0

5.195.19


00 1.631.63 3.523.52 4.514.51

E [# positively prime solutions] 0

Positively prime solution: a solution in which no variable assigned 1 can be converted to 0.

Fact: If there exists a solution, there exists a positively prime solution.

5.195.194.674.67


00 1.631.63 3.523.52 4.514.51

E [# symmetrically prime solutions] 0

5.195.194.674.67

PLRPLR



Survey propagationSurvey propagation



00 1.631.63 3.953.953.523.52 4.274.27 4.514.51

MyopicMyopic

x1 x2 x3 x4 x5 x6 x7 x8 n

m = n


Random Walk AlgorithmsRandom Walk Algorithms

[Alekhnovich, Ben-Sasson `03]

Simple Random Walk:

Pick an unsatisfied clausePick a variable in the clause

Flip the variable

Theorem: Finds a solution in O(n) steps for < 1.63.

[Seitz, Alava, Orponen `05][Ardelius, Aurell `06]ASAT:

Pick an unsatisfied clausePick a variable in the clause

Flip it only with prob. p if number of unsatisfied clauses does not increase

Experiment: Takes O(n) steps for < 4.21.

PLRPLR






00 1.631.63 3.953.953.523.52 4.274.27 4.514.51

MyopicMyopic

x1 x2 x3 x4 x5 x6 x7 x8 n

m = n


We can find solutions via inferenceWe can find solutions via inference

Suppose the formula is satisfiable.Suppose the formula is satisfiable.

Consider the uniform distribution Consider the uniform distribution

over satisfying assignments:over satisfying assignments:

Pr[xPr[x11, x, x22, …, x, …, xnn] ] (x (x11, x, x22, …, x, …, xnn))

Simple ClaimSimple Claim: : If we can compute Pr[xIf we can compute Pr[xii=1], then we =1], then we

can find a solution fast.can find a solution fast.

DecimationDecimation: : Assign variables one by one to a value Assign variables one by one to a value that has highest probability. that has highest probability.

Fact:Fact: We cannot hope to compute Pr[x We cannot hope to compute Pr[x ii=1] exactly=1] exactly

Heuristics for guessing the best variable to assign:Heuristics for guessing the best variable to assign:

1.1. Pure Literal Rule (PLR)Pure Literal Rule (PLR): Choose a variable that appears : Choose a variable that appears always positive / always negative.always positive / always negative.

2.2. Myopic RuleMyopic Rule: Choose a variable based on number of : Choose a variable based on number of positive and negative occurrences.positive and negative occurrences.

3. Belief Propagation3. Belief Propagation: Estimate Pr[x: Estimate Pr[xii=1] by belief propagation =1] by belief propagation

and choose variable with largest estimated bias.and choose variable with largest estimated bias.

Computing Pr[xComputing Pr[x11=0] on a tree formula=0] on a tree formula

x1

108108192192

1111

1111

111111

111111

1111

1111

3344

4433

3344

12121212

36364848

#Solutions with 0#Solutions with 0#Solutions with 1#Solutions with 1

#Solns with 0#Solns with 0#Solns with 1#Solns with 1

Vectors can be normalizedVectors can be normalized

x1

.36.36

.64.64

.5.5

.5.5

.43.43

.57.57

.5.5

.5.5

.5.5

.5.5 .5.5.5.5

.5.5

.5.5

.5.5

.5.5.5.5.5.5

.5.5

.5.5

.43.43

.57.57

.43.43

.57.57

.57.57

.43.43

… … and thought of as messagesand thought of as messagesx1

Vectors can be normalizedVectors can be normalized

What if the graph is not a tree?What if the graph is not a tree?



x11

x5

x1

x4

x10

x6

x9 x8 x7

x3

x2

Pr[xPr[x11, …, x, …, xnn] ] ΠΠaa aa(x(xN(N(aa))) )

(x(x11, x, x22 , x , x33))

Belief Propagation [Pearl ’88]Belief Propagation [Pearl ’88]

x1 x2 x3 x4 x5 x6 x7 nn

mm

Given:Given: Pr[xPr[x1 1 …x…x77]] aa(x(x11, x, x33) ) bb(x(x11, x, x22) ) cc(x(x11, x, x44) ) ……

Goal: Goal: Compute Pr[xCompute Pr[x11] (i.e. ] (i.e. marginalmarginal))

Message passing rules:M i c (xi) = Π M b i (xi)

M c i (xi) = Σ c(x N(c) ) Π M j c (xj)

Estimated marginals:i(xi) = Π M c i (xi)

xj: j N(c)\i j N(c)\i

cN(i)

bN(i)/c

i.e. Markov Random Field (MRF)i.e. Markov Random Field (MRF)

Belief propagation is a dynamic programming algorithm.It is exact only when the recurrence relation holds, i.e.:1. if the graph is a tree.2. if the graph behaves like a tree: large cycles

Applications of belief propagationApplications of belief propagation

• Statistical learning theoryStatistical learning theory• VisionVision• Error-correcting codes (Turbo, LDPC, LT)Error-correcting codes (Turbo, LDPC, LT)• Lossy data-compressionLossy data-compression• Computational biologyComputational biology• Sensor networksSensor networks

PLRPLR

Random walksRandom walks




Limitations of BPLimitations of BP

00 1.631.63 3.953.953.523.52 4.274.27 4.514.51

MyopicMyopic

x1 x2 x3 x4 x5 x6 x7 x8 n

m = n

Reason for failure of Belief PropagationReason for failure of Belief Propagation

• Messages from different neighbors are assumed to be almost Messages from different neighbors are assumed to be almost independent independent i.e. there are no long-range correlations i.e. there are no long-range correlations

PLRPLR

Random walksRandom walks



00 1.631.63 3.953.953.523.52 4.274.27 4.514.51

MyopicMyopic

No long-range correlations

Long-range correlations exist



Fix:Fix: 1-step Replica Symmetry Breaking Ansatz 1-step Replica Symmetry Breaking Ansatz

• The distribution can be decomposed into “phases”The distribution can be decomposed into “phases”• There are no long-range correlations within a phaseThere are no long-range correlations within a phase• Each phase consists of similar assignments – “clusters”Each phase consists of similar assignments – “clusters”• Messages become distributions of distributionsMessages become distributions of distributions• An approximation yields 3-dimensional messages: An approximation yields 3-dimensional messages:

Survey Propagation Survey Propagation [Mezard, Parisi, Zecchina ‘02][Mezard, Parisi, Zecchina ‘02]• Survey propagation finds a phase, then WalkSAT is used to find a Survey propagation finds a phase, then WalkSAT is used to find a

solution in the phasesolution in the phase



Fix:Fix: 1-step Replica Symmetry Breaking Ansatz 1-step Replica Symmetry Breaking Ansatz

• The distribution can be decomposed into “phases”The distribution can be decomposed into “phases”

Pr[xPr[x11, x, x22, …, x, …, xnn] = ] = p p Pr Pr [x [x11, x, x22, …, x, …, xnn]]

fixed variables

Space of solutionsSpace of solutions

Satisfying assignments in {0, 1}Satisfying assignments in {0, 1}nn

01011100

101011110110101101

phases


.12.12

.81.81

.07.07

0011


Mci= ————————

Muic = (1- (1- Mbi )) (1-Mbi)

Msic = (1- (1- Mbi )) (1-Mbi)

Mic = (1- Mbi )

Mujc

Muj c+Ms

j c+Mjc

jN(c)\i

b Nsa (i)b Nu

a (i)

b Nsc (i) b Nu

c (i)

b N(i)\c

x1 x2 x3 x4 x5 x6 x7 x8

You have to satisfy me

with prob. 60%

I’m 0 with prob 10%,1 with prob 70%,

whichever (i.e. ) 20%

Combinatorial interpretation

• Can survey propagation be thought of as inference on cluster assignments?

Not precisely, but close.• We define a related concept of core/cover assignments• Assignments in the same cluster share the same core• However, different cluster may have the same core

Finding the core of a solutionFinding the core of a solution

0

0

01

1

1

0

0


0

0

01

1

1

0

0


0

0

01

0

1

0

0

unconstrained variables


0

0

01

1

0

0


0

0

01

1

0


0

0

01

0


0

0

01

Such a fully constrained partial assignment is called a cover.

Partial assignments {0,1,Partial assignments {0,1,}}nn{0, 1}{0, 1}nn assignments assignments

01011100

101011110110101101

# st

ars

# st

ars

corecore

corecore

Extending the space of assignmentsExtending the space of assignments

Theorem:Theorem: Survey propagation is Survey propagation is equivalentequivalent to belief to belief propagation on the uniform distribution over coverpropagation on the uniform distribution over cover assignments.assignments.

Survey propagation is a Survey propagation is a belief propagation algorithmbelief propagation algorithm

[Maneva, Mossel, Wainwright ‘05][Maneva, Mossel, Wainwright ‘05][Braunstein, Zecchina ‘05][Braunstein, Zecchina ‘05]

But, we still need to look at all partial assignments.

Peeling Experiment for 3-SAT, Peeling Experiment for 3-SAT, n n =10=1055

0

10000

20000

30000

40000

50000

60000

70000

80000

90000

100000

0 20000 40000 60000 80000 100000

# stars

# u

nc

on

str

ain

ed

2

2.5

3

3.5

4

4.1

4.2

Clusters and partial assignmentsClusters and partial assignments

Partial assignmentsPartial assignments{0, 1}{0, 1}nn assignments assignments

# st

ars

# st

ars

0110101101

01011100

101011110110101101

0101111

4

23

nn(())

3. 3. A family of belief propagation algorithms:A family of belief propagation algorithms:00 11

Vanilla BPVanilla BP SPSP

Pr[Pr[] ] (1- (1- ))nn(()) nnoo(())

Definition of the new distributionDefinition of the new distribution

FormulaFormula

11111111

111111 111111

1111 1111

11

1111

11 11 11

1010101001110111

011011 010010 101000

Partial assignmentsPartial assignments

2. 2. Weight of partial assignments:Weight of partial assignments:

nnoo(())

1. 1. Includes all assignments without contradictions or implicationsIncludes all assignments without contradictions or implications

Partial assignmentsPartial assignments{0, 1}{0, 1}nn assignments assignments

01011100

101011110110101101

# st

ars

# st

ars

corecore

corecore

=0=0

=1=1

Pr[Pr[] ] (1- (1- ))nn(()) nnoo(())

00 11

Vanilla BPVanilla BP SPSP

This is the correct picture for 9-SAT and above.This is the correct picture for 9-SAT and above.[Achlioptas, Ricci-Tersenghi ‘06]

Clustering for k-SATClustering for k-SAT

What is known?What is known?2-SAT: a single cluster 2-SAT: a single cluster

3-SAT to 7-SAT: not known3-SAT to 7-SAT: not known

8-SAT and above: exponential number of clusters 8-SAT and above: exponential number of clusters (with second moment method) (with second moment method) [Mezard, Mora, Zecchina `05] [Mezard, Mora, Zecchina `05] [Achlioptas, Ricci-Tersenghi `06][Achlioptas, Ricci-Tersenghi `06]

9-SAT and above: clusters have non-trivial cores 9-SAT and above: clusters have non-trivial cores (with differential equations method)(with differential equations method) [Achlioptas, Ricci-Tersenghi `06][Achlioptas, Ricci-Tersenghi `06]

11111111

111111 111111

1111 1111

11

1111

11 11 11

010111

1010101001110111

011011 010010 101000

Convex geometry / AntimatroidConvex geometry / Antimatroid

Total weight is 1 for every Total weight is 1 for every


00 3.523.52 4.514.51 5.195.19

E [total weight of partial assignments] 0 ( = 0.8)

Fact: If there exists a solution, the weight of partial assignments is at least 1.

4.94.9


Theorem [ Maneva, Sinclair ] For > 4.453 one of the following holds:1. there are no satisfying assignments with high probability;2. the core of every satisfying assignment is (,,…,).

4.4534.453

00 3.523.52 4.514.51 5.195.19

PLRPLR






00 1.631.63 3.953.953.523.52 4.274.27 4.514.51

MyopicMyopic

x1 x2 x3 x4 x5 x6 x7 x8 n

m = n


Challenges

• Improve the bounds on the threshold

• Prove algorithms work with high probability

• Find an algorithm for certifying that a formula with n clauses for large has no solution

Thank youThank you

Documents

The satisfiability threshold and clusters of solutions in the 3-SAT problem Elitza Maneva IBM Almaden Research Center