58
Average Case Analysis of Java 7’s Dual Pivot Quicksort Sebastian Wild Markus E. Nebel [s_wild, nebel] @cs.uni-kl.de Computer Science Department University of Kaiserslautern September 11, 2012 20th European Symposium on Algorithms Sebastian Wild Java 7’s Dual Pivot Quicksort 2012/09/11 1 / 15

Average Case Analysis of Java 7’s Dual Pivot Quicksort

Embed Size (px)

DESCRIPTION

I gave this talk at the European Symposium on Algorithms 2012 in Ljubljana (Slowenia). The corresponding paper won the best paper award. Find my other talks and all corresponding papers on my web page: http://wwwagak.cs.uni-kl.de/sebastian-wild.html

Citation preview

Page 1: Average Case Analysis of Java 7’s Dual Pivot Quicksort

Average Case Analysis ofJava 7’s Dual Pivot Quicksort

Sebastian Wild Markus E. Nebel[s_wild, nebel] @cs.uni-kl.de

Computer Science DepartmentUniversity of Kaiserslautern

September 11, 201220th European Symposium on Algorithms

Sebastian Wild Java 7’s Dual Pivot Quicksort 2012/09/11 1 / 15

Page 2: Average Case Analysis of Java 7’s Dual Pivot Quicksort

Classic Quicksort

Classic Quicksort with Hoare’s Crossing Pointer Technique

2 9 5 4 1 7 8 3 6

. . . by example

Sebastian Wild Java 7’s Dual Pivot Quicksort 2012/09/11 2 / 15

Page 3: Average Case Analysis of Java 7’s Dual Pivot Quicksort

Classic Quicksort

Classic Quicksort with Hoare’s Crossing Pointer Technique

2 9 5 4 1 7 8 3 6

Select one element as pivot.

Sebastian Wild Java 7’s Dual Pivot Quicksort 2012/09/11 2 / 15

Page 4: Average Case Analysis of Java 7’s Dual Pivot Quicksort

Classic Quicksort

Classic Quicksort with Hoare’s Crossing Pointer Technique

2 9 5 4 1 7 8 3 6

Only value relative to pivot counts.

Sebastian Wild Java 7’s Dual Pivot Quicksort 2012/09/11 2 / 15

Page 5: Average Case Analysis of Java 7’s Dual Pivot Quicksort

Classic Quicksort

Classic Quicksort with Hoare’s Crossing Pointer Technique

2 9 5 4 1 7 8 3 6

Left pointer scans until first large element.

Sebastian Wild Java 7’s Dual Pivot Quicksort 2012/09/11 2 / 15

Page 6: Average Case Analysis of Java 7’s Dual Pivot Quicksort

Classic Quicksort

Classic Quicksort with Hoare’s Crossing Pointer Technique

2 9 5 4 1 7 8 3 6

Right pointer scans until first small element.

Sebastian Wild Java 7’s Dual Pivot Quicksort 2012/09/11 2 / 15

Page 7: Average Case Analysis of Java 7’s Dual Pivot Quicksort

Classic Quicksort

Classic Quicksort with Hoare’s Crossing Pointer Technique

2 9 5 4 1 7 8 3 6

Swap out-of-order pair.

Sebastian Wild Java 7’s Dual Pivot Quicksort 2012/09/11 2 / 15

Page 8: Average Case Analysis of Java 7’s Dual Pivot Quicksort

Classic Quicksort

Classic Quicksort with Hoare’s Crossing Pointer Technique

2 3 5 4 1 7 8 9 6

Swap out-of-order pair.

Sebastian Wild Java 7’s Dual Pivot Quicksort 2012/09/11 2 / 15

Page 9: Average Case Analysis of Java 7’s Dual Pivot Quicksort

Classic Quicksort

Classic Quicksort with Hoare’s Crossing Pointer Technique

2 3 5 4 1 7 8 9 6

Left pointer scans until first large element.

Sebastian Wild Java 7’s Dual Pivot Quicksort 2012/09/11 2 / 15

Page 10: Average Case Analysis of Java 7’s Dual Pivot Quicksort

Classic Quicksort

Classic Quicksort with Hoare’s Crossing Pointer Technique

2 3 5 4 1 7 8 9 6

Right pointer scans until first small element.

Sebastian Wild Java 7’s Dual Pivot Quicksort 2012/09/11 2 / 15

Page 11: Average Case Analysis of Java 7’s Dual Pivot Quicksort

Classic Quicksort

Classic Quicksort with Hoare’s Crossing Pointer Technique

2 3 5 4 1 7 8 9 6

The pointers have crossed!

Sebastian Wild Java 7’s Dual Pivot Quicksort 2012/09/11 2 / 15

Page 12: Average Case Analysis of Java 7’s Dual Pivot Quicksort

Classic Quicksort

Classic Quicksort with Hoare’s Crossing Pointer Technique

2 3 5 4 1 7 8 9 6

Swap pivot to final position.

Sebastian Wild Java 7’s Dual Pivot Quicksort 2012/09/11 2 / 15

Page 13: Average Case Analysis of Java 7’s Dual Pivot Quicksort

Classic Quicksort

Classic Quicksort with Hoare’s Crossing Pointer Technique

2 3 5 4 1 6 8 9 7

Partitioning done!

Sebastian Wild Java 7’s Dual Pivot Quicksort 2012/09/11 2 / 15

Page 14: Average Case Analysis of Java 7’s Dual Pivot Quicksort

Classic Quicksort

Classic Quicksort with Hoare’s Crossing Pointer Technique

2 3 5 4 1 6 8 9 7

Recursively sort two sublists.

Sebastian Wild Java 7’s Dual Pivot Quicksort 2012/09/11 2 / 15

Page 15: Average Case Analysis of Java 7’s Dual Pivot Quicksort

Classic Quicksort

Classic Quicksort with Hoare’s Crossing Pointer Technique

1 2 3 4 5 6 7 8 9

Done.

Sebastian Wild Java 7’s Dual Pivot Quicksort 2012/09/11 2 / 15

Page 16: Average Case Analysis of Java 7’s Dual Pivot Quicksort

Dual Pivot Quicksort

“new” idea: use two pivots p < q

3 5 1 8 4 7 2 9 6p q

How to do partitioning?

1 For each element x, determine its class

small for x < p

medium for p < x < q

large for q < x

by comparing x to p and/or q

2 Arrange elements according to classes p q

Sebastian Wild Java 7’s Dual Pivot Quicksort 2012/09/11 3 / 15

Page 17: Average Case Analysis of Java 7’s Dual Pivot Quicksort

Dual Pivot Quicksort

“new” idea: use two pivots p < q

3 5 1 8 4 7 2 9 6p q

How to do partitioning?

1 For each element x, determine its class

small for x < p

medium for p < x < q

large for q < x

by comparing x to p and/or q

2 Arrange elements according to classes p q

Sebastian Wild Java 7’s Dual Pivot Quicksort 2012/09/11 3 / 15

Page 18: Average Case Analysis of Java 7’s Dual Pivot Quicksort

Dual Pivot Quicksort – Previous Work

Robert Sedgewick, 1975in-place dual pivot Quicksort implementationmore comparisons and swaps than classic Quicksort

Pascal Hennequin, 1991comparisons for list-based Quicksort with r pivots

r = 2 same #comparisons as classic Quicksortin one partitioning step: 5

3comparisons per element

r > 2 very small savings, but complicated partitioning

Using two pivots does not pay.

Vladimir Yaroslavskiy, 2009new implementation of dual pivot Quicksortnow used in Java 7’s runtime libraryruntime studies, no rigorous analysis

Sebastian Wild Java 7’s Dual Pivot Quicksort 2012/09/11 4 / 15

Page 19: Average Case Analysis of Java 7’s Dual Pivot Quicksort

Dual Pivot Quicksort – Previous Work

Robert Sedgewick, 1975in-place dual pivot Quicksort implementationmore comparisons and swaps than classic Quicksort

Pascal Hennequin, 1991comparisons for list-based Quicksort with r pivots

r = 2 same #comparisons as classic Quicksortin one partitioning step: 5

3comparisons per element

r > 2 very small savings, but complicated partitioning

Using two pivots does not pay.

Vladimir Yaroslavskiy, 2009new implementation of dual pivot Quicksortnow used in Java 7’s runtime libraryruntime studies, no rigorous analysis

Sebastian Wild Java 7’s Dual Pivot Quicksort 2012/09/11 4 / 15

Page 20: Average Case Analysis of Java 7’s Dual Pivot Quicksort

Dual Pivot Quicksort – Previous Work

Robert Sedgewick, 1975in-place dual pivot Quicksort implementationmore comparisons and swaps than classic Quicksort

Pascal Hennequin, 1991comparisons for list-based Quicksort with r pivots

r = 2 same #comparisons as classic Quicksortin one partitioning step: 5

3comparisons per element

r > 2 very small savings, but complicated partitioning

Using two pivots does not pay.

Vladimir Yaroslavskiy, 2009new implementation of dual pivot Quicksortnow used in Java 7’s runtime libraryruntime studies, no rigorous analysis

Sebastian Wild Java 7’s Dual Pivot Quicksort 2012/09/11 4 / 15

Page 21: Average Case Analysis of Java 7’s Dual Pivot Quicksort

Dual Pivot Quicksort – Comparison Costs

How many comparisons to determine classes ( small , medium or large ) ?

Assume, we first compare with p. small elements need 1, others 2 comparisons

on average: 13 of all elements are small

13 · 1+

23 · 2 =

53 comparisons per element

if inputs are uniform random permutations,classes of x and y are independent

Any partitioning method needs at least53(n− 2) ∼ 20

12n comparisons on average?

No! (Stay tuned . . . )

Sebastian Wild Java 7’s Dual Pivot Quicksort 2012/09/11 5 / 15

Page 22: Average Case Analysis of Java 7’s Dual Pivot Quicksort

Dual Pivot Quicksort – Comparison Costs

How many comparisons to determine classes ( small , medium or large ) ?

Assume, we first compare with p. small elements need 1, others 2 comparisons

on average: 13 of all elements are small

13 · 1+

23 · 2 =

53 comparisons per element

if inputs are uniform random permutations,classes of x and y are independent

Any partitioning method needs at least53(n− 2) ∼ 20

12n comparisons on average?

No! (Stay tuned . . . )

Sebastian Wild Java 7’s Dual Pivot Quicksort 2012/09/11 5 / 15

Page 23: Average Case Analysis of Java 7’s Dual Pivot Quicksort

Beating the “Lower Bound”

∼ 2012n comparisons only needed,

if there is one comparison location,then checks for x and y independent

But: Can have several comparison locations!

Here: Assume two locations C1 and C2 s. t.

C1 first compares with p.

C2 first compares with q.

C1 executed often, iff p is large.

C2 executed often, iff q is small.

C1 executed ofteniff many small elementsiff good chance that C1 needs only one comparison

(C2 similar)

less comparisons than 53 per elements on average

Sebastian Wild Java 7’s Dual Pivot Quicksort 2012/09/11 6 / 15

Page 24: Average Case Analysis of Java 7’s Dual Pivot Quicksort

Yaroslavskiy’s Quicksort – Example

Yaroslavskiy’s Dual Pivot Quicksort(used in Oracle’s Java 7 Arrays.sort(int[]))

p q

3 5 1 8 4 7 2 9 6

Select two elements as pivots.

Invariant: < p `→

> qg

←p 6 ◦ 6 q k

→?

Sebastian Wild Java 7’s Dual Pivot Quicksort 2012/09/11 7 / 15

Page 25: Average Case Analysis of Java 7’s Dual Pivot Quicksort

Yaroslavskiy’s Quicksort – Example

Yaroslavskiy’s Dual Pivot Quicksort(used in Oracle’s Java 7 Arrays.sort(int[]))

p q

3 5 1 8 4 7 2 9 6

Only value relative to pivot counts.

Invariant: < p `→

> qg

←p 6 ◦ 6 q k

→?

Sebastian Wild Java 7’s Dual Pivot Quicksort 2012/09/11 7 / 15

Page 26: Average Case Analysis of Java 7’s Dual Pivot Quicksort

Yaroslavskiy’s Quicksort – Example

Yaroslavskiy’s Dual Pivot Quicksort(used in Oracle’s Java 7 Arrays.sort(int[]))

3 5 1 8 4 7 2 9 6

k

A[k] is medium go on

Invariant: < p `→

> qg

←p 6 ◦ 6 q k

→?

Sebastian Wild Java 7’s Dual Pivot Quicksort 2012/09/11 7 / 15

Page 27: Average Case Analysis of Java 7’s Dual Pivot Quicksort

Yaroslavskiy’s Quicksort – Example

Yaroslavskiy’s Dual Pivot Quicksort(used in Oracle’s Java 7 Arrays.sort(int[]))

3 5 1 8 4 7 2 9 6

` k

A[k] is small Swap to left

Invariant: < p `→

> qg

←p 6 ◦ 6 q k

→?

Sebastian Wild Java 7’s Dual Pivot Quicksort 2012/09/11 7 / 15

Page 28: Average Case Analysis of Java 7’s Dual Pivot Quicksort

Yaroslavskiy’s Quicksort – Example

Yaroslavskiy’s Dual Pivot Quicksort(used in Oracle’s Java 7 Arrays.sort(int[]))

3 5 1 8 4 7 2 9 6

` k

Swap small element to left end.

Invariant: < p `→

> qg

←p 6 ◦ 6 q k

→?

Sebastian Wild Java 7’s Dual Pivot Quicksort 2012/09/11 7 / 15

Page 29: Average Case Analysis of Java 7’s Dual Pivot Quicksort

Yaroslavskiy’s Quicksort – Example

Yaroslavskiy’s Dual Pivot Quicksort(used in Oracle’s Java 7 Arrays.sort(int[]))

3 1 5 8 4 7 2 9 6

` k

Swap small element to left end.

Invariant: < p `→

> qg

←p 6 ◦ 6 q k

→?

Sebastian Wild Java 7’s Dual Pivot Quicksort 2012/09/11 7 / 15

Page 30: Average Case Analysis of Java 7’s Dual Pivot Quicksort

Yaroslavskiy’s Quicksort – Example

Yaroslavskiy’s Dual Pivot Quicksort(used in Oracle’s Java 7 Arrays.sort(int[]))

3 1 5 8 4 7 2 9 6

` k

A[k] is large Find swap partner.

Invariant: < p `→

> qg

←p 6 ◦ 6 q k

→?

Sebastian Wild Java 7’s Dual Pivot Quicksort 2012/09/11 7 / 15

Page 31: Average Case Analysis of Java 7’s Dual Pivot Quicksort

Yaroslavskiy’s Quicksort – Example

Yaroslavskiy’s Dual Pivot Quicksort(used in Oracle’s Java 7 Arrays.sort(int[]))

3 1 5 8 4 7 2 9 6

g` k

A[k] is large Find swap partner:g skips over large elements.

Invariant: < p `→

> qg

←p 6 ◦ 6 q k

→?

Sebastian Wild Java 7’s Dual Pivot Quicksort 2012/09/11 7 / 15

Page 32: Average Case Analysis of Java 7’s Dual Pivot Quicksort

Yaroslavskiy’s Quicksort – Example

Yaroslavskiy’s Dual Pivot Quicksort(used in Oracle’s Java 7 Arrays.sort(int[]))

3 1 5 8 4 7 2 9 6

g` k

A[k] is large Swap

Invariant: < p `→

> qg

←p 6 ◦ 6 q k

→?

Sebastian Wild Java 7’s Dual Pivot Quicksort 2012/09/11 7 / 15

Page 33: Average Case Analysis of Java 7’s Dual Pivot Quicksort

Yaroslavskiy’s Quicksort – Example

Yaroslavskiy’s Dual Pivot Quicksort(used in Oracle’s Java 7 Arrays.sort(int[]))

3 1 5 2 4 7 8 9 6

g` k

A[k] is large Swap

Invariant: < p `→

> qg

←p 6 ◦ 6 q k

→?

Sebastian Wild Java 7’s Dual Pivot Quicksort 2012/09/11 7 / 15

Page 34: Average Case Analysis of Java 7’s Dual Pivot Quicksort

Yaroslavskiy’s Quicksort – Example

Yaroslavskiy’s Dual Pivot Quicksort(used in Oracle’s Java 7 Arrays.sort(int[]))

3 1 5 2 4 7 8 9 6

g` k

A[k] is old A[g], small Swap to left

Invariant: < p `→

> qg

←p 6 ◦ 6 q k

→?

Sebastian Wild Java 7’s Dual Pivot Quicksort 2012/09/11 7 / 15

Page 35: Average Case Analysis of Java 7’s Dual Pivot Quicksort

Yaroslavskiy’s Quicksort – Example

Yaroslavskiy’s Dual Pivot Quicksort(used in Oracle’s Java 7 Arrays.sort(int[]))

3 1 2 5 4 7 8 9 6

g` k

A[k] is old A[g], small Swap to left

Invariant: < p `→

> qg

←p 6 ◦ 6 q k

→?

Sebastian Wild Java 7’s Dual Pivot Quicksort 2012/09/11 7 / 15

Page 36: Average Case Analysis of Java 7’s Dual Pivot Quicksort

Yaroslavskiy’s Quicksort – Example

Yaroslavskiy’s Dual Pivot Quicksort(used in Oracle’s Java 7 Arrays.sort(int[]))

3 1 2 5 4 7 8 9 6

g` k

A[k] is medium go on

Invariant: < p `→

> qg

←p 6 ◦ 6 q k

→?

Sebastian Wild Java 7’s Dual Pivot Quicksort 2012/09/11 7 / 15

Page 37: Average Case Analysis of Java 7’s Dual Pivot Quicksort

Yaroslavskiy’s Quicksort – Example

Yaroslavskiy’s Dual Pivot Quicksort(used in Oracle’s Java 7 Arrays.sort(int[]))

3 1 2 5 4 7 8 9 6

g` k

A[k] is large Find swap partner.

Invariant: < p `→

> qg

←p 6 ◦ 6 q k

→?

Sebastian Wild Java 7’s Dual Pivot Quicksort 2012/09/11 7 / 15

Page 38: Average Case Analysis of Java 7’s Dual Pivot Quicksort

Yaroslavskiy’s Quicksort – Example

Yaroslavskiy’s Dual Pivot Quicksort(used in Oracle’s Java 7 Arrays.sort(int[]))

3 1 2 5 4 7 8 9 6

g` k

A[k] is large Find swap partner:g skips over large elements.

Invariant: < p `→

> qg

←p 6 ◦ 6 q k

→?

Sebastian Wild Java 7’s Dual Pivot Quicksort 2012/09/11 7 / 15

Page 39: Average Case Analysis of Java 7’s Dual Pivot Quicksort

Yaroslavskiy’s Quicksort – Example

Yaroslavskiy’s Dual Pivot Quicksort(used in Oracle’s Java 7 Arrays.sort(int[]))

3 1 2 5 4 7 8 9 6

g` k

g and k have crossed!Swap pivots in place

Invariant: < p `→

> qg

←p 6 ◦ 6 q k

→?

Sebastian Wild Java 7’s Dual Pivot Quicksort 2012/09/11 7 / 15

Page 40: Average Case Analysis of Java 7’s Dual Pivot Quicksort

Yaroslavskiy’s Quicksort – Example

Yaroslavskiy’s Dual Pivot Quicksort(used in Oracle’s Java 7 Arrays.sort(int[]))

2 1 3 5 4 6 8 9 7

g` k

g and k have crossed!Swap pivots in place

Invariant: < p `→

> qg

←p 6 ◦ 6 q k

→?

Sebastian Wild Java 7’s Dual Pivot Quicksort 2012/09/11 7 / 15

Page 41: Average Case Analysis of Java 7’s Dual Pivot Quicksort

Yaroslavskiy’s Quicksort – Example

Yaroslavskiy’s Dual Pivot Quicksort(used in Oracle’s Java 7 Arrays.sort(int[]))

2 1 3 5 4 6 8 9 7

Partitioning done!

Invariant: < p `→

> qg

←p 6 ◦ 6 q k

→?

Sebastian Wild Java 7’s Dual Pivot Quicksort 2012/09/11 7 / 15

Page 42: Average Case Analysis of Java 7’s Dual Pivot Quicksort

Yaroslavskiy’s Quicksort – Example

Yaroslavskiy’s Dual Pivot Quicksort(used in Oracle’s Java 7 Arrays.sort(int[]))

2 1 3 5 4 6 8 9 7

Recursively sort three sublists.

Invariant: < p `→

> qg

←p 6 ◦ 6 q k

→?

Sebastian Wild Java 7’s Dual Pivot Quicksort 2012/09/11 7 / 15

Page 43: Average Case Analysis of Java 7’s Dual Pivot Quicksort

Yaroslavskiy’s Quicksort – Example

Yaroslavskiy’s Dual Pivot Quicksort(used in Oracle’s Java 7 Arrays.sort(int[]))

1 2 3 4 5 6 7 8 9

Done.

Invariant: < p `→

> qg

←p 6 ◦ 6 q k

→?

Sebastian Wild Java 7’s Dual Pivot Quicksort 2012/09/11 7 / 15

Page 44: Average Case Analysis of Java 7’s Dual Pivot Quicksort

Yaroslavskiy’s Quicksort

DUALPIVOTQUICKSORTYAROSLAVSKIY(A, left, right)

1 if right − left > 12 p := A[left]; q := A[right]3 if p > q then Swap p and q end if4 ` := left + 1; g := right − 1; k := `5 while k 6 g

6 if A[k] < p

7 Swap A[k] and A[`] ; ` := `+ 1

8 else if A[k] > q

9 while A[g] > q and k < g do g := g− 1 end while10 Swap A[k] and A[g] ; g := g− 1

11 if A[k] < p

12 Swap A[k] and A[`] ; ` := `+ 113 end if14 end if15 k := k+ 116 end while17 ` := `− 1; g := g+ 118 Swap A[left] and A[`] ; Swap A[right] and A[g]19 DUALPIVOTQUICKSORTYAROSLAVSKIY(A, left , `− 1)20 DUALPIVOTQUICKSORTYAROSLAVSKIY(A, `+ 1,g− 1)21 DUALPIVOTQUICKSORTYAROSLAVSKIY(A,g+ 1, right )22 end if

Sebastian Wild Java 7’s Dual Pivot Quicksort 2012/09/11 8 / 15

Page 45: Average Case Analysis of Java 7’s Dual Pivot Quicksort

Yaroslavskiy’s QuicksortDUALPIVOTQUICKSORTYAROSLAVSKIY(A, left, right)

1 if right − left > 12 p := A[left]; q := A[right]3 if p > q then Swap p and q end if4 ` := left + 1; g := right − 1; k := `5 while k 6 g6 Ck if A[k] < p

7 Swap A[k] and A[`] ; ` := `+ 18 elseC′

k if A[k] > q

9 Cg while A[g] > q and k < g do g := g− 1 end while10 Swap A[k] and A[g] ; g := g− 111 C′

g if A[k] < p

12 Swap A[k] and A[`] ; ` := `+ 113 end if14 end if15 k := k+ 116 end while17 ` := `− 1; g := g+ 118 Swap A[left] and A[`] ; Swap A[right] and A[g]19 DUALPIVOTQUICKSORTYAROSLAVSKIY(A, left , `− 1)20 DUALPIVOTQUICKSORTYAROSLAVSKIY(A, `+ 1,g− 1)21 DUALPIVOTQUICKSORTYAROSLAVSKIY(A,g+ 1, right )22 end if

2 comparison locations

Ck handles pointer k

Cg handles pointer g

Ck first checks < p

C ′k if needed > q

Cg first checks > q

C ′g if needed < p

Sebastian Wild Java 7’s Dual Pivot Quicksort 2012/09/11 8 / 15

Page 46: Average Case Analysis of Java 7’s Dual Pivot Quicksort

Analysis of Yaroslavskiy’s Algorithm

In this talk:only number of comparisons (swaps similar)only leading term asymptotics

all exact resultsin the paper

some marginal cases excluded

Cn expected #comparisons to sort random permutation of {1, . . . , n}

Cn satisfies recurrence relation

Cn = cn + 2n(n−1)

∑16p<q6n

(Cp−1 + Cq−p−1 + Cn−q

),

with cn expected #comparisons in first partitioning step

recurrence solvable by standard methods

linear cn ∼ a · n yields Cn ∼ 65a · n lnn.

need to compute cnSebastian Wild Java 7’s Dual Pivot Quicksort 2012/09/11 9 / 15

Page 47: Average Case Analysis of Java 7’s Dual Pivot Quicksort

Analysis of Yaroslavskiy’s Algorithm

first comparison for all elements (at Ck or Cg ) ∼ n comparisons

second comparison for some elements at C′k resp. C′

g

. . . but how often are C ′k resp. C ′g reached?

C ′k : all non- small elements reached by pointer k.

C ′g : all non- large elements reached by pointer g.

second comparison for medium elements not avoidable ∼ 1

3n comparisons in expectation

it remains to count:large elements reached by k andsmall elements reached by g.

Sebastian Wild Java 7’s Dual Pivot Quicksort 2012/09/11 10 / 15

Page 48: Average Case Analysis of Java 7’s Dual Pivot Quicksort

Analysis of Yaroslavskiy’s Algorithm

Second comparisons for small and large elements?Depends on location!

C ′k l@K: number of large elements at positions K.

C ′g s@G: number of small elements at positions G.

Recall invariant: < p `→

> qg

←p 6 ◦ 6 q k

→?

k and g cross at (rank of) q

p q

positions K = {2, . . . , q− 1} G = {q, . . . , n− 1}

l@K = 3 s@G = 2

for given p and q, l@K hypergeometrically distributed E [l@K |p, q] = (n− q)q−2

n−2

Sebastian Wild Java 7’s Dual Pivot Quicksort 2012/09/11 11 / 15

Page 49: Average Case Analysis of Java 7’s Dual Pivot Quicksort

Analysis of Yaroslavskiy’s Algorithm

law of total expectation:

E [l@K] =∑

16p<q6n

Pr[pivots (p, q)] · (n− q)q−2n−2 ∼ 1

6n

Similarly: E [s@G] ∼ 112n.

Summing up contributions:

cn ∼ n first comparisons

+ 13n medium elements

+ 16n large elements at C ′k

+ 112n small elements at C ′g

= 1912 n

Recall: “lower bound” was 2012n.

Sebastian Wild Java 7’s Dual Pivot Quicksort 2012/09/11 12 / 15

Page 50: Average Case Analysis of Java 7’s Dual Pivot Quicksort

Results

Comparisons:

Yaroslavskiy needs ∼ 65· 1912n lnn = 1.9n lnn on average.

Classic Quicksort needs ∼ 2n lnn comparisons!

Swaps:

∼ 0.6n lnn swaps for Yaroslavskiy’s algorithm vs.

∼ 0.3n lnn swaps for classic Quicksort

Sebastian Wild Java 7’s Dual Pivot Quicksort 2012/09/11 13 / 15

Page 51: Average Case Analysis of Java 7’s Dual Pivot Quicksort

Summary

We can exploit asymmetries to save comparisons!Many extra swaps might hurt.

However, runtime studies favor dual pivot Quicksort:more than 10% faster!

0 0.5 1 1.5 2

·106

7

7.5

8

n

time

10−6·n

lnn

Classic QuicksortYaroslavskiy

Normalized Java runtimes (in ms).Average and standard deviationof 1000 random permutationsper size.

Sebastian Wild Java 7’s Dual Pivot Quicksort 2012/09/11 14 / 15

Page 52: Average Case Analysis of Java 7’s Dual Pivot Quicksort

Open Questions

Closer look at runtime: Why is Yaroslavskiy so fast in practice?

experimental studies?

Input distributions other than random permutations

equal elementspresorted lists

Variances of Costs, Limiting Distributions?

Sebastian Wild Java 7’s Dual Pivot Quicksort 2012/09/11 15 / 15

Page 53: Average Case Analysis of Java 7’s Dual Pivot Quicksort

Lower Bound on Comparisons

How clever can dual pivot paritioning be?

For lower bound, assume

random permutation modelpivots are selected uniformlyan oracle tells us, whether more small or more large elements occur

1 comparison for frequent extreme elements2 comparisons for middle and rare extreme elements

(n− 2) + 2n(n−1)

∑16p<q6n

((q− p− 1) + min{p− 1, n− q}

)∼ 3

2n = 1812n

Even with unrealistic oracle, not much better than Yaroslavskiy

Sebastian Wild Java 7’s Dual Pivot Quicksort 2012/09/11 16 / 15

Page 54: Average Case Analysis of Java 7’s Dual Pivot Quicksort

Counting Primitive Instructions à la Knuth

for implementations MMIX and Java bytecodedetermine exact expected overall costs

MMIX: processor cycles “oops” υ and memory accesses “mems” µBytecode: #executed instructions

divide program code into basic blocks

count cost contribution for blocks

determine expected execution frequencies of blocks

in first partitioning step total frequency via recurrence relation

Sebastian Wild Java 7’s Dual Pivot Quicksort 2012/09/11 17 / 15

Page 55: Average Case Analysis of Java 7’s Dual Pivot Quicksort

Counting Primitive Instructions à la Knuth

Results:

Algorithm total expected costs

MMIX Classic (11υ+2.6µ)(n+1)Hn+(11υ+3.7µ)n+(−11.5υ−4.5µ)

MMIX Yaroslavskiy(13.1υ+2.8µ)(n+ 1)Hn + (−1.695υ+ 1.24µ)n

+ (−1.6783υ− 1.793µ)

Bytecode Classic 18(n+ 1)Hn + 2n− 15

BytecodeYaroslavskiy

23.8(n+ 1)Hn − 8.71n− 4.743

Classic Quicksort significantly better in both measures . . .

Why is Yaroslavskiy faster in practice?

Sebastian Wild Java 7’s Dual Pivot Quicksort 2012/09/11 18 / 15

Page 56: Average Case Analysis of Java 7’s Dual Pivot Quicksort

Pivot Sampling

Idea: choose pivots from random sample of list

median for classic Quicksort

tertiles for dual pivot Quicksort

or asymmetric order statistics?

Here: sample of constant size k

choose pivots, such that t1 elements < p,t2 elements between p and q,t3 = k− 2− t1 − t2 larger > q

Allows to “push” pivot towards desired order statistic of list

Sebastian Wild Java 7’s Dual Pivot Quicksort 2012/09/11 19 / 15

Page 57: Average Case Analysis of Java 7’s Dual Pivot Quicksort

Pivot Sampling

Idea: choose pivots from random sample of list

median for classic Quicksort

tertiles for dual pivot Quicksort?

or asymmetric order statistics?

Here: sample of constant size k

choose pivots, such that t1 elements < p,t2 elements between p and q,t3 = k− 2− t1 − t2 larger > q

Allows to “push” pivot towards desired order statistic of list

Sebastian Wild Java 7’s Dual Pivot Quicksort 2012/09/11 19 / 15

Page 58: Average Case Analysis of Java 7’s Dual Pivot Quicksort

Pivot Sampling

leading n lnn term coefficient of

Comparisons for k = 11

tertiles(t1, t2, t3) = (3, 3, 3)∼ 1.609n lnnminimum(t1, t2, t3) = (4, 2, 3)∼ 1.585n lnn

asymmetric orderstatistics are better!

Sebastian Wild Java 7’s Dual Pivot Quicksort 2012/09/11 20 / 15