156
CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

Embed Size (px)

Citation preview

Page 1: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826: Multimedia Databasesand Data Mining

Lecture #27: Time series mining and forecasting

Christos Faloutsos

Page 2: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 2

Must-Read Material• Byong-Kee Yi, Nikolaos D. Sidiropoulos,

Theodore Johnson, H.V. Jagadish, Christos Faloutsos and Alex Biliris, Online Data Mining for Co-Evolving Time Sequences, ICDE, Feb 2000.

• Chungmin Melvin Chen and Nick Roussopoulos, Adaptive Selectivity Estimation Using Query Feedbacks, SIGMOD 1994

Page 3: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 3

Thanks

Deepay Chakrabarti (UT-Austin)

Spiros Papadimitriou (Rutgers)

Prof. Byoung-Kee Yi (Samsung)

Page 4: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 4

Outline

• Motivation

• Similarity search – distance functions

• Linear Forecasting

• Bursty traffic - fractals and multifractals

• Non-linear forecasting

• Conclusions

Page 5: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 5

Problem definition

• Given: one or more sequences x1 , x2 , … , xt , …

(y1, y2, … , yt, …

… )

• Find – similar sequences; forecasts– patterns; clusters; outliers

Page 6: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 6

Motivation - Applications• Financial, sales, economic series

• Medical

– ECGs +; blood pressure etc monitoring

– reactions to new drugs

– elderly care

Page 7: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 7

Motivation - Applications (cont’d)

• ‘Smart house’

– sensors monitor temperature, humidity, air quality

• video surveillance

Page 8: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 8

Motivation - Applications (cont’d)

• civil/automobile infrastructure

– bridge vibrations [Oppenheim+02]

– road conditions / traffic monitoring

Page 9: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 9

Motivation - Applications (cont’d)

• Weather, environment/anti-pollution

– volcano monitoring

– air/water pollutant monitoring

Page 10: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 10

Motivation - Applications (cont’d)

• Computer systems

– ‘Active Disks’ (buffering, prefetching)

– web servers (ditto)

– network traffic monitoring

– ...

Page 11: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 11

Stream Data: Disk accesses

time

#bytes

Page 12: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 12

Problem #1:

Goal: given a signal (e.g.., #packets over time)

Find: patterns, periodicities, and/or compress

year

count lynx caught per year(packets per day;temperature per day)

Page 13: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 13

Problem#2: ForecastGiven xt, xt-1, …, forecast xt+1

0102030405060708090

1 3 5 7 9 11

Time Tick

Nu

mb

er o

f p

ack

ets

sen

t

??

Page 14: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 14

Problem#2’: Similarity searchE.g.., Find a 3-tick pattern, similar to the last one

0102030405060708090

1 3 5 7 9 11

Time Tick

Nu

mb

er o

f p

ack

ets

sen

t

??

Page 15: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 15

Problem #3:• Given: A set of correlated time sequences• Forecast ‘Sent(t)’

0102030405060708090

1 3 5 7 9 11

Time Tick

Nu

mb

er o

f p

ack

ets

sent

lost

repeated

Page 16: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 16

Important observations

Patterns, rules, forecasting and similarity indexing are closely related:

• To do forecasting, we need– to find patterns/rules– to find similar settings in the past

• to find outliers, we need to have forecasts– (outlier = too far away from our forecast)

Page 17: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 17

Outline

• Motivation

• Similarity Search and Indexing

• Linear Forecasting

• Bursty traffic - fractals and multifractals

• Non-linear forecasting

• Conclusions

Page 18: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 18

Outline

• Motivation

• Similarity search and distance functions– Euclidean– Time-warping

• ...

Page 19: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 19

Importance of distance functions

Subtle, but absolutely necessary:

• A ‘must’ for similarity indexing (-> forecasting)

• A ‘must’ for clustering

Two major families– Euclidean and Lp norms– Time warping and variations

Page 20: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 20

Euclidean and Lp

n

iii yxyxD

1

2)(),(x(t) y(t)

...

n

i

piip yxyxL

1

||),(

•L1: city-block = Manhattan

•L2 = Euclidean

•L

Page 21: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 21

Observation #1

• Time sequence -> n-d vector

...

Day-1

Day-2

Day-n

Page 22: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 22

Observation #2

Euclidean distance is closely related to – cosine similarity– dot product– ‘cross-correlation’

function

...

Day-1

Day-2

Day-n

Page 23: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 23

Time Warping

• allow accelerations - decelerations– (with or w/o penalty)

• THEN compute the (Euclidean) distance (+ penalty)

• related to the string-editing distance

Page 24: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 24

Time Warping

‘stutters’:

Page 25: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 25

Time warping

Q: how to compute it?A: dynamic programming D( i, j ) = cost to match prefix of length i of first sequence x with prefix

of length j of second sequence y

Page 26: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

Copyright: C. Faloutsos (2014)

Full-text scanning

• Approximate matching - string editing distance:

d( ‘survey’, ‘surgery’) = 2 = min # of insertions, deletions,

substitutions to transform the first string into the second SURVEY SURGERY

15-826 26

Page 27: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 27

Thus, with no penalty for stutter, for sequences

x1, x2, …, xi,; y1, y2, …, yj

),1(

)1,(

)1,1(

min][][),(

jiD

jiD

jiD

jyixjiD x-stutter

y-stutter

no stutter

Time warping

Page 28: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 28

VERY SIMILAR to the string-editing distance

),1(

)1,(

)1,1(

min][][),(

jiD

jiD

jiD

jyixjiD x-stutter

y-stutter

no stutter

Time warping

Page 29: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

Copyright: C. Faloutsos (2014)

Full-text scanning

if s[i] = t[j] then

cost( i, j ) = cost(i-1, j-1)

else

cost(i, j ) = min (

1 + cost(i, j-1) // deletion

1 + cost(i-1, j-1) // substitution

1 + cost(i-1, j) // insertion

)

15-826 29

Page 30: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 30

VERY SIMILAR to the string-editing distance

),1(

)1,(

)1,1(

min][][),(

jiD

jiD

jiD

jyixjiD

Time warping

1 + cost(i-1, j-1) // sub.1 + cost(i, j-1) // del.1 + cost(i-1, j) // ins.

cost(i,j) = min

Time-warping String editing

Page 31: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 31

Time warping

• Complexity: O(M*N) - quadratic on the length of the strings

• Many variations (penalty for stutters; limit on the number/percentage of stutters; …)

• popular in voice processing [Rabiner + Juang]

Page 32: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 32

Other Distance functions

• piece-wise linear/flat approx.; compare pieces [Keogh+01] [Faloutsos+97]

• ‘cepstrum’ (for voice [Rabiner+Juang])– do DFT; take log of amplitude; do DFT again!

• Allow for small gaps [Agrawal+95]

See tutorial by [Gunopulos + Das, SIGMOD01]

Page 33: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 33

Other Distance functions

• In [Keogh+, KDD’04]: parameter-free, MDL based

Page 34: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 34

Conclusions

Prevailing distances: – Euclidean and – time-warping

Page 35: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 35

Outline

• Motivation

• Similarity search and distance functions

• Linear Forecasting

• Bursty traffic - fractals and multifractals

• Non-linear forecasting

• Conclusions

Page 36: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 36

Linear Forecasting

Page 37: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 37

Forecasting

"Prediction is very difficult, especially about the future." - Nils Bohr

http://www.hfac.uh.edu/MediaFutures/thoughts.html

Page 38: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 38

Outline

• Motivation

• ...

• Linear Forecasting– Auto-regression: Least Squares; RLS– Co-evolving time sequences– Examples– Conclusions

Page 39: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 39

Reference

[Yi+00] Byoung-Kee Yi et al.: Online Data Mining for Co-Evolving Time Sequences, ICDE 2000. (Describes MUSCLES and Recursive Least Squares)

Page 40: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 40

Problem#2: Forecast• Example: give xt-1, xt-2, …, forecast xt

0102030405060708090

1 3 5 7 9 11

Time Tick

Nu

mb

er o

f p

ack

ets

sen

t

??

Page 41: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 41

Forecasting: PreprocessingMANUALLY:

remove trends spot periodicities

0

1

2

3

4

5

6

1 2 3 4 5 6 7 8 9 10

0

0.5

1

1.5

2

2.5

3

3.5

1 3 5 7 9 11 13

timetime

7 days

Page 42: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 42

Problem#2: Forecast• Solution: try to express

xt

as a linear function of the past: xt-2, xt-2, …, (up to a window of w)

Formally:

0102030405060708090

1 3 5 7 9 11Time Tick

??

Page 43: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 43

(Problem: Back-cast; interpolate)• Solution - interpolate: try to express

xt

as a linear function of the past AND the future: xt+1, xt+2, … xt+wfuture; xt-1, … xt-wpast

(up to windows of wpast , wfuture)

• EXACTLY the same algo’s

0102030405060708090

1 3 5 7 9 11Time Tick

??

Page 44: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 44

Linear Regression: idea

40

45

50

55

60

65

70

75

80

85

15 25 35 45

Body weight

patient weight height

1 27 43

2 43 54

3 54 72

……

N 25 ??

• express what we don’t know (= ‘dependent variable’)• as a linear function of what we know (= ‘indep. variable(s)’)

Body height

Page 45: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 45

Linear Auto Regression:Time Packets

Sent (t-1)PacketsSent(t)

1 - 43

2 43 54

3 54 72

……

N 25 ??

Page 46: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 46

Linear Auto Regression:

40

45

50

55

60

65

70

75

80

85

15 25 35 45

Number of packets sent (t-1)N

um

ber

of

pac

ket

s se

nt

(t)

Time PacketsSent (t-1)

PacketsSent(t)

1 - 43

2 43 54

3 54 72

……

N 25 ??

• lag w=1• Dependent variable = # of packets sent (S [t])• Independent variable = # of packets sent (S[t-1])

‘lag-plot’

Page 47: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 47

Outline

• Motivation

• ...

• Linear Forecasting– Auto-regression: Least Squares; RLS– Co-evolving time sequences– Examples– Conclusions

Page 48: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 48

More details:

• Q1: Can it work with window w>1?

• A1: YES!

xt-2

xt

xt-1

Page 49: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 49

More details:

• Q1: Can it work with window w>1?

• A1: YES! (we’ll fit a hyper-plane, then!)

xt-2

xt

xt-1

Page 50: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 50

More details:

• Q1: Can it work with window w>1?

• A1: YES! (we’ll fit a hyper-plane, then!)

xt-2

xt-1

xt

Page 51: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 51

More details:

• Q1: Can it work with window w>1?

• A1: YES! The problem becomes:

X[N w] a[w 1] = y[N 1]

• OVER-CONSTRAINED– a is the vector of the regression coefficients

– X has the N values of the w indep. variables

– y has the N values of the dependent variable

Page 52: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 52

More details:• X[N w] a[w 1] = y[N 1]

N

w

NwNN

w

w

y

y

y

a

a

a

XXX

XXX

XXX

2

1

2

1

21

22221

11211

,,,

,,,

,,,

Ind-var1 Ind-var-w

time

Page 53: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 53

More details:• X[N w] a[w 1] = y[N 1]

N

w

NwNN

w

w

y

y

y

a

a

a

XXX

XXX

XXX

2

1

2

1

21

22221

11211

,,,

,,,

,,,

Ind-var1 Ind-var-w

time

Page 54: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 54

More details

• Q2: How to estimate a1, a2, … aw = a?

• A2: with Least Squares fit

• (Moore-Penrose pseudo-inverse)

• a is the vector that minimizes the RMSE from y

• <identical math with ‘query feedbacks’>

a = ( XT X )-1 (XT y)

Page 55: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 55

More details

• Q2: How to estimate a1, a2, … aw = a?

• A2: with Least Squares fit

Identical to earlier formula (proof?)

Where

a = ( XT X )-1 (XT y)

a = V x x UT y

X = U x x VT

Page 56: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 56

More details• Straightforward solution:

• Observations:– Sample matrix X grows over time

– needs matrix inversion

– O(Nw2) computation

– O(Nw) storage

a = ( XT X )-1 (XT y)

a : Regression Coeff. VectorX : Sample Matrix

XN:

w

N

Page 57: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 57

Even more details

• Q3: Can we estimate a incrementally?

• A3: Yes, with the brilliant, classic method of ‘Recursive Least Squares’ (RLS) (see, e.g., [Yi+00], for details).

• We can do the matrix inversion, WITHOUT inversion! (How is that possible?!)

Page 58: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 58

Even more details

• Q3: Can we estimate a incrementally?

• A3: Yes, with the brilliant, classic method of ‘Recursive Least Squares’ (RLS) (see, e.g., [Yi+00], for details).

• We can do the matrix inversion, WITHOUT inversion! (How is that possible?!)

• A: our matrix has special form: (XT X)

Page 59: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 59

More details

XN:

w

NXN+1

At the N+1 time tick:

xN+1

Page 60: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 60

More details

• Let GN = ( XNT XN )-1 (``gain matrix’’)

• GN+1 can be computed recursively from GN

GN

w

w

Page 61: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 61

EVEN more details:

NNT

NNNN GxxGcGG

111

1 ][][

]1[ 11T

NNN xGxc

1 x w row vector

Let’s elaborate (VERY IMPORTANT, VERY VALUABLE!)

Page 62: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 62

EVEN more details:

][][ 111

11

NT

NNT

N yXXXa

Page 63: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 63

EVEN more details:

][][ 111

11

NT

NNT

N yXXXa

[w x 1]

[w x (N+1)]

[(N+1) x w]

[w x (N+1)]

[(N+1) x 1]

Page 64: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 64

EVEN more details:

][][ 111

11

NT

NNT

N yXXXa

[w x (N+1)]

[(N+1) x w]

Page 65: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 65

EVEN more details:

NNT

NNNN GxxGcGG

111

1 ][][

]1[ 11T

NNN xGxc

][][ 111

11

NT

NNT

N yXXXa

1111 ][

NT

NN XXG1 x w row vector‘gain

matrix’

Page 66: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 66

EVEN more details:

NNT

NNNN GxxGcGG

111

1 ][][

]1[ 11T

NNN xGxc

Page 67: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 67

EVEN more details:

NNT

NNNN GxxGcGG

111

1 ][][

]1[ 11T

NNN xGxc

wxw wxw wxw wx11xw

wxw

1x1

SCALAR!

Page 68: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 68

Altogether:

NNT

NNNN GxxGcGG

111

1 ][][

]1[ 11T

NNN xGxc

][][ 111

11

NT

NNT

N yXXXa

1111 ][

NT

NN XXG

Page 69: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 69

Altogether:

IG 0

where I: w x w identity matrix: a large positive number (say, 104)

IMPORTANT!

Page 70: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 70

Comparison:

• Straightforward Least Squares– Needs huge matrix

(growing in size) (O(N×w)

– Costly matrix operation O(N×w2)

• Recursive LS– Need much smaller, fixed

size matrix O(w×w)

– Fast, incremental computation (O(1×w2)

– no matrix inversion

N = 106, w = 1-100

Page 71: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 71

Pictorially:

• Given:

Independent Variable

Dep

ende

nt V

aria

ble

Page 72: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 72

Pictorially:

Independent Variable

Dep

ende

nt V

aria

ble

.

new point

Page 73: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 73

Pictorially:

Independent Variable

Dep

ende

nt V

aria

ble

RLS: quickly compute new best fit

new point

Page 74: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 74

Even more details

• Q4: can we ‘forget’ the older samples?

• A4: Yes - RLS can easily handle that [Yi+00]:

Page 75: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 75

Adaptability - ‘forgetting’

Independent Variableeg., #packets sent

Dep

ende

nt V

aria

ble

eg.,

#byt

es s

ent

Page 76: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 76

Adaptability - ‘forgetting’

Independent Variableeg. #packets sent

Dep

ende

nt V

aria

ble

eg.,

#byt

es s

ent

Trend change

(R)LSwith no forgetting

Page 77: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 77

Adaptability - ‘forgetting’

Independent Variable

Dep

ende

nt V

aria

ble

Trend change

(R)LSwith no forgetting

(R)LSwith forgetting

• RLS: can *trivially* handle ‘forgetting’

Page 78: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 89

Outline

• Motivation

• ...

• Linear Forecasting– Auto-regression: Least Squares; RLS– Co-evolving time sequences– Examples– Conclusions

Page 79: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 90

Co-Evolving Time Sequences• Given: A set of correlated time sequences• Forecast ‘Repeated(t)’

0102030405060708090

1 3 5 7 9 11

Time Tick

Nu

mb

er o

f p

ack

ets

sent

lost

repeated

??

Page 80: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 91

Solution:

Q: what should we do?

Page 81: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 92

Solution:

Least Squares, with

• Dep. Variable: Repeated(t)

• Indep. Variables: Sent(t-1) … Sent(t-w); Lost(t-1) …Lost(t-w); Repeated(t-1), ...

• (named: ‘MUSCLES’ [Yi+00])

Page 82: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 93

Forecasting - Outline

• Auto-regression

• Least Squares; recursive least squares

• Co-evolving time sequences

• Examples

• Conclusions

Page 83: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 94

Examples - Experiments• Datasets

– Modem pool traffic (14 modems, 1500 time-ticks; #packets per time unit)

– AT&T WorldNet internet usage (several data streams; 980 time-ticks)

• Measures of success– Accuracy : Root Mean Square Error (RMSE)

Page 84: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 95

Accuracy - “Modem”

MUSCLES outperforms AR & “yesterday”

0

0.5

1

1.5

2

2.5

3

3.5

4

RMSE

1 2 3 4 5 6 7 8 9 10 11 12 13 14

Modems

AR

yesterday

MUSCLES

Page 85: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 96

Accuracy - “Internet”

0

0.2

0.4

0.6

0.8

1

1.2

1.4

RMSE

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15

Streams

AR

yesterday

MUSCLES

MUSCLES consistently outperforms AR & “yesterday”

Page 86: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 97

Linear forecasting - Outline

• Auto-regression

• Least Squares; recursive least squares

• Co-evolving time sequences

• Examples

• Conclusions

Page 87: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 98

Conclusions - Practitioner’s guide

• AR(IMA) methodology: prevailing method for linear forecasting

• Brilliant method of Recursive Least Squares for fast, incremental estimation.

• See [Box-Jenkins]

• (AWSOM: no human intervention)

Page 88: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 99

Resources: software and urls

• free-ware: ‘R’ for stat. analysis (clone of Splus) http://cran.r-project.org/

• python script for RLShttp://www.cs.cmu.edu/~christos/SRC/rls-all.tar

Page 89: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 100

Books

• George E.P. Box and Gwilym M. Jenkins and Gregory C. Reinsel, Time Series Analysis: Forecasting and Control, Prentice Hall, 1994 (the classic book on ARIMA, 3rd ed.)

• Brockwell, P. J. and R. A. Davis (1987). Time Series: Theory and Methods. New York, Springer Verlag.

Page 90: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 101

Additional Reading

• [Papadimitriou+ vldb2003] Spiros Papadimitriou, Anthony Brockwell and Christos Faloutsos Adaptive, Hands-Off Stream Mining VLDB 2003, Berlin, Germany, Sept. 2003

• [Yi+00] Byoung-Kee Yi et al.: Online Data Mining for Co-Evolving Time Sequences, ICDE 2000. (Describes MUSCLES and Recursive Least Squares)

Page 91: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 102

Outline

• Motivation

• Similarity search and distance functions

• Linear Forecasting

• Bursty traffic - fractals and multifractals

• Non-linear forecasting

• Conclusions

Page 92: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 103

Bursty Traffic& Multifractals

Page 93: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 104

Outline

• Motivation

• ...

• Linear Forecasting

• Bursty traffic - fractals and multifractals– Problem– Main idea (80/20, Hurst exponent)– Results

Page 94: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 105

Reference:

[Wang+02] Mengzhi Wang, Tara Madhyastha, Ngai Hang Chang, Spiros Papadimitriou and Christos Faloutsos, Data Mining Meets Performance Evaluation: Fast Algorithms for Modeling Bursty Traffic, ICDE 2002, San Jose, CA, 2/26/2002 - 3/1/2002.

Full thesis: CMU-CS-05-185Performance Modeling of Storage Devices using Machine Learning Mengzhi Wang, Ph.D. ThesisAbstract, .ps.gz, .pdf

Page 95: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 106

Recall: Problem #1:

Goal: given a signal (eg., #bytes over time)

Find: patterns, periodicities, and/or compress

time

#bytes Bytes per 30’(packets per day;earthquakes per year)

Page 96: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 107

Problem #1

• model bursty traffic

• generate realistic traces

• (Poisson does not work)

time

# bytes

Poisson

Page 97: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 108

Motivation

• predict queue length distributions (e.g., to give probabilistic guarantees)

• “learn” traffic, for buffering, prefetching, ‘active disks’, web servers

Page 98: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 109

But:

• Q1: How to generate realistic traces; extrapolate; give guarantees?

• Q2: How to estimate the model parameters?

Page 99: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 110

Outline

• Motivation

• ...

• Linear Forecasting

• Bursty traffic - fractals and multifractals– Problem– Main idea (80/20, Hurst exponent)– Results

Page 100: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 111

Approach

• Q1: How to generate a sequence, that is– bursty– self-similar– and has similar queue length distributions

Page 101: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 112

Approach

• A: ‘binomial multifractal’ [Wang+02]

• ~ 80-20 ‘law’:– 80% of bytes/queries etc on first half– repeat recursively

• b: bias factor (eg., 80%)

Page 102: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 113

Binary multifractals20 80

Page 103: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 114

Binary multifractals20 80

Page 104: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 115

Could you use IFS?

To generate such traffic?

Page 105: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 116

Could you use IFS?

To generate such traffic?

A: Yes – which transformations?

Page 106: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 117

Could you use IFS?

To generate such traffic?

A: Yes – which transformations?

A:

x’ = x / 2 (p = 0.2 )

x’ = x / 2 + 0.5 (p = 0.8 )

Page 107: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 118

Parameter estimation

• Q2: How to estimate the bias factor b?

Page 108: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 119

Parameter estimation

• Q2: How to estimate the bias factor b?

• A: MANY ways [Crovella+96]– Hurst exponent– variance plot– even DFT amplitude spectrum! (‘periodogram’)– Fractal dimension (D2)

• Or D1 (‘entropy plot’ [Wang+02])

Page 109: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 133

Fractal dimension

• Real (and 80-20) datasets can be in-between: bursts, gaps, smaller bursts, smaller gaps, at every scale

Dim = 1

Dim=0

0<Dim<1

Page 110: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

Estimating ‘b’

• Exercise: Show that

D2 = - log2 ( b2 + (1-b)2 )

Sanity checks:

- b =1.0 D2 = ??

- b =0.5 D2 = ??15-826 (c) C. Faloutsos, 2014 134

b 1-b

log ( r )

Log (#pairs(<r))D2

Page 111: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 135

(Fractals, again)

• What set of points could have behavior between point and line?

Page 112: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 136

Cantor dust

• Eliminate the middle third

• Recursively!

Page 113: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 137

Cantor dust

Page 114: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 138

Cantor dust

Page 115: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 139

Cantor dust

Page 116: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 140

Cantor dust

Page 117: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 141

Dimensionality?(no length; infinite # points!)Answer: log2 / log3 = 0.6

Cantor dust

Page 118: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 149

Conclusions

• Multifractals (80/20, ‘b-model’, Multiplicative Wavelet Model (MWM)) for analysis and synthesis of bursty traffic

Page 119: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 150

Books

• Fractals: Manfred Schroeder: Fractals, Chaos, Power Laws: Minutes from an Infinite Paradise W.H. Freeman and Company, 1991 (Probably the BEST book on fractals!)

Page 120: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 151

Further reading:

• Crovella, M. and A. Bestavros (1996). Self-Similarity in World Wide Web Traffic, Evidence and Possible Causes. Sigmetrics.

• [ieeeTN94] W. E. Leland, M.S. Taqqu, W. Willinger, D.V. Wilson, On the Self-Similar Nature of Ethernet Traffic, IEEE Transactions on Networking, 2, 1, pp 1-15, Feb. 1994.

Page 121: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 152

Further reading

• [Riedi+99] R. H. Riedi, M. S. Crouse, V. J. Ribeiro, and R. G. Baraniuk, A Multifractal Wavelet Model with Application to Network Traffic, IEEE Special Issue on

Information Theory, 45. (April 1999), 992-1018. • [Wang+02] Mengzhi Wang, Tara Madhyastha, Ngai Hang

Chang, Spiros Papadimitriou and Christos Faloutsos, Data Mining Meets Performance Evaluation: Fast Algorithms for Modeling Bursty Traffic, ICDE 2002, San Jose, CA, 2/26/2002 - 3/1/2002.

Ent

ropy

plo

ts

Page 122: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 153

Outline

• Motivation

• ...

• Linear Forecasting

• Bursty traffic - fractals and multifractals

• Non-linear forecasting

• Conclusions

Page 123: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 154

Chaos and non-linearforecasting

Page 124: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 155

Reference:

[ Deepay Chakrabarti and Christos Faloutsos F4: Large-Scale Automated Forecasting using Fractals CIKM 2002, Washington DC, Nov. 2002.]

Page 125: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 156

Detailed Outline

• Non-linear forecasting– Problem– Idea– How-to– Experiments– Conclusions

Page 126: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 157

Recall: Problem #1

Given a time series {xt}, predict its future course, that is, xt+1, xt+2, ...

Time

Value

Page 127: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 158

Datasets

Logistic Parabola: xt = axt-1(1-xt-1) + noise Models population of flies [R. May/1976]

time

x(t

)

Lag-plot

ARIMA: fails

Page 128: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 159

How to forecast?

• ARIMA - but: linearity assumption

Lag-plot

ARIMA: fails

Page 129: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 160

How to forecast?

• ARIMA - but: linearity assumption

• ANSWER: ‘Delayed Coordinate Embedding’ = Lag Plots [Sauer92]

~ nearest-neighbor search, for past incidents

Page 130: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 161

General Intuition (Lag Plot)

xt-1

xxtt

4-NNNew Point

Interpolate these…

To get the final prediction

Lag = 1,k = 4 NN

Page 131: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 162

Questions:

• Q1: How to choose lag L?• Q2: How to choose k (the # of NN)?• Q3: How to interpolate?• Q4: why should this work at all?

Page 132: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 163

Q1: Choosing lag L

• Manually (16, in award winning system by [Sauer94])

Page 133: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 164

Q2: Choosing number of neighbors k

• Manually (typically ~ 1-10)

Page 134: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 165

Q3: How to interpolate?

How do we interpolate between the k nearest neighbors?

A3.1: Average

A3.2: Weighted average (weights drop with distance - how?)

Page 135: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 166

Q3: How to interpolate?

A3.3: Using SVD - seems to perform best ([Sauer94] - first place in the Santa Fe forecasting competition)

Xt-1

xt

Page 136: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 167

Q4: Any theory behind it?

A4: YES!

Page 137: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 168

Theoretical foundation

• Based on the ‘Takens theorem’ [Takens81]

• which says that long enough delay vectors can do prediction, even if there are unobserved variables in the dynamical system (= diff. equations)

Page 138: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 169

Theoretical foundation

Example: Lotka-Volterra equations

dH/dt = r H – a H*P dP/dt = b H*P – m P

H is count of prey (e.g., hare)P is count of predators (e.g., lynx)

Suppose only P(t) is observed (t=1, 2, …).

H

P

Skip

Page 139: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 170

Theoretical foundation

• But the delay vector space is a faithful reconstruction of the internal system state

• So prediction in delay vector space is as good as prediction in state space

Skip

H

P

P(t-1)

P(t)

Page 140: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 171

Detailed Outline

• Non-linear forecasting– Problem– Idea– How-to– Experiments– Conclusions

Page 141: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 172

Datasets

Logistic Parabola: xt = axt-1(1-xt-1) + noise Models population of flies [R. May/1976]

time

x(t

)

Lag-plot

Page 142: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 173

Datasets

Logistic Parabola: xt = axt-1(1-xt-1) + noise Models population of flies [R. May/1976]

time

x(t

)

Lag-plot

ARIMA: fails

Page 143: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 174

Logistic Parabola

Timesteps

Value

Our Prediction from here

Page 144: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 175

Logistic Parabola

Timesteps

Value

Comparison of prediction to correct values

Page 145: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 176

Datasets

LORENZ: Models convection currents in the airdx / dt = a (y - x) dy / dt = x (b - z) - y dz / dt = xy - c z

Value

Page 146: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 177

LORENZ

Timesteps

Value

Comparison of prediction to correct values

Page 147: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 178

Datasets

Time

Value

• LASER: fluctuations in a Laser over time (used in Santa Fe competition)

Page 148: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 179

Laser

Timesteps

Value

Comparison of prediction to correct values

Page 149: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 180

Conclusions

• Lag plots for non-linear forecasting (Takens’ theorem)

• suitable for ‘chaotic’ signals

Page 150: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 181

References

• Deepay Chakrabarti and Christos Faloutsos F4: Large-Scale Automated Forecasting using Fractals CIKM 2002, Washington DC, Nov. 2002.

• Sauer, T. (1994). Time series prediction using delay coordinate embedding. (in book by Weigend and Gershenfeld, below) Addison-Wesley.

• Takens, F. (1981). Detecting strange attractors in fluid turbulence. Dynamical Systems and Turbulence. Berlin: Springer-Verlag.

Page 151: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 182

References

• Weigend, A. S. and N. A. Gerschenfeld (1994). Time Series Prediction: Forecasting the Future and Understanding the Past, Addison Wesley. (Excellent collection of papers on chaotic/non-linear forecasting, describing the algorithms behind the winners of the Santa Fe competition.)

Page 152: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 183

Overall conclusions

• Similarity search: Euclidean/time-warping; feature extraction and SAMs

Page 153: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 184

Overall conclusions

• Similarity search: Euclidean/time-warping; feature extraction and SAMs

• Signal processing: DWT is a powerful tool

Page 154: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 185

Overall conclusions

• Similarity search: Euclidean/time-warping; feature extraction and SAMs

• Signal processing: DWT is a powerful tool

• Linear Forecasting: AR (Box-Jenkins) methodology; AWSOM

Page 155: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 186

Overall conclusions

• Similarity search: Euclidean/time-warping; feature extraction and SAMs

• Signal processing: DWT is a powerful tool

• Linear Forecasting: AR (Box-Jenkins) methodology; AWSOM

• Bursty traffic: multifractals (80-20 ‘law’)

Page 156: CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #27: Time series mining and forecasting Christos Faloutsos

CMU SCS

15-826 (c) C. Faloutsos, 2014 187

Overall conclusions

• Similarity search: Euclidean/time-warping; feature extraction and SAMs

• Signal processing: DWT is a powerful tool

• Linear Forecasting: AR (Box-Jenkins) methodology; AWSOM

• Bursty traffic: multifractals (80-20 ‘law’)

• Non-linear forecasting: lag-plots (Takens)