15
1 Maintaining Knowledge-Bases of Navigational Patterns from Streams of Navigational Sequences Ajumobi Udechukwu, Ken Barker, Reda Alhajj Proceedings of the 15th International Workshop on Research Issues in Data Engineering: Stream Data Mining and Applications (RIDE-SDMA’05) Advisor Jia-Ling Koh Speaker Chun-Wei Hsieh

1 Maintaining Knowledge-Bases of Navigational Patterns from Streams of Navigational Sequences Ajumobi Udechukwu, Ken Barker, Reda Alhajj Proceedings of

Embed Size (px)

Citation preview

Page 1: 1 Maintaining Knowledge-Bases of Navigational Patterns from Streams of Navigational Sequences Ajumobi Udechukwu, Ken Barker, Reda Alhajj Proceedings of

1

Maintaining Knowledge-Bases of Navigational Patterns from Streams of

Navigational Sequences

Ajumobi Udechukwu, Ken Barker, Reda Alhajj

Proceedings of the 15th International Workshop on Research Issues in Data Engineering: Stream Data Mining and Applications (RIDE-SDMA’05)

Advisor : Jia-Ling Koh

Speaker : Chun-Wei Hsieh

Page 2: 1 Maintaining Knowledge-Bases of Navigational Patterns from Streams of Navigational Sequences Ajumobi Udechukwu, Ken Barker, Reda Alhajj Proceedings of

2

Introduction

Navigational patterns: traversal patterns

Two broad techniques for mining

navigational patterns– 1. level-wise, apriori-based techniques– 2. tree-based techniques

Page 3: 1 Maintaining Knowledge-Bases of Navigational Patterns from Streams of Navigational Sequences Ajumobi Udechukwu, Ken Barker, Reda Alhajj Proceedings of

3

Methodology

Sliding window Batch-update strategy

– Batch: the web log in the base time unit

Example

B1

4

B1 B2B1 B2 B3B1 B2 B3 B4B1 B2 B3 B4 B5B1 B2 B3 B4 B5 B6

Page 4: 1 Maintaining Knowledge-Bases of Navigational Patterns from Streams of Navigational Sequences Ajumobi Udechukwu, Ken Barker, Reda Alhajj Proceedings of

4

Adapted GST

Adapted generalized suffix tree Appending a stop symbol to all strings Mining without thresholds

Page 5: 1 Maintaining Knowledge-Bases of Navigational Patterns from Streams of Navigational Sequences Ajumobi Udechukwu, Ken Barker, Reda Alhajj Proceedings of

5

1,12,1

1,22,2

1,32,3

LQ R$$R Q

R$

1,42,43,3

$

3,1

$

3,1

$

1,11,2

1,3

LQR$$RQ

R$

1,4

$

1,12,1

1,22,2

1,32,3

LQR$$RQ

R$

1,42,4

$

Adapted GST

LQR

1,1LQR$

1,11,2LQR$$RQ

1,11,2

1,3

LQR$$RQ

R$

LQR LQ

Page 6: 1 Maintaining Knowledge-Bases of Navigational Patterns from Streams of Navigational Sequences Ajumobi Udechukwu, Ken Barker, Reda Alhajj Proceedings of

6

Adapted GST

Page 7: 1 Maintaining Knowledge-Bases of Navigational Patterns from Streams of Navigational Sequences Ajumobi Udechukwu, Ken Barker, Reda Alhajj Proceedings of

7

The Challenge of Adapted GST

”LQ” occurs in B1 with support count of 4

and “L” occurs independently in B2 with support count of 2

Total count of “L” should be 4 + 2

Page 8: 1 Maintaining Knowledge-Bases of Navigational Patterns from Streams of Navigational Sequences Ajumobi Udechukwu, Ken Barker, Reda Alhajj Proceedings of

8

AC-NAP tree 1

Page 9: 1 Maintaining Knowledge-Bases of Navigational Patterns from Streams of Navigational Sequences Ajumobi Udechukwu, Ken Barker, Reda Alhajj Proceedings of

9

AC-NAP tree 2

Output all node labels and counts to a database

Page 10: 1 Maintaining Knowledge-Bases of Navigational Patterns from Streams of Navigational Sequences Ajumobi Udechukwu, Ken Barker, Reda Alhajj Proceedings of

10

Maintaining patterns within a window

Page 11: 1 Maintaining Knowledge-Bases of Navigational Patterns from Streams of Navigational Sequences Ajumobi Udechukwu, Ken Barker, Reda Alhajj Proceedings of

11

Maintaining patterns within a window

Count total support

Remove out_of_date patterns

Page 12: 1 Maintaining Knowledge-Bases of Navigational Patterns from Streams of Navigational Sequences Ajumobi Udechukwu, Ken Barker, Reda Alhajj Proceedings of

12

Experiments

OS: Microsoft Windows XP professional edition CPU: 2GHz Intel Pentium 4 RAM: 512MB Program language: Java DBMS: MySQL Data: real-world web logs of ”msnbc.com”

Page 13: 1 Maintaining Knowledge-Bases of Navigational Patterns from Streams of Navigational Sequences Ajumobi Udechukwu, Ken Barker, Reda Alhajj Proceedings of

13

Experiments

Page 14: 1 Maintaining Knowledge-Bases of Navigational Patterns from Streams of Navigational Sequences Ajumobi Udechukwu, Ken Barker, Reda Alhajj Proceedings of

14

Experiments

Page 15: 1 Maintaining Knowledge-Bases of Navigational Patterns from Streams of Navigational Sequences Ajumobi Udechukwu, Ken Barker, Reda Alhajj Proceedings of

15

Experiments