25
831-656-7582 http:// movesinstitute.org Approaches to Event Prediction in Complex Environments Terence Tan (PhD Candidate) Advisors: Prof Christian Darken, PhD Prof Neil Rowe , PhD Prof Arnold Buss , PhD Prof Ralucca Gera , PhD Prof John Hiles

Approaches to Event Prediction in Complex Environments

  • Upload
    kamana

  • View
    56

  • Download
    0

Embed Size (px)

DESCRIPTION

Approaches to Event Prediction in Complex Environments . Terence Tan (PhD Candidate) Advisors: Prof Christian Darken, PhD Prof Neil Rowe , PhD Prof Arnold Buss , PhD Prof Ralucca Gera , PhD Prof John Hiles. 831-656-7582 http:// movesinstitute.org. Scope of Presentation. - PowerPoint PPT Presentation

Citation preview

Page 1: Approaches to Event Prediction in Complex Environments

831-656-7582http://movesinstitute.org

Approaches to Event Prediction in Complex Environments

Terence Tan (PhD Candidate)

Advisors:Prof Christian Darken, PhD

Prof Neil Rowe , PhDProf Arnold Buss , PhDProf Ralucca Gera , PhD

Prof John Hiles

Page 2: Approaches to Event Prediction in Complex Environments

Scope of Presentation

• What is Relational Time Series?• Previous Approaches• New Learning and Prediction Approaches• Conclusions

2

Page 3: Approaches to Event Prediction in Complex Environments

Network Intrusion Detection Alerts

12/02/11-17:32:21.984133 2924 NETBIOS SMB-DS repeated logon failure TCP 78.45.215.210 63.205.26.80

What is Relational Time Series?Previous ApproachesNew Learning and Prediction ApproachesConclusions

3

Page 4: Approaches to Event Prediction in Complex Environments

Network Intrusion Detection Alerts

12/02/11-17:32:21.984133 2924 NETBIOS SMB-DS repeated logon failure TCP 78.45.215.210 63.205.26.8012/02/11-17:32:24.712867 2924 NETBIOS SMB-DS repeated logon failure TCP 78.45.215.210 63.205.26.80

What is Relational Time Series?Previous ApproachesNew Learning and Prediction ApproachesConclusions

4

Page 5: Approaches to Event Prediction in Complex Environments

Network Intrusion Detection Alerts

12/02/11-17:32:21.984133 2924 NETBIOS SMB-DS repeated logon failure TCP 78.45.215.210 63.205.26.8012/02/11-17:32:24.712867 2924 NETBIOS SMB-DS repeated logon failure TCP 78.45.215.210 63.205.26.8012/02/11-18:50:13.575037 648 SHELLCODE x86 NOOP TCP 84.0.158.110 63.205.26.70

What is Relational Time Series?Previous ApproachesNew Learning and Prediction ApproachesConclusions

5

Page 6: Approaches to Event Prediction in Complex Environments

Network Intrusion Detection Alerts

12/02/11-17:32:21.984133 2924 NETBIOS SMB-DS repeated logon failure TCP 78.45.215.210 63.205.26.8012/02/11-17:32:24.712867 2924 NETBIOS SMB-DS repeated logon failure TCP 78.45.215.210 63.205.26.8012/02/11-18:50:13.575037 648 SHELLCODE x86 NOOP TCP 84.0.158.110 63.205.26.7012/02/11-18:50:13.575356 648 SHELLCODE x86 NOOP TCP 84.0.158.110 63.205.26.70

What is Relational Time Series?Previous ApproachesNew Learning and Prediction ApproachesConclusions

6

Page 7: Approaches to Event Prediction in Complex Environments

Time Series of Network Intrusion Detection Alerts

12/02/11-17:32:21.984133 2924 NETBIOS SMB-DS repeated logon failure TCP 78.45.215.210 63.205.26.8012/02/11-17:32:24.712867 2924 NETBIOS SMB-DS repeated logon failure TCP 78.45.215.210 63.205.26.8012/02/11-18:50:13.575037 648 SHELLCODE x86 NOOP TCP 84.0.158.110 63.205.26.7012/02/11-18:50:13.575356 648 SHELLCODE x86 NOOP TCP 84.0.158.110 63.205.26.7012/02/11-18:50:13.575356 3397 NETBIOS DCERPC NCACN-IP-TCP TCP 84.0.158.110 63.205.26.7012/02/11-18:50:15.443929 648 SHELLCODE x86 NOOP TCP 84.0.158.110 63.205.26.7312/02/11-18:50:15.444255 648 SHELLCODE x86 NOOP TCP 84.0.158.110 63.205.26.7312/02/11-18:50:15.444255 3397 NETBIOS DCERPC NCACN-IP-TCP TCP 84.0.158.110 63.205.26.7312/02/11-18:50:19.048303 648 SHELLCODE x86 NOOP TCP 84.0.158.110 63.205.26.7712/02/11-18:50:19.048624 648 SHELLCODE x86 NOOP TCP 84.0.158.110 63.205.26.7712/02/11-18:50:19.048624 3397 NETBIOS DCERPC NCACN-IP-TCP TCP 84.0.158.110 63.205.26.7712/02/11-18:50:20.346232 648 SHELLCODE x86 NOOP TCP 84.0.158.110 63.205.26.7412/02/11-18:50:22.656974 648 SHELLCODE x86 NOOP TCP 84.0.158.110 63.205.26.7912/02/11-18:50:22.657291 648 SHELLCODE x86 NOOP TCP 84.0.158.110 63.205.26.7912/02/11-18:50:22.657291 3397 NETBIOS DCERPC NCACN-IP-TCP TCP 84.0.158.110 63.205.26.7912/02/11-19:12:38.913940 384 ICMP PING ICMP 66.235.66.233 63.205.26.8012/02/11-19:12:38.914642 408 ICMP Echo Reply ICMP 63.205.26.80 66.235.66.23312/02/11-19:12:38.959461 384 ICMP PING ICMP 66.235.66.233 63.205.26.8012/02/11-19:12:38.959672 408 ICMP Echo Reply ICMP 63.205.26.80 66.235.66.233

What is Relational Time Series?Previous ApproachesNew Learning and Prediction ApproachesConclusions

7

Page 8: Approaches to Event Prediction in Complex Environments

Relational Time Series: Time Series of Relational Atoms

• 0.0, lookA(spock84)• 0.0, place+(Paperville3)• 0.0, location+(pitchfork74, Paperville3)• 0.0, pitchfork+(pitchfork74)• 0.0, location+(spock84, Paperville3)• 0.0, spock+(spock84)• 2.75, getA(pitchfork74, spock84)• 2.75, getE(spock84, pitchfork74)• 2.75, location-(pitchfork74, Paperville3)• 2.75, location+(pitchfork74, spock84)• 5.5, wA(spock84)• 5.5, goE(spock84, west)• 5.5, location-(spock84, Paperville3)• 5.5, spock-(spock84)• 5.5, place-(Paperville3)

P = (t, r(c1, c2, … cn))where

P: perceptt: timer: relationcx: constant

What is Relational Time Series?Previous ApproachesNew Learning and Prediction ApproachesConclusions

8

Page 9: Approaches to Event Prediction in Complex Environments

Relational Time Series: Time Series of Relational Atoms

• 0.0, lookA(spock84)• 0.0, place+(Paperville3)• 0.0, location+(pitchfork74, Paperville3)• 0.0, pitchfork+(pitchfork74)• 0.0, location+(spock84, Paperville3)• 0.0, spock+(spock84)• 2.75, getA(pitchfork74, spock84)• 2.75, getE(spock84, pitchfork74)• 2.75, location-(pitchfork74, Paperville3)• 2.75, location+(pitchfork74, spock84)• 5.5, wA(spock84)• 5.5, goE(spock84, west)• 5.5, location-(spock84, Paperville3)• 5.5, spock-(spock84)• 5.5, place-(Paperville3)

P = (t, r(c1, c2, … cn))where

P: perceptt: timer: relationcx: constant

What is the next percept?

What is Relational Time Series?Previous ApproachesNew Learning and Prediction ApproachesConclusions

9

Page 10: Approaches to Event Prediction in Complex Environments

Characteristics• No Background knowledge

– Eg. In a unknown domain, we do not know the behaviors of any entity

• Relational Atoms– Multi-dimension proposition

• High variability in predicates & constants– Too many to predefine

• Moving Context– Needs online Learning

10

What is Relational Time Series?Previous ApproachesNew Learning and Prediction ApproachesConclusions

Page 11: Approaches to Event Prediction in Complex Environments

Motivations

• Anticipation• Early Warning

What is Relational Time Series?Previous ApproachesNew Learning and Prediction ApproachesConclusions

11

Page 12: Approaches to Event Prediction in Complex Environments

Previous Approaches• Production Rules• Finite State Machines• Bayesian Network• Markov Chain• Statistical Relational Learning

What is Relational Time Series?Previous ApproachesNew Learning and Prediction ApproachesConclusions

12

Page 13: Approaches to Event Prediction in Complex Environments

Recent Interest in IDS Alerts Predictions• Recent Interests

– 2011 Nexat a history-based approach to predict attacker actions – 2011 A Novel Probabilistic Matching Algorithm for Multi-Stage Attack

Forecast– 2010 Multi stage attack Detection system for Network Administrators

using Data Mining (UTN, Oak Ridge NL)– 2008 Alert Fusion Based on Cluster and Correlation– 2007 Using Network Attack Graph to Predict the Future Attacks– 2007 Discovering Novel Multistage Attack Strategies

• Common Approaches– Mine Behavior from past data, such as DARPA challenge (1998, 1999 and

2000– No Online learning

What is Relational Time Series?Previous ApproachesNew Learning and Prediction ApproachesConclusions

13

Page 14: Approaches to Event Prediction in Complex Environments

• 0.0, lookA(spock84)• 0.0, place+(Paperville3)• 0.0, location+(pitchfork74, Paperville3)• 0.0, pitchfork+(pitchfork74)• 0.0, location+(spock84, Paperville3)• 0.0, spock+(spock84)• 2.75, getA(pitchfork74, spock84)• 2.75, getE(spock84, pitchfork74)• 2.75, location-(pitchfork74, Paperville3)• 2.75, location+(pitchfork74, spock84)• 5.5, wA(spock84)• 5.5, goE(spock84, west)• 5.5, location-(spock84, Paperville3)• 5.5, spock-(spock84)• 5.5, place-(Paperville3)

Situation Learning

• Situation Learning (Darken, 2005) – A sliding time window identifies

“Situations”– Forms a simple lookup table– Able to model trending and high

variability

14

∆ 𝒕Predictive atom

What is Relational Time Series?Previous ApproachesNew Learning and Prediction ApproachesConclusions

Page 15: Approaches to Event Prediction in Complex Environments

Situation Learning (SL) + Current Approaches

15

What is Relational Time Series?Previous ApproachesNew Learning and Prediction ApproachesConclusions

Page 16: Approaches to Event Prediction in Complex Environments

Conceptual Blending

Outer Space Mappings

ProjectionEmergent Frame

(From Fauconnier & Turner, 2002)

Constitution PrinciplesVital Relation Mapping

Construct Generic SpaceCompositionCompletionElaboration

Back Projection

Optimality PrinciplesCompression

TopologyPattern Completion

IntegrationPromoting Vital Relation

WebUnpackingrelevance

Vital RelationChange, Cause-Effect, Time,

Space, Identity, Change, Uniqueness, Part-Whole,

Representation, Role, Analogy, Disanalogy, Property,

Similarity, Category, and Intentionality

Data, information, organizer

Previous Domains

CurrentDomain

Back projection

={?c/?d}

Subst(,)

={?a/?b}

Subst(,)

What is Relational Time Series?Previous ApproachesNew Learning and Prediction ApproachesConclusions

16

Page 17: Approaches to Event Prediction in Complex Environments

Single Scope Blending Example• Dragon-1 in Location-1• Agent-1 enter Location-1• Dragon-1 Kill Agent-1• …• …• Goblin-2 in location-2• Dragon-2 in location-2• Agent-2 enter location-2• ?

Dragon-1 Agent-1

Location-1

kill

in Enter

Dragon-2 Agent-2

Location-2

in Enter

Previous Situation

Current Situation

inGoblin-2

One Possible Substitution:Dragon-1 to Goblin-2Agent-1 to Agent-2

Prediction: Goblin-2 Kill Agent-2

Analogy

What is Relational Time Series?Previous ApproachesNew Learning and Prediction ApproachesConclusions

17

Page 18: Approaches to Event Prediction in Complex Environments

Prediction Accuracies from a Agent Simulator

LUT VAR VOMM MSB SBM SSB(DFS)0.00

0.05

0.10

0.15

0.20

0.25

0.30

0.35

0.40

Average Prediction Accuracy, 40 batches of 100 percepts

LUT VAR VOMM MSB SBM SSB0.00

2.00

4.00

6.00

8.00

10.00

12.00

Time to run 40 batches of 100 percepts

What is Relational Time Series?Previous ApproachesNew Learning and Prediction ApproachesConclusions

18

Page 19: Approaches to Event Prediction in Complex Environments

Network Intrusion Alerts Predictions

Greedy Previous LUT VAR MSB VOMM SSB0

0.1

0.2

0.3

0.4

0.5

0.6

0.7

0.8

Final Average Prediction Accuracy

Dataset1Dataset2

What is Relational Time Series?Previous ApproachesNew Learning and Prediction ApproachesConclusions

19

Page 20: Approaches to Event Prediction in Complex Environments

Why is SSB Better?• Dataset

– 6482 alerts – 1590 unique alerts

• Detection Rate

• Effect of Frequency on Detection RateFrequency Number of Alerts SSB Detects MSB detects VOMM detects

1 643 163 0 02 751 621 230 2423 52 34 27 144 88 80 77 745 5 0 0 06 11 10 8 87 3 3 1 18 2 2 2 29 4 4 3 3

10 3 3 3 3

SSB MSB VOMMUnique Alert Detected 947 379 375% 59.56% 23.84% 23.58%

What is Relational Time Series?Previous ApproachesNew Learning and Prediction ApproachesConclusions

20

Page 21: Approaches to Event Prediction in Complex Environments

Complexity Reduction: From Exponential to Linear

• Default Method: Backtracking– Subgraph Isomorphism– NP-Complete

• 1st Improvement– Formulate the problem as graph search– Depth First Search to Greedy Best First Search– Best First Search to 2-tier ASTAR

• 2nd Improvement– Attention Based Search

What is Relational Time Series?Previous ApproachesNew Learning and Prediction ApproachesConclusions

21

Page 22: Approaches to Event Prediction in Complex Environments

Attention Based Search

Dragon-1 Agent-1

Location-1

in Enter

Dragon-2 Agent-2

Location-2

in Enter

Previous Situation Current Situation

node1 node2 Difference

Dragon - 1 Dragon - 2 [1, 0, 1, 1, 0]

Agent - 2 [0, 0, 1, 1, 0]

Location - 2 [0, 0, 0, 0, -3]

Agent - 1 Dragon - 2 [0, 0, 1, 1, 0]

Agent - 2 [1, 0, 1, 1, 0]

Location - 2 [0, 0, 0, 0, -3]

Location - 1 Dragon - 2 [0, 0, 0, 0, -3]

Agent - 2 [0, 0, 0, 0, -3]

Location - 2 [1, 0, 1, 1, 0]

nodes In Degree

Out Degree

Type

Dragon – 1 0 1 D

Agent – 1 0 1 A

Location – 1 2 0 L

Dragon – 2 0 1 D

Agent – 2 0 1 A

Location – 2 2 0 L

score = [Type, ExactNameMatch, BothExactDegreeMatch, AtLeastOneDegreeMatch, DegreeDiff]22

Page 23: Approaches to Event Prediction in Complex Environments

Scalability Test

1 2 3 4 5 6 7 8 9 100

10

20

30

40

50

60

Processing Time over number of atom in each situation (Pymud)

ASTARAttentionBacktrack

time

(sec

)

1 2 3 4 5 6 7 8 9 100

0.2

0.4

0.6

0.8

1

1.2

1.4

1.6

Processing Time over number of atom in each situation (Pymud)

ASTARAttention

Axis Title

What is Relational Time Series?Previous ApproachesNew Learning and Prediction ApproachesConclusions

23

Page 24: Approaches to Event Prediction in Complex Environments

Conclusions & Future Works

• Conclusions– Single Scope Blending Prediction Approach predicts better– Reduces NP-Complete complexity to Linear through Greedy ASTAR and

Attention based search

• Future Works– 2-tier memory system (Short term and long term memory)– Flexible windows based on event segment theory– Letter prediction

24

Page 25: Approaches to Event Prediction in Complex Environments

Thank you

25