Markov Chains for the Web - SEO, Usability, Search Engine Scoring, and More

Using Markov Chains to Predict User Behavior

Rivka Fogel

Markov Chains: Probability without History

Andrey Markov

Rivka Fogel

What Are Probability Spaces?

Focal Object / Function Co-Domain

Function/Possibility 1

• Also known as stochastic processes

Rivka Fogel

Type 1: Time Series

First Event

Also called “states”

Rivka Fogel

Application: Personalization

• To return more accurate SERPs (E) for that user

Identifying user-specific authorities

User E B A

Rivka Fogel

Type 2: Spatial Field

Shared Event

• Variable interactions are often statistically correlated

Rivka Fogel

Addition of The Markov Property

E because of B or D, not because of A

• The probability of B causing E, as opposed to D causing E, is calculated by the Bayesian Theorem

The Next State Depends Only on the Current State:

Rivka Fogel

Application: (not provided)

• The Markov Property enables the marketer to model paths without knowing every state.

• While some keyphrase data is known, it can also identify the keyphrase based on other users’ paths where the keyphrase is known.

Homepage

Keyphrase?

Bounce

Model Landing Page

Homepage Video View

Inventory

Gallery Page Video View

Rivka Fogel

Application: Multichannel Attribution

Monitoring and prediction can be based on probability of a user’s path given other users’ paths

Probability of B A Probability of C

B 1 C Known Path 1

B 2 C Known Path 2

Rivka Fogel

Application: Audience Segmentation

Probability of B A

Probability of C

B 1 C Known Path 1

Landing Page

Known Path 2

Referral Paths On-Site Paths

Rivka Fogel

Relational Markov Properties

• Relational Markov Models group multiple types of objects – relations – and calculate the probability of the relation’s appearance in a state.

• They work off of Dynamic Bayesian Networks

Relational Markov Models allow states to be of different types.

E because of B or D’s type, not because of A or C’s type

State B

State D

Type 2 Type 1

State A

State C

Rivka Fogel

Application: Audience Segmentation 2

Organic

Rivka Fogel

Application: User Experience

Homepage Bounce

Model Landing Page

Homepage Video View

Inventory

Gallery Page Video View

Page Visit Video View Bounce

Types:

Rivka Fogel

Application: Social Network Modeling

• This function will answer: if the user ended up converting/visiting the landing page, which [type(s)] of social interaction[s] came into play?

Site Landing

Rich Media Play Rich Media

Host Page

User Share

Influencer

News Feed

Brand Social Profile

Rivka Fogel

Application: HTTP Service Request Prediction

• Prefetch Page A given the probability that the user will want to see it. • The keyphrase cluster is predicted by the function with co-domain B and

is then used to predict the incidence of B where the first state isn’t known.

Probability of 3 A Keyphrase 1

Known Paths

Keyphrase Cluster

Keyphrase 2

Rivka Fogel

Application: Agent Suggestion

• Auto-suggests searches (Search C) and links (URL E) that the user is likely to want to access, based on user history and other users’ history

URL A URL B

URL C URL D URL E

Keyphrase Cluster or Authority

First words of Query

Search A

Search B Search C

Rivka Fogel

Application: Search Engine Scoring

• The function identifies hubs of authority that are probable next steps in many systems (each with individual focus objects).

Identifying Authority 2:

Page A Keyphrase Cluster

Page B

Link 2

Page C Link 1

Authority 1 Authority 2

Rivka Fogel

Appendix: Formal Definitions

Where, Probability Spaces: • The measurable space (S, Σ) and an object on the

measurable space X • The probability space is defined by the function P, the

assignment of probabilities to events, and where Ω is the set of possible outcomes, and F is set of events in which each event has 0 or more outcomes P(x) = Σ(t1-tk)P(t1) for all X on Ω

• The finite dimensional distribution X: Xt1 Ω -> Xk

• That arrow, or the push forward measures, or the random distribution of events, or the matrix of transition probabilities P PT1(.)=PT1(.)/x = Sk

– Where the Bayesian theorem allows for: P (H|E old) = P(H)*P(H|E new)/P(E entire set)

Rivka Fogel

• P(Xl+1=S | Xl=St | Xl-1 = St-1 … X0 = S0) = P(Xl+1=S | Xl = Sl) | Xl=I – The random distribution of events is defined because the

system is finite. • So, in the matrix of transition probabilities [defined

as Pl, l+1 over ij = P(Xl+1 = j | Xl=i)], Pl is independent of l.

• That is, s^(t) = s^

(t-1)A – s is the state space, A is the matrix of transition

probabilities, and ^ is the initial probability distribution of the states in s. s(t) is the probability vector for states at time “t.”

Then, Markov Property:

Rivka Fogel

Markov Restatement 1: When a User’s History is Available

• A(s, s’)=C(s,s’)/Σs’’ C(s,s’’) and ^(s)=C(s)/Σs’ C(s’) – C(s,s’) counts the instances where s’ follows s – This can be applied to HTTP prediction and agent

suggestion

Rivka Fogel

Markov Restatement 2: When the Evidence Comes from a User Pool • The Markov function becomes a generative chain

link system that can store counts and probabilities • s^(t) = a0i^(t-1)A+a1i^(t-2)A2+a2i^(t-3)A3… and

= Max(a0i^(t-1)A+a1i^(t-2)A2+a2i^(t-3)A3…) – s(t) is normalized to select a list of probable states. – Where probabilities are used:

This can be applied to authority hubs as well, where collected user path traversal patterns are represented in a traversal connectivity matrix.

Rivka Fogel

Markov Restatement 3: When Groupings of States Are Estimated • These are Relational Markov Models • These groupings are also seen as abstractions. A(Q) forms a

lattice of abstractions. – {D, R, Q, A, π} where D ∈ D is the tree and a hierarchy of values. R is a

set of relations. Each relation is defined by nodes on leaves of D. Q is the set of states. A is the transition probability matrix. Π is the initial probability, that is the initial state in the chain. States are defined as abstractions on Q.

– The rank of an abstraction a=R(d1, …., dk) in the lattice is defined as 1+ Σk

1 depth(dk). Depth is a node’s depth on the tree, and increases with the abstraction’s rank. The rank of Q (the most general) is 0.

• States that have nodes on common leaves will more frequently appear in abstractions together.

Rivka Fogel

Further Reading • Anderson, Corin R., Domingos, Pedro, and Weld, Daniel S.

“Relational Markov Models and their Application to Adaptive Web Navigation.” Proceedings of the eighth ACM SIGKDD international conference on knowledge discovery and data mining. (2002): 143-152. Electronic. http://homes.cs.washington.edu/~pedrod/papers/kdd02a.pdf

• Downey, Allen. “Bayesian statistics made (as) simple (as possible).” Pycon US. 7 March 2012. http://pyvideo.org/video/608/bayesian-statistics-made-as-simple-as-possible

• Ildiko, Flesch and Lucas, Peter. “Markov Equivalence in Bayesian Networks.” Electronic. http://www.cs.ru.nl/P.Lucas/markoveq.pdf

• Sarukkai, Ramesh R. “Link prediction and path analysis using Markov chains.” Computer Networks 3 (June 2000): 377-386. Electronic. http://www.sciencedirect.com/science/article/pii/S138912860000044X

Rivka Fogel

Questions?

Markov Chains for the Web - SEO, Usability, Search Engine Scoring, and More

Marketing

Usability, Affordance, and Usability Principlesuser.ceng.metu.edu.tr/~tcan/se542_f1516/Schedule/week1... · 2015-09-30 · Usability, Affordance, and Usability Principles Visual affordances

Pengujian Usability Website Menggunakan System Usability

Hidden Markov Models - AUusers-cs.au.dk/cstorm/courses/PRiB_f12/slides/hidden-markov-model… · Hidden Markov Models Markov Model Hidden Markov Model If the latent variables are

Markov Chains Regular Markov Chains Absorbing Markov Chains

PepHMM: A Hidden Markov Model Based Scoring Function for Mass Spectrometry Database Search Laxman Yetukuri T-61.6070: Modeling of Proteomics Data

USABILITY S10-B1: Powerpoint-Presentation »Usability«

Chapter 9: Markov Chain Regular Markov Chains Section 9…momran/m118videos/notes/sec92.pdf · Chapter 9: Markov Chain Section 9.2: Regular Markov Chains • Irreducible Markov Chain:

Usability Analysis with Markov Models · Markov models can be built into design tools, and can therefore be made very convenient for designers to utilise. One would hope that in the

Geoportal Usability Evaluation Geoportal Usability Evaluation

PepHMM: A Hidden Markov Model Based Scoring Function for Mass

Usability 2009 J T Burns1 Usability & Usability Engineering

M-Outpatient Health Care Usability Profile v4OR Health Care Usability Profile v4OR (M-OHCUP) ... Overall Scoring Sheet ... wall adjacent to the latch

Usability, Affordance, and Usability Principles

Usability 2 - Usability Heuristiken Und Usability Engineering - 2012-04-19

2 1 Discrete Markov Processes (Markov Chains) 3 1 First-Order Markov Models

Chapter 9: Markov Chain Regular Markov Chains …momran/m118videos/notes/sec92.pdf · Chapter 9: Markov Chain Section 9.2: Regular Markov Chains • Irreducible Markov Chain: When

Markov Chains - 1 Markov Chains Chapter 16. Markov Chains - 2 Overview Stochastic Process Markov Chains Chapman-Kolmogorov Equations State classification

Markov Models and Hidden Markov Models (HMMs)

Usability Engineering. 2 Contents Usability Engineering Know the user Task analysis Usability metrics The usability engineering process

Usability, Affordance, and Usability Principlesuser.ceng.metu.edu.tr/~tcan/se705_s0910/Schedule/se705_week2.pdf · Usability, Affordance, and Usability Principles Visual affordances