Relational Factor Graphs

Lin Liao

Joint work with Dieter Fox

A Running Example

Collective classification of a person’s significant places

Features to Consider Local features:

Temporal: time of day, day of week, duration

Geographic: near restaurants, near stores Pair-wise features:

Transitions: which place follows which place Global features:

Aggregates: number of homes or workplaces

Which Graphical Model?

Option 1: Bayesian networks and Probabilistic Relational Models But the pair-wise relations may introduce

cycles

Place 1

Place 3 Place 4

Place 2

Which Graphical Model?

Option 2: Markov networks and Relational Markov Networks But aggregations can introduce huge

cliques and lose independence relations.

Place 1

Place 3 Place 4

Place 2

Number of homes

Motivation

We want a relational probabilistic model that is Suitable to represent both undirected

relations (e.g., pair-wise features) and directed relations (e.g., deterministic aggregation)

Able to address some of the computational issues at the template level

Outline Representation

Factor graphs [Kschischang et al. 2001, Frey 2003]

Relational factor graphs Inference

Belief propagation Inference templates

Summation template based on FFT Experiments

Factor Graph Undirected factor graph [Kschischang et al.

2001] Bipartite graph that includes both variable

nodes (x1,…,xN) and factor nodes (f1,…,fM)

Joint distribution of variables is proportional to the product of factor functions

Factor Graph Directed factor graph [Frey 2003]

Allow some edges to be directed so as to unify Bayesian networks and Markov networks

A valid graph should have no directed cycles

Markov Network to Factor Graph

Factors represent the potential functions

Markov network Factor graph

Bayesian Network to Factor Graph

Factors represent the conditional probability table

Bayesian network Factor graph

Unify MN and BN

Local features

Place labels

Aggregation factor

Number of homes

Aggregate features

Relational Factor Graph

A set of factor templates that can be used to instantiate (directed) factor graphs given data Representation template

Use SQL (similar to RMN) Guarantee no directed cycles

Inference template Optimization within a factor (discussed

later)

Place Labeling: Schema

Place Labeling: Transition Features

Label1 Label2 Label3

Pair-wise factor

Place Labeling: Aggregate Features

Label1 Label2 Label3

=Home? =Home? =Home?

Bool variables

Num of homes

Aggregate feature

Outline Representation

Factor graphs [Kschischang et al. 2001, Frey 2003]

Relational factor graphs Inference

Belief propagation Inference templates

Summation template based on FFT Experiments

Inference in Factor Graph Belief propagation: two types of messages

Message from variable x to factor f

Message from factor f to variable x

nx: factors adjacent to x; nf: variables adjacent to f

Inference Templates Simplest case: specify the function f(nf) and

use the above formula to compute message f -> x Problem: complexity is exponential in the

number of factor arguments. This can be very expensive for aggregation factors

Inference templates allow users to specify optimized algorithms at the template level Be in general form and easy to be shared Support template level complexity analysis

Summation Templates

xin1 xin

2 xin7 xin

Summation: Forward Message

xin1 xin

2 xin7 xin

Compute the distribution of the sum of independent variables xin

1, …. , xin8

Summation: Forward Message

Convolution tree: each node can be computed using FFT; total complexity O(nlog2n)

Summation: Backward Message

xin1 xin

2 xin7 xin

Message from xout defines a prior distribution of the sum. For each value of xin

2, compute the distribution of sum and weighted by the prior

Summation: Backward Message

If we reuse the results cached for the forward message, complexity becomes O(nlogn)

Summation Templates

By using convolution tree, FFT, and caching, the average complexity of passing a message through summation factor is O(nlogn), instead of exponential.

Learning

Estimate the weights for probabilistic factors (local features, pair-wise features, and aggregate features)

Optimize the weights to maximize the conditional likelihood of the labeled training data The same algorithm as RMN

Experiments Two data sets:

“Single” data set: one person’s GPS data for 4 months

“Multiple” data set: one-week GPS data from 5 subjects

Six candidate labels: Home, Work, Shopping, Dining, Friend, Others

Get the geographic knowledge from Microsoft MapPoint Web Service

How Much Aggregates Help

Error rate Multiple Single

No aggregate 28% 9%

With aggregate 18% 6%

Test on “multiple” data set: leave-one-subject-crossvalidation

Test on “single” data set: crossvalidation (train on 1 month, test on 3 months)

How Efficient the Optimized BP

Summary

Relational factor graph is SQL + (directed) factor graph

It is Suitable to represent both undirected

relations and directed relations Convenient to use: no directed cycles Able to address computation issues at the

template level

Relational Factor Graphs

Documents

An Introduction to Factor Graphs

Virtualizing Relational Databases as Graphs: a multi-model approach

Automated Generation of Factor Graphs for Security Attacks Detectionpublish.illinois.edu/science-of-security-lablet/files/... · 2016-11-15 · Factor Graphs are general probabilistic

Incremental Export of Relational Database Contents into RDF Graphs

An OLAP Endpoint for RDF Data Analysis Using Analysis Graphs · An OLAP Endpoint for RDF Data Analysis Using Analysis Graphs 3 We extend Analysis Graphs, proposed for relational data

Extracting and Analyzing Hidden Graphs from Relational ... · Extracting and Analyzing Hidden Graphs from Relational Databases Konstantinos Xirogiannopoulos University of Maryland,

GLoMo: Unsupervisedly Learned Relational Graphs as ... · GLoMo: Unsupervisedly Learned Relational Graphs as Transferable Representations Zhilin Yang 1, Jake (Junbo) Zhao 23, Bhuwan

Variational Bayesian Image Processing on Stochastic Factor Graphs

Boolean Factor Analysis of Multi-Relational Data

Factor Graphs for Quantum Probabilities

Probabilistic Color-by-Numbers: Suggesting Pattern Colorizations Using Factor Graphs

Factor Graphs and GTSAM: A Hands-on Introduction

Inference in Factor Graphs - College of Computingdellaert/pub/2013-05-10-ICRA-Tutorial.pdf · Frank Dellaert: Inference in Factor Graphs, Tutorial at ICRA 2013 4.1 Representation

RELATIONAL DATA MODEL 1. 2 What is a Data Model? 1.Mathematical representation of data. wExamples: relational model = tables; semistructured model = trees/graphs

Social Action Tracking via Noise Tolerant Time-varying Factor Graphs

Loop Corrections for Approximate Inference on Factor Graphs Loop Corrections for Approximate Inference on Factor Graphs Joris M. Mooij J.MOOIJ@SCIENCE.RU.NL Hilbert J. Kappen B.KAPPEN@SCIENCE.RU.NL

Scalable Probabilistic Databases with Factor Graphs and MCMC

Relational Algebra-Relational Calculus-SQL€¦ · Sample Queries in Tuple Relational Calculus . 41 Notation for Query Graphs . 42 Transforming the Universal and Existential Quantifiers

Kschischang Frey Loeliger - Factor Graphs and the Sum Product Algorithm

FACTOR GRAPHS AND GRAPH ENSEMBLESmontanar/RESEARCH/BOOK/partC.pdf · 2007-11-21 · FACTOR GRAPHS AND GRAPH ENSEMBLES {ch:Graphs} Systems involving a large number of simple variables