32
An Information Geometry Perspective on Estimation of Distribution Algorithms: Boundary Analysis Luigi Malagò , Matteo Matteucci, Bernardo Dal Seno OBUPM 2008, July 13 GECCO’08, Atlanta, Georgia, USA DEPARTMENT OF ELECTRONICS AND INFORMATION http:/www.dei.polimi.it AIRLab Artificial Intelligence and Robotics Laboratory http:/www.airlab.elet.polimi.it

An-Information-Geometry-Perspective-on-Estimation-of-Distribution-Algorithms:-Boundary-Analysis

Embed Size (px)

DESCRIPTION

Presentation by Luigi Malago at the Optimization by Building and Using Probabilistic Models (OBUPM 2008) workshop at the Genetic and Evolutionary Computation Conference (GECCO-2008)

Citation preview

Page 1: An-Information-Geometry-Perspective-on-Estimation-of-Distribution-Algorithms:-Boundary-Analysis

An Information Geometry Perspective on Estimation of Distribution Algorithms: Boundary Analysis

Luigi Malagò, Matteo Matteucci, Bernardo Dal Seno

OBUPM 2008, July 13GECCO’08, Atlanta, Georgia, USA

DEPARTMENT OF ELECTRONICS AND INFORMATION http:/www.dei.polimi.it

AIRLab Artificial Intelligence and Robotics Laboratoryhttp:/www.airlab.elet.polimi.it

Page 2: An-Information-Geometry-Perspective-on-Estimation-of-Distribution-Algorithms:-Boundary-Analysis

L. Malagò, OBUPM 2008

Agenda

Motivations and Context

Estimation of Distribution Algorithms (EDAs)

Notions of Information Geometry (IG)

Different parametrization for a statistical manifold

Interpretation of the behavior of an EDA

Simple examples

Conclusions and future work

Page 3: An-Information-Geometry-Perspective-on-Estimation-of-Distribution-Algorithms:-Boundary-Analysis

L. Malagò, OBUPM 2008

Context

Amari, Shun-ichi (2001). Information geometry on hierarchy of probability distributions. IEEE Transactions on Information Theory, 47(5), 1701-1711

Toussaint, Marc (2004). Notes on information geometry and evolutionary processes. Los Alamos pre-print nlin.AO/0408040.

Does Information Geometry provide (useful) insightsin the study of EDAs?

Page 4: An-Information-Geometry-Perspective-on-Estimation-of-Distribution-Algorithms:-Boundary-Analysis

L. Malagò, OBUPM 2008

Estimation of Distribution Algorithms

a.k.a. Probabilistic Model-Building Optimization Genetic Algorithms (PMBGAs)

make use of a probabilistic model

replace crossover and mutation with estimation and sampling

Page 5: An-Information-Geometry-Perspective-on-Estimation-of-Distribution-Algorithms:-Boundary-Analysis

L. Malagò, OBUPM 2008

Estimation of Distribution Algorithms

a.k.a. Probabilistic Model-Building Optimization Genetic Algorithms (PMBGAs)

make use of a probabilistic model

replace crossover and mutation with estimation and sampling

Notation

: Population of candidate solutions

: Random vector of binary variables

: Parametrized probability model

Page 6: An-Information-Geometry-Perspective-on-Estimation-of-Distribution-Algorithms:-Boundary-Analysis

L. Malagò, OBUPM 2008

Estimation of Distribution Algorithms

Many different algorithms, according to

choice of the model

model building and model fitting

model evaluation

sampling techniques

Page 7: An-Information-Geometry-Perspective-on-Estimation-of-Distribution-Algorithms:-Boundary-Analysis

L. Malagò, OBUPM 2008

Estimation of Distribution Algorithms

Many different algorithms, according to

choice of the model

model building and model fitting

model evaluation

sampling techniques

In the literature, EDAs are often classified as

Univariate: no dependencies

Bivariate: pairwise dependencies

Multivariate: higher order dependencies

The choice of can determine convergenceto local optimum!

Page 8: An-Information-Geometry-Perspective-on-Estimation-of-Distribution-Algorithms:-Boundary-Analysis

L. Malagò, OBUPM 2008

EDA Classification

INDEPENDENT VARIABLES(PBIL, UMDA, cGA, DEUM)

CHAIN MODEL(COMIT)

TREE MODEL(MIMIC)

FOREST MODEL(BMDA)

UNDIRECTED GRAPH(FDA, LFDA, DEUM, MN-EDA)

INDEPENDENT CLUSTERS(eCGA)

BAYESIAN NETWORKS(BOA, EBNA)

Univariate Bivariate Multivariate

Page 9: An-Information-Geometry-Perspective-on-Estimation-of-Distribution-Algorithms:-Boundary-Analysis

L. Malagò, OBUPM 2008

Information Geometry

studies properties of families of probability distributions by means of differential geometry

is a rather theoretical framework with an increasing number of applications

A parametric statistical model can be regarded as a manifold of distributions, where the Fisher Information matrix plays the role of metric tensor

Page 10: An-Information-Geometry-Perspective-on-Estimation-of-Distribution-Algorithms:-Boundary-Analysis

L. Malagò, OBUPM 2008

Information Geometry

studies properties of families of probability distributions by means of differential geometry

is a rather theoretical framework with an increasing number of applications

A parametric statistical model can be regarded as a manifold of distributions, where the Fisher Information matrix plays the role of metric tensor

Page 11: An-Information-Geometry-Perspective-on-Estimation-of-Distribution-Algorithms:-Boundary-Analysis

L. Malagò, OBUPM 2008

Contingency tables and log-linear models

Consider a vector of binary variables

Page 12: An-Information-Geometry-Perspective-on-Estimation-of-Distribution-Algorithms:-Boundary-Analysis

L. Malagò, OBUPM 2008

Contingency tables and log-linear models

Consider a vector of binary variables

Page 13: An-Information-Geometry-Perspective-on-Estimation-of-Distribution-Algorithms:-Boundary-Analysis

L. Malagò, OBUPM 2008

Contingency tables and log-linear models

Consider a vector of binary variables

Mean parameters

Page 14: An-Information-Geometry-Perspective-on-Estimation-of-Distribution-Algorithms:-Boundary-Analysis

L. Malagò, OBUPM 2008

Contingency tables and log-linear models

Consider a vector of binary variables

Natural parameters Mean parameters

Page 15: An-Information-Geometry-Perspective-on-Estimation-of-Distribution-Algorithms:-Boundary-Analysis

L. Malagò, OBUPM 2008

Contingency tables and log-linear models

Consider a vector of binary variables

Natural parameters

Due to orthogonality among mean and natural parameters, we can employ a k-cut mixed parametrization

Mean parameters

Page 16: An-Information-Geometry-Perspective-on-Estimation-of-Distribution-Algorithms:-Boundary-Analysis

L. Malagò, OBUPM 2008

k-cut mixed parametrization

Page 17: An-Information-Geometry-Perspective-on-Estimation-of-Distribution-Algorithms:-Boundary-Analysis

L. Malagò, OBUPM 2008

k-cut mixed parametrization

Page 18: An-Information-Geometry-Perspective-on-Estimation-of-Distribution-Algorithms:-Boundary-Analysis

L. Malagò, OBUPM 2008

k-cut mixed parametrization

Page 19: An-Information-Geometry-Perspective-on-Estimation-of-Distribution-Algorithms:-Boundary-Analysis

L. Malagò, OBUPM 2008

k-cut mixed parametrization

Page 20: An-Information-Geometry-Perspective-on-Estimation-of-Distribution-Algorithms:-Boundary-Analysis

L. Malagò, OBUPM 2008

k-cut mixed parametrization

Page 21: An-Information-Geometry-Perspective-on-Estimation-of-Distribution-Algorithms:-Boundary-Analysis

L. Malagò, OBUPM 2008

k-cut mixed parametrization

Page 22: An-Information-Geometry-Perspective-on-Estimation-of-Distribution-Algorithms:-Boundary-Analysis

L. Malagò, OBUPM 2008

k-cut mixed parametrization

Page 23: An-Information-Geometry-Perspective-on-Estimation-of-Distribution-Algorithms:-Boundary-Analysis

L. Malagò, OBUPM 2008

Analysis of the border of the manifold

Page 24: An-Information-Geometry-Perspective-on-Estimation-of-Distribution-Algorithms:-Boundary-Analysis

L. Malagò, OBUPM 2008

Analysis of the border of the manifold

Page 25: An-Information-Geometry-Perspective-on-Estimation-of-Distribution-Algorithms:-Boundary-Analysis

L. Malagò, OBUPM 2008

Analysis of the border of the manifold

Page 26: An-Information-Geometry-Perspective-on-Estimation-of-Distribution-Algorithms:-Boundary-Analysis

L. Malagò, OBUPM 2008

Interpretation of EDA operatorswithin the manifold of distributions

Page 27: An-Information-Geometry-Perspective-on-Estimation-of-Distribution-Algorithms:-Boundary-Analysis

L. Malagò, OBUPM 2008

A Simple Example 1/2

Independence model

Page 28: An-Information-Geometry-Perspective-on-Estimation-of-Distribution-Algorithms:-Boundary-Analysis

L. Malagò, OBUPM 2008

A Simple Example 1/2

Independence model Model

Page 29: An-Information-Geometry-Perspective-on-Estimation-of-Distribution-Algorithms:-Boundary-Analysis

L. Malagò, OBUPM 2008

A Simple Example 2/2

Independence model

Page 30: An-Information-Geometry-Perspective-on-Estimation-of-Distribution-Algorithms:-Boundary-Analysis

L. Malagò, OBUPM 2008

A Simple Example 2/2

Independence model Model

Page 31: An-Information-Geometry-Perspective-on-Estimation-of-Distribution-Algorithms:-Boundary-Analysis

L. Malagò, OBUPM 2008

Conclusions and Future Work

Information Geometry provides an interesting perspective on the study of the behavior of EDAs

Direction of research

Formalization of the ideas presented by means of mathematical proofs

Convergence results for specific classes of problems according to the model used by the EDA

Page 32: An-Information-Geometry-Perspective-on-Estimation-of-Distribution-Algorithms:-Boundary-Analysis

L. Malagò, OBUPM 2008

Conclusions and Future Work

Information Geometry provides an interesting perspective on the study of the behavior of EDAs

Direction of research

Formalization of the ideas presented by means of mathematical proofs

Convergence results for specific classes of problems according to the model used by the EDA

From discrete to continuous EDAs

Proposal of new meta-heuristics based on IG principles

Extension of the framework to other meta-heuristics that employ statistical models