17
Binary Symbol Recognition from Local Dissimilarity Map GREC - 2009 F. Morain-Nicolier, J. Landr´ e, S. Ruan [email protected] http://pixel-shaker.fr CRESTIC - URCA/IUT Troyes 22 juillet 2009 1

Binary Symbol Recognition from Local Dissimilarity Map

Embed Size (px)

DESCRIPTION

In previous works, we have proposed a local dissimilarity map (LDM) in order to compare images. In this research, we show how the LDM can be applied in the field of symbol recognition. A global dissimilarity measure (GDM) is obtained from the LDM. This versatile allow to measure symetric as well as asymetric similarities. A matcher is derived by summing the values of the LDM. The obtained matcher is compared to the chamfer matching. Its properties are related to the human similarity judgement from Tversky results. It is tested on the grec2005 symbol recognition database. Good to excellent results are obtained without any knowledge on images, and no pre-processing nor segmentation involved.

Citation preview

Page 1: Binary Symbol Recognition from Local Dissimilarity Map

Binary Symbol Recognition from Local Dissimilarity MapGREC - 2009

F. Morain-Nicolier, J. Landre, S. Ruan

[email protected]://pixel-shaker.fr

CRESTIC - URCA/IUT Troyes

22 juillet 2009

1

Page 2: Binary Symbol Recognition from Local Dissimilarity Map

2/14

Goal

Symbol Recognition using raw pixel information

Not a statistical approach : no learning

Not a structural approach : no primitive

⇒ Build a good (di)similarity measure between images

2

Page 3: Binary Symbol Recognition from Local Dissimilarity Map

3/14

Local to Global Dissimilarity

Local Dissimilarity Map

LDMA,B = |A− B|max(dtA,dt(p)) (1)

= BdtA + AdtB . (2)

Global Dissimilarity Measure : sum the (squared) values :

GDM(A,B) = α∑

p

B(p)dt2A(p) + β∑

p

A(p)dt2B(p). (3)

3

Page 4: Binary Symbol Recognition from Local Dissimilarity Map

3/14

Local to Global Dissimilarity

Local Dissimilarity Map

LDMA,B = |A− B|max(dtA,dt(p)) (1)

= BdtA + AdtB . (2)

Global Dissimilarity Measure : sum the (squared) values :

GDM(A,B) = α∑

p

B(p)dt2A(p) + β∑

p

A(p)dt2B(p). (3)

4

Page 5: Binary Symbol Recognition from Local Dissimilarity Map

4/14

Local to Global Dissimilarity

Why α and β ?

Chamfer Matching [Borgefors] : sum of the distances of each translatedmodel’s pixel to the nearest images’s pixel.

GDM is a double Chamfer Matching :

GDMA,B ∼ αCS(A,B) + βCS(B,A) (4)

CS : how is M similar to I ?

GDM : + how is I similar to M ?

symetric or asymetric matching (links to psychological models of similarity[Tversky]) ?

5

Page 6: Binary Symbol Recognition from Local Dissimilarity Map

5/14

First Algorithm

1 I contains an unknow symbol.2 Foreach model M :

compute the dissimilarity GDM(I ,M) between I and M.

3 Keep the model with the lowest dissimilarity as winner.

Tested on the GREC2005 international symbol recognition contest database(electronic and architecture) : six degradations

25 symbols to be recognized in 50 images.

α = 1 and β = 0.

6

Page 7: Binary Symbol Recognition from Local Dissimilarity Map

5/14

First Algorithm

1 I contains an unknow symbol.2 Foreach model M :

compute the dissimilarity GDM(I ,M) between I and M.

3 Keep the model with the lowest dissimilarity as winner.

Tested on the GREC2005 international symbol recognition contest database(electronic and architecture) : six degradations

25 symbols to be recognized in 50 images.

α = 1 and β = 0.

7

Page 8: Binary Symbol Recognition from Local Dissimilarity Map

6/14

Results (no deformation)

Degradation 1

100 %

8

Page 9: Binary Symbol Recognition from Local Dissimilarity Map

7/14

Results (no deformation)

Degradation 2

100 %

9

Page 10: Binary Symbol Recognition from Local Dissimilarity Map

8/14

Results (no deformation)

Degradation 3

100 %

10

Page 11: Binary Symbol Recognition from Local Dissimilarity Map

9/14

Results (no deformation)

Degradation 4

100 %

11

Page 12: Binary Symbol Recognition from Local Dissimilarity Map

10/14

Results (no deformation)

Degradation 5

100 %

12

Page 13: Binary Symbol Recognition from Local Dissimilarity Map

11/14

Results (no deformation)

Degradation 6

54 % (but best is 59.81 %)

13

Page 14: Binary Symbol Recognition from Local Dissimilarity Map

12/14

And with deformations ?

Transform rotation and scale into translations⇒ log-polar transform

Given two images I and M to be registrated :

Compute Ilp and Mlp

Which translation produces the most similar images ?⇒ compute the global dississimilarity between Ilp and each translation of Mlp.Very fast with cross-correlations (in Fourier Domain) :

LocI ,M = αdt2I ?M + βI ? dt2

M . (5)

Find the minimum of LocI ,M and find the rotation and scale parameters fromits position.

14

Page 15: Binary Symbol Recognition from Local Dissimilarity Map

12/14

And with deformations ?

Transform rotation and scale into translations⇒ log-polar transform

Given two images I and M to be registrated :

Compute Ilp and Mlp

Which translation produces the most similar images ?⇒ compute the global dississimilarity between Ilp and each translation of Mlp.Very fast with cross-correlations (in Fourier Domain) :

LocI ,M = αdt2I ?M + βI ? dt2

M . (5)

Find the minimum of LocI ,M and find the rotation and scale parameters fromits position.

15

Page 16: Binary Symbol Recognition from Local Dissimilarity Map

13/14

Complete Algorithm

1 Compute Ilp, log-polar representation of I .2 Foreach model M :

1 compute Mlp, log-polar representation of M2 find the global minimum of LocIlp ,Mlp (fast computation)3 find (σ, θ) the scaling and rotation parameters of M.4 compute Mcorrected from (ρ, θ), by applying reverse scale and rotation.5 compute the dissimilarity GDM(I ,Mcorrected) between Mcorrected and I .

3 Keep the model with the lowest dissimilarity as winner.

16

Page 17: Binary Symbol Recognition from Local Dissimilarity Map

14/14

Advantages and Drawbacks

rot/scale m1 m2 m3 m4 m5 m6without 100% 100% 100% 100% 100% 54%

with 96% 40% 94% 70% 42% 12%

Quite good performances (lowered by 5%-10% with 100 images/150symbols)

without any a-priori knowledge, pre-processing, nor primitive extraction

Fast but still needs parameters estimations

Next :

Gray level images ⇒ Localization with rotation/scale variations.Balance the asymetry, e.g. α = 0.9 and β = 0.1 (take into account noise)Direct measure in log-polar domain.Apply preprocessing to boost performances.

17