16
Prénom Nom Oriented Local Binary Patterns for Offline Writer Identification Handwriting as oriented texture Anguelos Nicolaou, Marcus Liwicki, Rolf Ingolf DIVA Group Fribourg Switzerland

Prénom Nom Oriented Local Binary Patterns for Offline Writer Identification Handwriting as oriented texture Anguelos Nicolaou, Marcus Liwicki, Rolf Ingolf

Embed Size (px)

Citation preview

Prénom Nom

Oriented Local Binary Patterns for Offline Writer

IdentificationHandwriting as oriented texture

Anguelos Nicolaou, Marcus Liwicki, Rolf IngolfDIVA Group Fribourg Switzerland

Prénom Nom

Offline writer identification

Image to feature vector

Vector should “uniquely” identify the handwriting style

Measure similarity between handwriting styles

Identify writer as the most similar

2

LBP

LBP

LBP

LBP

LBP

LBP

LBP

LBP

LBP

DB

All image samples in this presentation are taken from Icfhr 2012 competition on writer identification challenge

Features

sample

Prénom Nom

Local Binary Patterns Introduced by Ojala et al. Each pixel has a circular

neighborhood Encode the binary

relationship of each pixel in the neighborhood borders

Each pixel’s neighborhood is encoded as an integer

Bag of words approach Total disregard towards

structure between individual patterns

3

Prénom Nom

LBP parameters

Radius:

Sample count:

Binary operator:

4

TextureInformation

StructureInformation

Smaller LargerRadius

Frequent eventsLow detail

High detailScarce events

Lesssamples

MoreSamples

Sample count

eg binary : center ≠ edge, center = edge,center∧edge

eg grayscale : center > edge, center < edge,center ≥edge

Prénom Nom

Proposed feature set:LBP tuning

8 samples in the periphery 3 pixel radius Binary function: center equals periphery Histogram of all occurring patterns except

of “all white or all black” pattern normalized to a sum of 1

5

Prénom Nom

Proposed feature-set:High-order features

Derived from the LBP histogram

Histogram of rotation invariant hashes

Histogram of rotation phases Histogram of border orientations Histogram of border

aggregations Histogram of true bit count Histogram of 0-1 transitions (beta

function)

6

255 72

High-orderFeatures

327

Histogram

LBP

Prénom Nom

Transforming the feature space:Pipeline Overview

7

Train setRaw Features

Train setLabeled Features

Component Matrix

Scale Matrix

Feature extraction

Rebase to components

Scale to optimal

PCA

EvolutionaryAlgorithm

Raw Features

Rebased Features

Scaled Features

NN Classification

Only Training

Prénom Nom

Transforming the feature space:PCA

Changing the basis of the feature space

Preserving 99.9% of the information

Reducing 327 to 125 dimensions

Should be perceived as training

With very good generalizing properties

8

Prénom Nom

Transforming the feature space:Scale vector optimization

Scale each dimension Evolutionary optimization Fitness:

minimize

Fitness inspired by LDA 2000 Generations 50 individuals per population

9

distance of same class samples∑distance of different class samples∑

⎝ ⎜ ⎜

⎠ ⎟ ⎟

each sample

Prénom Nom

Performance measurement

Train-set: ICFHR 2012 (100 writers x 4 samples) “Icfhr 2012 com- petition on writer identification challenge”

Test-set: ICDAR 2011 (26 writers x 8 samples) “Icdar 2011 writer identification contest”

NN classification:

10

Method Soft 10 Hard 7NN

accuracy

Tsinghua (2011 winner)

100% 44.1% 99.5%

Untrained 99.04% 28.84% 96.63%

Trained 99.04% 50.48% 98.56%

Prénom Nom

Qualitative experiments:Rotation robustness

11

• Rotation correction destroys slant information

• Tolerance +- 5 degrees• Conclusion: Quite robust

to rotation

-20o

-5o

0o

Prénom Nom

Qualitative experiments:Robustness against scale

Typically, methods are sensitive to scale

NN classification Dataset: ICHFR 2012 Intolerant of comparing

samples at different scales from the db

Discriminative ability preserved when rescaling samples and db

12

Prénom Nom

Qualitative experiments:Signal quantity requirements

NN classification Dataset: 2011

ICDAR Gradually removed

connected components from top to bottom

For the specific dataset, the method requires 3-4 text-lines.

13

25% 100%

Prénom Nom

Qualitative Experiments:Writer vs Writing Style

SigWiComp2013 train set 55 writers 3 samples

Performance of winner 28% SigWiComp2013. Our system 19.4%

Split in left and right halves Task: assign left half to right half Assigning correctly two pieces of

the same sample: 86.06% Assigning correctly two pieces of

the same writer: 87.27%

Conclusion: Our method practically treats

different writing styles as different writers

14

Prénom Nom

Conclusions:

Powerful feature set Good behavior in trained and untrained

modalities Perceives handwriting as texture Reasonably robust against geometric

distortions As is, probably unfit for biometric

applications

15

Prénom Nom

Use cases of LBP feature set

Thank you for your attention

16