Upload
millicent-gallagher
View
233
Download
1
Tags:
Embed Size (px)
Citation preview
Prénom Nom
Oriented Local Binary Patterns for Offline Writer
IdentificationHandwriting as oriented texture
Anguelos Nicolaou, Marcus Liwicki, Rolf IngolfDIVA Group Fribourg Switzerland
Prénom Nom
Offline writer identification
Image to feature vector
Vector should “uniquely” identify the handwriting style
Measure similarity between handwriting styles
Identify writer as the most similar
2
LBP
LBP
LBP
LBP
LBP
LBP
LBP
LBP
LBP
DB
All image samples in this presentation are taken from Icfhr 2012 competition on writer identification challenge
Features
sample
Prénom Nom
Local Binary Patterns Introduced by Ojala et al. Each pixel has a circular
neighborhood Encode the binary
relationship of each pixel in the neighborhood borders
Each pixel’s neighborhood is encoded as an integer
Bag of words approach Total disregard towards
structure between individual patterns
3
Prénom Nom
LBP parameters
Radius:
Sample count:
Binary operator:
4
TextureInformation
StructureInformation
Smaller LargerRadius
Frequent eventsLow detail
High detailScarce events
Lesssamples
MoreSamples
Sample count
€
eg binary : center ≠ edge, center = edge,center∧edge
€
eg grayscale : center > edge, center < edge,center ≥edge
Prénom Nom
Proposed feature set:LBP tuning
8 samples in the periphery 3 pixel radius Binary function: center equals periphery Histogram of all occurring patterns except
of “all white or all black” pattern normalized to a sum of 1
5
Prénom Nom
Proposed feature-set:High-order features
Derived from the LBP histogram
Histogram of rotation invariant hashes
Histogram of rotation phases Histogram of border orientations Histogram of border
aggregations Histogram of true bit count Histogram of 0-1 transitions (beta
function)
6
255 72
High-orderFeatures
327
Histogram
LBP
Prénom Nom
Transforming the feature space:Pipeline Overview
7
Train setRaw Features
Train setLabeled Features
Component Matrix
Scale Matrix
Feature extraction
Rebase to components
Scale to optimal
PCA
EvolutionaryAlgorithm
Raw Features
Rebased Features
Scaled Features
NN Classification
Only Training
Prénom Nom
Transforming the feature space:PCA
Changing the basis of the feature space
Preserving 99.9% of the information
Reducing 327 to 125 dimensions
Should be perceived as training
With very good generalizing properties
8
Prénom Nom
Transforming the feature space:Scale vector optimization
Scale each dimension Evolutionary optimization Fitness:
minimize
Fitness inspired by LDA 2000 Generations 50 individuals per population
9
€
distance of same class samples∑distance of different class samples∑
⎛
⎝ ⎜ ⎜
⎞
⎠ ⎟ ⎟
each sample
∑
Prénom Nom
Performance measurement
Train-set: ICFHR 2012 (100 writers x 4 samples) “Icfhr 2012 com- petition on writer identification challenge”
Test-set: ICDAR 2011 (26 writers x 8 samples) “Icdar 2011 writer identification contest”
NN classification:
10
Method Soft 10 Hard 7NN
accuracy
Tsinghua (2011 winner)
100% 44.1% 99.5%
Untrained 99.04% 28.84% 96.63%
Trained 99.04% 50.48% 98.56%
Prénom Nom
Qualitative experiments:Rotation robustness
11
• Rotation correction destroys slant information
• Tolerance +- 5 degrees• Conclusion: Quite robust
to rotation
-20o
-5o
0o
Prénom Nom
Qualitative experiments:Robustness against scale
Typically, methods are sensitive to scale
NN classification Dataset: ICHFR 2012 Intolerant of comparing
samples at different scales from the db
Discriminative ability preserved when rescaling samples and db
12
Prénom Nom
Qualitative experiments:Signal quantity requirements
NN classification Dataset: 2011
ICDAR Gradually removed
connected components from top to bottom
For the specific dataset, the method requires 3-4 text-lines.
13
25% 100%
Prénom Nom
Qualitative Experiments:Writer vs Writing Style
SigWiComp2013 train set 55 writers 3 samples
Performance of winner 28% SigWiComp2013. Our system 19.4%
Split in left and right halves Task: assign left half to right half Assigning correctly two pieces of
the same sample: 86.06% Assigning correctly two pieces of
the same writer: 87.27%
Conclusion: Our method practically treats
different writing styles as different writers
14
Prénom Nom
Conclusions:
Powerful feature set Good behavior in trained and untrained
modalities Perceives handwriting as texture Reasonably robust against geometric
distortions As is, probably unfit for biometric
applications
15