1
Berkley edge detection [1] followed by contour grouping [2] to obtain a set of salient contours for an image Create meaningful long sequences of contour segments (Chains) a-priori to avoid costly on-line matching Maximum Spanning Tree-based approach to find chains Efficient Dynamic Programming-based Matching Algorithm Joint Similarity Scale, translation, rotation invariant matching based on similarity of segment-length ratios and angles. Chain descriptor: Chain Matching Similarity-invariant Sketch-based Image Retrieval in Large Databases Sarthak Parui and Anurag Mittal Computer Vision Lab, Indian Institute of Technology Madras, India Contributions: Application Domain: Sketch-based image retrieval on touch based devices Image tag improvement Problem Statement: Open Issues Addressed: Given a hand-drawn sketch, retrieve the most relevant images from a database of millions of images Similarity-invariant image representation by a small number of variable-length compact descriptors Efficient Dynamic Programming-based approximate substring matching algorithm to match chain descriptors Hierarchical k-medoid index structure based on pure chain matching for fast online retrieval 1. Maximum Spanning Tree-based Contour Chain Extraction 2. Matching Two Chains [1] Arbelaez, P., Maire, M., Fowlkes, C., Malik, J.: Contour detection and hierarchical image segmentation. PAMI 2011 [2] Zhu, Q., Song, G., Shi, J.: Untangling cycles for contour grouping. In: ICCV 2007 [3] Eitz, M., Hildebrand, K., Boubekeur, T., Alexa, M.: A descriptor for large scale image retrieval based on sketched feature lines. In: Eurographics Symposium on Sketch-Based Interfaces and Modeling 2009 [4] Cao, Y., Wang, C., Zhang, L., Zhang, L.: Edgel index for large-scale sketch- based image search. In: CVPR 2011 [5] Schindler, K., Suter, D.: Object detection by global contour shape. Pattern Recognition,41 2008 Efficient Retrieval from web-scale dataset invariant to Scale, Translation and Rotation (Similarity) and/or small deformations Entire process of Chain creation Ψ = < = +1 , | ∈ 1, 2 > Assumption Object boundary is captured by a more or less contiguous portion of the chain and is not split by large gaps Approach Partial Matching strategy that can be integrated with the indexing structure Measure similarity between two chains by determining the maximum almost-contiguous matching portions of the sequences while leaving out non-matching portions from either side Allow slack while matching individual joints to allow small deformations 1 , 1 , 2 , 2 = Match score in the interval 1 , 1 in chain 1 and 2 , 2 in chain 2 = max ( 1 , 1 ) ( 2 , 2 ) ( , , ∈ 1 1 1 \X 2 2 2 \Y() ) J 1 and J 2 denote the set of joints of chain 1 & chain 2 and JM denotes matching between J 1 and J 2 in the given interval = , = , ∈ } = Skip penalty for joint = ( + +1 ) 1 , 2 = max 1 , 2 1 , 1 , 2 , 2 [value≥0] 3. Indexing and Retrieval Hierarchical divisive k-medoid based indexing structure formed by probabilistically choosing k cluster centroids (k-means++) at every level Distance is measured by partially matching pairs of chains using efficient Dynamic Programming-based chain matching algorithm Leaf node contains an image if at least one chain of the image matches with the medoid chain of the node Multiple leaves may contain the same image Given a query sketch, find images with similar chains by partially matching each query chain with the medoid chains at every level of the hierarchical k-medoid tree 4. Geometric Consistency Necessary as actual object boundary may be split across multiple chains. Geometric Consistency of the matched portions of a chain-pair with respect to another chain-pair: a. Closeness of distances between centroids b. Average angle difference of individually matched joints Measure the similarity of a database image with respect to the query sketch based on pairwise chain matching scores which are weighted by the geometric consistency scores of the corresponding chain-pairs Effectively only geometrically consistent chains are given weight for scoring 5. Experiments and Results A match when fragmented skips are allowed A match when only almost-contiguous matches are allowed Pairwise Geometric consistency Top retrieved results from 1.2 million images. Green: correct, Yellow: similar, Red: false match Maximum Spanning Tree captures a substantial portion of the outer contour of an object Salient contours [2] Longest chain Extracted chains for some Images Retrieval Framework 1 , 2 = 0, 1 , 2 =0 max 1 − 1, 2 −1 + ( 1 , 2 ) 1 − 1, 2 1 1 1 1 , 2 −1 − 2 2 2 0 , ℎ Solved using Dynamic Programming Time complexity: Ο( 1 2 ) Precision (% of true positives) at different ranks on a dataset of 1.2 million images. B: Best, W: Worst, A: Average performances are computed among sketches for each category and then averaged. CS+GC and CS indicate the performances with and without geometric verification respectively. Comparison of % of true positive retrievals in top 20 on ETHZ extended shape dataset [5] Unmatched joint

Similarity-invariant Sketch-based Image Retrieval in Large … · 2015. 1. 20. · 𝑀 1, 1, 2, 2 =Match score in the interval 1, 1 in chain 1 and 2, 2 in chain 2 = max 𝐽𝑀∈𝑂

  • Upload
    others

  • View
    1

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Similarity-invariant Sketch-based Image Retrieval in Large … · 2015. 1. 20. · 𝑀 1, 1, 2, 2 =Match score in the interval 1, 1 in chain 1 and 2, 2 in chain 2 = max 𝐽𝑀∈𝑂

• Berkley edge detection [1] followed by contour grouping

[2] to obtain a set of salient contours for an image

• Create meaningful long sequences of contour segments

(Chains) a-priori to avoid costly on-line matching

• Maximum Spanning Tree-based approach to find chains

Efficient Dynamic Programming-based Matching

Algorithm

Joint Similarity

Scale, translation, rotation invariant matching based on

similarity of segment-length ratios and angles.

Chain descriptor:

Chain Matching

Similarity-invariant Sketch-based Image Retrieval in Large Databases Sarthak Parui and Anurag Mittal

Computer Vision Lab, Indian Institute of Technology Madras, India

Contributions:

Application

Domain:

• Sketch-based image retrieval on touch

based devices

• Image tag improvement

Problem

Statement:

Open Issues

Addressed:

• Given a hand-drawn sketch, retrieve the

most relevant images from a

database of millions of images

• Similarity-invariant image representation

by a small number of variable-length

compact descriptors

• Efficient Dynamic Programming-based

approximate substring matching

algorithm to match chain descriptors

• Hierarchical k-medoid index structure

based on pure chain matching for fast

online retrieval

1. Maximum Spanning Tree-based

Contour Chain Extraction

2. Matching Two Chains

[1] Arbelaez, P., Maire, M., Fowlkes, C., Malik, J.: Contour detection and

hierarchical image segmentation. PAMI 2011

[2] Zhu, Q., Song, G., Shi, J.: Untangling cycles for contour grouping.

In: ICCV 2007

[3] Eitz, M., Hildebrand, K., Boubekeur, T., Alexa, M.: A descriptor for large scale

image retrieval based on sketched feature lines. In: Eurographics Symposium

on Sketch-Based Interfaces and Modeling 2009

[4] Cao, Y., Wang, C., Zhang, L., Zhang, L.: Edgel index for large-scale sketch-

based image search. In: CVPR 2011

[5] Schindler, K., Suter, D.: Object detection by global contour shape. Pattern

Recognition,41 2008

• Efficient Retrieval from web-scale

dataset invariant to Scale, Translation

and Rotation (Similarity) and/or small

deformations

Entire process of Chain creation

Ψ = < 𝛾𝑖 =𝑙𝑠𝑒𝑔𝑖

𝑙𝑠𝑒𝑔𝑖+1, 𝜃𝑖 | 𝑖 ∈ 1, 2 >

Assumption • Object boundary is captured by a more or less

contiguous portion of the chain and is not split by

large gaps

Approach • Partial Matching strategy that can be integrated with

the indexing structure

• Measure similarity between two chains by

determining the maximum almost-contiguous

matching portions of the sequences while leaving out

non-matching portions from either side

• Allow slack while matching individual joints to allow

small deformations

𝑀 𝑝1, 𝑞1, 𝑝2, 𝑞2 = Match score in the interval 𝑝1, 𝑞1 in chain 1 and 𝑝2, 𝑞2 in

chain 2

= max

𝐽𝑀∈𝑂𝑟𝑑𝑒𝑟𝑒𝑑 𝑚𝑎𝑡𝑐ℎ𝑖𝑛𝑔𝑠 𝑖𝑛𝑖𝑛𝑡𝑒𝑟𝑣𝑎𝑙 (𝑝1,𝑞1)𝑎𝑛𝑑 (𝑝2,𝑞2)

( 𝑆𝑗𝑛𝑡 𝑥, 𝑦

𝑥,𝑦 ∈𝐽𝑀

− 𝜔𝑥1𝛼1

𝑥∈𝐽1\X 𝐽𝑀

− 𝜔𝑦2𝛼2

𝑦∈𝐽2\Y(𝐽𝑀)

)

J1 and J2 denote the set of joints of chain 1 & chain 2 and JM

denotes matching between J1 and J2 in the given interval

𝑋 𝐽𝑀 = 𝑥 𝑥, 𝑦 ∈ 𝐽𝑀 𝑌 𝐽𝑀 = 𝑦 𝑥, 𝑦 ∈ 𝐽𝑀}

𝜔𝑥 = Skip penalty for joint 𝑥 = ( 𝑙𝑠𝑒𝑔𝑥 + 𝑙𝑠𝑒𝑔𝑥+1 )

𝑀 𝑞1, 𝑞2 = max

𝑝1,𝑝2𝑀 𝑝1, 𝑞1, 𝑝2, 𝑞2 [value≥0]

3. Indexing and Retrieval

• Hierarchical divisive k-medoid based indexing structure

formed by probabilistically choosing k cluster centroids

(k-means++) at every level

• Distance is measured by partially matching pairs of

chains using efficient Dynamic Programming-based

chain matching algorithm

• Leaf node contains an image if at least one chain of the

image matches with the medoid chain of the node

• Multiple leaves may contain the same image

• Given a query sketch, find images with similar chains by

partially matching each query chain with the medoid

chains at every level of the hierarchical k-medoid tree

4. Geometric Consistency

Necessary as actual object boundary may be split across

multiple chains.

• Geometric Consistency of the matched portions of a

chain-pair with respect to another chain-pair:

a. Closeness of distances between centroids

b. Average angle difference of individually matched joints

• Measure the similarity of a database image with respect

to the query sketch based on pairwise chain matching

scores which are weighted by the geometric consistency

scores of the corresponding chain-pairs

• Effectively only geometrically consistent chains are given

weight for scoring

5. Experiments and Results

A match when fragmented skips are

allowed A match when only almost-contiguous

matches are allowed

Pairwise Geometric consistency

Top retrieved results from 1.2 million images. Green: correct, Yellow: similar, Red: false match

Maximum Spanning Tree

captures a substantial portion

of the outer contour of an

object

Salient contours [2] Longest chain

Extracted chains for some Images

Retrieval Framework

𝑀 𝑞1, 𝑞2

=

0, 𝑖𝑓 𝑞1, 𝑞2 = 0

max

𝑀 𝑞1 − 1, 𝑞2−1 + 𝑆𝑗𝑛𝑡(𝑞1, 𝑞2)

𝑀 𝑞1 − 1, 𝑞2 − 𝜔𝑞11 𝛼1

𝑀 𝑞1, 𝑞2 − 1 − 𝜔𝑞22 𝛼2

0

, 𝑜𝑡ℎ𝑒𝑟𝑤𝑖𝑠𝑒

• Solved using Dynamic Programming

• Time complexity: Ο(𝑛𝐶1 ⋅ 𝑛𝐶2)

Precision (% of true positives) at different ranks on a dataset of 1.2 million images. B: Best, W: Worst,

A: Average performances are computed among sketches for each category and then averaged.

CS+GC and CS indicate the performances with and without geometric verification respectively.

Comparison of % of true positive retrievals in top 20 on ETHZ extended shape dataset [5]

Unmatched joint