20 cv mil_models_for_words

Computer vision: models, learning and inference

Chapter 20 Models for visual words

Please send errata to s.prince@cs.ucl.ac.uk

Visual words

• Most models treat data as continuous• Likelihood based on normal distribution• Visual words = discrete representation of

image• Likelihood based on categorical distribution• Useful for difficult tasks such as scene

recognition and object recognition

Motivation: scene recognition

Structure

• Computing visual words• Bag of words model• Latent Dirichlet allocation• Single author-topic model• Constellation model• Scene model• Applications

Computing dictionary of visual words

1. For every one of the I training images, select a set of Ji spatial locations.• Interest points• Regular grid

2. Compute a descriptor at each spatial location in each image

3. Cluster all of these descriptor vectors into K groups using a method such as the K-Means algorithm

4. The means of the K clusters are used as the K prototype vectors in the dictionary.

Encoding images as visual words

1. Select a set of J spatial locations in the image using the same method as for the dictionary

2. Compute the descriptor at each of the J spatial locations. 3. Compare each descriptor to the set of K prototype

descriptors in the dictionary4. Assign a discrete index to this location that corresponds to

the index of the closest word in the dictionary.

End result:

Discrete feature index x and y position

Structure

Bag of words model

Key idea:

• Abandon all spatial information• Just represent image by relative frequency

(histogram) of words from dictionary

Bag of words

Structure

Learning (MAP solution):

Inference:

Bag of words for object recognition

Problems with bag of words

Structure

Latent Dirichlet allocation

• Describes relative frequency of visual words in a single image (no world term)

• Words not generated independently (connected by hidden variable)

• Analogy to text documents– Each image contains mixture of several topics (parts)– Each topic induces a distribution over words

Generative equations

Marginal distribution over features

Conjugate priors over parameters

Learning LDA model

• Part labels p hidden variables• If we knew them then it would be easy to estimate the

parameters

• How about EM algorithm? Unfortunately, parts within in image not independent

Learning

Strategy:

1. Write an expression for posterior distribution over part labels

2. Draw samples from posterior using MCMC3. Use samples to estimate parameters

1. Posterior over part labels

Can compute two terms in numerator in closed formDenominator

intractable

2. Draw samples from posterior

Gibbs’ sampling: fix all part labels except one and sample from conditional distribution

This can be computed in closed form

3. Use samples to estimate parameters

Samples substitute in for real part labels in update equations

Structure

Single author topic model

Single author-topic model

Learning

Likelihood same as before, prior becomes

Learning

Inference

Compute posterior over categories

Likelihood that words in this image are due to category n

Structure

Constellation model

Learning

Prior same as before, likelihood becomes

Learning

Part and word probabilities as before

Inference

Compute posterior over categories

Likelihood that words in this image are due to category n

Learning

Structure

Scene model

Structure

Video Google

Action recognition

Spatio-temporal bag of words model 91.8% classification

Action recognition

20 cv mil_models_for_words

Technology

1:13-cv-00501 #20

Case 5:20-cv-01989-JMG Document 1 Filed …...Case 5:20-cv-01989-JMG Document 1 Filed 04/22/20 Page 14 of 48 Case 5:20-cv-01989-JMG Document 1 Filed 04/22/20 Page 15 of 48 Case 5:20-cv-01989-JMG

20-304-cvCON, 20-340-cvCON, 20-341-cv CON 20-342-cvCON

Case 4:20-cv-01563-HSG Document 55 Filed 03/30/20 Page 1 of 36 · Case 4:20-cv-01563-HSG Document 55 Filed 03/30/20 Page 1 of 36. Case 4:20-cv-01563-HSG Document 55 Filed 03/30/20

20 oct 2015 cv

RPaine CV 11-20 - Kasmin Gallery

CV 10 CV 20, CV 25 CV 35i - Nilfisknilfisk.ca/docs/2009CentralVacManual.pdf · CV 10, CV 20, CV 25, CV 35i ... qualiﬁ ed electrician. The green colored rigid ear, lug, or the like

2:14-cv-01762 #20

Case 1:20-cv-11889-MLW Document 1 Filed 10/20/20 Page 1 of 16 · 2020. 10. 21. · Case 1:20-cv-11889-MLW Document 1 Filed 10/20/20 Page 2 of 16. Case 1:20-cv-11889-MLW Document 1

Case 1:20-cv-09829-PGG Document 11 Filed 12/16/20 Page 1 of … · Case 1:20-cv-09829-PGG Document 11 Filed 12/16/20 Page 2 of 31. Case 1:20-cv-09829-PGG Document 11 Filed 12/16/20

4:14-cv-11499 #20

2:14-cv-00024 #20

Henning CV 12-10-20

NOVELTIES CERSAIE 2017 II ED - الشامل للسيراميك · 2018. 1. 29. · NOVELTIES CERSAIE 2017 II ED. CV-10 CV-10 CV-13 CV-15 CV-15 CV-159 CV-10 CV-20 CV-10 AP-7 N-340

Case 1:20-cv-01559-RBJ Document 1 Filed 05/29/20 USDC ...Case 1:20-cv-01559-RBJ Document 1 Filed 05/29/20 USDC Colorado Page 31 of 32. Case 1:20-cv-01559-RBJ Document 1 Filed 05/29/20

PORTER CV 20 - sicardi.com

Case 2:20-cv-03691 Document 1 Filed 07/29/20 Page 1 of 30€¦ · Case 2:20-cv-03691 Document 1 Filed 07/29/20 Page 27 of 30. Case 2:20-cv-03691 Document 1 Filed 07/29/20 Page 28

Case 4:20-cv-02078-MWB Document 176 Filed 11/19/20 Page 1 ... · Case 4:20-cv-02078-MWB Document 176 Filed 11/19/20 Page 1 of 17. Case 4:20-cv-02078-MWB Document 176 Filed 11/19/20

Case 1:20-cv-00613-CCE-LPA Document 57 Filed 08/07/20 …...Case 1:20-cv-00613-CCE-LPA Document 57 Filed 08/07/20 Page 18 of 19. Case 1:20-cv-00613-CCE-LPA Document 57 Filed 08/07/20

Dr Khuloud CV jan 20 2016