Large-Scale Content-Based Image Retrieval

Large-Scale Content-Based Image RetrievalProject PresentationCMPT 880: Large Scale Multimedia Systems and Cloud Computing

Under supervision of Dr. Mohamed HefeedaBy: Ahmed Abdelsadek (aabdelsa@sfu.ca)

Outlines •Introduction•Project Scope•Work Flow•Image Features •Indexing and Retrieval•Matching•Evaluation•Conclusion

Introduction•Current image search engines rely heavily

on text to retrieve images▫User provides keywords, and images

having that keyword in the filename or in nearby html are candidates for retrieval.

•In this project we are willing to try content-based retrieval techniques where the query is an image.

Project Scope•Similarity using local features.•Extracting features from the reference

images.•Index these features in efficient data

structure in a scalable large scale environment

•Process query images.•Search and Match.

•This project is NOT▫Recognition, Classification, Categorization

Work Flow

Generate Feature Points

Direct to KD-Tree Index Bin

Build KD-Tree Index Bins

Distributed Storage

Searching for Nearest Neighbors

Matching Objects

Sorting and Reporting Results

QueryMultimedia Object

ReferenceMultimedia Object

Results

Matching

BuildingQuerying

SaveLoad

Image Features• Using SIFT features (Scale-invariant feature transform).

▫ A SIFT feature is a selected image region (also called keypoint) with an associated descriptor.

▫ A SIFT descriptor is a histogram of the image gradients surrounding a keypoint.

▫ Using PCA for Dimension Reduction

KD-Tree•Using KD-Trees

▫Each tree level represent a dimension of a feature

▫Searching the index for the K-nearest neighbours

Logical View

ReferenceFeatures

Points

QueryFeatures

Points

Multimedia Objects Matcher

Similar Features

Similar Objects

Results

Physical ViewDirecting

Block 1 Block 2 Block 3 Block n

Block 1

Block 2

Block 3

Block n

Physical FilesOn HDFS

B1 vs B1

B2 vs B2

B3 vs B3

Bn vs Bn

Computing DistancesTasks

ReducePhase

MapPhase

DistributedCache

QueriesR

sKD-Tree

Matching•For each query we extract the features

and then search the index for the K-NN features.

•For each query feature, each neighbouring feature of it votes to certain image with a score of its rank.

•The maximum 10 images for the voting array are reported as the most similar images.

Evaluation•Core KNN

▫Experiments on local machine.▫Our results vs brute force

•Image retrieval▫CalTech, and TRICVID datasets▫On amazon AWS cloud.▫We 8 machines.

Dual core 4 GB ram

Precision of KNN

Scanned Bins Size

Affect of Data Size

Image Recall @ K

First Correct @ K

Implementation Details•The system is implemented in Java•We use Hadoop 1.0.3 •We run cloud experiments on AWS

services▫S3▫EMR

•We use some open source libraries▫For images preprocessing we use :

FFMPEG▫For extracting SIFT features we use :

VLFeat

Conclusion•We implement a full pipeline for image

retrieval problem.▫The framework can easily support different

types of features, different indexing methods.

•We show how we can build a big cloud system from small components.

Conclusion•Intersection with my research

•Contributions▫Feature Selection and Extraction▫Implement Dimension Reduction▫Design and Implement Map/Reduce Index▫Implement Image Matching and Ranking

Questions ?

Thank you !

Large-Scale Content-Based Image Retrieval

Documents

Large Scale Image Retrieval in Urban Environments with ... · LARGE SCALE IMAGE RETRIEVAL IN URBAN ENVIRONMENTS WITH PIXEL ACCURATE IMAGE TAGGING by Jerry Zhang Research Project Submitted

Revisiting Oxford and Paris: Large-Scale Image Retrieval ...cmp.felk.cvut.cz/~chum/papers/Radenovic-CVPR18.pdf · 1. Introduction Image retrieval methods have gone through signiﬁ-cant

EFFICIENT IMAGE RETRIEVAL USING REGION BASED IMAGE RETRIEVAL

Large scale near-duplicate image retrieval using Triples

Media Retrieval Information Retrieval Image Retrieval Video Retrieval Audio Retrieval Information Retrieval Image Retrieval Video Retrieval Audio Retrieval

Image Retrieval Part I (Introduction). 2 Image Understanding Functions Image indexing similarity matching image retrieval (content-based method)

Large Scale Image Retrieval From Books

Web Image Retrieval

CS688/WST665 Web-Scale Image Retrieval and Classification

A Benchmark on Tricks for Large-scale Image RetrievalA Benchmark on Tricks for Large-scale Image Retrieval Byungsoo Ko NAVER/LINE Vision kobiso62@gmail.com Minchul Shin Search Solutions

Tensor index for large scale image retrieval€¦ · Keywords Tensor index · Image retrieval · Bag-of-words model 1 Introduction This paper considers the task of large-scale partial

Visual Rerank a Soft Computing Approach for Image Retrieval From Large Scale Image Database

Large Scale Document Image Retrieval by Automatic …cvit.iiit.ac.in/images/ConferencePapers/2014/Pramod2013Large.pdf · Large Scale Document Image Retrieval by Automatic ... script

Implementation of the Fisher Kernel Framework for Image ...upcommons.upc.edu/.../16223/PFC_Santiago_Herrero.pdf · Implementation based on the article “Large-scale image retrieval

Revisiting Oxford and Paris: Large-Scale Image Retrieval ...Revisiting Oxford and Paris: Large-Scale Image Retrieval Benchmarking Filip Radenovic´ 1Ahmet Iscen Giorgos Tolias 1Yannis

Large-Scale Image Retrieval with Attentive Deep Local Features

Web scale image retrieval using compact tensor aggregation ...gosselin/pdf/negrel13mm.pdfWeb scale image retrieval using compact tensor aggregation of visual descriptors Romain Negrel,

A System for Large-scale, Content-based Web Image Retrieval and the Semantics within

Composing Text and Image for Image Retrieval - An ... · Image retrieval and product search: Image retrieval is an important vision problem and signiﬁcant progress has been made

Large Scale Image Retrieval - cvut.cz · • Basic image retrieval is easy – Visual vocabulary be vector quantization to approximate distance between features – Bag of words representation