ITI-CERTH participation in TRECVID 2016 · 2016-11-29 · Supported by: ITI-CERTH participation in...

Supported by:

ITI-CERTH participation in TRECVID 2016

Instance Search

Foteini Markatopoulou1,2, Anastasia Moumtzidou1, Damianos Galanopoulos1, Theodoros Mironidis1, Vagia Kaltsa1, Anastasia Ioannidou1, Spyridon Symeonidis1, Konstantinos Avgerinakis1, Stelios Andreadis1, Ilias Gialampoukidis1, Stefanos Vrochidis1, Alexia Briassouli1, Vasileios Mezaris1, Ioannis Kompatsiaris1, Ioannis Patras2

2Queen Mary University of London, Mile end Campus, UK i.patras@qmul.ac.uk

Ad-hoc Video Search

Multimedia Event Detection

1Information Technologies Institute (ITI), Centre for Research and Technology Hellas (CERTH), Thessaloniki, Greece {markatopoulou, moumtzid, dgalanop, mironidis, vagiakal, ioananas, spyridons, koafgeri, andreadisst, heliasgj, stefanos, abria, bmezaris, ikom} @iti.gr

Query processing: Selecting concepts which are the most related to a query, exploiting NLP procedures • Concept selection using the ESA distance between

the query and each of the concepts • String matching between concepts and any part of

the query • Query’s linguistic analysis for sub-queries creation • Concept selection using the ESA distance between

sub-queries and each of the concepts • Our MED16 000Ex pipeline is used, if none of the

previous steps is able to select concepts

Runs ITI-CERTH 1: Annotation using DCNN outputs for 1000 ImageNet concepts and SVM-based scores for 345 TRECVID SIN concepts ITI-CERTH 2: Annotation using DCNN outputs for 1000 ImageNet concepts and the direct output of the FT network for 345 SIN concepts ITI-CERTH 3: ITI-CERTH 1 ignoring sub-queries which do not present high semantic relatedness with any of the concepts ITI-CERTH 4: ITI-CERTH 1 without string matching Submitted run: ITI-CERTH 1 ITI-CERTH 2 ITI-CERTH 3 ITI-CERTH 4

Fully-automatic 0.051 0.042 0.051 0.051

010Ex and 100Ex

DCNN-based features • GoogLeNet trained with 12988 ImageNet

concepts • The best FT network on the 345 TRECVID SIN

concepts (the same used for the AVS task) • GoogLeNet trained with 500 EventNet concepts • GoogLeNet trained with 5055 ImageNet

concepts and FT on 487 Sports-1M concepts • GoogLeNet trained with Places 205 concepts

Run ID MAP% mInfAP@200%

p-1DCNN13K_1 14.6 12.2

c-1DCNN14K_1 14.5 11.9

c-1DCNN05K_1 02.5 01.4

c-3Train_1 16.2 14.2

Run ID MAP% mInfAP@200%

010Ex p-1KDALSVM 31.8 34.2

100Ex p-1KDALSVM 46.2 47.5

• Kernel Subclass Discriminant Analysis for dimensionality reduction and fast linear SVM for training (KSDA+LSVM)

• Both DCNN-based and motion features

AVS system overview

Motion features • HOG, HOF, MBHx, MBHy • An improved version of our MED15 zero-example

event detection framework is used • Extensive visual concept pool: up to 14k concepts • Pseudo-relevance feedback using KSDA+LSVM and

retrieved videos from the MED16–EvalSub set

DCNN-based features • Direct output of 5 DCNNs (from 010Ex and 100Ex) • Direct output of GoogLeNet trained with 5055

ImageNet concepts

Conclusion • Plentiful DCNN-based features lead to

better results

Runs p-1DCNN13K_1 • ImageNet 12988 concepts • EventNet 500 concepts

c-1DCNN14K_1 • ImageNet 12988 concepts • EventNet 500 concepts • TRECVID SIN 345 concepts • Sports-1M 478 concepts • Places 205 concepts

c-1DCNN05K_1 • ImageNet 5055 concepts • EventNet 500 concepts

c-3Train_1 • KSDA+LSVM using as

positive samples the 10 top retrieved videos of p-1DCNN13K_1 run

Conclusion • Pseudo-relevance feedback has a significant impact to our

performance (the relative improvement is 16.39%) • Using a large number of visual concepts gives a boost to

the performance compared with our MED15 submission

Video Representation • Concatenate motion and DCNN-based

feature vectors • New feature vector in ℝ153781

• KSDA drastically reduces the feature vector dimensionality

Future work • Combination of the different modules for improving the

video search results

VERGE retrieval and presentation modules High Level Visual Concept Retrieval • 346 TRECVID SIN concepts using pre-trained DCNNs &

training with Linear SVMs • 205 Places scene concepts using GoogLeNet for landscape

recognition Visual Similarity Search module • Use of pre-trained DCNN on 5055 ImageNet categories &

selection of last pooling layer for keyframe representation • Nearest Neighbour search realized using Asymmetric

Distance Computation

Face detection and Face Retrieval Module • Face detection: detect facial landmarks using cascade of

DCNNs • Face feature extraction: extract DCNN descriptors using

VGG-Very-Deep-16 architecture & use of last FC layer as feature vector

• Construction of an IVFADC index for fast face retrieval

VERGE GUI http://mklab-services.iti.gr/trec2015/

Evaluation of INS results

Run ID MAP Recall

Run 1 0.114 1000/11197

VERGE video search engine

KSDA+LSVM software (executables and sources) available at: http://mklab.iti.gr/project/aksda

Video shot processing: Video shot annotation with concepts from two concept pools: A) 1000 ImageNet concepts - Late fusion of 5 different pre-trained Deep Convolutional Neural Nets (DCNNs) B) 345 TRECVID SIN concepts - 3 pre-trained ImageNet networks fine-tuned (FT) on these concepts • The best performing FT network was chosen • Two different approaches for video shot annotation • Direct output of the FT network • Linear SVM training with DCNN-based features

ITI-CERTH participation in TRECVID 2016 · 2016-11-29 · Supported by: ITI-CERTH participation in...

Documents

IRIM at TRECVID 2016: Instance Search › projects › tvpubs › tv16.papers › ... · 2016-11-29 · The TRECVID 2016 instance search task is described in the TRECVID 2016 overview

ITI-CERTH participation in TRECVID 2018 · Face detection and Face Retrieval Module: • Face detection: detect faces using Tiny face detection algorithm. • Face feature extraction:

TRECVID Evaluations

ITI-CERTH participation to TRECVID 2011

ITI-CERTH participation in TRECVID 2020 - NIST

Informatics and Telematics Institute Centre for Research and Technology Hellas ITI-CERTH Amsterdam, Multimedia Semantics XG, 10-11 July 2006 Vasileios

ITI-CERTH participation in TRECVID 2017bmezaris/publications/trecvid2017.pdf · This paper provides an overview of the runs submitted to TRECVID 2017 by ITI-CERTH. ITI-CERTH participated

ITI-CERTH in TRECVID 2016 Ad-hoc Video Search (AVS)ITI-CERTH in TRECVID 2016 Ad-hoc Video Search (AVS) Foteini Markatopoulou, Damianos Galanopoulos, Ioannis Patras, ... •A five-step

ITI-CERTH participation in TRECVID 2017bmezaris/publications/trecvid2017.pdf · 2018-02-21 · ITI-CERTH participation in TRECVID 2017 Foteini Markatopoulou1,2, Anastasia Moumtzidou

Automatic monitoring and detection of activities of daily ... · Data collection and annotation from CERTH/ITI Smart-home (power consumption from fridge, oven, cooking hood, dishwasher,

1. Pericles Mitkas (CERTH/ITI) - Welcome & Introduction to Cassandra Project

TRECVID 2016 : Instance Search

Pleros-AUTH CERTH Welcome

NISTInformation ITI-CERTH participation to TRECVID 2015 Technologies Foteini Markatopoulou1'2 Anastasia Ioannidoul Christos Tzelepis1'2 Theodoros Mironidisl Damianos Galanopoulosl

D7.1.3 Final dissemination report - CORDIS...Tasks: T7.1: VIS-SENSE knowledge dissemination (led by CERTH/ITI), T7.2: Dissem-ination material production (led by FhG/IGD), T7.3: Networking

Informedia at TRECVID 2003

TRECVID 2016 POSTER CERTH-ITI

TRECVID-2009: Search Task

ITI-CERTH participation in TRECVID 2016 participation in TRECVID 2016 Foteini Markatopoulou1,2, Anastasia Moumtzidou 1, Damianos Galanopoulos , Theodoros Mironidis 1, Vagia Kaltsa

December 2, 2013 Thessaloniki, Greece GNORASI WORKSHOP Charalampos Doulaverakis CERTH/ITI Knowledge and processing algorithms for remote sensing data Reasoning