Video Search Engines and Content-Based Retrieval Steven C.H. Hoi CUHK, CSE 18-Sept, 2006

Video Search Engines and

Content-Based Retrieval

Steven C.H. Hoi

CUHK, CSE

18-Sept, 2006

Outline

Video Search Engines

Content-Based Video Retrieval

Video Search Engines

A survey of state-of-the-arts

Introduction

Who are doing video search engines?

Top text search engines5.6 billion searches

07/2006

Introduction Google

Introduction Yahoo

Introduction MSN/Live Search

Introduction YouTube

Business Models Web Advertising

Site Volume, or keyword customized Video Ads

Disable controls (MSN) Subscription

MLB, Real Download to own

iTunes, Movie Rental

Limited time, number of plays Other

Desktop Media Search Media player (jukebox) Media Monitoring Media Asset Management

Types of video Sites Content Originators

Major Broadcasters Affiliates, Local News Major League Baseball

Syndication, Aggregation, “Internet Broadcasters” Rental, purchase, advertising, subscription MSN, Google, iTunes ROO Media, FeedRoom

Movie and Video Download Share portals

Consumer content, blogs YouTube, Putfile, Vsocial, Google, Akimbo

Traditional Search Engines (Crawl) / “RSS” Yahoo, Blinkx

Other Public (Internet Archive) Media Monitoring, asset management systems

Video Search Challenges

Current Video Search Engines

Metadata File type and context Media file attributes

Size, length Structured global metadata

RSS content description

Content Content Indexing

Search within a video Full text of dialog Image or video content

Automated Content Indexing

Current Video Search Engines

Content Search Engines

Keyword search with transcripts from speech recognition

Content-Based Video Search Engine

Architecture

Video Processing

Research ChallengesSpeech RecognitionShot Boundary DetectionVideo Story Segmentation Concept DetectionMulti-modal Fusion for Ranking

Text/ASR, Audio/Speech, Visual, etc.

Content-Based Retrieval

Our Research ProblemLearning to rank video shots for automatic

content-based search tasks !

ChallengesMulti-Modal Information FusionSmall Sample Learning (a few pos. & no neg.)Learning on large-scale datasets

Multi-modal and Multi-scale Ranking Framework

Main IdeasRepresenting video structures by graphsUsing semi-supervised learning to address

small labeled sample learning problemFusing Multi-modal information by Harmonic

learning over graphsMulti-scale ranking for achieving efficient

performance on large-scale datasets

Graph-based Modeling

StoryText

Semi-Supervised Learning on GraphTo find an optimal real-valued function

g: VR on the graph GTo minimize a quadratic energy function:

Using Gaussian field and Harmonic property of Spectral Graph Theory (J. Zhu’s ICML’03), a harmonic function g can be found:

Semi-Supervised Learning on GraphLet

The solution of the harmonic function g can be expressed in matrix operations:

Multi-Modal Fusion over GraphTo combine text information into SSL on visual

modality, we consider the text inputs as the attached nodes on the visual graph:

Visual - g

Text - f

ChallengesNumber of examples in database: N is large

For examples:TRECVID 2005: Rep. Key-Frames N = 45,765TRECVID 2006: Rep. Key-Frames N = 79,487

How to do Semi-Supervised Learning?!

Multi-Scale RankingLearning ranking through multi-scale rerankingEach stage is associated with different

computational costsIn our solution, four ranking stages include:

Ranking by Text Retrieval using Language ModelsRe-ranking by NN fusing Text and VisualRe-ranking by SVM fusing Text and VisualRe-ranking by multi-modal Semi-supervised Learning

Video Search Engines and Content-Based Retrieval Steven C.H. Hoi CUHK, CSE 18-Sept, 2006

Documents

C.H PLATANAL

CUHK Mathematics

Cm. Wojskowy C.H. ARKADIA Rondo „Radosława” C.H. KLIF · C.H. ARKADIA C.H. KLIF Rondo „Radosława” opiełuszki a c Wilsona da awła II a Słomińskiego awki Cm. Wojskowy

Abstract - CUHK CSE

NEAR-DUPLICATE KEYFRAME RETRIEVAL BY …lyu/presentation/acm_mm08.pdf · NEAR-DUPLICATE KEYFRAME RETRIEVAL BY NONRIGID IMAGE MATCHING Jianke Zhu, CUHK Steven C.H. Hoi, NTU Michael

CUHK FAA TCSS

M.PHIL.-PH.D. - CUHK

Ba - CUHK CSE

Large-Scale Text Categorization By Batch Mode Active Learning Steven C.H. Hoi †, Rong Jin ‡, Michael R. Lyu † † CSE Department, Chinese University of Hong

Catalogo c.h

* BvmMJ - CUHK Mathematics

Round 03/ - CUHK Business School Master's - CUHK Business

Cuhk system 14oct_2

Online Multiple Kernel Classification Steven C.H. Hoi, Rong Jin, Peilin Zhao, Tianbao Yang Machine Learning (2013) Presented by Audrey Cheong Electrical

Tom Z.J. Fu, CUHK W. T. Leung, CUHK P. Y. Lam, CUHK Dah Ming Chiu, CUHK Zhibin Lei, ASTRI

C.H Enterprise

AITT CUHK T

Cuhk brochure oct2013

Homepage - CUHK CSE

CUHK booklet 1