33
VisualLabel – Video Analytics Made Open in a Big Way! Jarno Kallio / PacketVideo Finland Oy on behalf of VisualLabel team

FORGE 1 year: Jarno Kallio, PacketVideo

  • Upload
    digile

  • View
    71

  • Download
    2

Embed Size (px)

Citation preview

Page 1: FORGE 1 year: Jarno Kallio, PacketVideo

VisualLabel –Video Analytics Made Open

in a Big Way!

Jarno Kallio / PacketVideo Finland Oyon behalf of VisualLabel team

Page 2: FORGE 1 year: Jarno Kallio, PacketVideo

Motivation 1/2

• Applications utilizing “Big data” computing are expected to provide an ICT breakthrough in the next years.

• Big data computing started from Web search engines 15 years ago.

• Now “googling” has become the standard of searching and analyzing textual information.

• Extending this to media items: images and videos is still incomplete and existing in-house solutions only available to big IT giants.

Page 3: FORGE 1 year: Jarno Kallio, PacketVideo

Motivation 2/2

• Implementing such an media analytics service has not been possible until the recent developments in cloud technology

• Cloud platform’s such as Forge provide computational resources for resource intensive content analysis services.

• Additionally, Forge has very unique characteristics of being shared sandbox for open collaboration between companies & universities

Page 4: FORGE 1 year: Jarno Kallio, PacketVideo

Demo:

Video Analysis

using PicSOM system

You will see authentic unprocessed

object recognition results as green

subtitles over the video..

Page 5: FORGE 1 year: Jarno Kallio, PacketVideo

Demo about Video Analysis Capabilities:

Example clip (2:19) is an trailer from open source movie “Vaalkama”

Page 6: FORGE 1 year: Jarno Kallio, PacketVideo

Achieved Goals So Far:

1. Design a media analysis service for photos and videos

2. Design integration specification for content analysis back-ends and cloud storage • Publicly available REST APIs • data formats…

3. Implement prototype applications that demonstrate the basic system features

Page 7: FORGE 1 year: Jarno Kallio, PacketVideo

Target for 2015:

4. Refine reference service and platform to be easily available for testing by 3rd party

5. Design and validate self-learning algorithms for media content analysis

Page 8: FORGE 1 year: Jarno Kallio, PacketVideo

Challenges.. and how Forge came into rescue!

Before Forge we run couple of pilots with test users. The results itself look promising but...• Getting all 3 services running smoothly on separate

university and company clouds was a pain because..1. We didn’t have separate stable and dev env2. University cloud environment where sometimes

running other services -> offline for test users

• Now when both the frontend and backend services are running in Forge – problems where solved!

Page 9: FORGE 1 year: Jarno Kallio, PacketVideo

How it All Works? 1/3:FORGE Deployment

Page 10: FORGE 1 year: Jarno Kallio, PacketVideo

How it All Works? 2/3:Task Based Analysis

Page 11: FORGE 1 year: Jarno Kallio, PacketVideo

How it All Works? 3/3: Additional Details

• Performed for content stored on third party providers• No media storage, only metadata is synchronized• Support for popular content providers

(e.g. Google, Facebook, Twitter)• On-demand analysis for directly uploaded content• Generate tags (keywords) from media content• Generate sentences that describe the image in the

English language• Facial recognition• Detect similarities for extended search functionality

Page 12: FORGE 1 year: Jarno Kallio, PacketVideo

Back-end

Analysis Systems:

PicSOM (Aalto),

Tag-Engine (Arcada),

CMUVIS (TUT-SGN)

Page 13: FORGE 1 year: Jarno Kallio, PacketVideo

PicSOM media analysis system

• Developed at Aalto University since 1998• Content analysis of images and videos with a large variety of

visual features, including many types of SIFT, Fisher Vector and deep convolutional neural network features

• Provides also interactive search with relevance feedback based iterative query refinement

• Uses very fast linear Support Vector Machines for content classification and visual category detection

• Self-Organizing Maps are used for iterative search and class distribution analysis

• Participated in NIST's annual video search evaluation TRECVID since 2005 and ranked 2nd in the Semantic Indexing task in 2014

Page 14: FORGE 1 year: Jarno Kallio, PacketVideo

Automatic Tag Extraction from Social Media for Visual Labeling (Arcada)

1. FacebookAnalyzer• Retrieves basic Facebook profile content and generates

from it tags of user’s interest and hobbies.

2. TwitterAnalyzer• Retrieves tweet-image pairs from public Twitter

accounts • Analyze the Tweet text to extract hashtags, named

entities, keywords and phrases• Post-processing to remove noisy tags

Page 15: FORGE 1 year: Jarno Kallio, PacketVideo

Core of the Tag-Engine

• Make use of existing content structure information to identify relevant parts of the content

• Allow preferences in content weighting to customize the system for different types of profiles

• Heuristic rules, Statistical term weighting method Term TF-IDF weighting, with adjustment

N-grams Named entities Hashtags

• Simple, generic, handling multiple languages

Page 16: FORGE 1 year: Jarno Kallio, PacketVideo

Twitter Analysis: English TweetPresident Obama meets with @VP Biden and members of his National Security Council in the Situation Room. pic.twitter.com/SDWvZ1sSnL

"Neil Armstrong, Buzz Aldrin and Michael Collins took the 1st small steps of our giant leap into the future." —Obama pic.twitter.com/OVwaxP1kgm

Text Tagssituation roompresident obamameetsnational security councilmembers of his nationalcouncil in the situationobama meets with @vpmeets with @vp biden@vp biden and members

Named Entity TagsObama

HashtagsVP

Text Tagsneil armstronggiant leap1st small stepssteps of our giantbuzz aldrin and michaelcollins took the 1staldrin and michaelcollinsleap into the future

Named Entity TagsObamaNeil ArmstrongBuzz AldrinMichael Collins

Page 17: FORGE 1 year: Jarno Kallio, PacketVideo

Twitter Analysis: Finnish TweetOlympialaisten avajaisissa nähtiin koreita kuvioita– ja osasi se Akukin jo 20 vuotta sitten... #Sotshi#Lillehammer pic.twitter.com/ZRwl9PfLZw

#Facebook täyttää tänään 10. Vuodesta 2010 mukana ollut Aku Ankka onnittelee. #some#pärstäpankki pic.twitter.com/tO74l0Jc7o

Text Tagsopening ceremonyolympic gameskoreita patterns -20 years agoknew it akukinpatterns - and knewceremony of the olympicakukin already 20 yearsgames was seen koreita

Named Entity TagsAkukinHashtagsSotshiLillehammerText Tags in Original Language

Text Tagstoday 10donald duck congratulatesinvolved had donaldduck2010 involved had donald

tänään 10aku ankka onnitteleemukana oli aku ankka2010 mukana oli aku

Named Entity TagsDonald Duckaku ankka

HashtagsFacebooksomepärstäpankki

Page 18: FORGE 1 year: Jarno Kallio, PacketVideo

Motivation and Goals

• Goal(s):

• Detection of different indoor furniture types in user uploaded images.

• Automatically recognize different furniture types in vender provider

images.

Page 19: FORGE 1 year: Jarno Kallio, PacketVideo

Cloud MUVIS: Visual Label Project

Page 20: FORGE 1 year: Jarno Kallio, PacketVideo

Cloud MUVIS Architecture

Images/Audio

s/Videos/Text

Data

Partitio

n

Machine Learning

(Feature Extraction)

Offline

Processing

Images/Audios

/Videos/Text

Online

ProcessingMatch

score/Recommendation

Page 21: FORGE 1 year: Jarno Kallio, PacketVideo

Challenges

• Unlimited Object

Categories

• Appearance Variation

• Object Pose

• View Angle

• Illumination Variation

• Occlusions

• Image Quality Variation

Page 22: FORGE 1 year: Jarno Kallio, PacketVideo

Prototype

Applications

Page 23: FORGE 1 year: Jarno Kallio, PacketVideo

Prototype Application 1#Smart Photo Service

Page 24: FORGE 1 year: Jarno Kallio, PacketVideo

Prototype Application 1#Smart Photo Service

Page 25: FORGE 1 year: Jarno Kallio, PacketVideo

Prototype Application 1#Smart Photo Service

Page 26: FORGE 1 year: Jarno Kallio, PacketVideo

Prototype Application 2#Facebook Profile Summarization

Page 27: FORGE 1 year: Jarno Kallio, PacketVideo

Prototype Application 2#Facebook Profile Summarization

Page 28: FORGE 1 year: Jarno Kallio, PacketVideo

Partners:

Page 29: FORGE 1 year: Jarno Kallio, PacketVideo

Thank You!

Q & A

Page 30: FORGE 1 year: Jarno Kallio, PacketVideo

Backup

Slides

Page 31: FORGE 1 year: Jarno Kallio, PacketVideo

Summarization

•Analysis of user’s social media accounts•Generate tags from profile content (posts, tweets, timelines...)•Provide tags as suggestions for content modifications

Page 32: FORGE 1 year: Jarno Kallio, PacketVideo

Search

•Text-based search using pre-generated keywords•Similarity search using features extracted from analyzed content•On-demand similarity search

–Direct file upload–URL content

•Search targeted to user’s synchronized content (accounts), not public Internet

Page 33: FORGE 1 year: Jarno Kallio, PacketVideo

Feedback

•Indirectly from user’s actions–Content modifications

•Directly from users–Feedback for search results

•Delivered to back-ends to facilitate self-learning algorithms