Annotating Music Collections: How Content-Based Similarity Helps to Propagate Tags

Embed Size (px)

Citation preview

SWC Announcement

Annotating Music Collections: How Content-Based Similarity Helps to Propagate Labels

Mohamed Sordo, Cyril Laurier and scar Celma

(Music Technology Group, UPF)

Outline

Introduction

Motivation and goals

Evaluation

Demonstration

Conclusions

introduction

Massive Increase in volume of online music

Long Tail Economics

Make everything available

Help me find it

introduction:: last year's work...

Searchsounds (ISMIR, 2006)

Exploits MP3-blogs info

Natural Lang. Processing

Audio similarity (more like this)

motivation:: annotating music collections

Tag sparsity

Postal Service - Such Great Heights

2.4 million num. plays

100s of tags

Mike Shupps's - All Over Town

3 num. plays

0 tags

Cold start problem for new songs

goals

Ease the process of annotating large music collections

No learning from audio to tags, but...

Content-based similarity to propose tags among songs

Timbre, rhythm, tonality, etc.

our proposal

Annotated

our proposal

wisdom of (song) crowds

FOR FREE!!!Annotated

our proposal

evaluation...? wisdom of crowds & feedback

FOR FREE!!!Annotated

our proposal

Relevance feedback from the users

Spencer Tunick's wisdom of crowds

http://www.flickr.com/photos/guacamoleproject/487376758/

our proposal

new song, no tags

rockIndieCuteguitarDrums90sFastWeirdtweeQuirky

noise popCuteplayfulDrums80sFastWeirdSweetQuirky

Cuterockpop90sFun

metalrockEdgyguitarthrash90sFierceWeirdconcertLoud

??????????????????

thrashloudmetaldeath

thrashrockEdgygothicheavy metal90sWeirdconcertLoud

our proposal

propose tags (wisdom of song crowds)

rockIndieCuteguitarDrums90sFastWeirdtweeQuirky

noise popCuteplayfulDrums80sFastWeirdSweetQuirky

Cuterockpop90sFun

metalrockEdgyguitarthrash90sFierceWeirdconcertLoud

thrashloudmetaldeath

thrashrockEdgygothicheavy metal90sWeirdconcertLoud

thrashrock90sLoudmetalWeird

our proposal

users validate/add tags

rockIndieCuteguitarDrums90sFastWeirdtweeQuirky

noise popCuteplayfulDrums80sFastWeirdSweetQuirky

Cuterockpop90sFun

metalrockEdgyguitarthrash90sFierceWeirdconcertLoud

thrashloudmetaldeath

thrashrockEdgygothicheavy metal90sWeirdconcertLoud

thrashrock90sLoudmetalWeird

experiments

Ground Truth

5481 annotated songs from Magnatune (John Buckman)

29 Different labels (rock, instrumental, classical, relaxing, etc.)

Avg. 3 tags per song

Only CB similarity, no user feedback

2 experiments

All tags

Only mood tags (smaller collection)

experiments:: results

Some basic comments...

Increasing number of similar songs (10, 20, 30,...)

Bad recall, precision

Best results: ~10 similars

Using album, artist filtering

Need to increase the number of similar songs

Best results: ~20 similars

Metrics

Precision / Recall (F-measure, alpha = 2)

Spearman Rho

(Details on the paper!)

demo

Searchsounds, with users

~250,000 full songs

~48% songs annotated, avg. 2 tags per song

from CDBaby, Magnatune, etc.

conclusions

Extent annotations in a music collection

General experiment

10 similar, recall >0.4

40%+38% = 78%

Mood experiment

P@1 = 0.5

30%+35% = 65%

Avoid cold start problem (new song)

Limitations

General experiment: Low precision, due to (valid) proposed tags, that were not in the GT

Mood experiment: Mysterious label

questions?

Preguntes?

Preguntas?

Questions ?

Fragen ?

Annotating Music Collections: How Content-Based Similarity Helps to Propagate Labels

Mohamed Sordo, Cyril Laurier and scar Celma

(Music Technology Group, UPF)

experiments:: metrics

SongId usana31605

Manually annotated (ground truth)

Classical, Instrumental, Baroque, Piano

Proposed labels (and frequency)

Instrumental (0.55), Classical (0.40), Baroque (0.26), Harpsichord (0.24)

Precision = TP / (TP + FN)

3 / 4 (added Harpsichord)

Recall = TP / (TP + FP)

3 / 4 (Piano is missing)

F-measure (alpha = 2 , more weight to Recall)

Spearman-Rho

experiments:: results

How does CB performs...?

experiments:: results

With album, artist filtering...

experiments:: results

Recall the recall

ismir2007 // mohamed sordo & cyril laurier & scar celma

ISMIR 2007 / Vienna, Austria / Sept. 27th 2007