Special Indices for LaaLaLaa Lyric Analysis & Generation Framework Special Indices for LaaLaLaa...

Preview:

Citation preview

Special Indices for LaaLaLaa Lyric Special Indices for LaaLaLaa Lyric Analysis & Generation FrameworkAnalysis & Generation Framework

Dr. V. Madhan Karky,Tamil Computing Lab (TaCoLa),College of Engineering Guindy,

Anna University, Chennai.

OverviewOverview• Objective• Introduction• Background• Rhyme Schemes in Tamil• Meter Pattern• System Architecture• Indexing Structure• Indexing Algorithm• Results and Analysis

ObjectiveObjective

• To build special indices for the LaaLaLaa lyric analysis and generation framework to facilitate faster retrieval based on – Meter Pattern – Rhyme

IntroductionIntroduction

• Tamil is a vibrant language with a rich grammar,

vocabulary, an inherent poetic flavor.

• About 1000 lyrics are being written every year as

private albums, jingles and as original soundtracks of

mainstream movies.

BackgroundBackground

• WASP (Pablo Grevas (2000)) splits a given block of

text, identifies patterns and fits words from the

vocabulary to get verses of similar pattern.

• COLIBRI (Agudo, Grevas, Calero (2002)) follows a

case based approach to poem generation.

BackgroundBackground

• Tra-la-Lyrics (Oliviera, Cardoso, Pereira (2005))

finds out the beat pattern of the midi file and

places words with similar syllabic division and

stress pattern.

• Automatic generation of tamil lyrics for melodies

(Ramakrishnan, Kuppan, Devi (2009)) converts the

midi file to KNM reprsn and fits words to it from a

corpus. Phrases are meaningful as parts only. No

edhugai, monai or iyaybu.

BackgroundBackground

• LaaLaLaa (Sowmiya, Karky (2010)) talks of splitting raw

text from midi file to templates and filling them with

words from a wordnet according to the pattern mined

from an existing corpus of lyrics with due consideration

to rhyme, meaning and flow.

Rhyme Schemes in TamilRhyme Schemes in Tamil

• Two words are said to rhyme in

– Monai (மோ��னை�) - first letters are the same.– Edhugai (எதுனை�) - second letters are the same.– Iyaibu (இனைபு) - last letters are the same.

Rhyme Schemes in TamilRhyme Schemes in Tamil

• Examples:– பறனை� and பச்னை� rhyme in monai.– அரு�� and ��ருப்பு rhyme in edhugai.– �க்னை� and ��ழ்க்னை� rhyme in iyaibu.–அரு�� and குரு�� rhyme in edhugai and

iyaibu.– ���னை��ள் and ���ஞர்�ள் rhyme in all

the three schemes.

Meter PatternMeter Pattern

• Maathirai (��த்��னை�) - time taken to wink an eyelid.

• Maathirai based classification of Tamil alphabets.– Nedil (N) (நெ டில்) - Those alphabets which are pronounced for the

time interval of 2 maathirai.– Kuril (K) (குற#ல்) - Alphabets which take 1 maathirai to be

pronounced.– Mei (M) (நெ�ய்) - Alphabets which are pronounced for 0.5 maathirai.

• Meter pattern of a word refers to its Kuril Nedil Mei pattern. • For example, the Meter pattern of the word ப�டல் is NKM as ப�

is a Nedil(N), ட is a Kuril(K) and ல் is a Mei(M).

System ArchitectureSystem Architecture

Lyric DB

Lyric DB

Word Object Convertor

Rhyme PatternExtractor

Rhyme Extractor

Index Builder

Rhyme Meter Index

Indexing StructureIndexing StructurePart of Speech

மோ��னை�

MeterPattern1Letter1 Words

Letter2 Words

MeterPattern2Letter1 Words

Letter2 Words

எதுனை�

MeterPattern1Letter1 Words

Letter2 Words

MeterPattern2Letter1 Words

Letter2 Words

இனைபு

MeterPattern1Letter1 Words

Letter2 Words

MeterPattern2Letter1 Words

Letter2 Words

Indexing AlgorithmIndexing Algorithm

Result and AnalysisResult and Analysis

Word

Retrieval Time(in milliseconds)

Results and AnalysisResults and Analysis

• Retrieval complexity of both the approaches tested using a dataset of 500 tamil words.

• The average retrieval time in – Word indexed approach - 875.47millisecond – Meter Rhyme Indexed approach – 1.90millisecond

Results and AnalysisResults and Analysis

• The drastic decrease in retrieval time from O(α) to O(1) [α is the number of words in the database] is due to

– The use of hash-tables which are efficient for retrieval.

– Having separate hash-tables for the மோ��னை�, எதுனை� and இனைபு of each POS.

Thank You!!!Thank You!!!