Final Assignment Demo 11 th Nov, 2012

Final Assignment Demo11th Nov, 2012

Deepak SuyelGeetanjali Rakshit

Sachin Pawar

CS 626 – Sppech, NLP and the Web

Assignments

• POS Tagger– Bigram Viterbi – Trigram Viterbi– A-Star– Bigram Discriminative Viterbi

• Language Model (Word Prediction)– Bigram– Trigram

• Yago Explorer• Parser Projection and NLTK

POS Tagger

Viterbi: Generative Model

• Most probable tag sequence given word sequence:

• Bigram Model:

• Trigram Model:

Discriminative Bigram Model

• Most probable tag sequence given word sequence:

A-star Heuristic

• A : Highest transition probability– Static score which can be found directly from the

learned model• B : Highest lexical probability in the given

sentence– Dynamic score

• Min_cost = -log(A)-log(B)• h(n) = Min_cost * (no. of hops till goal state)

Comparison of different flavours of POS Taggers

POS Tagger Correct Total Accuracy (%)

Bigram Generative Viterbi

812188.0 862785.0 94.14

Trigram Generative Viterbi

814505.0 862785.0 94.4

A-Star 793441.0 862785.0 91.96

Bigram Discriminative

Viterbi

796890.0 862785.0 92.36

Language Model

Next word prediction : Bigram Model

• Using language model on raw text

• Using language model on POS tagged text

Next word prediction : Trigram Model

• Using language model on raw text

• Using language model on POS tagged text

Metrics: Comparing Language Models

• We have used “Perplexity” for comparing two language models.– Language model using only previous word– Language model using previous word as well as

POS tag of previous word• Perplexity is weighted average branching

factor which is calculated as,

Results

• Raw text LM :– Word Prediction Accuracy: 12.97%– Perplexity : 5451

• POS tagged text LM :– Word Prediction Accuracy : 13.24%– Perplexity : 5002

ExamplesRaw Text - Incorrect POS tagged Text - Correct

• porridgy liquid is : fertiliser• AJ0_porridgy NN1_liquid is : is

• malt dissolve into : terms• NN1_malt VVB_dissolve into : into

• also act as : of• AV0_also VVB_act as : as

Examples(Contd.)

• about english literature : and• PRP_about AJ0_english literature : literature

• spoken english was : literature• AJ0_spoken NN1_english was : was

Yago Explorer

• Made use of:– WikipediaCategories– WordnetCategores, and – YagoFacts.

• Modified Breadth First Search (BFS).

Algorithm

• Input: Entities E1, E2• Output: Paths between E1 and E2• Procedure:

1. Find WikipediaCategories for E1 and E2. If any category matches, return

2. Find WordNetCategories for E1 and E2. If any match found, return.

3. Find YagoFacts for E1 and E2. If any match found, return4. Expand YagoFacts for E1 and E2. For each pair of

entities from E1 and E2, repeat steps 1-4.

Ex:1 Narendra Modi and Indian National Congress

• Path from E1 : Narendra_Modi--livesIn--> Gandhinagar; Gandhinagar--category--> Indian_capital_cities; • Path from E2 : Indian_National_Congress--isLocatedIn--> New_Delhi; New_Delhi--category--> Indian_capital_cities; • Path from E1: Narendra_Modi--isAffiliatedTo--> Bharatiya_Janata_Party; Bharatiya_Janata_Party--category--> Political_parties_in_India; • Path from E2 : Indian_National_Congress--category--> Political_parties_in_India;

Ex:2 Mahesh Bhupathi and Mother Teresa

• Path from E1 : Mahesh_Bhupathi--livesIn--> Bangalore; Bangalore--category--> Metropolitan_cities_in_India; • Path from E2: Mother_Teresa--diedIn--> Kolkata; Kolkata--category--> Metropolitan_cities_in_India; • Path from E1 : Mahesh_Bhupathi--hasWonPrize--> Padma_Shri; Padma_Shri--category--> Civil_awards_and_decorations_of_India; • Path from E2 : Mother_Teresa--hasWonPrize--> Bharat_Ratna; Bharat_Ratna--category--> Civil_awards_and_decorations_of_India;

Ex:3 Michelle Obama and Frederick Jelinek

• Path from E1 : Michelle_Obama--graduatedFrom--> Princeton_University; Princeton_University--category--> university_108286569; • Path from E2 : Frederick_Jelinek--graduatedFrom--> Massachusetts_Institute_of_Technology; Massachusetts_Institute_of_Technology--category--> university_108286569;

Ex:4 Sonia Gandhi and Benito Mussolini

• Path from E1 : Sonia_Gandhi--isCitizenOf--> Italy ; Italy--dealsWith--> Germany ; Germany--isLocatedIn--> Europe ; • Path from E2 : Benito_Mussolini--isAffiliatedTo--> National_Fascist_Party; National_Fascist_Party--isLocatedIn--> Rome; Rome--isLocatedIn--> Europe;

Ex5 : Narendra Modi and Mohan Bhagwat

• Path from E1 :– Narendra_Modi--isAffiliatedTo--

>Bharatiya_Janata_Party ; Bharatiya_Janata_Party<--isAffiliatedTo--Hansraj_Gangaram_Ahir ;

• Path from E2 : – Mohan_Bhagwat--wasBornIn-->Chandrapur ;

Chandrapur<--livesIn--Hansraj_Gangaram_Ahir ;

Parser Projection

ExampleE: Delhi is the capital of IndiaH: dillii bhaarat kii raajdhaani haiE-parse: [ [ [Delhi]NN]NP

[ [is]VBZ [[the]ART [capital]NN]NP [[of]P [[India]NNP]NP]PP]VP

H-parse: [ [ [dillii]NN]NP

[ [[[bhaarat]NNP]NP [kii]P ]PP [raaajdhaanii]NN]NP [hai]VBZ ]VP

Resource and Tools

• Parallel corpora in two languages L1 and L2

• Parser for langauge L1

• Word translation model• A statistical model of the relationship between

the syntactic structures of two different languages (can be effectively learned from a bilingual corpus by an unsupervised learning technique)

Challenges

• Conflation across languages– “goes” “जा�ता� है�”

• Phrase to phrase translation required; some phrases are opaque to translation– E.g. Phrases like “piece of cake”

• Noise introduced by misalignments

Natural LanguageTool Kit

• It is a platform for building Python programs to work with human language data.

• It provides easy-to-use interfaces to over 50 corpora and lexical resources such as WordNet

• It has a suite of text processing libraries for classification, tokenization, stemming, tagging, parsing, and semantic reasoning.

NLTK ModulesLanguage processing task

NLTK modules Functionality

Collocation discovery

nltk.collocationst-test, chi-squared, point-wise mutual information

Part-of-speech tagging

nltk.tagn-gram, backoff, Brill, HMM, TnT

Classification nltk.classify, nltk.clusterdecision tree, maximum entropy, naive Bayes, EM, k-means

Chunking nltk.chunkregular expression, n-gram, named-entity

Parsing nltk.parsechart, feature-based, unification, probabilistic, dependency

NLTK Modules (Contd)

Language processing task NLTK modules Functionality

Semantic interpretation nltk.sem, nltk.inferencelambda calculus, first-order logic, model checking

Evaluation metrics nltk.metricsprecision, recall, agreement coefficients

Probability and estimation nltk.probabilityfrequency distributions, smoothed probability distributions

Applications nltk.app, nltk.chatgraphical concordancer, parsers, WordNet browser, chatbots

Linguistic fieldwork nltk.toolboxmanipulate data in SIL Toolbox format

Final Assignment Demo 11 th Nov, 2012

Documents

Assignment Krb 3013 (10 Nov 13)

Mason Williams - Classical Gas - easymusicnotes.com€¦WinJammer Demo WinJammer Demo WinJammer Demo WinJammer Demo 12 WinJammer Demo WinJammer Demo WinJammer Demo WinJammer Demo 14

PALE - Case Assignment # 1 Nov 11 2014

EE578 Assignment #4 Abdul-Aziz.M Al-Yami Nov 1 st 2010

Student Handbook for Assignment Preparation Nov 2010

ASSIGNMENT KRB3063 (TUGASAN A)- 29 NOV 2014.docx

ASSIGNMENT : DECISION SCIENCE -204 Submission: 5th Nov,2016dimr.edu.in/wp-content/uploads/2016/11/204-DS-Assignment.pdf · ASSIGNMENT : DECISION SCIENCE -204 Submission: 5th Nov,2016

NCR room assignment Nov 2015 NLE

Assignment #3 is posted: Due Thursday Nov. 15 at the beginning of class

Chapter 11 – Chemical Equilibrium. Homework Assignment, Ch 8 (buffers) Problems 5,6,9,11,12,13,18,19,20 Due Fri, Nov 1

UPLOADING AN ASSIGNMENT - Westfield State University · Uploading an Assignment Step 8: Click the “Submit Assignment” ... 2015 Jan 14, 2015 10:... Nov 7, 2014 8:4... Jan 12, Jan

DEMO dimensione ridotta · DEMO dimensione ridotta DEMO dimensione ridotta DEMO dimensione ridotta DEMO dimensione ridotta

ASSIGNMENT 6: BEYOND RIGID BODY DYNAMICS€¦ · ASSIGNMENT 6: BEYOND RIGID BODY DYNAMICS Topic Selected by 16 thMarch | Demo DUE: 6 April, 2016 | Submission 6 April, 2016 CS7057:

Worksheets Demo 1 Demo 2 Demo 3 Demo 4

CmmIi Assignment Vish 04 Nov 2003

Demo nov 2011

Demonstration of Channel Assignment in a Wireless ... · channel selection metric, Figure 2. The pseudo-code for the channel assignment procedure used in this demo when the multi-point

4th IAEA DEMO programme workshop - Pages - Home IAEA DEMO programme workshop KIT, Karlsruhe, Germany, 15-18 Nov 2016 Summary of session 2: DEMO physics gaps and impact on engineering

Technology Review, Oct 2009 Assignments: Wind Assignment Due Thurs. Nov 5 Study Quizzes for Chap 15 and Chap 19 “Due” Tues. Nov. 10

demo partituur demo partituur demo partituur demo partituur