View
216
Download
0
Tags:
Embed Size (px)
Citation preview
Pragmatics I: Reference resolution
Ling 571
Fei Xia
Week 7: 11/8/05
Outline
• Discourse: a related group of sentences– Ex: articles, dialogue, ….
• Pragmatics: the study of the relation between language and context-of-use– Reference resolution– Discourse structure
Reference resolution
Reference resolution
• Some terms: referents, referring expression• Discourse model• Types of referring expression• Types of referents• Constraints and preference for reference
resolution• Some algorithms for reference resolution
Some terms
• Ex: John bought a book yesterday. He thought it was cheap.
• Referring expression: the expression used to refer to an entity:– Ex: John, a book, he, it
• Referent: an entity that is referred to.
Some Terms (cont)
• Co-reference: two or more referring expressions refer to the same entity: e.g., “John” and “he”.
– Antecedents: a referring expression that licenses the use of others. Ex. John
– Anaphora: reference to an entity that has been previous introduced. Ex: he
Discourse Model
• A discourse model stores the representations of entities that have been referred to in the discourse and the relationships in which they participate.
• Two operations:– Evoke: first mention– Access: subsequence mention
John He
Refer (evoke) Refer (access)
Corefer
Five types of referring expressions
• Indefinite NPs: a car
• Definite NPs: the car
• Pronouns: it
• Demonstratives: this, that
• One-anaphora: one
Indefinite NPs
• Introduce entities that are new to the hearer
• The entity may or may not be identifiable to the speaker:– I saw an Acura today. (Specific reading)– I am going to the dealership to buy an Acura today.
(specific or non-specific)• I hope that they still have it. (Specific)• I hope that they have a car I like. (non-specific)
Definite NPs
• Identifiable to the hearer– I saw an Acura today. The Acura … (explicitly mentioned before in the context)
– The Eagles …. (the hearer’s knowledge about the world)
– The largest company in Seattle announced … (inherently unique)
Pronouns
• Pronouns refer to something that is identifiable to the hearer.
• The referent must have a high degree of salience in the discourse model.
• Pronouns can participate in cataphora, in which they appear before their referents.– Ex: Before he bought it, John checked over the
Acura very carefully.
Demonstratives
• Demonstratives refer to something that is identifiable to the hearer.
• They are used alone or as a determiner:– Ex: I want this. I want this car.
• “this” indicating closeness, “that” signaling distance: spatial/temporal distance.
One-anaphora
• “One” “One of them”• It selects a member from a set that is identifiable
to the hearer. • Ex:
– He had a BMW before, now he got another one.– Is he the one?– You like this one, or that one?– John has two BMWs, but I have only one.– One should not pay more than 20K for a Camry.
Five types of referring expressions
• Indefinite NPs: a car• Definite NPs: the car• Pronouns: it• Demonstratives: this, that• One-anaphora: one
Next question: what do a referring expression refers to?
Types of referents
• Ex: According to John, Bob bought Sue a BMW, and Sue bought Bob a Honda.– But that turned out to be a lie. (speech act)– But that was false. (proposition)– That caused Bob to become rather poor.
(event)– That caused them both to become rather
poor. (combination of events)
Inferrables
• Explicitly evoked in the text: John bought a car.
• Inferrables: inferrentially related to an evoked entity.– Whole-part: I almost bought a BMW today,
but a door had a dent and the engine seemed noisy.
– The results of action: Mix the flour and water, kneed the dough until smooth.
– …
Discontinuous sets
• Plural references may refer to entities that have been evoked separately.
• Ex:– John has an Acura, and Mary has a Mazda.
They drive them all the time. (pairwise reading)
Generics
• Generic references: individual generic
• Ex: I saw six BMWs today. They are the coolest cars.
John He
Refer (evoke) Refer (access)
Corefer
Constraints and preferences for reference resolution
• Constraints (filters):– Agreement: number, person, gender– Syntax: reflexives– Semantics: selectional restrictions
• Preferences:– Salience– Parallelism– Verb semantics
Agreement
• Number: – (1) John bought a BMW. – (2a) It is great.– (2b) They are great.– (2c) ??They are red.
• Person:– (1) John and I have BMWs.– (2a) We love them. – (2b) They love them.
Agreement (cont)
• Gender: she, he, it.– (1) John looked at the new ship.– (2) She was beautiful.
– (1’) Mary looked at the new ship.– (2) She was beautiful.
Syntactic constraints
• Reflexives and personal pronouns.– John bought himself a car.– John bought him a car.
– John wrapped a blanket around himself.– John wrapped a blanket around him.
Semantic constraints
• Selectional restrictions– (1) John parked his car in the garage.– (2a) He had driven it around for hours.– (2b) It is very messy, with old bike and car
parts lying around everywhere.
– (1) John parked his Acura in downtown Beverly Hills.
– (2) It is very messy, with old bikes and car parts lying around everywhere.
Preferences in pronoun interpretation
• Saliency:– Recency– Grammatical role– Repeated Mention
• Parallelism
• Verb semantics
Saliency
• Recency: – John has an Integra. …Bill has a BMW. Mary likes to
drive it.
• Grammatical role:– John went the dealership with Bill. He bought a car.
• Repeated mention:– John needed a car. He decided to get a BMW. Bill
went to the dealership with him. He bought one.
Parallelism
• Mary went with Sue to the Acura dealership. Sally went with her to the Mazda dealership.
Verb semantics
• John telephoned Bill. He lost the pamphlet on BMWs.
• John seized the pamphlet to Bill. He loves reading about cars.
• The car dealer admired John. He knows Acuras inside and out.
Thematic roles or world knowledge?
criticized
impressed
passed
Constraints and preferences for reference resolution
• Hard-and-fast constraints (filters):– Agreement: number, person, case, gender– Syntax: reflexives– Semantics: selectional restrictions
• Preferences:– Saliency: recency, thematic roles, repeated
mention– Parallelism– Verb semantics: thematic roles or world knowledge
Algorithms for pronoun resolution
• Heuristics approaches:– Lappin & Leass (1994)– Hobbs (1978)– Centering Theory (Grosz, Joshi, Weinstein
1995, and various)
• Machine learning approaches
Lappin & Leass 1994
• A heuristic approach.
• Use agreement and syntactic constraints.
• Represent preferences (saliency, parallelism) with weights.
• Not using: selectional restrictions, verb semantics, world knowledge.
Salience factors and weights
• Sentence recency: 100
• Subject: 80• Existential position: 70
– There is a car ….• Direct object: 50• Indirect object: 40
• Non-adv: 50– Inside his car, John …..
• Head noun of max NP: 80– The manual for the car is …
The algorithm
• Start with an empty set of referents.
• Process each sentence– For each referring expression
• Calculate the salience value of the expression.• If it could be merged with existing referents
then choose the referent with the highest saliency value
else add it as a new referent.
• Update the value of the corresponding referent.
– Cut the values of all the referents by half.
An example• John saw a beautiful Acura at the dealership.
Rec Subj Obj Non-adv
Head noun
Total
John 100 80 50 80 310
Acura 100 50 50 80 280
dealership
100 50 80 230
Before moving on to the 2nd sentence
Referent Referring expressions
Value
John {John} 155
Acura {Acura} 140
dealership {dealership} 115
Handling “He”
• He showed it to Bob.• The value of “He” is 310
Referent Referring expressions
Value
John {John} 155
Acura {Acura} 140
dealership {dealership} 115
After adding “he”
• He showed it to Bob.
Referent Referring expressions
Value
John {John, he} 465
Acura {Acura} 140
dealership {dealership} 115
Handling “it”
• He showed it to Bob.• The salience value of “it” is 280.• Two new factors:
– Role parallelism: 35– Cataphora (??): -175
Referent Expressions Value
John {John, he} 465
Acura {Acura} 140
dealership {dealership} 115
After adding “it”
• He showed it to Bob.• The salience value of “it” is 280.• Two new factors:
– Role parallelism: 35– Cataphora (??): -175
Referent Expressions Value
John {John, he} 465
Acura {Acura, it} 140+280+35=455
dealership {dealership} 115
Handling “Bob”
• He showed it to Bob.• The salience value of “Bob” is 270.
Referent Expressions Value
John {John, he} 465
Acura {Acura, it} 455
dealership {dealership} 115
After adding “Bob”
• He showed it to Bob.• The salience value of “Bob” is 270.
Referent Expressions value
John {John, he} 465
Acura {Acura, it} 455
Bob {Bob} 270
dealership {dealership} 115
Moving on to the 3rd sentence
• He bought it.
Referent Expressions value
John {John, he} 232.5
Acura {Acura, it} 227.5
Bob {Bob} 135
dealership {dealership} 57.5
He (John) bought it (Acura).
Core of the algorithm
• For each referring expression– Calculate the saliency value, x.– Collect all the referents that comply with
agreement and syntactic constraints.– If the set is not empty, choose the one with
the highest salience value, and increase the reference value by x.
– If the set is empty, add a new referent to the discourse model, and set its value to x.
Algorithms for reference resolution
• Heuristics approaches:– Lappin & Leass (1994)– Hobbs (1978)– Centering Theory (Grosz, Joshi, Weinstein
1995, and various)
• Machine learning approaches
Summary of reference resolution
• Some terms: referents, referring expression• Discourse model• Types of referring expression• Types of referents• Constraints and preference for reference
resolution• Some algorithms for reference resolution