31
Centering theory and its direct applications Lecture 2

Centering theory and its direct applications Lecture 2

  • View
    219

  • Download
    1

Embed Size (px)

Citation preview

Page 1: Centering theory and its direct applications Lecture 2

Centering theory and its direct applications

Lecture 2

Page 2: Centering theory and its direct applications Lecture 2

Some definitions

Discourse = coherent sequence of utterances

Several sentences following one another do not make a readable text

Defining specific computable measures of coherence is the goal of this seminar

Page 3: Centering theory and its direct applications Lecture 2

Centering theory ingredients

Deals with local coherenceWhat happens to the flow from

sentence to sentenceDoes not deal with global structuring

of the text (paragraphs/segments) Defines coherence as an estimate of

the processing load required to “understand” the text

Page 4: Centering theory and its direct applications Lecture 2

Processing load

Upon hearing a sentence a personCognitive effort to interpret the

expressions in the utteranceIntegrates the meaning of the

utterance with that of the previous sentence

Creates some expectations on what might come next

Page 5: Centering theory and its direct applications Lecture 2

Example

(1) John met his friend Mary today.

(2) He was surprised to see her.

(3) He thought she is still in Italy.

Form of referring expressions Anaphora needs to be resolved “Create” a discourse entity at first mention

with full noun phrase Creating expectations

Page 6: Centering theory and its direct applications Lecture 2

Creating and meeting expectations(1) a. John went to his favorite music store to buy a

piano. b. He had frequented the store for many years. c. He was excited that he could finally buy a piano. d. He arrived just as the store was closing for the day.

(2) a. John went to his favorite music store to buy a piano.

b. It was a store John had frequented for many years. c. He was excited that he could finally buy a piano. d. It was closing just as John arrived.

Page 7: Centering theory and its direct applications Lecture 2

Interpreting pronouns

a. Terry really goofs sometimes.

b. Yesterday was a beautiful day and he was excited about trying out his new sailboat.

c. He wanted Tony to join him on a sailing expedition.

d. He called him at 6am.

e. He was sick and furious at being woken up so early.

Page 8: Centering theory and its direct applications Lecture 2

Basic center definitions

Centers of an utteranceSet of entities serving to link that

utterance to the other utterances in the discourse segment that contains it

Not words or phrases themselvesSemantic interpretations of noun

phraes

Page 9: Centering theory and its direct applications Lecture 2

Types of centers

Forward looking centers An ordered set of entities What could we expect to hear about next Ordered by salience as determined by grammatical

function Subject > Indirect object > Object > Others

John gave the textbook to Mary. Cf = {John, Mary, textbook}

Preferred center Cp

The highest ranked forward looking center High expectation that the next utterance in the

segment will be about Cp

Page 10: Centering theory and its direct applications Lecture 2

Backward looking center

Single backward looking center, Cb (U)For each utterance other than the

segment-initial one The backward looking center of

utterance Un+1 connects with one of the forward looking centers of Un

Cb (U+1) is the most highly ranked element from Cf (Un) that is also realized in U+1

Page 11: Centering theory and its direct applications Lecture 2

Centering transitions ordering

Cb(Un+1)=Cb(Un) OR

Cb(Un)=[?]

Cb(Un+1) != Cb(Un)

Cb(Un+1) = Cp (Un+1) continue smooth-shift

Cb(Un+1) != Cp (Un+1) retain rough-shift

Page 12: Centering theory and its direct applications Lecture 2

Centering constraints

There is precisely one backward-looking center Cb(Un)

Cb(Un+1) is the highest-ranked element of Cf(Un) that is realized in Un+1

Page 13: Centering theory and its direct applications Lecture 2

Centering rules

If some element of Cf(Un) is realized as a pronoun in Un+1 then so is Cb(Un+1)

Transitions not equalcontinue > retain > smooth-shift >

rough-shift

Page 14: Centering theory and its direct applications Lecture 2

Centering analysis

Terry really goofs sometimes. Cf={Terry}, Cb=?, undef

Yesterday was a beautiful day and he was excited about trying out his new sailboat. Cf={Terry,sailboat}, Cb=Terry, continue

He wanted Tony to join him in a sailing expedition. Cf={Terry, Tony, expedition}, Cb=Terry, continue

He called him at 6am. Cf={Terry,Tony}, Cb=Terry, continue

Page 15: Centering theory and its direct applications Lecture 2

He called him at 6am. Cf={Terry,Tony}, Cb=Terry, continue

Tony was sick and furious at being woken up so early. Cf={Tony}, Cb=Tony, smooth shift

He told Terry to get lost and hung up. Cf={Tony,Terry}, Cb=Tony, continue

Of course, Terry hadn’t intended to upset Tony. Cf={Terry,Tony}, Cb = Tony, retain

Page 16: Centering theory and its direct applications Lecture 2

Rough shifts in evaluation of writing skills One of the graders of student essays in standardized tests is an

automatic program

ETS researchers have developed a number of applications that use natural language processing technologies to evaluate and score the writing abilities of test takers:

The CriterionSM Online Essay Evaluation Service automatically evaluates essay responses using e-rater and the Critique writing analysis tools.

E-rater® gives holistic scores for essays.

CritiqueTM provides real-time feedback about grammar, usage, mechanics and style, and organization and development.

C-raterTM offers automated analysis of conceptual information in short-answer, free responses.

Page 17: Centering theory and its direct applications Lecture 2

E-rater features

Syntactic variety Represented by features that quantify the

occurrence of clause types Clear transitions

Cue phrases in certain syntactic constructions

Existence of main and supporting points Appropriateness of the vocabulary content

of the essay What about local coherence?

Page 18: Centering theory and its direct applications Lecture 2

Ranking forward looking centers

Subject > Indirect object > Object > Others > Quantified indefinite subjects (people,

everyone) > Arbitrary plural pronominals

Page 19: Centering theory and its direct applications Lecture 2

Essay score model

Human score available E-rater prediction available Percentage of rough-shifts in each

essay: analysis done manually

Negative correlation between the human score and the percentage of rough-shifts

Page 20: Centering theory and its direct applications Lecture 2

Karamanis’07

Why are we reading this paper? Gives quite complete list of references on

later work on centering• Centering variants

Reminds that entity coherence is not the only factor in text flow

• We’ll be discussing rhetorical structure theory during the next class

Applications---can some aspects of the work be done differently/improved upon?

Page 21: Centering theory and its direct applications Lecture 2

Information ordering task

Given a set of sentences/clauses, what is the best presentation? Take a newspaper article and jumble the

sentences---the result will be much more difficult to read than the original

Criteria for deciding which of two orderings is better Centering would definitely be applicable

Summarization, question answering, generation

Page 22: Centering theory and its direct applications Lecture 2

Linear multi-factor regression Approximate the human score as a linear

function of the e-rater prediction and the percentage of rough-shifts

Adding rough shifts significantly improves the model of the score 0.5 improvement on 1—6 scale

How easy/difficult would it be to fully automate the rough-shift variable

Page 23: Centering theory and its direct applications Lecture 2

Centering variations

Continuity (NOCB=lack of continuity) Cf(Un) and Cf(Un+1) share at least one

element Coherence

Cb(Un) = Cb(Un+1) Salience

The Cb(U) = Cp(U) Coherence is more important than salience Cheapness (fulfilled expectations)

Cb (Un+1) = Cp(Un)

Page 24: Centering theory and its direct applications Lecture 2

GNOME corpus

20 descriptions of museum artifacts Split into finite unites (clauses) Semi-automatic centering annotation

Item 144 is a torc. Its present arrangement, twisted into three rings, may be a modern alteration; it should probably be a single ring, worn around the neck. The terminals are in the form of goats’ heads.

Page 25: Centering theory and its direct applications Lecture 2

Rhetorical coherence

Each text can be seen as a hierarchical tree structure

Different spans are related by some rhetorical relation Elaboration (adding more information) Contrast Sequence Purpose Summary etc

Page 26: Centering theory and its direct applications Lecture 2

Local rhetorical coherence

Applies only locally rather than on the text as a whole Signaled by cue phrases Contrast: but, however, on the other hand Continuation: and, then, later Consequence: because, in order to, so

These local rhetorical relations structure the text

When missing, entity coherence determines the flow 8 out of the 20 texts do not have any explicitly marked

rhetorical relations

Page 27: Centering theory and its direct applications Lecture 2

Joint centering and local rhetorical coherence In clauses directly marked for a rhetorical

relation Merge the Cf lists of the two clauses

Apply centering transitions on the resulting Cf list rather than the original

GNOME-RR contains 1.58 fewer CF lists compared to the original average number (8.35)

Page 28: Centering theory and its direct applications Lecture 2

Metrics of coherence

M.NOCB (no continuity) M.CHEAP (expectations not met) M.KP sum of the violations of

continuity, cheapness, coherence and salience

M. BFP seeks to maximize transitions according to Rule 2

Page 29: Centering theory and its direct applications Lecture 2

Experimental methodology

Gold-standard ordering The original order of the text (object

description, news article) Assume that other orderings are inferior

Classification error rate Percentage orderings that score better than

the gold-standard + 0.5*percentage of the orderings that score the same

Page 30: Centering theory and its direct applications Lecture 2

Results

NOCB gives best resultsSignificantly better than the other

metrics M.BFP is the second best metric

Adding the local rhetorical relations hurts performance---is this surprising?

Page 31: Centering theory and its direct applications Lecture 2

Reminders

Select topics you would like to present Should schedule next week now The second time you present one of the

goals will be to relate the papers with previous topics we have covered

Start thinking about the topic of your literature overview About 15 papers 5/6 pages Due Nov 12