Kees van Deemter Matthew Stone Formal Issues in Natural Language Generation Lecture 5 Stone,...

Kees van DeemterMatthew Stone

Formal Issuesin

Natural Language Generation

Lecture 5Stone, Doran,Webber, Bleam &

Palmer

GRE and surface realization

Arguably, GRE uses a grammar.– Parameters such as the preference order on

properties reflect knowledge of how to communicate effectively.

– Decisions about usefulness or completeness of a referring expression reflect beliefs about utterance interpretation.

Maybe this is a good idea for NLG generally.

But we’ve thought GRE outputs semantics:

referent: furniture886

type: desk

status: definite

color: brown

origin: sweden

We also need to link this up with surface form:

the brown Swedish desk

Note: not

?the Swedish brown desk

Today’s initial observations

It’s hard to do realization on its ownmapping from semantics to surface

structure.

It’s easy to combine GRE and realizationbecause GRE is grammatical reasoning!if you have a good representation for

syntax.

Why it’s hard to do realizationA pathological grammar of adjective

order:

NP the N(w).N(w) w N(w) if w is an adjective and

wRw.N(w) w if w is a noun.

Syntax with this grammar

Derivation of example:

the brown Swedish desk

N(brown)

N(Swedish)

N(desk)

Requires: brown R Swedish, Swedish R desk

Realization, formally

You start with k properties.Each property can be realized lexically.

assume: one noun, many adjectives(not that it’s easy to enforce this)

Realization solution:NP which realizes each property exactly

Quick formal analysis

View problem graph-theoretically:k words, corresponding to vertices in a graphR is a graph on the k wordsSurface structure is a Hamiltonian path

(which visits each vertex exactly once)through R.

This is a famous NP complete problemSo surface realization itself is intractable!

Moral of the example

Semantics underdetermines syntactic relations.Here, semantics underdetermines syntactic

relations of adjectives to one another and to the head.

Searching for the correspondence is hard.See also Brew 92, Koller and Striegnitz 02.

Today’s initial observations

It’s hard to do realization on its ownmapping from semantics to surface

structure.

It’s easy to combine GRE and realizationbecause GRE is grammatical reasoning!if you have a good representation for

syntax.

Syntactic processing for GRE

LexicalizationSteps of grammatical derivation correspond to meaningful choices in NLG.

E.g., steps of grammar are synched with steps of adding a property to a description.

Key ideas: lexicalization, plusFlat dependency structure (adjs modify

noun)Hierarchical representation of word-order

N(color)

N(origin)

N(size)

N(material)the desk

Other syntactic lexical entries

N(origin)

Swedish

N(color)

Describing syntactic combinationOperation of combination 1: Substitution

NP + =NP

N(color)

N(origin)

N(size)

N(material)the desk

N(color)

N(origin)

N(size)

N(material)the desk

Describing syntactic combinationOperation of combination 2: Sister adjunction

N(color)

N(origin)

N(size)

N(material)the desk

N(color)

N(origin)

N(size)

N(material)the desk

N(color)

brownAdj

Abstracting syntax

Tree rewriting:Each lexical item is associated with a structure.You have a starting structure.You have ways of combining two structures together.

Abstracting syntax

Derivation treerecords elements and how they are combined

the desk

brown(s.a. @ color)

Swedish(s.a. @ origin)

An extended incremental algorithm• r = individual to be described• P = lexicon of entries, in preference

orderP is an individual entrysem(P) is a property or set of entries from

the contextsyn(P) is a syntactic element

• L = surface syntax of description

Extended incremental algorithm

L := NPC := DomainFor each P P do:

If r sem(P) & C sem(P)Then do

L := add(syn(P), L)C := C sem(P)If C = {r} then return L

Return failure

Observations

Why use tree-rewriting - not,e.g. CFG derivation?

NP the N(w).N(w) w N(w) if w is an adjective and

wRw.N(w) w if w is a noun.

CFG derivation forces you to select properties in the surface word-order.

Observations

Tree-rewriting frees word-order from choice-order.

N(color)

N(origin)

N(size)

N(material)

N(color)

N(origin)

N(size)

N(material)

N(color)

N(origin)

N(size)

N(material)

Swedish

Observations

Tree-rewriting frees word-order from choice-order.

N(color)

N(origin)

N(size)

N(material)

N(color)

N(size)the

N(color)

N(origin)

N(size)

N(material)

Swedish

N(origin)

N(material)

Swedish

This is reflected in derivation treeDerivation tree

records elements and how they are combined

the desk

brown(s.a. @ color)

Swedish(s.a. @ origin)

Formal results

Logical completeness.If there’s a flat derivation tree for an NP that identifies referent r, Then the incremental algorithm finds it.

ButSensible combinations of properties may not yield surface NPs.Hierarchical derivation trees may require lookahead in usefulness check.

Formal results

Computational complexityNothing changes – we just add properties, one after another…

Now, though, we’re choosing specific lexical entries

N(departure)

N(destination)

N(stops)

express

Trenton

N(departure)

N(destination)

N(stops)

express

Trenton

maybe these lexical items express the same property…

• Use

in 12-hour time context

• Use

in 24-hour time context

What motivates these choices?

N(departure)

• P = lexicon of entries, in preference orderP is an individual entrysem(P) is a property or set of entries from

the contextsyn(P) is a syntactic elementprags(P) is a test which the context must

satisfy for the entry to be appropriate

Need to extend grammar again

For example:

sem: departure(x, 1535)prags: twentyfourhourtime

Need to extend grammar again

N(departure)

L := NPC := DomainFor each P P do:

If r sem(P) & C sem(P) & prags(P) is trueThen do

L := add(syn(P), L)C := C sem(P)If C = {r} then return L

Return failure

Discussion:What does this entry do?

sem: thing(x)prags: in-focus(x)

Suggestion: find best value

Given: – A set of entries that combine syntactically with

L in the same way– Related by semantic generality and pragmatic

specificity.– Current distractors

Take entries that remove the most distractorsOf those, take the most semantically generalOf those, take the most pragmatically specific

L := NP C := DomainRepeat

Choices := { P : add(syn(P), L) at next node & r sem(P) & prags(P) is true }

P := find best value(Choices)L := add(syn(P), L)C := C sem(P)If C = {r} then return L

Return failure

What is generation anyway?

Generation is intentional (or rational) actionthat’s why Grice’s maxims apply, for

example.

You have a goalYou build a plan to achieve it

(& achieve it economically in a recognizable way)

You carry out the plan

In GRE…

The goal is for hearer to know the identity of r(in general g)

The plan will be to utter some NP Usuch that the interpretation of U identifies { r }(in general c u cg)

Carrying out the plan means realizing this utterance.

In other words

GRE amounts to a process of deliberation.

Adding a property to L incrementally is like committing to an action.These commitments are called intentions.Incrementality is characteristic of intentions –

though in general intentions are open to revision.

Note: this connects with belief-desire-intention models of bounded rationality.

GRE as (BDI) rational agency

L := NP // Initial plan C := Domain // Interpretationwhile (P := FindBest(P, C, L)) { //

DeliberationL := add(syn(P), L) // Adopt new intentionC := C sem(P) // Update interpretationif C = { r } return L // Goal satisfied

NLG as (BDI) rational agency

L := X C := Initial Interpretationwhile (P := FindBest(P, C, L)) {

L := AddSyntax(syn(P), L)C := AddInterpretation(sem(P), C)if GoalSatisfied(C) return L

Conclusionsfor NLG researchers

It’s worth asking (and answering) formal questions about NLG.

Questions of logical completeness – can a generator express everything it ought to?Questions of computational complexity – is the cost of a generation algorithm worth the results?

Conclusionsfor linguists

NLG offers a precise perspective on questions of language use.

For example, what’s the best way of communicating some message?

NLG – as opposed to other perspectives – gives more complete, smaller-scale models.

Conclusionsfor AI in general

NLG does force us to characterize and implement representations & inference for practical interactive systems

Good motivation for computational semantics.Meaty problems like logical form equivalence.

Many connections and possibilities for implementation (graphs, CSPs, circuit optimization, data mining,…)

Open Problems

• Sets and salience in REs.• Generating parallel REs.• Theoretical and empirical measures of

quality/utility for REs.• Avoiding ambiguity in REs.

Any problem in RE generalizes to one in NLG.

Followup information

Course web page:http://www.itri.brighton.ac.uk/home/Kees.van.Deemter/esslli-notes.html

– downloadable papers– final lecture notes– papers we’ve talked about– links (recent/upcoming events, siggen,

sigsem)

by Monday August 26.

Kees van Deemter Matthew Stone Formal Issues in Natural Language Generation Lecture 5 Stone,...

Documents

Narmada Stone Shivling - IndiaMART · of shiva lingam,narmada stone shivling,shiv parivar,jaipur handicraft stone,tumble stone,stone boll ,agate stone and other stone & status. About

Chem 201 lecture #19 Van Deemter Equation - · PDF fileVan Deemter Equation Today: Van Deemter Equation GC Student survey ... Effect of particle size Pressures required: Chem201 lec

CS1512 1 CS2013 Mathematics for Computing Science Kees van Deemter Probability and statistics © J R W Hunter, 2006; C J van Deemter 2007

2. Persamaan Van Deemter

Fast LC Utilizing Prominence - Shimadzu · PDF fileFast LC Utilizing Prominence ... Figure 1: van Deemter plots with different particle size packing ... in the van Deemter equation,

The bleam, new QR code generation

14.5.20041 Generating Referring Expressions (Dale & Reiter 1995) Ivana Kruijff-Korbayová (based on slides by Gardent&Webber, and Stone&van Deemter) Einfürung

TN-1072 APPLICATIONS - Windows · PDF fileAPPLICATIONS TN-1072 For ... Uniform Particle Size Distribution. APPLICATIONS TN-1072 ... van Deemter Equation van Deemter Data Agilent 1200

Lava Stone - · PDF fileLava Stone 12G Lava Stone 60G Lava Stone 36G Lava Stone 36MSG MK.Lava Stone G Lava Stone RB12G Lava Stone ... Lava Stone ISO 13006 - G - BIa (E ... ANNEX F

SPP Performance Brief - PerkinElmer...chromatography is described by the Van Deemter equation (see Van Deemter Plot). As column efficiency increases, the plate height in the column

2020 SCHOLARSHIP REPORT - Community Foundation...Gisela Padilla, Shasta Andrea Worden, Shasta Dr. Frank L. Doane Memorial Scholarship ... Aspen Hoyer, Shasta Robert Bleam, Shasta Talem

LC Van Deemter Chem Eng 1956

Microplanning (Sentence planning) Part 1 Kees van Deemter

Fashion Stone - d2zpvmybpipqvy.cloudfront.net€¦ · fashion stone ivory fashion stone beige fashion stone clay fashion stone light grey fashion stone grey natural lappato Fashion

HPLC column SunShell - · PDF fileHPLC column Core Shell ... Van Deemter Equation H u A term B term C term Van Deemter plot 1 ... narrower than particle size. Same diffusion means

Van Deemter, Riga, Jan. 2010 Not Exactly Vagueness as Original Sin? Kees van Deemter University of Aberdeen Scotland

Vagueness: a problem for AI Kees van Deemter University of Aberdeen Scotland, UK

Selection Guide - Sunset Stone · 4. Cobble Stone Antique Lueders. Cobble Stone Sequoia. Tuscany Cobble Stone. Natural . Cobble Stone Dover. Cobble Stone Pinos Creek. Cobble Stone

Kees van Deemter. For Institut Nicod, Jan 2009 Vagueness as Original Sin from measurement to semantic theory Kees van Deemter University of Aberdeen Scotland,

UBOOK - condensé de cas d'utilisation du bleam