1 Quasi-Synchronous Grammars Based on key observations in MT: translated sentences often have some...

Quasi-Synchronous Grammars Based on key observations in MT:

translated sentences often have some isomorphic syntactic structure, but not usually in entirety.

the strictness of the isomorphism may vary across words or syntactic rules.

Key idea: Unlike some synchronous grammars (e.g. SCFG,

which is more strict and rigid), QG defines a monolingual grammar for the target tree, “inspired” by the source tree.

Quasi-Synchronous Grammars In other words, we model the generation of

the target tree, influenced by the source tree (and their alignment)

QA can be thought of as extremely free monolingual translation.

The linkage between question and answer trees in QA is looser than in MT, which gives a bigger edge to QG.

Model Works on labeled dependency parse trees Learn the hidden structure (alignment between Q and

A trees) by summing out ALL possible alignments

One particular alignment tells us both the syntactic configurations and the word-to-word semantic correspondences

An example…

question answer

answerparse tree

questionparse tree

an alignment

BushNNP

person

metVBD

FrenchJJ

location

presidentNN

Jacques ChiracNNP

person

leaderNN

FranceNNP

location

Q: A:$

subj obj

det of

subj with

BushNNP

person

metVBD

FrenchJJ

location

presidentNN

Jacques ChiracNNP

person

leaderNN

FranceNNP

location

Q: A:$

subj obj

det of

subj with

BushNNP

person

metVBD

FrenchJJ

location

presidentNN

Jacques ChiracNNP

person

Q: A:$

root root

subj with

root)|P(root

noNE)|P(noNE

VBD)| P(VB

Our model makes local Markov assumptions to allow efficient computation via Dynamic Programming (details in paper)

given its parent, a word is independent of all other words (including siblings).

BushNNP

person

metVBD

FrenchJJ

location

presidentNN

Jacques ChiracNNP

person

Q: A:$

subj with

child)-parent|P(subj

person)|P(qword

NNP)|P(WP

BushNNP

person

metVBD

FrenchJJ

location

presidentNN

Jacques ChiracNNP

person

leaderNN

Q: A:$

subj obj

subj with

child)-tgrandparen|P(obj

noNE)|P(noNE

NN)|P(NN

BushNNP

person

metVBD

FrenchJJ

location

presidentNN

Jacques ChiracNNP

person

leaderNN

Q: A:$

subj obj

subj with

)word-same|P(det

noNE)|P(noNE

N)|P(DT

BushNNP

person

metVBD

FrenchJJ

location

presidentNN

Jacques ChiracNNP

person

leaderNN

FranceNNP

location

Q: A:$

subj obj

det of

subj with

)child-parent|P(of

location)|P(location

JJ)|P(NNP

6 types of syntactic configurations

Parent-child

BushNNP

person

metVBD

FrenchJJ

location

presidentNN

Jacques ChiracNNP

person

leaderNN

FranceNNP

location

Q: A:$

subj obj

det of

subj with

Parent-child configuration

Parent-child Same-word

BushNNP

person

metVBD

FrenchJJ

location

presidentNN

Jacques ChiracNNP

person

leaderNN

FranceNNP

location

Q: A:$

subj obj

det of

subj with

Same-word configuration

Parent-child configuration

Parent-child Same-word Grandparent-child

BushNNP

person

metVBD

FrenchJJ

location

presidentNN

Jacques ChiracNNP

person

leaderNN

FranceNNP

location

Q: A:$

subj obj

det of

subj with

Parent-child configuration Same-word configuration

Grandparent-child configuration

Parent-child Same-word Grandparent-child Child-parent Siblings C-command(Same as [D. Smith & Eisner ’06])

Parent-child configuration Same-word configuration Grandparent-child configuration

Child-parent configuration Siblings configuration C-command configuration

Modeling alignment Base model

)child-parent|P(of

location)|P(location

N)|P(N

BushNNP

person

metVBD

FrenchJJ

location

presidentNN

Jacques ChiracNNP

person

leaderNN

FranceNNP

location

Q: A:$

subj obj

det of

subj with

BushNNP

person

metVBD

FrenchJJ

location

presidentNN

Jacques ChiracNNP

person

leaderNN

FranceNNP

location

Q: A:$

subj obj

det of

subj with

Modeling alignment cont.

Base model

Log-linear modelLexical-semantic features from WordNet,Identity, hypernym, synonym, entailment, etc.

Mixture model

Parameter estimation

Things to be learnt Multinomial distributions in base model Log-linear model feature weights Mixture coefficient

Training involves summing out hidden structures, thus non-convex.

Solved using conditional Expectation-Maximization

Experiments

Trec8-12 data set for training Trec13 questions for development

and testing

Candidate answer generation

For each question, we take all documents from the TREC doc pool, and extract sentences that contain at least one non-stop keywords from the question.

For computational reasons (parsing speed, etc.), we only took answer sentences <= 40 words.

Dataset statistics Manually labeled 100 questions for training

Total: 348 positive Q/A pairs 84 questions for dev

Total: 1415 Q/A pairs 3.1+, 17.1-

100 questions for testing Total: 1703 Q/A pairs 3.6+, 20.0-

Automatically labeled another 2193 questions to create a noisy training set, for evaluating model robustness

Experiments cont.

Each question and answer sentence is tokenized, POS tagged (MX-POST), parsed (MSTParser) and labeled with named-entity tags (Identifinder)

Baseline systems (replications) [Cui et al. SIGIR ‘05]

The algorithm behind one of the best performing systems in TREC evaluations.

It uses a mutual information-inspired score computed over dependency trees and a single fixed alignment between them.

[Punyakanok et al. NLE ’04] measures the similarity between Q and A by

computing tree edit distance. Both baselines are high-performing, syntax-based,

and most straight-forward to replicate We further enhanced the algorithms by augmenting

them with WordNet.

ResultsMean Average

PrecisionMean Reciprocal

Rank of Top 1

Statistically significantly better than the 2nd best score in each column

28.2% 23.9% 41.2% 30.3%

Summing vs. Max

Switching back

Tree-edit CRFs

1 Quasi-Synchronous Grammars Based on key observations in MT: translated sentences often have some...

Documents

Isomorphic javascript - Uppsala.js #8

Isomorphic Aplication with Javascript

Denotational Semantics Syntax-directed approach, generalization of attribute grammars: –Define context-free abstract syntax –Specify syntactic categories

Just as grammars of language grammars of language grammars

Isomorphic mimicry -_can_camouflage_be_sabotaged

1 CONTEXT-FREE GRAMMARS. NLE 2 Syntactic analysis (Parsing) S NPVP ATNNSVBD NP AT NNthechildrenate thecake

Craft, Performance, and Grammars...Craft, Performance, and Grammars 207 2 From Shape Grammars to Making Grammars Shape grammars provide a unique, computational theory of design, one

On Induction of Morphology Grammars and its Role in … · 2014-02-04 · logical grammars that can then serve, together with the morpholo gical terminals, as cues for syntactic structure

Fullstack javascript. Isomorphic apps

RHECITAS: citation analysis of French humanities …ucrel.lancs.ac.uk/publications/cl2009/205_FullPaper.doc · Web viewThese local grammars rely on the syntactic parsing for NP identification,

Isomorphic Javascript at Trulia

) ISOMORPHIC TO Z

Isomorphic web application

Discrete Isomorphic Completeness and a Unified Isomorphic Layout Format

Context-Free Grammars (CFG) - univ- · PDF fileContext-Free Grammars (CFG) ... won’t lead to a derivation of a ∗ (a + b000). ... (syntactic) struc- ture ofw. w could be a program,

Grammars and CHAPTER Parsing 3schubert/530/slides-james...Grammars and Parsing 41 To examine how the syntactic structure of a sentence can be computed, you must consider two things:

Generalized Feature Extraction for Structural Pattern ...bobski/pubs/tr01108-twosided.pdfStructural approaches to pattern recognition use syntactic grammars to discriminate among objects

Isomorphic React + Flux at Yahoo

Syntactic analysis using Context Free Grammars. Analysis of language Morphological analysis – Chairs, Part Of Speech (POS) tagging – The/DT man/NN left/VBD

Isomorphic React Apps Testing