Getting the structure right for word alignment: LEAF Alexander Fraser and Daniel Marcu Presenter Qin...

Getting the structure right for word alignment: LEAF

Alexander Fraser and Daniel Marcu

Presenter Qin Gao

Problem

IBM Models have 1-N

assumption

Solutions

A sophisticated

generative story

Generative Estimation of parametersAdditional Solution

Decompose the model

components

Semi-supervised

training

ResultSignificant

Improvement on BLEU (AR-

Quick summary

The generative storySource word

Head words Links to zero or more non-head words (same side)

Non-head words

Linked from one head word (same side)

Deleted words No link in source sideTarget words

Head words Links to zero or more non-head words (same side)

Non-head words

Linked from one head word (same side)

Spurious words

No link in target side

Minimal translational correspondence

The generative story

1a. Condition: Source word

1b. Determine source word class

2a. Condition on source classes

C(A) C(B) C(C)

2b. Determine links between head word and non-head words

C(A) C(B) C(C)

3a. Depends on the source head word

3b. Determine the target head word

4a. Conditioned on source head word and cept size

4b. Determine the target cept size

5a. Depend on the existing sentence length

5b. Determine the number of spurious target words

6a. Depend on the target word

X ? ?XYZ

6b. Determine the spurious word

X ? ZXYZ

7a. Depends on target head word’s class and source word

C(X) ? Z

7b. Determine the non-head word it linked to

C(X) Y Z

8a. Depends on the classes of source/target head words

C(A) B C

C(X) Y Z

8b. Determine the position of target head word

C(A) B C

8c. Depends on the target word class

C(A) B C

8d. Determine the position of non-headwords

C(A) B C

C(X) Y

9. Fill the vacant position uniformly

C(A) B C

C(X) YZ

(10) The real alignment

C(A) B C

C(X) YZ

Unsupervised parameter estimation

Bootstrap using HMM alignments in two directions Using the intersection to determine

head words Using 1-N alignment to determine target

cepts Using M-1 alignment to determine

source cepts Could be infeasible

Training: Similar to model 3/4/5

From some alignment (not sure how they get it), apply one of the seven operators to get new alignments

Move French non-head word to new head, move English non-head word to new head, swap heads of two French non-head words, swap heads of two English non-head words, swap English head word links of two French head

words, link English word to French word making new head

words, unlink English and French head words.

All the alignments that can be generated by one of the operators above, are called neighbors of the alignment

Training If we have better alignment in the

neighborhood, update the current alignment

Continue until no better alignment can be found

Collect count from the last neighborhood

Semi-supervised training Decompose the components in the large formula

treat them as features in log-linear model And other features

Used EMD algorithm (EM-Discriminative) method

Experiment First, a very weird operation, they

fully link alignments from ALL systems and then compare the performance

Training/Test Set

Experiments French/English: Phrase based Arabic/English: Hierarchical (Chiang

2005) Baseline: GIZA++ Model 4, Union Baseline Discriminative: Only using

Model 4 components as features

Conclusion(Mine) The new structural features are

useful in discriminative training No evidence to support the

generative model is superior over model 4

Unclear points Are F scores “biased?” No BLEU score given for LEAF

unsupervised They used features in addition to

LEAF features, where is the contribution comes from?

Getting the structure right for word alignment: LEAF Alexander Fraser and Daniel Marcu Presenter Qin...

Documents

Rudolf Steiner Evanghelia Dupa Marcu

Redactor: Prof. GRIGORIE T. MARCU

38844488 Cartea Kinetoterapie Marcu

Lpgn marcu

- Marcu... - Hotul

Basilica Sfantul Marcu

CV-Marcu Ionel

Activitati Fizice Adaptate - Vasile Marcu

Body Temperature Huang Qin Huang Qin （ Tel 2995285 ）

episcopul marcu eugenicul

FESTA SANTU MARCU RANDE

Qin State vs. the Qin Dynasty

Marcu Duiliu. Inv. 1444

Debarcarea Din Normandia de Marcu Adalia, Marcu Larissa Și Stoian Anisia

EVANGHELIA DUPĂ MARCU - Rudolf Steiner.doc

IT Strategy 2003-2006 NATIONAL INSTITUTE OF STATISTICS Dan Marcu – 2005 Dan Marcu – 2005

Marcu Iulia

Parintele Marcu de La Sihastria

Nieruchomości w marcu 2011 roku

Rudolf Steiner Evanghelia După Marcu