62
Design of RRG Grammars Automatic RRG Grammar Extraction Parsing experiments Issues Summary & Outlook Chart-based RRG parsing using an automatically extracted RRG grammar with features David Arps & Tatiana Bladier & Laura Kallmeyer International Conference on Role and Reference Grammar 19 August 2019 University at Buffalo, The State University of New York 1 / 32

Chart-based RRG parsing using an automatically extracted RRG …€¦ · CO NUC V think RP Asterix TNS does CL CL CO NUC V drank RP Obelix PrCS RP What CL CL CO NUC V drank RP Obelix

  • Upload
    others

  • View
    7

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Chart-based RRG parsing using an automatically extracted RRG …€¦ · CO NUC V think RP Asterix TNS does CL CL CO NUC V drank RP Obelix PrCS RP What CL CL CO NUC V drank RP Obelix

Design of RRG Grammars Automatic RRG Grammar Extraction Parsing experiments Issues Summary & Outlook

Chart-based RRG parsingusing an automatically extracted

RRG grammar with features

David Arps & Tatiana Bladier & Laura Kallmeyer

International Conference on Role and Reference Grammar19 August 2019

University at Buffalo, The State University of New York

1 / 32

Page 2: Chart-based RRG parsing using an automatically extracted RRG …€¦ · CO NUC V think RP Asterix TNS does CL CL CO NUC V drank RP Obelix PrCS RP What CL CL CO NUC V drank RP Obelix

Design of RRG Grammars Automatic RRG Grammar Extraction Parsing experiments Issues Summary & Outlook

Overview

Design of RRG Grammars

Automatic RRG Grammar Extraction

Parsing experiments

Issues

Summary & Outlook

2 / 32

Page 3: Chart-based RRG parsing using an automatically extracted RRG …€¦ · CO NUC V think RP Asterix TNS does CL CL CO NUC V drank RP Obelix PrCS RP What CL CL CO NUC V drank RP Obelix

Design of RRG Grammars Automatic RRG Grammar Extraction Parsing experiments Issues Summary & Outlook

Introduction

1 RRG annotated treebank:RRGBank [2],

2 extract RRG elementarytrees automatically,

3 use these elementarytrees for RRG parsing.

SENTENCE

CLAUSE

CORE

NP

PRO

it

V

looks

NUC

PP

COREP

NUCP

P

like

DEF-OP

a

N

NUCN

COREN

NP

holiday

3 / 32

Page 4: Chart-based RRG parsing using an automatically extracted RRG …€¦ · CO NUC V think RP Asterix TNS does CL CL CO NUC V drank RP Obelix PrCS RP What CL CL CO NUC V drank RP Obelix

Design of RRG Grammars Automatic RRG Grammar Extraction Parsing experiments Issues Summary & Outlook

Introduction

1 RRG annotated treebank:RRGBank [2],

2 extract RRG elementarytrees automatically,

3 use these elementarytrees for RRG parsing.

NP

PRO

it

SENTENCE

CLAUSE

CORE

NP↓ NUC

V

looks

PP↓

PP

COREP

NUCP

P

like

NP↓

NP∗

DEF-OP

a

NP

COREN

NUCN

N

holiday

3 / 32

Page 5: Chart-based RRG parsing using an automatically extracted RRG …€¦ · CO NUC V think RP Asterix TNS does CL CL CO NUC V drank RP Obelix PrCS RP What CL CL CO NUC V drank RP Obelix

Design of RRG Grammars Automatic RRG Grammar Extraction Parsing experiments Issues Summary & Outlook

Introduction

1 RRG annotated treebank:RRGBank [2],

2 extract RRG elementarytrees automatically,

3 use these elementarytrees for RRG parsing.

NP

PRO

it

SENTENCE

CLAUSE

CORE

NP↓ NUC

V

looks

PP↓

PP

COREP

NUCP

P

like

NP↓

NP∗

DEF-OP

a

NP

COREN

NUCN

N

holiday

3 / 32

Page 6: Chart-based RRG parsing using an automatically extracted RRG …€¦ · CO NUC V think RP Asterix TNS does CL CL CO NUC V drank RP Obelix PrCS RP What CL CL CO NUC V drank RP Obelix

Design of RRG Grammars Automatic RRG Grammar Extraction Parsing experiments Issues Summary & Outlook

Outline

Design of RRG Grammars

Automatic RRG Grammar Extraction

Parsing experiments

Issues

Summary & Outlook

4 / 32

Page 7: Chart-based RRG parsing using an automatically extracted RRG …€¦ · CO NUC V think RP Asterix TNS does CL CL CO NUC V drank RP Obelix PrCS RP What CL CL CO NUC V drank RP Obelix

Design of RRG Grammars Automatic RRG Grammar Extraction Parsing experiments Issues Summary & Outlook

RRGbank

? Corpus of RRG annotated sentences [2]→ automatically converted from PennTreebank,→ manually checked and validated;

? 395 gold sentences, 1090 silverannotated sentences;

? RRGbank and RRG annotation tool:rrgbank.phil.hhu.de.

SENTENCE

CLAUSE

CORE

NP

COREN

NUCN

N

Japanese

NUC

V

said

CORE

CLM

to

NUC

AUX

be

NP

COREN

AP-PERI

A

heavy

NUCN

N

buyers

TNS-OP

were

V

NUC

CORE

CLAUSETNS

5 / 32

Page 8: Chart-based RRG parsing using an automatically extracted RRG …€¦ · CO NUC V think RP Asterix TNS does CL CL CO NUC V drank RP Obelix PrCS RP What CL CL CO NUC V drank RP Obelix

Design of RRG Grammars Automatic RRG Grammar Extraction Parsing experiments Issues Summary & Outlook

Removing crossing branchesSENTENCE

CLAUSE

CORE

NP

COREN

NUCN

N

Japanese

NUC

V

said

CORE

CLM

to

NUC

AUX

be

NP

COREN

AP-PERI

A

heavy

NUCN

N

buyers

TNS-OP

were

V

NUC

CORE

CLAUSETNS

SENTENCE

CLAUSE

CORE

NP

COREN

NUCN

N

Japanese

NUC

V

said

CORE

CLM

to

NUC

AUX

be

NP

COREN

AP-PERI

A

heavy

NUCN

N

buyers

TNS-OP [OP=CL]

were

V

NUC

CORE

CLAUSETNS? we transform the RRG structures to remove crossing branches,

? we mark the original position of the node with [op=cl],? original tree structure is easily recovered.

6 / 32

Page 9: Chart-based RRG parsing using an automatically extracted RRG …€¦ · CO NUC V think RP Asterix TNS does CL CL CO NUC V drank RP Obelix PrCS RP What CL CL CO NUC V drank RP Obelix

Design of RRG Grammars Automatic RRG Grammar Extraction Parsing experiments Issues Summary & Outlook

Removing crossing branchesSENTENCE

CLAUSE

CORE

NP

COREN

NUCN

N

Japanese

NUC

V

said

CORE

CLM

to

NUC

AUX

be

NP

COREN

AP-PERI

A

heavy

NUCN

N

buyers

TNS-OP

were

V

NUC

CORE

CLAUSETNS

SENTENCE

CLAUSE

CORE

NP

COREN

NUCN

N

Japanese

NUC

V

said

CORE

CLM

to

NUC

AUX

be

NP

COREN

AP-PERI

A

heavy

NUCN

N

buyers

TNS-OP [OP=CL]

were

V

NUC

CORE

CLAUSETNS? we transform the RRG structures to remove crossing branches,? we mark the original position of the node with [op=cl],

? original tree structure is easily recovered.

6 / 32

Page 10: Chart-based RRG parsing using an automatically extracted RRG …€¦ · CO NUC V think RP Asterix TNS does CL CL CO NUC V drank RP Obelix PrCS RP What CL CL CO NUC V drank RP Obelix

Design of RRG Grammars Automatic RRG Grammar Extraction Parsing experiments Issues Summary & Outlook

Removing crossing branchesSENTENCE

CLAUSE

CORE

NP

COREN

NUCN

N

Japanese

NUC

V

said

CORE

CLM

to

NUC

AUX

be

NP

COREN

AP-PERI

A

heavy

NUCN

N

buyers

TNS-OP

were

V

NUC

CORE

CLAUSETNS

SENTENCE

CLAUSE

CORE

NP

COREN

NUCN

N

Japanese

NUC

V

said

CORE

CLM

to

NUC

AUX

be

NP

COREN

AP-PERI

A

heavy

NUCN

N

buyers

TNS-OP [OP=CL]

were

V

NUC

CORE

CLAUSETNS? we transform the RRG structures to remove crossing branches,? we mark the original position of the node with [op=cl],? original tree structure is easily recovered.

6 / 32

Page 11: Chart-based RRG parsing using an automatically extracted RRG …€¦ · CO NUC V think RP Asterix TNS does CL CL CO NUC V drank RP Obelix PrCS RP What CL CL CO NUC V drank RP Obelix

Design of RRG Grammars Automatic RRG Grammar Extraction Parsing experiments Issues Summary & Outlook

Operator projection and periphery can be recovered

SENTENCE

CLAUSE

CORE

NP

COREN

NUCN

N

Japanese

NUC

V

said

CORE

CLM

to

NUC

AUX

be

NP

COREN

AP-PERI

A

heavy

NUCN

N

buyers

TNS-OP

were

V

NUC

CORE

CLAUSETNS

SENTENCE

CLAUSE

CORE

NP

COREN

NUCN

N

Japanese

NUC

V

said

CORECLM

to

NUC

AUX

be

NP

COREN

AP

PERIPHERY

heavy

NUCN

N

buyerswere

V

NUC

CORE

CLAUSETNS

7 / 32

Page 12: Chart-based RRG parsing using an automatically extracted RRG …€¦ · CO NUC V think RP Asterix TNS does CL CL CO NUC V drank RP Obelix PrCS RP What CL CL CO NUC V drank RP Obelix

Design of RRG Grammars Automatic RRG Grammar Extraction Parsing experiments Issues Summary & Outlook

Elementary trees in RRG Grammars

? We follow Kallmeyer et al. (2013) and Osswald & Kallmeyer(2018) [3, 4] in design of the elementary trees in our grammar.

? Three tree composition operations:→ substitution ( argument slot filling)→ wrapping substitution (displaced argument slot filling)→ sister adjunction ( adding operators and periphery elements);

? Such RRG grammars capture long-distance dependencies→ for example, WH-movement.

8 / 32

Page 13: Chart-based RRG parsing using an automatically extracted RRG …€¦ · CO NUC V think RP Asterix TNS does CL CL CO NUC V drank RP Obelix PrCS RP What CL CL CO NUC V drank RP Obelix

Design of RRG Grammars Automatic RRG Grammar Extraction Parsing experiments Issues Summary & Outlook

Elementary trees in RRG Grammars

? We follow Kallmeyer et al. (2013) and Osswald & Kallmeyer(2018) [3, 4] in design of the elementary trees in our grammar.

? Three tree composition operations:

→ substitution ( argument slot filling)→ wrapping substitution (displaced argument slot filling)→ sister adjunction ( adding operators and periphery elements);

? Such RRG grammars capture long-distance dependencies→ for example, WH-movement.

8 / 32

Page 14: Chart-based RRG parsing using an automatically extracted RRG …€¦ · CO NUC V think RP Asterix TNS does CL CL CO NUC V drank RP Obelix PrCS RP What CL CL CO NUC V drank RP Obelix

Design of RRG Grammars Automatic RRG Grammar Extraction Parsing experiments Issues Summary & Outlook

Elementary trees in RRG Grammars

? We follow Kallmeyer et al. (2013) and Osswald & Kallmeyer(2018) [3, 4] in design of the elementary trees in our grammar.

? Three tree composition operations:→ substitution ( argument slot filling)

→ wrapping substitution (displaced argument slot filling)→ sister adjunction ( adding operators and periphery elements);

? Such RRG grammars capture long-distance dependencies→ for example, WH-movement.

8 / 32

Page 15: Chart-based RRG parsing using an automatically extracted RRG …€¦ · CO NUC V think RP Asterix TNS does CL CL CO NUC V drank RP Obelix PrCS RP What CL CL CO NUC V drank RP Obelix

Design of RRG Grammars Automatic RRG Grammar Extraction Parsing experiments Issues Summary & Outlook

Elementary trees in RRG Grammars

? We follow Kallmeyer et al. (2013) and Osswald & Kallmeyer(2018) [3, 4] in design of the elementary trees in our grammar.

? Three tree composition operations:→ substitution ( argument slot filling)→ wrapping substitution (displaced argument slot filling)

→ sister adjunction ( adding operators and periphery elements);

? Such RRG grammars capture long-distance dependencies→ for example, WH-movement.

8 / 32

Page 16: Chart-based RRG parsing using an automatically extracted RRG …€¦ · CO NUC V think RP Asterix TNS does CL CL CO NUC V drank RP Obelix PrCS RP What CL CL CO NUC V drank RP Obelix

Design of RRG Grammars Automatic RRG Grammar Extraction Parsing experiments Issues Summary & Outlook

Elementary trees in RRG Grammars

? We follow Kallmeyer et al. (2013) and Osswald & Kallmeyer(2018) [3, 4] in design of the elementary trees in our grammar.

? Three tree composition operations:→ substitution ( argument slot filling)→ wrapping substitution (displaced argument slot filling)→ sister adjunction ( adding operators and periphery elements);

? Such RRG grammars capture long-distance dependencies→ for example, WH-movement.

8 / 32

Page 17: Chart-based RRG parsing using an automatically extracted RRG …€¦ · CO NUC V think RP Asterix TNS does CL CL CO NUC V drank RP Obelix PrCS RP What CL CL CO NUC V drank RP Obelix

Design of RRG Grammars Automatic RRG Grammar Extraction Parsing experiments Issues Summary & Outlook

Combination operations: Substitution and sister adjunction

NP

PRO

I

CORE∗

TNS-OP [OP=CL]

have

SENTENCE

CLAUSE

CORE

NP↓ NUC

V

felt

NP↓

CORE*N

AP-PERI

COREA

NUCA

A

many

NP

COREN

NUCN

N

aftershocks

Sentence: I have felt many aftershocks

9 / 32

Page 18: Chart-based RRG parsing using an automatically extracted RRG …€¦ · CO NUC V think RP Asterix TNS does CL CL CO NUC V drank RP Obelix PrCS RP What CL CL CO NUC V drank RP Obelix

Design of RRG Grammars Automatic RRG Grammar Extraction Parsing experiments Issues Summary & Outlook

Combination operations: Substitution and sister adjunction

NP

PRO

I

CORE∗

TNS-OP [OP=CL]

have

SENTENCE

CLAUSE

CORE

NP↓ NUC

V

felt

NP↓

CORE*N

AP-PERI

COREA

NUCA

A

many

NP

COREN

NUCN

N

aftershocks

Sentence: I have felt many aftershocks

9 / 32

Page 19: Chart-based RRG parsing using an automatically extracted RRG …€¦ · CO NUC V think RP Asterix TNS does CL CL CO NUC V drank RP Obelix PrCS RP What CL CL CO NUC V drank RP Obelix

Design of RRG Grammars Automatic RRG Grammar Extraction Parsing experiments Issues Summary & Outlook

Combination operations: Substitution and sister adjunction

NP

PRO

I

CORE∗

TNS-OP [OP=CL]

have

SENTENCE

CLAUSE

CORE

NP↓ NUC

V

felt

NP↓

CORE*N

AP-PERI

COREA

NUCA

A

many

NP

COREN

NUCN

N

aftershocks

Sentence: I have felt many aftershocks

9 / 32

Page 20: Chart-based RRG parsing using an automatically extracted RRG …€¦ · CO NUC V think RP Asterix TNS does CL CL CO NUC V drank RP Obelix PrCS RP What CL CL CO NUC V drank RP Obelix

Design of RRG Grammars Automatic RRG Grammar Extraction Parsing experiments Issues Summary & Outlook

Combination operations: Substitution and sister adjunction

NP

PRO

I

CORE∗

TNS-OP [OP=CL]

have

SENTENCE

CLAUSE

CORE

NP↓ NUC

V

felt

NP↓

CORE*N

AP-PERI

COREA

NUCA

A

many

NP

COREN

NUCN

N

aftershocks

Sentence: I have felt many aftershocks

9 / 32

Page 21: Chart-based RRG parsing using an automatically extracted RRG …€¦ · CO NUC V think RP Asterix TNS does CL CL CO NUC V drank RP Obelix PrCS RP What CL CL CO NUC V drank RP Obelix

Design of RRG Grammars Automatic RRG Grammar Extraction Parsing experiments Issues Summary & Outlook

Combination operations: Substitution and sister adjunction

NP

PRO

I

CORE∗

TNS-OP [OP=CL]

have

SENTENCE

CLAUSE

CORE

NP↓ NUC

V

felt

NP↓

CORE*N

AP-PERI

COREA

NUCA

A

many

NP

COREN

NUCN

N

aftershocks

Sentence: I have felt many aftershocks

9 / 32

Page 22: Chart-based RRG parsing using an automatically extracted RRG …€¦ · CO NUC V think RP Asterix TNS does CL CL CO NUC V drank RP Obelix PrCS RP What CL CL CO NUC V drank RP Obelix

Design of RRG Grammars Automatic RRG Grammar Extraction Parsing experiments Issues Summary & Outlook

Combination operations: Substitution and sister adjunction

NP

PRO

I

CORE∗

TNS-OP [OP=CL]

have

SENTENCE

CLAUSE

CORE

NP↓ NUC

V

felt

NP↓

CORE*N

AP-PERI

COREA

NUCA

A

many

NP

COREN

NUCN

N

aftershocks

Sentence: I have felt many aftershocks

9 / 32

Page 23: Chart-based RRG parsing using an automatically extracted RRG …€¦ · CO NUC V think RP Asterix TNS does CL CL CO NUC V drank RP Obelix PrCS RP What CL CL CO NUC V drank RP Obelix

Design of RRG Grammars Automatic RRG Grammar Extraction Parsing experiments Issues Summary & Outlook

Combination operations: Substitution and sister adjunction

NP

PRO

I

CORE∗

TNS-OP [OP=CL]

have

SENTENCE

CLAUSE

CORE

NP↓ NUC

V

felt

NP↓

CORE*N

AP-PERI

COREA

NUCA

A

many

NP

COREN

NUCN

N

aftershocks

Sentence: I have felt many aftershocks

9 / 32

Page 24: Chart-based RRG parsing using an automatically extracted RRG …€¦ · CO NUC V think RP Asterix TNS does CL CL CO NUC V drank RP Obelix PrCS RP What CL CL CO NUC V drank RP Obelix

Design of RRG Grammars Automatic RRG Grammar Extraction Parsing experiments Issues Summary & Outlook

Combination operations: Substitution and sister adjunction

NP

PRO

I

CORE∗

TNS-OP [OP=CL]

have

SENTENCE

CLAUSE

CORE

NP↓ NUC

V

felt

NP↓ CORE*N

AP-PERI

COREA

NUCA

A

many

NP

COREN

NUCN

N

aftershocks

Sentence: I have felt many aftershocks

9 / 32

Page 25: Chart-based RRG parsing using an automatically extracted RRG …€¦ · CO NUC V think RP Asterix TNS does CL CL CO NUC V drank RP Obelix PrCS RP What CL CL CO NUC V drank RP Obelix

Design of RRG Grammars Automatic RRG Grammar Extraction Parsing experiments Issues Summary & Outlook

Combination operations: Substitution and sister adjunction

NP

PRO

I

CORE∗

TNS-OP [OP=CL]

have

SENTENCE

CLAUSE

CORE

NP↓ NUC

V

felt

NP↓ CORE*N

AP-PERI

COREA

NUCA

A

many

NP

COREN

NUCN

N

aftershocks

Sentence: I have felt many aftershocks

9 / 32

Page 26: Chart-based RRG parsing using an automatically extracted RRG …€¦ · CO NUC V think RP Asterix TNS does CL CL CO NUC V drank RP Obelix PrCS RP What CL CL CO NUC V drank RP Obelix

Design of RRG Grammars Automatic RRG Grammar Extraction Parsing experiments Issues Summary & Outlook

Combination operations: Substitution and sister adjunction

SENTENCE

CLAUSE

CORE

NP

PRO

I felt

V

NUC

NP

AP-PERI

COREA

NUCA

A

many

NUCN

N

aftershocks

TNS-OP [OP=CL]

have

COREN

Sentence: I have felt many aftershocks

10 / 32

Page 27: Chart-based RRG parsing using an automatically extracted RRG …€¦ · CO NUC V think RP Asterix TNS does CL CL CO NUC V drank RP Obelix PrCS RP What CL CL CO NUC V drank RP Obelix

Design of RRG Grammars Automatic RRG Grammar Extraction Parsing experiments Issues Summary & Outlook

Combination operations: Wrapping substitution

CL

CL↓CO

NUC

V

think

RP

Asterix

TNS

does

CL

CL

CO

NUC

V

drank

RP

Obelix

PrCS

RP

What

CL

CL

CO

NUC

V

drank

RP

Obelix

CO

NUC

V

think

RP

Asterix

TNS

does

PrCS

RP

What

Sentence: What does Asterix think Obelix drank

11 / 32

Page 28: Chart-based RRG parsing using an automatically extracted RRG …€¦ · CO NUC V think RP Asterix TNS does CL CL CO NUC V drank RP Obelix PrCS RP What CL CL CO NUC V drank RP Obelix

Design of RRG Grammars Automatic RRG Grammar Extraction Parsing experiments Issues Summary & Outlook

Combination operations: Wrapping substitution

CL

CL↓CO

NUC

V

think

RP

Asterix

TNS

does

CL

CL

CO

NUC

V

drank

RP

Obelix

PrCS

RP

What

CL

CL

CO

NUC

V

drank

RP

Obelix

CO

NUC

V

think

RP

Asterix

TNS

does

PrCS

RP

What

Sentence: What does Asterix think Obelix drank

11 / 32

Page 29: Chart-based RRG parsing using an automatically extracted RRG …€¦ · CO NUC V think RP Asterix TNS does CL CL CO NUC V drank RP Obelix PrCS RP What CL CL CO NUC V drank RP Obelix

Design of RRG Grammars Automatic RRG Grammar Extraction Parsing experiments Issues Summary & Outlook

Combination operations: Wrapping substitution

CL

CL↓CO

NUC

V

think

RP

Asterix

TNS

does

CL

CL

CO

NUC

V

drank

RP

Obelix

PrCS

RP

What

CL

CL

CO

NUC

V

drank

RP

Obelix

CO

NUC

V

think

RP

Asterix

TNS

does

PrCS

RP

What

Sentence: What does Asterix think Obelix drank

11 / 32

Page 30: Chart-based RRG parsing using an automatically extracted RRG …€¦ · CO NUC V think RP Asterix TNS does CL CL CO NUC V drank RP Obelix PrCS RP What CL CL CO NUC V drank RP Obelix

Design of RRG Grammars Automatic RRG Grammar Extraction Parsing experiments Issues Summary & Outlook

Combination operations: Wrapping substitution

CL

CL↓CO

NUC

V

think

RP

Asterix

TNS

does

CL

CL

CO

NUC

V

drank

RP

Obelix

PrCS

RP

What

CL

CL

CO

NUC

V

drank

RP

Obelix

CO

NUC

V

think

RP

Asterix

TNS

does

PrCS

RP

What

Sentence: What does Asterix think Obelix drank

11 / 32

Page 31: Chart-based RRG parsing using an automatically extracted RRG …€¦ · CO NUC V think RP Asterix TNS does CL CL CO NUC V drank RP Obelix PrCS RP What CL CL CO NUC V drank RP Obelix

Design of RRG Grammars Automatic RRG Grammar Extraction Parsing experiments Issues Summary & Outlook

Elementary trees with features

RRGbank Extracted Elementary Trees

NP

DEF-OP

the

CORE-N

NUC-N

N

average placeholder

NP∗

DEF-OP

the

NP

CORE-N

NUC-N

N

average

[DEF -OPS [NP -]

][DEF +OPS [NP +]

][DEF +]

[OP=NP]

[NUC-O lOPS [NP -]

] [NUC-O rOPS [NP -]

]

? elementary trees are enhanced with features

→ edge features→ node features

12 / 32

Page 32: Chart-based RRG parsing using an automatically extracted RRG …€¦ · CO NUC V think RP Asterix TNS does CL CL CO NUC V drank RP Obelix PrCS RP What CL CL CO NUC V drank RP Obelix

Design of RRG Grammars Automatic RRG Grammar Extraction Parsing experiments Issues Summary & Outlook

Elementary trees with features

RRGbank Extracted Elementary Trees

NP

DEF-OP

the

CORE-N

NUC-N

N

average placeholder

NP∗

DEF-OP

the

NP

CORE-N

NUC-N

N

average

[DEF -OPS [NP -]

][DEF +OPS [NP +]

][DEF +]

[OP=NP]

[NUC-O lOPS [NP -]

] [NUC-O rOPS [NP -]

]

? elementary trees are enhanced with features→ edge features

→ node features

12 / 32

Page 33: Chart-based RRG parsing using an automatically extracted RRG …€¦ · CO NUC V think RP Asterix TNS does CL CL CO NUC V drank RP Obelix PrCS RP What CL CL CO NUC V drank RP Obelix

Design of RRG Grammars Automatic RRG Grammar Extraction Parsing experiments Issues Summary & Outlook

Elementary trees with features

RRGbank Extracted Elementary Trees

NP

DEF-OP

the

CORE-N

NUC-N

N

average placeholder

NP∗

DEF-OP

the

NP

CORE-N

NUC-N

N

average

[DEF -OPS [NP -]

][DEF +OPS [NP +]

][DEF +]

[OP=NP]

[NUC-O lOPS [NP -]

] [NUC-O rOPS [NP -]

]

? elementary trees are enhanced with features→ edge features→ node features

12 / 32

Page 34: Chart-based RRG parsing using an automatically extracted RRG …€¦ · CO NUC V think RP Asterix TNS does CL CL CO NUC V drank RP Obelix PrCS RP What CL CL CO NUC V drank RP Obelix

Design of RRG Grammars Automatic RRG Grammar Extraction Parsing experiments Issues Summary & Outlook

Edge featuresRRGbank Extracted Elementary Trees

NP

DEF-OP

the

CORE-N

NUC-N

N

average placeholder

NP∗

DEF-OP

the

NP

CORE-N

NUC-N

N

average

[DEF -OPS [NP -]

][DEF +OPS [NP +]

][DEF +]

[OP=NP]

[NUC-O lOPS [NP -]

] [NUC-O rOPS [NP -]

]

Left and right edge feature structures:

? unify adjacent structures in the derived tree? model ordering constraints? percolate upwards until phrasal nodes

13 / 32

Page 35: Chart-based RRG parsing using an automatically extracted RRG …€¦ · CO NUC V think RP Asterix TNS does CL CL CO NUC V drank RP Obelix PrCS RP What CL CL CO NUC V drank RP Obelix

Design of RRG Grammars Automatic RRG Grammar Extraction Parsing experiments Issues Summary & Outlook

Edge featuresRRGbank Extracted Elementary Trees

NP

DEF-OP

the

CORE-N

NUC-N

N

average placeholder

NP∗

DEF-OP

the

NP

CORE-N

NUC-N

N

average

[DEF -OPS [NP -]

][DEF +OPS [NP +]

][DEF +]

[OP=NP]

[NUC-O lOPS [NP -]

] [NUC-O rOPS [NP -]

]

Left and right edge feature structures:? unify adjacent structures in the derived tree

? model ordering constraints? percolate upwards until phrasal nodes

13 / 32

Page 36: Chart-based RRG parsing using an automatically extracted RRG …€¦ · CO NUC V think RP Asterix TNS does CL CL CO NUC V drank RP Obelix PrCS RP What CL CL CO NUC V drank RP Obelix

Design of RRG Grammars Automatic RRG Grammar Extraction Parsing experiments Issues Summary & Outlook

Edge featuresRRGbank Extracted Elementary Trees

NP

DEF-OP

the

CORE-N

NUC-N

N

average placeholder

NP∗

DEF-OP

the

NP

CORE-N

NUC-N

N

average

[DEF -OPS [NP -]

][DEF +OPS [NP +]

][DEF +]

[OP=NP]

[NUC-O lOPS [NP -]

] [NUC-O rOPS [NP -]

]

Left and right edge feature structures:? unify adjacent structures in the derived tree? model ordering constraints

? percolate upwards until phrasal nodes

13 / 32

Page 37: Chart-based RRG parsing using an automatically extracted RRG …€¦ · CO NUC V think RP Asterix TNS does CL CL CO NUC V drank RP Obelix PrCS RP What CL CL CO NUC V drank RP Obelix

Design of RRG Grammars Automatic RRG Grammar Extraction Parsing experiments Issues Summary & Outlook

Edge featuresRRGbank Extracted Elementary Trees

NP

DEF-OP

the

CORE-N

NUC-N

N

average placeholder

NP∗

DEF-OP

the

NP

CORE-N

NUC-N

N

average

[DEF -OPS [NP -]

][DEF +OPS [NP +]

][DEF +]

[OP=NP]

[NUC-O lOPS [NP -]

] [NUC-O rOPS [NP -]

]

Left and right edge feature structures:? unify adjacent structures in the derived tree? model ordering constraints? percolate upwards until phrasal nodes

13 / 32

Page 38: Chart-based RRG parsing using an automatically extracted RRG …€¦ · CO NUC V think RP Asterix TNS does CL CL CO NUC V drank RP Obelix PrCS RP What CL CL CO NUC V drank RP Obelix

Design of RRG Grammars Automatic RRG Grammar Extraction Parsing experiments Issues Summary & Outlook

Node featuresRRGbank Extracted Elementary Trees

NP

DEF-OP

the

CORE-N

NUC-N

N

average placeholder

NP∗

DEF-OP

the

NP

CORE-N

NUC-N

N

average

[DEF -OPS [NP -]

][DEF +OPS [NP +]

][DEF +]

[OP=NP]

[NUC-O lOPS [NP -]

] [NUC-O rOPS [NP -]

]

? One feature structure per node:• unify during tree composition• store syntactic or syn-sem interface information.

? Unification successful → accept parse tree

14 / 32

Page 39: Chart-based RRG parsing using an automatically extracted RRG …€¦ · CO NUC V think RP Asterix TNS does CL CL CO NUC V drank RP Obelix PrCS RP What CL CL CO NUC V drank RP Obelix

Design of RRG Grammars Automatic RRG Grammar Extraction Parsing experiments Issues Summary & Outlook

Outline

Design of RRG Grammars

Automatic RRG Grammar Extraction

Parsing experiments

Issues

Summary & Outlook

15 / 32

Page 40: Chart-based RRG parsing using an automatically extracted RRG …€¦ · CO NUC V think RP Asterix TNS does CL CL CO NUC V drank RP Obelix PrCS RP What CL CL CO NUC V drank RP Obelix

Design of RRG Grammars Automatic RRG Grammar Extraction Parsing experiments Issues Summary & Outlook

RRG Grammar extraction algorithm (1)

? Elementary tree extraction inspired by Xia [6] algorithmfor induction of Tree-Adjoining Grammars.

? Top-down extraction of elementary trees.? Heuristics from head-modifier percolation tables.? We use RRG structures from RRGbank for automatic grammarinduction→ rrgbank.phil.hhu.de.

16 / 32

Page 41: Chart-based RRG parsing using an automatically extracted RRG …€¦ · CO NUC V think RP Asterix TNS does CL CL CO NUC V drank RP Obelix PrCS RP What CL CL CO NUC V drank RP Obelix

Design of RRG Grammars Automatic RRG Grammar Extraction Parsing experiments Issues Summary & Outlook

RRG Grammar extraction algorithm (2)

ROOT

CLAUSE

CORE

NP

COREN

NUCN

DEF-OP

The

N

average

V

dropped

NUC NP

COREN

CD

27

NUCN

N

points

TNS-OP [OP=CL]

had

NP∗

DEF-OP[OP = NP,DEF = +]

The

NP

COREN

NUCN

N

average

CORE∗

TNS-OP[OP = CL,TNS = past]

had

ROOT

CLAUSE

CORE

NP↓ NUC

V

dropped

NP↓

COREN∗

CD

27

NP

COREN

NUCN

N

points

17 / 32

Page 42: Chart-based RRG parsing using an automatically extracted RRG …€¦ · CO NUC V think RP Asterix TNS does CL CL CO NUC V drank RP Obelix PrCS RP What CL CL CO NUC V drank RP Obelix

Design of RRG Grammars Automatic RRG Grammar Extraction Parsing experiments Issues Summary & Outlook

Outline

Design of RRG Grammars

Automatic RRG Grammar Extraction

Parsing experiments

Issues

Summary & Outlook

18 / 32

Page 43: Chart-based RRG parsing using an automatically extracted RRG …€¦ · CO NUC V think RP Asterix TNS does CL CL CO NUC V drank RP Obelix PrCS RP What CL CL CO NUC V drank RP Obelix

Design of RRG Grammars Automatic RRG Grammar Extraction Parsing experiments Issues Summary & Outlook

Chart-based RRG Parser TuLiPA

? input: set of elementary trees and sentences to parse;

? output: all derivations that can be derived by combining theelementary trees;

? standard CYK algorithm;? bottom-up, left-to-right traversion of the derived tree;? software: TuLiPA RRG parser [1]

(https://github.com/spetitjean/TuLiPA-frames)TuLiPA = Tübingen Linguistic Parsing Architecture;

? suitable for hand-crafted precision RRG grammars;? suitable for automatically extracted RRG grammars.

19 / 32

Page 44: Chart-based RRG parsing using an automatically extracted RRG …€¦ · CO NUC V think RP Asterix TNS does CL CL CO NUC V drank RP Obelix PrCS RP What CL CL CO NUC V drank RP Obelix

Design of RRG Grammars Automatic RRG Grammar Extraction Parsing experiments Issues Summary & Outlook

Chart-based RRG Parser TuLiPA

? input: set of elementary trees and sentences to parse;? output: all derivations that can be derived by combining theelementary trees;

? standard CYK algorithm;? bottom-up, left-to-right traversion of the derived tree;? software: TuLiPA RRG parser [1]

(https://github.com/spetitjean/TuLiPA-frames)TuLiPA = Tübingen Linguistic Parsing Architecture;

? suitable for hand-crafted precision RRG grammars;? suitable for automatically extracted RRG grammars.

19 / 32

Page 45: Chart-based RRG parsing using an automatically extracted RRG …€¦ · CO NUC V think RP Asterix TNS does CL CL CO NUC V drank RP Obelix PrCS RP What CL CL CO NUC V drank RP Obelix

Design of RRG Grammars Automatic RRG Grammar Extraction Parsing experiments Issues Summary & Outlook

Chart-based RRG Parser TuLiPA

? input: set of elementary trees and sentences to parse;? output: all derivations that can be derived by combining theelementary trees;

? standard CYK algorithm;

? bottom-up, left-to-right traversion of the derived tree;? software: TuLiPA RRG parser [1]

(https://github.com/spetitjean/TuLiPA-frames)TuLiPA = Tübingen Linguistic Parsing Architecture;

? suitable for hand-crafted precision RRG grammars;? suitable for automatically extracted RRG grammars.

19 / 32

Page 46: Chart-based RRG parsing using an automatically extracted RRG …€¦ · CO NUC V think RP Asterix TNS does CL CL CO NUC V drank RP Obelix PrCS RP What CL CL CO NUC V drank RP Obelix

Design of RRG Grammars Automatic RRG Grammar Extraction Parsing experiments Issues Summary & Outlook

Chart-based RRG Parser TuLiPA

? input: set of elementary trees and sentences to parse;? output: all derivations that can be derived by combining theelementary trees;

? standard CYK algorithm;? bottom-up, left-to-right traversion of the derived tree;

? software: TuLiPA RRG parser [1](https://github.com/spetitjean/TuLiPA-frames)TuLiPA = Tübingen Linguistic Parsing Architecture;

? suitable for hand-crafted precision RRG grammars;? suitable for automatically extracted RRG grammars.

19 / 32

Page 47: Chart-based RRG parsing using an automatically extracted RRG …€¦ · CO NUC V think RP Asterix TNS does CL CL CO NUC V drank RP Obelix PrCS RP What CL CL CO NUC V drank RP Obelix

Design of RRG Grammars Automatic RRG Grammar Extraction Parsing experiments Issues Summary & Outlook

Chart-based RRG Parser TuLiPA

? input: set of elementary trees and sentences to parse;? output: all derivations that can be derived by combining theelementary trees;

? standard CYK algorithm;? bottom-up, left-to-right traversion of the derived tree;? software: TuLiPA RRG parser [1](https://github.com/spetitjean/TuLiPA-frames)TuLiPA = Tübingen Linguistic Parsing Architecture;

? suitable for hand-crafted precision RRG grammars;? suitable for automatically extracted RRG grammars.

19 / 32

Page 48: Chart-based RRG parsing using an automatically extracted RRG …€¦ · CO NUC V think RP Asterix TNS does CL CL CO NUC V drank RP Obelix PrCS RP What CL CL CO NUC V drank RP Obelix

Design of RRG Grammars Automatic RRG Grammar Extraction Parsing experiments Issues Summary & Outlook

Chart-based RRG Parser TuLiPA

? input: set of elementary trees and sentences to parse;? output: all derivations that can be derived by combining theelementary trees;

? standard CYK algorithm;? bottom-up, left-to-right traversion of the derived tree;? software: TuLiPA RRG parser [1](https://github.com/spetitjean/TuLiPA-frames)TuLiPA = Tübingen Linguistic Parsing Architecture;

? suitable for hand-crafted precision RRG grammars;

? suitable for automatically extracted RRG grammars.

19 / 32

Page 49: Chart-based RRG parsing using an automatically extracted RRG …€¦ · CO NUC V think RP Asterix TNS does CL CL CO NUC V drank RP Obelix PrCS RP What CL CL CO NUC V drank RP Obelix

Design of RRG Grammars Automatic RRG Grammar Extraction Parsing experiments Issues Summary & Outlook

Chart-based RRG Parser TuLiPA

? input: set of elementary trees and sentences to parse;? output: all derivations that can be derived by combining theelementary trees;

? standard CYK algorithm;? bottom-up, left-to-right traversion of the derived tree;? software: TuLiPA RRG parser [1](https://github.com/spetitjean/TuLiPA-frames)TuLiPA = Tübingen Linguistic Parsing Architecture;

? suitable for hand-crafted precision RRG grammars;? suitable for automatically extracted RRG grammars.

19 / 32

Page 50: Chart-based RRG parsing using an automatically extracted RRG …€¦ · CO NUC V think RP Asterix TNS does CL CL CO NUC V drank RP Obelix PrCS RP What CL CL CO NUC V drank RP Obelix

Design of RRG Grammars Automatic RRG Grammar Extraction Parsing experiments Issues Summary & Outlook

Extracted RRG grammar

• removed punctuation• exhaustive parsing (i.e. not probabilistic, overgenerating a lot)

• 2 versions:1 no features2 edge features for operators model adjunction constraints

• do feature structures eliminate parse trees that contradictlinguistic intuitions?

20 / 32

Page 51: Chart-based RRG parsing using an automatically extracted RRG …€¦ · CO NUC V think RP Asterix TNS does CL CL CO NUC V drank RP Obelix PrCS RP What CL CL CO NUC V drank RP Obelix

Design of RRG Grammars Automatic RRG Grammar Extraction Parsing experiments Issues Summary & Outlook

Parsing experiments

Gold Grammar Silver Grammarw feats w/o feats w feats w/o feats

Sentences 395 1480avg. sentence length 6.1 8.0token-supertag pairs 1526 1497 6288 6044avg. number of parses 6.9 12.7 1166 2939

savings 45.1% 39.7%

features decrease number of results by ≈ 45%

21 / 32

Page 52: Chart-based RRG parsing using an automatically extracted RRG …€¦ · CO NUC V think RP Asterix TNS does CL CL CO NUC V drank RP Obelix PrCS RP What CL CL CO NUC V drank RP Obelix

Design of RRG Grammars Automatic RRG Grammar Extraction Parsing experiments Issues Summary & Outlook

Features restrict adjunction of operators and periphery (1)

SENTENCE

PP-PERI

COREP

NUCP

By

NP[NUC-O rOPS [NP -]

][NUC-O lOPS [NP -]

]COREN

CD

9:45

CLAUSE

CORE

NP [NUC-O rOPS [NP -]

][NUC-O lOPS [NP -]

][NUC-O lOPS [NP -]

]AP-PERI

COREA

NUCA

A

industrial

COREN

NUCN

N

average

. . .

NP∗

DEF-OP

the

[NUC-O lOPS [NP -]

][NUC-O lOPS [NP +]

]

Sentence: By 9:45, the industrial average had dropped 27 points.

22 / 32

Page 53: Chart-based RRG parsing using an automatically extracted RRG …€¦ · CO NUC V think RP Asterix TNS does CL CL CO NUC V drank RP Obelix PrCS RP What CL CL CO NUC V drank RP Obelix

Design of RRG Grammars Automatic RRG Grammar Extraction Parsing experiments Issues Summary & Outlook

Features restrict adjunction of operators and periphery (2)

SENTENCE

PP-PERI

COREP

NUCP

By

NP

COREN

CD

9:45

CLAUSE

CORE

NP

AP-PERI

COREA

NUCA

A

industrial

COREN

NUCN

N

average

. . .

NP

DEF-OP

the

Sentence: By 9:45, the industrial average had dropped 27 points.

23 / 32

Page 54: Chart-based RRG parsing using an automatically extracted RRG …€¦ · CO NUC V think RP Asterix TNS does CL CL CO NUC V drank RP Obelix PrCS RP What CL CL CO NUC V drank RP Obelix

Design of RRG Grammars Automatic RRG Grammar Extraction Parsing experiments Issues Summary & Outlook

Outline

Design of RRG Grammars

Automatic RRG Grammar Extraction

Parsing experiments

Issues

Summary & Outlook

24 / 32

Page 55: Chart-based RRG parsing using an automatically extracted RRG …€¦ · CO NUC V think RP Asterix TNS does CL CL CO NUC V drank RP Obelix PrCS RP What CL CL CO NUC V drank RP Obelix

Design of RRG Grammars Automatic RRG Grammar Extraction Parsing experiments Issues Summary & Outlook

Attachment ambiguitiesCORE*

PP-PERI

CORE_P

NUC_P

P

in

CORE↓

NP*

PP-PERI

CORE_P

NUC_P

P

in

NP↓

CORE_N*

PP-PERI

CORE_P

NUC_P

P

in

NP↓

CORE*

PP-PERI

CORE_P

NUC_P

P

in

CORE↓

CORE

NP

CORE-N

NUC-N

N

bank

X∗

PP-PERI

CORE_P

X↓NUC_P

P

in

NP∗

NP

CORE-N

NUC-N

N

Texas

syntactic information needed that might not be in RRGBank25 / 32

Page 56: Chart-based RRG parsing using an automatically extracted RRG …€¦ · CO NUC V think RP Asterix TNS does CL CL CO NUC V drank RP Obelix PrCS RP What CL CL CO NUC V drank RP Obelix

Design of RRG Grammars Automatic RRG Grammar Extraction Parsing experiments Issues Summary & Outlook

Size of the grammar

Gold Grammar Silver Grammarw feats w/o feats w feats w/o feats

Sentences 395 1480avg. sentence length 6.1 8.0token-supertag pairs 1526 1497 6288 6044avg. number of parses 6.9 12.7 1166 2939

the number of parses per sentence increases with the size of the grammar

26 / 32

Page 57: Chart-based RRG parsing using an automatically extracted RRG …€¦ · CO NUC V think RP Asterix TNS does CL CL CO NUC V drank RP Obelix PrCS RP What CL CL CO NUC V drank RP Obelix

Design of RRG Grammars Automatic RRG Grammar Extraction Parsing experiments Issues Summary & Outlook

Extraction of wrapping substitution treesCL

CL↓CO

NUC

V

think

RP

Asterix

TNS

does

CL

CL

CO

NUC

V

drank

RP

Obelix

PrCS

RP

What

CL

CL

CO

NUC

V

drank

RP

Obelix

CO

NUC

V

think

RP

Asterix

TNS

does

PrCS

RP

What

? discontinuous constituents are marked with traces in PTB;

? no special marking in RRGBank;

? transfer traces from PTB to RRG trees in RRGbank?27 / 32

Page 58: Chart-based RRG parsing using an automatically extracted RRG …€¦ · CO NUC V think RP Asterix TNS does CL CL CO NUC V drank RP Obelix PrCS RP What CL CL CO NUC V drank RP Obelix

Design of RRG Grammars Automatic RRG Grammar Extraction Parsing experiments Issues Summary & Outlook

Outline

Design of RRG Grammars

Automatic RRG Grammar Extraction

Parsing experiments

Issues

Summary & Outlook

28 / 32

Page 59: Chart-based RRG parsing using an automatically extracted RRG …€¦ · CO NUC V think RP Asterix TNS does CL CL CO NUC V drank RP Obelix PrCS RP What CL CL CO NUC V drank RP Obelix

Design of RRG Grammars Automatic RRG Grammar Extraction Parsing experiments Issues Summary & Outlook

Summary

? automatically extracted elementary trees from RRGBank;? experiments with exhaustive parsing of 395/1480 sentences;? parsing w/o edge features → too many results;? some edge features already rule out bad results.

29 / 32

Page 60: Chart-based RRG parsing using an automatically extracted RRG …€¦ · CO NUC V think RP Asterix TNS does CL CL CO NUC V drank RP Obelix PrCS RP What CL CL CO NUC V drank RP Obelix

Design of RRG Grammars Automatic RRG Grammar Extraction Parsing experiments Issues Summary & Outlook

Future Work

? adding more edge features = better results;? extract wrapping elementary trees;? 100s of results → not satisfying;? ambiguity and annotation/extraction mistakes have badconsequences;

? use unlexicalized elementary trees (= supertags);? probabilistic grammar and parsing→ A* parsing algorithm ParTAGe by Waszczuk (2017) [5];

? Web GUI.

30 / 32

Page 61: Chart-based RRG parsing using an automatically extracted RRG …€¦ · CO NUC V think RP Asterix TNS does CL CL CO NUC V drank RP Obelix PrCS RP What CL CL CO NUC V drank RP Obelix

Design of RRG Grammars Automatic RRG Grammar Extraction Parsing experiments Issues Summary & Outlook

Thank you!

THANK YOU VERY MUCH FOR YOUR ATTENTION!

31 / 32

Page 62: Chart-based RRG parsing using an automatically extracted RRG …€¦ · CO NUC V think RP Asterix TNS does CL CL CO NUC V drank RP Obelix PrCS RP What CL CL CO NUC V drank RP Obelix

References

References I

Arps, D. and Petitjean, S. (2018). A Parser for LTAG and Frame Semantics. In chair), N. C. C., Choukri, K., Cieri, C.,Declerck, T., Goggi, S., Hasida, K., Isahara, H., Maegaard, B., Mariani, J., Mazo, H., Moreno, A., Odijk, J.,Piperidis, S., and Tokunaga, T., editors, Proceedings of the Eleventh International Conference on LanguageResources and Evaluation (LREC 2018), Miyazaki, Japan. European Language Resources Association (ELRA).

Bladier, T., van Cranenburgh, A., Evang, K., Kallmeyer, L., Möllemann, R., and Osswald, R. (2018). RRGbank: a Roleand Reference Grammar Corpus of Syntactic Structures Extracted from the Penn Treebank. In Proceedings ofthe 17th International Workshop on Treebanks and Linguistic Theories (TLT 2018), December 13–14, 2018,Oslo University, Norway, number 155, pages 5–16. Linköping University Electronic Press.

Kallmeyer, L., Osswald, R., and Van Valin, Jr., R. D. (2013). Tree Wrapping for Role and Reference Grammar. In Morrill,G. and Nederhof, M.-J., editors, Formal Grammar 2012/2013, volume 8036 of LNCS, pages 175–190. Springer.

Osswald, R. and Kallmeyer, L. (2018). Towards a formalization of role and reference grammar. In Kailuweit, R., Künkel,L., and Staudinger, E., editors, Applying and Expanding Role and Reference Grammar., pages 355–378.Albert-Ludwigs-Universität, Universitätsbibliothek. [NIHIN studies], Freiburg.

Waszczuk, J. (2017). Leveraging MWEs in practical TAG parsing: towards the best of the two worlds. PhD thesis.

Xia, F. (1999). Extracting tree adjoining grammars from bracketed corpora. In Proceedings of the 5th Natural LanguageProcessing Pacific Rim Symposium (NLPRS-99), pages 398–403.

32 / 32