50
Maartje ter Hoeve [email protected] @maartjeterhoeve 1 Automatic Summarization Lecture Information Retrieval, DS 8th of March, 2019

Automatic Summarization - Maartje ter Hoeve · Maartje ter Hoeve [email protected] @maartjeterhoeve 24 ROUGE Lin, Chin-Yew. “ROUGE: A Package for Automatic Evaluation of Summaries.”

  • Upload
    others

  • View
    6

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Automatic Summarization - Maartje ter Hoeve · Maartje ter Hoeve m.a.terhoeve@uva.nl @maartjeterhoeve 24 ROUGE Lin, Chin-Yew. “ROUGE: A Package for Automatic Evaluation of Summaries.”

Maartje ter Hoeve [email protected] @maartjeterhoeve �1

Automatic Summarization Lecture Information Retrieval, DS

8th of March, 2019

Page 2: Automatic Summarization - Maartje ter Hoeve · Maartje ter Hoeve m.a.terhoeve@uva.nl @maartjeterhoeve 24 ROUGE Lin, Chin-Yew. “ROUGE: A Package for Automatic Evaluation of Summaries.”

Maartje ter Hoeve [email protected] @maartjeterhoeve �2

Content

1. What is summarization?

2. What can summaries look like?

3. Evaluation

4. Today’s focus: Single Document Summarization (text & multimodal)

Page 3: Automatic Summarization - Maartje ter Hoeve · Maartje ter Hoeve m.a.terhoeve@uva.nl @maartjeterhoeve 24 ROUGE Lin, Chin-Yew. “ROUGE: A Package for Automatic Evaluation of Summaries.”

Maartje ter Hoeve [email protected] @maartjeterhoeve �3

Content

1. What is summarization?

2. What can summaries look like?

3. Evaluation

4. Today’s focus: Single Document Summarization (text & multimodal)

Disclaimer! The field is way too big to

cover everything in one lecture,

so we will just touch

upon a selection of different work!

Page 4: Automatic Summarization - Maartje ter Hoeve · Maartje ter Hoeve m.a.terhoeve@uva.nl @maartjeterhoeve 24 ROUGE Lin, Chin-Yew. “ROUGE: A Package for Automatic Evaluation of Summaries.”

Maartje ter Hoeve [email protected] @maartjeterhoeve �4

Content

1. What is summarization?

2. What can summaries look like?

3. Evaluation

4. Today’s focus: Single Document Summarization (text & multimodal)

Page 5: Automatic Summarization - Maartje ter Hoeve · Maartje ter Hoeve m.a.terhoeve@uva.nl @maartjeterhoeve 24 ROUGE Lin, Chin-Yew. “ROUGE: A Package for Automatic Evaluation of Summaries.”

Maartje ter Hoeve [email protected] @maartjeterhoeve �5

What is summarization?

Page 6: Automatic Summarization - Maartje ter Hoeve · Maartje ter Hoeve m.a.terhoeve@uva.nl @maartjeterhoeve 24 ROUGE Lin, Chin-Yew. “ROUGE: A Package for Automatic Evaluation of Summaries.”

Maartje ter Hoeve [email protected] @maartjeterhoeve �6

What is summarization?

Task: Find the important bits in

all the data and return these

Page 7: Automatic Summarization - Maartje ter Hoeve · Maartje ter Hoeve m.a.terhoeve@uva.nl @maartjeterhoeve 24 ROUGE Lin, Chin-Yew. “ROUGE: A Package for Automatic Evaluation of Summaries.”

Maartje ter Hoeve [email protected] @maartjeterhoeve �7

What is summarization?

Task: Find the important bits in

all the data and return these

Page 8: Automatic Summarization - Maartje ter Hoeve · Maartje ter Hoeve m.a.terhoeve@uva.nl @maartjeterhoeve 24 ROUGE Lin, Chin-Yew. “ROUGE: A Package for Automatic Evaluation of Summaries.”

Maartje ter Hoeve [email protected] @maartjeterhoeve �8

Why would you be interested in this?

Page 9: Automatic Summarization - Maartje ter Hoeve · Maartje ter Hoeve m.a.terhoeve@uva.nl @maartjeterhoeve 24 ROUGE Lin, Chin-Yew. “ROUGE: A Package for Automatic Evaluation of Summaries.”

Maartje ter Hoeve [email protected] @maartjeterhoeve �9

Why would you be interested in this?

Academically

Can we learn a smaller representation

that still captures our input?

Practical standpoint

Countless examples where amount of

information available is too much to

manually digest.

Law

Medicines

Police

News

Research

Page 10: Automatic Summarization - Maartje ter Hoeve · Maartje ter Hoeve m.a.terhoeve@uva.nl @maartjeterhoeve 24 ROUGE Lin, Chin-Yew. “ROUGE: A Package for Automatic Evaluation of Summaries.”

Maartje ter Hoeve [email protected] @maartjeterhoeve �10

Content

1. What is summarization?

2. What can summaries look like?

3. Evaluation

4. Today’s focus: Single Document Summarization (text & multimodal)

Page 11: Automatic Summarization - Maartje ter Hoeve · Maartje ter Hoeve m.a.terhoeve@uva.nl @maartjeterhoeve 24 ROUGE Lin, Chin-Yew. “ROUGE: A Package for Automatic Evaluation of Summaries.”

Maartje ter Hoeve [email protected] @maartjeterhoeve �11

What can summaries look like?

Extractive summarization Abstractive summarization

Page 12: Automatic Summarization - Maartje ter Hoeve · Maartje ter Hoeve m.a.terhoeve@uva.nl @maartjeterhoeve 24 ROUGE Lin, Chin-Yew. “ROUGE: A Package for Automatic Evaluation of Summaries.”

Maartje ter Hoeve [email protected] @maartjeterhoeve �12

Extractive Summarization

Page 13: Automatic Summarization - Maartje ter Hoeve · Maartje ter Hoeve m.a.terhoeve@uva.nl @maartjeterhoeve 24 ROUGE Lin, Chin-Yew. “ROUGE: A Package for Automatic Evaluation of Summaries.”

Maartje ter Hoeve [email protected] @maartjeterhoeve �13

Extractive Summarization

Line 1Line 2Line 3Line 4Line 5Line 6Line 7Line 8Line 9

Page 14: Automatic Summarization - Maartje ter Hoeve · Maartje ter Hoeve m.a.terhoeve@uva.nl @maartjeterhoeve 24 ROUGE Lin, Chin-Yew. “ROUGE: A Package for Automatic Evaluation of Summaries.”

Maartje ter Hoeve [email protected] @maartjeterhoeve �14

Extractive Summarization

Line 1Line 2Line 3Line 4Line 5Line 6Line 7Line 8Line 9

Page 15: Automatic Summarization - Maartje ter Hoeve · Maartje ter Hoeve m.a.terhoeve@uva.nl @maartjeterhoeve 24 ROUGE Lin, Chin-Yew. “ROUGE: A Package for Automatic Evaluation of Summaries.”

Maartje ter Hoeve [email protected] @maartjeterhoeve �15

Extractive Summarization

Line 1Line 2Line 3Line 4Line 5Line 6Line 7Line 8Line 9

Page 16: Automatic Summarization - Maartje ter Hoeve · Maartje ter Hoeve m.a.terhoeve@uva.nl @maartjeterhoeve 24 ROUGE Lin, Chin-Yew. “ROUGE: A Package for Automatic Evaluation of Summaries.”

Maartje ter Hoeve [email protected] @maartjeterhoeve �16

Extractive Summarization

Line 1Line 2Line 3Line 4Line 5Line 6Line 7Line 8Line 9

Page 17: Automatic Summarization - Maartje ter Hoeve · Maartje ter Hoeve m.a.terhoeve@uva.nl @maartjeterhoeve 24 ROUGE Lin, Chin-Yew. “ROUGE: A Package for Automatic Evaluation of Summaries.”

Maartje ter Hoeve [email protected] @maartjeterhoeve �17

Abstractive Summarization

Line 1Line 2Line 3Line 4Line 5Line 6Line 7Line 8

Line 9

New line 1

New line 2

New line 3

Page 18: Automatic Summarization - Maartje ter Hoeve · Maartje ter Hoeve m.a.terhoeve@uva.nl @maartjeterhoeve 24 ROUGE Lin, Chin-Yew. “ROUGE: A Package for Automatic Evaluation of Summaries.”

Maartje ter Hoeve [email protected] @maartjeterhoeve �18

Focus of this lecture

Single Document Summarization

• Text

• Multimodal

Page 19: Automatic Summarization - Maartje ter Hoeve · Maartje ter Hoeve m.a.terhoeve@uva.nl @maartjeterhoeve 24 ROUGE Lin, Chin-Yew. “ROUGE: A Package for Automatic Evaluation of Summaries.”

Maartje ter Hoeve [email protected] @maartjeterhoeve �19

Content

1. What is summarization?

2. What can summaries look like?

3. Evaluation

4. Today’s focus: Single Document Summarization (text & multimodal)

Page 20: Automatic Summarization - Maartje ter Hoeve · Maartje ter Hoeve m.a.terhoeve@uva.nl @maartjeterhoeve 24 ROUGE Lin, Chin-Yew. “ROUGE: A Package for Automatic Evaluation of Summaries.”

Maartje ter Hoeve [email protected] @maartjeterhoeve �20

Evaluation - what makes a summary ‘good’?

Page 21: Automatic Summarization - Maartje ter Hoeve · Maartje ter Hoeve m.a.terhoeve@uva.nl @maartjeterhoeve 24 ROUGE Lin, Chin-Yew. “ROUGE: A Package for Automatic Evaluation of Summaries.”

Maartje ter Hoeve [email protected] @maartjeterhoeve �21

ROUGELin, Chin-Yew. “ROUGE: A Package for Automatic Evaluation of Summaries.” In Text Summarization Branches Out: Proceedings of the ACL-04 Workshop, edited by Stan Szpakowicz Marie-Francine Moens, 74–81. Barcelona, Spain: Association for Computational Linguistics, 2004.

ROUGE: Recall-Oriented Understudy for Gisting Evaluation

ROUGE is a metric to evaluate textual summaries.

Page 22: Automatic Summarization - Maartje ter Hoeve · Maartje ter Hoeve m.a.terhoeve@uva.nl @maartjeterhoeve 24 ROUGE Lin, Chin-Yew. “ROUGE: A Package for Automatic Evaluation of Summaries.”

Maartje ter Hoeve [email protected] @maartjeterhoeve �22

ROUGELin, Chin-Yew. “ROUGE: A Package for Automatic Evaluation of Summaries.” In Text Summarization Branches Out: Proceedings of the ACL-04 Workshop, edited by Stan Szpakowicz Marie-Francine Moens, 74–81. Barcelona, Spain: Association for Computational Linguistics, 2004.

Predicted summary

ROUGE computes the quality of a

summary, by comparing the number of

overlapping n-grams in the predicted

summary and the reference summary.

Reference summary

ROUGE is a metric to compute the quality of

a textual summary, by comparing the number

of overlapping n-grams in the predicted

summary and the reference summary.

Page 23: Automatic Summarization - Maartje ter Hoeve · Maartje ter Hoeve m.a.terhoeve@uva.nl @maartjeterhoeve 24 ROUGE Lin, Chin-Yew. “ROUGE: A Package for Automatic Evaluation of Summaries.”

Maartje ter Hoeve [email protected] @maartjeterhoeve �23

ROUGELin, Chin-Yew. “ROUGE: A Package for Automatic Evaluation of Summaries.” In Text Summarization Branches Out: Proceedings of the ACL-04 Workshop, edited by Stan Szpakowicz Marie-Francine Moens, 74–81. Barcelona, Spain: Association for Computational Linguistics, 2004.

Predicted summary

ROUGE computes the quality of a

summary, by comparing the number of

overlapping n-grams in the predicted

summary and the reference summary.

Reference summary

ROUGE is a metric to compute the quality of

a textual summary, by comparing the number

of overlapping n-grams in the predicted

summary and the reference summary.

Page 24: Automatic Summarization - Maartje ter Hoeve · Maartje ter Hoeve m.a.terhoeve@uva.nl @maartjeterhoeve 24 ROUGE Lin, Chin-Yew. “ROUGE: A Package for Automatic Evaluation of Summaries.”

Maartje ter Hoeve [email protected] @maartjeterhoeve �24

ROUGELin, Chin-Yew. “ROUGE: A Package for Automatic Evaluation of Summaries.” In Text Summarization Branches Out: Proceedings of the ACL-04 Workshop, edited by Stan Szpakowicz Marie-Francine Moens, 74–81. Barcelona, Spain: Association for Computational Linguistics, 2004.

Predicted summary

ROUGE computes the quality of a

summary, by comparing the number of

overlapping n-grams in the predicted

summary and the reference summary.

# Unigrams: 22

# Bigrams: 21

Reference summary

ROUGE is a metric to compute the quality of

a textual summary, by comparing the number

of overlapping n-grams in the predicted

summary and the reference summary.

# Unigrams: 27

# Bigrams: 26

Page 25: Automatic Summarization - Maartje ter Hoeve · Maartje ter Hoeve m.a.terhoeve@uva.nl @maartjeterhoeve 24 ROUGE Lin, Chin-Yew. “ROUGE: A Package for Automatic Evaluation of Summaries.”

Maartje ter Hoeve [email protected] @maartjeterhoeve �25

ROUGELin, Chin-Yew. “ROUGE: A Package for Automatic Evaluation of Summaries.” In Text Summarization Branches Out: Proceedings of the ACL-04 Workshop, edited by Stan Szpakowicz Marie-Francine Moens, 74–81. Barcelona, Spain: Association for Computational Linguistics, 2004.

Predicted summary

ROUGE computes the quality of a

summary, by comparing the number of

overlapping n-grams in the predicted

summary and the reference summary.

# Overlapping Unigrams: 21

# Overlapping Bigrams: 18

Reference summary

ROUGE is a metric to compute the quality of

a textual summary, by comparing the number

of overlapping n-grams in the predicted

summary and the reference summary.

Page 26: Automatic Summarization - Maartje ter Hoeve · Maartje ter Hoeve m.a.terhoeve@uva.nl @maartjeterhoeve 24 ROUGE Lin, Chin-Yew. “ROUGE: A Package for Automatic Evaluation of Summaries.”

Maartje ter Hoeve [email protected] @maartjeterhoeve �26

ROUGELin, Chin-Yew. “ROUGE: A Package for Automatic Evaluation of Summaries.” In Text Summarization Branches Out: Proceedings of the ACL-04 Workshop, edited by Stan Szpakowicz Marie-Francine Moens, 74–81. Barcelona, Spain: Association for Computational Linguistics, 2004.

ROUGE − NRecall =# overlapping n-grams

# n-grams in reference summary

ROUGE − NPrecision =# overlapping n-grams

# n-grams in predicted summary

ROUGE − NF =(1 + β2) × ROUGE − NRecall × ROUGE − NPrecision

ROUGE − NRecall + β2 × ROUGE − NPrecision

Page 27: Automatic Summarization - Maartje ter Hoeve · Maartje ter Hoeve m.a.terhoeve@uva.nl @maartjeterhoeve 24 ROUGE Lin, Chin-Yew. “ROUGE: A Package for Automatic Evaluation of Summaries.”

Maartje ter Hoeve [email protected] @maartjeterhoeve �27

ROUGELin, Chin-Yew. “ROUGE: A Package for Automatic Evaluation of Summaries.” In Text Summarization Branches Out: Proceedings of the ACL-04 Workshop, edited by Stan Szpakowicz Marie-Francine Moens, 74–81. Barcelona, Spain: Association for Computational Linguistics, 2004.

ROUGE − 1Recall =# overlapping unigrams

# unigrams in reference summary=

2127

= 0.78

ROUGE − 1Precision =# overlapping unigrams

# unigrams in predicted summary=

2122

= 0.95

ROUGE − 1F =(1 + β2) × ROUGE − 1Recall × ROUGE − 1Precision

ROUGE − 1Recall + β2 × ROUGE − 1Precision= 2 ×

0.78 × 0.950.78 + 0.95

= 0.86

We use β = 1

Page 28: Automatic Summarization - Maartje ter Hoeve · Maartje ter Hoeve m.a.terhoeve@uva.nl @maartjeterhoeve 24 ROUGE Lin, Chin-Yew. “ROUGE: A Package for Automatic Evaluation of Summaries.”

Maartje ter Hoeve [email protected] @maartjeterhoeve �28

ROUGELin, Chin-Yew. “ROUGE: A Package for Automatic Evaluation of Summaries.” In Text Summarization Branches Out: Proceedings of the ACL-04 Workshop, edited by Stan Szpakowicz Marie-Francine Moens, 74–81. Barcelona, Spain: Association for Computational Linguistics, 2004.

ROUGE − 1Recall =# overlapping unigrams

# unigrams in reference summary=

2127

= 0.78

ROUGE − 1Precision =# overlapping unigrams

# unigrams in predicted summary=

2122

= 0.95

ROUGE − 1F =(1 + β2) × ROUGE − 1Recall × ROUGE − 1Precision

ROUGE − 1Recall + β2 × ROUGE − 1Precision= 2 ×

0.78 × 0.950.78 + 0.95

= 0.86

Same computation for Rouge-2,

but use the bigram counts

We use β = 1

Page 29: Automatic Summarization - Maartje ter Hoeve · Maartje ter Hoeve m.a.terhoeve@uva.nl @maartjeterhoeve 24 ROUGE Lin, Chin-Yew. “ROUGE: A Package for Automatic Evaluation of Summaries.”

Maartje ter Hoeve [email protected] @maartjeterhoeve �29

ROUGELin, Chin-Yew. “ROUGE: A Package for Automatic Evaluation of Summaries.” In Text Summarization Branches Out: Proceedings of the ACL-04 Workshop, edited by Stan Szpakowicz Marie-Francine Moens, 74–81. Barcelona, Spain: Association for Computational Linguistics, 2004.

ROUGE-L

Do the same, but for longest common subsequence.

(Use the union of LCS for multiple sentences.)

Multiple references

Use argmax of all ROUGE-N scores.

Page 30: Automatic Summarization - Maartje ter Hoeve · Maartje ter Hoeve m.a.terhoeve@uva.nl @maartjeterhoeve 24 ROUGE Lin, Chin-Yew. “ROUGE: A Package for Automatic Evaluation of Summaries.”

Maartje ter Hoeve [email protected] @maartjeterhoeve �30

ROUGELin, Chin-Yew. “ROUGE: A Package for Automatic Evaluation of Summaries.” In Text Summarization Branches Out: Proceedings of the ACL-04 Workshop, edited by Stan Szpakowicz Marie-Francine Moens, 74–81. Barcelona, Spain: Association for Computational Linguistics, 2004.

Correlation with Human Judgement

Okay?

People are not totally convinced and often perform a human evaluation as well.

Page 31: Automatic Summarization - Maartje ter Hoeve · Maartje ter Hoeve m.a.terhoeve@uva.nl @maartjeterhoeve 24 ROUGE Lin, Chin-Yew. “ROUGE: A Package for Automatic Evaluation of Summaries.”

Maartje ter Hoeve [email protected] @maartjeterhoeve �31

Content

1. What is summarization?

2. What can summaries look like?

3. Evaluation

4. Today’s focus: Single Document Summarization (text & multimodal)

Page 32: Automatic Summarization - Maartje ter Hoeve · Maartje ter Hoeve m.a.terhoeve@uva.nl @maartjeterhoeve 24 ROUGE Lin, Chin-Yew. “ROUGE: A Package for Automatic Evaluation of Summaries.”

Maartje ter Hoeve [email protected] @maartjeterhoeve �32

Single Document Summarization - TextAbstractive summarizationExtractive summarization

• Historical methods

• Binary labeling strategy

• Reinforcement learning approach

• …

x1 x2 x3 xT

Encoder

Attention

Context vector

yt-1 yt

Decoder

Page 33: Automatic Summarization - Maartje ter Hoeve · Maartje ter Hoeve m.a.terhoeve@uva.nl @maartjeterhoeve 24 ROUGE Lin, Chin-Yew. “ROUGE: A Package for Automatic Evaluation of Summaries.”

Maartje ter Hoeve [email protected] @maartjeterhoeve �33

Single Document Summarization - TextAbstractive summarizationExtractive summarization

• Historical methods

• Binary labeling strategy

• Reinforcement learning approach

• …

Disclaimer! The field is way too big to

cover everything in one lecture,

so we will just touch

upon a selection of different work!

x1 x2 x3 xT

Encoder

Attention

Context vector

yt-1 yt

Decoder

Page 34: Automatic Summarization - Maartje ter Hoeve · Maartje ter Hoeve m.a.terhoeve@uva.nl @maartjeterhoeve 24 ROUGE Lin, Chin-Yew. “ROUGE: A Package for Automatic Evaluation of Summaries.”

Maartje ter Hoeve [email protected] @maartjeterhoeve �34

Extractive Single Document Summarization - Text (1)Erkan, Gunes, and Dragomir R. Radev. “LexRank: Graph-Based Lexical Centrality as Salience in Text Summarization.” Journal of Artificial Intelligence Research 22 (December 1, 2004): 457–79. https://doi.org/10.1613/jair.1523.

Historical methods - LexRank as example

Make a graph from the document. Each sentence is a node.

Compute similarity between sentences. These scores are the edges.

Use PageRank to rank these sentences.

Select the highest ranking sentences as the summary.

Unsupervised!

Page 35: Automatic Summarization - Maartje ter Hoeve · Maartje ter Hoeve m.a.terhoeve@uva.nl @maartjeterhoeve 24 ROUGE Lin, Chin-Yew. “ROUGE: A Package for Automatic Evaluation of Summaries.”

Maartje ter Hoeve [email protected] @maartjeterhoeve �35

Extractive Single Document Summarization - Text (2)Nallapati, Ramesh, Feifei Zhai, and Bowen Zhou. “SummaRuNNer: A Recurrent Neural Network Based Sequence Model for Extractive Summarization of Documents.” ArXiv:1611.04230 [Cs], November 13, 2016. http://arxiv.org/abs/1611.04230.

Binary labeling strategy

Classify each sentence as either

‘belongs to summary’ or as ‘does not

belong to summary’.

Page 36: Automatic Summarization - Maartje ter Hoeve · Maartje ter Hoeve m.a.terhoeve@uva.nl @maartjeterhoeve 24 ROUGE Lin, Chin-Yew. “ROUGE: A Package for Automatic Evaluation of Summaries.”

Maartje ter Hoeve [email protected] @maartjeterhoeve �36

Extractive Single Document Summarization - Text (3)Dong, Yue, Yikang Shen, Eric Crawford, Herke van Hoof, and Jackie Chi Kit Cheung. “BanditSum: Extractive Summarization as a Contextual Bandit.” In Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, 3739–3748. Brussels, Belgium: Association for Computational Linguistics, 2018. http://www.aclweb.org/anthology/D18-1409.

Reinforcement Learning Strategy

Each document is a context, combinations of sentences in the document are

actions.

Learn a policy to perform the best action - get the best summary.

This method optimises directly for ROUGE!

Page 37: Automatic Summarization - Maartje ter Hoeve · Maartje ter Hoeve m.a.terhoeve@uva.nl @maartjeterhoeve 24 ROUGE Lin, Chin-Yew. “ROUGE: A Package for Automatic Evaluation of Summaries.”

Maartje ter Hoeve [email protected] @maartjeterhoeve �37

Abstractive Single Document Summarization - Text (1)Nallapati, Ramesh, Bowen Zhou, Cicero Nogueira dos santos, Caglar Gulcehre, and Bing Xiang. “Abstractive Text Summarization Using Sequence-to-Sequence RNNs and Beyond.” ArXiv:1602.06023 [Cs], February 18, 2016. http://arxiv.org/abs/1602.06023.

Main idea / contributions

• Introduce encoder / decoder structure

for summarisation.

• Add tags to the encoder.

Page 38: Automatic Summarization - Maartje ter Hoeve · Maartje ter Hoeve m.a.terhoeve@uva.nl @maartjeterhoeve 24 ROUGE Lin, Chin-Yew. “ROUGE: A Package for Automatic Evaluation of Summaries.”

Maartje ter Hoeve [email protected] @maartjeterhoeve �38

Abstractive Single Document Summarization - Text (1)Nallapati, Ramesh, Bowen Zhou, Cicero Nogueira dos santos, Caglar Gulcehre, and Bing Xiang. “Abstractive Text Summarization Using Sequence-to-Sequence RNNs and Beyond.” ArXiv:1602.06023 [Cs], February 18, 2016. http://arxiv.org/abs/1602.06023.

Main idea / contributions

• Introduce encoder / decoder structure

for summarisation.

• Add tags to the encoder.

• Introduce pointing probability, for

unknown words.

Page 39: Automatic Summarization - Maartje ter Hoeve · Maartje ter Hoeve m.a.terhoeve@uva.nl @maartjeterhoeve 24 ROUGE Lin, Chin-Yew. “ROUGE: A Package for Automatic Evaluation of Summaries.”

Maartje ter Hoeve [email protected] @maartjeterhoeve �39

Abstractive Single Document Summarization - Text (1)Nallapati, Ramesh, Bowen Zhou, Cicero Nogueira dos santos, Caglar Gulcehre, and Bing Xiang. “Abstractive Text Summarization Using Sequence-to-Sequence RNNs and Beyond.” ArXiv:1602.06023 [Cs], February 18, 2016. http://arxiv.org/abs/1602.06023.

Main idea / contributions

• Introduce encoder / decoder structure

for summarisation.

• Add tags to the encoder.

• Introduce pointing probability, for

unknown words.

• Introduce hierarchical attention

mechanism.

Page 40: Automatic Summarization - Maartje ter Hoeve · Maartje ter Hoeve m.a.terhoeve@uva.nl @maartjeterhoeve 24 ROUGE Lin, Chin-Yew. “ROUGE: A Package for Automatic Evaluation of Summaries.”

Maartje ter Hoeve [email protected] @maartjeterhoeve �40

Abstractive Single Document Summarization - Text (2)See, Abigail, Peter J. Liu, and Christopher D. Manning. “Get To The Point: Summarization with Pointer-Generator Networks.” ArXiv:1704.04368 [Cs], April 14, 2017. http://arxiv.org/abs/1704.04368.

Main idea / contributions

• Make use of pointer network, that

can copy words from the input at

any time - less unknown words.

• Introduce coverage mechanism

that keeps track of previously paid

attention.

Page 41: Automatic Summarization - Maartje ter Hoeve · Maartje ter Hoeve m.a.terhoeve@uva.nl @maartjeterhoeve 24 ROUGE Lin, Chin-Yew. “ROUGE: A Package for Automatic Evaluation of Summaries.”

Maartje ter Hoeve [email protected] @maartjeterhoeve �41

Abstractive Single Document Summarization - Text

Challenges

Questionable grammaticality.

Not very abstractive - Next method addresses this.

Page 42: Automatic Summarization - Maartje ter Hoeve · Maartje ter Hoeve m.a.terhoeve@uva.nl @maartjeterhoeve 24 ROUGE Lin, Chin-Yew. “ROUGE: A Package for Automatic Evaluation of Summaries.”

Maartje ter Hoeve [email protected] @maartjeterhoeve �42

Abstractive Single Document Summarization - Text (3)Narayan, Shashi, Shay B. Cohen, and Mirella Lapata. “Don’t Give Me the Details, Just the Summary! Topic-Aware Convolutional Neural Networks for Extreme Summarization.” In Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, 1797–1807. Brussels, Belgium: Association for Computational Linguistics, 2018. http://www.aclweb.org/anthology/D18-1206.

Main idea / contributions

• Use convolutions, to capture the hierarchical

structure of the text.

• Add topic information, to force the summary

into a certain direction.

• They introduce a new dataset!

Page 43: Automatic Summarization - Maartje ter Hoeve · Maartje ter Hoeve m.a.terhoeve@uva.nl @maartjeterhoeve 24 ROUGE Lin, Chin-Yew. “ROUGE: A Package for Automatic Evaluation of Summaries.”

Maartje ter Hoeve [email protected] @maartjeterhoeve �43

Single Document Summarization - Multimodal

Page 44: Automatic Summarization - Maartje ter Hoeve · Maartje ter Hoeve m.a.terhoeve@uva.nl @maartjeterhoeve 24 ROUGE Lin, Chin-Yew. “ROUGE: A Package for Automatic Evaluation of Summaries.”

Maartje ter Hoeve [email protected] @maartjeterhoeve �44

Multimodal Summarization with Multimodal OutputZhu, Junnan, Haoran Li, Tianshang Liu, Yu Zhou, Jiajun Zhang, and Chengqing Zong. “MSMO: Multimodal Summarization with Multimodal Output.” In Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, 4154–4164. Brussels, Belgium: Association for Computational Linguistics, 2018. http://www.aclweb.org/anthology/D18-1448.

Page 45: Automatic Summarization - Maartje ter Hoeve · Maartje ter Hoeve m.a.terhoeve@uva.nl @maartjeterhoeve 24 ROUGE Lin, Chin-Yew. “ROUGE: A Package for Automatic Evaluation of Summaries.”

Maartje ter Hoeve [email protected] @maartjeterhoeve �45

Multimodal Summarization with Multimodal OutputZhu, Junnan, Haoran Li, Tianshang Liu, Yu Zhou, Jiajun Zhang, and Chengqing Zong. “MSMO: Multimodal Summarization with Multimodal Output.” In Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, 4154–4164. Brussels, Belgium: Association for Computational Linguistics, 2018. http://www.aclweb.org/anthology/D18-1448.

Page 46: Automatic Summarization - Maartje ter Hoeve · Maartje ter Hoeve m.a.terhoeve@uva.nl @maartjeterhoeve 24 ROUGE Lin, Chin-Yew. “ROUGE: A Package for Automatic Evaluation of Summaries.”

Maartje ter Hoeve [email protected] @maartjeterhoeve �46

Summary

• Automatic summarization has the goal to automatically find the important bits in all

the data and return these.

Page 47: Automatic Summarization - Maartje ter Hoeve · Maartje ter Hoeve m.a.terhoeve@uva.nl @maartjeterhoeve 24 ROUGE Lin, Chin-Yew. “ROUGE: A Package for Automatic Evaluation of Summaries.”

Maartje ter Hoeve [email protected] @maartjeterhoeve �47

Summary

• Automatic summarization has the goal to automatically find the important bits in all

the data and return these.

• We can summarise different types of data. In this lecture we have focussed on text and

we have briefly touched upon multimodal summarization.

Page 48: Automatic Summarization - Maartje ter Hoeve · Maartje ter Hoeve m.a.terhoeve@uva.nl @maartjeterhoeve 24 ROUGE Lin, Chin-Yew. “ROUGE: A Package for Automatic Evaluation of Summaries.”

Maartje ter Hoeve [email protected] @maartjeterhoeve �48

Summary

• Automatic summarization has the goal to automatically find the important bits in all

the data and return these.

• We can summarise different types of data. In this lecture we have focussed on text and

we have briefly touched upon multimodal summarization.

• Summarization techniques can be divided into extractive methods and abstractive

methods.

Page 49: Automatic Summarization - Maartje ter Hoeve · Maartje ter Hoeve m.a.terhoeve@uva.nl @maartjeterhoeve 24 ROUGE Lin, Chin-Yew. “ROUGE: A Package for Automatic Evaluation of Summaries.”

Maartje ter Hoeve [email protected] @maartjeterhoeve �49

Summary

• Automatic summarization has the goal to automatically find the important bits in all

the data and return these.

• We can summarise different types of data. In this lecture we have focussed on text and

we have briefly touched upon multimodal summarization.

• Summarization techniques can be divided into extractive methods and abstractive

methods.

• ROUGE and Human Evaluation are used to evaluate the produced summaries.

Page 50: Automatic Summarization - Maartje ter Hoeve · Maartje ter Hoeve m.a.terhoeve@uva.nl @maartjeterhoeve 24 ROUGE Lin, Chin-Yew. “ROUGE: A Package for Automatic Evaluation of Summaries.”

Maartje ter Hoeve [email protected] @maartjeterhoeve �50

Questions?

Slides are available at maartjeth.github.io/#talks