98
Systems & Applications: Introduction Ling 573 NLP Systems and Applications April 1, 2013

Systems & Applications: Introduction

  • Upload
    carina

  • View
    28

  • Download
    0

Embed Size (px)

DESCRIPTION

Systems & Applications: Introduction. Ling 573 NLP Systems and Applications April 1, 2013. Roadmap. Motivation 573 Structure Question-Answering Shared Tasks. Motivation. Information retrieval is very powerful Search engines index and search enormous doc sets - PowerPoint PPT Presentation

Citation preview

Page 1: Systems & Applications: Introduction

Systems & Applications:Introduction

Ling 573NLP Systems and Applications

April 1, 2013

Page 2: Systems & Applications: Introduction

RoadmapMotivation

573 Structure

Question-Answering

Shared Tasks

Page 3: Systems & Applications: Introduction

MotivationInformation retrieval is very powerful

Search engines index and search enormous doc setsRetrieve billions of documents in tenths of seconds

Page 4: Systems & Applications: Introduction

MotivationInformation retrieval is very powerful

Search engines index and search enormous doc setsRetrieve billions of documents in tenths of seconds

But still limited!

Page 5: Systems & Applications: Introduction

MotivationInformation retrieval is very powerful

Search engines index and search enormous doc setsRetrieve billions of documents in tenths of seconds

But still limited!Technically – keyword search (mostly)

Page 6: Systems & Applications: Introduction

MotivationInformation retrieval is very powerful

Search engines index and search enormous doc setsRetrieve billions of documents in tenths of seconds

But still limited!Technically – keyword search (mostly)Conceptually

User seeks information Sometimes a web site or document

Page 7: Systems & Applications: Introduction

MotivationInformation retrieval is very powerful

Search engines index and search enormous doc setsRetrieve billions of documents in tenths of seconds

But still limited!Technically – keyword search (mostly)Conceptually

User seeks information Sometimes a web site or document Very often, the answer to a question

Page 8: Systems & Applications: Introduction

Why Question-Answering?People ask questions on the web

Page 9: Systems & Applications: Introduction

Why Question-Answering?People ask questions on the web

Web logs:Which English translation of the bible is used in official

Catholic liturgies?Who invented surf music?What are the seven wonders of the world?

Page 10: Systems & Applications: Introduction

Why Question-Answering?People ask questions on the web

Web logs:Which English translation of the bible is used in official

Catholic liturgies?Who invented surf music?What are the seven wonders of the world?

12-15% of queries

Page 11: Systems & Applications: Introduction

Why Question-Answering?People ask questions on the web

Web logs:Which English translation of the bible is used in official

Catholic liturgies?Who invented surf music?What are the seven wonders of the world?

12-15% of queries

Search sites (e.g., Google) beginning to includeCanonical factoids, esp. Wikipedia infobox data

Dates, conversions, birthdates

Page 12: Systems & Applications: Introduction

Why Question Answering?Answer sites proliferate:

Page 13: Systems & Applications: Introduction

Why Question Answering?Answer sites proliferate:

Top hit for ‘questions’ :

Page 14: Systems & Applications: Introduction

Why Question Answering?Answer sites proliferate:

Top hit for ‘questions’ : Ask.com

Page 15: Systems & Applications: Introduction

Why Question Answering?Answer sites proliferate:

Top hit for ‘questions’ : Ask.comAlso: Yahoo! Answers, wiki answers, Facebook,…

Collect and distribute human answers

Page 16: Systems & Applications: Introduction

Why Question Answering?Answer sites proliferate:

Top hit for ‘questions’ : Ask.comAlso: Yahoo! Answers, wiki answers, Facebook,…

Collect and distribute human answersDo I Need a Visa to Go to Japan?

Page 17: Systems & Applications: Introduction

Why Question Answering?Answer sites proliferate:

Top hit for ‘questions’ : Ask.comAlso: Yahoo! Answers, wiki answers, Facebook,…

Collect and distribute human answers Do I Need a Visa to Go to Japan?

eHow.comRules regarding travel between the United States and Japan

are governed by both countries. Entry requirements for Japan are contingent on the purpose and length of a traveler's visit.

Passport Requirements Japan requires all U.S. citizens provide a valid passport and a return

on "onward" ticket for entry into the country. Additionally, the United States requires a passport for all citizens wishing to enter or re-enter the country.

Page 18: Systems & Applications: Introduction

Search Engines & QAWho was the prime minister of Australia during the

Great Depression?

Page 19: Systems & Applications: Introduction

Search Engines & QAWho was the prime minister of Australia during the

Great Depression?Rank 1 snippet:

The conservative Prime Minister of Australia, Stanley Bruce

Page 20: Systems & Applications: Introduction

Search Engines & QAWho was the prime minister of Australia during the

Great Depression?Rank 1 snippet:

The conservative Prime Minister of Australia, Stanley Bruce

Wrong!Voted out just before the Depression

Page 21: Systems & Applications: Introduction

Perspectives on QATREC QA track (1999---)

Initially pure factoid questions, with fixed length answersBased on large collection of fixed documents (news)Increasing complexity: definitions, biographical info, etc

Single response

Page 22: Systems & Applications: Introduction

Perspectives on QATREC QA track (~1999---)

Initially pure factoid questions, with fixed length answersBased on large collection of fixed documents (news)Increasing complexity: definitions, biographical info, etc

Single response

Reading comprehension (Hirschman et al, 2000---)Think SAT/GRE

Short text or article (usually middle school level)Answer questions based on text

Also, ‘machine reading’

Page 23: Systems & Applications: Introduction

Perspectives on QATREC QA track (~1999---)

Initially pure factoid questions, with fixed length answersBased on large collection of fixed documents (news) Increasing complexity: definitions, biographical info, etc

Single response

Reading comprehension (Hirschman et al, 2000---) Think SAT/GRE

Short text or article (usually middle school level)Answer questions based on text

Also, ‘machine reading’And, of course, Jeopardy! and Watson

Page 24: Systems & Applications: Introduction

Natural Language Processing and QA

Rich testbed for NLP techniques:

Page 25: Systems & Applications: Introduction

Natural Language Processing and QA

Rich testbed for NLP techniques: Information retrieval

Page 26: Systems & Applications: Introduction

Natural Language Processing and QA

Rich testbed for NLP techniques: Information retrievalNamed Entity Recognition

Page 27: Systems & Applications: Introduction

Natural Language Processing and QA

Rich testbed for NLP techniques: Information retrievalNamed Entity RecognitionTagging

Page 28: Systems & Applications: Introduction

Natural Language Processing and QA

Rich testbed for NLP techniques: Information retrievalNamed Entity RecognitionTagging Information extraction

Page 29: Systems & Applications: Introduction

Natural Language Processing and QA

Rich testbed for NLP techniques: Information retrievalNamed Entity RecognitionTagging Information extractionWord sense disambiguation

Page 30: Systems & Applications: Introduction

Natural Language Processing and QA

Rich testbed for NLP techniques: Information retrievalNamed Entity RecognitionTagging Information extractionWord sense disambiguationParsing

Page 31: Systems & Applications: Introduction

Natural Language Processing and QA

Rich testbed for NLP techniques: Information retrievalNamed Entity RecognitionTagging Information extractionWord sense disambiguationParsingSemantics, etc..

Page 32: Systems & Applications: Introduction

Natural Language Processing and QA

Rich testbed for NLP techniques: Information retrievalNamed Entity RecognitionTagging Information extractionWord sense disambiguationParsingSemantics, etc.. Co-reference

Page 33: Systems & Applications: Introduction

Natural Language Processing and QA

Rich testbed for NLP techniques: Information retrievalNamed Entity RecognitionTagging Information extractionWord sense disambiguationParsingSemantics, etc.. Co-reference

Deep/shallow techniques; machine learning

Page 34: Systems & Applications: Introduction

573 StructureImplementation:

Page 35: Systems & Applications: Introduction

573 StructureImplementation:

Create a factoid QA system

Page 36: Systems & Applications: Introduction

573 StructureImplementation:

Create a factoid QA systemExtend existing software componentsDevelop, evaluate on standard data set

Page 37: Systems & Applications: Introduction

573 StructureImplementation:

Create a factoid QA systemExtend existing software componentsDevelop, evaluate on standard data set

Presentation:

Page 38: Systems & Applications: Introduction

573 StructureImplementation:

Create a factoid QA systemExtend existing software componentsDevelop, evaluate on standard data set

Presentation:Write a technical reportPresent plan, system, results in class

Page 39: Systems & Applications: Introduction

573 StructureImplementation:

Create a factoid QA systemExtend existing software componentsDevelop, evaluate on standard data set

Presentation:Write a technical reportPresent plan, system, results in classGive/receive feedback

Page 40: Systems & Applications: Introduction

Implementation: Deliverables

Complex system:Break into (relatively) manageable components Incremental progress, deadlines

Page 41: Systems & Applications: Introduction

Implementation: Deliverables

Complex system:Break into (relatively) manageable components Incremental progress, deadlines

Key components:D1: Setup

Page 42: Systems & Applications: Introduction

Implementation: Deliverables

Complex system:Break into (relatively) manageable components Incremental progress, deadlines

Key components:D1: SetupD2: Baseline system, Passage retrieval

Page 43: Systems & Applications: Introduction

Implementation: Deliverables

Complex system:Break into (relatively) manageable components Incremental progress, deadlines

Key components:D1: SetupD2: Baseline system, Passage retrievalD3: Question processing, classification

Page 44: Systems & Applications: Introduction

Implementation: Deliverables

Complex system:Break into (relatively) manageable components Incremental progress, deadlines

Key components:D1: SetupD2: Baseline system, Passage retrievalD3: Question processing, classificationD4: Answer processing, final results

Page 45: Systems & Applications: Introduction

Implementation: Deliverables

Complex system: Break into (relatively) manageable components Incremental progress, deadlines

Key components: D1: Setup D2: Baseline system, Passage retrieval D3: Question processing, classification D4: Answer processing, final results

Deadlines: Little slack in schedule; please keep to time Timing: ~12 hours week; sometimes higher

Page 46: Systems & Applications: Introduction

PresentationTechnical report:

Follow organization for scientific paperFormatting and Content

Page 47: Systems & Applications: Introduction

PresentationTechnical report:

Follow organization for scientific paperFormatting and Content

Presentations:10-15 minute oral presentation for deliverables

Page 48: Systems & Applications: Introduction

PresentationTechnical report:

Follow organization for scientific paperFormatting and Content

Presentations:10-15 minute oral presentation for deliverablesExplain goals, methodology, success, issues

Page 49: Systems & Applications: Introduction

PresentationTechnical report:

Follow organization for scientific paperFormatting and Content

Presentations:10-15 minute oral presentation for deliverablesExplain goals, methodology, success, issuesCritique each others’ work

Page 50: Systems & Applications: Introduction

PresentationTechnical report:

Follow organization for scientific paperFormatting and Content

Presentations:10-15 minute oral presentation for deliverablesExplain goals, methodology, success, issuesCritique each others’ workAttend ALL presentations

Page 51: Systems & Applications: Introduction

Working in TeamsWhy teams?

Page 52: Systems & Applications: Introduction

Working in TeamsWhy teams?

Too much work for a single personRepresentative of professional environment

Page 53: Systems & Applications: Introduction

Working in TeamsWhy teams?

Too much work for a single personRepresentative of professional environment

Team organization:Form groups of 3 (possibly 2) people

Page 54: Systems & Applications: Introduction

Working in TeamsWhy teams?

Too much work for a single personRepresentative of professional environment

Team organization:Form groups of 3 (possibly 2) peopleArrange coordinationDistribute work equitably

Page 55: Systems & Applications: Introduction

Working in TeamsWhy teams?

Too much work for a single personRepresentative of professional environment

Team organization:Form groups of 3 (possibly 2) peopleArrange coordinationDistribute work equitably

All team members receive the same grade End-of-course evaluation

Page 56: Systems & Applications: Introduction

First TaskForm teams:

Email Ryan [email protected] with the team list

Page 57: Systems & Applications: Introduction

ResourcesReadings:

Current research papers in question-answering

Page 58: Systems & Applications: Introduction

ResourcesReadings:

Current research papers in question-answering Jurafsky & Martin/Manning & Schutze text

Background, reference, refresher

Page 59: Systems & Applications: Introduction

ResourcesReadings:

Current research papers in question-answering Jurafsky & Martin/Manning & Schutze text

Background, reference, refresher

Software:

Page 60: Systems & Applications: Introduction

ResourcesReadings:

Current research papers in question-answering Jurafsky & Martin/Manning & Schutze text

Background, reference, refresher

Software:Build on existing system components, toolkits

NLP, machine learning, etcCorpora, etc

Page 61: Systems & Applications: Introduction

Resources: PatasSystem should run on patas

Existing infrastructureSoftware systems

Corpora

Repositories

Page 62: Systems & Applications: Introduction

Shared Task Evaluations Goals:

Lofty:

Page 63: Systems & Applications: Introduction

Shared Task Evaluations Goals:

Lofty:Focus research community on key challenges

‘Grand challenges’

Page 64: Systems & Applications: Introduction

Shared Task Evaluations Goals:

Lofty:Focus research community on key challenges

‘Grand challenges’

Support the creation of large-scale community resources Corpora: News, Recordings, Video Annotation: Expert questions, labeled answers,..

Page 65: Systems & Applications: Introduction

Shared Task Evaluations Goals:

Lofty:Focus research community on key challenges

‘Grand challenges’

Support the creation of large-scale community resources Corpora: News, Recordings, Video Annotation: Expert questions, labeled answers,..

Develop methodologies to evaluate state-of-the-art Retrieval, Machine Translation, etc

Page 66: Systems & Applications: Introduction

Shared Task Evaluations Goals:

Lofty:Focus research community on key challenges

‘Grand challenges’

Support the creation of large-scale community resources Corpora: News, Recordings, Video Annotation: Expert questions, labeled answers,..

Develop methodologies to evaluate state-of-the-art Retrieval, Machine Translation, etc

Facilitate technology/knowledge transfer b/t industry/acad.

Page 67: Systems & Applications: Introduction

Shared Task EvaluationGoals:

Pragmatic:

Page 68: Systems & Applications: Introduction

Shared Task EvaluationGoals:

Pragmatic:Head-to-head comparison of systems/techniques

Same data, same task, same conditions, same timing

Page 69: Systems & Applications: Introduction

Shared Task EvaluationGoals:

Pragmatic:Head-to-head comparison of systems/techniques

Same data, same task, same conditions, same timingCentralizes funding, effort

Page 70: Systems & Applications: Introduction

Shared Task EvaluationGoals:

Pragmatic:Head-to-head comparison of systems/techniques

Same data, same task, same conditions, same timingCentralizes funding, effortRequires disclosure of techniques in exchange for

data

Base:

Page 71: Systems & Applications: Introduction

Shared Task EvaluationGoals:

Pragmatic:Head-to-head comparison of systems/techniques

Same data, same task, same conditions, same timingCentralizes funding, effortRequire disclosure of techniques in exchange for data

Base:Bragging rights

Page 72: Systems & Applications: Introduction

Shared Task EvaluationGoals:

Pragmatic:Head-to-head comparison of systems/techniques

Same data, same task, same conditions, same timingCentralizes funding, effortRequire disclosure of techniques in exchange for data

Base:Bragging rightsGovernment research funding decisions

Page 73: Systems & Applications: Introduction

Shared Tasks: PerspectiveLate ‘80s-90s:

Page 74: Systems & Applications: Introduction

Shared Tasks: PerspectiveLate ‘80s-90s:

ATIS: spoken dialog systemsMUC: Message Understanding: information

extraction

Page 75: Systems & Applications: Introduction

Shared Tasks: PerspectiveLate ‘80s-90s:

ATIS: spoken dialog systemsMUC: Message Understanding: information

extractionTREC (Text Retrieval Conference)

Arguably largest ( often >100 participating teams) Longest running (1992-current) Information retrieval (and related technologies)

Actually hasn’t had ‘ad-hoc’ since ~2000, thoughOrganized by NIST

Page 76: Systems & Applications: Introduction

TREC TracksTrack: Basic task organization

Page 77: Systems & Applications: Introduction

TREC TracksTrack: Basic task organizationPrevious tracks:

Ad-hoc – Basic retrieval from fixed document set

Page 78: Systems & Applications: Introduction

TREC TracksTrack: Basic task organizationPrevious tracks:

Ad-hoc – Basic retrieval from fixed document setCross-language – Query in one language, docs in

otherEnglish, French, Spanish, Italian, German, Chinese,

Arabic

Page 79: Systems & Applications: Introduction

TREC TracksTrack: Basic task organizationPrevious tracks:

Ad-hoc – Basic retrieval from fixed document setCross-language – Query in one language, docs in

otherEnglish, French, Spanish, Italian, German, Chinese,

ArabicGenomics

Page 80: Systems & Applications: Introduction

TREC TracksTrack: Basic task organizationPrevious tracks:

Ad-hoc – Basic retrieval from fixed document setCross-language – Query in one language, docs in

otherEnglish, French, Spanish, Italian, German, Chinese,

ArabicGenomicsSpoken Document Retrieval

Page 81: Systems & Applications: Introduction

TREC TracksTrack: Basic task organizationPrevious tracks:

Ad-hoc – Basic retrieval from fixed document setCross-language – Query in one language, docs in

otherEnglish, French, Spanish, Italian, German, Chinese,

ArabicGenomicsSpoken Document RetrievalVideo search

Page 82: Systems & Applications: Introduction

TREC TracksTrack: Basic task organizationPrevious tracks:

Ad-hoc – Basic retrieval from fixed document setCross-language – Query in one language, docs in

otherEnglish, French, Spanish, Italian, German, Chinese,

ArabicGenomicsSpoken Document RetrievalVideo searchQuestion Answering

Page 83: Systems & Applications: Introduction

Current TREC tracksTREC 2013:

Contextual SuggestionCrowdsourcingFederated Web SearchKnowledge-base AccelerationMicroblogSessionTemporal SummarizationWeb

Page 84: Systems & Applications: Introduction

Other Shared TasksInternational:

CLEF (Europe); NTCIR (Japan); FIRE (India)

Page 85: Systems & Applications: Introduction

Other Shared TasksInternational:

CLEF (Europe); NTCIR (Japan); FIRE (India)Other NIST:

DUC (Document Summarization)Machine TranslationTopic Detection & Tracking

Page 86: Systems & Applications: Introduction

Other Shared TasksInternational:

CLEF (Europe); NTCIR (Japan); FIRE (India)Other NIST:

DUC (Document Summarization)Machine TranslationTopic Detection & Tracking

Various:CoNLL (NE, parsing,..); SENSEVAL: WSD; PASCAL

(morphology); BioNLP (biological entities, relations)

Page 87: Systems & Applications: Introduction

Other Shared TasksInternational:

CLEF (Europe); NTCIR (Japan); FIRE (India)Other NIST:

DUC (Document Summarization)Machine TranslationTopic Detection & Tracking

Various:CoNLL (NE, parsing,..); SENSEVAL: WSD; PASCAL

(morphology); BioNLP (biological entities, relations)Mediaeval (multi-media information access)

Page 88: Systems & Applications: Introduction

TREC Question-AnsweringSeveral years (1999-2007)

Started with pure factoid questions from news sources

Page 89: Systems & Applications: Introduction

TREC Question-AnsweringSeveral years (1999-2007)

Started with pure factoid questions from news sources

Extended to lists, relationship

Page 90: Systems & Applications: Introduction

TREC Question-AnsweringSeveral years (1999-2007)

Started with pure factoid questions from news sources

Extended to lists, relationshipExtended to blog dataEmployed question series

Page 91: Systems & Applications: Introduction

TREC Question-AnsweringSeveral years (1999-2007)

Started with pure factoid questions from news sources

Extended to lists, relationshipExtended to blog dataEmployed question seriesAdded temporal constraints ‘Complex, interactive’ evaluation

Page 92: Systems & Applications: Introduction

TREC Question-AnsweringSeveral years (1999-2007)

Started with pure factoid questions from news sources Extended to lists, relationship Extended to blog data Employed question series Added temporal constraints ‘Complex, interactive’ evaluation

Combined with summarization to form TAC (2008---) Text Analytics Conference

Opinion Q/A, Knowledge-based population, Scientific Summarization

Page 93: Systems & Applications: Introduction

TREC Question-AnsweringProvides:

Lists of questionsDocument collections (licensed via LDC)Ranked document resultsEvaluation tools: Answer verification patternsDerived resources:

E.g. Roth and Li’s question categories, training/testReams of related publications

Page 94: Systems & Applications: Introduction

Questions<top>

<num> Number: 894<desc> Description: How far is it from Denver to

Aspen? </top>

Page 95: Systems & Applications: Introduction

Questions<top>

<num> Number: 894<desc> Description: How far is it from Denver to

Aspen? </top> <top>

<num> Number: 895 <desc> Description: What county is Modesto,

California in? </top>

Page 96: Systems & Applications: Introduction

Documents <DOC><DOCNO> APW20000817.0002 </DOCNO> <DOCTYPE> NEWS STORY </DOCTYPE><DATE_TIME> 2000-08-17

00:05 </DATE_TIME> <BODY> <HEADLINE> 19 charged with drug trafficking </HEADLINE> <TEXT><P> UTICA, N.Y. (AP) - Nineteen people involved in a drug trafficking

ring in the Utica area were arrested early Wednesday, police said. </P><P> Those arrested are linked to 22 others picked up in May and comprise

''a major cocaine, crack cocaine and marijuana distribution organization,'' according to the U.S. Department of Justice.

</P>

Page 97: Systems & Applications: Introduction

Answer Keys1394: French 1395: Nicole Kidman 1396: Vesuvius 1397: 62,046 1398: 1867 1399: Brigadoon

Page 98: Systems & Applications: Introduction

ReminderTeam up!