Text Classification & Naïve BayesText Classification & Naïve Bayes CMSC 723 / LING 723 /...

Text Classification

& Naïve Bayes

CMSC 723 / LING 723 / INST 725

MARINE CARPUAT

marine@cs.umd.edu

Some slides by Dan Jurafsky & James Martin, Jacob Eisenstein

• Text classification problems

– and their evaluation

• Linear classifiers

– Features & Weights

– Bag of words

– Naïve Bayes

Machine Learning, Probability

Linguistics

TEXT CLASSIFICATION

Is this spam?

From: "Fabian Starr“ <Patrick_Freeman@pamietaniepeerelu.pl>

Subject: Hey! Sofware for the funny prices!

Get the great discounts on popular software today for PC and Macintoshhttp://iiled.org/Cj4Lmx

70-90% Discounts from retail price!!!All sofware is instantly available to download - No Need Wait!

Who wrote which Federalist

papers?

• 1787-8: anonymous essays try to convince

New York to ratify U.S Constitution: Jay,

Madison, Hamilton.

• Authorship of 12 of the letters in dispute

• 1963: solved by Mosteller and Wallace

using Bayesian methods

James Madison Alexander Hamilton

Positive or negative movie review?

• unbelievably disappointing

• Full of zany characters and richly applied

satire, and some great plot twists

• this is the greatest screwball comedy

ever filmed

• It was pathetic. The worst part about it

was the boxing scenes.

What is the subject of this

article?

• Antogonists and

Inhibitors

• Blood Supply

• Chemistry

• Drug Therapy

• Embryology

• Epidemiology

• …

MeSH Subject Category Hierarchy

MEDLINE Article

Text Classification

• Assigning subject categories, topics, or genres

• Spam detection

• Authorship identification

• Age/gender identification

• Language Identification

• Sentiment analysis

• …

Text Classification: definition

• Input:

– a document w

– a fixed set of classes Y = {y1, y2,…, yJ}

• Output: a predicted class y Y

Classification Methods:

Hand-coded rules

• Rules based on combinations of words or other features

– spam: black-list-address OR (“dollars” AND “have been selected”)

• Accuracy can be high

– If rules carefully refined by expert

• But building and maintaining these rules is expensive

Classification Methods:

Supervised Machine Learning

• Input

– a document w

– a fixed set of classes Y = {y1, y2,…, yJ}

– A training set of m hand-labeled documents (w1,y1),....,(wm,ym)

• Output

– a learned classifier w y

Aside: getting examples for

supervised learning

• Human annotation

– By experts or non-experts (crowdsourcing)

– Found data

• Truth vs. gold standard

• How do we know how good a classifier is?

– Accuracy on held out data

Aside: evaluating classifiers

• How do we know how good a classifier is?

– Compare classifier predictions with human

annotation

– On held out test examples

– Evaluation metrics: accuracy, precision, recall

The 2-by-2 contingency table

correct not correct

selected tp fp

not selected fn tn

Precision and recall

• Precision: % of selected items that are

correct

Recall: % of correct items that are selected

correct not correct

selected tp fp

not selected fn tn

A combined measure: F

• A combined measure that assesses the P/R

tradeoff is F measure (weighted harmonic mean):

• People usually use balanced F1 measure

– i.e., with = 1 (that is, = ½):

F = 2PR/(P+R)

LINEAR CLASSIFIERS

Bag of words

Defining features

Linear classification

Linear Models

for ClassificationFeature function

representation

Weights

How can we learn weights?

• By hand

• Probability

– Today: Naïve Bayes

• Discriminative training

– e.g., perceptron, support vector machines

Generative Story

for Multinomial Naïve Bayes

• A hypothetical stochastic process describing how

training examples are generated

Prediction with Naïve Bayes

Parameter Estimation

• “count and normalize”

• Parameters of a multinomial distribution

– Relative frequency estimator

– Formally: this is the maximum likelihood estimate

• See CIML for derivation

Smoothing

Naïve Bayes recap

• Text classification problems

– and their evaluation

• Linear classifiers

– Features & Weights

– Bag of words

– Naïve Bayes

Machine Learning, Probability

Linguistics

Text Classification & Naïve BayesText Classification & Naïve Bayes CMSC 723 / LING 723 /...

Documents

Naïve Action Theory

Naïve | Revista

The Naïve Bayes Classifier - svivek · •The naïve Bayes Classifier •Learning the naïve Bayes Classifier •Practical concerns 2. Today’s lecture •The naïve Bayes Classifier

Naïve T cells

Naïve Bayes Classfication

The Kooks – Naïve

Spam Filtering with Naïve Bayes – Which Naïve Bayes?cobweb.cs.uga.edu/.../CSCI6900...Presentation2.pdf · Title: Spam Filtering with Naïve Bayes – Which Naïve Bayes? Author:

Classification: Logistic Regressionalvin/courses/ugml2016/02b.pdf · Logistic Regression Example Contrasting Naïve Bayes and Logistic Regression Naïve Bayes easier Naïve Bayes

NAÏVE BAYES CLASSIFIER

CMSC 723 Fall 2016 - University Of Maryland · CMSC 723 / LING 723 / INST 725 MARINE CARPUAT marine@cs.umd.edu Some slides by Graham Neubig , Jacob Eisenstein . Last time •Text

Naïve Bayes: refinements

Equipements pour machines spéciales WF 721/WF 723 A/WF 723 ...€¦ · WF 721/WF 723 A/ WF 723 B/WF 723 C Cartes de positionnement Manuel de l’utilisateur Description COM 723 Valable

Naïve Bayes

naïve compositing

Naïve Bayes Classification

Parsing with Context Free Grammarspeople.cs.georgetown.edu/nschneid/cosc572/s18/16_parsing_slides.pdfContext Free Grammars CMSC 723 / LING 723 / INST 725 MARINE CARPUAT marine@cs.umd.edu

Ausrüstungen für Sondermaschinen WF 721/WF 723 A/WF 723 B ... · Ausrüstungen für Sondermaschinen WF 721/WF 723 A/WF 723 B/WF 723 C Positionierbaugruppen Benutzeranleitung Ausgabe

Naïve Bayes, HMM

Naïve Bayes Classifier

Introduction to Machine Learning · Introduction to Machine Learning CMSC 422 MARINE CARPUAT marine@cs.umd.edu