MLE’s, Bayesian Classifiers and Naïve Bayes Machine Learning 10-601 Tom M. Mitchell Machine...

MLE’s, Bayesian Classifiers and Naïve Bayes

Machine Learning 10-601

Tom M. MitchellMachine Learning Department

Carnegie Mellon University

January 30, 2008

Required reading:

• Mitchell draft chapter, sections 1 and 2. (available on class website)

Naïve Bayes in a Nutshell

Bayes rule:

Assuming conditional independence among Xi’s:

So, classification rule for Xnew = < X1, …, Xn > is:

Naïve Bayes Algorithm – discrete Xi

• Train Naïve Bayes (examples)

for each* value yk

estimate

for each* value xij of each attribute Xi

estimate

• Classify (Xnew)

* probabilities must sum to 1, so need estimate only n-1 parameters...

Estimating Parameters: Y, Xi discrete-valued

Maximum likelihood estimates (MLE’s):

Number of items in set D for which Y=yk

Example: Live in Sq Hill? P(S|G,D,M)• S=1 iff live in Squirrel Hill• G=1 iff shop at Giant Eagle

• D=1 iff Drive to CMU• M=1 iff Dave Matthews fan

Example: Live in Sq Hill? P(S|G,D,M)• S=1 iff live in Squirrel Hill• G=1 iff shop at Giant Eagle

• D=1 iff Drive to CMU• M=1 iff Dave Matthews fan

Naïve Bayes: Subtlety #1

If unlucky, our MLE estimate for P(Xi | Y) may be zero. (e.g., X373= Birthday_Is_January30)

• Why worry about just one parameter out of many?

• What can be done to avoid this?

Estimating Parameters: Y, Xi discrete-valued

Maximum likelihood estimates:

MAP estimates (Dirichlet priors):

Only difference: “imaginary” examples

Naïve Bayes: Subtlety #2

Often the Xi are not really conditionally independent

• We use Naïve Bayes in many cases anyway, and it often works pretty well– often the right classification, even when not the right

probability (see [Domingos&Pazzani, 1996])

• What is effect on estimated P(Y|X)?– Special case: what if we add two copies: Xi = Xk

Learning to classify text documents

• Classify which emails are spam

• Classify which emails are meeting invites

• Classify which web pages are student home pages

How shall we represent text documents for Naïve Bayes?

Baseline: Bag of Words Approach

aardvark 0

about 2

Africa 1

apple 0

anxious 0

Zaire 0

For code and data, see www.cs.cmu.edu/~tom/mlbook.html click on “Software and Data”

What you should know:

• Training and using classifiers based on Bayes rule

• Conditional independence– What it is– Why it’s important

• Naïve Bayes– What it is– Why we use it so much– Training using MLE, MAP estimates– Discrete variables (Bernoulli) and continuous (Gaussian)

Questions:

• Can you use Naïve Bayes for a combination of discrete and real-valued Xi?

• How can we easily model just 2 of n attributes as dependent?

• What does the decision surface of a Naïve Bayes classifier look like?

What is form of decision surface for Naïve Bayes classifier?

MLE’s, Bayesian Classifiers and Naïve Bayes Machine Learning 10-601 Tom M. Mitchell Machine...

Documents

¿Qué es Machine Learning? Usos de Machine Learning

Machine Learning-Deep Learning Fondement et biaisfiles.meetup.com/19203187/FondetBiaisMLDLonline.pdf · Machine Learning-Deep Learning Fondement et biais Machine learning Aix-Marseille

MACHINE REASONING: A PERSPECTIVE AND …...“From machine learning to machine reasoning”. Machine Learning Volume 94 Issue 2, 2004, Pages 133-149 Machine Learning Volume …

Machine Learning Made Easy€¦ · Machine Learning –What is Machine Learning and why do we need it? –Common challenges in Machine Learning Example 1: Human activity learning

Machine Learning Lecture 5 Bayesian Learning G53MLE | Machine Learning | Dr Guoping Qiu1

machine Learning Learning

MathWorks - Introducing Machine Learning · 4 ntroducing Machine Learning How Machine Learning Works Machine learning uses two types of techniques: supervised learning, which trains

Section 1Section 1 Machine Learning basic concepts · Machine Learning TutorialMachine Learning Tutorial ... Section 1Section 1 Machine Learning basic concepts ... Learning Tutorial

Methods MACHINE LEARNING FROM MACHINE LEARNING TO …

Machine Learning: Machine Learning:

Deep learning - Machine learning overviewce.sharif.edu/courses/98-99/1/ce719-1/resources/root/...Deep learning j What is machine learning? Types of machine learning Machine learning

Chapter- 6 : Machine Learning - IOE Notesioenotes.edu.np › media › ...Chapter-6-machine-learning... · Chapter- 6 : Machine Learning-Machine learning is a branch of AI that uses

Machine Learning on Spark - UC Berkeley AMP Campampcamp.berkeley.edu/.../Machine-Learning-on-Spark... · Machine Learning on Spark Shivaram Venkataraman ... Machine learning algorithms

MACHINE LEARNING AN INTRODUCTION - Sas Institute · 2016-03-11 · MACHINE LEARNING - AN INTRODUCTION WHAT IS MACHINE LEARNING? Wikipedia: Machine learning, a branch of artificial

Machine Learning FabSpace March 2018 - irit.fr Learning FabSpace March 2018... · Machine Learning 5 Machine Learning • Supervised : Train / test • Predictiveanalysis Machine

Computer Security: A Machine Learning Approachmedia.techtarget.com/searchSecurity/downloads/REVISED_RH9_Sabn… · Royal Holloway series Machine learning approaches • MACHINE LEARNING

Machine Learning with MATLAB - … · 2 Agenda Machine Learning Overview Machine Learning with MATLAB –Unsupervised Learning Clustering –Supervised Learning Classification

Machine Learning in Logistics: Machine Learning Algorithmsltu.diva-portal.org/smash/get/diva2:1118764/FULLTEXT02.pdf · Machine Learning in Logistics: Machine Learning ... Machine

Tutorial: Deep Reinforcement Learning - Machine Learning ...hunch.net/~beygel/deep_rl_tutorial.pdfTutorial: Deep Reinforcement Learning - Machine Learning

MLE’s, Bayesian Classifiers and Naïve Bayes