Introduction to Machine Learning - CmpE WEBethem/i2ml2e/2e_v1-0/i2ml2e-chap1… · Combining...

ETHEM ALPAYDIN© The MIT Press, 2010

alpaydin@boun.edu.trhttp://www.cmpe.boun.edu.tr/~ethem/i2ml2e

Lecture Slides for

Rationale No Free Lunch Theorem: There is no algorithm that is always

the most accurate

Generate a group of base-learners which when combined has higher accuracy

Different learners use different Algorithms

Hyperparameters

Representations /Modalities/Views

Training sets

Subproblems

Diversity vs accuracy

3Lecture Notes for E Alpaydın 2010 Introduction to Machine Learning 2e © The MIT Press (V1.0)

Voting Linear combination

Classification

jjiji dwy

Bayesian perspective:

If dj are iid

Bias does not change, variance decreases by L

If dependent, error increase with positive correlation

jjii PxCPxCPj

,|| models all

dEdELL

VarVarVarVarVar1111

j j jijij

jj ddCovd

Ly ),(VarVarVar 2

Lecture Notes for E Alpaydın 2010 Introduction to Machine Learning 2e © The MIT Press (V1.0)

Fixed Combination Rules

Error-Correcting Output Codes

110100

101010

011001

000111

K classes; L problems (Dietterich and Bakiri, 1995)

Code matrix W codes classes in terms of learners

One per class

Pairwise

L=K(K-1)/2

Full code L=2(K-1)-1

With reasonable L, find W such that the Hamming distance btw rows and columns are maximized.

Voting scheme

Subproblems may be more difficult than one-per-K

1111111

jjiji dwy

Bagging Use bootstrapping to generate L training sets and train

one base-learner with each (Breiman, 1996)

Use voting (Average or median with regression)

Unstable algorithms profit from bagging

AdaBoost

Generate a sequence of base-learners each focusing on previous one’s errors

(Freund and Schapire, 1996)

Voting where weights are input-dependent (gating)

(Jacobs et al., 1991)

Experts or gating

can be nonlinear

Mixture of Experts

jjjdwy

Stacking Combiner f () is

another learner (Wolpert, 1992)

Fine-Tuning an Ensemble Given an ensemble of dependent classifiers, do not use it

as is, try to get independence

1. Subset selection: Forward (growing)/Backward (pruning) approaches to improve accuracy/diversity/independence

2. Train metaclassifiers: From the output of correlated classifiers, extract new combinations that are uncorrelated. Using PCA, we get “eigenlearners.”

Similar to feature selection vs feature extraction

CascadingUse dj only if preceding ones are not confident

Cascade learners in order of complexity

Combining Multiple Sources Early integration: Concat all features and train a single

learner

Late integration: With each feature set, train one learner, then either use a fixed rule or stacking to combine decisions

Intermediate integration: With each feature set, calculate a kernel, then use a single SVM with multiple kernels

Combining features vs decisions vs kernels

Introduction to Machine Learning - CmpE WEBethem/i2ml2e/2e_v1-0/i2ml2e-chap1… · Combining...

Documents

ETHEM ALPAYDIN © The MIT Press, 2010ethem/i2ml2e/2e_v1-0/i2ml2...Simpler models are more robust on small datasets More interpretable; simpler explanation Data visualization (structure,

Introduction to Machine Learning - CmpE WEBethem/i2ml3e/3e_v1-0/i2ml3e-chap11.pdfWhat a Perceptron Does ... MLP with one hidden layer is a universal approximator (Hornik et al., 1989),

Machine Learning CSE 681 CH1 - INTRODUCTION. INTRODUCTION TO Machine Learning 2nd Edition ETHEM ALPAYDIN © The MIT Press, 2010 alpaydin@boun.edu.tr ethem/i2ml2e

ETHEM ALPAYDIN © The MIT Press, 2010ethem/i2ml2e/2e_v1-0/i2ml2e-chap3 … · Apriori algorithm (Agrawal et al., 1996) For (X,Y,Z), a 3-item set, to be frequent (have enough support),

Introduction to Machine Learning - CmpE WEB |ethem/i2ml2e/2e_v1-0/i2ml2... · 2016-08-08 · Guidelines for ML experiments A. Aim of the study B. Selection of the response variable

ETHEM ALPAYDIN © The MIT Press, 2010 alpaydin@boun.edu.tr ethem/i2ml2e Lecture Slides for 1 Lecture Notes for E Alpaydın 2010

Introduction to Machine Learning - CmpE WEBethem/i2ml3e/3e_v1-0/i2ml3... · 2016-08-08 · Introduction to Machine Learning Author: ethem Created Date: 7/8/2014 1:29:59 PM

ETHEM ALPAYDIN © The MIT Press, 2010 alpaydin@boun.edu.tr ethem/i2ml2e Lecture Slides for

Introduction to Machine Learningethem/i2ml2e/2e_v1-0/i2ml2... · 2016-08-08 · Undirected Graphs: Markov Random Fields In a Markov random field, dependencies are symmetric, for example,

Introduction to Machine Learning - CmpE WEBethem/i2ml3e/3e_v1-0/i2ml3e-chap6.pdf · Spectral clustering (chapter 7) Title: Introduction to Machine Learning Author: ethem Created Date:

ETHEM ALPAYDIN © The MIT Press, 2010fac.ksu.edu.sa/sites/default/files/i2ml2e-chap6-v1-0.pdf · Why Reduce Dimensionality? Reduces time complexity: Less computation Reduces space

Introduction to Machine Learning - CmpE WEBethem/i2ml3e/3e_v1-0/i2ml3e-chap1.pdf · Why “Learn” ? 4 Machine learning is programming computers to optimize a performance criterion

ETHEM ALPAYDIN © The MIT Press, 2010ethem/i2ml2e/2e_v1-0/i2ml2...Nonparametric Estimation Parametric (single global model), semiparametric (small number of local models) Nonparametric:

INTRODUCTION TO Machine Learning - CmpE WEBethem/i2ml/slides/v1... · INTRODUCTION TO Machine Learning ETHEM ALPAYDIN © The MIT Press, 2004 alpaydin@boun.edu.tr ethem/i2ml Lecture

Pattern Recognition Letters - CmpE WEBethem/files/papers/olcay_cca_prl.pdf · Canonical correlation analysis using within-class couplingq Olcay Kursuna,⇑, Ethem Alpaydinb, Oleg

Introduction to Machine Learning - CmpE WEBethem/i2ml3e/3e_v1-0/... · 2. Instead of a single estimate with a single q, we generate several estimates using several q and average,

Introduction to Machine Learning - CmpE WEBethem/i2ml3e/3e_v1-0/i2ml3e-chap9.… · Lecture Slides for . CHAPTER 9: DECISION TREES . Tree Uses Nodes and Leaves 3 . Divide and Conquer

Introduction to Machine Learning - CmpE WEBethem/i2ml3e/3e_v1-0/i2ml3...Introduction to Machine Learning Author ethem Created Date 7/8/2014 1:33:20 PM

INTRODUCTION TO Machine Learning - CmpE WEBethem/i2ml/slides/v1-1/i2ml-chap8-v1... · CHAPTER 8: Nonparametric ... Lecture Notes for E Alpaydın 2004 Introduction to Machine Learning

ETHEM ALPAYDIN © The MIT Press, 2010 · 2016. 8. 8. · ETHEM ALPAYDIN © The MIT Press, 2010 alpaydin@boun.edu.tr ethem/i2ml2e Lecture Slides for