Regression

CS294 Practical Machine Learning

Romain Thibaux09/18/06

Outline

• Ordinary Least Squares regression– Derivation from minimizing the sum of squares– Probabilistic interpretation– Online version (LMS)

• Overfitting and Regularization

• Numerical stability

• L1 Regression

• Kernel Regression, Spline Regression• Multiple Adaptive Regression Splines (MARS)

Classification (reminder)

X ! YAnything:

• continuous (, d, …)

• discrete ({0,1}, {1,…k}, …)

• structured (tree, string, …)

• …

• discrete:

– {0,1} binary

– {1,…k} multi-class

– tree, etc. structured

XAnything:

• discrete ({0,1}, {1,…k}, …)

• …

XAnything:

• discrete ({0,1}, {1,…k}, …)

• …

Perceptron

Logistic Regression

Support Vector Machine

Decision TreeDecision TreeRandom ForestRandom Forest

Kernel trickKernel trick

Regression

X ! Y• continuous:

– , dAnything:

• discrete ({0,1}, {1,…k}, …)

• …

Examples

• Voltage ! Temperature

• Processes, memory ! Power consumption• Protein structure ! Energy [next week]

• Robot arm controls ! Torque at effector

• Location, industry, past losses ! Premium

Linear regression

0 10 200

[start Matlab demo lecture2.m]

Given examples

Predict given a new point

Linear regression

Prediction Prediction

Ordinary Least Squares (OLS)

Error or “residual”

Prediction

Observation

Sum squared error

Minimize the sum squared error

Sum squared error

Linear equation

Linear system

Alternative derivation

d Solve the system (it’s better not to invert the matrix)

LMS Algorithm(Least Mean Squares)

Online algorithm

Beyond lines and planes

everything is the same with

still linear in

0 10 200

Geometric interpretation

[Matlab demo]

Ordinary Least Squares [summary]

For example

Minimize by solving

Given examples

Predict

Probabilistic interpretation

Likelihood

Assumptions vs. Reality

Voltage

0 1 2 3 4 5 6 70

Intel sensor network data

Temperature

Overfitting

0 2 4 6 8 10 12 14 16 18 20-15

[Matlab demo]

Degree 15 polynomial

Ridge Regression(Regularization)

0 2 4 6 8 10 12 14 16 18 20-10

15Effect of regularization (degree 19)

with “small”

Minimize by solving

Probabilistic interpretation

Likelihood

Posterior

Numerical Accuracy

Condition number

We want covariates as perpendicular as possible, and roughly the same scale• Regularization• Preconditioning

Errors in Variables(Total Least Squares)

Sensitivity to outliers

High weight given to outliers

Temperature at noon

Influence function

L1 Regression

Linear program Influence function

Kernel Regression

0 2 4 6 8 10 12 14 16 18 20-10

15Kernel regression (sigma=1)

Spline RegressionRegression on each interval

5200 5400 5600 5800

Spline RegressionWith equality constraints

5200 5400 5600 5800

Spline RegressionWith L1 cost

5200 5400 5600 5800

0 1 20

#requests per minute

Time (days)

Heteroscedasticity

MARSMultivariate Adaptive Regression Splines

…on the board…

Further topics

• Generalized Linear Models

• Gaussian process regression

• Local Linear regression• Feature Selection [next class]

Regression

Documents

Basic linear regression and multiple regression

Robust Regression. Regression Methods We are going to look at three approaches to robust regression: Regression with robust standard errors Regression

Multiple Linear Regression Review. Outline Outline Outline Simple Linear Regression Multiple Regression Understanding the Regression Output Coefficient

2. Korrelation, Linear Regression und multiple · PDF file2. Korrelation, Linear Regression und multiple Regression 2. Korrelation, lineare Regression und multiple Regression 2.1 Korrelation

Penalised regression - Ridge, LASSO and elastic net regression · 8 Ridge regression LASSO regression Extensions Department of Mathematical Sciences Ridge regression For the least

Chapter 2 Simple Linear Regression Analysis The simple ...home.iitk.ac.in/~shalab/regression/Chapter2-Regression-Simple... · Regression Analysis | Chapter 2 | Simple Linear Regression

Regression Analysis and Multiple Regression

2.linear regression and logistic regression

2. Korrelation, Linear Regression und multiple Regression · 2. Korrelation, Linear Regression und multiple Regression 2. Korrelation, lineare Regression und multiple Regression 2.1

1 Curve-Fitting Polynomial Interpolation. 2 Curve Fitting Regression Linear Regression Polynomial Regression Multiple Linear Regression Non-linear Regression

Fuzzy Regression Models...fuzzy least-squares regression models. Keywords: fuzzy linear regression, fuzzy least-squares regression, fuzzy coefficients, possibilistic regression, term

Linear Regression and Regression Trees Avinash Kak Purdue ... · The Regression Tree Tutorial by Avi Kak 1. Regression In General • Regression, in general, helps us understand relationships

MULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS

Logistic Regression - cs.wellesley.educs.wellesley.edu/~cs305/lectures/6_Logistic_Regression.pdfLogistic Regression Logistic regression is used for classification, not regression!

Modern Regression - Ridge Regression

Choquistic Regression: Generalizing Logistic Regression

REGRESSION 12.1 Simple Linear Regression Model 12.2 ...sman/courses/6739/SimpleLinearRegression.pdf · Goldsman — ISyE 6739 Linear Regression REGRESSION 12.1 Simple Linear Regression

Least Squares Regression and Multiple Regression

Multiple linear regression: estimation and model fitting · simple linear regression and multiple regression Multiple Simple regression regression Solar 0.05 0.13 Wind -3.32 -5.73

Chapter 6 Regression IIntroduction to Regression