Regularization The problem of overfitting Machine Learning

View
265
Download
2
Category

Documents

Tags:

Preview:

Citation preview

RegularizationThe problem ofoverfitting

Machine Learning

Andrew Ng

Example: Linear regression (housing prices)

Overfitting: If we have too many features, the learned hypothesis may fit the training set very well ( ), but fail to generalize to new examples (predict prices on new examples).

Pric

Size

Pric

Size

Pric

Size

Andrew Ng

Example: Logistic regression

( = sigmoid function)

Andrew Ng

Addressing overfitting:

Pric

Size

size of houseno. of bedroomsno. of floorsage of houseaverage income in neighborhoodkitchen size

Andrew Ng

Addressing overfitting:

Options:1. Reduce number of features.

― Manually select which features to keep.― Model selection algorithm (later in course).

2. Regularization.― Keep all the features, but reduce magnitude/values of

parameters .― Works well when we have a lot of features, each of

which contributes a bit to predicting .

RegularizationCost function

Machine Learning

Andrew Ng

Intuition

Suppose we penalize and make , really small.

Pric

Size of house

Pric

Size of house

Andrew Ng

Small values for parameters ― “Simpler” hypothesis― Less prone to overfitting

Regularization.

Housing:― Features: ― Parameters:

Andrew Ng

Regularization.

Pric

Size of house

Andrew Ng

In regularized linear regression, we choose to minimize

What if is set to an extremely large value (perhaps for too large for our problem, say )?

Pric

Size of house

RegularizationRegularized linear regression

Machine Learning

Regularized linear regression

Andrew Ng

Gradient descentRepeat

Andrew Ng

Normal equation

Andrew Ng

Suppose ,Non-invertibility (optional/advanced).

(#examples) (#features)

If ,

RegularizationRegularized logistic regression

Machine Learning

Andrew Ng

Regularized logistic regression.

Cost function:x1

Andrew Ng

Gradient descentRepeat

Andrew Ng

gradient(1) = [ ];

function [jVal, gradient] = costFunction(theta)

jVal = [ ];

gradient(2) = [ ];

gradient(n+1) = [ ];

code to compute

Advanced optimization

gradient(3) = [ ];code to compute

Recommended

Regularization of a nonlinear inverse problem by discrete

Documents

Machine learning methodology: Overfitting, regularization ...people.eecs.berkeley.edu/~russell/classes/cs194/f11...Regularization Reduces overﬁtting by adding a complexity penalty

Documents

Overfitting - courses.cs.washington.edu

Documents

From Mating Pool Distributions to Model Overfitting

Technology

Math 4/896: Seminar in Mathematics opiTc: Inverse Theorytshores1/Public/Math496S... · Application to Higher Order Regularization The minimization problem is equivalent to the problem

Documents

WCDA Regularization for 3D Quantitative Microwave Tomography · WCDA Regularization for 3D Quantitative Microwave Tomography 2 problem is also ill-posed [11] and regularization is

Documents

HOW TO SPOT BACKTEST OVERFITTING

Documents

Bias-Variance Tradeoffguestrin/Class/10701/slides/overfitting-logistic... · What’s learning, revisited Overfitting Generative versus Discriminative Logistic Regression Machine

Documents

What’s learning, revisited Overfitting Bayes optimal

Documents

Bias-Variance Tradeoff › ~guestrin › Class › 15781 › slides › overfitting-logis… · What’s learning, revisited Overfitting Generative versus Discriminative Logistic

Documents

What’s learning, revisited Overfitting Bayes optimal classifier ...Classification: Mitchell’s Machine Learning, Chapter 6 What’s learning, revisited Overfitting Bayes optimal

Documents

Overfitting and-tbl

Education

Mask Design for Resolution Enhancement: An Inverse Problem ...milanfar/talks/AIP_Vancouver.pdf · 14 Regularization Framework ¾Inverse Lithography is an ill-posed problem. ¾Estimated

Documents

9 Entropy Regularization - Semantic Scholar...9 Entropy Regularization Yves Grandvalet Yoshua Bengio The problem of semi-supervised induction consists in learning a decision rule from

Documents

Overfitting Overfitting occurs when a statistical model describes random error or noise instead of the underlying relationship. Overfitting generally occurs

Documents

Overfitting in Decision Trees NEURAL NETWORKS

Documents

Decision Trees Decision tree representation Top Down Construction Avoiding overfitting: Bottom up pruning Cost complexity issues (Regularization) Missing

Documents

The Learning Problem and Regularization - mit.edu9.520/spring11/slides/class02.pdf · The Learning Problem and Regularization Tomaso Poggio 9.520 Class 02 February 2011 Tomaso Poggio

Documents

Application of denoising methods to regularization of ill ...reichel/publications/dndb.pdf · regularization replaces the problem (1) by a penalized least-squares problem, and a regularization

Documents

On maximum entropy regularization for a speciﬁc inverse ... · On maximum entropy regularization for a speciﬁc inverse problem of option pricing Bernd Hofmann ∗ and Romy Krämer

Documents