Dimensionality Reduction Motivation I: Data Compression Machine Learning

Dimensionality ReductionMotivation I: Data Compression

Machine Learning

Data Compression(in

Reduce data from 2D to 1D

Data Compression

Reduce data from 2D to 1D

Data CompressionReduce data from 3D to 2D

Dimensionality ReductionMotivation II: Data Visualization

Machine Learning

Data Visualization

[resources from en.wikipedia.org]

Country

GDP (trillions of

Per capita GDP

(thousands of intl. $)

Human Develop-

ment IndexLife

expectancy

Poverty Index

(Gini as percentage)

Mean household

income (thousands

of US$) …Canada 1.577 39.17 0.908 80.7 32.6 67.293 …China 5.878 7.54 0.687 73 46.9 10.22 …India 1.632 3.41 0.547 64.7 36.8 0.735 …

Russia 1.48 19.84 0.755 65.5 39.9 0.72 …Singapore 0.223 56.69 0.866 80 42.5 67.1 …

USA 14.527 46.86 0.91 78.3 40.8 84.3 …… … … … … … …

Data Visualization

CountryCanada 1.6 1.2

China 1.7 0.3India 1.6 0.2

Russia 1.4 0.5Singapore 0.5 1.7

USA 2 1.5… … …

Data Visualization

Dimensionality ReductionPrincipal Component Analysis problem formulation

Machine Learning

Principal Component Analysis (PCA) problem formulation

The red line is the projections error

PCA tries to minimize the projections error

The red line is the projections error

Example for large projection error

Reduce from 2-dimension to 1-dimension: Find a direction (a vector )onto which to project the data so as to minimize the projection error.Reduce from n-dimension to k-dimension: Find vectors onto which to project the data, so as to minimize the projection error.

PCA is not linear regression

Dimensionality ReductionPrincipal Component Analysis algorithmMachine Learning

Training set:Preprocessing (feature scaling/mean normalization):

Data preprocessing

Replace each with .If different features on different scales (e.g., size of house, number of bedrooms), scale features to have comparable range of values.

Principal Component Analysis (PCA) algorithm

Reduce data from 2D to 1D Reduce data from 3D to 2D

Principal Component Analysis (PCA) algorithmReduce data from -dimensions to -dimensionsCompute “covariance matrix”:

Compute “eigenvectors” of matrix :[U,S,V] = svd(Sigma);

Covariance Matrix• Covariance measures the degree to which two variables change or vary

together (i.e. co-vary). • On the one hand, the covariance of two variables is positive if they vary

together in the same direction relative to their expected values (i.e. if one variable moves above its expected value, then the other variable also moves above its expected value).

• On the other hand, if one variable tends to be above its expected value when the other is below its expected value, then the covariance between the two variables is negative.

• If there is no linear dependency between the two variables, then the covariance is 0.

Principal Component Analysis (PCA) algorithmFrom , we get: [U,S,V] = svd(Sigma)

Principal Component Analysis (PCA) algorithm summary

After mean normalization (ensure every feature has zero mean) and optionally feature scaling:

Sigma =

[U,S,V] = svd(Sigma);Ureduce = U(:,1:k);z = Ureduce’*x;

Dimensionality ReductionReconstruction from compressed representation

Machine Learning

Reconstruction from compressed representation

Dimensionality ReductionChoosing the number of principal components

Machine Learning

Choosing (number of principal components)Average squared projection error:Total variation in the data:

Typically, choose to be smallest value so that

“99% of variance is retained”

Choosing (number of principal components)Algorithm:Try PCA with Compute

Check if

[U,S,V] = svd(Sigma)

Choosing (number of principal components)[U,S,V] = svd(Sigma)

Pick smallest value of for which

(99% of variance retained)

Dimensionality Reduction

Advice for applying PCA

Machine Learning

Supervised learning speedup

Extract inputs:Unlabeled dataset:

New training set:

Note: Mapping should be defined by running PCA only on the training set. This mapping can be applied as well to the examples and in the cross validation and test sets.

Application of PCA

- Compression- Reduce memory/disk needed to store data- Speed up learning algorithm

- Visualization

Bad use of PCA: To prevent overfittingUse instead of to reduce the number of features to Thus, fewer features, less likely to overfit.

This might work OK, but isn’t a good way to address overfitting. Use regularization instead.

PCA is sometimes used where it shouldn’t beDesign of ML system:

- Get training set- Run PCA to reduce in dimension to get- Train logistic regression on- Test on test set: Map to . Run on

How about doing the whole thing without using PCA?Before implementing PCA, first try running whatever you want to do with the original/raw data . Only if that doesn’t do what you want, then implement PCA and consider using .

Dimensionality Reduction Motivation I: Data Compression Machine Learning

Documents

Dimensionality and dimensionality … and dimensionality reductiondimensionality reduction Nuno Vasconcelos ECE Depp,artment, UCSD. Note ... The curse of dimensionality

Audio Compression Usha Sree CMSC 691M 10/12/04. Motivation Efficient Storage Streaming Interactive Multimedia Applications

Dimensionality Reduction - University of Pittsburghpeople.cs.pitt.edu/~iyad/DR.pdf · Dimensionality Reduction Problems of learning in high dimensional spaces: • Curse of dimensionality

PCA For Image Compression Extended with Shearing and ...PCA [1] is a matrix factorization algorithm used for dimensionality reduction. In the case of image compression, we take the

Significance of Dimensionality

DIMENSIONALITY AND EFFECTS OF INFORMATION MOTIVATION … · motivation on users’ approval of SNA. KEYWORDS | Social network sites, information motivation, social network advertising,

Nonlinear Dimensionality Reduction

Motivation & Previous Work Sentence Compression Approach Linguistically-motivated Heuristics Word Significance Compression Generation and Selection

Dimensionality Reduction for Stationary Time Series via ...papers.nips.cc/paper/7609-dimensionality-reduction-for-stationary-ti… · Dimensionality Reduction for Stationary Time

Motivation I: Data Compression

Dimensionality Reduction Mappings

DARTEL John Ashburner 2008. Overview Motivation –Dimensionality –Inverse-consistency Principles Geeky stuff Example Validation Future directions

12.2 Dimensionality Reduction

Dimensionality Reduction

Multilevel stochastic collocations with dimensionality reduction - … · 2017. 1. 27. · TUM, Chair of Scientiﬁc Computing in Computer Science (I5) 27.01.2017. Outline 1 Motivation

High dimensionality

Dimensionality reduction Usman Roshan CS 675. Dimensionality reduction What is dimensionality reduction? –Compress high dimensional data into lower dimensions

Graphical Models and Message passingPreliminaries on Graphical Models Motivation: Curse of dimensionality I Matroid enumeration I Polyhedron computation I Entropic Region I Machine

The Academic Motivation Scale: Dimensionality, Reliability ... · PDF fileThe Academic Motivation Scale 19 Types of motivation The AMS is based on self-determination theory (SDT),

Data Compression Basics - Rochester Institute of Technologymeseec.ce.rit.edu/eecc694-spring2000/694-5-9-2000.pdf · 2000. 5. 9. · Data Compression Basics • Main motivation: The