16

Transformations

Transformations. Although linear regression might produce a ‘good’ fit (high r value) to a set of data, the data set may still be non-linear. To remove

Download PPT Report

Upload
collin-brooks
View
214
Download
0

Tags:

Embed Size (px)

Citation preview

Page 1: Transformations. Although linear regression might produce a ‘good’ fit (high r value) to a set of data, the data set may still be non-linear. To remove

Transformations

Page 2: Transformations. Although linear regression might produce a ‘good’ fit (high r value) to a set of data, the data set may still be non-linear. To remove

Although linear regression might produce a ‘good’ fit (high r value) to a set of data, thedata set may still be non-linear. To remove (as much as is possible) such non-linearity, the data can be transformed.

Either the x-values, y-values, or both may be transformed in some way so that the

transformed data are more linear. This enables more accurate predictions (extrapolations and interpolations) from the regression equation.

Page 3: Transformations. Although linear regression might produce a ‘good’ fit (high r value) to a set of data, the data set may still be non-linear. To remove

To decide on an appropriate transformation, examine the points on a scatterplot with high values of x and or y (that is, away from the origin) and decide for each axis whether it needs to be stretched or compressed to make the points line up. The best way to see which of the transformations to use is to look at a number of ‘data patterns’.

Page 4: Transformations. Although linear regression might produce a ‘good’ fit (high r value) to a set of data, the data set may still be non-linear. To remove

Page 5: Transformations. Although linear regression might produce a ‘good’ fit (high r value) to a set of data, the data set may still be non-linear. To remove

Example: The seeds in the sunflower are arranged in

spirals for a compact head. Counting the number of seeds in the successive circles starting from the centre and moving outwards, the following number of seeds were counted.

Regression equation is:y = 18.77x -49.73OrNumber of seeds = 18.77(circle) – 49.73

Page 6: Transformations. Although linear regression might produce a ‘good’ fit (high r value) to a set of data, the data set may still be non-linear. To remove

Fit a least-squares regression line and plot the data.

Page 7: Transformations. Although linear regression might produce a ‘good’ fit (high r value) to a set of data, the data set may still be non-linear. To remove

Find the correlation coefficient. What does this mean?

r = 0.88 Strong, positive and linear correlation

Page 8: Transformations. Although linear regression might produce a ‘good’ fit (high r value) to a set of data, the data set may still be non-linear. To remove

Using the regression line for the original data, predict the number of seeds in the 11th circle. (note: 11th circle is an extrapolation analysis)

y = 18.77x -49.73 When x = 11

y = 18.77(11) – 49.73 y = 156.74 (round it off to the nearest no. of

seeds, y = 157)

Page 9: Transformations. Although linear regression might produce a ‘good’ fit (high r value) to a set of data, the data set may still be non-linear. To remove

Find the residuals.

Page 10: Transformations. Although linear regression might produce a ‘good’ fit (high r value) to a set of data, the data set may still be non-linear. To remove

Plot the residuals on a separate graph. Are the data linear?

Page 11: Transformations. Although linear regression might produce a ‘good’ fit (high r value) to a set of data, the data set may still be non-linear. To remove

What type of transformation could be applied to:

i) the x-values? Ii) the y-values?

Page 12: Transformations. Although linear regression might produce a ‘good’ fit (high r value) to a set of data, the data set may still be non-linear. To remove

Apply the log10 y transformation to the data used in the previous question.

Page 13: Transformations. Although linear regression might produce a ‘good’ fit (high r value) to a set of data, the data set may still be non-linear. To remove

Fit a least-squares regression line to the transformed data and plot it with the data.

Page 14: Transformations. Although linear regression might produce a ‘good’ fit (high r value) to a set of data, the data set may still be non-linear. To remove

Page 15: Transformations. Although linear regression might produce a ‘good’ fit (high r value) to a set of data, the data set may still be non-linear. To remove

Find the correlation coefficient. Is there an improvement? Why?

Complete the least-squares regression for the transformation.

Calculate the coefficient of determination. How does this explain variation?

Page 16: Transformations. Although linear regression might produce a ‘good’ fit (high r value) to a set of data, the data set may still be non-linear. To remove

Using the regression line for the transformed data, predict the number of seeds for the 11th circle.

How does this compare with the prediction from the previous question?

€¦ · Web viewDetermines the linear function that best models a set of data. Analyzes the relationship between a linear equation and its graph. Uses linear functions, inequalities

€¦ · Web viewDetermines the linear function that best models a set of data. Analyzes the relationship between a linear equation and its graph. Uses linear functions, inequalities

Documents

CREATING LINEAR MODELS FOR DATA · 12! Creating)linear)models)for)data)!)!!)!)!!!!)! !)!!!!!

CREATING LINEAR MODELS FOR DATA · 12! Creating)linear)models)for)data)!)!!)!)!!!!)! !)!!!!!

Documents

Linear Algebra Review - cseweb.ucsd.edu · Linear Spaces A linear space is the set of all vectors that can be expressed as a linear combination of a set of basis vectors. We say this

Linear Algebra Review - cseweb.ucsd.edu · Linear Spaces A linear space is the set of all vectors that can be expressed as a linear combination of a set of basis vectors. We say this

Documents

System of Linear Equations - NTUd00922011/matlab/260/20150923.pdf · System of Linear Equations A system of linear equations is a set of linear equations involving the same set of

System of Linear Equations - NTUd00922011/matlab/260/20150923.pdf · System of Linear Equations A system of linear equations is a set of linear equations involving the same set of

Documents

Kristin P. Bennett - rpi.edubennek/class/mds/lecture/lecture9-06b.pdf · every bootstrap on average. molecule size. ... Partition Training Data Training Set Validation Set Linear

Kristin P. Bennett - rpi.edubennek/class/mds/lecture/lecture9-06b.pdf · every bootstrap on average. molecule size. ... Partition Training Data Training Set Validation Set Linear

Documents

INVESTIGATING THE LINEAR EXPANSION, SET, AND STRENGTH

INVESTIGATING THE LINEAR EXPANSION, SET, AND STRENGTH

Documents

Objectives: Set up a Linear Programming Problem Solve a Linear Programming Problem

Objectives: Set up a Linear Programming Problem Solve a Linear Programming Problem

Documents

Section 3.2 linear models building linear functions from data

Section 3.2 linear models building linear functions from data

Education

NON LINEAR DATA STRUCTURES - Hi-Tech College of Engineering · NON LINEAR DATA STRUCTURES Trees Basic Concepts: A tree is a non-empty set one element of which is designated the root

NON LINEAR DATA STRUCTURES - Hi-Tech College of Engineering · NON LINEAR DATA STRUCTURES Trees Basic Concepts: A tree is a non-empty set one element of which is designated the root

Documents

Miniature linear guidance set - promsnab.info linear guidance set with...Miniature linear guidance set ... Miniature linear guidance systems are supplied coated with a preservative

Miniature linear guidance set - promsnab.info linear guidance set with...Miniature linear guidance set ... Miniature linear guidance systems are supplied coated with a preservative

Documents

Experimental evidence for an Optical Spring Experimental set up (LFF) The data and a simple model Evidence of the optical spring, linear data Evidence

Experimental evidence for an Optical Spring Experimental set up (LFF) The data and a simple model Evidence of the optical spring, linear data Evidence

Documents

Lecture 3 Linear Data Structures - York University Linear... · Lecture 3 Linear Data Structures Chapters 3.1-3.3 , 5.1-5.2, ... – Review the basic linear data structures ... Chapter

Lecture 3 Linear Data Structures - York University Linear... · Lecture 3 Linear Data Structures Chapters 3.1-3.3 , 5.1-5.2, ... – Review the basic linear data structures ... Chapter

Documents

Non-linear data structures

Non-linear data structures

Documents

Holt Algebra 1 11-4 Linear, Quadratic, and Exponential Models Compare linear, quadratic, and exponential models. Given a set of data, decide which type

Holt Algebra 1 11-4 Linear, Quadratic, and Exponential Models Compare linear, quadratic, and exponential models. Given a set of data, decide which type

Documents

Linear Data Structures

Linear Data Structures

Documents

Practice A - Manchester High School · Exponential Growth and Decay ... Linear, Quadratic, and Exponential Models Graph each data set. Write linear, quadratic, or exponential

Practice A - Manchester High School · Exponential Growth and Decay ... Linear, Quadratic, and Exponential Models Graph each data set. Write linear, quadratic, or exponential

Documents

Methods Validation with Simulated Data 1.Generate random linear objects in the model coordinate system. 2.Generate a random set of points on each linear

Methods Validation with Simulated Data 1.Generate random linear objects in the model coordinate system. 2.Generate a random set of points on each linear

Documents

Machine learning for set-identiﬁed linear models

Machine learning for set-identiﬁed linear models

Documents

Linear Regression and K-Nearest Neighborsbryce/cs63/s18/slides/3-28_KNN.pdf · Linear Regression Hypothesis Space Supervised learning •For every input in the data set, we know the

Linear Regression and K-Nearest Neighborsbryce/cs63/s18/slides/3-28_KNN.pdf · Linear Regression Hypothesis Space Supervised learning •For every input in the data set, we know the

Documents

Probabilistic Linear Discriminant Analysispeople.irisa.fr/.../Probabilistic_Linear_Discriminant_Analysis.pdf · of the data set. We seek the linear transformation x → WTx that maximizes

Probabilistic Linear Discriminant Analysispeople.irisa.fr/.../Probabilistic_Linear_Discriminant_Analysis.pdf · of the data set. We seek the linear transformation x → WTx that maximizes

Documents

M3_Non Linear Data Structures

M3_Non Linear Data Structures

Documents

Writing a Linear Equation to Fit Data Draw a line that fits or models a set of points Write an intercept equation that fits a set of real-world data

Writing a Linear Equation to Fit Data Draw a line that fits or models a set of points Write an intercept equation that fits a set of real-world data

Documents

Big Data: Data Analysis Boot Camp Linear Regression › ... › 090-linear-regression.pdf · Intro. Background Linear regression Improving outcomesHands-onQ & AConclusionReferencesFiles

Big Data: Data Analysis Boot Camp Linear Regression › ... › 090-linear-regression.pdf · Intro. Background Linear regression Improving outcomesHands-onQ & AConclusionReferencesFiles

Documents

Linear and Non-Linear Data Structures and Non-Linear Data Structures ... E.g. – array, linked list. Non-Linear data structure: ... Array- Representation of a stack:

Linear and Non-Linear Data Structures and Non-Linear Data Structures ... E.g. – array, linked list. Non-Linear data structure: ... Array- Representation of a stack:

Documents

Linear Data Structures (Queue)

Linear Data Structures (Queue)

Documents

Linear Data Structures (Stack)

Linear Data Structures (Stack)

Documents

TYPES DATA STRUCTURES( LINEAR AND NON LINEAR)

TYPES DATA STRUCTURES( LINEAR AND NON LINEAR)

Engineering

Special Set Linear Algebra

Special Set Linear Algebra

Documents

Data enriched linear regression

Data enriched linear regression

Technology

Linear Programming Problem Set

Linear Programming Problem Set

Documents