30
Correlation

3. Multiple Correlation 1. Perfect Correlation 2. High Degree of Correlation 3. Moderate Degree of Correlation 4. Low Degree of Correlation 5

Embed Size (px)

Citation preview

Page 1: 3. Multiple Correlation 1. Perfect Correlation 2. High Degree of Correlation 3. Moderate Degree of Correlation 4. Low Degree of Correlation 5

Correlation

Page 2: 3. Multiple Correlation 1. Perfect Correlation 2. High Degree of Correlation 3. Moderate Degree of Correlation 4. Low Degree of Correlation 5

DEFINITION OF CORRELATION

Correlation analysis deals with the association between two or more variables.

-Simpson and Kafka Correlation analysis attempts to degree of relationship

between variables. -Ya-Lun Chou If two or more quantities vary in sympathy, so that movement

in one tend to be accompanied by corresponding movement in the other, then they are said to be correlated.

-Ya- Lun Chou Thus , correlation is a statistical technique which help in

analysis the relationship between two or more variables.

Page 3: 3. Multiple Correlation 1. Perfect Correlation 2. High Degree of Correlation 3. Moderate Degree of Correlation 4. Low Degree of Correlation 5

TYPES OF CORRELATION

1. On the

basis of direction of change:

2. On the basis of change in

proportion

3. On the basis of number of

variable

s studied

1.PERFECT CORRELATION2. NEGATIVE CORRELATION

1.LINEAR CORRELATION 2. CURVI-LINEAR CORRELATION

1.SIMPLE CORRELATION2.PARTIAL CORRELATION3. MULTIPLE CORRELATION

Page 4: 3. Multiple Correlation 1. Perfect Correlation 2. High Degree of Correlation 3. Moderate Degree of Correlation 4. Low Degree of Correlation 5

On the basis of direction of change

Positive Correlation

If two variables X and Y move in the same direction, i.e., if one rises, other rises too and vice versa, then it is a called as positive correlation.

Examples : Relationship between

price and supply, between money supply and prices, etc.

Negative CorrelationIf two variables X and Y move in opposite direction, i.e., if one rises, other falls, and if one falls, other rises, then it is called as negative correlation.

Examples : Relationship between

demand and price, investment and rate of interest, etc.

Page 5: 3. Multiple Correlation 1. Perfect Correlation 2. High Degree of Correlation 3. Moderate Degree of Correlation 4. Low Degree of Correlation 5

On the basis of change in proportion

Linear Correlation If the ratio of change

of two variables X and Y remains constant throughout, then they are said to be linearly correlated.

EXAMPLE:

Supply of a commodity rises by 20% as often as its price rises by 1o%, then such two variables have linear relationship. These two variables gave a straight line graph.

Curvi-Linear Correlation

If the ratio of change between the two variables is not constant but changing correlation is said to be curvi-linear correlation.

EXAMPLE:

Price of a commodity rises by 10%, then sometimes its supply rises by 20% then two variables have non linear relationship. These two variables gave us a curve.

Page 6: 3. Multiple Correlation 1. Perfect Correlation 2. High Degree of Correlation 3. Moderate Degree of Correlation 4. Low Degree of Correlation 5

On the basis of change in proportion

•When we study the relationship between two variables only, then it is called simple correlation.

•Example:

•Relationship between price and demand, height and weight, income and consumption, etc.

1. Simple Correlatio

n

•When three or more variables are taken but relationship between any two of the variables is studied as constant, then it is called partial correlation.

•Example:

•Relationship between amount of rainfall and wheat yield

2. Partial Correlatio

n

•When we study the relationship among three and more variables, then it is called multiple correlation.

•Example:

•Relationship between rainfall , temperature and yield of wheat.

3. Multiple

Correlation

Page 7: 3. Multiple Correlation 1. Perfect Correlation 2. High Degree of Correlation 3. Moderate Degree of Correlation 4. Low Degree of Correlation 5

DEGREE OF CORRELATION

1. Perfect Correlation

2. High Degree of

Correlation

3. Moderate Degree of

Correlation

4. Low Degree of

Correlation

5. Absence of

Correlation

Degree of correlation can be known by coefficient of correlation (r). The following can be various types of the degree of correlation.

Page 8: 3. Multiple Correlation 1. Perfect Correlation 2. High Degree of Correlation 3. Moderate Degree of Correlation 4. Low Degree of Correlation 5

(1) PERFECT CORRELATION: When two variables vary at constant ratio in the same direction, it is perfect correlation . In case of perfect positive correlation, correlation coefficient (r) is equal to +1,

(2) HIGH DEGREE OF CORRELATION: when direction of change is opposite, it is called perfect negative correlation. In case of perfect negative correlation, correlation coefficient(r) is equal to -1.

(3) MODERATE DEGREE OF CORRELATION: Correlation coefficient, on being within the limits +0.25 and +0.75 is termed as moderate degree of correlation.Correlation coefficient, on being within the limits +0.25 and +0.75 is termed as moderate degree of correlation.

(4) LOW DEGREE OF CORRELATION: When correlation exists in very small magnitude, then it is called as low degree of correlation. In such a case, correlation coefficient ranges between 0 and +0.25.

(5) ABSENCE OF CORRELATION: When there is no relationship between the variables, then correlation found to be absent. In case of absence of correlation, the values of correlation coefficient is zero.

Page 9: 3. Multiple Correlation 1. Perfect Correlation 2. High Degree of Correlation 3. Moderate Degree of Correlation 4. Low Degree of Correlation 5

DEGREE OF CORELATION

DEGREE OF CORRELATION

POSITIVE NEGATIVE

Perfect Correlation

+1 -1

High Degree of Correlation

Between +0.75 to +1

Between -0.75 to -1

Moderate Degree of Correlation

Between +0.25 to +0.75

Between -0.25 to -0.75

Low Degree of Correlation

Between 0 to 0.75

Between 0 to -0.25

Absence of Correlation

0 0

Page 10: 3. Multiple Correlation 1. Perfect Correlation 2. High Degree of Correlation 3. Moderate Degree of Correlation 4. Low Degree of Correlation 5

METHODS OF STUDYING CORRELATION

METHODS OF STUDYING

CORRELATION

1.GRAPHIC METHODS

1.SCATTER DIAGRAM

2.CORRELATION GRAPH

2.ALGEBRIC METHOD

3.KARL PEARSON COEFFICIENT OF CORRELATION

4.RANK CORRELATION

METHOD

.CONCURRENT DEVIATION METHOD

Page 11: 3. Multiple Correlation 1. Perfect Correlation 2. High Degree of Correlation 3. Moderate Degree of Correlation 4. Low Degree of Correlation 5

1. GRAPHIC METHOD

(i) Scatter Diagram Scatter Diagram is a graphic method

to finding out correlation between two variables.

For constructing a scatter diagram, (1) X-variable is represented on X-axis (2) Y-variable on Y-axis. (3) Each pair of values of X and Y series

is plotted in two-dimensional space of X-Y.

Page 12: 3. Multiple Correlation 1. Perfect Correlation 2. High Degree of Correlation 3. Moderate Degree of Correlation 4. Low Degree of Correlation 5

Thus we get scatter diagram by plotting all the pair of values. So, the direction and magnitude of correlation in the following ways:1. PERFECT POSITIVE CORRELATION (R=+1): If a points are plotted in the shape of a straight line, passing from the lower corner of left side to the upper corner at right side, then both series X and Y have perfect positive correlation.

2. PERFECT NEGATIVE CORRELATION (R=-1): When all points lie on a straight line from up to down, then X and Y have perfect negative correlation.

2 4 6 80

2

4

6

8

10

12

2 4 6 80

2

4

6

8

10

12

Page 13: 3. Multiple Correlation 1. Perfect Correlation 2. High Degree of Correlation 3. Moderate Degree of Correlation 4. Low Degree of Correlation 5

3. HIGH DEGREE OF POSITIVE CORRELATION : When concentration of points moves from left to right upward and the points are all close to each other, then X and Y have high degree of positive correlation.

4. HIGH DEGREE OF NEGATIVE CORRELATION: When points are concentrated from left to right downward, and the points are close to each other, then X and Y have high degree of negative correlation.

5. ZERO CORRELATION (R=0): When all the points are scattered in four directions here and there and lacking in any pattern, then there is absence of correlation.

2 4 6 802468

1012

2 4 6 802468

1012

2 4 6 802468

1012

Page 14: 3. Multiple Correlation 1. Perfect Correlation 2. High Degree of Correlation 3. Moderate Degree of Correlation 4. Low Degree of Correlation 5

(ii) Correlation GraphCorrelation can also be determined with help of correlation graph. Under this method, two curves are drawn by marking the time, place, serial number, etc. on X-axis and the values of both correlation variables’ series on Y-axis. The degree and direction is judged in the basis of these curves in the following ways: (a)If curves of both series move up or down in the same direction,

then they have positive correlation (b) If curves of both series move in a opposite direction, then they

have negative correlation.

HIGH DEGREE OF POSITIVE CORRELATION EXAMPLE:

1990 1991 1992 1993012345678

Column1Series 2

Page 15: 3. Multiple Correlation 1. Perfect Correlation 2. High Degree of Correlation 3. Moderate Degree of Correlation 4. Low Degree of Correlation 5

2. ALGEBRAIC METHOD (i) Karl Pearson’s Coefficient of Correlation It is a quantitative method of measuring correlation. This method is

known as Pearson’s coefficient of correlation. This method has the following main characteristics:

(1).KNOWLEDGE OF DIRECTION OF CORRELATION: Whether it is positive or negative.

(2). KNOWLEDGE OF DEGREE OF RELATIONSHIP: We can measure correlation quantity whether range between -1 and +1.

(3). IDEAL MEASURE: It is based on mean and standard deviation.

(4). COVARIANCE: Karl Pearson’s method is based on covariance. The formula is as follows:

Cov (X,Y) = ∑ ( X – X ) ( Y – Y ) = ∑XY – X Y N N

Page 16: 3. Multiple Correlation 1. Perfect Correlation 2. High Degree of Correlation 3. Moderate Degree of Correlation 4. Low Degree of Correlation 5

CALCULATION OF KARL PEARSON’S COEFFICIENT OF CORRELATION

A. Calculation of Coefficient of Correlation in the case of Individual Series

(1) ACTUAL MEAN METHOD ∑xy ∑(X – X)(Y- Y) r = or ∑x² × ∑y² √ ∑(X – X)² √ ∑(Y – Y) ² Where, arithmetic mean of X an Y seriesDeviations of X-series are denoted by x and Y-

series are denoted by yDeviations of the two series are squared and

added up to get ∑x² and ∑y²

Page 17: 3. Multiple Correlation 1. Perfect Correlation 2. High Degree of Correlation 3. Moderate Degree of Correlation 4. Low Degree of Correlation 5

(2)ASSUMED MEAN METHOD

N. ∑dxdy - ∑dx. ∑dy r = √ N . ∑dx² - ( ∑dx)² √N . ∑dy² - ( ∑dy)²

Where N = Number of pairs of scores∑dxdy = Sum of the paired of deviations from assumed mean∑dx = Sum of the deviations of X series from assumed mean (X – Ax)∑dy = Sum of the deviations Y series from assumed mean (Y – Ay)∑dx² = Sum of squared X deviations from assumed mean∑dy² = Sum of squared of Y deviations

Page 18: 3. Multiple Correlation 1. Perfect Correlation 2. High Degree of Correlation 3. Moderate Degree of Correlation 4. Low Degree of Correlation 5

(3) METHOD BASED ON THE USE OF ACTUAL DATA

N. ∑XY - ∑X . ∑Y r = √ N . ∑X² - ( ∑X)² √ N . ∑Y² - (∑Y)²

Where N = Number of pairs of scores∑X = Summation of variables of X series∑ Y = Summation of variables of Y series∑X² = Value of variables of X series are squared up and added∑Y²= Value of variables of Y series are squared up and added∑XY = Value of X variables and Y variables are multiplied and then added

Page 19: 3. Multiple Correlation 1. Perfect Correlation 2. High Degree of Correlation 3. Moderate Degree of Correlation 4. Low Degree of Correlation 5

(4) VARIANCE- COVARIANCE METHOD

Cov (X ,Y ) r = √ Var (X) √ Var(Y) Where, ∑xy ∑(X – X) (Y- Y) ∑XYCov (X, Y) = = = - X Y N N N

The formula can also be written as : ∑ xy r = where, x = X – X , Y – Y N . σx σy

Page 20: 3. Multiple Correlation 1. Perfect Correlation 2. High Degree of Correlation 3. Moderate Degree of Correlation 4. Low Degree of Correlation 5

B. Calculation of coefficient of Correlation in Grouped Data N × ∑ fdxdy – ( ∑ fdx ) ( ∑ fdy) r = √ N × ∑ fdx² - ( ∑ fdx)² √ N × ∑ fdy² -∑ fdy)²

WhereN = Number of pairs of scores∑ fdx = Step deviation of X variables are multiplied by corresponding frequency and then added∑ fdy = Step deviation of Y variables are multiplied by corresponding frequency and then added∑ fdxdy= Multiplying dx and dy and further multiply it with their corresponding frequencies yield

Page 21: 3. Multiple Correlation 1. Perfect Correlation 2. High Degree of Correlation 3. Moderate Degree of Correlation 4. Low Degree of Correlation 5

Assumptions of Karl Pearson’ Coefficient of Correlation

(1) Affected by a Large Number of Independent Causes: Series or variables which are correlated, are affected by a large number of factors that result in a normal distribution.

(2) Cause and Effect Relation : There is a cause and effect relationship between the forces affecting the distribution of the items in the two series.

(3) Linear Relationship: Two variables are linearly related. Plotting the values of the variables in a scatter diagram yield a straight line.

Page 22: 3. Multiple Correlation 1. Perfect Correlation 2. High Degree of Correlation 3. Moderate Degree of Correlation 4. Low Degree of Correlation 5

PROPERTIES OF THE COEFFICIENT OF CORRELATION

(1) Limits of coefficient of Correlation: Karl Pearson’s coefficient of correlation lies between -1 and +1. Symbolically,

-1 < r < +1 (2) Change of Origin and Scale: Coefficient of

correlation is independent of change of origin and scale.

(3) Geometric Mean of Regression Coefficient: Correlation coefficient is the geometric mean of the regression coefficient bxy and bxy. Symbolically:

r= √bxy . byx

Page 23: 3. Multiple Correlation 1. Perfect Correlation 2. High Degree of Correlation 3. Moderate Degree of Correlation 4. Low Degree of Correlation 5

(4) If X and Y are independent variables then coefficient of correlation is zero but the converse is not necessarily true.

(5) Pure Number : ‘r’ is a pure number and is independent of the units of measurements viz.; rainfall in inches, and yield of crops in quintals, the value of correlation coefficient comes out with a pure number . Thus , it does not require that the units of both the variables should be the same.

(6) Symmetric: The coefficient of correlation between the two variables x and y is symmetric i.e., rxy = ryx . It means that either we compute the value of correlation coefficient between x and y or between y and x, the coefficient of correlation remains the same.

Page 24: 3. Multiple Correlation 1. Perfect Correlation 2. High Degree of Correlation 3. Moderate Degree of Correlation 4. Low Degree of Correlation 5

Probable Error and Karl Pearson’s Coefficient of Correlation

To test the reliability of Karl Pearson’s correlation coefficient , probable error is used. The following formula is used to determine probable error:

Probable Error (P.E.) = 0.6745 × 1 - r² √N Where, r is the coefficient of correlation N, the number of pairs of observationsIf the constant 0.6745 is omitted from the above

formula of probable error, we get the standard error of the coefficient of correlation. Thus,

SE = 1 - r² √N

Page 25: 3. Multiple Correlation 1. Perfect Correlation 2. High Degree of Correlation 3. Moderate Degree of Correlation 4. Low Degree of Correlation 5

UTILITY OF PROBABLE ERROR

(1) Probable error is used to interpret the value of the correlation coefficient. Interpretation of r with the help of probable error is made clear by the following points:

(i) If |r| > 6 P.E., then coefficient of correlation (r) is taken to be significant.

(ii) (ii) If |r| < P.E., then coefficient of correlation (r) is taken to be insignificant. This means that, there is no evidence of the existence of correlation in both the series.

(2) Probable error also determines the upper and lower limits within the correlation of a randomly selected sample from the same universe will fall. Symbolically,

Upper Limit = r + P.E. , Lower Limit = r – P.E.

Page 26: 3. Multiple Correlation 1. Perfect Correlation 2. High Degree of Correlation 3. Moderate Degree of Correlation 4. Low Degree of Correlation 5

SPEARMAN’S RANK CORRELATION METHOD

This method of determining correlation was propounded by Prof. Spearman in 1904. By this method, correlation between qualitative data namely beauty, honesty etc, can be computed. The formula for computation of rank correlation coefficient :

R = 1 – 6 ∑ D² N³ - N Where, R= Rank coefficient of correlation D= Difference between two ranks (R1 –R2) N= Number of pair of observationsThe value of rank correlation coefficient always lie between -1

and+1.

Note: 1. The value of rank correlation will be equal to the value of Pearson’s

Coefficient of Correlation for the two characteristics taking the rank as value of the variables, provided no rank value is repeated i.e. the rank value of all the variables are different.

2. The sum total of rank difference is always equal to zero

Page 27: 3. Multiple Correlation 1. Perfect Correlation 2. High Degree of Correlation 3. Moderate Degree of Correlation 4. Low Degree of Correlation 5

MERITS AND DEMERITS OF RANK CORRELATION METHOD

MERITS(1) This method is simple to understand and easy to

apply.(2) When the data are of qualitative nature like beauty,

honesty, intelligence, etc., (3) When we are given the rank and not the actual data,

this method can be usefully employed.

DEMERITS(1) This method is not suitable for finding correlation in a

grouped frequency distribution.(2) When the number of items exceed 30, the calculation

become quite tedious and require a lot of time

Page 28: 3. Multiple Correlation 1. Perfect Correlation 2. High Degree of Correlation 3. Moderate Degree of Correlation 4. Low Degree of Correlation 5

Concurrent Deviation Method

Concurrent deviation method of determining the correlation on the basis of direction of the deviations. Under this method, taking into consideration the direction of deviation, they are assigned (+) or (-) or (0) signs.

Steps to find out correlation in this method: (1) The series X and Y are to be studied for correlation, each

item of the series is compared with its preceding item. If the values is more than its preceding value then its deviation is assigned (+) sign, if less than preceding value then (-) sign and if equal to the preceding value then (0) sign is assigned. After this, third item is compared with the second, fourth item is compared with the third and this process goes on till the deviation of all items in a series are worked out.

Page 29: 3. Multiple Correlation 1. Perfect Correlation 2. High Degree of Correlation 3. Moderate Degree of Correlation 4. Low Degree of Correlation 5

(2) The deviations of X and Y series (dx) and (dy) are multiplied to get dxdy. Product of similar signs will be positive (+) and opposite signs will be negative (-).

(3) Summing the positive dxdy sings, their number is counted. This is known as the number of concurrent deviations. It is denoted by the sign ‘C’. (4) Finally, the following formula is used for determining coefficient of concurrent deviations r = ± ± 2C – n √ nHere , r = Coefficient of concurrent deviations C= Number of concurrent deviation s n = Number of pairs of observations minus one = N-1.

Page 30: 3. Multiple Correlation 1. Perfect Correlation 2. High Degree of Correlation 3. Moderate Degree of Correlation 4. Low Degree of Correlation 5

THANKS