Upload
haider-ali
View
214
Download
0
Embed Size (px)
Citation preview
8/3/2019 Regression Analysis II
1/2
Managerial EconomicsRegression and Multiple Regression
It is a basic human instinct to relate many different happenings all around. Political analysts
always try to find out causes of the events taking place in a country or internationally.Stock market analysts point out reasons for a rise or fall in the price of a single share or
share index. These analysts and others always rely on some qualified information or
observation that they have, about the factors contributing change in a particular situation.
More formally, we can say, they (the analysts) are sure about the independent and
dependent variables involved in a specific situation. They try to keep their discussion
focused around chosen independent and dependent variables. It is generally found that
an analysis that involves quantifiable variables is more conclusive than the one using
qualitative variables.Regression Analysis is an endeavor to establish quantitative relationship between a cause
(independent variable) and its effect on something (dependent variable) [SIMPLE
REGRESSION] or the individual1 and collective2 impact of a number of variables on
something (dependent variable) [MULTIPLE REGRESSION].
The process of regression analysis starts with a simple step of choosing variables, about
which, you want to estimate a relationship. It could be the impact of global warming
(measured in terms of centigrade) on rainfall (measured in terms of millimeters). Lets
assume that you are given with the information about average global temperature (T) and
average global rainfall (R). If you want to estimate relationship between the two variables,
or more simply, you want to estimate the impact of global warming on global rainfall, you
need to process the information with you. Following is the detail of global temperature &
rainfall3.
Years 1997 1998 1999 2000 2001 2002 2003 2004 2005 2006T 22 20 25 22 23 27 19 21 23 23
R 1022 1100 1232 1435 1407 1385 980 1355 1387 1416
We can describe the relationship as Rt = f (Tt)4
We can build a model as R = a5 + b T +
Where a stands for the impact of everything else on global rainfall (except temperature); b
stands for the impact change in temperature on global rainfall; and is termed as the error
term (a provision for the limitation of our estimation).
Now the next step is to find out the values ofa & b6.The formulas of a & b are given as
under:
b =( )( )
( ) 2tT
TT
RRT
t
t
& a = TbR
For this purpose we need to find out certain other things. Firstly, we shall find out an
average of all the T and R values for all the years ( T = T / n ; R = R / n n stands for
the number of years). Then, secondly, we need ( tT -T ) & ( )RRt ---- this means wehave to subtract each value of T & R from their respective average values. Then thirdly,
1 With the help of beta coefficients2 with coefficient of determination3 Data is fictitious and is given only for the practice purposes.4 t here refers to the time period5 a is called the interceptand measures the impact of everything else (besides global temperature) on theglobal rainfall.6 the estimated values of a & b (called theslope) are quoted as a & b
Handout Managerial Economics
M. Salahuddin
8/3/2019 Regression Analysis II
2/2
we shall generate the values for ( tT - T ) * ( )RRt -- the multiplication of the values ofeach of the two multiplying terms. In order to make use of these values in the formula of
b , we shall add all these multiplied values -- ( tT - T ) * ( )RRt . In the fourth step,we shall take squares of all the values for ( tT - T ) and add them to quote against
( )2
TTt in the denominator of the formula for b
. At this stage, we shall have all thevalues needed for finding a . At this stage, we can express the regression equation (with
the help of estimated values of a & b ) as:
=R 297.14 + 43.32 tT
Now, having the estimated regression, we need to interpret this equation. The equation says
that one degree increase in global temperature brings about 42.32 mm more global rainfall.
Moreover, of the total global rainfall 297.14 mm is because of all the factors other than
global temperature.
The next logical question is to check whether the estimates of this regression equation are
true or not. For this purpose, we shall conduct a test of our parametric 7 estimates. Usually,
the value of intercept is not tested and to test the significance ofb
, we compare twodifferent values t statistic & critical value of t distribution.
A t statistic is a ratio ofb
sb
8. The denominator of the formula can be found as:
( )( ) ( ) 2
2
TTkn
RRs
t
t
b
=
9 where k stands for number estimated parameters ( a & b )
At this stage we need estimated values of global rainfall for each value of the global
temperature given in the table. After having all tR values for each value of temperature in
the table we shall deduct all these values from their respective table values. Thereafter, we
shall square all the results of this subtraction process to get ( )2RR
t
. The sum total of all
the squared terms would be used in the numerator of the above mentioned highlighted
formula. By furnishing the remaining terms of the formula as well, we will have the value
of bs for using it in the formula for t-statistic.
The critical value of t distribution in the t-table has two points of reference the degree
of freedom (d/f) & the level of significance10. The d/f is quoted in the extreme left column
of the table. For any d/f there are eight different values quoted in the corresponding row.
Each value corresponds to a specific level of significance (quoted on the top row of the
table). Most tests are performed at 95% level of significance (.05 in the table). Thus, if the
critical table value of t is found less than the t statistic we declare our estimates a & b as
significant and reject our estimates if the results are otherwise. If the value of t-statistic inour example is 1.92 and the critical value in the table for 8 d/f and 95% significance is
2.306 we shall accept the null hypothesis. However at 90% significance the critical value is
1.86 (which is less than the t-statistic) and we shall reject null hypothesis,
7 we call the estimates as parameters because there values dont remain the same in the face of increase or
decrease in values and other changes.8
Termed as standard error of b
9 (n - k) is termed as degree of freedom & ( RRt
is termed as error term
10 For this see the slides in the shares folder