Upload
others
View
4
Download
0
Embed Size (px)
Citation preview
27.03.2014
1
QMT 3001 BUSINESS FORECASTING
1
Exploring Data Patterns&
A I d i F i T h iAn Introduction to Forecasting Techniques
Aysun KAPUCUGİL-İKİZ, PhD.
Forecasting 2
27.03.2014
2
3
4
27.03.2014
3
5
6
27.03.2014
4
Time Series Data Patterns
7
The two steps in analyzing time series data are:(a) Graph the time series data
Horizontal (stationary) / Trend / Cyclical / Seasonal
(a) Graph the time series data The data should be graphed to visually see the type of pattern: is
the series progressively increasingor is it decreasing through time?
There are various graphing techniques available including scatter diagrams, line graphs, or bar graphs. You can choose the visual approach that is optimal for your data.
(b) Generate an autocorrelation function(b) Generate an autocorrelation function The pattern of the autocorrelations will usually help explain the
pattern of the data. The autocorrelation output will also provide you with statistical tests to determine if the autocorrelation is important(i.e., "significant" in statistical terms).
8
A stationary data series does not increaseor decrease overtime.
27.03.2014
5
9
10
27.03.2014
6
11
12
= observation in time period t
= observation at time period t-k
tY
ktY
27.03.2014
7
13
14
27.03.2014
8
15
16
27.03.2014
9
17
Stat > Time Series > Autocorrelation
18
27.03.2014
10
19
randomness
trend
20
seasonality
Is AC significant?
27.03.2014
11
21
22
The standard error is the The standard error is the difference between a predicted value and the actual value for avariable. If the autocorrelation coefficient isdivided by the standard error, the outcome should be >2 for a significant outcome.
27.03.2014
12
23
"Box-Ljung Statistic" (BLS) or modified Box-Pierce Q Statistic:o ju g S a s c ( S) o od ed o e ce Q S a s c:.05 or less of level of significance value of Box-Ljung, is desirable -this means the forecaster has a less than a 5% chance of being wrong in stating autocorrelation exists between two variables.
24
27.03.2014
13
25
26
27.03.2014
14
27
28
r1= 0.572
r2= 0.463
27.03.2014
15
29
Differencing data is needed when forecasting two data patterns:1. Data with a trend.2. Data with a strong autocorrelation component at lag 1 (above 0.90), where the autocorrelation at subsequent lags diminishes slowly.
30
Differencing simply generates a new time series by subtracting the current value from the previous value for the entire original series.value from the previous value for the entire original series.
27.03.2014
16
31
32
27.03.2014
17
33
34
27.03.2014
18
35
36
27.03.2014
19
Examples37 Examples
Example:"Anna-Marie's Pools and Spas" is a chain of stores in Manitoba selling pools and pool supplies.
Anna-Marie is considering opening a new store in Saxon, Manitoba and has
38
approached you as an advisor.
She has a number of markets she is considering for her new store and wants to carefully examine each of these markets before making her selection.
She wants to know if this is a good year to open a new store in Saxon, or if she would be better advised to wait a few years.
She has asked you to examine the pattern of pool sales in Saxon in past years, using data on pool permits as a proxy for sales. Table shows this data for the last 15 years.
27.03.2014
20
The first step is to graph this data over time.
39
The next step in the analysis is to generate the autocorrelation function, to see if the data is indeed random.
40
27.03.2014
21
Correlogram41
Example:Table shows the number of houses under construction (housing starts) in July in Toronto for the period 1994 to 2004.
42
27.03.2014
22
Scatter graph43
44
27.03.2014
23
Housing under Construction in Toronto, Monthly, from January 1972 to May 2005(extended the housing data series back to 1972 and include all months, rather than just July.)
45
To illustrateTo illustrate a cyclical pattern
46
For forecasting purposes this indicates that the future values will dependon the last available level.However, using this approach to forecast cyclical time-series is problematic
When time series data changes by smallmargins from period to period, the bestapproach is to
problematic
ppexplore how the datamoves (e.g., the rate of change).
"differencing".
27.03.2014
24
An example of differenced data for the firstfive rows of the Housing Under Construction database.
47
= 39455 - 41744
Differenced Data48
27.03.2014
25
49
When is the best
50
Example:
time for the new home builder to hold open houses in order to time these with the
ddi k ?wedding market?
Table shows the number of marriages recorded in Canada from 1995 to 2004, on a quarterly basis (3 month intervals).
27.03.2014
26
Side-by-side Bar Chart51
52
27.03.2014
27
53
The optimal forecasting technique for anygiven situation depends on the nature ofavailable data and the decision to be madeor problem to be solved.
54
27.03.2014
28
55
56
27.03.2014
29
57
58
27.03.2014
30
59
60
Other Factors for Choosing a Forecasting Technique: Level of Details.
Time horizon.
Based on judgment or data manipulation.
Management acceptance.
Cost.
27.03.2014
31
Types of Forecasts
Forecasting Techniques
No single method is superior
MovingAverage
Exponential Smoothing
Time-Series Methods: include historical data over a time interval
DelphiMethods
Jury of ExecutiveOpinion
Qualitative Models: attempt to include subjective factors
Causal Methods: include a
variety of factors
Regression Analysis
Multiple Regression
Trend ProjectionsSales ForceComposite
ConsumerMarket Survey
Decomposition
General considerations for choosing the appropriate method
62
Method Uses ConsiderationsJ d C b d i h b f S bj i i bj hJudgment Can be used in the absence of
historical data (e.g. new product).Most helpful in medium- and long-term forecasts
Subjective estimates are subject to the biases and motives of estimators.
Causal Sophisticated methodVery good for medium- and long-term forecasts
Must have historical data.Relationships can be difficult to specifylong term forecasts specify
Time series Easy to implement Work well when the series is relatively stable
Rely exclusively on past data. Most useful for short-term estimates.
27.03.2014
32
63
65
27.03.2014
33
66
67
27.03.2014
34
68
69
27.03.2014
35
70
71
27.03.2014
36
Example:Check the quality of forecast of the data on July Housing Under Construction
Compare two forecasts using:
d f h h l d
72
1. the second naïve forecast (which includes a difference term)
2. moving average method
The in-sample period will be 1994 to 2002, p p ,
the out-of-sample check will use the years 2003 and2004.
First step: Check the errors of the forecast for autocorrelation.
73
27.03.2014
37
Results for the naïve forecast74
Results for the moving average forecast75
27.03.2014
38
Results for the moving average forecast76
Second step: Check the errors for 2003 and 2004 and test which forecast produces a more accurate result.
77
27.03.2014
39
the "in-sample" test78
the "out-of-sample" test79
27.03.2014
40
80
The out-of-sample test confirms the results of the in-sample test.
The error measures are all much larger for the moving average forecast
than they are for the naïve forecast.
This confirms that the naïve forecast is superior for short-term forecasts
for this data.
REFERENCES
Business Forecasting. John E. Hanke and Dean W. Wichern 9th Edition Pearson Ed cation 2009
82
Wichern, 9th Edition, Pearson Education, 2009.