Power 14 - UCSB Economics

May 10, 2018 | Author: Anonymous | Category: Math, Statistics And Probability, Statistics

Short Description

Download Power 14 - UCSB Economics...

Description

1

Econ 240 C Lecture 14

2

Project II

I. Work in Groups

II. You will be graded based on a PowerPoint presentation and a written report. III. Your report should have an executive summary of one to one and a half pages that summarizes your findings in words for a nontechnical reader. It should explain the problem being examined from an economic perspective, i.e. it should motivate interest in the issue on the part of the reader. Your report should explain how you are investigating the issue, in simple language. It should explain why you are approaching the problem in this particular fashion. Your executive report should explain the economic importance of your findings.

The technical details of your findings you can attach as an appendix

Technical Appendix 1. Table of Contents 2. Spreadsheet of data used and sources or, if extensive, a subsample of the data 3. Describe the analytical time series techniques you are using 4. Show descriptive statistics and histograms for the variables in the study 5. Use time series data for your project; show a plot of each variable against time

4

5 Group A

Group B

Group C

Tara Copello Zhimin Zhou Andrea Cardani Jonathan Hester Evan Nakano Eric Laschinger Yana Ten

Pungdalis Suos Micah Witt Charles Rabkin Will Hippen Thomas Bruister Arnaud Piechaud Kyu-Sang Park

Calvin Yeung Andrew Cahill Ashley Hedberg Jesse Smith Darren Doi Sarab Khalsa Jong Duk Woo

Group D

Group E

Carl-Einar Thorner Robert Connor Gleason Antung Anthony Liu Hamid Ghofrani Joonho Shin Ufook Sahilliohlu

Jeffrey Ahlvin Russell Ludwick Aren Megerdichian Carrie Koen Anthony Kasza Matthew Stevens

6

Outline 

Exponential Smoothing  Back

of the envelope formula: geometric distributed lag: L(t) = a*y(t-1) + (1-a)*L(t-1); F(t) = L(t)  ARIMA (p,d,q) = (0,1,1); ∆y(t) = e(t) –(1-a)e(t-1)  Error correction: L(t) =L(t-1) + a*e(t) 

Intervention Analysis

7

Part I: Exponential Smoothing Exponential smoothing is a technique that is useful for forecasting short time series where there may not be enough observations to estimate a Box-Jenkins model  Exponential smoothing can be understood from many perspectives; one perspective is a formula that could be calculated by hand 

8 Santa Barbara South Coast Median House Price in Nominal Thousands 1000

HSEPRC

800

600

400

200

0 1970

1980

1990

2000

2010

Three Rates of Growth

9

7

LNHSEPRC

6

5

4

3 1970

1980

1990 Y EAR

2000

2010

10 Santa Barbara South Coast House Price, 000 04 $

1000

HSEPRC04

800

600

400

200

0 1970

1980

1990 YEAR

2000

2010

11

Simple exponential smoothing

Simple exponential smoothing, also known as single exponential smoothing, is most appropriate for a time series that is a random walk with first order moving average error structure  The levels term, L(t), is a weighted average of the observation lagged one, y(t-1) plus the previous levels, L(t-1):  L(t) = a*y(t-1) + (1-a)*L(t-1) 

12

Single exponential smoothing The parameter a is chosen to minimize the sum of squared errors where the error is the difference between the observation and the levels term: e(t) = y(t) – L(t)  The forecast for period t+1 is given by the formula: L(t+1) = a*y(t) + (1-a)*L(t)  Example from John Heinke and Arthur Reitsch, Business Forecasting, 6th Ed. 

observations

Sales

1

500

2

350

3

250

4

400

5

450

6

350

7

200

8

300

9

350

10

200

11

150

12

400

13

550

14

350

15

250

16

550

17

550

18

400

19

350

20

600

21

750

22

500

23

400

24

650

13

14

Single exponential smoothing For observation #1, set L(1) = Sales(1) = 500, as an initial condition  As a trial value use a = 0.1  So L(2) = 0.1*Sales(1) + 0.9*Level(1) L(2) = 0.1*500 + 0.9*500 = 500  And L(3) = 0.1*Sales(2) + 0.9*Level(2) L(3) = 0.1*350 + 0.9*500 = 485 

observations

Sales

1

500

2

350

3

250

4

400

5

450

6

350

7

200

8

300

9

350

10

200

11

150

12

400

13

550

14

350

15

250

16

550

17

550

18

400

19

350

20

600

21

750

22

500

23

400

24

650

Level 500

15

observations

Sales

16

Level

1

500

500

2

350

500

3

250

485

4

400

5

450

6

350

7

200

8

300

9

350

10

200

11

150

12

400

13

550

14

350

15

250

16

550

17

550

18

400

19

350

20

600

21

750

22

500

23

400

24

650

a = 0.1

17

Single exponential smoothing So the formula can be used to calculate the rest of the levels values, observation #4-#24  This can be set up on a spread-sheet 

observations

Sales

18

Level

1

500

500

2

350

500

3

250

485

4

400

461.5

5

450

455.4

6

350

454.8

7

200

444.3

8

300

419.9

9

350

407.9

10

200

402.1

11

150

381.9

12

400

358.7

13

550

362.8

14

350

381.6

15

250

378.4

16

550

365.6

17

550

384.0

18

400

400.6

19

350

400.5

20

600

395.5

21

750

415.9

22

500

449.3

23

400

454.4

24

650

449.0

a = 0.1

19

Single exponential smoothing The forecast for observation #25 is: L(25) = 0.1*sales(24)+0.9*(24)  Forecast(25)=Levels(25)=0.1*650+0.9*449  Forecast(25) = 469.1 

Single Exponential Sm oothing

800 700 600

Value

500 Sales

400

Levels

300 200 100 0 1

2

3

4

5

6

7

8

9

10

11

12

13

14

Observation

15

16

17

18

19

20

21

22

23

24

21

Single exponential distribution 

The errors can now be calculated: e(t) = sales(t) – levels(t)

observations

Sales

Level

22

error

1

500

500

0

2

350

500

-150

3

250

485

-235

4

400

461.5

-61.5

5

450

455.4

-5.35

6

350

454.8

-104.8

7

200

444.3

-244.3

8

300

419.9

-119.9

9

350

407.9

-57.9

10

200

402.1

-202.1

11

150

381.9

-231.9

12

400

358.7

41.3

13

550

362.8

187.2

14

350

381.6

-31.6

15

250

378.4

-128.4

16

550

365.6

184.4

17

550

384.0

166.0

18

400

400.6

-0.6

19

350

400.5

-50.5

20

600

395.5

204.5

21

750

415.9

334.1

22

500

449.3

50.7

23

400

454.4

-54.4

24

650

449.0

201.0

a = 0.1

observations

Sales

Level

23

error squared

error

1

500

500

0

0

2

350

500

-150

22500

3

250

485

-235

55225

4

400

461.5

-61.5

3782.25

5

450

455.4

-5.35

28.62

6

350

454.8

-104.8

10986.18

7

200

444.3

-244.3

59698.86

8

300

419.9

-119.9

14376.05

9

350

407.9

-57.9

3353.58

10

200

402.1

-202.1

40852.14

11

150

381.9

-231.9

53780.95

12

400

358.7

41.3

1704.33

13

550

362.8

187.2

35027.05

14

350

381.6

-31.6

996.06

15

250

378.4

-128.4

16487.67

16

550

365.6

184.4

34016.68

17

550

384.0

166.0

27553.51

18

400

400.6

-0.6

0.37

19

350

400.5

-50.5

2554.91

20

600

395.5

204.5

41823.74

21

750

415.9

334.1

111594.53

22

500

449.3

50.7

2565.62

23

400

454.4

-54.4

2960.80

24

650

449.0

201.0

40412.28

a = 0.1

observations

Sales

Level

24

error squared

error

1

500

500

0

0

2

350

500

-150

22500

3

250

485

-235

55225

4

400

461.5

-61.5

3782.25

5

450

455.4

-5.35

28.62

6

350

454.8

-104.8

10986.18

7

200

444.3

-244.3

59698.86

8

300

419.9

-119.9

14376.05

9

350

407.9

-57.9

3353.58

10

200

402.1

-202.1

40852.14

11

150

381.9

-231.9

53780.95

12

400

358.7

41.3

1704.33

13

550

362.8

187.2

35027.05

14

350

381.6

-31.6

996.06

15

250

378.4

-128.4

16487.67

16

550

365.6

184.4

34016.68

17

550

384.0

166.0

27553.51

18

400

400.6

-0.6

0.37

19

350

400.5

-50.5

2554.91

20

600

395.5

204.5

41823.74

21

750

415.9

334.1

111594.53

22

500

449.3

50.7

2565.62

23

400

454.4

-54.4

2960.80

24

650

449.0

201.0

40412.28

a = 0.1

sum sq res

582281.2

25

Single exponential smoothing For a = 0.1, the sum of squared errors is: S = (errors)2 = 582,281.2  A grid search can be conducted for the parameter value a, to find the value between 0 and 1 that minimizes the sum of squared errors  The calculations of levels, L(t), and errors, e(t) = sales(t) – L(t) for a =0.6 

observa tions

26 Sales

Levels

1

500

500

2

350

500

3

250

410

4

400

314

5

450

365.6

6

350

416.2

7

200

376.5

8

300

270.6

9

350

288.2

10

200

325.3

11

150

250.1

12

400

190.0

13

550

316.0

14

350

456.4

15

250

392.6

16

550

307.0

17

550

452.8

18

400

511.1

19

350

444.4

20

600

387.8

21

750

515.1

22

500

656.0

23

400

562.4

24

650

465.0

a = 0.6

27

Single exponential smoothing 

Forecast(25) = Levels(25) = 0.6*sales(24) + 0.4*levels(24) = 0.6*650 + 0.4*465 = 776

observa tions

Sales

Levels

28

error square

error

1

500

500

0

0

2

350

500

-150

22500

3

250

410

-160

25600

4

400

314

86

7396

5

450

365.6

84.4

7123.36

6

350

416.2

-66.2

4387.74

7

200

376.5

-176.5

31150.84

8

300

270.6

29.4

864.45

9

350

288.2

61.8

3814.38

10

200

325.3

-125.3

15699.02

11

150

250.1

-100.1

10023.67

12

400

190.0

210.0

44080.13

13

550

316.0

234.0

54747.14

14

350

456.4

-106.4

11322.57

15

250

392.6

-142.6

20324.22

16

550

307.0

243.0

59036.75

17

550

452.8

97.2

9445.88

18

400

511.1

-111.1

12348.55

19

350

444.4

-94.4

8920.73

20

600

387.8

212.2

45037.39

21

750

515.1

234.9

55172.40

22

500

656.0

-156.0

24349.97

23

400

562.4

-162.4

26379.58

24

650

465.0

185.0

34237.15

a = 0.6

Sum of Sq Res

533961.9

29

Single exponential smoothing 

Grid search plot

Grid Search for Sm oothing Param eter

590000 580000

Sum of Squared Residuals

570000 560000 550000 540000 530000 520000 510000 500000 490000 0

0.2

0.4

0.6 Sm oothing Param eter

0.8

1

1.2

31

Single Exponential Smoothing

EVIEWS: Algorithmic search for the smoothing parameter a  In EVIEWS, select time series sales(t), and open 

  



In the sales window, go to the PROCS menu and select exponential smoothing Select single the best parameter a = 0.26 with sum of squared errors = 472982.1 and root mean square error = 140.4 = (472982.1/24)1/2 The forecast, or end of period levels mean = 532.4

32

33

Forecast = L(25) = 0.26*Sales(24) + 0.74L(24) = 532.4 =0.26*650 + 0.74*491.07 =532.4

34

35

36

Part II. Three Perspectives on Single Exponential Smoothing 

The formula perspective  L(t)

= a*y(t-1) + (1 - a)*L(t-1)  e(t) = y(t) - L(t)

The Box-Jenkins Perspective  The Updating Forecasts Perspective 

Box Jenkins Perspective 

37

Use the error equation to substitute for L(t) in the formula, L(t) = a*y(t-1) + (1 - a)*L(t-1)  L(t)

= y(t) - e(t)  y(t) - e(t) = a*y(t-1) + (1 - a)*[y(t-1) - e(t-1)] y(t) = e(t) + y(t-1) - (1-a)*e(t-1)  or Dy(t) = y(t) - y(t-1) = e(t) - (1-a) e(t-1) 

So y(t) is a random walk plus MAONE noise, i.e y(t) is a (0,1,1) process where (p,d,q) are the orders of AR, differencing, and MA.

38

Box-Jenkins Perspective 

In Lab Eight, we will apply simple exponential smoothing to retail sales, a process you used for forecasting trend in Lab 3, and which can be modeled as (0,1,1).

39

40

41

Retail Sales: Simple Exponential Smoothing

170000

160000

150000

140000

130000 90

91

92 RETAIL

93

94

95

RETAILSM

96

43

44

Box-Jenkins Perspective 

If the smoothing parameter approaches one, then y(t) is a random walk:  Dy(t)

= y(t) - y(t-1) = e(t) - (1-a) e(t-1)  if a = 1, then Dy(t) = y(t) - y(t-1) = e(t) 

In Lab Eight, we will use the price of gold to make this point

45 Weekly Closing Price of Gold , Nov. 14, 2003-April 29, 2005

460

440

420

400

380

360 10

20

30

40 GOLD

50

60

70

46

47

48

Box-Jenkins Perspective 

49

The levels or forecast, L(t), is a geometric distributed lag of past observations of the series, y(t), hence the name “exponential” smoothing  L(t)

= a*y(t-1) + (1 - a)*L(t-1)  L(t) = a*y(t-1) + (1 - a)*ZL(t)  L(t) - (1 - a)*ZL(t) = a*y(t-1)  [1 - (1-a)Z] L(t) = a*y(t-1)  L(t) = {1/ [1 - (1-a)Z]} a*y(t-1)  L(t) = [1 +(1-a)Z + (1-a)2 Z2 + …] a*y(t-1)  L(t) = a*y(t-1) + (1-a)*a*y(t-2) + (1-a)2a*y(t-3) + ….

50

51

The Updating Forecasts Perspective 

Use the error equation to substitute for y(t) in the formula, L(t) = a*y(t-1) + (1 - a)*L(t-1)  y(t)

= L(t) + e(t)  L(t) = a*[L(t-1) + e(t-1)] + (1 - a)*L(t-1) 

So L(t) = L(t-1) + a*e(t-1),  i.e.

the forecast for period t is equal to the forecast for period t-1 plus a fraction a of the forecast error from period t-1.

52

Part III. Double Exponential Smoothing

With double exponential smoothing, one estimates a “trend” term, R(t), as well as a levels term, L(t), so it is possible to forecast, f(t), out more than one period k 1  f(t+k) = L(t) + k*R(t), k>=1  L(t) = a*y(t) + (1-a)*[L(t-1) + R(t-1)]  R(t) = b*[L(t) - L(t-1)] + (1-b)*R(t-1) 

k 1

 so

the trend, R(t), is a geometric distributed lag of the change in levels, DL(t)

Part III. Double Exponential 53 Smoothing If the smoothing parameters a = b, then we have double exponential smoothing  If the smoothing parameters are different, then it is the simplest version of HoltWinters smoothing 

Part III. Double Exponential Smoothing

54

Holt- Winters can also be used to forecast seasonal time series, e.g. monthly  f(t+k) = L(t) + k*R(t) + S(t+k-12) k>=1  L(t) = a*[y(t)-S(t-12)]+ (1-a)*[L(t-1) + R(t-1)]  R(t) = b*[L(t) - L(t-1)] + (1-b)*R(t-1)  S(t) = c*[y(t) - L(t)] + (1-c)*S(t-12) 

55

Part V. Intervention Analysis

56

Intervention Analysis The approach to intervention analysis parallels Box-Jenkins in that the actual estimation is conducted after prewhitening, to the extent that nonstationarity such as trend and seasonality are removed  Example: preview of Lab 8 

57

Telephone Directory Assistance 

A telephone company was receiving increased demand for free directory assistance, i.e. subscribers asking operators to look up numbers. This was increasing costs and the company changed policy, providing a number of free assisted calls to subscribers per month, but charging a price per call after that number.

58

Telephone Directory Assistance This policy change occurred at a known time, March 1974  The time series is for calls with directory assistance per month  Did the policy change make a difference? 

59

60

The simple-minded approach

D=549 - 162 D=387

61

62

63

64

Principle The event may cause a change, and affect time series characteristics  Consequently, consider the pre-event period, January 1962 through February 1974, the event March 1974, and the post-event period, April 1974 through December 1976  First difference and then seasonally difference the entire series 

65

Analysis: Entire Differenced Series

66

67

68

69

70

Analysis: Pre-Event Differences

71

72

73

74

So Seasonal Nonstationarity It was masked in the entire sample by the variance caused by the difference from the event  The seasonality was revealed in the preevent differenced series 

75

76

Pre-Event Analysis 

Seasonally differenced, differenced series

77

78

79

80

81

Pre-Event Box-Jenkins Model 

[1-Z12 ][1 –Z]Assist(t) = WN(t) – a*WN(t-12)

82

83

84

85

Modeling the Event 

Step function

86

Entire Series Assist and Step  Dassist and Dstep  Sddast sddstep 

87

88

89

90

Model of Series and Event Pre-Event Model: [1-Z12 ][1 –Z]Assist(t) = WN(t) – a*WN(t-12)  In Levels Plus Event: Assist(t)=[WN(t) – a*WN(t-12)]/[1-Z]*[1-Z12] + (-b)*step  Estimate: [1-Z12 ][1 –Z]Assist(t) = WN(t) – a*WN(t-12) + (-b)* [1-Z12 ][1 –Z]*step 

91

92

93

Policy Change Effect Simple: decrease of 387 (thousand) calls per month  Intervention model: decrease of 397 with a standard error of 22 

94

95

Stochastic Trends: Random Walks with Drift

We have discussed earlier in the course how to model the Total Return to the Standard and Poor’s 500 Index  One possibility is this time series could be a random walk around a deterministic trend”  Sp500(t) = exp{a + d*t +WN(t)/[1-Z]}  And taking logarithms, 

96

Stochastic Trends: Random Walks with Drift

Lnsp500(t) = a + d*t + WN(t)/[1-Z]  Lnsp500(t) –a –d*t = WN(t)/[1-Z]  Multiplying through by the difference operator, D = [1-Z]  [1-Z][Lnsp500(t) –a –d*t] = WN(t-1) 

[LnSp500(t) – a –d*t] - [LnSp500(t-1) – a –d*(t1)] = WN(t)  D Lnsp500(t) = d + WN(t) 

97

So the fractional change in the total return to the S&P 500 is drift, d, plus white noise  More generally,  y(t) = a + d*t + {1/[1-Z]}*WN(t)  [y(t) –a –d*t] = {1/[1-Z]}*WN(t)  [y(t) –a –d*t]- [y(t-1) –a –d*(t-1)] = WN(t)  [y(t) –a –d*t]= [y(t-1) –a –d*(t-1)] + WN(t)  Versus the possibility of an ARONE: 

98      



[y(t) –a –d*t]=b*[y(t-1)–a–d*(t-1)]+WN(t) Y(t) = a + d*t + b*[y(t-1)–a–d*(t-1)]+WN(t) Or y(t) = [a*(1-b)+b*d]+[d*(1-b)]*t+b*y(t-1) +wn(t) Subtracting y(t-1) from both sides’ D y(t) = [a*(1-b)+b*d] + [d*(1-b)]*t + (b-1)*y(t-1) +wn(t) So the coefficient on y(t-1) is once again interpreted as b-1, and we can test the null that this is zero against the alternative it is significantly negative. Note that we specify the equation with both a constant, [a*(1-b)+b*d] and a trend [d*(1-b)]*t

99

Part IV. Dickey Fuller Tests: Trend

100

Example 

Lnsp500(t) from Lab 2

101

102

103

104

Power 14 - UCSB Economics

Short Description

Description

Comments

We need your help!