STA 6104 Financial Time Series. Moving Averages and Exponential Smoothing

Similar documents
Economic Forecasting Lecture 9: Smoothing Methods

DEPARTMENT OF QUANTITATIVE METHODS & INFORMATION SYSTEMS

BUSINESS FORECASTING

Chapter 5: Forecasting

Time-series Forecasting - Exponential Smoothing

Lecture 7: Exponential Smoothing Methods Please read Chapter 4 and Chapter 2 of MWH Book

Forecasting: The First Step in Demand Planning

Forecasting. Simon Shaw 2005/06 Semester II

The Art of Forecasting

CHAPTER 18. Time Series Analysis and Forecasting

Foundations - 1. Time-series Analysis, Forecasting. Temporal Information Retrieval

Chapter 8 - Forecasting

PPU411 Antti Salonen. Forecasting. Forecasting PPU Forecasts are critical inputs to business plans, annual plans, and budgets

Multiplicative Winter s Smoothing Method

THE ROYAL STATISTICAL SOCIETY 2009 EXAMINATIONS SOLUTIONS GRADUATE DIPLOMA MODULAR FORMAT MODULE 3 STOCHASTIC PROCESSES AND TIME SERIES

Time-Series Analysis. Dr. Seetha Bandara Dept. of Economics MA_ECON

Lecture 4 Forecasting

Dennis Bricker Dept of Mechanical & Industrial Engineering The University of Iowa. Exponential Smoothing 02/13/03 page 1 of 38

TIMES SERIES INTRODUCTION INTRODUCTION. Page 1. A time series is a set of observations made sequentially through time

Forecasting models and methods

Ch. 12: Workload Forecasting

Forecasting. Copyright 2015 Pearson Education, Inc.

Antti Salonen PPU Le 2: Forecasting 1

Antti Salonen KPP Le 3: Forecasting KPP227

Lecture Prepared By: Mohammad Kamrul Arefin Lecturer, School of Business, North South University

ECON/FIN 250: Forecasting in Finance and Economics: Section 3.2 Filtering Time Series

NATCOR. Forecast Evaluation. Forecasting with ARIMA models. Nikolaos Kourentzes

Improved Holt Method for Irregular Time Series

3 Time Series Regression

Forecasting Chapter 3

CP:

Forecasting. Chapter Copyright 2010 Pearson Education, Inc. Publishing as Prentice Hall

Copyright 2010 Pearson Education, Inc. Publishing as Prentice Hall.

Operations Management

Forecasting of Electric Consumption in a Semiconductor Plant using Time Series Methods

Dennis Bricker Dept of Mechanical & Industrial Engineering The University of Iowa. Forecasting demand 02/06/03 page 1 of 34

Modified Holt s Linear Trend Method

Cyclical Effect, and Measuring Irregular Effect

Comparing the Univariate Modeling Techniques, Box-Jenkins and Artificial Neural Network (ANN) for Measuring of Climate Index

Chapter 7 Forecasting Demand

Introduction to Forecasting

CHAPTER 14. Time Series Analysis and Forecasting STATISTICS IN PRACTICE:

Time Series and Forecasting

Econ 300/QAC 201: Quantitative Methods in Economics/Applied Data Analysis. 17th Class 7/1/10

Industrial Engineering Prof. Inderdeep Singh Department of Mechanical & Industrial Engineering Indian Institute of Technology, Roorkee

SOLVING PROBLEMS BASED ON WINQSB FORECASTING TECHNIQUES

Time Series Analysis -- An Introduction -- AMS 586

Operation and Supply Chain Management Prof. G. Srinivasan Department of Management Studies Indian Institute of Technology, Madras

14. Time- Series data visualization. Prof. Tulasi Prasad Sariki SCSE, VIT, Chennai

Based on the original slides from Levine, et. all, First Edition, Prentice Hall, Inc

Time Series and Forecasting

Decision 411: Class 3

A B C 1 Robert's Drugs 2 3 Week (t ) Sales t. Forec t

Exponentially weighted forecasts

Glossary. The ISI glossary of statistical terms provides definitions in a number of different languages:

22/04/2014. Economic Research

Forecasting. Operations Analysis and Improvement Spring

at least 50 and preferably 100 observations should be available to build a proper model

Decision 411: Class 3

Growth Models for the Forecasting of New Product Market Adoption

WEATHER NORMALIZATION METHODS AND ISSUES. Stuart McMenamin Mark Quan David Simons

Decision 411: Class 3

AUTO SALES FORECASTING FOR PRODUCTION PLANNING AT FORD

FORECASTING FLUCTUATIONS OF ASPHALT CEMENT PRICE INDEX IN GEORGIA

7. Forecasting with ARIMA models

15 yaş üstü istihdam ( )

Exponential smoothing is, like the moving average forecast, a simple and often used forecasting technique

Section 6.8 Exponential Models; Newton's Law of Cooling; Logistic Models

BUSI 460 Suggested Answers to Selected Review and Discussion Questions Lesson 7

Product and Inventory Management (35E00300) Forecasting Models Trend analysis

CHAPTER 1: Decomposition Methods

Time Series Analysis. Smoothing Time Series. 2) assessment of/accounting for seasonality. 3) assessment of/exploiting "serial correlation"

Advances in promotional modelling and analytics

Forecasting. Dr. Richard Jerz rjerz.com

Lecture Prepared By: Mohammad Kamrul Arefin Lecturer, School of Business, North South University

Modeling Recurrent Events in Panel Data Using Mixed Poisson Models

More on Estimation. Maximum Likelihood Estimation.

TIME SERIES ANALYSIS AND FORECASTING USING THE STATISTICAL MODEL ARIMA

Time Series Analysis

CHAPTER 4: DATASETS AND CRITERIA FOR ALGORITHM EVALUATION

Forecasting Using Time Series Models

ECON 343 Lecture 4 : Smoothing and Extrapolation of Time Series. Jad Chaaban Spring

Forecasting. Al Nosedal University of Toronto. March 8, Al Nosedal University of Toronto Forecasting March 8, / 80

Forecasting: Principles and Practice. Rob J Hyndman. 1. Introduction to forecasting OTexts.org/fpp/1/ OTexts.org/fpp/2/3

An approach to make statistical forecasting of products with stationary/seasonal patterns

Forecasting Automobile Sales using an Ensemble of Methods

Chapter 3: Regression Methods for Trends

Cost and Preference in Recommender Systems Junhua Chen LESS IS MORE

A time series is called strictly stationary if the joint distribution of every collection (Y t

Operations Management

Models with Limited Dependent Variables

Geology 560, Prof. Thomas Johnson Unit IV, Part 2: Activation energies and the responses of reaction rates to temperature and compositional changes.

Simple Descriptive Techniques

INVERSE PROBLEMS IN MARKOV MODELS AND MEDICAL INSURANCE

A MACRO-DRIVEN FORECASTING SYSTEM FOR EVALUATING FORECAST MODEL PERFORMANCE

A Metzlerian business cycle model with nonlinear heterogeneous expectations

FORECASTING TIME SERIES WITH BOOT.EXPOS PROCEDURE

From Argentina to Zimbabwe: Where Should I Sell my Widgets?

STATS24x7.com 2010 ADI NV, INC.

The SAB Medium Term Sales Forecasting System : From Data to Planning Information. Kenneth Carden SAB : Beer Division Planning

Transcription:

STA 6104 Financial Time Series Moving Averages and Exponential Smoothing

Smoothing Our objective is to predict some future value Y n+k given a past history {Y 1, Y 2,..., Y n } of observations up to time n. Smoothing always involves some form of local averaging of data such that the non-systematic components of individual observations cancel each other out. All the time-series smoothing techniques use a form of weighted average of past observations to smooth up-and-down movements, i.e., some statistical methods of suppressing short-term fluctuations. The underlying assumption of these methods is that the fluctuations in past values represent random departures from some smooth curve that can plausibly be extrapolated into the future to produce a forecast or series of forecasts. 1

Moving Averages The most common technique is moving average smoothing which replaces each element of the series by either the simple or weighted average of n most recent values, where n is the width of the smoothing window (see Box & Jenkins, 1976; Velleman & Hoaglin, 1981). The simple moving average (SMA) is most often used for simple descriptive patterns of a time-series variable and is a very simple procedure for forecasting a time-series variable. The calculation of the simple moving average is given by SMA t = 1 k (Y t + Y t 1 + + Y t (k 1) ), where SMA t is the simple k-period moving average in time period t. 2

Moving Averages This calculation involves the previous k periods and is used for generating the moving average for this time period. We can use the SMA to generate a one-period ahead forecast by Ŷ t+1 = SMA t. The naive method that assumes the next period will be identical to the present is a special case of SMA with k = 1. The term moving average refers to an average that is updated each time period by deleting one observation at the beginning of the period and replacing it with another at the end of the period. 3

Moving Averages The choice of how many of the most recent time periods k to use in computing the moving average is determined by the researcher (subjectively) or choosing the value of k that generates the smallest error (objectively). The accuracy of the forecast using a simple moving average is, in most cases, dependent upon the choice of k. The larger the choice of k, the smoother the series will be. Example: (c3t1.xls) Exchange rate with Japan 4

Centered Moving Averages The major difference between the simple moving average and the centered moving average is the selection of observation used. The simple moving average uses the current observation plus previous observations, e.g., a five-period SMA uses the current time period observation and the four previous time period observations. The centered moving average (CMA) centers its average on the current period using both the previous time period observations and the forward time period observations, e.g., a five-period CMA uses the current time period observation, the two previous time period observations and the two subsequent time period observations. 5

Centered Moving Averages The specific method of calculating a centered moving average depends upon whether the period of averaging, k, is even or odd. If k is odd, then the centered moving average is calculated by CMA t = 1 k (Y t ((k 1)/2)) + + Y t + + Y t+((k 1)/2)), where Y t is the mid-point in the range of k data observations. Note that (k 1)/2 observations are lost both at the beginning and the end of the series. 6

Centered Moving Averages In most time-series data, the centered moving average most commonly is associated with an even number of observations. For example, we are often analyzing either quarterly or monthly data, where the choice of k would be 4 for quarterly or 12 for monthly data. In these cases, the SMA does not correspond to the time period observations in the original data series. To obtain an average that does correspond to the original time periods, CMA are calculated using two-period moving averages of the initial moving averages. 7

Centered Moving Averages If k is even, the CMA is calculated in a two-step process. 1. Calculate two moving averages that bound the time period: MA t(1) = 1 k (Y t (k/2)+1 + + Y t + + Y t+(k/2)) MA t(2) = 1 k (Y t (k/2)+2 + + Y t + + Y t+(k/2)+1) 2. Take the simple average of these two values in order to center the average on a corresponding time period of the original series: CMA t = 1 2 (MA t(1) + MA t(2) ) The number of observation lost at each end of the series is L/2. 8

Exponential Smoothing Exponential smoothing is a method for continually revising an estimate or forecast by accounting for more recent changes or for fluctuations in the data. Here, we assume there is no systematic trend or seasonal effects in the process, or that these have been identified and removed. The mean of the process can change from one time step to the next, but we have no information about the likely direction of these changes. The model is Y t = µ t + e t, where µ t is the non-stationary mean of the process at time t and e t are independent random deviations with a mean of 0 and a standard deviation σ. 9

Exponential Smoothing A typical application is forecasting sales of a well-established product in a stable market. Exponential smoothing assigns exponentially decreasing weights as the observation get older. Recent observations are given relatively more weight in forecasting than the older observations. In exponential smoothing, however, there are one or more smoothing parameters to be determined (or estimated) and these choices determine the weights assigned to the observations. 10

Exponential Smoothing Let a t be our estimate of µ t. Given that there is no systematic trend, an intuitively reasonable estimate of the mean at time t is given by a weighted average of our observation at time t and our estimate of the mean at time t - 1: a t+1 = αy t + (1 α)a t, 0 < α < 1, where a t is the exponentially weighted moving average at time t. The value of α determines the amount of smoothing, and it is referred to as the smoothing parameter. 11

Exponential Smoothing If α is near 1, there is little smoothing and a t is approximately Y t. This would only be appropriate if the changes in the mean level were expected to be large by comparison with σ At the other extreme, a value of α near 0 gives highly smoothed estimates of the mean level and takes little account of the most recent observation. This would only be appropriate if the changes in the mean level were expected to be small compared with σ. A typical compromise figure for α is 0.2 since in practice we usually expect that the change in the mean between time t 1 and time t is likely to be smaller than σ. 12

Exponential Smoothing As time passes the smoothed statistics becomes the weighted average of a greater and greater number of the past observations Y t n, and the weights assigned to previous observations are in general proportional to the terms of the geometric progression {1,(1 α),(1 α) 2,(1 α) 3,...}. A geometric progression is the discrete version of an exponential function, so this is where the name for this smoothing method originated: w(t) = w(0)exp(kt), k = ln(1 α), w(0) = 1. 13

Holt s Exponential Smoothing We usually have more information about the market than exponential smoothing can take into account. Sales are often seasonal, and we may expect trends to be sustained for short periods at least. But trends will change. The first extension of the simple exponential smoothing is to adjust the smoothing model for any trend in the data. When a trend exists, the forecast may then by improved by adjusting for this trend by using a two-parameter exponential smoothing (originated by C. C. Holt, 1957 Forecasting trends and seasonals by exponentially weighted moving averages, ONR Research Memorandum, Carnegie Institute of Technology 52.). 14

Holt s Exponential Smoothing The model adds a growth factor (or trend factor) to the smoothing equation as a way to adjust for the trend. The Holt s exponential smoothing is given by where a t+1 = αy t + (1 α)(a t + b t ) b t+1 = β(a t+1 a t ) + (1 β)b t a t+1 = smoothed value for time period t + 1 α = smoothing constant for the level (0 < α < 1) x t = actual value in time period t a t = smoothed value for time period t b t+1 = trend estimate for time period t + 1 b t = trend estimate for time period t β = smoothing constant for the trend estimate (0 < β < 1) 15

Additive and Multiplicative Models Time-series models can basically be classified into two types: additive models and multiplicative models. For additive model, we assume the data is the sum of the time-series components, i.e., Y t = Tr t + Sn t + Cl t + e t. If the data does not contain one of the components, the value for that is equal to zero. The seasonal (or cyclical) component of an additive model is independent of the trend, and thus the magnitude of the seasonal swing (movement) is constant over time. 16

Additive and Multiplicative Models For multiplicative model, the data is the product of the various components, i.e., Y t = Tr t Sn t Cl t e t. If the data does not contain one of the components, the value for that is equal to 1. The seasonal (or cyclical) component of a multiplicative model is proportional (a ratio) of the trend, and thus the magnitude of the seasonal swing is increases or decreases according to the behavior of the trend. Although most data that posses seasonal (cyclical) variations cannot be precisely classified as additive or multiplicative in nature, we usually look at forecasts obtained in both models and choose the model that yields the smallest error. 17

Winter s Exponential Smoothing Winter s exponential smoothing model is another extension of the simple exponential smoothing model; it is used for data that exhibit both trend and seasonality. It is a three-parameter exponential smoothing model, which has an additional equation to adjust for the seasonal component. The Winter s exponential smoothing is due to P. R. Winters (1960, Forecasting sales by exponentially weighted moving averages, Management Science 6, 324 342). 18

Winter s Exponential Smoothing The additive Holt-Winters prediction function (for time series with period length p) is Ŷ t+h = a t + h b t + s t+1+(h 1) mod p, where a t, b t and s t are given by a t = α(y t s t p ) + (1 α)(a t 1 + b t 1 ) where b t s t = β(a t a t 1 ) + (1 β)b t 1 = γ(y t a t ) + (1 γ)s t p a t+1 = smoothed value for time period t + 1 α = smoothing constant for the level (0 < α < 1) x t = actual value in time period t a t = smoothed value for time period t b t+1 = trend estimate for time period t + 1 b t = trend estimate for time period t β = smoothing constant for the trend estimate (0 < β < 1) γ = smoothing constant for the seasonality estimate (0 < γ < 1) h = number of periods in forecast lead period p = number of periods in the seasonal cycle = seasonality estimate for time period t s t 19

Winter s Exponential Smoothing The multiplicative Holt-Winters prediction function (for time series with period length p) is Ŷ t+h = (a t + hb t )s t+1+(h 1) mod p, where a t, b t and s t are given by a t = α(y t /s t p ) + (1 α)(a t 1 + b t 1 ) b t = β(a t a t 1 ) + (1 β)b t 1 s t = γ(y t /a t ) + (1 γ)s t p As with simple and Holt s exponential smoothing, initial values must be selected to initialize or warm up the model. Over a long time period, the particular values selected have little effect on the forecast. 20

New Product Forecasting For new products, because they typically lack historical data, most forecasting techniques cannot produce satisfying results. It is typically impossible for Holt s exponential smoothing to determine the trend since the dataset is too small. To overcome this difficulty, forecasters use a number of models that generally fall in the category called diffusing models. These models are alternatively called S-curves, growth models, saturation models, or substitution curves. These models are most commonly used to forecast the sales of new products and technology life cycles. 21

New Product Forecasting Life cycles usually follow a common pattern: 1. A period of slow growth just after introduction during an embryonic stage. 2. A period of rapid growth. 3. Slowing growth in a mature phase. 4. Decline. The forecaster s task is to identify and estimate the parameters of such a pattern of growth. Each new-product model has its own lower and upper limit and expert opinion is needed to determine these limits. In most cases, the lower limitation is 0 and the determination of the upper limit is a more complicated task. 22

Gompertz Curve Two most common forms of S-curves used in forecasting are the Gompertz curve and the logistics curve (also known as the Peral curve). The Gompertz curve is named after its developer, Benjamin Gompertz, an English actuary. In 1825, Benjamin Gompertz proposed an exponential increase in death rates with age. The Gompertz s Law of Mortality states that the death rate is increasing in a geometric progression. 23

Gompertz Curve The Gompertz function is given as Y t = Le ae bt, where a and b are the parameters that describing the curve and L is the upper limit of Y. The Gompertz curve will range in value from zero to U as t varies from to. The Gompertz curve is best used in situations where it becomes more difficult to achieve an increase in the growth rate as the maximum value is approached. Example: Color TV adoptions (c3t8) 24

Logistics Curve The logistics curve is another way of forecasting with sparse data and is also used frequently to forecast new-product sales. The logistics curve has the following form: Y t = L 1 + ae bt, where a and b are the parameters that describing the curve and L is the upper limit of Y. The logistics curve is symmetric about its point of inflection, i.e., the upper half of the curve is a reflection of the lower half. Note that the Gompertz curve is not necessarily symmetric about its point of inflection. 25

Logistics Curve vs. Gompertz Curve To choose in between Logistics curve and Gompertz curve, it lies in whether it is easier to achieve the maximum value the closer you get to it, or whether it becomes more difficult to attain the maximum value the closer you get to it. If there is an offsetting factor such that growth is more difficult to maintain as the maximum is approached, then the Gompertz curve will be the best choice. If there are no such offsetting factors hindering the attainments of the maximum value, the logistics curve will be the best choice. Example: Cellular phone adoption (c3t9) 26

Bass Model Professor Frank M. Bass published a paper describing his mathematical model, which quantified the theory of adoption and diffusion of a new product by society, in Management Science nearly fifty years ago (Bass, 1969). The mathematics is straightforward, and the model has been influential in marketing, and on a variety of biological, medical and scientific forecasts. An entrepreneur with a new invention will often use the Bass model when making a case for funding. 27

Bass Model The Bass formula for the number of people, N t who have bought a product at time t depends on three parameters: p: the coefficient of innovation (the probability of initial purchase of a new product independent of the influence of previous buyers) q: the coefficient of imitation (the pressure of imitation on previous purchasers) m: the total number of people who eventually buy the product The Bass formula (discrete-time version) is N t+1 = N t + p(m N t ) + qn t (m N t )/m Rationale for the model is that initial sales will be to people who are interested in the novelty of the product, whereas later sales will be to people who are drawn to the product after seeing their friends and acquaintances use it. 28

Bass Model The above formula is a difference equation and its solution is N t = m 1 e (p+q)t 1 + (q/p)e (p+q)t (We will verify this result for the continuous-time version of the model). One interpretation of the Bass model is that the time from product launch until purchase is assumed to have a probability distribution that can be parameterized in terms of p and q. The interpretation of the hazard is that if it is multiplied by a small time increment, it gives the probability that a random purchaser who has not yet made the purchase will do so in the next small time increment. 29

Bass Model The continuous-time model of the Bass formula can be expressed in terms of the hazard function and the CDF: h(t) = p + qf(t). This differential equation give the solution for F(t) as F(t) = 1 e (p+q)t 1 + (q/p)e (p+q)t. Therefore, the pdf is f(t) = (p + q)2 e (p+q)t p[1 + (q/p)e (p+q)t ] 2. Cumulative sales are given by m F(t). The time to peak is t = ln(q) ln(p) p+q. Example: Adoption of Telephone-answering devices in the United States (c3t12) 30