Discrete time processes

Similar documents
Covariance Stationary Time Series. Example: Independent White Noise (IWN(0,σ 2 )) Y t = ε t, ε t iid N(0,σ 2 )

Class 1: Stationary Time Series Analysis

Ch. 14 Stationary ARMA Process

Basic concepts and terminology: AR, MA and ARMA processes

3. ARMA Modeling. Now: Important class of stationary processes

1 Class Organization. 2 Introduction

Introduction to ARMA and GARCH processes

Chapter 4: Models for Stationary Time Series

1 Linear Difference Equations

Autoregressive and Moving-Average Models

TIME SERIES AND FORECASTING. Luca Gambetti UAB, Barcelona GSE Master in Macroeconomic Policy and Financial Markets

Ch. 15 Forecasting. 1.1 Forecasts Based on Conditional Expectations

LINEAR STOCHASTIC MODELS

Lecture on ARMA model

Introduction to Stochastic processes

Some Time-Series Models

5: MULTIVARATE STATIONARY PROCESSES

Empirical Market Microstructure Analysis (EMMA)

Time Series 2. Robert Almgren. Sept. 21, 2009

Lecture 1: Stationary Time Series Analysis

7. MULTIVARATE STATIONARY PROCESSES

ECON/FIN 250: Forecasting in Finance and Economics: Section 6: Standard Univariate Models

Permanent Income Hypothesis (PIH) Instructor: Dmytro Hryshko

Univariate Time Series Analysis; ARIMA Models

Difference equations. Definitions: A difference equation takes the general form. x t f x t 1,,x t m.

Econ 623 Econometrics II Topic 2: Stationary Time Series

LECTURE 10 LINEAR PROCESSES II: SPECTRAL DENSITY, LAG OPERATOR, ARMA. In this lecture, we continue to discuss covariance stationary processes.

Ch 4. Models For Stationary Time Series. Time Series Analysis

Forecasting with ARMA

Single Equation Linear GMM with Serially Correlated Moment Conditions

Autoregressive Moving Average (ARMA) Models and their Practical Applications

Prof. Dr. Roland Füss Lecture Series in Applied Econometrics Summer Term Introduction to Time Series Analysis

Lecture 1: Stationary Time Series Analysis

Chapter 1. Basics. 1.1 Definition. A time series (or stochastic process) is a function Xpt, ωq such that for

Econometrics II Heij et al. Chapter 7.1

Econometría 2: Análisis de series de Tiempo

A time series is called strictly stationary if the joint distribution of every collection (Y t

Università di Pavia. Forecasting. Eduardo Rossi

ECONOMETRICS Part II PhD LBS

Midterm Suggested Solutions

Module 3. Descriptive Time Series Statistics and Introduction to Time Series Models

Chapter 3 - Temporal processes

E 4101/5101 Lecture 6: Spectral analysis

Lecture 3: Autoregressive Moving Average (ARMA) Models and their Practical Applications

Review Session: Econometrics - CLEFIN (20192)

Nonlinear time series

Lecture 2: Univariate Time Series

Covariances of ARMA Processes

Lesson 9: Autoregressive-Moving Average (ARMA) models

Econ 424 Time Series Concepts

Applied time-series analysis

ECON 616: Lecture 1: Time Series Basics

Problem Set 1 Solution Sketches Time Series Analysis Spring 2010

Lecture 2: ARMA(p,q) models (part 2)

Time Series Analysis -- An Introduction -- AMS 586

Statistics of stochastic processes

STAT 443 Final Exam Review. 1 Basic Definitions. 2 Statistical Tests. L A TEXer: W. Kong

Econometric Forecasting

Reliability and Risk Analysis. Time Series, Types of Trend Functions and Estimates of Trends

EASTERN MEDITERRANEAN UNIVERSITY ECON 604, FALL 2007 DEPARTMENT OF ECONOMICS MEHMET BALCILAR ARIMA MODELS: IDENTIFICATION

Econometría 2: Análisis de series de Tiempo

18.S096 Problem Set 4 Fall 2013 Time Series Due Date: 10/15/2013

Time Series Models and Inference. James L. Powell Department of Economics University of California, Berkeley

LECTURES 2-3 : Stochastic Processes, Autocorrelation function. Stationarity.

Stationary Stochastic Time Series Models

Economics Department LSE. Econometrics: Timeseries EXERCISE 1: SERIAL CORRELATION (ANALYTICAL)

Module 4. Stationary Time Series Models Part 1 MA Models and Their Properties

University of Oxford. Statistical Methods Autocorrelation. Identification and Estimation

Time Series Analysis. James D. Hamilton PRINCETON UNIVERSITY PRESS PRINCETON, NEW JERSEY

Ch.10 Autocorrelated Disturbances (June 15, 2016)

Lesson 2: Analysis of time series

Chapter 9: Forecasting

Marcel Dettling. Applied Time Series Analysis SS 2013 Week 05. ETH Zürich, March 18, Institute for Data Analysis and Process Design

EC402: Serial Correlation. Danny Quah Economics Department, LSE Lent 2015

Time Series Analysis. James D. Hamilton PRINCETON UNIVERSITY PRESS PRINCETON, NEW JERSEY

Identifiability, Invertibility

Trend-Cycle Decompositions

Week 5 Quantitative Analysis of Financial Markets Characterizing Cycles

Stochastic Processes: I. consider bowl of worms model for oscilloscope experiment:

γ 0 = Var(X i ) = Var(φ 1 X i 1 +W i ) = φ 2 1γ 0 +σ 2, which implies that we must have φ 1 < 1, and γ 0 = σ2 . 1 φ 2 1 We may also calculate for j 1

Non-Stationary Time Series and Unit Root Testing

Define y t+h t as the forecast of y t+h based on I t known parameters. The forecast error is. Forecasting

We use the centered realization z t z in the computation. Also used in computing sample autocovariances and autocorrelations.

3 Theory of stationary random processes

ARMA Estimation Recipes

11. Further Issues in Using OLS with TS Data

Dynamic models. Dependent data The AR(p) model The MA(q) model Hidden Markov models. 6 Dynamic models

Ross Bettinger, Analytical Consultant, Seattle, WA

Time Series 3. Robert Almgren. Sept. 28, 2009

Ch. 19 Models of Nonstationary Time Series

Ch 5. Models for Nonstationary Time Series. Time Series Analysis

Non-Stationary Time Series and Unit Root Testing

Time Series I Time Domain Methods

Stat 248 Lab 2: Stationarity, More EDA, Basic TS Models

For a stochastic process {Y t : t = 0, ±1, ±2, ±3, }, the mean function is defined by (2.2.1) ± 2..., γ t,

Module 9: Stationary Processes

Econometrics of financial markets, -solutions to seminar 1. Problem 1

Empirical Macroeconomics

{ } Stochastic processes. Models for time series. Specification of a process. Specification of a process. , X t3. ,...X tn }

APPLIED ECONOMETRIC TIME SERIES 4TH EDITION

Transcription:

Discrete time processes Predictions are difficult. Especially about the future Mark Twain. Florian Herzog 2013

Modeling observed data When we model observed (realized) data, we encounter usually the following situations: Time-series data of a single time series (D t ), e.g. economic data. Online observations also single time series (D t ) Ensemble data from measurements (experiments) or seasonal data (D j t ) Modeling ensemble data is often easier than single time series, because we have the same stochastic information of a given point in time t with multiple realizations. By D t t = 1,..., T we denote the data observation at time in the case that we have only one single time series observation. By D j t t = 1,..., T j = 1,..., l we denote the time series observation at time t and measurement j. Data D t is recorded in discrete intervals (sampled data) and assumed to be generated by an SDE. Stochastic Systems, 2013 2

Modeling real-world data Stochastic differential equations are can be viewed as stochastic process from two point of view: Stochastic process with a probability distribution p(t θ) across-time, where θ denotes the parameters A stochastic process, where the changes in the resulting time series is the stochastic process, i.e. p(d t D t 1 θ) or p( D t D t 1 D θ) t 1 The first interpretation is help full to describe ensemble data and the second to analyze single time series. In order to deal with discrete data, all SDEs need to be discretized. Stochastic Systems, 2013 3

Discrete-time white noise When we assume a discrete white noise process ε t which identically and independently distributed (i.i.d) with zero expectation and σ 2 variance. Then following statistical properties apply: E[ε t ] = 0 (1) E[ε 2 t ] = σ2 (2) E[ε t ε τ ] = 0 for t τ (3) The last property is derived from the assumption of independence. A discretized Brownian Motion is a discrete-time white noise process with ε t N (0, t) where t is the time discretization. In discrete-time models, a white noise process can be normally distributed (Gaussian white noise) but can be distributed by any other distribution as long as the i.i.d. assumption is valid. Stochastic Systems, 2013 4

Statistical Theory Stationarity: A process is covariance-stationary if and only if (iff): E[D t ] = µ t (4) E[(D t µ)(d t j µ)] = ν j t j. (5) This form of stationarity is often also called weak stationarity. For example the process: D t = µ + ε t where ε t is white noise process with zero mean and σ 2 variance. In this case D t is stationary since: E[D t ] = µ { σt j = 0 E[(D t µ)(d t j µ)] = 0 j 0 Stochastic Systems, 2013 5

Statistical Theory Stationarity: A process is strictly stationary iff for any value of j 1, j 2, j 3,... the joint distribution of D t, D t+j1, D t+j2, D t+j3,.. depend only on the intervals separating the different values of j and not the time t. Ergodicity: A process is called ergodic for the mean iff: A process is said to be ergodic for the second moment iff: lim T 1 lim t E[D t] = µ. (6) T j E[(D t µ)(d t j µ)] p ν j (7) where p denotes convergence in probability. From an ergodic process, we can estimate all necessary data without possing ensemble data. If an process is not ergodic, we have only one observation, we can not identify its parameters. Stochastic Systems, 2013 6

Statistical Theory It can be shown that if the autocovariance of a covariance stationary process satisfies the following: ν j < (8) j=0 then the process is ergodic for the mean. Stochastic Systems, 2013 7

First Order Moving Average Process When ε k be a Gaussian white noise processes with mean zeros and variance σ 2 and consider the process: Y k = µ + ε k + θε k 1, (9) where µ and θ are nay real valued constant. The process is called first order moving average process and ussualy denotes as MA(1). The expectation of Y k is given as: E[Y k ] = E[µ + ε k + θε k 1 ] = µ + E[ε k ] + θe[ε k 1 ] = µ The constant µ is thus the mean of the processes. Stochastic Systems, 2013 8

First Order Moving Average Process The variance of the MA(1) is calculated as Var[Y k ] = E[(Y k µ) 2 ] = E[(ε k + θε k 1 ) 2 ] = E[ε 2 k + θ2 ε 2 k 1 + 2θε kε k 1 ] = σ 2 + 0 + θ 2 σ 2 = (1 + θ 2 )σ 2 The first autocovariance is computed as E[(Y k µ)(y k 1 µ)] = E[(ε k + θε k 1 )(ε k 1 + θε k 2 )] = E[ε k ε k 1 + θε 2 k 1 + θε k 1ε k θ 2 ε k 1 ε k 2 ] = 0 + θσ 2 + 0 + 0 = θσ 2 Stochastic Systems, 2013 9

First Order Moving Average Process The higher order autocovariances are all zero: E[(Y k µ)(y k j µ)] = E[(ε k + θε k 1 )(ε k j + θε k j 1 )] = 0. Since the mean and autocovariance are not function of times, an MA(1) process is covariance stationary regardless of the value of θ, since ν j = (1 + θ 2 )σ 2 + θσ 2. (10) j=0 When ε k is a Gaussian white noise process, the MA(1) is ergodic for all moments. Stochastic Systems, 2013 10

First Order Moving Average Process Autocorrelation is defined as ρ 1 = Cov[Y k Y k 1 ]/ V ar[y k ]V ar[y k 1 ] and V ar[y k ] = V ar[y k+1 ] = (1 + θ 2 )σ 2. Therefore we get ρ 1 = θσ 2 (1 + θ 2 )σ 2 = θ 1 + θ 2. (11) All higher order autocorrelations are zero. Positive values of θ induce positive autocorrelation, negative values induces negative autocorrelation. The autocorrelation of an MA(1) process can be between -0.5 and 0.5. For any value of ρ 1, there exists two values of θ, see ρ 1 = (1/θ) 1 + (1/θ) = θ 2 (1/θ) 2 θ 2 (1 + (1/θ) 2 ) = θ 1 + θ 2. (12) Stochastic Systems, 2013 11

Q-th Order Moving Average Process A q-th order moving average process, denotes as MA(q) is given by Y k = µ + ε k + θ 1 ε k 1 + θ 2 ε k 2 +... + θ q ε k q, (13) where µ and θ 1,...θ 1 are real valued constant. The expectation of Y k is given as: E[Y k ] = µ + E[ε k ] + θ 1 E[ε k 1 ] +... + θ q E[ε k q ] = µ The constant µ is thus the mean of the processes. The variance is computed as The variance of the MA(1) is calculated as Var[Y k ] = E[(Y k µ) 2 ] = E[(ε k + θ 1 ε k 1 + θ 2 ε k 2 +... + θ q ε k q ) 2 ]. Since all ε s are uncorrelated, all cross terms ε j ε k are zero. Stochastic Systems, 2013 12

Q-th Order Moving Average Process The variance is therefore Var[Y k ] = σ 2 + θ 2 1 σ2 +... + θ 2 q σ2 = σ 2 (1 + q θ 2 j ). j=1 The autocovariance is for the i-th lag is computed as ν i = E[(ε k + q θ j ε k j )(ε k i + j=1 q θ j ε k j i )] j=1 = E[θ i ε 2 k i + θ i+1θ 1 ε 2 k i 1 + θ i+2θ 2 ε k i 2 +... + θ q θ q i ε 2 k q ] Stochastic Systems, 2013 13

Q-th Order Moving Average Process The ε s at different dates have been drop since their product is zero. For i > q there are no ε s with common dates in the definition of of ν i. We get for the autocovariance thus ν i = { (θi + θ i+1 θ 1 + θ i+2 θ 2 +... + θ q θ q j )σ 2 i = 1, 2,..., q 0 i > q The autocorrelation is given as ν i = θ i +θ i+1 θ 1 +θ i+2 θ 2 +...+θqθ q j (1+ q j=1 θ2 j ) i = 1, 2,..., q 0 i > q Stochastic Systems, 2013 14

Autoregressive first order Process We analyze an so-called first order autoregressive (AR(1)): Y k+1 = c + ΦY k + ε k Long-term mean: We take the mean on both sides of the equation E[Y k+1 ] = E[c + ΦY k + ε k ] E[Y k+1 ] = c + ΦE[Y k ] µ = = c + Φµ µ = c 1 Φ Stochastic Systems, 2013 15

Autoregressive first order Process µ = c 1 Φ From this formula, we can see that Φ must be smaller than 1, otherwise the mean would be negative when c 0. Also we have assumed that E[Y k ] = E[Y k+1 ] = µ and exists. Second moment: Y k+1 = µ(1 Φ) + ΦY k + ε k Y k+1 µ = Φ(Y k µ) + ε k Stochastic Systems, 2013 16

Autoregressive first order Process E[(Y k+1 µ) 2 ] = E[(Φ(Y k µ) + ε k ) 2 ] = Φ 2 E[(Y k µ)] + 2ΦE[(Y k µ)ε k ] + E[ε 2 k ] E[(Y k+1 µ) 2 ] = Φ 2 E[(Y k µ)] + σ 2 where E[(Y k µ)ε k ] = 0. Assuming covariance stationarity, i.e. E[(Y k+1 µ) 2 ] = ν and E[(Y k µ) 2 ] = ν. We get: ν = Φ 2 ν + σ 2 ν = σ 2 1 Φ 2 Stochastic Systems, 2013 17

Autoregressive first order Process Autocorrelation: E[(Y k+1 µ)(y k µ)] = E[(Φ(Y k µ) + ε k )(Y k µ)] E[(Y k+1 µ)(y k µ)] = ΦE[((Y k µ) 2 ] + E[(ε k )(Y k µ)]. Again E[(ε k )(Y k µ)] = 0 and E[((Y k µ) 2 ] = ν we get: E[(Y k+1 µ)(y k µ)] = Φν = Φ σ2 1 Φ 2. Autocorrelation is defined as Cov[Y k+1 Y k ]/ V ar[y k+1 ]V ar[y k ] and V ar[y k ] = V ar[y k+1 ] = ν. Therefore we get acf(y k+1 Y k ) = ϕ (14) Stochastic Systems, 2013 18

Autoregressive first order Process AR(1) process with Y k+1 = ΦY k + c + ε k hast the following properties: Mean : µ = V ariance : ν = Autocorr. : acf(y k+j Y k ) = ϕ j c 1 Φ σ 2 1 Φ 2 A covariance stationary AR(1) process is only well defined when Φ 1, otherwise the variance is infinite and mean makes no sense. We can check if a process is a AR(1) process by calculating the mean and variance over-time and the autocorrelation. From these three calculations the parameters can be estimated. It is important to notice that the autocorrelation is exactly the coefficient of the autoregression. Stochastic Systems, 2013 19

Example: GBM and mean reversion The two examples have a discrete-time structure as an first-order autoregressive process with: (I) (II) Φ = 1 + µ t Φ = 1 b t When we assume that µ > 0, the first model is instable and we can not use the techniques from auto-regressions to estimate the parameters. We have to deal differently in this case, by using p = ln(x), since then we get an SDE with constant coefficients. When we assume that b > 0, the process is a true mean reverting process that can be estimated by AR(1) methods. In this case, the discrete-time model is also stable. Stochastic Systems, 2013 20

First order difference equation A first order difference equation give by y k+1 = Φy k + w k is of the same type as an AR(1) equation. The can be solved by recursive substitution since y 1 = Φy 0 + w 0, y 2 = Φy 1 + w 1, y 3 = Φy 2 + w 2.... When we plug in the results of y 1 into the equation for y 2, we get y 2 = Φ 2 y 0 + Φw 0 + w 1, which in turn we plug in the equation for y 3 and get y 3 = Φ 3 y 0 + Φ 2 w 0 + Φw 1 + w 2. Expanding this further on, we get the general solution y k+1 = Φ k+1 y 0 + k Φ k j w j j=0 Stochastic Systems, 2013 21

p-th order difference equation A p-th order difference equation give by y k+1 = Φ 1 y k + Φ 2 y k 1 +... + Φ p y k p+1 + w k We write this in a vectorized form with: y k Φ 1 Φ 2 Φ 3 Φ p 1 Φ p y k 1 1 0 0 0 0 ξ k = y k 2 F = 0 1 0 0 0...... y k p 1 0 0 0 1 0 v k = w k 0 0. 0 and get ξ k+1 = F ξ k + v k. Stochastic Systems, 2013 22

p-th order difference equation The solution is given as: ξ k = F k ξ 0 + F k 1 v 0 + F k 2 v 1 + F k 3 v 2 +... + F v k 1 + v k The p-th order difference equation is stable (or stationary) when the eigenvalues of F are smaller than one. The eigenvalue polynom is given as: λ p Φ 1 λ p 1 Φ 2 λ p 2... Φ p 1 λ Φ p = 0 Stochastic Systems, 2013 23

Lag operators A highly useful operator for time series models is the lag operator. Suppose that we start with an sequence of data x k where k =... and generate a new sequence y k where the value of y for date k is equal to the value x took at time k 1 y k = x k 1. This is described as applying the Lag operator on x k and is represented by symbol L: Lx k = x k 1. Consider applying the lag operator twice to a series: L(Lx k ) = L(x k 1 ) = x k 2, which is denotes as L 2 or L 2 x k = x k 2. Stochastic Systems, 2013 24

Lag operators In general for any integer l L l x k = x k l, The lag operator is linear and can be treated as the multiplation operator and thus L(βx k ) = βlx k = βx k 1 L(x k + w k ) = Lx k + Lw k y k = (a + bl)lx k = (al + bl 2 )x k = ax k 1 + bx k 2 An expression such as (al + bl 2 ) is referred as a polynomial lag operators. Notice if x k = c then Lx k = x k 1 = c Stochastic Systems, 2013 25

First order difference equation and lag operators A first order difference equation give by can be written as y k+1 = Φy k + w k y k+1 = ΦLy k+1 + w k y k+1 ΦLy k+1 = w k y k+1 (1 L) = w k when we multiply both sides with (1 + ΦL + Φ 2 L 2 + Φ 3 L 3 +... + Φ k L k ) we get y k+1 (1 + ΦL + Φ 2 L 2 + Φ 3 L 3 +... + Φ k L k )(1 L) = (1 + ΦL + Φ 2 L 2 + Φ 3 L 3 +... + Φ k L k )w k Stochastic Systems, 2013 26

First order difference equation and lag operators y k+1 (1 Φ k+1 L k+1 ) = (1 + ΦL + Φ 2 L 2 + Φ 3 L 3 +... + Φ k L k )w k y k+1 (1) = Φ k+1 L k+1 y k+1 + (1 + ΦL + Φ 2 L 2 + Φ 3 L 3 +... + Φ k L k )w k y k+1 = Φ k+1 y 0 + w k + Φw k 1 + Φ 2 w k 2 + Φ 3 w k 3 +... + Φ k w 0 which results exactly in the same solution as by the usage of the recursive substitution. Stochastic Systems, 2013 27

P-th order difference equation and lag operators The p-th order difference equation given by y k+1 = Φ 1 y k + Φ 2 y k 1 +... + Φ p y t p+1 + w k, can be written using the lag operator as: y k+1 (1 Φ 1 L Φ 2 L 2 Φ 3 L 3... Φ p L p ) = w k, which results in the same polynomial as in the case of the eigenvalues of the matrix F. Stochastic Systems, 2013 28

P-th order difference equation and lag operators Since (1 Φ 1 z Φ 2 z 2 Φ 3 z 3... Φ p z p ) = (1 λ 1 z)(1 λ 2 z)...(1 λ p z), and by defining z 1 = λ and multiply both sides with z p and we get (λ p Φ 1 λ p 1 Φ 2 λ p 2... Φ p ) = (λ λ 1 )(λ λ 2 z)...(λ λ p z), which is nothing but the root search of the left side polynomial: λ p Φ 1 λ p 1 Φ 2 λ p 2... Φ p 1 λ Φ p = 0 Stochastic Systems, 2013 29

Second order Autoregression A second order autoregressive process (denotes as AR(2)) has the form: written in lag notation: Y k+1 = c + Φ 1 Y k + Φ 2 Y k 1 + ε k Y k+1 (1 Φ 1 L Φ 2 L 2 ) = c + ε k The AR(2) is stable if the roots of (1 Φ 1 z Φ 2 z 2 ) = 0 lie outside the unit circle. The process has the mean value: E[Y k+1 ] = c + Φ 1 E[Y k ] + Φ 2 E[Y k 1 ] + E[ε k ] µ = c + Φ 1 µ + Φ 2 µ µ = c 1 Φ 1 Φ 2 Stochastic Systems, 2013 30

Second order Autoregression To find the second moments of the AR(2), we write Y k+1 = µ(1 Φ 1 Φ 2 ) + Φ 1 Y k + Φ 2 Y k 1 + ε k Y k+1 µ = Φ 1 (Y k µ) + Φ 2 (Y k 1 µ) + ε k. multiplying both sides with Y k+1 j µ we get and taking the expectation yields: ν j = Φ 1 ν j 1 + Φ 2 ν j 2 j = 1, 2, 3.. The auto-covariances follow the same second order difference equation as the AR(2) process. The autocorrelation is given by dividing the last equation by ν 0 we get ρ j = Φ 1 ρ j 1 + Φ 2 ρ j 2 j = 1, 2, 3.. Stochastic Systems, 2013 31

Second order Autoregression For j = 1 we get for the autocorrelation: ρ 1 = Φ 1 + Φ 2 ρ 1 ρ 1 = Φ 1 1 Φ 2, since ρ 0 = 1 and ρ 1 = ρ 1. For j=2 we get ρ 2 = Φ 1 ρ 1 + Φ 2 ρ 2 = Φ 2 1 1 Φ 2 + Φ 2. The covariance is computed by E[(Y k+1 µ)(y k+1 µ)] = Φ 1 E[(Y k µ)(y k+1 µ)] + Φ 2 E[(Y k 1 µ)(y k+1 µ)] + E[ε k (Y k+1 µ)], Stochastic Systems, 2013 32

Second order Autoregression The covariance equation can be written as ν 0 = Φ 1 ν 1 + Φ 2 ν 2 + σ 2 j = 1, 2, 3.. The last term from noticing E[ε k (Y k+1 µ)] = E[(ε k )(Φ 1 (Y k µ) +Φ 2 (Y k 1 µ) + ε k )] = Φ 1 0 + Φ 2 0 + σ 2 The covariance equation can be written as ν 0 = Φ 1 ρ 1 ν 0 + Φ 2 ρ 2 ν 0 + σ 2, ν 0 = (1 Φ 2 )σ 2 (1 + Φ 2 )[(1 Φ 2 2 ) Φ2 1 ] Stochastic Systems, 2013 33

P-th order Autoregression A p-th order autoregressive process (denotes as AR(p)) has the form: written in lag notation: Y k+1 = c + Φ 1 Y k + Φ 2 Y k 1 +... + Φ p Y k p+1 + ε k Y k+1 (1 Φ 1 L Φ 2 L 2... Φ p L p ) = c + ε k The AR(2) is stable if the roots of (1 Φ 1 z Φ 2 z 2... Φ p z p ) = 0 lie outside the unit circle. The process has the mean value: E[Y k+1 ] = c + Φ 1 E[Y k ] + Φ 2 E[Y k 1 ] +... + Φ p E[Y k p+1 ] + E[ε k ] µ = c + Φ 1 µ + Φ 2 µ +... + Φ p µ µ = c 1 Φ 1 Φ 2... Φ p Stochastic Systems, 2013 34

P-th order Autoregression The autocovariance is given as ν j = { Φ1 ν j 1 + Φ 2 ν j 2 +... + Φ p ν j p j = 1, 2, 3.. Φ 1 ν 1 + Φ 2 ν 2 +... + Φ p ν p j = 0 The equation for the autocorrelation is given as ρ j = Φ 1 ρ j 1 + Φ 2 ρ j 2 +... + Φ 2 ρ j p j = 1, 2, 3.. which is known as the Yule-Walker equation. The equation is again a p-th order difference equation. Stochastic Systems, 2013 35

Mixed Autoregressive Moving Average Process (ARMA) A ARMA(p,q) process includes autoregressive elements as well as moving average elements Y k+1 = c + Φ 1 Y k + Φ 2 Y k 1 +... + Φ p Y k p+1 The process is written in Lag form as +ε k + θ 1 ε k 1 + θ 2 ε k 2 +... + θ q ε k q. Y k+1 (1 Φ 1 L Φ 2 L 2... Φ p L p ) = +c + ε k (1 + θ 1 L + θ 2 L 2 +... + θ q L q ). Provided that the roots of (1 Φ 1 L Φ 2 L 2... Φ p L p ) = 0 lie outside the unit circle we obtain the following general form for an ARMA(p,q) process: Y k+1 = µ + (1 + θ 1L + θ 2 L 2 +... + θ q L q ) (1 Φ 1 L Φ 2 L 2... Φ p L p ) ε k. Stochastic Systems, 2013 36

Mixed Autoregressive Moving Average Process (ARMA) The mean is given as µ = c (1 Φ 1 L Φ 2 L 2... Φ p L p ). The stationarity of an ARMA process depends only on the autoregressive parameters and not on the moving average parameters. The more convenient form of an ARMA(p,q) process is given as Y k+1 µ = Φ 1 (Y k µ) + Φ 2 (Y k 1 µ) +... + Φ p (Y k p+1 µ) +ε k + θ 1 ε k 1 + θ 2 ε k 2 +... + θ q ε k q. The autocovariance equation for j > q is given as ν j = Φ 1 ν j 1 + Φ 2 ν j 2 +... + Φ p ν j p j = q + 1, q + 2,... Stochastic Systems, 2013 37

Mixed Autoregressive Moving Average Process (ARMA) Thus after q lags, the autocovariance function ν j follows a p-th order difference equation governed by the the autoregressive part. Note that last equation does not hold for j q since the correlation between θ j ε k j and Y t j cannot be neglected. In order to analyze the auto-covariance, we introduce the auto-covariance generating function g Y (z) = j= ν j z j which is constructed as taking the j-th auto-covariance and multiplying with z to the power of j and then summing over all possible values of j Stochastic Systems, 2013 38

Parameter Identification We start to examine the appropriate log-likelihood function with our frame work of the discretized SDEs. Let ψ R d be the vector of unknown parameters, which belongs to the admissible parameter space Ψ, e.g. a, b, etc. The various system matrices of the discrete models, as well as the variances of stochastic processes, given in the previous section, depend on ψ. The likelihood function of the state space model is given by the joint density of the observational data x = (x N, x N 1,..., x 1 ) l(x, ψ) = p(x N, x N 1,..., x 1 ; ψ) (15) which reflects how likely it would have been to have observed the date if ψ were the true values of the parameters. Stochastic Systems, 2013 39

Parameter Identification Using the definition of condition probability and employing Bayes s theorem recursively, we now write the joint density as product of conditional densities l(x, ψ) = p(x N x N 1,..., x 1 ; ψ)... p(x k x k 1,..., x 1 ; ψ)... p(x 1 ; ψ) (16) where we approximate the initial density function p(x 1 ; ψ) by p(x 1 x 0 ; ψ). Furthermore, since we deal with a Markovian system, future values of x l with l > k only depend on (x k, x k 1,..., x 1 ) through the current values of x k, the expression in (16) can be rewritten to depend on only the last observed values, thus it is reduced to l(x, ψ) = p(x N x N 1 ; ψ)... p(x k x k 1 ; ψ)... p(x 1 ; ψ) (17) Stochastic Systems, 2013 40

Parameter Identification A Euler discretized SDE is given as: x k+1 = x k + f(x k, k) t + g(x k, k) tε k (18) where ε k is an i.i.d random variable with normal distribution, zero expectation and unit variance. Since f and g are deterministic functions, x k+1 is conditional (on x k normal distributed with E[x k+1 ] = x k + f(x k, k) t (19) Var[x k+1 ] = g(x k, k)g(x k, k) T t (20) p(x k+1 ) = 1 (2π) n 2 Var[xk+1 ] e 1 2 (x k+1 E[x k+1 ])T Var[x k+1 ] 1 (x k+1 E[x k+1 ]) (21) Stochastic Systems, 2013 41

Parameter Identification The parameter of an SDEs are estimated by maximizing the log-likelihood. To maximize the log-likelihood is equivalent to maximizing the likelihood, since the log function is strictly increasing function. The log likelihood is given as: ln(l(x, ψ)) = ln(p(x N x N 1 ; ψ)... p(y k x k 1 ; ψ)... p(x 1 ; ψ))(22) ln(l(x, ψ)) = N ln(p(x j x j 1 ; ψ) (23) j=1 ln(l(x, ψ)) = N { j=1 ln((2π) n 2 Var[x k+1 ] 1 2) 1 } 2 (x k+1 E[x k+1 ]) T Var[x k+1 ] 1 (x k+1 E[x k+1 ]) (24) Stochastic Systems, 2013 42

Example: Parameter Identification We want to identify the parameters of a AR(1) process without any constant: which has the log-likelihood function: x k+1 = Φx k + σε k (25) ln(l(x, ψ)) = N ln((2π) 1 2σ 1 ) (x k+1 Φx k ) 2. (26) 2σ 2 j=1 The function is maximized by taking the derivatives for Φ and σ. The log-likelihood function has the same from as an so-called ordinary least-square (OLS) estimation. Stochastic Systems, 2013 43

Example: Parameter Identification The solution of the maximum likelihood estimations is: Φ = N j=1 x k+1x k N j=1 x2 k (27) σ = 1 N N (x k+1 Φx k ) 2 (28) j=1 The solution for Φ is the same solution as for an OLS regression. Furthermore, the solution is exactly the definition of autocorrelation with lag one. The estimation for σ is the empirical standard deviation of the estimation of the empirical white noise, i.e. ε k = x k+1 Φx k Stochastic Systems, 2013 44