A SARIMAX coupled modelling applied to individual load curves intraday forecasting

Similar documents
Univariate ARIMA Models

Problem Set 2: Box-Jenkins methodology

A time series is called strictly stationary if the joint distribution of every collection (Y t

Forecasting using R. Rob J Hyndman. 2.4 Non-seasonal ARIMA models. Forecasting using R 1

Empirical Market Microstructure Analysis (EMMA)

Univariate Time Series Analysis; ARIMA Models

Advanced Econometrics

FORECASTING SUGARCANE PRODUCTION IN INDIA WITH ARIMA MODEL

TIME SERIES ANALYSIS AND FORECASTING USING THE STATISTICAL MODEL ARIMA

Autoregressive Moving Average (ARMA) Models and their Practical Applications

Lesson 13: Box-Jenkins Modeling Strategy for building ARMA models

Circle a single answer for each multiple choice question. Your choice should be made clearly.

at least 50 and preferably 100 observations should be available to build a proper model

STAT 443 Final Exam Review. 1 Basic Definitions. 2 Statistical Tests. L A TEXer: W. Kong

Quantitative Finance I

IDENTIFICATION OF ARMA MODELS

Econometrics I: Univariate Time Series Econometrics (1)

Ch 6. Model Specification. Time Series Analysis

FE570 Financial Markets and Trading. Stevens Institute of Technology

Lecture 2: Univariate Time Series

Applied time-series analysis

Introduction to Time Series Analysis. Lecture 11.

Econometrics II Heij et al. Chapter 7.1

Time Series I Time Domain Methods

TMA4285 December 2015 Time series models, solution.

10. Time series regression and forecasting

Chapter 4: Models for Stationary Time Series

Prof. Dr. Roland Füss Lecture Series in Applied Econometrics Summer Term Introduction to Time Series Analysis

MODELING INFLATION RATES IN NIGERIA: BOX-JENKINS APPROACH. I. U. Moffat and A. E. David Department of Mathematics & Statistics, University of Uyo, Uyo

A Data-Driven Model for Software Reliability Prediction

Econometrics for Policy Analysis A Train The Trainer Workshop Oct 22-28, 2016 Organized by African Heritage Institution

3 Theory of stationary random processes

Non-Stationary Time Series and Unit Root Testing

Circle the single best answer for each multiple choice question. Your choice should be made clearly.

Ross Bettinger, Analytical Consultant, Seattle, WA

Non-Stationary Time Series and Unit Root Testing

Classic Time Series Analysis

EASTERN MEDITERRANEAN UNIVERSITY ECON 604, FALL 2007 DEPARTMENT OF ECONOMICS MEHMET BALCILAR ARIMA MODELS: IDENTIFICATION

Module 3. Descriptive Time Series Statistics and Introduction to Time Series Models

Univariate linear models

CHAPTER 8 FORECASTING PRACTICE I

Nonlinear time series

APPLIED ECONOMETRIC TIME SERIES 4TH EDITION

Lecture 1: Fundamental concepts in Time Series Analysis (part 2)

Estimating AR/MA models

Econometría 2: Análisis de series de Tiempo

Non-Stationary Time Series and Unit Root Testing

Marcel Dettling. Applied Time Series Analysis SS 2013 Week 05. ETH Zürich, March 18, Institute for Data Analysis and Process Design

The Identification of ARIMA Models

Time Series Analysis -- An Introduction -- AMS 586

Review Session: Econometrics - CLEFIN (20192)

Covariances of ARMA Processes

Ch 5. Models for Nonstationary Time Series. Time Series Analysis

Lesson 2: Analysis of time series

Introduction to ARMA and GARCH processes

STAT Financial Time Series

E 4101/5101 Lecture 6: Spectral analysis

Chapter 6: Model Specification for Time Series

Lab: Box-Jenkins Methodology - US Wholesale Price Indicator

Modelling Monthly Rainfall Data of Port Harcourt, Nigeria by Seasonal Box-Jenkins Methods

Exercises - Time series analysis

Midterm Suggested Solutions

Ch 4. Models For Stationary Time Series. Time Series Analysis

Chapter 2: Unit Roots

Financial Time Series Analysis: Part II

{ } Stochastic processes. Models for time series. Specification of a process. Specification of a process. , X t3. ,...X tn }

STAT 443 (Winter ) Forecasting

Stat 5100 Handout #12.e Notes: ARIMA Models (Unit 7) Key here: after stationary, identify dependence structure (and use for forecasting)

Forecasting. Simon Shaw 2005/06 Semester II

7. Forecasting with ARIMA models

Stationary Stochastic Time Series Models

Homework 4. 1 Data analysis problems

ECON/FIN 250: Forecasting in Finance and Economics: Section 6: Standard Univariate Models

Econometrics I. Professor William Greene Stern School of Business Department of Economics 25-1/25. Part 25: Time Series

Lecture 3: Autoregressive Moving Average (ARMA) Models and their Practical Applications

Time Series: Theory and Methods

A SEASONAL TIME SERIES MODEL FOR NIGERIAN MONTHLY AIR TRAFFIC DATA

Time Series Models and Inference. James L. Powell Department of Economics University of California, Berkeley

Covariance Stationary Time Series. Example: Independent White Noise (IWN(0,σ 2 )) Y t = ε t, ε t iid N(0,σ 2 )

7. Integrated Processes

Some Time-Series Models

AR, MA and ARMA models

ECONOMETRIA II. CURSO 2009/2010 LAB # 3

7. Integrated Processes

NANYANG TECHNOLOGICAL UNIVERSITY SEMESTER II EXAMINATION MAS451/MTH451 Time Series Analysis TIME ALLOWED: 2 HOURS

Econometrics for Policy Analysis A Train The Trainer Workshop Oct 22-28, 2016

Using Analysis of Time Series to Forecast numbers of The Patients with Malignant Tumors in Anbar Provinc

Problem Set 2 Solution Sketches Time Series Analysis Spring 2010

Unit root problem, solution of difference equations Simple deterministic model, question of unit root

Topic 4 Unit Roots. Gerald P. Dwyer. February Clemson University

Final Examination 7/6/2011

Moving Average (MA) representations

6 NONSEASONAL BOX-JENKINS MODELS

2. An Introduction to Moving Average Models and ARMA Models

4. MA(2) +drift: y t = µ + ɛ t + θ 1 ɛ t 1 + θ 2 ɛ t 2. Mean: where θ(l) = 1 + θ 1 L + θ 2 L 2. Therefore,

Lecture 7: Model Building Bus 41910, Time Series Analysis, Mr. R. Tsay

Time Series Analysis. James D. Hamilton PRINCETON UNIVERSITY PRESS PRINCETON, NEW JERSEY

Part III Example Sheet 1 - Solutions YC/Lent 2015 Comments and corrections should be ed to

Ch 8. MODEL DIAGNOSTICS. Time Series Analysis

Econometrics of financial markets, -solutions to seminar 1. Problem 1

Transcription:

A SARIMAX coupled modelling applied to individual load curves intraday forecasting Frédéric Proïa Workshop EDF Institut Henri Poincaré - Paris 05 avril 2012 INRIA Bordeaux Sud-Ouest Institut de Mathématiques de Bordeaux 1 / 40

Plan 1 Introduction 2 On a SARIMAX coupled modelling Stationary ARMA processes Identification for stationary AR(p) and MA(q) processes Linear relationship between consumption and temperature The SARIMAX modelling Application to forecasting 3 Application to forecasting on a load curve Procedure Seasonality and stationarity ACF and PACF Selection criteria Selection on bayesian criteria Selection on bayesian criteria 4 Application on a huge dataset Technical issues Procedure Modelling Forecasting 5 Conclusion 2 / 40

Introduction Motivations. New energy meters to gather individual consumption with high frequencie. Economic issue for EDF : anticipate to optimize. Objectives. Distinguish nonthermosensitive from thermosensitive customers. Intraday daily forecasting. Introduce temperature as an exogenous contribution. 3 / 40

Introduction Detecting thermosensitivity. Deterministic criterion. Thermosensitivity if max Mt min t N t N Mt > δ where M t is the empirical median of ( C t... ) C t +1. = 1440 et δ = 1000. More pertinent than SR/DR. 70% of nonthermosensitive customers, only including 75% of SR. 4 / 40

Plan 1 Introduction 2 On a SARIMAX coupled modelling Stationary ARMA processes Identification for stationary AR(p) and MA(q) processes Linear relationship between consumption and temperature The SARIMAX modelling Application to forecasting 3 Application to forecasting on a load curve Procedure Seasonality and stationarity ACF and PACF Selection criteria Selection on bayesian criteria Selection on bayesian criteria 4 Application on a huge dataset Technical issues Procedure Modelling Forecasting 5 Conclusion 5 / 40

On a SARIMAX coupled modelling Stationary ARMA processes Stationary ARMA processes. Definition (Stationarity) A time series (Y t ) is said to be weakly stationary if, for all t Z, E[Y 2 t ] <, E[Y t ] = m and, for all s, t Z, Cov(Y t, Y s) = Cov(Y t s, Y 0 ). Definition (ARMA) Let (Y t ) be a stationary time series with zero mean. It is said to be an ARMA(p, q) process if, for every t Z, p q Y t a k Y t k = ε t + b k ε t k k=1 k=1 where (ε t ) is a white noise of variance σ 2 > 0, a R p and b R q. 6 / 40

On a SARIMAX coupled modelling Stationary ARMA processes Causality of ARMA processes. Compact expression, for all 1 t T, where the polynomials A(B)Y t = B(B)ε t A(z) = 1 a 1 z... a pz p and B(z) = 1 + b 1 z +... + b qz q. Definition (Causality) Let (Y t ) be an ARMA(p, q) process for which the polynomials A and B have no common zeroes. Then, (Y t ) is causal if and only if A(z) 0 for all z C such that z 1. Implications. Causality implies the existence of a MA( ) structure for (Y t ). Causality implies stationarity of the process. On N, causality often coincides with stationarity. 7 / 40

On a SARIMAX coupled modelling Stationary ARMA processes Existence and unicity of a stationary solution. Proposition If A(z) 0 for all z C such that z 1, then the ARMA equation A(B)Y t = B(B)ε t have the unique stationary solution Y t = ψ k ε t k, k=0 and the coefficients (ψ k ) k N are determined by the relation A 1 (z)b(z) = ψ k z k k=0 with ψk 2 <. k=0 Explosive cases. On Z, no zeroes on the unit circle is a sufficient condition. Irrelevant for practical purposes. 8 / 40

On a SARIMAX coupled modelling Identification for stationary AR(p) and MA(q) processes Autocorrelation function. To identify q. Definition (ACF) Let (Y t ) be a stationary time series. The autocorrelation function ρ associated with (Y t ) is defined, for all t Z, as ρ(t) = γ(t) γ(0) where the autocovariance function γ(t) = Cov(Y t, Y 0 ). Proposition The stationary time series (Y t ) with zero mean is a MA(q) process such that b q 0 if and only if ρ(q) 0 and ρ(t) = 0 for all t > q. 9 / 40

On a SARIMAX coupled modelling Identification for stationary AR(p) and MA(q) processes Examples. MA(1) : only ρ(1) nonzero, exponential decay of α(t). MA(2) : only ρ(1) and ρ(2) nonzero, damped exponential and sine wave for α(t). 10 / 40

On a SARIMAX coupled modelling Identification for stationary AR(p) and MA(q) processes Partial autocorrelation function. To identify p. Definition (PACF) Let (Y t ) be a stationary time series with zero mean. The partial autocorrelation function α is defined as α(0) = 1, and, for all t N, as α(t) = φ t, t where (φ t, t ) t N are computed via the Durbin-Levinson recursion. Proposition If there exists a square-integrable sequence (ψ k ) k N such that (Y t ) has a MA( ) expression with ψ 0 = 1, then the stationary time series (Y t ) with zero mean is an AR(p) process such that a p 0 if and only if α(p) 0 and α(t) = 0 for all t > p. 11 / 40

On a SARIMAX coupled modelling Identification for stationary AR(p) and MA(q) processes Examples. AR(1) : only α(1) nonzero, exponential decay of ρ(t). AR(2) : only α(1) and α(2) nonzero, damped exponential and sine wave for ρ(t). 12 / 40

On a SARIMAX coupled modelling Identification for stationary AR(p) and MA(q) processes Examples. ARMA(1,1) : exponential decay of ρ(t) and α(t) from first lag. ARMA(2,2) : exponential decay of ρ(t) and α(t) from second lag. 13 / 40

On a SARIMAX coupled modelling Linear relationship between consumption and temperature Variance-stabilizing Box-Cox transformation. Logarithmic transform given, for all 1 t T, by Y t = log ( C t + e m) where m ensures that Y t = m when C t = 0. 14 / 40

On a SARIMAX coupled modelling Linear relationship between consumption and temperature Linear relationship between consumption and temperature. On a thermosensitive load curve, for all 1 t T, For all z C, Y t = c 0 + C(B)U t + ε t. r 1 C(z) = c k+1 z k. k=0 Unknown vector c R r+1 estimated by OLS. Seasonal residuals. Residuals (ε t ) regarded as a seasonal time series. 15 / 40

On a SARIMAX coupled modelling The SARIMAX modelling Residuals (ε t ) as a seasonal time series. SARIMA(p, d, q) (P, D, Q) s modelling, for all 1 t T, (1 B) d (1 B s ) D A(B)A s(b)ε t = B(B)B s(b)v t, where (V t ) is a white noise of variance σ 2 > 0. Polynomials defined, for all z C, as p A(z) = 1 a k z k, k=1 P A s(z) = 1 α k z sk, k=1 B(z) = 1 q b k z k, k=1 B s(z) = 1 Q β k z sk, Parameters a R p, b R q, α R P and β R Q estimated by GLS. A and A s are causal. k=1 16 / 40

On a SARIMAX coupled modelling The SARIMAX modelling The dynamic coupled modelling. Definition (SARIMAX) In the particular framework of the study, a random process (Y t ) will be said to follow a SARIMAX(p, d, q, r) (P, D, Q) s coupled modelling if, for all 1 t T, it satisfies { Yt = c 0 + C(B)U t + ε t, (1 B) d (1 B s ) D A(B)A s(b)ε t = B(B)B s(b)v t. As soon as d + D > 0, (1 B) d (1 B s ) D A(B)A s(b) (Y t C(B)U t ) = B(B)B s(b)v t. 17 / 40

On a SARIMAX coupled modelling The SARIMAX modelling Existence of a stationary solution. Let I be the identity matrix of order T and 1 U T U T 1... U T r+1 Y 1 1 U T 1 U T 2... U T r Y 2 U =, Y =...... 1 U 1 U 0... U r+2 Y T Theorem Assume that U U is invertible. Then, the differenced process ( d D s εt ) where εt is given, for all 1 t T, by the vector form ( ε = I U(U U) 1 U ) Y is a stationary solution of the coupled model suitably specified. ADF and KPSS tests : d and D. Box and Jenkins methodology : p, q, r, P, Q, s. 18 / 40

On a SARIMAX coupled modelling Application to forecasting Forecasting using time series analysis. Let ε T +1 be the predictor of (ε t ) at stage T + 1. Let ĉ T be the OLS estimate of c. Assume that r has been evaluated. The predictor at horizon 1 is given by Ỹ T +1 = ĉ 0,T + The predictor at horizon H is given by Ỹ T +H = ĉ 0,T + r ĉ k,t U T k+2 + ε T +1. k=1 r ĉ k,t U T k+h+1 + ε T +H. k=1 19 / 40

Plan 1 Introduction 2 On a SARIMAX coupled modelling Stationary ARMA processes Identification for stationary AR(p) and MA(q) processes Linear relationship between consumption and temperature The SARIMAX modelling Application to forecasting 3 Application to forecasting on a load curve Procedure Seasonality and stationarity ACF and PACF Selection criteria Selection on bayesian criteria Selection on bayesian criteria 4 Application on a huge dataset Technical issues Procedure Modelling Forecasting 5 Conclusion 20 / 40

Application to forecasting on a load curve Procedure 21 / 40

Application to forecasting on a load curve Procedure Box and Jenkins methodology. For a given r, estimation of the residual set ( ε t ), for all 1 t T, ε t = Y t ĉ 0,T Select ŝ by investigating the seasonality of ( ε t ). r ĉ k,t U t k+1. Select d and D by investigating the stationarity of ( d Ḓ s εt ). Select p, q, P and Q by looking at ACF and PACF on ( d Ḓ s εt ). Adjust p, q, r, P and Q by minimizing bayesian or prediction criteria. Test of white noise on the fitted innovations. k=1 22 / 40

Application to forecasting on a load curve Seasonality and stationarity Seasonality. Fourier spectrogram on ( ε t ), ( 12 ε t ) and ( 24 ε t ), for T = 730 24 and r = 1. Stationarity. ( ε t ) is not stationary. ( ε t ), ( 24 ε t ) and ( 24 ε t ) are stationary around a deterministic trend. 23 / 40

Application to forecasting on a load curve ACF and PACF Sample autocorrelations. On the estimated residuals ( ε t ). On the seasonally differenced residuals ( 24 ε t ). 24 / 40

Application to forecasting on a load curve ACF and PACF Sample autocorrelations. On the doubly differenced residuals ( 24 ε t ). Identified models. SARIMAX(p, 0, 0, r) (P, 1, Q) 24 with p 5, r 2, P 1 and Q = 1. SARIMAX(p, 1, q, r) (P, 1, Q) 24 with p 1, q = 2, r 2, P 1 and Q = 1. 25 / 40

Application to forecasting on a load curve Selection criteria Bayesian criteria. Akaike information criterion and Schwarz bayesian criterion, AIC = 2 log L + 2k and SBC = 2 log L + k log T where L is the model likelihood and k the number of parameters. Log-likelihood. Overall randomness of successive innovations. Prediction criteria. We define C A and C R as follows, C A = 1 NH ( NH NH C T +k C T +k and CR = k=1 k=1 C T +k ) 1 ( NH k=1 ) C T +k C T +k where ( C T +1,..., C T +NH ) are N consecutive predictions at horizon H from time T. 26 / 40

Application to forecasting on a load curve Selection on bayesian criteria Selection on bayesian criteria. p d q r P D Q s AIC SBC LL VAR WN SARIMAX 1 0 0 1 0 1 1 24-362.4-330.4 186.2 0.053 SARIMAX 3 0 0 1 0 1 1 24-393.6-348.8 203.8 0.053 SARIMAX 5 0 0 1 0 1 1 24-424.7-367.2 221.4 0.053 SARIMAX 3 0 2 1 0 1 1 24-446.9-389.4 232.5 0.052 SARIMAX 3 0 2 2 0 1 1 24-457.2-393.3 238.6 0.052 SARIMAX 0 1 1 1 0 1 1 24 100.4 132.3-45.2 0.059 SARIMAX 0 1 2 1 0 1 1 24-238.7-200.4 125.4 0.055 SARIMAX 1 1 1 1 0 1 1 24-389.5-351.2 200.8 0.053 SARIMAX 2 1 2 1 0 1 1 24-435.1-384.0 225.6 0.052 Modelling with SARIMAX(3, 0, 2, 2) (0, 1, 1) 24 with T = 730 24. 27 / 40

Application to forecasting on a load curve Selection on bayesian criteria Selection on bayesian criteria. Modelling with SARIMAX(3, 0, 2, 2) (0, 1, 1) 24 with T = 730 24. Least squares estimation, for all 28 t T, C t = exp ( ) ĉ 0 + ĉ 1 U t + ĉ 2 U t 1 + ε t exp(5), ε t = ε t 24 + â 1 (ε t 1 ε t 25 ) + â 2 (ε t 2 ε t 26 ) + â 3 (ε t 3 ε t 27 ) + (V t b 1 V t 1 b 2 V t 2 ) β 1 (V t 24 b 1 V t 25 b 2 V t 26 ), in which ĉ 0 = 7.9871, ĉ 1 = 0.0166, ĉ 2 = 0.0420, â 1 = 0.4776, â 2 = 0.9030, â 3 = 0.4305, b 1 = 0.0801, b 2 = 0.8524, β 1 = 0.8125. 28 / 40

Application to forecasting on a load curve Selection on prediction criteria Selection on prediction criteria. p d q r P D Q s C A C R SARIMAX 1 0 0 1 0 1 1 24 193.9 0.1814 SARIMAX 1 0 1 2 0 1 1 24 193.3 0.1808 SARIMAX 3 0 0 1 0 1 1 24 196.0 0.1833 SARIMAX 3 0 2 1 0 1 1 24 199.9 0.1870 SARIMAX 3 0 2 2 0 1 1 24 198.9 0.1861 SARIMAX 1 1 1 1 0 1 1 24 195.4 0.1828 SARIMAX 2 1 2 1 0 1 1 24 195.3 0.1828 Forecasting with SARIMAX(1, 0, 1, 2) (0, 1, 1) 24 with T = 730 24. 29 / 40

Application to forecasting on a load curve Selection on prediction criteria Selection on prediction criteria. Forecasting with SARIMAX(1, 0, 1, 2) (0, 1, 1) 24 with T = 730 24. Parsimony is a central issue in time series analysis. Results slightly improved with a sliding window of 2 months. Around 2% of relative error between (ỸT +1,..., Y T +NH ) and (Y T +1,..., ỸT +NH ). 30 / 40

Application to forecasting on a load curve Selection on prediction criteria Refining... Influence of r and the size of the sliding windows M on C R. M = 2 months is the optimal sliding window. No more influence of r as soon as r 2. 31 / 40

Plan 1 Introduction 2 On a SARIMAX coupled modelling Stationary ARMA processes Identification for stationary AR(p) and MA(q) processes Linear relationship between consumption and temperature The SARIMAX modelling Application to forecasting 3 Application to forecasting on a load curve Procedure Seasonality and stationarity ACF and PACF Selection criteria Selection on bayesian criteria Selection on bayesian criteria 4 Application on a huge dataset Technical issues Procedure Modelling Forecasting 5 Conclusion 32 / 40

Application on a huge dataset Technical issues Huge dataset. More than 2000 load curves. Around 70% nonthermosensitive. High quality : more than 9 months of data per curve, very little missing values. Technical problems. Exponential growth of computing time with parsimony. 33 / 40

Application on a huge dataset Procedure On a representative panel. Selection of 200 heterogeneous thermosensitive curves (size, peaks intensity, etc.) Massive statistical KPSS procedures and visual first conclusions. Bayesian criteria to select the best models on average. Application to forecasting. Prediction criteria to select the best models on average. All parameters vary in their neighborhood. Consider technical issues : a night of computation for some models. 2 more bayesian criteria. Reliability index. Percentage of significance of the first exogenous coefficient. Assess the relevance on large-scale. Caution : main assumptions for t-test not satisfied! 34 / 40

Application on a huge dataset Modelling Stationarity, for r = 1 and M = 3 months. Less than 30% of ( ε t ) stationary. 100% of ( 24 ε t ) and ( 24 ε t ) stationary. SARIMAX(p, 0, q, r) (P, 1, Q) 24 and SARIMAX(p, 1, q, r) (P, 1, Q) 24. Making r increase. Relative evolution of AIC and SBC with r. Clearly, r = 1. SARIMAX(3, 0, 2, 1) (0, 1, 1) 24, SARIMAX(3, 1, 2, 1) (0, 1, 1) 24. Gain over the naive model : 50%. Gain over the SARIMA model : 2%. 35 / 40

Application on a huge dataset Modelling Significance of c 1 for M = 6 months. Substantial on thermosensitive curves. 36 / 40

Application on a huge dataset Forecasting Evolution of C R. SARIMAX(1, 1, 1, 1) (0, 1, 1) 24 and SARIMAX(2, 0, 1, 1) (0, 1, 1) 24. M = 2 months. Gain over the naive model : 65%. Gain over the SARIMA model : 3%. 37 / 40

Application on a huge dataset Forecasting Examples. 38 / 40

Plan 1 Introduction 2 On a SARIMAX coupled modelling Stationary ARMA processes Identification for stationary AR(p) and MA(q) processes Linear relationship between consumption and temperature The SARIMAX modelling Application to forecasting 3 Application to forecasting on a load curve Procedure Seasonality and stationarity ACF and PACF Selection criteria Selection on bayesian criteria Selection on bayesian criteria 4 Application on a huge dataset Technical issues Procedure Modelling Forecasting 5 Conclusion 39 / 40

Conclusion Nonthermosensitive curves. SARIMA(1, 0, 1) (0, 1, 1) 24 Thermosensitive curves. SARIMAX(1, 1, 1, 1) (0, 1, 1) 24 and SARIMAX(2, 0, 1, 1) (0, 1, 1) 24. A careful study curve by curve would provide better results. Of course... Caution : technical issues for huge datasets. Is a computation time 1000 for a gain of 0.1% relevant? Engineering approach, corporate vision. Thank you for your attention. Comments or questions? 40 / 40