Modelling and forecasting of offshore wind power fluctuations with Markov-Switching models

Similar documents
Chapter 3 - Estimation by direct maximization of the likelihood

TIME SERIES ANALYSIS AND FORECASTING USING THE STATISTICAL MODEL ARIMA

Chapter 8 - Extensions of the basic HMM

Forward Problems and their Inverse Solutions

Hmms with variable dimension structures and extensions

Sequential Monte Carlo Methods for Bayesian Computation

FORECASTING: A REVIEW OF STATUS AND CHALLENGES. Eric Grimit and Kristin Larson 3TIER, Inc. Pacific Northwest Weather Workshop March 5-6, 2010

Suan Sunandha Rajabhat University

Regime-switching modelling of the fluctuations of offshore wind generation

Spatio-temporal precipitation modeling based on time-varying regressions

Dynamic linear models (aka state-space models) 1

Contents. Part I: Fundamentals of Bayesian Inference 1

INTRODUCTION TO TIME SERIES ANALYSIS. The Simple Moving Average Model

Modelling Non-linear and Non-stationary Time Series

Hidden Markov Models for precipitation

Machine Learning. Probabilistic KNN.

Prashant Pant 1, Achal Garg 2 1,2 Engineer, Keppel Offshore and Marine Engineering India Pvt. Ltd, Mumbai. IJRASET 2013: All Rights are Reserved 356

Technical note on seasonal adjustment for M0

State-Space Methods for Inferring Spike Trains from Calcium Imaging

Cross-sectional space-time modeling using ARNN(p, n) processes

Introduction to Bayesian Statistics and Markov Chain Monte Carlo Estimation. EPSY 905: Multivariate Analysis Spring 2016 Lecture #10: April 6, 2016

Probabilistic Machine Learning

Economics 618B: Time Series Analysis Department of Economics State University of New York at Binghamton

Bayesian Model Comparison:

MCMC Sampling for Bayesian Inference using L1-type Priors

LECTURE 15 Markov chain Monte Carlo

MCMC analysis of classical time series algorithms.

Downscaling in Time. Andrew W. Robertson, IRI. Advanced Training Institute on Climate Variability and Food Security, 12 July 2002

CPSC 540: Machine Learning

Markov Chain Monte Carlo methods

Bayesian inference for multivariate extreme value distributions

Nonlinear Time Series

Learning the hyper-parameters. Luca Martino

Bayesian Graphical Models

GAMINGRE 8/1/ of 7

The Particle Filter. PD Dr. Rudolph Triebel Computer Vision Group. Machine Learning for Computer Vision

Multiple Regression Analysis

Stat 516, Homework 1

Non-Parametric Bayes

Dynamic models. Dependent data The AR(p) model The MA(q) model Hidden Markov models. 6 Dynamic models

Markov Chain Monte Carlo

Advanced Statistical Modelling

Assessing Regime Uncertainty Through Reversible Jump McMC

Scenario 5: Internet Usage Solution. θ j

Monte Carlo Methods. Geoff Gordon February 9, 2006

Markov Chain Monte Carlo, Numerical Integration

Statistical Inference and Methods

Model selection and checking

TIME SERIES ANALYSIS. Forecasting and Control. Wiley. Fifth Edition GWILYM M. JENKINS GEORGE E. P. BOX GREGORY C. REINSEL GRETA M.

ECO 513 Fall 2009 C. Sims HIDDEN MARKOV CHAIN MODELS

Consistent Downscaling of Seismic Inversions to Cornerpoint Flow Models SPE

Non-Markovian Regime Switching with Endogenous States and Time-Varying State Strengths

Bayesian construction of perceptrons to predict phenotypes from 584K SNP data.

Object tracking: Using HMMs to estimate the geographical location of fish

High-resolution forecasting of wind power generation with regime switching models and off-site observations

(5) Multi-parameter models - Gibbs sampling. ST440/540: Applied Bayesian Analysis

Advances and Applications in Perfect Sampling

Stat 451 Lecture Notes Markov Chain Monte Carlo. Ryan Martin UIC

Density Estimation. Seungjin Choi

Infinite-State Markov-switching for Dynamic. Volatility Models : Web Appendix

Solar irradiance forecasting for Chulalongkorn University location using time series models

Recursive Deviance Information Criterion for the Hidden Markov Model

Point spread function reconstruction from the image of a sharp edge

Bayesian Inference and MCMC

Gaussian kernel GARCH models

Lecture 16: Mixtures of Generalized Linear Models

Computer Intensive Methods in Mathematical Statistics

STA 4273H: Statistical Machine Learning

Computer intensive statistical methods

State Space Models for Wind Forecast Correction

Wind Power Forecasting

Nonparametric Bayesian Methods (Gaussian Processes)

Non-homogeneous Markov Mixture of Periodic Autoregressions for the Analysis of Air Pollution in the Lagoon of Venice

Coupled Hidden Markov Models: Computational Challenges

Adaptive Monte Carlo methods

Bayesian Estimation of Input Output Tables for Russia

Theory of Stochastic Processes 8. Markov chain Monte Carlo

Switching Regime Estimation

Recent Advances in Bayesian Inference Techniques

Statistical Models for Rainfall with Applications to Index Insura

Bayesian Methods with Monte Carlo Markov Chains II

Statistics & Data Sciences: First Year Prelim Exam May 2018

Bayesian Variable Selection for Nowcasting Time Series

STAT Advanced Bayesian Inference

4. MA(2) +drift: y t = µ + ɛ t + θ 1 ɛ t 1 + θ 2 ɛ t 2. Mean: where θ(l) = 1 + θ 1 L + θ 2 L 2. Therefore,

Time Series I Time Domain Methods

Probabilistic forecasting of solar radiation

Modelling Wind Farm Data and the Short Term Prediction of Wind Speeds

Bayesian Networks BY: MOHAMAD ALSABBAGH

ABC methods for phase-type distributions with applications in insurance risk problems

Time Series Analysis. James D. Hamilton PRINCETON UNIVERSITY PRESS PRINCETON, NEW JERSEY

Monitoring and characterizing offshore wind variability with weather radars for wind energy applications

A STUDY OF ARIMA AND GARCH MODELS TO FORECAST CRUDE PALM OIL (CPO) EXPORT IN INDONESIA

Bayesian Regression Linear and Logistic Regression

Making rating curves - the Bayesian approach

COURSE INTRODUCTION. J. Elder CSE 6390/PSYC 6225 Computational Modeling of Visual Perception

Bayesian Inference for DSGE Models. Lawrence J. Christiano

Frequency Forecasting using Time Series ARIMA model

EXAMINATIONS OF THE ROYAL STATISTICAL SOCIETY

Why Bayesian approaches? The average height of a rare plant

Transcription:

Modelling and forecasting of offshore wind power fluctuations with Markov-Switching models 02433 - Hidden Markov Models Pierre-Julien Trombe, Martin Wæver Pedersen, Henrik Madsen Course week 10 MWP, compiled June 7, 2011

Offshore vs Onshore Wind power There is a large difference in the magnitude of the fluctuations in on and offshore wind power production. Since there is no smoothing effect in offshore environments fluctuations can become larger. Fluctuations at large offshore wind farms have a significant impact on the control and management strategies of their power output. Critical issue: keep a balance between power demand and power generation. The presence of different dynamics in the power fluctutaions indicates the need for regime switching models which can be formulated as HMMs. 2

Classical regime-switching approach In the classical regime-switching approach switches occur between three predefined intervals of the power range: low, medium and high power. This approach can be refined by combining HMMs and dynamical models. Power [% of Pn] 0 20 40 60 80 100 High power? Medium power? Low power? Aug 05 Aug 07 Aug 09 Aug 11 Aug 13 Time [year 2005] 3

Offshore wind power data Frequency Case study: Horns Rev I offshore windfarm, installed capacity 160 MW. 0 2000 4000 6000 8000 0 20 40 60 80 100 Power [% of Pn] - Time-series of wind power averaged over 10 minutes intervals. - 49536 data points (16 Feb 2005 to 26 Jan 2006). - 7885 (16%) missing data. - Bounded on [0,100] % of Nominal power. 3023 (6.1%) data points lie at max production. 870 (1.8%) data points lie a zero production. The power production fluctuates differently when at max (100% of Pn) or min (0% of Pn) capacity compared to when at mid capacity. This is indicated by the histogram above, where Pn is the nominal production capacity (installed capacity). 4

Heteroscedasticity of data The variance of the data is not homogeneous. This can be accounted for in Auto Regressive Conditional Heteroscedastic (ARCH) models or Generalized ARCH (GARCH) models. 0.4 Squared residuals 0.3 0.2 0.1 0.0 Sept 05 Oct 05 Nov 05 Dec 05 5

Model specification To model the stochastic behavior of a given wind power time series x t, a MS(s)-AR(r)-GARCH(p, q) model is proposed as follows: x t = θ (Ct ) 0 + r i=1 θ (Ct ) i x t i + ɛ t ɛ t N(0, σt 2 ) q σt 2 = a (Ct ) 0 + a (Ct ) j ɛ 2 t j + j=1 p i=1 b (Ct ) i σt i 2 with C t a first order Markov chain, which governs the switches between different parameter values in the dynamical models (AR or GARCH) in each regime. As always Γ is the probability transition matrix for C t. 6

Estimation scheme Numerical maximizaton of the likelihood is computationally infeasible in general. A Bayesian estimation approach with Markov Chain Monte Carlo (MCMC) is simpler. MCMC algorithm: loop until convergence over Sample the state sequence (Data Augmentation). Sample the transition probabilities (Dirichlet distribution). Sample the state dependent AR and GARCH parameters (Giddy-Gibbs sampler). Compute the Bayes estimates (posterior distributions). 7

Data Augmentation (1/2) For a given t in {1,..., T } and for all states i in the finite space {1,..., m}: Pr(C t = i C t, X (T ) = x (T ), Θ) = Pr(C(T ), X (T ) = x (T ), Θ) Pr(C t, X (T ) = x (T ), Θ) = Pr(X(T ) = x (T ) C (T ), Θ)Pr(C (T ), Θ) Pr(X (T ) = x (T ) C t, Θ)Pr(C t, Θ) = Pr(X(T ) = x (T ) C t, Θ)Pr(C t = i C t, Θ), Pr(X (T ) = x (T ) C t, Θ) where C t is the states at all times except time t. We therefore get Pr(C t = i C t, X (T ) = x (T ), Θ) Pr(X (T ) = x (T ) C (T ), Θ)Pr(C t = i C t, Θ). So, C t can be sampled using the Gibbs sampler. 8

Data augmentation (2/2) Pr(X (T ) = x (T ) C (T ), Θ) is, for a given sample of the state sequence C (T ), the likelihood function Pr(X (T ) = x (T ) C (T ), Θ) = = T Pr(X t = x t X (t 1) = x (t 1), C (T ) Θ) t=1 T t=1 where the Markov Property leads to ( ) 1 (xt θ(ct 0 + ) r ) i=1 exp θ(ct i x t i ) 2, 2πσ 2 t 2σt 2 Pr(C t = i C t, Θ) = Pr(C t = i C t 1 = j, C t+1 = k) γ ji γ ik = m i=1 γ. jiγ ik 9

Simulation of synthetic data Simulating data from a MS(2)-AR(2)-GARCH(1,1) process: Top is the log-variance and bottom is the simulated C t, i.e. the switching sequence. 5 0 5 10 15 0 500 1000 1500 2000 state 1.0 1.4 1.8 0 500 1000 1500 2000 10

Results, synthetic data True values Prior bounds Means Std. Dev. θ (1) 0 0.1 [-0.3 0.5] 0.080 0.050 θ (1) 1 0.3 [-0.2 0.7] 0.227 0.057 θ (1) 2-0.1 [-0.5 0.3] -0.074 0.057 a (1) 0 0.05 [0 0.2] 0.067 0.012 b (1) 1 0.65 [0.2 1] 0.547 0.044 a (1) 1 0.2 [0 0.5] 0.307 0.047 θ (2) 0 0.5 [0.1 0.9] 0.478 0.052 θ (2) 1 0.7 [0.2 1.3] 0.731 0.060 θ (2) 2 0.25 [-0.3 0.7] 0.183 0.056 a (2) 0 0.25 [0 0.35] 0.319 0.026 b (2) 1 0.8 [0 0.7] 0.670 0.020 a (2) 1 0.1 [0 0.5] 0.233 0.033 γ 11 0.98 [0 1] 0.977 0.005 γ 22 0.95 [0 1] 0.958 0.009 11

Posterior distributions of the AR coefficients intercept lag 1 lag 2 state 1 0 100 200 300 400 500 0 200 400 600 800 1000 0 200 400 600 800 1000 0.05 0.05 0.15 0.25 0.1 0.2 0.3 0.4 0.5 0.3 0.1 0.1 0.2 state 2 0 200 400 600 800 1000 1200 0 200 400 600 800 1000 0 200 400 600 800 1000 0.3 0.4 0.5 0.6 0.7 0.5 0.6 0.7 0.8 0.9 1.0 0.0 0.1 0.2 0.3 0.4 12

Results: Regime characterization on wind power data Regime 1 has little process noise, regime 2 is the most common with moderate process noise, regime 3 accounts for spikes in the data. Pr. regime 3 Pr. regime 2 Pr. regime 1 Wind power 1 0.5 0 1 0.5 0 1 0.5 0 1 0.5 0 Inference on the regimes 200 400 600 800 1000 1200 1400 200 400 600 800 1000 1200 1400 200 400 600 800 1000 1200 1400 200 400 600 800 1000 1200 1400 Time 13

Results: Point Forecast The performance of the point forecast is evaluated as the root mean square error (R) of the conditional expectation of the one-step-ahead observation µ t+1 = E(X t+1 X (t) = x (t) ), i.e. R = 1 T 1 T 1 (X t+1 µ t+1) 2, where R has the unit percentage of nominal capacity i.e. [% of Pn]. t=1 No. Model R [% of Pn] R [% of Pn] in sample out-of-sample 1 Persistence (naive) 4.62 4.49 2 AR(3)-GARCH(1,1) 4.65 4.48 3 MS(3)-AR(3)-GARCH(1,1) 4.49 4.29 Clearly the model allowing switches between different dynamical models (no. 3) has a lower out-of-sample R when compared to the two other modelling approaches. 14

Exercise a) Download and plot the wind data set (winddata.txt) from the course website. b) Fit AR models with varying number of lags to the data using R s built in arima function. c) Fit AR models with varying number of lags to the data by writing your own code. Ensure that your optimal likelihood value is similar to the one obtained with the arima function. d) Fit a two state MSAR model to the data by extending your code for fitting AR models. Assume γ 11 = γ 22 = 0.95. Compare the AIC for this model with the AIC for the simple AR model. 15

End of example 16