Forecasting the electricity consumption by aggregating specialized experts

Similar documents
SYSTEM BRIEF DAILY SUMMARY

GAMINGRE 8/1/ of 7

SYSTEM BRIEF DAILY SUMMARY

A Second-order Bound with Excess Losses

Robust online aggregation of ensemble forecasts

Online Learning and Sequential Decision Making

CWV Review London Weather Station Move

peak half-hourly South Australia

OPERA: Online Prediction by ExpeRt Aggregation

peak half-hourly New South Wales

Determine the trend for time series data

January 2009 Videmus Stellae

Summary of Seasonal Normal Review Investigations CWV Review

Activity Sheet Counting M&Ms

Motion to Review the Academic Calendar Passed by Admissions and Standards, September 24, 2009

Mountain View Community Shuttle Monthly Operations Report

JANUARY MONDAY TUESDAY WEDNESDAY THURSDAY FRIDAY SATURDAY SUNDAY

YACT (Yet Another Climate Tool)? The SPI Explorer

peak half-hourly Tasmania

Astrological Calendar. for Central Time

Flow- based Market Coupling: Implementa9on of the CWE FB Project. Andrew Claxton APX Co- Chair, CWE Steering Commi5ee

Design of a Weather-Normalization Forecasting Model

Jackson County 2013 Weather Data

REPORT ON LABOUR FORECASTING FOR CONSTRUCTION

Day Ahead Hourly Load and Price Forecast in ISO New England Market using ANN

Climatography of the United States No

BAYESIAN PROCESSOR OF ENSEMBLE (BPE): PRIOR DISTRIBUTION FUNCTION

Constructing a typical meteorological year -TMY for Voinesti fruit trees region and the effects of global warming on the orchard ecosystem

TILT, DAYLIGHT AND SEASONS WORKSHEET

DAILY QUESTIONS 28 TH JUNE 18 REASONING - CALENDAR

Time Series Analysis

Future Weather in Toronto and the GTA

Computing & Telecommunications Services Monthly Report January CaTS Help Desk. Wright State University (937)

Lesson Adaptation Activity: Analyzing and Interpreting Data

Smoothed Prediction of the Onset of Tree Stem Radius Increase Based on Temperature Patterns

Jackson County 2018 Weather Data 67 Years of Weather Data Recorded at the UF/IFAS Marianna North Florida Research and Education Center

The Coupled Earth Reanalysis system [CERA]

Winter Season Resource Adequacy Analysis Status Report

Salem Economic Outlook

Changing Hydrology under a Changing Climate for a Coastal Plain Watershed

DESC Technical Workgroup. CWV Optimisation Production Phase Results. 17 th November 2014

Multivariate Regression Model Results

Sections 01, 02, & 04 (Bressoud and Ehren) 9 October, 2015

Basic human requirements

Jayalath Ekanayake Jonas Tappolet Harald Gall Abraham Bernstein. Time variance and defect prediction in software projects: additional figures

Climate Change Impact Analysis

Lunar Cycle. About This Lesson Phases of the Moon

= observed volume on day l for bin j = base volume in jth bin, and = residual error, assumed independent with mean zero.

Champaign-Urbana 2000 Annual Weather Summary

ACCA Interactive Timetable

Aggregation of sleeping predictors to forecast electricity consumption

A look into the factor model black box Publication lags and the role of hard and soft data in forecasting GDP

Nonparametric forecasting of the French load curve

Alberto Troccoli, Head of Weather and Energy Research Unit, CSIRO, Australia ICCS 2013 Jamaica, 5 December 2013 (remotely, unfortunately)

Computing & Telecommunications Services

Renewables and the Smart Grid. Trip Doggett President & CEO Electric Reliability Council of Texas

NRCSE. Misalignment and use of deterministic models

classroomsecrets.com Reasoning and Problem Solving Read and Interpret Line Graphs Teaching Information

How to use the guidance tool (Producing Guidance and Verification)

DOWNLOAD OR READ : THE YEARLY SON SHINE JOURNAL PDF EBOOK EPUB MOBI

Climatography of the United States No

Climatography of the United States No

Interannual variation of MODIS NDVI in Lake Taihu and its relation to climate in submerged macrophyte region

Predicting intraday-load curve using High-D methods

Accuracy of near real time updates in wind power forecasting with regard to different weather regimes

Predictability of Sudden Stratospheric Warmings in sub-seasonal forecast models

Multi Time Scale Wind Energy Forecasting Model based on Meteorological Simulation and Onsite Measurement

Climatography of the United States No

Climatography of the United States No

Global Climates. Name Date

Electricity Demand Forecasting using Multi-Task Learning

SHADOW - Main Result. windpro CUMULTATIEVE EFFECTEN SLAGSCHADUW HERENTALS. EDF Luminus Markiesstraat Brussel

ACCA Interactive Timetable & Fees

CLIMATE OVERVIEW. Thunder Bay Climate Overview Page 1 of 5

2015 Summer Readiness. Bulk Power Operations

TIME SERIES ANALYSIS AND FORECASTING USING THE STATISTICAL MODEL ARIMA

Faisal S. Syed, Shahbaz M.,Nadia R.,Siraj I. K., M. Adnan Abid, M. Ashfaq, F. Giorgi, J. Pal, X. Bi

Climatography of the United States No

Climatography of the United States No

Climatography of the United States No

Climatography of the United States No

CHAPTER 1 EXPRESSIONS, EQUATIONS, FUNCTIONS (ORDER OF OPERATIONS AND PROPERTIES OF NUMBERS)

Stochastic swell events generator

UWM Field Station meteorological data

Communicating Climate Change Consequences for Land Use

2003 Water Year Wrap-Up and Look Ahead

arxiv: v1 [stat.ap] 4 Mar 2016

United States Climate

Jackson County 2014 Weather Data

WHEN IS IT EVER GOING TO RAIN? Table of Average Annual Rainfall and Rainfall For Selected Arizona Cities

Year 9 Science Work Rate Calendar Term

Climatography of the United States No

Climatography of the United States No

Relationships between Synoptic- and Meso-scale scale Conditions and NO x Concentrations in Tel-Aviv

Climatography of the United States No

Climatography of the United States No

Climatography of the United States No

Climatography of the United States No

Climatography of the United States No

Climatography of the United States No

Transcription:

Forecasting the electricity consumption by aggregating specialized experts Pierre Gaillard (EDF R&D, ENS Paris) with Yannig Goude (EDF R&D) Gilles Stoltz (CNRS, ENS Paris, HEC Paris) June 2013 WIPFOR

Goal Consumption (GW) 25 30 35 40 Goal Short-term (one-day-ahead) forecasting of the French electricity consumption Mon Tue Wed Thu Fri Sat Sun Many models developed by EDF R&D: parametric, semi-parametric, and non-parametric Evolution of the electrical scene in France existing models get questionable Adaptive methods of models aggregation 2 / 20

Setting Sequential prediction with expert advice Each instance t - Each expert suggests a prediction x i,t of the consumption y t - We assign weight to each expert and we predict ŷ t = p t x t (= ) N p i,t x i,t i=1 Our goal is to minimize our cumulative loss (ŷ t y t) 2 Our loss = min (x i,t y t) 2 i=1,...,n Loss of the best expert + R T Estimation error Good set of experts Good aggregating algorithm 3 / 20

Setting Sequential prediction with expert advice Each instance t - Each expert suggests a prediction x i,t of the consumption y t - We assign weight to each expert and we predict ŷ t = p t x t (= ) N p i,t x i,t i=1 Our goal is to minimize our cumulative loss (ŷ t y t) 2 Our loss = min (q x t y t) 2 q N Loss of the best convex combination Good set of experts As varied as possible + R T Estimation error Good aggregating algorithm 4 / 20

Minimizing both approximation and estimation error (ŷ t y t) 2 Our loss = min (q x t y t) 2 q N Approximation error + R T Estimation error Approximation error good heterogeneous set of experts Ex: specializing the experts, bagging, boosting,... Estimation error efficient algorithm for aggregating specialized experts Ex: Exponentially weighted average, Exponentiated Gradient, Ridge,... Prediction Learning and Games, Cesa-Bianchi and Lugosi, 2006 5 / 20

I. Aggregating algorithms Prediction Learning and Games, Cesa-Bianchi and Lugosi, 2006 June 2013 WIPFOR

Exponentially weighted average forecaster (EWA) Each instance t - Each expert suggests a prediction x i,t of the consumption y t - We assign to expert i the weight - and we predict ŷ t = N i=1 pi,txi,t exp ( η t 1 s=1 (xi,s ys)2) p i,t = N j=1 exp ( η t 1 s=1 (xj,s ys)2) Our cumulated loss is upper bounded by (ŷ t y t) 2 min (x i,t y t) 2 i=1,...,d } {{ } + T log N Our loss Loss of the best expert Estimation error 7 / 20

Exponentially weighted average forecaster (EWA) Each instance t - Each expert suggests a prediction x i,t of the consumption y t - We assign to expert i the weight - and we predict ŷ t = N i=1 pi,txi,t exp ( η t 1 s=1 (xi,s ys)2) p i,t = N j=1 exp ( η t 1 s=1 (xj,s ys)2) Our cumulated loss is upper bounded by (ŷ t y t) 2 min (q x t y t) q N Our loss 2 +? Loss of the best Estimation error convex combination 8 / 20

Motivation of convex combinations Gam KWF EWA EG Residuals 2000 1000 0 1000 Weights 0.0 0.2 0.4 0.6 0.8 1.0 Gam KWF EWA EG Oct Nov Dec Jan Mar Avr Oct Nov Dec Jan Mar Avr 9 / 20

Exponentiated gradient forecaster (EG) Each instance t - Each expert suggests a prediction x i,t of the consumption y t - We assign to expert i the weight p i,t exp ( η ) t 1 s=1 li,s where l i,s = 2(ŷ s y s)x i,s - and we predict ŷ t = N i=1 pi,txi,t Our cumulated loss is then bounded as follow (ŷ t y t) 2 (q x t y t) 2 Our loss min q N } {{ } Loss of the best convexe combination + T log N Estimation error Idea of proof T (ŷt yt)2 (q x t y t) 2 T 2( p t x t y t)x t ( p t q ) = T l t ( p t q ) T p t l t T min i li,t 10 / 20

II. A good set of experts June 2013 WIPFOR

Consider as heterogeneous experts as possible Some ideas to get more variety inside the set of experts Consider heterogeneous prediction methods - Gam: semi-parametric method Generalized Additive Models, Wood, 2006 - KWF: functional method based on similarity between days Clustering functional data using Wavelets, Antoniadis and al, 2013 Create new experts from the same method thanks to boosting, bagging Vary the considered covariate: weather, calendar,... Specializing the experts: focus on specific situation (cloudy days,... ) during the training 12 / 20

The dataset The dataset includes 1 696 days from January 1, 2008 to June 15, 2012 The electricity consumption of EDF customers Side information - weather: temperature, nebulosity, wind - temporal: date, EJP - loss of clients We remove uncommon days (public holidays ±2) i.e., 55 days each year. We split the dataset in two subsets Jan. 2008 Aug. 2011: training set to build the experts Sept. 2011 Jun. 2012: testing set 13 / 20

Performance of the forecasting methods and of the aggregating algorithms EWA Method RMSE (MW) Gam 847 KWF 1287 EWA 813 EG 778 Weights Weights 0.0 0.2 0.4 0.6 0.8 1.0 0.0 0.2 0.4 0.6 0.8 1.0 Gam KWF Oct Nov Dec Jan Mar Avr EG Oct Nov Dec Jan Mar Avr 14 / 20

Specializing the experts to diversify Idea Focus on specific scenarios during the training of the methods Meteorological scenarios High / low temperature High / low variation of the temperature (since the last day, during the day) Other scenarios High / low consumption Winter / summer Such specialized experts suggest prediction only the days corresponding to their scenario 15 / 20

Specializing a method in cold days At day t, we consider T t = average temperature of the day We normalize T t on [0, 1] and we choose for each day the weight Density 0.0 0.5 1.0 1.5 0.0 0.2 0.4 0.6 0.8 1.0 Activation weight w t = (1 T t ) 2 We then train our forecasting method using the prior weights w t on the training days Activation weights 0.0 0.2 0.4 0.6 0.8 Feb May Jul Oct 16 / 20

Weights given in 2008 for several specializing scenarios Difference of temp. with last day Hot / cold days Activation weights 0.0 0.2 0.4 0.6 0.8 High / low consumption Variation of temp. during the day Activation weights 0.0 0.2 0.4 0.6 0.8 Feb May Jul Oct Feb May Jul Oct 17 / 20

Aggregating experts that specialize Setting Each day some of the experts are active and output predictions (according to their specialization) while other experts do not When the expert i is non active, we do not have access to its prediction A solution is to assume that non active experts output the same prediction ŷ t as we do and solve the fixed-point equation ŷ t = p j,t x j,t + p i,t ŷ t j active i non active Can be extended to activation functions of the experts [0, 1] Forecasting the electricity consumption by aggregating specialized experts, Devaine and al., 2013 18 / 20

Performance of algorithms with specialized experts Méthode RMSE (MW) Gam 847 KWF 1287 EWA 813 EG 778 Spec + EWA 765 Spec + EG 714 19 / 20

Performance of algorithms with specialized experts Hour Month RMSE (MW) 500 1000 1500 Gam KWF 500 1000 1500 EWA EG Spec.EWA Spec.EG 4 8 12 16 20 Jan Feb Mar Avr May Jun Sep Oct Nov Dec 20 / 20