Predicting intraday-load curve using High-D methods

Size: px
Start display at page:

Download "Predicting intraday-load curve using High-D methods"

Transcription

1 Predicting intraday-load curve using High-D methods LPMA- Université Paris-Diderot-Paris 7 Mathilde Mougeot UPD, Vincent Lefieux RTE, Laurence Maillard RTE Horizon Maths 2013

2 Intraday load curve during a week Monday January 25 th to Sunday January 31 th 9 x

3 Intraday load curve forecasting -here 48h- 5.6 x Y 3.8 Yapx tpred

4 Forecasting procedure 1 Construction of a smart encyclopedia of past scenarios out of a data basis using learning algorithms. 2 Build a set of prediction experts consulting the encyclopedia. 3 Aggregate the prediction experts

5 Data basis The past data basis Electrical consumption of the past Meteorological input Endogenous variables: calendar data, functional bases

6 Electrical consumption of the past Recorded every half hour from January 1 st, 2003 to August 31 th, For this period of time, the global consumption signal is split into N = 2800 sub signals (Y 1,...,Y t,...,y N ). Y t R n, defines the intra day load curve for the t th day of size n = 48.

7 Intraday load curve for seven days Monday January 25 th to Sunday January 31 th 9 x

8 Example of calendar variations (seasonnal) 6 x x x x Figure: autumn, winter, spring and summer

9 Meteorological inputs A total of 371 (=2x39+293) meteorological variables recorded each day half-hourly over the 2800 days of the same period of time. Temperature: T k for k = 1,...,39 measured in 39 weather stations scattered all over the French territory. Cloud Cover: N k for k = 1,...,39 measured in the same 39 weather stations. Wind: W k for k = 1,...,293 available at 293 network points scattered all over the territory.

10 Weather stations Figure: Temperature and Cloud covering measurement stations. Wind stations

11 Brest- Lille- Marseille (a) T (b) CC (c) W Figure: Brest (blue line), Lille (red line) and Marseille (green).

12 Building the smart encyclopedia : several issues 1 Large dimension 2 All the variables (load curve, meteo) are highly correlated 3 Necessity to introduce typical patterns

13 Load curves : functional regression and clustering Compression of the intraday load curves : the signals Y t are treated as functions of the time and sparsely represented on a dictionary of functions (combination of Fourier basis and Haar basis). Clustering the previous sparse representations of the signals, into homogeneous groups. Pattern : define a pattern of consumption inside each group. Calendar attribution : Translate the clustering into calendar (predictable) variables

14 Reduced set of explanatory variables For each t index of the day of interest, we register the daily electrical consumption signal Y t and Z t = [P t M t ] = [C t B t [T] t [N] t [W] t ] is the concatenation of the "calendar and functional" variables and "climate variables" also in reduced dimension.

15 Sparse approximation on the learning set Sparse Approximation of each consumption day on a learning set of days ( ), using the reduced set of explanatory variables. For each day t of the learning set, we build an approximation Ŷ t of the (observed) signal Y t with the help of the new set of explanatory variables (Z t ): Ŷ t = G t (Z t )

16 Sparse approximation on the learning set Sparse Approximation For each day t of the learning set, Z t = [P t M t ] = [C t B t [T] t [N] t [W] t ] is the concatenation of "calendar" variables and "climate variables. We build an approximation Ŷ t of the (observed) signal Y t using (Z t ): Ŷ t = G t (Z t ) G t (Z t ) = Z tˆβt ( ) Sparse Approximation and Knowledge Extraction for Electrical Consumption Signals, 2012, M. Mougeot, D. P., K. Tribouley & V. Lefieux, L. Teyssier-Maillard

17 High dimensional Linear Models Y = Xβ +ǫ β IR p is the unknown parameter (to be estimated) ǫ = (ǫ 1,...,ǫ n ) is a (non observed) vector of random errors. It is assumed to be variables i.i.d. N(0,σ 2 ) X is a known matrix n p. High dimension : p >> n t

18 Smart Encyclopedia contents For each day t, 1 t 2800: the daily electrical consumption Y t. a qualitative description of t, given by calendar statements, clustering allocation. the meteorological indicators over the French territory M t = [T t N t W t ]. the estimated coefficient ˆβ t. the approximation of the daily consumption Ŷ t = Z t ˆβt.

19 Forecasting procedure Forecasting using the encyclopedia Construction of a set of forecasting experts. Aggregation of the experts.

20 Expert associated to the strategy M Forecasting experts Strategy : M a function, data dependent or not, from N to N such that for any d N,M(d) < d (purely non anticipative). Plug-in To the strategy M we associate the expert Ỹ M t : the prediction of the signal of day t using forecasting strategy M, Ỹ M t = G M(t) (Z t ) = Z tˆβm(t)

21 Examples of strategies : time depending tm1: Refer to the day before: (The coefficients used for prediction are those calculated the previous day) M(d) = d 1 Ỹ tm1 t = Z tˆβ t 1 tm7: Refer to one week before: M(d) = d 7 Ỹt tm7 = Z tˆβt 7

22 Experts introducing meteorological scenarios T: Find the day having the closest temperature indicators, regarding the sup distance (over the days, and over the indicators): M(d) = ArgMin t sup k {1,...,6}, i {1,...,48} T k d (i) Tk t (i) T m : Find the day having the closest median temperature with the sup distance (over the days): M(d) = ArgMin t sup i {1,...,48} T 3 d (i) T3 t(i)

23 Experts introducing a climatic configuration of the day constrained by the type of the day N s/j : Closest cloud covering indicators (min, max, med, std) regarding the sup distance for days which are of the same type as Y t : M = ArgMin J(d)=J(t) sup i,k N k d (i) Nk t(i) within the same type of day as Y t

24 MAPE error For day t, the prediction MAPE error over the interval [0,T] is defined by: MAPE(Y,Ỹ M t )(T) = 1 T MISE(Y,Ỹ M t )(T) = 1 T T i=1 ỸM t (i) Y t (i) Y t (i) T Ỹt M (i) Y t (i) 2 i=1

25 Prediction evaluation Names average median std Naive Yday Week T med T med /W T med /N T T/G T/D T/C N N/G N/D N/C W W/G W/D W/C

26 Prediction evaluation-comparing experts Yday Week Tm Tm/N Tm/W T T/g T/d T/c N N/g N/d N/c W W/g W/d W/c Figure: Frequencies of best performances computed for one year of data from September 1 th 2009 to August 31 th 2010.

27 Prediction evaluation-comparing experts on days 0.25 Ranking Predictor Performances per Day tm1 tm7 Ts Ts/N Tm Tm/N T/g N/g T/j N/j T/c N/c Figure: Percentage of best predictor among days (1:monday,... 7:sunday)

28 Prediction evaluation-comparing experts 0.18 Best Predictor Performances per Month tm1 tm7 Ts Ts/N Tm Tm/N T/g N/g T/j N/j T/c N/c Figure: Percentage of best predictor among month

29 Aggregation of predictors: Exponential weights (inspired by various theoretical results -see Lecue, Rigollet, Stolz, Tsybakov,...-) with M Ỹ wgt d m=1 = wm d Ỹm d M m=1 wm d w M d = exp( 1 Tθ T ỸM d (i) Y d (i) 2 ) i=1 θ is a parameter, (often called temperature in physic applications, see the discussion below) T = Tpred.

30 Forecasting (mape=0.7%). 5.6 x Y 3.8 Yapx tpred

31 Sparse methods

32 Sparse approximation on the learning set Sparse Approximation For each day t of the learning set, Z t = [P t M t ] = [C t B t [T] t [N] t [W] t ] is the concatenation of "calendar" variables and "climate variables. We build an approximation Ŷ t of the (observed) signal Y t using (Z t ): Ŷ t = G t (Z t ) G t (Z t ) = Z tˆβt

33 High dimensional Linear Models Y = Xβ +ǫ β IR p is the unknown parameter (to be estimated) ǫ = (ǫ 1,...,ǫ n ) is a (non observed) vector of random errors. It is assumed to be variables i.i.d. N(0,σ 2 ) X is a known matrix n p. High dimension : p >> n t ( ) M. Mougeot, D. P., K. Tribouley, JRSS B 2012,B Stat. Methodol. vol 74

34 Conditions generally required to solve the problem Sparsity conditions on the vector β restricted identity conditions on the matrix X

35 Sparsity conditions

36 Restricted identity property For C {1,... p}, denote X C the matrix X restricted to the raws which are in C and the associated Gram-matrix M(C) := 1 n Xt C X C Restricted identity property means that M(C) is almost the identity matrix for any C small enough.

37 Example 1: RIP RIP(m 0,ν) assumes that There exist 0 ν < 1 and m 0 1 such that : x IR m, x 2 l 2 (m) (1 ν) xt M(C)x x 2 l 2 (m) (1+ν),

38 Example 2: Coherence condition M := 1 n Xt X. M jj = 1 for all j. Coherence τ n = sup M lm = sup 1 l m l m n Coherence = RIP( ν/τ n,ν) n X il X im i=1

39 Sparsity conditions #{l {1,...,p}, β l 0} S β l q M, 0 < q < 1 (B q (M)) l SMALL NUMBER OF BIG COEFFICIENTS

40 Penalization for sparsity Many penalizations introduced historically in the regression framework (to put identification constraints on β) Ridge: E(β,λ) = Y Xβ 2 +λσ j β 2 j Lasso: E(β,λ) = Y Xβ 2 +λσ j β j Scad: E(β,λ) = Y Xβ 2 +λσ j w j g(β j ) Solutions based on: Convex Optimization for L1, non convex Opti. for Scad Fan & Lv (2008, 2010), Candes & Tao (2007)... Many others...

41 2-thresholding-step Procedures Y = Xβ +ǫ Y (n 1), X (n p) steps compute size Step1=pre-selection Find b Leaders X b (n, b) b < n << p REGRESSION on Leaders β = (X b X b ) 1 X by (1, b) Step2=denoising the coefficients ˆβ (1, Ŝ)

42 Winter forecast 9 x Figure: Forecast (solid blue line) and observed (dashed dark line) electrical consumption for a winter week from Monday February 1 st to Sunday January 7 th 2010.

43 Spring forecast 6 x Figure: Forecast (solid blue line) and observed (dashed dark line) electrical consumption for a spring week from Monday June 14 th to Sunday June 21 th 2010.

High dimensional statistical learning methods applied to energy data sets

High dimensional statistical learning methods applied to energy data sets High dimensional statistical learning methods applied to energy data sets Dominique Picard Université Paris-Diderot LPMA Data science challenge : Post-modern Rationality Replace human logic by statistical

More information

Nonparametric forecasting of the French load curve

Nonparametric forecasting of the French load curve An overview RTE & UPMC-ISUP ISNPS 214, Cadix June 15, 214 Table of contents 1 Introduction 2 MAVE modeling 3 IBR modeling 4 Sparse modeling Electrical context Generation must be strictly equal to consumption

More information

Graph Functional Methods for Climate Partitioning

Graph Functional Methods for Climate Partitioning Graph Functional Methods for Climate Partitioning Mathilde Mougeot - with D. Picard, V. Lefieux*, M. Marchand* Université Paris Diderot, France *Réseau Transport Electrique (RTE) Buenos Aires, 2015 Mathilde

More information

Multivariate Regression Model Results

Multivariate Regression Model Results Updated: August, 0 Page of Multivariate Regression Model Results 4 5 6 7 8 This exhibit provides the results of the load model forecast discussed in Schedule. Included is the forecast of short term system

More information

Forecasting the electricity consumption by aggregating specialized experts

Forecasting the electricity consumption by aggregating specialized experts Forecasting the electricity consumption by aggregating specialized experts Pierre Gaillard (EDF R&D, ENS Paris) with Yannig Goude (EDF R&D) Gilles Stoltz (CNRS, ENS Paris, HEC Paris) June 2013 WIPFOR Goal

More information

Defining Normal Weather for Energy and Peak Normalization

Defining Normal Weather for Energy and Peak Normalization Itron White Paper Energy Forecasting Defining Normal Weather for Energy and Peak Normalization J. Stuart McMenamin, Ph.D Managing Director, Itron Forecasting 2008, Itron Inc. All rights reserved. 1 Introduction

More information

peak half-hourly New South Wales

peak half-hourly New South Wales Forecasting long-term peak half-hourly electricity demand for New South Wales Dr Shu Fan B.S., M.S., Ph.D. Professor Rob J Hyndman B.Sc. (Hons), Ph.D., A.Stat. Business & Economic Forecasting Unit Report

More information

Electricity Demand Forecasting using Multi-Task Learning

Electricity Demand Forecasting using Multi-Task Learning Electricity Demand Forecasting using Multi-Task Learning Jean-Baptiste Fiot, Francesco Dinuzzo Dublin Machine Learning Meetup - July 2017 1 / 32 Outline 1 Introduction 2 Problem Formulation 3 Kernels 4

More information

The Dayton Power and Light Company Load Profiling Methodology Revised 7/1/2017

The Dayton Power and Light Company Load Profiling Methodology Revised 7/1/2017 The Dayton Power and Light Company Load Profiling Methodology Revised 7/1/2017 Overview of Methodology Dayton Power and Light (DP&L) load profiles will be used to estimate hourly loads for customers without

More information

peak half-hourly Tasmania

peak half-hourly Tasmania Forecasting long-term peak half-hourly electricity demand for Tasmania Dr Shu Fan B.S., M.S., Ph.D. Professor Rob J Hyndman B.Sc. (Hons), Ph.D., A.Stat. Business & Economic Forecasting Unit Report for

More information

A Bayesian Perspective on Residential Demand Response Using Smart Meter Data

A Bayesian Perspective on Residential Demand Response Using Smart Meter Data A Bayesian Perspective on Residential Demand Response Using Smart Meter Data Datong-Paul Zhou, Maximilian Balandat, and Claire Tomlin University of California, Berkeley [datong.zhou, balandat, tomlin]@eecs.berkeley.edu

More information

Short-Term Electrical Load Forecasting for Iraqi Power System based on Multiple Linear Regression Method

Short-Term Electrical Load Forecasting for Iraqi Power System based on Multiple Linear Regression Method Volume 00 No., August 204 Short-Term Electrical Load Forecasting for Iraqi Power System based on Multiple Linear Regression Method Firas M. Tuaimah University of Baghdad Baghdad, Iraq Huda M. Abdul Abass

More information

Consistent high-dimensional Bayesian variable selection via penalized credible regions

Consistent high-dimensional Bayesian variable selection via penalized credible regions Consistent high-dimensional Bayesian variable selection via penalized credible regions Howard Bondell bondell@stat.ncsu.edu Joint work with Brian Reich Howard Bondell p. 1 Outline High-Dimensional Variable

More information

Development of Short-term Demand Forecasting Model And its Application in Analysis of Resource Adequacy. For discussion purposes only Draft

Development of Short-term Demand Forecasting Model And its Application in Analysis of Resource Adequacy. For discussion purposes only Draft Development of Short-term Demand Forecasting Model And its Application in Analysis of Resource Adequacy For discussion purposes only Draft January 31, 2007 INTRODUCTION In this paper we will present the

More information

WEATHER NORMALIZATION METHODS AND ISSUES. Stuart McMenamin Mark Quan David Simons

WEATHER NORMALIZATION METHODS AND ISSUES. Stuart McMenamin Mark Quan David Simons WEATHER NORMALIZATION METHODS AND ISSUES Stuart McMenamin Mark Quan David Simons Itron Forecasting Brown Bag September 17, 2013 Please Remember» Phones are Muted: In order to help this session run smoothly,

More information

Stepwise Searching for Feature Variables in High-Dimensional Linear Regression

Stepwise Searching for Feature Variables in High-Dimensional Linear Regression Stepwise Searching for Feature Variables in High-Dimensional Linear Regression Qiwei Yao Department of Statistics, London School of Economics q.yao@lse.ac.uk Joint work with: Hongzhi An, Chinese Academy

More information

Demand Forecasting Models

Demand Forecasting Models E 2017 PSE Integrated Resource Plan Demand Forecasting Models This appendix describes the econometric models used in creating the demand forecasts for PSE s 2017 IRP analysis. Contents 1. ELECTRIC BILLED

More information

A Sparse Linear Model and Significance Test. for Individual Consumption Prediction

A Sparse Linear Model and Significance Test. for Individual Consumption Prediction A Sparse Linear Model and Significance Test 1 for Individual Consumption Prediction Pan Li, Baosen Zhang, Yang Weng, and Ram Rajagopal arxiv:1511.01853v3 [stat.ml] 21 Feb 2017 Abstract Accurate prediction

More information

Non-linear Supervised High Frequency Trading Strategies with Applications in US Equity Markets

Non-linear Supervised High Frequency Trading Strategies with Applications in US Equity Markets Non-linear Supervised High Frequency Trading Strategies with Applications in US Equity Markets Nan Zhou, Wen Cheng, Ph.D. Associate, Quantitative Research, J.P. Morgan nan.zhou@jpmorgan.com The 4th Annual

More information

Time Series and Forecasting

Time Series and Forecasting Chapter 8 Time Series and Forecasting 8.1 Introduction A time series is a collection of observations made sequentially in time. When observations are made continuously, the time series is said to be continuous;

More information

A summary of the weather year based on data from the Zumwalt weather station

A summary of the weather year based on data from the Zumwalt weather station ZUMWALT PRAIRIE WEATHER 2016 A summary of the weather year based on data from the Zumwalt weather station Figure 1. An unusual summer storm on July 10, 2016 brought the second-largest precipitation day

More information

Oslo Class 6 Sparsity based regularization

Oslo Class 6 Sparsity based regularization RegML2017@SIMULA Oslo Class 6 Sparsity based regularization Lorenzo Rosasco UNIGE-MIT-IIT May 4, 2017 Learning from data Possible only under assumptions regularization min Ê(w) + λr(w) w Smoothness Sparsity

More information

Smoothly Clipped Absolute Deviation (SCAD) for Correlated Variables

Smoothly Clipped Absolute Deviation (SCAD) for Correlated Variables Smoothly Clipped Absolute Deviation (SCAD) for Correlated Variables LIB-MA, FSSM Cadi Ayyad University (Morocco) COMPSTAT 2010 Paris, August 22-27, 2010 Motivations Fan and Li (2001), Zou and Li (2008)

More information

Analysis of Fast Input Selection: Application in Time Series Prediction

Analysis of Fast Input Selection: Application in Time Series Prediction Analysis of Fast Input Selection: Application in Time Series Prediction Jarkko Tikka, Amaury Lendasse, and Jaakko Hollmén Helsinki University of Technology, Laboratory of Computer and Information Science,

More information

peak half-hourly South Australia

peak half-hourly South Australia Forecasting long-term peak half-hourly electricity demand for South Australia Dr Shu Fan B.S., M.S., Ph.D. Professor Rob J Hyndman B.Sc. (Hons), Ph.D., A.Stat. Business & Economic Forecasting Unit Report

More information

A Bootstrap Lasso + Partial Ridge Method to Construct Confidence Intervals for Parameters in High-dimensional Sparse Linear Models

A Bootstrap Lasso + Partial Ridge Method to Construct Confidence Intervals for Parameters in High-dimensional Sparse Linear Models A Bootstrap Lasso + Partial Ridge Method to Construct Confidence Intervals for Parameters in High-dimensional Sparse Linear Models Jingyi Jessica Li Department of Statistics University of California, Los

More information

Predicting Future Energy Consumption CS229 Project Report

Predicting Future Energy Consumption CS229 Project Report Predicting Future Energy Consumption CS229 Project Report Adrien Boiron, Stephane Lo, Antoine Marot Abstract Load forecasting for electric utilities is a crucial step in planning and operations, especially

More information

Chapter 3. Regression-Based Models for Developing Commercial Demand Characteristics Investigation

Chapter 3. Regression-Based Models for Developing Commercial Demand Characteristics Investigation Chapter Regression-Based Models for Developing Commercial Demand Characteristics Investigation. Introduction Commercial area is another important area in terms of consume high electric energy in Japan.

More information

September 2018 Weather Summary West Central Research and Outreach Center Morris, MN

September 2018 Weather Summary West Central Research and Outreach Center Morris, MN September 2018 Weather Summary The mean temperature for September was 60.6 F, which is 1.5 F above the average of 59.1 F (1886-2017). The high temperature for the month was 94 F on September 16 th. The

More information

LOADS, CUSTOMERS AND REVENUE

LOADS, CUSTOMERS AND REVENUE EB-00-0 Exhibit K Tab Schedule Page of 0 0 LOADS, CUSTOMERS AND REVENUE The purpose of this evidence is to present the Company s load, customer and distribution revenue forecast for the test year. The

More information

Homogeneity Pursuit. Jianqing Fan

Homogeneity Pursuit. Jianqing Fan Jianqing Fan Princeton University with Tracy Ke and Yichao Wu http://www.princeton.edu/ jqfan June 5, 2014 Get my own profile - Help Amazing Follow this author Grace Wahba 9 Followers Follow new articles

More information

(Part 1) High-dimensional statistics May / 41

(Part 1) High-dimensional statistics May / 41 Theory for the Lasso Recall the linear model Y i = p j=1 β j X (j) i + ɛ i, i = 1,..., n, or, in matrix notation, Y = Xβ + ɛ, To simplify, we assume that the design X is fixed, and that ɛ is N (0, σ 2

More information

Summary of Seasonal Normal Review Investigations CWV Review

Summary of Seasonal Normal Review Investigations CWV Review Summary of Seasonal Normal Review Investigations CWV Review DESC 31 st March 2009 1 Contents Stage 1: The Composite Weather Variable (CWV) An Introduction / background Understanding of calculation Stage

More information

Regression Analysis. y t = β 1 x t1 + β 2 x t2 + β k x tk + ϵ t, t = 1,..., T,

Regression Analysis. y t = β 1 x t1 + β 2 x t2 + β k x tk + ϵ t, t = 1,..., T, Regression Analysis The multiple linear regression model with k explanatory variables assumes that the tth observation of the dependent or endogenous variable y t is described by the linear relationship

More information

By Gareth M. James, Jing Wang and Ji Zhu University of Southern California, University of Michigan and University of Michigan

By Gareth M. James, Jing Wang and Ji Zhu University of Southern California, University of Michigan and University of Michigan The Annals of Statistics 2009, Vol. 37, No. 5A, 2083 2108 DOI: 10.1214/08-AOS641 c Institute of Mathematical Statistics, 2009 FUNCTIONAL LINEAR REGRESSION THAT S INTERPRETABLE 1 arxiv:0908.2918v1 [math.st]

More information

STAT763: Applied Regression Analysis. Multiple linear regression. 4.4 Hypothesis testing

STAT763: Applied Regression Analysis. Multiple linear regression. 4.4 Hypothesis testing STAT763: Applied Regression Analysis Multiple linear regression 4.4 Hypothesis testing Chunsheng Ma E-mail: cma@math.wichita.edu 4.4.1 Significance of regression Null hypothesis (Test whether all β j =

More information

The lasso. Patrick Breheny. February 15. The lasso Convex optimization Soft thresholding

The lasso. Patrick Breheny. February 15. The lasso Convex optimization Soft thresholding Patrick Breheny February 15 Patrick Breheny High-Dimensional Data Analysis (BIOS 7600) 1/24 Introduction Last week, we introduced penalized regression and discussed ridge regression, in which the penalty

More information

November 2018 Weather Summary West Central Research and Outreach Center Morris, MN

November 2018 Weather Summary West Central Research and Outreach Center Morris, MN November 2018 Weather Summary Lower than normal temperatures occurred for the second month. The mean temperature for November was 22.7 F, which is 7.2 F below the average of 29.9 F (1886-2017). This November

More information

of the 7 stations. In case the number of daily ozone maxima in a month is less than 15, the corresponding monthly mean was not computed, being treated

of the 7 stations. In case the number of daily ozone maxima in a month is less than 15, the corresponding monthly mean was not computed, being treated Spatial Trends and Spatial Extremes in South Korean Ozone Seokhoon Yun University of Suwon, Department of Applied Statistics Suwon, Kyonggi-do 445-74 South Korea syun@mail.suwon.ac.kr Richard L. Smith

More information

Holiday Demand Forecasting

Holiday Demand Forecasting Holiday Demand Forecasting Yue Li Senior Research Statistician Developer SAS #AnalyticsX Outline Background Holiday demand modeling techniques Weekend day Holiday dummy variable Two-stage methods Results

More information

Construction of an Informative Hierarchical Prior Distribution: Application to Electricity Load Forecasting

Construction of an Informative Hierarchical Prior Distribution: Application to Electricity Load Forecasting Construction of an Informative Hierarchical Prior Distribution: Application to Electricity Load Forecasting Anne Philippe Laboratoire de Mathématiques Jean Leray Université de Nantes Workshop EDF-INRIA,

More information

Short Term Load Forecasting Based Artificial Neural Network

Short Term Load Forecasting Based Artificial Neural Network Short Term Load Forecasting Based Artificial Neural Network Dr. Adel M. Dakhil Department of Electrical Engineering Misan University Iraq- Misan Dr.adelmanaa@gmail.com Abstract Present study develops short

More information

Course Notes for EE227C (Spring 2018): Convex Optimization and Approximation

Course Notes for EE227C (Spring 2018): Convex Optimization and Approximation Course Notes for EE227C (Spring 2018): Convex Optimization and Approximation Instructor: Moritz Hardt Email: hardt+ee227c@berkeley.edu Graduate Instructor: Max Simchowitz Email: msimchow+ee227c@berkeley.edu

More information

Selection of Smoothing Parameter for One-Step Sparse Estimates with L q Penalty

Selection of Smoothing Parameter for One-Step Sparse Estimates with L q Penalty Journal of Data Science 9(2011), 549-564 Selection of Smoothing Parameter for One-Step Sparse Estimates with L q Penalty Masaru Kanba and Kanta Naito Shimane University Abstract: This paper discusses the

More information

MIT 9.520/6.860, Fall 2018 Statistical Learning Theory and Applications. Class 08: Sparsity Based Regularization. Lorenzo Rosasco

MIT 9.520/6.860, Fall 2018 Statistical Learning Theory and Applications. Class 08: Sparsity Based Regularization. Lorenzo Rosasco MIT 9.520/6.860, Fall 2018 Statistical Learning Theory and Applications Class 08: Sparsity Based Regularization Lorenzo Rosasco Learning algorithms so far ERM + explicit l 2 penalty 1 min w R d n n l(y

More information

Chapter 2 Available Solar Radiation

Chapter 2 Available Solar Radiation Chapter 2 Available Solar Radiation DEFINITIONS Figure shows the primary radiation fluxes on a surface at or near the ground that are important in connection with solar thermal processes. DEFINITIONS It

More information

Bayesian variable selection via. Penalized credible regions. Brian Reich, NCSU. Joint work with. Howard Bondell and Ander Wilson

Bayesian variable selection via. Penalized credible regions. Brian Reich, NCSU. Joint work with. Howard Bondell and Ander Wilson Bayesian variable selection via penalized credible regions Brian Reich, NC State Joint work with Howard Bondell and Ander Wilson Brian Reich, NCSU Penalized credible regions 1 Motivation big p, small n

More information

Colorado State University, Fort Collins, CO Weather Station Monthly Summary Report

Colorado State University, Fort Collins, CO Weather Station Monthly Summary Report Colorado State University, Fort Collins, CO Weather Station Monthly Summary Report Month: December Year: 2017 Temperature: Mean T max was 47.2 F which is 4.4 above the 1981-2010 normal for the month. This

More information

Sensing systems limited by constraints: physical size, time, cost, energy

Sensing systems limited by constraints: physical size, time, cost, energy Rebecca Willett Sensing systems limited by constraints: physical size, time, cost, energy Reduce the number of measurements needed for reconstruction Higher accuracy data subject to constraints Original

More information

Summary of Seasonal Normal Review Investigations. DESC 31 st March 2009

Summary of Seasonal Normal Review Investigations. DESC 31 st March 2009 Summary of Seasonal Normal Review Investigations DESC 31 st March 9 1 Introduction to the Seasonal Normal Review The relationship between weather and NDM demand is key to a number of critical processes

More information

Regularization and Variable Selection via the Elastic Net

Regularization and Variable Selection via the Elastic Net p. 1/1 Regularization and Variable Selection via the Elastic Net Hui Zou and Trevor Hastie Journal of Royal Statistical Society, B, 2005 Presenter: Minhua Chen, Nov. 07, 2008 p. 2/1 Agenda Introduction

More information

MSA220/MVE440 Statistical Learning for Big Data

MSA220/MVE440 Statistical Learning for Big Data MSA220/MVE440 Statistical Learning for Big Data Lecture 9-10 - High-dimensional regression Rebecka Jörnsten Mathematical Sciences University of Gothenburg and Chalmers University of Technology Recap from

More information

Lesson 2: Analysis of time series

Lesson 2: Analysis of time series Lesson 2: Analysis of time series Time series Main aims of time series analysis choosing right model statistical testing forecast driving and optimalisation Problems in analysis of time series time problems

More information

False Discovery Rate

False Discovery Rate False Discovery Rate Peng Zhao Department of Statistics Florida State University December 3, 2018 Peng Zhao False Discovery Rate 1/30 Outline 1 Multiple Comparison and FWER 2 False Discovery Rate 3 FDR

More information

Gradient descent. Barnabas Poczos & Ryan Tibshirani Convex Optimization /36-725

Gradient descent. Barnabas Poczos & Ryan Tibshirani Convex Optimization /36-725 Gradient descent Barnabas Poczos & Ryan Tibshirani Convex Optimization 10-725/36-725 1 Gradient descent First consider unconstrained minimization of f : R n R, convex and differentiable. We want to solve

More information

Product and Inventory Management (35E00300) Forecasting Models Trend analysis

Product and Inventory Management (35E00300) Forecasting Models Trend analysis Product and Inventory Management (35E00300) Forecasting Models Trend analysis Exponential Smoothing Data Storage Shed Sales Period Actual Value(Y t ) Ŷ t-1 α Y t-1 Ŷ t-1 Ŷ t January 10 = 10 0.1 February

More information

Estimating subgroup specific treatment effects via concave fusion

Estimating subgroup specific treatment effects via concave fusion Estimating subgroup specific treatment effects via concave fusion Jian Huang University of Iowa April 6, 2016 Outline 1 Motivation and the problem 2 The proposed model and approach Concave pairwise fusion

More information

Short-term wind forecasting using artificial neural networks (ANNs)

Short-term wind forecasting using artificial neural networks (ANNs) Energy and Sustainability II 197 Short-term wind forecasting using artificial neural networks (ANNs) M. G. De Giorgi, A. Ficarella & M. G. Russo Department of Engineering Innovation, Centro Ricerche Energia

More information

Flexible Spatio-temporal smoothing with array methods

Flexible Spatio-temporal smoothing with array methods Int. Statistical Inst.: Proc. 58th World Statistical Congress, 2011, Dublin (Session IPS046) p.849 Flexible Spatio-temporal smoothing with array methods Dae-Jin Lee CSIRO, Mathematics, Informatics and

More information

A Short Introduction to the Lasso Methodology

A Short Introduction to the Lasso Methodology A Short Introduction to the Lasso Methodology Michael Gutmann sites.google.com/site/michaelgutmann University of Helsinki Aalto University Helsinki Institute for Information Technology March 9, 2016 Michael

More information

CP:

CP: Adeng Pustikaningsih, M.Si. Dosen Jurusan Pendidikan Akuntansi Fakultas Ekonomi Universitas Negeri Yogyakarta CP: 08 222 180 1695 Email : adengpustikaningsih@uny.ac.id Operations Management Forecasting

More information

Forecasting: The First Step in Demand Planning

Forecasting: The First Step in Demand Planning Forecasting: The First Step in Demand Planning Jayant Rajgopal, Ph.D., P.E. University of Pittsburgh Pittsburgh, PA 15261 In a supply chain context, forecasting is the estimation of future demand General

More information

High-dimensional Ordinary Least-squares Projection for Screening Variables

High-dimensional Ordinary Least-squares Projection for Screening Variables 1 / 38 High-dimensional Ordinary Least-squares Projection for Screening Variables Chenlei Leng Joint with Xiangyu Wang (Duke) Conference on Nonparametric Statistics for Big Data and Celebration to Honor

More information

Minwise hashing for large-scale regression and classification with sparse data

Minwise hashing for large-scale regression and classification with sparse data Minwise hashing for large-scale regression and classification with sparse data Nicolai Meinshausen (Seminar für Statistik, ETH Zürich) joint work with Rajen Shah (Statslab, University of Cambridge) Simons

More information

Mathematical introduction to Compressed Sensing

Mathematical introduction to Compressed Sensing Mathematical introduction to Compressed Sensing Lesson 1 : measurements and sparsity Guillaume Lecué ENSAE Mardi 31 janvier 2016 Guillaume Lecué (ENSAE) Compressed Sensing Mardi 31 janvier 2016 1 / 31

More information

A SARIMAX coupled modelling applied to individual load curves intraday forecasting

A SARIMAX coupled modelling applied to individual load curves intraday forecasting A SARIMAX coupled modelling applied to individual load curves intraday forecasting Frédéric Proïa Workshop EDF Institut Henri Poincaré - Paris 05 avril 2012 INRIA Bordeaux Sud-Ouest Institut de Mathématiques

More information

ECE G: Special Topics in Signal Processing: Sparsity, Structure, and Inference

ECE G: Special Topics in Signal Processing: Sparsity, Structure, and Inference ECE 18-898G: Special Topics in Signal Processing: Sparsity, Structure, and Inference Sparse Recovery using L1 minimization - algorithms Yuejie Chi Department of Electrical and Computer Engineering Spring

More information

Homogenization of the Hellenic cloud amount time series

Homogenization of the Hellenic cloud amount time series Homogenization of the Hellenic cloud amount time series A Argiriou 1, A Mamara 2, E Dimadis 1 1 Laboratory of Atmospheric Physics, 2 Hellenic Meteorological Service October 19, 2017 A Argiriou 1, A Mamara

More information

14 Multiple Linear Regression

14 Multiple Linear Regression B.Sc./Cert./M.Sc. Qualif. - Statistics: Theory and Practice 14 Multiple Linear Regression 14.1 The multiple linear regression model In simple linear regression, the response variable y is expressed in

More information

Predicting the Electricity Demand Response via Data-driven Inverse Optimization

Predicting the Electricity Demand Response via Data-driven Inverse Optimization Predicting the Electricity Demand Response via Data-driven Inverse Optimization Workshop on Demand Response and Energy Storage Modeling Zagreb, Croatia Juan M. Morales 1 1 Department of Applied Mathematics,

More information

DAILY QUESTIONS 28 TH JUNE 18 REASONING - CALENDAR

DAILY QUESTIONS 28 TH JUNE 18 REASONING - CALENDAR DAILY QUESTIONS 28 TH JUNE 18 REASONING - CALENDAR LEAP AND NON-LEAP YEAR *A non-leap year has 365 days whereas a leap year has 366 days. (as February has 29 days). *Every year which is divisible by 4

More information

Lecture 24 May 30, 2018

Lecture 24 May 30, 2018 Stats 3C: Theory of Statistics Spring 28 Lecture 24 May 3, 28 Prof. Emmanuel Candes Scribe: Martin J. Zhang, Jun Yan, Can Wang, and E. Candes Outline Agenda: High-dimensional Statistical Estimation. Lasso

More information

Gradient Descent. Ryan Tibshirani Convex Optimization /36-725

Gradient Descent. Ryan Tibshirani Convex Optimization /36-725 Gradient Descent Ryan Tibshirani Convex Optimization 10-725/36-725 Last time: canonical convex programs Linear program (LP): takes the form min x subject to c T x Gx h Ax = b Quadratic program (QP): like

More information

STAT 462-Computational Data Analysis

STAT 462-Computational Data Analysis STAT 462-Computational Data Analysis Chapter 5- Part 2 Nasser Sadeghkhani a.sadeghkhani@queensu.ca October 2017 1 / 27 Outline Shrinkage Methods 1. Ridge Regression 2. Lasso Dimension Reduction Methods

More information

3. If a forecast is too high when compared to an actual outcome, will that forecast error be positive or negative?

3. If a forecast is too high when compared to an actual outcome, will that forecast error be positive or negative? 1. Does a moving average forecast become more or less responsive to changes in a data series when more data points are included in the average? 2. Does an exponential smoothing forecast become more or

More information

EVALUATION OF ALGORITHM PERFORMANCE 2012/13 GAS YEAR SCALING FACTOR AND WEATHER CORRECTION FACTOR

EVALUATION OF ALGORITHM PERFORMANCE 2012/13 GAS YEAR SCALING FACTOR AND WEATHER CORRECTION FACTOR EVALUATION OF ALGORITHM PERFORMANCE /3 GAS YEAR SCALING FACTOR AND WEATHER CORRECTION FACTOR. Background The annual gas year algorithm performance evaluation normally considers three sources of information

More information

An Introduction to Sparse Approximation

An Introduction to Sparse Approximation An Introduction to Sparse Approximation Anna C. Gilbert Department of Mathematics University of Michigan Basic image/signal/data compression: transform coding Approximate signals sparsely Compress images,

More information

Statistics 300B Winter 2018 Final Exam Due 24 Hours after receiving it

Statistics 300B Winter 2018 Final Exam Due 24 Hours after receiving it Statistics 300B Winter 08 Final Exam Due 4 Hours after receiving it Directions: This test is open book and open internet, but must be done without consulting other students. Any consultation of other students

More information

Electric Load Forecasting Using Wavelet Transform and Extreme Learning Machine

Electric Load Forecasting Using Wavelet Transform and Extreme Learning Machine Electric Load Forecasting Using Wavelet Transform and Extreme Learning Machine Song Li 1, Peng Wang 1 and Lalit Goel 1 1 School of Electrical and Electronic Engineering Nanyang Technological University

More information

1-bit Matrix Completion. PAC-Bayes and Variational Approximation

1-bit Matrix Completion. PAC-Bayes and Variational Approximation : PAC-Bayes and Variational Approximation (with P. Alquier) PhD Supervisor: N. Chopin Bayes In Paris, 5 January 2017 (Happy New Year!) Various Topics covered Matrix Completion PAC-Bayesian Estimation Variational

More information

IJESRT. Scientific Journal Impact Factor: (ISRA), Impact Factor: [Gupta et al., 3(5): May, 2014] ISSN:

IJESRT. Scientific Journal Impact Factor: (ISRA), Impact Factor: [Gupta et al., 3(5): May, 2014] ISSN: IJESRT INTERNATIONAL JOURNAL OF ENGINEERING SCIENCES & RESEARCH TECHNOLOGY Short Term Weather -Dependent Load Forecasting using Fuzzy Logic Technique Monika Gupta Assistant Professor, Marwadi Education

More information

From Sales to Peak, Getting It Right Long-Term Demand Forecasting

From Sales to Peak, Getting It Right Long-Term Demand Forecasting From Sales to Peak, Getting It Right Long-Term Demand Forecasting 12 th Annual Energy Forecasters Meeting Las Vegas, NV April 2 April 3, 2014 Terry Baxter, NV Energy Manager, Forecasting Getting the Peak

More information

Confidence Intervals for Low-dimensional Parameters with High-dimensional Data

Confidence Intervals for Low-dimensional Parameters with High-dimensional Data Confidence Intervals for Low-dimensional Parameters with High-dimensional Data Cun-Hui Zhang and Stephanie S. Zhang Rutgers University and Columbia University September 14, 2012 Outline Introduction Methodology

More information

High-dimensional statistics: Some progress and challenges ahead

High-dimensional statistics: Some progress and challenges ahead High-dimensional statistics: Some progress and challenges ahead Martin Wainwright UC Berkeley Departments of Statistics, and EECS University College, London Master Class: Lecture Joint work with: Alekh

More information

Application and verification of the ECMWF products Report 2007

Application and verification of the ECMWF products Report 2007 Application and verification of the ECMWF products Report 2007 National Meteorological Administration Romania 1. Summary of major highlights The medium range forecast activity within the National Meteorological

More information

The Adaptive Lasso and Its Oracle Properties Hui Zou (2006), JASA

The Adaptive Lasso and Its Oracle Properties Hui Zou (2006), JASA The Adaptive Lasso and Its Oracle Properties Hui Zou (2006), JASA Presented by Dongjun Chung March 12, 2010 Introduction Definition Oracle Properties Computations Relationship: Nonnegative Garrote Extensions:

More information

SOLVING NON-CONVEX LASSO TYPE PROBLEMS WITH DC PROGRAMMING. Gilles Gasso, Alain Rakotomamonjy and Stéphane Canu

SOLVING NON-CONVEX LASSO TYPE PROBLEMS WITH DC PROGRAMMING. Gilles Gasso, Alain Rakotomamonjy and Stéphane Canu SOLVING NON-CONVEX LASSO TYPE PROBLEMS WITH DC PROGRAMMING Gilles Gasso, Alain Rakotomamonjy and Stéphane Canu LITIS - EA 48 - INSA/Universite de Rouen Avenue de l Université - 768 Saint-Etienne du Rouvray

More information

Multi-Plant Photovoltaic Energy Forecasting Challenge with Regression Tree Ensembles and Hourly Average Forecasts

Multi-Plant Photovoltaic Energy Forecasting Challenge with Regression Tree Ensembles and Hourly Average Forecasts Multi-Plant Photovoltaic Energy Forecasting Challenge with Regression Tree Ensembles and Hourly Average Forecasts Kathrin Bujna 1 and Martin Wistuba 2 1 Paderborn University 2 IBM Research Ireland Abstract.

More information

Asymptotic Equivalence of Regularization Methods in Thresholded Parameter Space

Asymptotic Equivalence of Regularization Methods in Thresholded Parameter Space Asymptotic Equivalence of Regularization Methods in Thresholded Parameter Space Jinchi Lv Data Sciences and Operations Department Marshall School of Business University of Southern California http://bcf.usc.edu/

More information

Various types of likelihood

Various types of likelihood Various types of likelihood 1. likelihood, marginal likelihood, conditional likelihood, profile likelihood, adjusted profile likelihood 2. semi-parametric likelihood, partial likelihood 3. empirical likelihood,

More information

A Blockwise Descent Algorithm for Group-penalized Multiresponse and Multinomial Regression

A Blockwise Descent Algorithm for Group-penalized Multiresponse and Multinomial Regression A Blockwise Descent Algorithm for Group-penalized Multiresponse and Multinomial Regression Noah Simon Jerome Friedman Trevor Hastie November 5, 013 Abstract In this paper we purpose a blockwise descent

More information

Champaign-Urbana 2000 Annual Weather Summary

Champaign-Urbana 2000 Annual Weather Summary Champaign-Urbana 2000 Annual Weather Summary ILLINOIS STATE WATER SURVEY 2204 Griffith Dr. Champaign, IL 61820 wxobsrvr@sws.uiuc.edu Maria Peters, Weather Observer January: January started on a mild note,

More information

Online Learning and Sequential Decision Making

Online Learning and Sequential Decision Making Online Learning and Sequential Decision Making Emilie Kaufmann CNRS & CRIStAL, Inria SequeL, emilie.kaufmann@univ-lille.fr Research School, ENS Lyon, Novembre 12-13th 2018 Emilie Kaufmann Online Learning

More information

ZONAL AND REGIONAL LOAD FORECASTING IN THE NEW ENGLAND WHOLESALE ELECTRICITY MARKET: A SEMIPARAMETRIC REGRESSION APPROACH

ZONAL AND REGIONAL LOAD FORECASTING IN THE NEW ENGLAND WHOLESALE ELECTRICITY MARKET: A SEMIPARAMETRIC REGRESSION APPROACH ZONAL AND REGIONAL LOAD FORECASTING IN THE NEW ENGLAND WHOLESALE ELECTRICITY MARKET: A SEMIPARAMETRIC REGRESSION APPROACH Jonathan T. Farland and Bernard J. Morzuch * Abstract We use a regression-based

More information

Nonconcave Penalized Likelihood with A Diverging Number of Parameters

Nonconcave Penalized Likelihood with A Diverging Number of Parameters Nonconcave Penalized Likelihood with A Diverging Number of Parameters Jianqing Fan and Heng Peng Presenter: Jiale Xu March 12, 2010 Jianqing Fan and Heng Peng Presenter: JialeNonconcave Xu () Penalized

More information

Predicting wildfire ignitions, escapes, and large fire activity using Predictive Service s 7-Day Fire Potential Outlook in the western USA

Predicting wildfire ignitions, escapes, and large fire activity using Predictive Service s 7-Day Fire Potential Outlook in the western USA http://dx.doi.org/10.14195/978-989-26-0884-6_135 Chapter 4 - Fire Risk Assessment and Climate Change Predicting wildfire ignitions, escapes, and large fire activity using Predictive Service s 7-Day Fire

More information

FORECAST ACCURACY REPORT 2017 FOR THE 2016 NATIONAL ELECTRICITY FORECASTING REPORT

FORECAST ACCURACY REPORT 2017 FOR THE 2016 NATIONAL ELECTRICITY FORECASTING REPORT FORECAST ACCURACY REPORT 2017 FOR THE 2016 NATIONAL ELECTRICITY FORECASTING REPORT Published: November 2017 Purpose The National Electricity Rules (Rules) require AEMO to report to the Reliability Panel

More information

The retrieval of the atmospheric humidity parameters from NOAA/AMSU data for winter season.

The retrieval of the atmospheric humidity parameters from NOAA/AMSU data for winter season. The retrieval of the atmospheric humidity parameters from NOAA/AMSU data for winter season. Izabela Dyras, Bożena Łapeta, Danuta Serafin-Rek Satellite Research Department, Institute of Meteorology and

More information

High-dimensional regression

High-dimensional regression High-dimensional regression Advanced Methods for Data Analysis 36-402/36-608) Spring 2014 1 Back to linear regression 1.1 Shortcomings Suppose that we are given outcome measurements y 1,... y n R, and

More information