Predicting intraday-load curve using High-D methods
|
|
- Gwen Lane
- 5 years ago
- Views:
Transcription
1 Predicting intraday-load curve using High-D methods LPMA- Université Paris-Diderot-Paris 7 Mathilde Mougeot UPD, Vincent Lefieux RTE, Laurence Maillard RTE Horizon Maths 2013
2 Intraday load curve during a week Monday January 25 th to Sunday January 31 th 9 x
3 Intraday load curve forecasting -here 48h- 5.6 x Y 3.8 Yapx tpred
4 Forecasting procedure 1 Construction of a smart encyclopedia of past scenarios out of a data basis using learning algorithms. 2 Build a set of prediction experts consulting the encyclopedia. 3 Aggregate the prediction experts
5 Data basis The past data basis Electrical consumption of the past Meteorological input Endogenous variables: calendar data, functional bases
6 Electrical consumption of the past Recorded every half hour from January 1 st, 2003 to August 31 th, For this period of time, the global consumption signal is split into N = 2800 sub signals (Y 1,...,Y t,...,y N ). Y t R n, defines the intra day load curve for the t th day of size n = 48.
7 Intraday load curve for seven days Monday January 25 th to Sunday January 31 th 9 x
8 Example of calendar variations (seasonnal) 6 x x x x Figure: autumn, winter, spring and summer
9 Meteorological inputs A total of 371 (=2x39+293) meteorological variables recorded each day half-hourly over the 2800 days of the same period of time. Temperature: T k for k = 1,...,39 measured in 39 weather stations scattered all over the French territory. Cloud Cover: N k for k = 1,...,39 measured in the same 39 weather stations. Wind: W k for k = 1,...,293 available at 293 network points scattered all over the territory.
10 Weather stations Figure: Temperature and Cloud covering measurement stations. Wind stations
11 Brest- Lille- Marseille (a) T (b) CC (c) W Figure: Brest (blue line), Lille (red line) and Marseille (green).
12 Building the smart encyclopedia : several issues 1 Large dimension 2 All the variables (load curve, meteo) are highly correlated 3 Necessity to introduce typical patterns
13 Load curves : functional regression and clustering Compression of the intraday load curves : the signals Y t are treated as functions of the time and sparsely represented on a dictionary of functions (combination of Fourier basis and Haar basis). Clustering the previous sparse representations of the signals, into homogeneous groups. Pattern : define a pattern of consumption inside each group. Calendar attribution : Translate the clustering into calendar (predictable) variables
14 Reduced set of explanatory variables For each t index of the day of interest, we register the daily electrical consumption signal Y t and Z t = [P t M t ] = [C t B t [T] t [N] t [W] t ] is the concatenation of the "calendar and functional" variables and "climate variables" also in reduced dimension.
15 Sparse approximation on the learning set Sparse Approximation of each consumption day on a learning set of days ( ), using the reduced set of explanatory variables. For each day t of the learning set, we build an approximation Ŷ t of the (observed) signal Y t with the help of the new set of explanatory variables (Z t ): Ŷ t = G t (Z t )
16 Sparse approximation on the learning set Sparse Approximation For each day t of the learning set, Z t = [P t M t ] = [C t B t [T] t [N] t [W] t ] is the concatenation of "calendar" variables and "climate variables. We build an approximation Ŷ t of the (observed) signal Y t using (Z t ): Ŷ t = G t (Z t ) G t (Z t ) = Z tˆβt ( ) Sparse Approximation and Knowledge Extraction for Electrical Consumption Signals, 2012, M. Mougeot, D. P., K. Tribouley & V. Lefieux, L. Teyssier-Maillard
17 High dimensional Linear Models Y = Xβ +ǫ β IR p is the unknown parameter (to be estimated) ǫ = (ǫ 1,...,ǫ n ) is a (non observed) vector of random errors. It is assumed to be variables i.i.d. N(0,σ 2 ) X is a known matrix n p. High dimension : p >> n t
18 Smart Encyclopedia contents For each day t, 1 t 2800: the daily electrical consumption Y t. a qualitative description of t, given by calendar statements, clustering allocation. the meteorological indicators over the French territory M t = [T t N t W t ]. the estimated coefficient ˆβ t. the approximation of the daily consumption Ŷ t = Z t ˆβt.
19 Forecasting procedure Forecasting using the encyclopedia Construction of a set of forecasting experts. Aggregation of the experts.
20 Expert associated to the strategy M Forecasting experts Strategy : M a function, data dependent or not, from N to N such that for any d N,M(d) < d (purely non anticipative). Plug-in To the strategy M we associate the expert Ỹ M t : the prediction of the signal of day t using forecasting strategy M, Ỹ M t = G M(t) (Z t ) = Z tˆβm(t)
21 Examples of strategies : time depending tm1: Refer to the day before: (The coefficients used for prediction are those calculated the previous day) M(d) = d 1 Ỹ tm1 t = Z tˆβ t 1 tm7: Refer to one week before: M(d) = d 7 Ỹt tm7 = Z tˆβt 7
22 Experts introducing meteorological scenarios T: Find the day having the closest temperature indicators, regarding the sup distance (over the days, and over the indicators): M(d) = ArgMin t sup k {1,...,6}, i {1,...,48} T k d (i) Tk t (i) T m : Find the day having the closest median temperature with the sup distance (over the days): M(d) = ArgMin t sup i {1,...,48} T 3 d (i) T3 t(i)
23 Experts introducing a climatic configuration of the day constrained by the type of the day N s/j : Closest cloud covering indicators (min, max, med, std) regarding the sup distance for days which are of the same type as Y t : M = ArgMin J(d)=J(t) sup i,k N k d (i) Nk t(i) within the same type of day as Y t
24 MAPE error For day t, the prediction MAPE error over the interval [0,T] is defined by: MAPE(Y,Ỹ M t )(T) = 1 T MISE(Y,Ỹ M t )(T) = 1 T T i=1 ỸM t (i) Y t (i) Y t (i) T Ỹt M (i) Y t (i) 2 i=1
25 Prediction evaluation Names average median std Naive Yday Week T med T med /W T med /N T T/G T/D T/C N N/G N/D N/C W W/G W/D W/C
26 Prediction evaluation-comparing experts Yday Week Tm Tm/N Tm/W T T/g T/d T/c N N/g N/d N/c W W/g W/d W/c Figure: Frequencies of best performances computed for one year of data from September 1 th 2009 to August 31 th 2010.
27 Prediction evaluation-comparing experts on days 0.25 Ranking Predictor Performances per Day tm1 tm7 Ts Ts/N Tm Tm/N T/g N/g T/j N/j T/c N/c Figure: Percentage of best predictor among days (1:monday,... 7:sunday)
28 Prediction evaluation-comparing experts 0.18 Best Predictor Performances per Month tm1 tm7 Ts Ts/N Tm Tm/N T/g N/g T/j N/j T/c N/c Figure: Percentage of best predictor among month
29 Aggregation of predictors: Exponential weights (inspired by various theoretical results -see Lecue, Rigollet, Stolz, Tsybakov,...-) with M Ỹ wgt d m=1 = wm d Ỹm d M m=1 wm d w M d = exp( 1 Tθ T ỸM d (i) Y d (i) 2 ) i=1 θ is a parameter, (often called temperature in physic applications, see the discussion below) T = Tpred.
30 Forecasting (mape=0.7%). 5.6 x Y 3.8 Yapx tpred
31 Sparse methods
32 Sparse approximation on the learning set Sparse Approximation For each day t of the learning set, Z t = [P t M t ] = [C t B t [T] t [N] t [W] t ] is the concatenation of "calendar" variables and "climate variables. We build an approximation Ŷ t of the (observed) signal Y t using (Z t ): Ŷ t = G t (Z t ) G t (Z t ) = Z tˆβt
33 High dimensional Linear Models Y = Xβ +ǫ β IR p is the unknown parameter (to be estimated) ǫ = (ǫ 1,...,ǫ n ) is a (non observed) vector of random errors. It is assumed to be variables i.i.d. N(0,σ 2 ) X is a known matrix n p. High dimension : p >> n t ( ) M. Mougeot, D. P., K. Tribouley, JRSS B 2012,B Stat. Methodol. vol 74
34 Conditions generally required to solve the problem Sparsity conditions on the vector β restricted identity conditions on the matrix X
35 Sparsity conditions
36 Restricted identity property For C {1,... p}, denote X C the matrix X restricted to the raws which are in C and the associated Gram-matrix M(C) := 1 n Xt C X C Restricted identity property means that M(C) is almost the identity matrix for any C small enough.
37 Example 1: RIP RIP(m 0,ν) assumes that There exist 0 ν < 1 and m 0 1 such that : x IR m, x 2 l 2 (m) (1 ν) xt M(C)x x 2 l 2 (m) (1+ν),
38 Example 2: Coherence condition M := 1 n Xt X. M jj = 1 for all j. Coherence τ n = sup M lm = sup 1 l m l m n Coherence = RIP( ν/τ n,ν) n X il X im i=1
39 Sparsity conditions #{l {1,...,p}, β l 0} S β l q M, 0 < q < 1 (B q (M)) l SMALL NUMBER OF BIG COEFFICIENTS
40 Penalization for sparsity Many penalizations introduced historically in the regression framework (to put identification constraints on β) Ridge: E(β,λ) = Y Xβ 2 +λσ j β 2 j Lasso: E(β,λ) = Y Xβ 2 +λσ j β j Scad: E(β,λ) = Y Xβ 2 +λσ j w j g(β j ) Solutions based on: Convex Optimization for L1, non convex Opti. for Scad Fan & Lv (2008, 2010), Candes & Tao (2007)... Many others...
41 2-thresholding-step Procedures Y = Xβ +ǫ Y (n 1), X (n p) steps compute size Step1=pre-selection Find b Leaders X b (n, b) b < n << p REGRESSION on Leaders β = (X b X b ) 1 X by (1, b) Step2=denoising the coefficients ˆβ (1, Ŝ)
42 Winter forecast 9 x Figure: Forecast (solid blue line) and observed (dashed dark line) electrical consumption for a winter week from Monday February 1 st to Sunday January 7 th 2010.
43 Spring forecast 6 x Figure: Forecast (solid blue line) and observed (dashed dark line) electrical consumption for a spring week from Monday June 14 th to Sunday June 21 th 2010.
High dimensional statistical learning methods applied to energy data sets
High dimensional statistical learning methods applied to energy data sets Dominique Picard Université Paris-Diderot LPMA Data science challenge : Post-modern Rationality Replace human logic by statistical
More informationNonparametric forecasting of the French load curve
An overview RTE & UPMC-ISUP ISNPS 214, Cadix June 15, 214 Table of contents 1 Introduction 2 MAVE modeling 3 IBR modeling 4 Sparse modeling Electrical context Generation must be strictly equal to consumption
More informationGraph Functional Methods for Climate Partitioning
Graph Functional Methods for Climate Partitioning Mathilde Mougeot - with D. Picard, V. Lefieux*, M. Marchand* Université Paris Diderot, France *Réseau Transport Electrique (RTE) Buenos Aires, 2015 Mathilde
More informationMultivariate Regression Model Results
Updated: August, 0 Page of Multivariate Regression Model Results 4 5 6 7 8 This exhibit provides the results of the load model forecast discussed in Schedule. Included is the forecast of short term system
More informationForecasting the electricity consumption by aggregating specialized experts
Forecasting the electricity consumption by aggregating specialized experts Pierre Gaillard (EDF R&D, ENS Paris) with Yannig Goude (EDF R&D) Gilles Stoltz (CNRS, ENS Paris, HEC Paris) June 2013 WIPFOR Goal
More informationDefining Normal Weather for Energy and Peak Normalization
Itron White Paper Energy Forecasting Defining Normal Weather for Energy and Peak Normalization J. Stuart McMenamin, Ph.D Managing Director, Itron Forecasting 2008, Itron Inc. All rights reserved. 1 Introduction
More informationpeak half-hourly New South Wales
Forecasting long-term peak half-hourly electricity demand for New South Wales Dr Shu Fan B.S., M.S., Ph.D. Professor Rob J Hyndman B.Sc. (Hons), Ph.D., A.Stat. Business & Economic Forecasting Unit Report
More informationElectricity Demand Forecasting using Multi-Task Learning
Electricity Demand Forecasting using Multi-Task Learning Jean-Baptiste Fiot, Francesco Dinuzzo Dublin Machine Learning Meetup - July 2017 1 / 32 Outline 1 Introduction 2 Problem Formulation 3 Kernels 4
More informationThe Dayton Power and Light Company Load Profiling Methodology Revised 7/1/2017
The Dayton Power and Light Company Load Profiling Methodology Revised 7/1/2017 Overview of Methodology Dayton Power and Light (DP&L) load profiles will be used to estimate hourly loads for customers without
More informationpeak half-hourly Tasmania
Forecasting long-term peak half-hourly electricity demand for Tasmania Dr Shu Fan B.S., M.S., Ph.D. Professor Rob J Hyndman B.Sc. (Hons), Ph.D., A.Stat. Business & Economic Forecasting Unit Report for
More informationA Bayesian Perspective on Residential Demand Response Using Smart Meter Data
A Bayesian Perspective on Residential Demand Response Using Smart Meter Data Datong-Paul Zhou, Maximilian Balandat, and Claire Tomlin University of California, Berkeley [datong.zhou, balandat, tomlin]@eecs.berkeley.edu
More informationShort-Term Electrical Load Forecasting for Iraqi Power System based on Multiple Linear Regression Method
Volume 00 No., August 204 Short-Term Electrical Load Forecasting for Iraqi Power System based on Multiple Linear Regression Method Firas M. Tuaimah University of Baghdad Baghdad, Iraq Huda M. Abdul Abass
More informationConsistent high-dimensional Bayesian variable selection via penalized credible regions
Consistent high-dimensional Bayesian variable selection via penalized credible regions Howard Bondell bondell@stat.ncsu.edu Joint work with Brian Reich Howard Bondell p. 1 Outline High-Dimensional Variable
More informationDevelopment of Short-term Demand Forecasting Model And its Application in Analysis of Resource Adequacy. For discussion purposes only Draft
Development of Short-term Demand Forecasting Model And its Application in Analysis of Resource Adequacy For discussion purposes only Draft January 31, 2007 INTRODUCTION In this paper we will present the
More informationWEATHER NORMALIZATION METHODS AND ISSUES. Stuart McMenamin Mark Quan David Simons
WEATHER NORMALIZATION METHODS AND ISSUES Stuart McMenamin Mark Quan David Simons Itron Forecasting Brown Bag September 17, 2013 Please Remember» Phones are Muted: In order to help this session run smoothly,
More informationStepwise Searching for Feature Variables in High-Dimensional Linear Regression
Stepwise Searching for Feature Variables in High-Dimensional Linear Regression Qiwei Yao Department of Statistics, London School of Economics q.yao@lse.ac.uk Joint work with: Hongzhi An, Chinese Academy
More informationDemand Forecasting Models
E 2017 PSE Integrated Resource Plan Demand Forecasting Models This appendix describes the econometric models used in creating the demand forecasts for PSE s 2017 IRP analysis. Contents 1. ELECTRIC BILLED
More informationA Sparse Linear Model and Significance Test. for Individual Consumption Prediction
A Sparse Linear Model and Significance Test 1 for Individual Consumption Prediction Pan Li, Baosen Zhang, Yang Weng, and Ram Rajagopal arxiv:1511.01853v3 [stat.ml] 21 Feb 2017 Abstract Accurate prediction
More informationNon-linear Supervised High Frequency Trading Strategies with Applications in US Equity Markets
Non-linear Supervised High Frequency Trading Strategies with Applications in US Equity Markets Nan Zhou, Wen Cheng, Ph.D. Associate, Quantitative Research, J.P. Morgan nan.zhou@jpmorgan.com The 4th Annual
More informationTime Series and Forecasting
Chapter 8 Time Series and Forecasting 8.1 Introduction A time series is a collection of observations made sequentially in time. When observations are made continuously, the time series is said to be continuous;
More informationA summary of the weather year based on data from the Zumwalt weather station
ZUMWALT PRAIRIE WEATHER 2016 A summary of the weather year based on data from the Zumwalt weather station Figure 1. An unusual summer storm on July 10, 2016 brought the second-largest precipitation day
More informationOslo Class 6 Sparsity based regularization
RegML2017@SIMULA Oslo Class 6 Sparsity based regularization Lorenzo Rosasco UNIGE-MIT-IIT May 4, 2017 Learning from data Possible only under assumptions regularization min Ê(w) + λr(w) w Smoothness Sparsity
More informationSmoothly Clipped Absolute Deviation (SCAD) for Correlated Variables
Smoothly Clipped Absolute Deviation (SCAD) for Correlated Variables LIB-MA, FSSM Cadi Ayyad University (Morocco) COMPSTAT 2010 Paris, August 22-27, 2010 Motivations Fan and Li (2001), Zou and Li (2008)
More informationAnalysis of Fast Input Selection: Application in Time Series Prediction
Analysis of Fast Input Selection: Application in Time Series Prediction Jarkko Tikka, Amaury Lendasse, and Jaakko Hollmén Helsinki University of Technology, Laboratory of Computer and Information Science,
More informationpeak half-hourly South Australia
Forecasting long-term peak half-hourly electricity demand for South Australia Dr Shu Fan B.S., M.S., Ph.D. Professor Rob J Hyndman B.Sc. (Hons), Ph.D., A.Stat. Business & Economic Forecasting Unit Report
More informationA Bootstrap Lasso + Partial Ridge Method to Construct Confidence Intervals for Parameters in High-dimensional Sparse Linear Models
A Bootstrap Lasso + Partial Ridge Method to Construct Confidence Intervals for Parameters in High-dimensional Sparse Linear Models Jingyi Jessica Li Department of Statistics University of California, Los
More informationPredicting Future Energy Consumption CS229 Project Report
Predicting Future Energy Consumption CS229 Project Report Adrien Boiron, Stephane Lo, Antoine Marot Abstract Load forecasting for electric utilities is a crucial step in planning and operations, especially
More informationChapter 3. Regression-Based Models for Developing Commercial Demand Characteristics Investigation
Chapter Regression-Based Models for Developing Commercial Demand Characteristics Investigation. Introduction Commercial area is another important area in terms of consume high electric energy in Japan.
More informationSeptember 2018 Weather Summary West Central Research and Outreach Center Morris, MN
September 2018 Weather Summary The mean temperature for September was 60.6 F, which is 1.5 F above the average of 59.1 F (1886-2017). The high temperature for the month was 94 F on September 16 th. The
More informationLOADS, CUSTOMERS AND REVENUE
EB-00-0 Exhibit K Tab Schedule Page of 0 0 LOADS, CUSTOMERS AND REVENUE The purpose of this evidence is to present the Company s load, customer and distribution revenue forecast for the test year. The
More informationHomogeneity Pursuit. Jianqing Fan
Jianqing Fan Princeton University with Tracy Ke and Yichao Wu http://www.princeton.edu/ jqfan June 5, 2014 Get my own profile - Help Amazing Follow this author Grace Wahba 9 Followers Follow new articles
More information(Part 1) High-dimensional statistics May / 41
Theory for the Lasso Recall the linear model Y i = p j=1 β j X (j) i + ɛ i, i = 1,..., n, or, in matrix notation, Y = Xβ + ɛ, To simplify, we assume that the design X is fixed, and that ɛ is N (0, σ 2
More informationSummary of Seasonal Normal Review Investigations CWV Review
Summary of Seasonal Normal Review Investigations CWV Review DESC 31 st March 2009 1 Contents Stage 1: The Composite Weather Variable (CWV) An Introduction / background Understanding of calculation Stage
More informationRegression Analysis. y t = β 1 x t1 + β 2 x t2 + β k x tk + ϵ t, t = 1,..., T,
Regression Analysis The multiple linear regression model with k explanatory variables assumes that the tth observation of the dependent or endogenous variable y t is described by the linear relationship
More informationBy Gareth M. James, Jing Wang and Ji Zhu University of Southern California, University of Michigan and University of Michigan
The Annals of Statistics 2009, Vol. 37, No. 5A, 2083 2108 DOI: 10.1214/08-AOS641 c Institute of Mathematical Statistics, 2009 FUNCTIONAL LINEAR REGRESSION THAT S INTERPRETABLE 1 arxiv:0908.2918v1 [math.st]
More informationSTAT763: Applied Regression Analysis. Multiple linear regression. 4.4 Hypothesis testing
STAT763: Applied Regression Analysis Multiple linear regression 4.4 Hypothesis testing Chunsheng Ma E-mail: cma@math.wichita.edu 4.4.1 Significance of regression Null hypothesis (Test whether all β j =
More informationThe lasso. Patrick Breheny. February 15. The lasso Convex optimization Soft thresholding
Patrick Breheny February 15 Patrick Breheny High-Dimensional Data Analysis (BIOS 7600) 1/24 Introduction Last week, we introduced penalized regression and discussed ridge regression, in which the penalty
More informationNovember 2018 Weather Summary West Central Research and Outreach Center Morris, MN
November 2018 Weather Summary Lower than normal temperatures occurred for the second month. The mean temperature for November was 22.7 F, which is 7.2 F below the average of 29.9 F (1886-2017). This November
More informationof the 7 stations. In case the number of daily ozone maxima in a month is less than 15, the corresponding monthly mean was not computed, being treated
Spatial Trends and Spatial Extremes in South Korean Ozone Seokhoon Yun University of Suwon, Department of Applied Statistics Suwon, Kyonggi-do 445-74 South Korea syun@mail.suwon.ac.kr Richard L. Smith
More informationHoliday Demand Forecasting
Holiday Demand Forecasting Yue Li Senior Research Statistician Developer SAS #AnalyticsX Outline Background Holiday demand modeling techniques Weekend day Holiday dummy variable Two-stage methods Results
More informationConstruction of an Informative Hierarchical Prior Distribution: Application to Electricity Load Forecasting
Construction of an Informative Hierarchical Prior Distribution: Application to Electricity Load Forecasting Anne Philippe Laboratoire de Mathématiques Jean Leray Université de Nantes Workshop EDF-INRIA,
More informationShort Term Load Forecasting Based Artificial Neural Network
Short Term Load Forecasting Based Artificial Neural Network Dr. Adel M. Dakhil Department of Electrical Engineering Misan University Iraq- Misan Dr.adelmanaa@gmail.com Abstract Present study develops short
More informationCourse Notes for EE227C (Spring 2018): Convex Optimization and Approximation
Course Notes for EE227C (Spring 2018): Convex Optimization and Approximation Instructor: Moritz Hardt Email: hardt+ee227c@berkeley.edu Graduate Instructor: Max Simchowitz Email: msimchow+ee227c@berkeley.edu
More informationSelection of Smoothing Parameter for One-Step Sparse Estimates with L q Penalty
Journal of Data Science 9(2011), 549-564 Selection of Smoothing Parameter for One-Step Sparse Estimates with L q Penalty Masaru Kanba and Kanta Naito Shimane University Abstract: This paper discusses the
More informationMIT 9.520/6.860, Fall 2018 Statistical Learning Theory and Applications. Class 08: Sparsity Based Regularization. Lorenzo Rosasco
MIT 9.520/6.860, Fall 2018 Statistical Learning Theory and Applications Class 08: Sparsity Based Regularization Lorenzo Rosasco Learning algorithms so far ERM + explicit l 2 penalty 1 min w R d n n l(y
More informationChapter 2 Available Solar Radiation
Chapter 2 Available Solar Radiation DEFINITIONS Figure shows the primary radiation fluxes on a surface at or near the ground that are important in connection with solar thermal processes. DEFINITIONS It
More informationBayesian variable selection via. Penalized credible regions. Brian Reich, NCSU. Joint work with. Howard Bondell and Ander Wilson
Bayesian variable selection via penalized credible regions Brian Reich, NC State Joint work with Howard Bondell and Ander Wilson Brian Reich, NCSU Penalized credible regions 1 Motivation big p, small n
More informationColorado State University, Fort Collins, CO Weather Station Monthly Summary Report
Colorado State University, Fort Collins, CO Weather Station Monthly Summary Report Month: December Year: 2017 Temperature: Mean T max was 47.2 F which is 4.4 above the 1981-2010 normal for the month. This
More informationSensing systems limited by constraints: physical size, time, cost, energy
Rebecca Willett Sensing systems limited by constraints: physical size, time, cost, energy Reduce the number of measurements needed for reconstruction Higher accuracy data subject to constraints Original
More informationSummary of Seasonal Normal Review Investigations. DESC 31 st March 2009
Summary of Seasonal Normal Review Investigations DESC 31 st March 9 1 Introduction to the Seasonal Normal Review The relationship between weather and NDM demand is key to a number of critical processes
More informationRegularization and Variable Selection via the Elastic Net
p. 1/1 Regularization and Variable Selection via the Elastic Net Hui Zou and Trevor Hastie Journal of Royal Statistical Society, B, 2005 Presenter: Minhua Chen, Nov. 07, 2008 p. 2/1 Agenda Introduction
More informationMSA220/MVE440 Statistical Learning for Big Data
MSA220/MVE440 Statistical Learning for Big Data Lecture 9-10 - High-dimensional regression Rebecka Jörnsten Mathematical Sciences University of Gothenburg and Chalmers University of Technology Recap from
More informationLesson 2: Analysis of time series
Lesson 2: Analysis of time series Time series Main aims of time series analysis choosing right model statistical testing forecast driving and optimalisation Problems in analysis of time series time problems
More informationFalse Discovery Rate
False Discovery Rate Peng Zhao Department of Statistics Florida State University December 3, 2018 Peng Zhao False Discovery Rate 1/30 Outline 1 Multiple Comparison and FWER 2 False Discovery Rate 3 FDR
More informationGradient descent. Barnabas Poczos & Ryan Tibshirani Convex Optimization /36-725
Gradient descent Barnabas Poczos & Ryan Tibshirani Convex Optimization 10-725/36-725 1 Gradient descent First consider unconstrained minimization of f : R n R, convex and differentiable. We want to solve
More informationProduct and Inventory Management (35E00300) Forecasting Models Trend analysis
Product and Inventory Management (35E00300) Forecasting Models Trend analysis Exponential Smoothing Data Storage Shed Sales Period Actual Value(Y t ) Ŷ t-1 α Y t-1 Ŷ t-1 Ŷ t January 10 = 10 0.1 February
More informationEstimating subgroup specific treatment effects via concave fusion
Estimating subgroup specific treatment effects via concave fusion Jian Huang University of Iowa April 6, 2016 Outline 1 Motivation and the problem 2 The proposed model and approach Concave pairwise fusion
More informationShort-term wind forecasting using artificial neural networks (ANNs)
Energy and Sustainability II 197 Short-term wind forecasting using artificial neural networks (ANNs) M. G. De Giorgi, A. Ficarella & M. G. Russo Department of Engineering Innovation, Centro Ricerche Energia
More informationFlexible Spatio-temporal smoothing with array methods
Int. Statistical Inst.: Proc. 58th World Statistical Congress, 2011, Dublin (Session IPS046) p.849 Flexible Spatio-temporal smoothing with array methods Dae-Jin Lee CSIRO, Mathematics, Informatics and
More informationA Short Introduction to the Lasso Methodology
A Short Introduction to the Lasso Methodology Michael Gutmann sites.google.com/site/michaelgutmann University of Helsinki Aalto University Helsinki Institute for Information Technology March 9, 2016 Michael
More informationCP:
Adeng Pustikaningsih, M.Si. Dosen Jurusan Pendidikan Akuntansi Fakultas Ekonomi Universitas Negeri Yogyakarta CP: 08 222 180 1695 Email : adengpustikaningsih@uny.ac.id Operations Management Forecasting
More informationForecasting: The First Step in Demand Planning
Forecasting: The First Step in Demand Planning Jayant Rajgopal, Ph.D., P.E. University of Pittsburgh Pittsburgh, PA 15261 In a supply chain context, forecasting is the estimation of future demand General
More informationHigh-dimensional Ordinary Least-squares Projection for Screening Variables
1 / 38 High-dimensional Ordinary Least-squares Projection for Screening Variables Chenlei Leng Joint with Xiangyu Wang (Duke) Conference on Nonparametric Statistics for Big Data and Celebration to Honor
More informationMinwise hashing for large-scale regression and classification with sparse data
Minwise hashing for large-scale regression and classification with sparse data Nicolai Meinshausen (Seminar für Statistik, ETH Zürich) joint work with Rajen Shah (Statslab, University of Cambridge) Simons
More informationMathematical introduction to Compressed Sensing
Mathematical introduction to Compressed Sensing Lesson 1 : measurements and sparsity Guillaume Lecué ENSAE Mardi 31 janvier 2016 Guillaume Lecué (ENSAE) Compressed Sensing Mardi 31 janvier 2016 1 / 31
More informationA SARIMAX coupled modelling applied to individual load curves intraday forecasting
A SARIMAX coupled modelling applied to individual load curves intraday forecasting Frédéric Proïa Workshop EDF Institut Henri Poincaré - Paris 05 avril 2012 INRIA Bordeaux Sud-Ouest Institut de Mathématiques
More informationECE G: Special Topics in Signal Processing: Sparsity, Structure, and Inference
ECE 18-898G: Special Topics in Signal Processing: Sparsity, Structure, and Inference Sparse Recovery using L1 minimization - algorithms Yuejie Chi Department of Electrical and Computer Engineering Spring
More informationHomogenization of the Hellenic cloud amount time series
Homogenization of the Hellenic cloud amount time series A Argiriou 1, A Mamara 2, E Dimadis 1 1 Laboratory of Atmospheric Physics, 2 Hellenic Meteorological Service October 19, 2017 A Argiriou 1, A Mamara
More information14 Multiple Linear Regression
B.Sc./Cert./M.Sc. Qualif. - Statistics: Theory and Practice 14 Multiple Linear Regression 14.1 The multiple linear regression model In simple linear regression, the response variable y is expressed in
More informationPredicting the Electricity Demand Response via Data-driven Inverse Optimization
Predicting the Electricity Demand Response via Data-driven Inverse Optimization Workshop on Demand Response and Energy Storage Modeling Zagreb, Croatia Juan M. Morales 1 1 Department of Applied Mathematics,
More informationDAILY QUESTIONS 28 TH JUNE 18 REASONING - CALENDAR
DAILY QUESTIONS 28 TH JUNE 18 REASONING - CALENDAR LEAP AND NON-LEAP YEAR *A non-leap year has 365 days whereas a leap year has 366 days. (as February has 29 days). *Every year which is divisible by 4
More informationLecture 24 May 30, 2018
Stats 3C: Theory of Statistics Spring 28 Lecture 24 May 3, 28 Prof. Emmanuel Candes Scribe: Martin J. Zhang, Jun Yan, Can Wang, and E. Candes Outline Agenda: High-dimensional Statistical Estimation. Lasso
More informationGradient Descent. Ryan Tibshirani Convex Optimization /36-725
Gradient Descent Ryan Tibshirani Convex Optimization 10-725/36-725 Last time: canonical convex programs Linear program (LP): takes the form min x subject to c T x Gx h Ax = b Quadratic program (QP): like
More informationSTAT 462-Computational Data Analysis
STAT 462-Computational Data Analysis Chapter 5- Part 2 Nasser Sadeghkhani a.sadeghkhani@queensu.ca October 2017 1 / 27 Outline Shrinkage Methods 1. Ridge Regression 2. Lasso Dimension Reduction Methods
More information3. If a forecast is too high when compared to an actual outcome, will that forecast error be positive or negative?
1. Does a moving average forecast become more or less responsive to changes in a data series when more data points are included in the average? 2. Does an exponential smoothing forecast become more or
More informationEVALUATION OF ALGORITHM PERFORMANCE 2012/13 GAS YEAR SCALING FACTOR AND WEATHER CORRECTION FACTOR
EVALUATION OF ALGORITHM PERFORMANCE /3 GAS YEAR SCALING FACTOR AND WEATHER CORRECTION FACTOR. Background The annual gas year algorithm performance evaluation normally considers three sources of information
More informationAn Introduction to Sparse Approximation
An Introduction to Sparse Approximation Anna C. Gilbert Department of Mathematics University of Michigan Basic image/signal/data compression: transform coding Approximate signals sparsely Compress images,
More informationStatistics 300B Winter 2018 Final Exam Due 24 Hours after receiving it
Statistics 300B Winter 08 Final Exam Due 4 Hours after receiving it Directions: This test is open book and open internet, but must be done without consulting other students. Any consultation of other students
More informationElectric Load Forecasting Using Wavelet Transform and Extreme Learning Machine
Electric Load Forecasting Using Wavelet Transform and Extreme Learning Machine Song Li 1, Peng Wang 1 and Lalit Goel 1 1 School of Electrical and Electronic Engineering Nanyang Technological University
More information1-bit Matrix Completion. PAC-Bayes and Variational Approximation
: PAC-Bayes and Variational Approximation (with P. Alquier) PhD Supervisor: N. Chopin Bayes In Paris, 5 January 2017 (Happy New Year!) Various Topics covered Matrix Completion PAC-Bayesian Estimation Variational
More informationIJESRT. Scientific Journal Impact Factor: (ISRA), Impact Factor: [Gupta et al., 3(5): May, 2014] ISSN:
IJESRT INTERNATIONAL JOURNAL OF ENGINEERING SCIENCES & RESEARCH TECHNOLOGY Short Term Weather -Dependent Load Forecasting using Fuzzy Logic Technique Monika Gupta Assistant Professor, Marwadi Education
More informationFrom Sales to Peak, Getting It Right Long-Term Demand Forecasting
From Sales to Peak, Getting It Right Long-Term Demand Forecasting 12 th Annual Energy Forecasters Meeting Las Vegas, NV April 2 April 3, 2014 Terry Baxter, NV Energy Manager, Forecasting Getting the Peak
More informationConfidence Intervals for Low-dimensional Parameters with High-dimensional Data
Confidence Intervals for Low-dimensional Parameters with High-dimensional Data Cun-Hui Zhang and Stephanie S. Zhang Rutgers University and Columbia University September 14, 2012 Outline Introduction Methodology
More informationLearning with L q<1 vs L 1 -norm regularisation with exponentially many irrelevant features
Learning with L q
More informationHigh-dimensional statistics: Some progress and challenges ahead
High-dimensional statistics: Some progress and challenges ahead Martin Wainwright UC Berkeley Departments of Statistics, and EECS University College, London Master Class: Lecture Joint work with: Alekh
More informationApplication and verification of the ECMWF products Report 2007
Application and verification of the ECMWF products Report 2007 National Meteorological Administration Romania 1. Summary of major highlights The medium range forecast activity within the National Meteorological
More informationThe Adaptive Lasso and Its Oracle Properties Hui Zou (2006), JASA
The Adaptive Lasso and Its Oracle Properties Hui Zou (2006), JASA Presented by Dongjun Chung March 12, 2010 Introduction Definition Oracle Properties Computations Relationship: Nonnegative Garrote Extensions:
More informationSOLVING NON-CONVEX LASSO TYPE PROBLEMS WITH DC PROGRAMMING. Gilles Gasso, Alain Rakotomamonjy and Stéphane Canu
SOLVING NON-CONVEX LASSO TYPE PROBLEMS WITH DC PROGRAMMING Gilles Gasso, Alain Rakotomamonjy and Stéphane Canu LITIS - EA 48 - INSA/Universite de Rouen Avenue de l Université - 768 Saint-Etienne du Rouvray
More informationMulti-Plant Photovoltaic Energy Forecasting Challenge with Regression Tree Ensembles and Hourly Average Forecasts
Multi-Plant Photovoltaic Energy Forecasting Challenge with Regression Tree Ensembles and Hourly Average Forecasts Kathrin Bujna 1 and Martin Wistuba 2 1 Paderborn University 2 IBM Research Ireland Abstract.
More informationAsymptotic Equivalence of Regularization Methods in Thresholded Parameter Space
Asymptotic Equivalence of Regularization Methods in Thresholded Parameter Space Jinchi Lv Data Sciences and Operations Department Marshall School of Business University of Southern California http://bcf.usc.edu/
More informationVarious types of likelihood
Various types of likelihood 1. likelihood, marginal likelihood, conditional likelihood, profile likelihood, adjusted profile likelihood 2. semi-parametric likelihood, partial likelihood 3. empirical likelihood,
More informationA Blockwise Descent Algorithm for Group-penalized Multiresponse and Multinomial Regression
A Blockwise Descent Algorithm for Group-penalized Multiresponse and Multinomial Regression Noah Simon Jerome Friedman Trevor Hastie November 5, 013 Abstract In this paper we purpose a blockwise descent
More informationChampaign-Urbana 2000 Annual Weather Summary
Champaign-Urbana 2000 Annual Weather Summary ILLINOIS STATE WATER SURVEY 2204 Griffith Dr. Champaign, IL 61820 wxobsrvr@sws.uiuc.edu Maria Peters, Weather Observer January: January started on a mild note,
More informationOnline Learning and Sequential Decision Making
Online Learning and Sequential Decision Making Emilie Kaufmann CNRS & CRIStAL, Inria SequeL, emilie.kaufmann@univ-lille.fr Research School, ENS Lyon, Novembre 12-13th 2018 Emilie Kaufmann Online Learning
More informationZONAL AND REGIONAL LOAD FORECASTING IN THE NEW ENGLAND WHOLESALE ELECTRICITY MARKET: A SEMIPARAMETRIC REGRESSION APPROACH
ZONAL AND REGIONAL LOAD FORECASTING IN THE NEW ENGLAND WHOLESALE ELECTRICITY MARKET: A SEMIPARAMETRIC REGRESSION APPROACH Jonathan T. Farland and Bernard J. Morzuch * Abstract We use a regression-based
More informationNonconcave Penalized Likelihood with A Diverging Number of Parameters
Nonconcave Penalized Likelihood with A Diverging Number of Parameters Jianqing Fan and Heng Peng Presenter: Jiale Xu March 12, 2010 Jianqing Fan and Heng Peng Presenter: JialeNonconcave Xu () Penalized
More informationPredicting wildfire ignitions, escapes, and large fire activity using Predictive Service s 7-Day Fire Potential Outlook in the western USA
http://dx.doi.org/10.14195/978-989-26-0884-6_135 Chapter 4 - Fire Risk Assessment and Climate Change Predicting wildfire ignitions, escapes, and large fire activity using Predictive Service s 7-Day Fire
More informationFORECAST ACCURACY REPORT 2017 FOR THE 2016 NATIONAL ELECTRICITY FORECASTING REPORT
FORECAST ACCURACY REPORT 2017 FOR THE 2016 NATIONAL ELECTRICITY FORECASTING REPORT Published: November 2017 Purpose The National Electricity Rules (Rules) require AEMO to report to the Reliability Panel
More informationThe retrieval of the atmospheric humidity parameters from NOAA/AMSU data for winter season.
The retrieval of the atmospheric humidity parameters from NOAA/AMSU data for winter season. Izabela Dyras, Bożena Łapeta, Danuta Serafin-Rek Satellite Research Department, Institute of Meteorology and
More informationHigh-dimensional regression
High-dimensional regression Advanced Methods for Data Analysis 36-402/36-608) Spring 2014 1 Back to linear regression 1.1 Shortcomings Suppose that we are given outcome measurements y 1,... y n R, and
More information