When One Size No Longer Fits All Electric Load Forecasting with a Geographic Hierarchy WHITE PAPER

Similar documents
DRIVING ROI. The Business Case for Advanced Weather Solutions for the Energy Market

Integrated Electricity Demand and Price Forecasting

SHORT TERM LOAD FORECASTING

About Nnergix +2, More than 2,5 GW forecasted. Forecasting in 5 countries. 4 predictive technologies. More than power facilities

MetConsole AWOS. (Automated Weather Observation System) Make the most of your energy SM

Use of Normals in Load Forecasting at National Grid

The Vaisala AUTOSONDE AS41 OPERATIONAL EFFICIENCY AND RELIABILITY TO A TOTALLY NEW LEVEL.

Defining Normal Weather for Energy and Peak Normalization

Put the Weather to Work for Your Company

Components for Accurate Forecasting & Continuous Forecast Improvement

Chapter 13: Forecasting

Analyzing the effect of Weather on Uber Ridership

Holiday Demand Forecasting in the Electric Utility Industry

AESO Load Forecast Application for Demand Side Participation. Eligibility Working Group September 26, 2017

2006 & 2007 Pre-Hurricane Scenario Analyses

SMART GRID FORECASTING

TRAITS to put you on the map

Assistant Prof. Abed Schokry. Operations and Productions Management. First Semester

Effect of Weather on Uber Ridership

wind power forecasts

Report on System-Level Estimation of Demand Response Program Impact

ArcGIS. for Server. Understanding our World

Introduction to Weather Data Cleaning

Crime Analysis. GIS Solutions for Intelligence-Led Policing

Integration of Behind-the-Meter PV Fleet Forecasts into Utility Grid System Operations

elgian energ imports are managed using forecasting software to increase overall network e 칁 cienc.

Electricity Demand Probabilistic Forecasting With Quantile Regression Averaging

I N T R O D U C T I O N : G R O W I N G I T C O M P L E X I T Y

The North American Electric Reliability Corporation hereby submits Informational Filing of the North American Electric Reliability Corporation.

Design of a Weather- Normalization Forecasting Model

CHARTING SPATIAL BUSINESS TRANSFORMATION

A Unified Framework for Defining and Measuring Flexibility in Power System

Renewables and the Smart Grid. Trip Doggett President & CEO Electric Reliability Council of Texas

CustomWeather Statistical Forecasting (MOS)

Map your way to deeper insights

A Unified Framework for Near-term and Short-term System Load Forecasting

Systems Operations. PRAMOD JAIN, Ph.D. Consultant, USAID Power the Future. Astana, September, /6/2018

Predicting the Electricity Demand Response via Data-driven Inverse Optimization

Exposure Disaggregation: Introduction. By Alissa Le Mon

ANN and Statistical Theory Based Forecasting and Analysis of Power System Variables

WIND INTEGRATION IN ELECTRICITY GRIDS WORK PACKAGE 3: SIMULATION USING HISTORICAL WIND DATA

California Independent System Operator (CAISO) Challenges and Solutions

POWER SYSTEM OPERATING PROCEDURE LOAD FORECASTING

WEATHER NORMALIZATION METHODS AND ISSUES. Stuart McMenamin Mark Quan David Simons

Machine Learning Using Machine Learning to Improve Demand Response Forecasts

The SAB Medium Term Sales Forecasting System : From Data to Planning Information. Kenneth Carden SAB : Beer Division Planning

No. of Days. Building 3D cities Using Esri City Engine ,859. Creating & Analyzing Surfaces Using ArcGIS Spatial Analyst 1 7 3,139

Regional Wind Vulnerability. Extratropical Cyclones Differ from Tropical Cyclones in Ways That Matter

Impact on Agriculture

FORECASTING OF WIND GENERATION The wind power of tomorrow on your screen today!

A Multistage Modeling Strategy for Demand Forecasting

Normalization of Peak Demand for an Electric Utility using PROC MODEL

How Accurate is My Forecast?

Predicting Future Energy Consumption CS229 Project Report

AWOS. Automated Weather Observing Systems COASTAL

Geospatial natural disaster management

How to shape future met-services: a seamless perspective

At the Midpoint of the 2008

GIS Capability Maturity Assessment: How is Your Organization Doing?

PUBLIC SAFETY POWER SHUTOFF POLICIES AND PROCEDURES

AUTOMATED METERED WATER CONSUMPTION ANALYSIS

Member Level Forecasting Using SAS Enterprise Guide and SAS Forecast Studio

DI CHEM ANALYTICS FOR CHEMICAL MANUFACTURERS

Forecasting demand in the National Electricity Market. October 2017

Commercialisation. Lessons learned from Dutch weather market

Enhancements and Validation of a Demand Forecast Tool for South Australian Water Corporation

POWER SYSTEM OPERATING PROCEDURE LOAD FORECASTING

URD Cable Fault Prediction Model

Data Aggregation with InfraWorks and ArcGIS for Visualization, Analysis, and Planning

Putting the U.S. Geospatial Services Industry On the Map

Integration for Informed Decision Making

Short Term Load Forecasting Using Multi Layer Perceptron

Advance signals that can be used to foresee demand response days PJM SODRTF - March 9, 2018

Recent US Wind Integration Experience

A Fuzzy Logic Based Short Term Load Forecast for the Holidays

Oregon Department of Transportation. Geographic Information Systems Strategic Plan

The AIR Severe Thunderstorm Model for the United States

2006 IRP Technical Workshop Load Forecasting Tuesday, January 24, :00 am 3:30 pm (Pacific) Meeting Summary

STATISTICAL LOAD MODELING

ASSESSMENT. Industry Solutions Harness the Power of GIS for Property Assessment

HSC Research Report. Electric load forecasting with recency effect: A big data approach. Pu Wang 1 Bidong Liu 2 Tao Hong 2. SAS - R&D, Cary, NC, USA 2

SYMBIOSIS CENTRE FOR DISTANCE LEARNING (SCDL) Subject: production and operations management

Integration of WindSim s Forecasting Module into an Existing Multi-Asset Forecasting Framework

Short-Term Demand Forecasting Methodology for Scheduling and Dispatch

FirstStreet.org. Rising Seas Swallow $403 Million in New England Home Values For Immediate Release: January 22, 2019

FORECAST ACCURACY REPORT 2017 FOR THE 2016 NATIONAL ELECTRICITY FORECASTING REPORT

Digitalization in Shipping

Let's Make Our Cities Breathable BECAUSE CLEAN AIR BELONGS TO EVERYONE.

The Road to Improving your GIS Data. An ebook by Geo-Comm, Inc.

For the IT professional Information management solutions IBM Spatial Solutions

Are You Maximizing The Value Of All Your Data?

Jean-Marie Painvin Chairman & Chief Executive Officer

Rheological Testing Equipment for Polymers

Better Topographic Map Production Using ArcGIS. A Comprehensive Solution for Mapping Organizations

RESERVE LEVEL DECLARATION GUIDELINES

Automatic Singular Spectrum Analysis and Forecasting Michele Trovero, Michael Leonard, and Bruce Elsheimer SAS Institute Inc.

Gorge Area Demand Forecast. Prepared for: Green Mountain Power Corporation 163 Acorn Lane Colchester, Vermont Prepared by:

MineScape Geological modelling, mine planning and design

MeteoGroup RoadMaster. The world s leading winter road weather solution

Multi-Plant Photovoltaic Energy Forecasting Challenge: Second place solution

Transcription:

When One Size No Longer Fits All Electric Load Forecasting with a Geographic Hierarchy WHITE PAPER

SAS White Paper Table of Contents Executive Summary.... 1 Electric Load Forecasting Challenges... 1 Selected Forecasting Approach.... 1 Independent System Operator of New England (ISO-NE).... 2 Three Challenges, Three Steps... 3 Challenge 1: Examine Nonseparable Load.... 3 Challenge 2: Examine Separable Load in a State Where Load Is Split into Multiple Time Series.... 5 Challenge 3: Examine Separable Load for a Different Historical Range (Applying a Regional Grouping Technique) and Examine Separable Load at the System Level.................................... 7 Conclusions.... 13 Looking Ahead.... 13 For More Information... 14 This paper was written by Sen-Hao Lai and Tao Hong, with contributions from Steve Becker, Michael Beckmann and Jingrui (Rain) Xie. Sen-Hao Lai is a Senior Analytical Consultant at SAS where he is part of the global forecasting solutions practice. His expertise is in deploying forecasting and other analytical solutions. He is a solution architect and developer for SAS Energy Forecasting, and he is involved with a number of energy forecasting customers. Tao Hong is an Industry Consultant at SAS where he leads forecasting for the energy business unit. He has provided consulting services to more than a hundred utility industry organizations worldwide. He is chair of the IEEE Working Group on Energy Forecasting. He is also general chair of the Global Energy Forecasting Competition (GEFCom2012) and guest editor-in-chief of IEEE Transactions on Smart Grid Special Issue on Analytics for Energy Forecasting with Applications to the Smart Grid. Contributors are: Steve Becker, a Senior Consulting Manager and a leader in SAS Professional Services Global Services Factory team; Michael Beckmann, a Senior Technical Architect in the Global Services Factory who supports SAS Energy Forecasting; and Jingrui (Rain) Xie, an Associate Analytical Consultant at SAS who brings expertise in statistical analysis and forecasting. NOTE: The combination of our analysis with SAS Energy Forecasting and JMP models led to the forecasts presented in this paper. Our purpose was to test the impact of geographic hierarchies on forecast accuracy. We did not seek to produce the best possible MAPE.

When One Size No Longer Fits All Electric Load Forecasting with a Geographic Hierarchy Executive Summary With the deployment of smart grid technologies, many utilities can now take advantage of hourly or sub-hourly data from millions of smart meters. There are many upsides to this such as the fact that utilities can potentially charge customers different rates based on the time of day they use electricity. However, there are downsides as well: Many forecasting methodologies are outdated. The days of one-size-fits-all models are gone for the utility forecaster. This paper tackles these considerations through an electric load-forecasting case study. In particular, the paper: Investigates how a number of approaches using geographic hierarchy and weather station data can improve the predictive analytics used to determine future electric usage. Demonstrates why using geographic hierarchies is now imperative for utilities. It also illustrates why utility forecasters must be equipped with solutions that allow them to retrain models multiple times each year. Electric Load Forecasting Challenges To forecast electric load accurately, models must include the attributes of trend, yearly and daily seasonality, as well as exogenous effects such as temperature and day of week. The models also need to account for any effects caused by interaction among the variables. Several modeling approaches in the utilities industry such as neural networks, regression and support vector machines take advantage of these attributes and effects. One method for improving forecast accuracy is to refine such models; another is to improve the model inputs, namely temperature and load. Selected Forecasting Approach To improve forecast accuracy in this case study, SAS used multiple temperatures and incorporated geographic hierarchy. Our question was: How can we take advantage of geographic hierarchies and weather station data to improve load forecast accuracy? To conduct the investigation, we incorporated two types of load data: nonseparable and separable. Two Types of Load Data Nonseparable: Load data aggregated into a single time series; at lowest level of geographic hierarchy. Example in our study: West Central Massachusetts. Separable: Load data split into multiple time series; at middle or top level of geographic hierarchy. Examples in our study: Massachusetts and ISO-NE. 1

SAS White Paper We conducted this project in three steps, represented by three challenges: Challenge 1 Bottom level: Examine nonseparable load. Challenge 2 Middle level: Examine separable load in a state where load can be split into multiple time series. Challenge 3 Top level: Examine separable load for a different historical range (applying a region grouping technique) and examine separable load at the system level. To conduct our investigation, we used SAS Energy Forecasting and JMP as well as publicly available system-level and region-level hourly load data from our case study utility, the Independent System Operator of New England (ISO-NE). We also purchased temperature data from a number of weather stations through WeatherBank.com. Independent System Operator of New England (ISO-NE) The ISO-NE includes 6.5 million households and businesses, with a population of 14 million. The utility s all-time peak demand was 28,130 megawatts. The ISO-NE has more than 350 generators and 32,000 megawatts of generating capacity. It includes six state regions: Maine (ME), Vermont (VT), Massachusetts (MA), New Hampshire (NH), Connecticut (CT) and Rhode Island (RI). Massachusetts is split into three regions: Northeast Massachusetts Boston (NEMASSBOST), Southeast Massachusetts (SEMASS) and West Central Massachusetts (WCMASS). The forecast inputs in this case study included four years of historical hourly demand from each ISO-NE region. An additional year of hourly demand was set aside as the out-of-sample range used in model testing. We assigned one weather station to each region. For West Central Massachusetts, we purchased temperature data from five regional weather stations to support the investigation at the lowest level of the hierarchy. 2

When One Size No Longer Fits All Electric Load Forecasting with a Geographic Hierarchy ISO-NE ME VT MA NH CT RI Six State Regions NEMASSBOST SEMASS WCMASS Three Massachusetts Regions KBAF KAQW KORH Five Weather Stations KCEF KPSF Figure 1: Organization of the ISO-NE region. Three Challenges, Three Steps As part of our investigation, our challenge was to address three main issues. We set up a three-step approach to find answers to these questions: 1. Does taking the average temperature from multiple weather stations in the region provide a better forecast accuracy than the temperature taken from a single station? 2. For a single, statewide load forecast, should separate region forecasts be summed or should the temperature stations be averaged? 3. Can similar regions be grouped together to get a better forecast accuracy? Will the same business rule from the second challenge work for a different historical range? Challenge 1: Examine Nonseparable Load In this step, we wanted to determine whether taking the average temperature from multiple regional weather stations would provide better forecast accuracy than using the temperature from a single station. To answer that question, we used temperature data from five weather stations in the West Central Massachusetts region: Barnes Municipal (KBAF), Harriman and West (KAQW), Worcester Regional (KORH), Westover Air Reserve Base/Metropolitan Airport (KCEF), and Pittsfield Municipal (KPSF). 3

SAS White Paper We chose these weather stations to cover the variations in temperature that occur in different parts of the West Central Massachusetts region. ISO-NE ME VT MA NH CT RI Six State Regions NEMASSBOST SEMASS WCMASS Three Massachusetts Regions KBAF KAQW KORH Five Weather Stations KCEF KPSF Figure 2: ISO-NE weather stations used in our analysis. Using this data, we examined two scenarios. Scenario 1: Forecast the entire West Central Massachusetts region using only temperatures from the KORH weather station (WCMASS_KORH). Scenario 2: Forecast the entire West Central Massachusetts region using the average temperature from the five weather stations (WCMASS_WCMASS). Figure 3 shows the hourly forecast MAPE by scenario, using 2011 as the out-ofsample range. As shown here, the forecast accuracy is improved by using the average temperature from the five weather stations. Figure 3: In the West Central Massachusetts region, forecast accuracy improved by using the average temperature from five weather stations. 4

When One Size No Longer Fits All Electric Load Forecasting with a Geographic Hierarchy Based on the nonseparable load of West Central Massachusetts, we concluded that taking the average temperature from multiple weather stations gives a better forecast than taking the temperature from only one weather station. Challenge 2: Examine Separable Load in a State Where Load Is Split into Multiple Time Series In this step, we wanted to determine whether separate regional forecasts should be summed or whether the temperature stations should be averaged for a single, statewide load forecast. Here, the region we studied was the entire state of Massachusetts. This data includes the sum of loads from Northeast Massachusetts Boston, Southeast Massachusetts and West Central Massachusetts. ISO-NE ME VT MA NH CT RI Six State Regions NEMASSBOST SEMASS WCMASS Three Massachusetts Regions KBAF KAQW KORH Five Weather Stations KCEF KPSF Figure 4: Organization of the state of Massachusetts. To support this analysis, we used data from Worcester Regional (KORH), Green State (KPVD) and Logan Airport (KBOS) weather stations. We chose these weather stations to cover the temperature variations that occur in the three regions that make up Massachusetts. Using this data, we examined five scenarios. Scenario 1: Sum the separate forecasts of three regions, using one weather station for each region (MASS_SUM). Scenario 2: Forecast the entire state of Massachusetts, using only the KBOS weather station (MASS_KBOS). 5

SAS White Paper Scenario 3: Forecast the entire state of Massachusetts, using the average temperature from the three weather stations (MASS_KMASS). Scenario 4: Forecast the entire state of Massachusetts, using the temperature weighted by load from the three regions. Here, we used the sum of load over the historical range as the weight of a given region (MASS_WLOADSUM). Scenario 5: Forecast the entire state of Massachusetts, using the temperature weighted by economic output from the three regions. Here, we used the sum of economic output over the historical range as the weight of a given region (MASS_WECONSUM). Figure 5 shows the hourly forecast MAPE by scenario, using 2011 as the out-of-sample range. It shows that forecast accuracy improves by using the average temperature from the three weather stations. The forecast using only the KBOS weather station was the least accurate. We expected this result, because Logan Airport alone cannot provide a good representation of weather behavior for the entire state of Massachusetts. Interestingly, the temperatures weighted by load and economic output did not provide as accurate a forecast as the average temperature from the three weather stations. This indicated that creating a weighted temperature from load or economic output does not necessarily provide a better representation of weather than taking a simple average of temperatures. What was surprising was that the sum of separate forecasts from the three regions, using one weather station for each region, did not provide the best forecast accuracy. To understand why, we needed to conduct another experiment (shown in the third challenge). Figure 5: Hourly forecast MAPE by scenario shows that forecast accuracy improves by using the average temperature from the three weather stations. 6

When One Size No Longer Fits All Electric Load Forecasting with a Geographic Hierarchy Our initial conclusion (based on the separable load of Massachusetts) was that taking the average temperature from multiple weather stations seemed to provide a better forecast than summing the separate forecasts of the regions. It was difficult to determine if this is because averaging helps to cancel out errors in temperature readings, or if it s because the historical range chosen for the analysis can have a noticeable influence on which modeling approach comes out better. What could be surmised is that weather across a state like Massachusetts is better represented by multiple weather stations spread across the area. The data from a single weather station like KBOS cannot provide as good a representation of weather across the entire state. This last observation was in line with our first challenge, where we examined nonseparable load. Next, we examined separable load for a different historical range and for a higher level in the geographic hierarchy. Challenge 3: Examine Separable Load for a Different Historical Range (Applying a Regional Grouping Technique) and Examine Separable Load at the System Level In this step, we wanted to answer two main questions: Can similar regions be grouped together to get a better forecast accuracy? Will the same business rule from the second challenge work for a different historical range? At first, we studied the state of Massachusetts, whose load is the sum of loads from Northeast Massachusetts Boston, Southeast Massachusetts and West Central Massachusetts. Then, we applied the same questions to ISO-NE, whose load is the sum of loads from eight regions. ISO-NE ME VT MA NH CT RI Six State Regions NEMASSBOST SEMASS WCMASS Three Massachusetts Regions KBAF KAQW KORH Five Weather Stations KCEF KPSF Figure 6: Organization of the ISO-NE, system-level. 7

SAS White Paper Grouping regions together into one super-region provides the same advantage illustrated in the first challenge (i.e., average temperature from multiple weather stations gives a better forecast). In certain cases, the load and temperature profiles between adjacent regions are similar enough that averaging data from multiple weather stations within the super-region might help to cancel out temperature reading errors. In other cases, the load and temperature profiles between regions are dissimilar, so the average temperature from those regions may not provide an appropriate representation of weather. Our investigation tested these possibilities for the ISO-NE regions by first applying a technique for identifying similar regions. There are several methods for determining if regions can be considered similar. For example, JMP allows us to compare load versus temperature contours among different regions. The following chart shows that Connecticut has the highest load among the eight regions. It also shows that load-versus-temperature profiles for certain regions overlap, indicating that demand for electricity can be similar between different regions. ISO-NE Region Loads vs Temperature Contour Load (MVV) Temperature (F) Figure 7: ISO-NE regional loads versus temperature contour. 8

When One Size No Longer Fits All Electric Load Forecasting with a Geographic Hierarchy SAS/ETS software has a procedure called PROC SIMILARITY that determines if two time series are similar. It computes the distance between a given time series and other series. In our investigation, we used PROC SIMILARITY to determine if any two load profiles or any two temperature profiles were similar. The distance threshold at which two time series are considered similar is adjustable. By increasing the threshold, more pairs of time series can be flagged as similar. By decreasing the threshold, fewer pairs can be flagged as similar. The following tables show which pairs of load and temperature profiles among the ISO-NE regions were flagged as similar after adjusting the thresholds and finding a balance. The regions flagged as similar are indicated by the number 1. Figure 8: Regions with similar loads using PROC SIMILARITY (threshold=2,000,000,000). Figure 9: Regions with similar temperature using PROC SIMILARITY (threshold=300,000). The following chart from JMP shows a side-by-side comparison of the load-versustemperature profiles of the eight regions. A visual analysis of the chart confirmed the results from PROC SIMILARITY. As in the previous tables, it illustrates that Maine and New Hampshire are similar. It also illustrates that Southeast Massachusetts and West Central Massachusetts are similar in load-versus-temperature profile. 9

SAS White Paper ISO-NE Region Loads vs Temperature Profiles Load (MVV) Temperature (F) Figure 10: ISO-NE regional load versus temperature profiles. We used the technique just described to identify similar regions, and we employed the following criteria for grouping regions into a super-region: Regions must be adjacent. Load profiles are similar from PROC SIMILARITY. Temperature profiles are similar from PROC SIMILARITY. The only super-regions that met these criteria were: A super-region including Southeast Massachusetts and West Central Massachusetts. A super-region including Maine and New Hampshire. With these new super-regions and a different historical period, we examined three scenarios for Massachusetts using 2008 as the out-of-sample range: Scenario 1: Sum the separate forecasts of three regions, using one weather station for each region (MASS_SUM2008). Scenario 2: Forecast the entire state of Massachusetts, using the average temperature from the three weather stations (MASS_KMASS2008). Scenario 3: Forecast the entire state of Massachusetts, with Southeast Massachusetts and West Central Massachusetts grouped as a single super-region and Northeast Massachusetts Boston kept as a separate region. For the superregion, we based the forecast on the average temperature from KPVD and KORH (MASS_SEWC2008). 10

When One Size No Longer Fits All Electric Load Forecasting with a Geographic Hierarchy Figure 11 shows the hourly forecast MAPE by scenario, using 2008 as the out-ofsample range. As shown here, summing the individual forecasts of the three regions provided a better accuracy than the other two approaches. The result contradicted what we saw in the second challenge using 2011 as the out-of-sample range, where averaging the temperatures from the three regions provided the better forecast. Our new results indicated that the best approach depends on the historical range used for the analysis. Our results also showed that grouping similar regions together provided a forecast that was almost as good as summing the individual forecasts; however, it did not provide the best forecast. This indicated that Southeast Massachusetts and West Central Massachusetts were too dissimilar in load and temperature behavior to be accurately represented by an average temperature. Figure 11: Hourly forecast MAPE by scenario for super-regions provides better accuracy. Our first conclusion for separable load was that forecast accuracy can be close when comparing a summed forecast across different regions and a combined forecast with similar regions grouped into a super-region. In our investigation, the regions were too dissimilar for an average temperature to represent the weather across the entire super-region. Our second conclusion, based on the separable load of Massachusetts, was that the best method depends on the historical data range. In certain cases, taking the average temperature of multiple weather stations provides a better forecast than summing the separate forecasts of the regions. In other cases, the opposite may be true. This means that the historical range chosen for the analysis can have a noticeable influence on which modeling approach works best. Third, and more importantly, a utility forecaster must retest the methodology for choosing the input temperature on a regular basis and retrain models multiple times each year. The forecaster cannot assume that one methodology will provide the best forecast from one year to the next. The final phase of our investigation involved examining separable load at the system level. In this phase, we had two questions to answer: Can grouping help for a large area like the eight ISO-NE regions? Are the conclusions for Massachusetts applicable to the systemwide separable load of ISO-NE? 11

SAS White Paper To answer these questions, we chose one weather station for each of the eight regions. The stations included Logan Airport (KBOS), Green State (KPVD), Worcester Regional (KORH), Bradley International (KBDL), Portland International (KPWM), Concord Municipal (KCON) and Burlington International (KBTV). We chose stations that were located within or close to the assigned region. For Rhode Island and Southeast Massachusetts, KPVD was shared between the two regions due to its proximity to both. Using the same super-regions defined from the grouping criteria above and the assigned weather stations, we examined four scenarios for ISO-NE using 2008 as the out-of-sample range: Scenario 1: Forecast the entire ISO-NE using the sum of six separate forecasts, with Southeast Massachusetts and West Central Massachusetts grouped as a single super-region, and Maine and New Hampshire grouped as a single superregion. For each super-region, we based the forecast on the average temperature from weather stations assigned to those regions (NEISO_SUM6). Scenario 2: Forecast the entire ISO-NE, using only the weighted temperature provided by ISO-NE (NEISO_NEISOW). Scenario 3: Forecast the entire ISO-NE, using the simple average temperature from weather stations assigned to each region (NEISO_NEISOA). Scenario 4: Sum the separate forecasts of eight regions, using one weather station for each region (NEISO_NEISOS). Figure 12 lists the hourly forecast MAPE by scenario, using 2008 as the out-of-sample range. Summing the individual forecasts of the eight regions provided a better accuracy than the other approaches. This result showed that grouping similar regions together provided a forecast that was almost as good as summing the individual forecasts; however, it did not provide the best forecast. This indicated that the grouped regions were too dissimilar in load and temperature behavior to be accurately represented by an average temperature. Also, the results show that using a simple average or weighted temperature produced significantly worse forecasts than the other approaches. This indicated that a single temperature cannot provide a good representation of weather across the eight ISO-NE regions. Figure 12: Hourly forecast MAPE by scenario reveals that summing the individual forecasts of the eight regions provides better accuracy than the other approaches. 12

When One Size No Longer Fits All Electric Load Forecasting with a Geographic Hierarchy Conclusions The results of our three challenges, based on using ISO-NE as the case study, led to a number of interesting conclusions: For nonseparable load, or load data at the lowest level of the geographic hierarchy that has been aggregated into a single time series, the average of temperatures from multiple weather stations in a given area provides a better representation of weather than a single station. Also, averaging temperatures from multiple weather stations helps to cancel out errors in temperature readings from individual weather stations. For separable load, or load data at the middle or top level of the geographic hierarchy that can be split into multiple time series, the best approach depends on the historical range under examination. The historical range chosen for the analysis can have a noticeable influence on which modeling approach comes out better. More importantly, utility forecasters must retest the methodology for choosing the input temperature on a regular basis and must retrain models multiple times each year. The other conclusion for separable load is that forecast accuracy can be close when comparing a summed forecast across different regions with a combined forecast that uses similar regions grouped into a super-region. In this investigation, the regions were too dissimilar for an average temperature to represent the weather across multiple regions. This case study clearly illustrates why the days of one-size-fits-all models are gone for the utility forecaster. Looking Ahead Forecasters cannot assume that one methodology will provide the best forecast from one year to the next. Relying on the status quo approach can result in forecasts that are less accurate and can lead to adverse financial or operational consequences such as over- or under-investment in new facilities, suboptimal energy trading decisions or suboptimal rate cases. To improve forecast performance, reduce uncertainties and generate value in the new data-intensive environment, forecasters must be able to decide which models or combinations of models are best. In addition, they must be able to determine more indicators of the factors that affect load. To succeed, forecasters need solutions that enable them to: Define flexible hierarchies and models. Choose the level of forecast automation that s best for them, including automatically performing large-scale enterprise forecasting tasks. Produce repeatable, scalable, traceable and defensible energy forecasts. Get statistical and visual indication of the likely range of forecast outcomes. 13

SAS White Paper These solutions must adapt to the utility s infrastructure and environment. Only then will utilities be able to make the right decisions proactively while increasing forecast reliability and reducing financial and operational risk. For More Information Read more about our views on forecasting in the SAS white paper: How Does Forecasting Enhance Smart Grid Benefits? See: sas.com/reg/wp/corp/59140. Get more information about SAS Forecasting for utilities at: sas.com/industry/utilities/ demand-forecasting. Learn more about SAS Energy Forecasting at: sas.com/energy-forecast-server. 14

About SAS SAS is the leader in business analytics software and services, and the largest independent vendor in the business intelligence market. Through innovative solutions, SAS helps customers at more than 60,000 sites improve performance and deliver value by making better decisions faster. Since 1976 SAS has been giving customers around the world THE POWER TO KNOW. For more information on SAS Business Analytics software and services, visit sas.com. SAS Institute Inc. World Headquarters +1 919 677 8000 To contact your local SAS office, please visit: sas.com/offices SAS and all other SAS Institute Inc. product or service names are registered trademarks or trademarks of SAS Institute Inc. in the USA and other countries. indicates USA registration. Other brand and product names are trademarks of their respective companies. Copyright 2013, SAS Institute Inc. All rights reserved. 106226_S95998_0413