David John Gagne II, NCAR

Similar documents
A Community Gridded Atmospheric Forecast System for Calibrated Solar Irradiance

Retrieval of Cloud Top Pressure

A SOLAR AND WIND INTEGRATED FORECAST TOOL (SWIFT) DESIGNED FOR THE MANAGEMENT OF RENEWABLE ENERGY VARIABILITY ON HAWAIIAN GRID SYSTEMS

The document was not produced by the CAISO and therefore does not necessarily reflect its views or opinion.

Importance of Numerical Weather Prediction in Variable Renewable Energy Forecast

4.3. David E. Rudack*, Meteorological Development Laboratory Office of Science and Technology National Weather Service, NOAA 1.

Improving the accuracy of solar irradiance forecasts based on Numerical Weather Prediction

P1.34 MULTISEASONALVALIDATION OF GOES-BASED INSOLATION ESTIMATES. Jason A. Otkin*, Martha C. Anderson*, and John R. Mecikalski #

Techniques for Improving Wind to Power Conversion

2.5 ANALYZING THE EFFECTS OF LOW LEVEL BOUNDARIES ON TORNADOGENESIS THROUGH SPATIOTEMPORAL RELATIONAL DATA MINING

Multi-Plant Photovoltaic Energy Forecasting Challenge with Regression Tree Ensembles and Hourly Average Forecasts

National Center for Atmospheric Research Research Applications Laboratory Renewable Energy

Hierarchical models for the rainfall forecast DATA MINING APPROACH

Multi-Model Ensemble for day ahead PV power forecasting improvement

WMO Aeronautical Meteorology Scientific Conference 2017

Postprocessing of Numerical Weather Forecasts Using Online Seq. Using Online Sequential Extreme Learning Machines

Remote Ground based observations Merging Method For Visibility and Cloud Ceiling Assessment During the Night Using Data Mining Algorithms

Tom Durrant Frank Woodcock. Diana Greenslade

Machine learning comes from Bayesian decision theory in statistics. There we want to minimize the expected value of the loss function.

Solar Nowcasting with Cluster-based Detrending

2014 HIGHLIGHTS. SHC Task 46 is a five-year collaborative project with the IEA SolarPACES Programme and the IEA Photovoltaic Power Systems Programme.

An Adaptive Multi-Modeling Approach to Solar Nowcasting

Toward improved initial conditions for NCAR s real-time convection-allowing ensemble. Ryan Sobash, Glen Romine, Craig Schwartz, and Kate Fossell

Short and medium term solar irradiance and power forecasting given high penetration and a tropical environment

Electricity Demand Probabilistic Forecasting With Quantile Regression Averaging

Electrical and Computer Engineering Department University of Waterloo Canada

810 A Comparison of Turbine-based and Farm-based Methods for Converting Wind to Power

CustomWeather Statistical Forecasting (MOS)

Current best practice of uncertainty forecast for wind energy

Chart types and when to use them

Weather Forecasting: Lecture 2

2015 Springer International Publishing Switzerland

Zachary Holden - US Forest Service Region 1, Missoula MT Alan Swanson University of Montana Dept. of Geography David Affleck University of Montana

Creation of a 30 years-long high resolution homogenized solar radiation data set over the

Using an Artificial Neural Network to Predict Parameters for Frost Deposition on Iowa Bridgeways

Validation of operational NWP forecasts for global, diffuse and direct solar exposure over Australia

The future role of statistical post-processing in weather forecasting. Klaus Knüpffer Philip Lorenz

PREDICTING SOLAR GENERATION FROM WEATHER FORECASTS. Chenlin Wu Yuhan Lou

Speedwell High Resolution WRF Forecasts. Application

GL Garrad Hassan Short term power forecasts for large offshore wind turbine arrays

EXAM IN STATISTICAL MACHINE LEARNING STATISTISK MASKININLÄRNING

Holdout and Cross-Validation Methods Overfitting Avoidance

Probabilistic forecasting of solar radiation

Making Our Cities Safer: A Study In Neighbhorhood Crime Patterns

UniResearch Ltd, University of Bergen, Bergen, Norway WinSim Ltd., Tonsberg, Norway {catherine,

Discovery Through Situational Awareness

Data Analytics for Solar Energy Management

The Pennsylvania State University. The Graduate School. Department of Meteorology ARTIFICIAL INTELLIGENCE TECHNIQUES FOR SHORT-RANGE SOLAR

Climate services in support of the energy transformation

Introduction to Machine Learning and Cross-Validation

Assignment 3. Introduction to Machine Learning Prof. B. Ravindran

Modelling Wind Farm Data and the Short Term Prediction of Wind Speeds

Country scale solar irradiance forecasting for PV power trading

Forecasting AOSC 200 Tim Canty. Class Web Site: Lecture 26 Nov 29, Weather Forecasting

FORECAST OF ENSEMBLE POWER PRODUCTION BY GRID-CONNECTED PV SYSTEMS

Forecasting Solar Irradiance from Time Series with Exogenous Information

EXAM IN STATISTICAL MACHINE LEARNING STATISTISK MASKININLÄRNING

Calibration of short-range global radiation ensemble forecasts

Using artificial neural network for solar energy level predicting

ANSWER KEY. Part I: Synoptic Scale Composite Map. Lab 12 Answer Key. Explorations in Meteorology 54

What is one-month forecast guidance?

Machine Learning Approaches to Crop Yield Prediction and Climate Change Impact Assessment

Combining Deterministic and Probabilistic Methods to Produce Gridded Climatologies

Application and verification of the ECMWF products Report 2007

Understanding Travel Time to Airports in New York City Sierra Gentry Dominik Schunack

Improving Efficiency of PV Systems Using Statistical Performance Monitoring

FORECASTING: A REVIEW OF STATUS AND CHALLENGES. Eric Grimit and Kristin Larson 3TIER, Inc. Pacific Northwest Weather Workshop March 5-6, 2010

Solar Resource Mapping in South Africa

An Integrated Approach to the Prediction of Weather, Renewable Energy Generation and Energy Demand in Vermont

A Novel Method for Predicting the Power Output of Distributed Renewable Energy Resources

Using Multivariate Adaptive Constructed Analogs (MACA) data product for climate projections

An Operational Solar Forecast Model For PV Fleet Simulation. Richard Perez & Skip Dise Jim Schlemmer Sergey Kivalov Karl Hemker, Jr.

Real-time PBL analysis system using profilers observations from The New York State Mesonet

3TIER Global Solar Dataset: Methodology and Validation

Post-processing of solar irradiance forecasts from WRF Model at Reunion Island

NOTES AND CORRESPONDENCE. Improving Week-2 Forecasts with Multimodel Reforecast Ensembles

Essence of Machine Learning (and Deep Learning) Hoa M. Le Data Science Lab, HUST hoamle.github.io

Heat Load Forecasting of District Heating System Based on Numerical Weather Prediction Model

p = ρrt p = ρr d = T( q v ) dp dz = ρg

Radiative Climatology of the North Slope of Alaska and the Adjacent Arctic Ocean

WEATHER NORMALIZATION METHODS AND ISSUES. Stuart McMenamin Mark Quan David Simons

MxVision WeatherSentry Web Services Content Guide

The benefits and developments in ensemble wind forecasting

4.5 Comparison of weather data from the Remote Automated Weather Station network and the North American Regional Reanalysis

Assimilation of MSG visible and near-infrared reflectivity in KENDA/COSMO

Deep Convolutional Neural Networks for Pairwise Causality

Bucharest Romania.

J8.4 NOWCASTING OCEANIC CONVECTION FOR AVIATION USING RANDOM FOREST CLASSIFICATION

2. Fargo, North Dakota receives more snow than Charleston, South Carolina.

Internet Engineering Jacek Mazurkiewicz, PhD

AN ARTIFICIAL NEURAL NETWORK BASED APPROACH FOR ESTIMATING DIRECT NORMAL, DIFFUSE HORIZONTAL AND GLOBAL HORIZONTAL IRRADIANCES USING SATELLITE IMAGES

Predicting MTA Bus Arrival Times in New York City

The prediction of house price

USING GRIDDED MOS TECHNIQUES TO DERIVE SNOWFALL CLIMATOLOGIES

Gridded Localized Aviation MOS Program (LAMP) Guidance for Aviation Forecasting

Application and verification of ECMWF products 2012

No. 8/2018 ISSN Meteorology. METreport. StrålInn. Evaluation of Predicted Shortwave Radiation Åsmund Bakketun, Jørn Kristiansen

GIFTS SOUNDING RETRIEVAL ALGORITHM DEVELOPMENT

Data Mining und Maschinelles Lernen

The Relative Contributions of ECMWF Deterministic and Ensemble Forecasts in an Automated Consensus Forecasting System

Transcription:

The Performance Impacts of Machine Learning Design Choices for Gridded Solar Irradiance Forecasting Features work from Evaluating Statistical Learning Configurations for Gridded Solar Irradiance Forecasting, Solar Energy, Under Review. David John Gagne II, NCAR Sue Ellen Haupt, NCAR Amy McGovern, University of Oklahoma John Williams, The Weather Company, an IBM Business Seth Linden, NCAR Doug Nychka, NCAR 1

Motivation: Solar Irradiance Solar electricity generation continues to grow rapidly and decrease in cost Accurate solar irradiance predictions needed by electric utilities to balance supply with expected demand Solar power is being generated more at sites that do not have observations or historical records of irradiance Contributions Developed a Gridded Atmospheric Forecasting System (GRAFS) for solar irradiance Evaluated different machine learning model configurations for predictive accuracy at unobserved sites for day ahead solar irradiance forecasts Solar irradiance predictions are needed for sites without historical data (Source: http://www.adventurecats.org/cat-tales/maine-coon-deaf-sailors-ears-sea/) 2

Solar Forecasting Ingredients Position of sun in sky Scattering by atmosphere & aerosols Cloud cover effects Sun Position Cloud Properties Height Coverage Transparency Aerosols and Water Vapor (Turbidity) Panel Obstructions Precipitation Non-meteorological obstructions Precipitation Shading Panel Orientation Panel Temperature Solar factors diagram from Gagne (2014) 3

Solar Data NOAA Global Forecast System (GFS) Interpolated to 4 km grid 3 hourly output interpolated in time to hourly output Variables: Solar irradiance, temperature, cloud cover, sun angles, spatial statistics Evaluation Period: June-August 2015 Oklahoma Mesonet (McPherson et al. 2007) Sites record solar irradiance every 5 minutes with a Li-Cor pyranometer Hourly-averaged irradiance and clearness index computed from raw observations Clearness index: ratio of observed irradiance to topof-atmosphere irradiance 4

Machine Learning Configurations: Solar NWP Model Output Extract input variables at each site Calculate neighborhood statistics for each variable Training site data Match model and observation data Split sites into training and testing sets Oklahoma Mesonet 5-minute solar irradiance Calculate solar position and clear sky irradiance at each site and time Calculate clearness index and hourly means Testing site data Mesonet stations randomly split into training and testing sites Evaluation period split into training and testing days: every 3 rd day used for testing Models: Random Forest, Gradient Boosting, Lasso Linear Regression Multi Site Training One machine learning model fitted with all training sites data Applied at testing sites using input data collocated with site Single Site Training Separate machine learning models fitted at each training site Predictions made at training sites and interpolated to testing sites with Cressman interpolation (Cressman 1959) Similar to approach used by Gridded MOS (Glahn et al. 2009) Train machine learning models to predict clearness index Apply machine learning models at testing sites 5

Gradient Boosting Regression Irradiance >500? + 0.1 * Temperature>30? + 0.1 * Dewpoint>2? 0.1 0.8-0.1 0.3 0.05-0.03 Stagewise, additive decision tree ensemble Initial tree predicts exact value, subsequent trees predict residuals of total predictions from all previous trees Used by top 4 finishers of AMS Solar Energy Prediction Contest 6

Detailed Configuration Random Forest Default: 500 trees, min samples split 10, features=sqrt Short Trees: max depth 3 All Features: features = all Gradient Boosting Default: loss= lad, 500 trees, max depth 5, features = sqrt, learning rate=0.1 Least Squares: loss = ls Big Trees: min sample split = 10 All Features: features = all Slow Learning Rate: learn rate = 0.01 Lasso Linear Regression Top 16 variables by F-Score, Alpha=0.5 7

Solar: GFS Clearness Index Error Gradient Boosting: Optimizes with MAE, Tree Depth of 5, Samples subset of features Gradient Boosting Least Squares: Uses MSE instead of MAE Gradient Boosting All Features: Evaluates all input features Gradient Boosting Slow Learning Rate: Uses a learning rate of 0.01 instead of 0.1 Gradient Boosting Big Trees: Allows trees to grow to minimize training samples in each branch Random Forest: fully grown trees, evaluates subset of features Random Forest All Features: evaluates all features Random Forest Short Trees: tree depth of 3 Linear Regression: Lasso with top 16 variables Raw GFS: Downward shortwave irradiance Persistence: Interpolated irradiance at test sites based on observations from 24 hours before 8

GFS Solar Distributions 9

GFS Forecast Distributions 10

GFS Solar Station Errors 11

Next Steps: Deep Learning Investigating the use of deep learning models for weather feature and regime identification Goal: Train models to recognize multiscale features in NWP output Potential application for improved solar irradiance mean and variability forecasts based on weather regime Many other weather and climate applications Deep Convolutional Generative Adversarial Network architecture from Radford et al. (2016) 12

Generative Adversarial Networks Unsupervised method of learning complex feature representations from data Requires 2 deep neural networks Discriminator: determines which samples are from the training set and which are not Both networks have a battle of wits either to the death or until the discriminator is fooled often enough Generator: Creates synthetic examples similar to training data to fool discriminator Advantages Unsupervised pre-training: learn features without needing a large labeled dataset Dimensionality reduction: reduce image to smaller vector Learns sharper, more detailed features than autoencodermodels Do not need to specify a complex loss function 13

Preliminary Results: Mean Sea Level Pressure Trained on 4096 GEFS pressure forecasts Produces realistic pressure fields after 100 epochs of training Generator uses 100-value vector as input Each input adjust different parts of field 14

Summary Developed gridded statistical forecasting system for solar irradiance Evaluated different machine learning models and configurations on their ability to predict irradiance at multiple sites Gradient Boosting consistently showed lowest errors All machine learning models underestimated cloud cover frequency ML models had lower errors at sites with fewer clouds Generative Adversarial Networks show potential for extracting information from weather data Acknowledgements Rich Loft Tom Hamill The Oklahoma Mesonet Contact Me Email: dgagne@ucar.edu Twitter: @DJGagneDos Github: github.com/djgagne 15