SVR-Based Model to Forecast PV Power Generation under Different Weather Conditions

Similar documents
COMPARISON OF CLEAR-SKY MODELS FOR EVALUATING SOLAR FORECASTING SKILL

Prediction of Hourly Solar Radiation in Amman-Jordan by Using Artificial Neural Networks

A Support Vector Regression Model for Forecasting Rainfall

A Hybrid ARIMA and Neural Network Model to Forecast Particulate. Matter Concentration in Changsha, China

SOLAR ENERGY FORECASTING A PATHWAY FOR SUCCESSFUL RENEWABLE ENERGY INTEGRATION. (An ANN Based Model using NARX Model for forecasting of GHI)

Short and medium term solar irradiance and power forecasting given high penetration and a tropical environment

CHAPTER 6 CONCLUSION AND FUTURE SCOPE

Linear & nonlinear classifiers

Linear & nonlinear classifiers

RBF Neural Network Combined with Knowledge Mining Based on Environment Simulation Applied for Photovoltaic Generation Forecasting

AN INTERNATIONAL SOLAR IRRADIANCE DATA INGEST SYSTEM FOR FORECASTING SOLAR POWER AND AGRICULTURAL CROP YIELDS

Solar Nowcasting with Cluster-based Detrending

Solar Irradiance Prediction using Neural Model

Short-term water demand forecast based on deep neural network ABSTRACT

MODELLING ENERGY DEMAND FORECASTING USING NEURAL NETWORKS WITH UNIVARIATE TIME SERIES

Research Article Hybrid Power Forecasting Model for Photovoltaic Plants Based on Neural Network with Air Quality Index

Forecasting Solar Irradiance from Time Series with Exogenous Information

Application of Fully Recurrent (FRNN) and Radial Basis Function (RBFNN) Neural Networks for Simulating Solar Radiation

The Neural Support Vector Machine

ESTIMATION OF SOLAR RADIATION USING LEAST SQUARE SUPPORT VECTOR MACHINE

Day-Ahead Solar Forecasting Based on Multi-level Solar Measurements

ARTIFICIAL NEURAL NETWORK PART I HANIEH BORHANAZAD

Deep Learning Architecture for Univariate Time Series Forecasting

CS 229: Final Paper Wind Prediction: Physical model improvement through support vector regression Daniel Bejarano

Improving the accuracy of solar irradiance forecasts based on Numerical Weather Prediction

An Adaptive Multi-Modeling Approach to Solar Nowcasting

This paper presents the

Multi-Model Ensemble for day ahead PV power forecasting improvement

Solar irradiance forecasting for Chulalongkorn University location using time series models

Short-term wind forecasting using artificial neural networks (ANNs)

PREDICTING SOLAR GENERATION FROM WEATHER FORECASTS. Chenlin Wu Yuhan Lou

ANN and Statistical Theory Based Forecasting and Analysis of Power System Variables

Short Term Solar Radiation Forecast from Meteorological Data using Artificial Neural Network for Yola, Nigeria

NON-FIXED AND ASYMMETRICAL MARGIN APPROACH TO STOCK MARKET PREDICTION USING SUPPORT VECTOR REGRESSION. Haiqin Yang, Irwin King and Laiwan Chan

Short-Term Wind Speed Forecasting Using Regularization Extreme Learning Machine Da-cheng XING 1, Ben-shuang QIN 1,* and Cheng-gang LI 2

Modelling and Prediction of 150KW PV Array System in Northern India using Artificial Neural Network

Robust Multiple Estimator Systems for the Analysis of Biophysical Parameters From Remotely Sensed Data

Wind Power Forecasting using Artificial Neural Networks

Neural networks and support vector machines

Jeff Howbert Introduction to Machine Learning Winter

Support Vector Regression (SVR) Descriptions of SVR in this discussion follow that in Refs. (2, 6, 7, 8, 9). The literature

Explanatory Information Analysis for Day-Ahead Price Forecasting in the Iberian Electricity Market

Feature Selection Optimization Solar Insolation Prediction Using Artificial Neural Network: Perspective Bangladesh

Multi-Plant Photovoltaic Energy Forecasting Challenge with Regression Tree Ensembles and Hourly Average Forecasts

A Novel 2-D Model Approach for the Prediction of Hourly Solar Radiation

Improved the Forecasting of ANN-ARIMA Model Performance: A Case Study of Water Quality at the Offshore Kuala Terengganu, Terengganu, Malaysia

Solar Power Forecasting Using Support Vector Regression

Neural Networks. Prof. Dr. Rudolf Kruse. Computational Intelligence Group Faculty for Computer Science

Support Vector Machines. Introduction to Data Mining, 2 nd Edition by Tan, Steinbach, Karpatne, Kumar

Learning with kernels and SVM

A new method for short-term load forecasting based on chaotic time series and neural network

Solar irradiance forecasting for Chulalongkorn University location using time series models

Reading, UK 1 2 Abstract

Forecasting of Solar Photovoltaic System Power Generation using Wavelet Decomposition and Biascompensated

Radial basis function neural networks model to estimate global solar radiation in semi-arid area

Reservoir Computing and Echo State Networks

About Nnergix +2, More than 2,5 GW forecasted. Forecasting in 5 countries. 4 predictive technologies. More than power facilities

SARIMA-ELM Hybrid Model for Forecasting Tourist in Nepal

Outliers Treatment in Support Vector Regression for Financial Time Series Prediction

Data and prognosis for renewable energy

Short Term Load Forecasting Based Artificial Neural Network

Introduction to Neural Networks

Review: Support vector machines. Machine learning techniques and image analysis

Application of Artificial Neural Networks to Predict Daily Solar Radiation in Sokoto

Systems Operations. PRAMOD JAIN, Ph.D. Consultant, USAID Power the Future. Astana, September, /6/2018

Temporal global solar radiation forecasting using artificial neural network in Tunisian climate

Predicting MTA Bus Arrival Times in New York City

Support Vector Machine Regression for Volatile Stock Market Prediction

SEASONAL MODELING OF HOURLY SOLAR IRRADIATION SERIES

Data Mining. Linear & nonlinear classifiers. Hamid Beigy. Sharif University of Technology. Fall 1396

Importance of Numerical Weather Prediction in Variable Renewable Energy Forecast

On the Performance of Forecasting Models in the Presence of Input Uncertainty

COMPARISON OF CLEAR-SKY MODELS FOR EVALUATING SOLAR FORECASTING SKILL

Lecture 18: Kernels Risk and Loss Support Vector Regression. Aykut Erdem December 2016 Hacettepe University

Recurrent Neural Network based Deep Learning for Solar Radiation Prediction

Square Penalty Support Vector Regression

SELF-ADAPTING BUILDING MODELS FOR MODEL PREDICTIVE CONTROL

Gated Ensemble Learning Method for Demand-Side Electricity Load Forecasting

Frequency Forecasting using Time Series ARIMA model

WMO Aeronautical Meteorology Scientific Conference 2017

A comparative study of ANN and angstrom Prescott model in the context of solar radiation analysis

Forecasting demand in the National Electricity Market. October 2017

Prashant Pant 1, Achal Garg 2 1,2 Engineer, Keppel Offshore and Marine Engineering India Pvt. Ltd, Mumbai. IJRASET 2013: All Rights are Reserved 356

Load Forecasting Using Artificial Neural Networks and Support Vector Regression

CS:4420 Artificial Intelligence

Retrieval of Cloud Top Pressure

Current best practice of uncertainty forecast for wind energy

Engineering Part IIB: Module 4F10 Statistical Pattern Processing Lecture 6: Multi-Layer Perceptrons I

A Recommendation for Tropical Daily Rainfall Prediction Based on Meteorological Data Series in Indonesia

not for commercial-scale installations. Thus, there is a need to study the effects of snow on

Unit III. A Survey of Neural Network Model

A Comparison of the Forecast Performance of. Double Seasonal ARIMA and Double Seasonal. ARFIMA Models of Electricity Load Demand

Nonlinear System Identification using Support Vector Regression

Multi-Plant Photovoltaic Energy Forecasting Challenge: Second place solution

Electric Load Forecasting Using Wavelet Transform and Extreme Learning Machine

RAINFALL RUNOFF MODELING USING SUPPORT VECTOR REGRESSION AND ARTIFICIAL NEURAL NETWORKS

Probabilistic Energy Forecasting

Direct Method for Training Feed-forward Neural Networks using Batch Extended Kalman Filter for Multi- Step-Ahead Predictions

Support Vector Machines Explained

Introduction to SVM and RVM

Transcription:

energies Article SVR-Based Model to Forecast PV Power Generation under Different Weather Conditions Utpal Kumar Das 1, Kok Soon Tey 1, *, Mehdi Seyedmahmoudian 2, Mohd Yamani Idna Idris 1, Saad Mekhilef 3, Ben Horan 2 and Alex Stojcevski 4 1 Department of Computer System and Technology, Faculty of Computer Science and Information Technology, University of Malaya, Kuala Lumpur 50603, Malaysia; utpal@duet.ac.bd (U.K.D.); yamani@um.edu.my (M.Y.I.I.) 2 School of Engineering, Deakin University, Melbourne 3216, Australia; mehdis@deakin.edu.au (M.S.); ben.horan@deakin.edu.au (B.H.) 3 Power Electronics and Renewable Energy Research Laboratory (PEARL), Department of Electrical Engineering, Faculty of Engineering, University of Malaya, Kuala Lumpur 50603, Malaysia; saad@um.edu.my 4 School of Software and Electrical Engineering, Swinburne University of Technology, Melbourne, Victoria 3122, Australia; astojcevski@swin.edu.au * Correspondence: koksoon@um.edu.my; Tel.: +60-3-7967-6347 Received: 24 May 2017; Accepted: 27 June 2017; Published: 29 June 2017 Abstract: Inaccurate forecasting of photovoltaic (PV) power generation is a great concern in the planning and operation of stable and reliable electric grid systems as well as in promoting large-scale PV deployment. The paper proposes a generalized PV power forecasting model based on support vector regression, historical PV power output, and corresponding meteorological data. Weather conditions are broadly classified into two categories, namely, normal condition (clear sky) and abnormal condition (rainy or cloudy day). A generalized day-ahead forecasting model is developed to forecast PV power generation at any weather condition in a particular region. The proposed model is applied and experimentally validated by three different types of PV stations in the same location at different weather conditions. Furthermore, a conventional artificial neural network ()-based forecasting model is utilized, using the same experimental data-sets of the proposed model. The analytical results showed that the proposed model achieved better forecasting accuracy with less computational complexity when compared with other models, including the conventional model. The proposed model is also effective and practical in forecasting existing grid-connected PV power generation. Keywords: photovoltaic power forecasting; support vector regression; support vector machine; artificial neural network; different weather conditions 1. Introduction Energy, especially electrical energy, plays a key role in a country s development; it also improves the living standard of people. Therefore, the demand for electrical energy has been increasing day by day due to industrialization and modernization. However, the conventional generation of electrical energy has caused global warming and significant climate changes. Subsequently, the modern world has undertaken different renewable energy initiatives to mitigate global warming and meet the growing electricity demand. Nowadays, many countries have produced a significant portion of their energy demands from renewable energy resources, particularly solar generation power plants [1]. Among the potential renewable energies, photovoltaics (PV) have undergone enormous growth over the last few years. The total installed capacity of PV systems has reached around 227 GW worldwide, an increase Energies 2017, 10, 876; doi:10.3390/en10070876 www.mdpi.com/journal/energies

Energies 2017, 10, 876 2 of 17 of more than 28% in 2015 (International Energy Agency (IEA) report, 2016); such growth is expected to continue at similar or higher rates in the future. Decreasing prices (i.e., the lowest at below $1.5/W P for fixed tilt systems) and improved PV technology can also boost PV system installations [2,3]. However, the generation of PV power fully depends on random and ungovernable solar irradiance and other metrological factors, such as atmospheric temperature, module temperature, wind speed, wind direction, and humidity. The power output of a PV system dynamically changes with time due to the variability of environmental factors. Unpredictable PV power output adversely affects system stability and reliability, the scheduling of system operations, and related economic benefits [4,5]. Meanwhile, accurate forecasting of PV power generation can reduce the impact of PV power uncertainty on the grid, improve system reliability, maintain power quality, and increase the penetration level of PV systems [6]. Therefore, accurate forecasting of PV power generation has become an important task for researchers at present. In a significant number of previous research, solar irradiance on different time scales was forecasted using various approaches, including numerical weather prediction methods, image-based methods, and statistical methods [7 10]. The forecasted solar irradiance and other associated data are used as inputs for PV power generation by commercial PV simulation software, such as TRNSYSM, PVFORM, and HOMER. In [11], first the fuzzy theorem was used to forecast the solar irradiance levels and then the recurrent neural network (RNN) method was used to forecast the 24 hour ahead output power of the PV system. However, most previous research on this problem employed direct methods to forecast PV power generation based on historical time-series data, such as historical PV power output and corresponding meteorological data. The research by Kudo et al. [12] demonstrated that the direct method of forecasting next-day PV power generation is better when compared with indirect methods. In direct PV power forecasting, persistence modeling [13] is generally conducted to justify and select other models, and to decide on benchmarks. modeling is mainly used for one-hour ahead forecasting; hence, as the time range of forecasting increases, the accuracy of persistence modeling decreases [14]. Both autoregressive and moving-average modeling and their generalizations (i.e., autoregressive-moving-average models) [15] are widely used in statistical and time-series data analyses. These are based on classical time-series analysis such as following the Box Jenkins method [16]. Yang et al. [17] proposed an autoregressive method, with an exogenous input (ARX)-based spatio-temporal (ST) model, in order to improve the accuracy of the developed PV output power forecasting technique. However, these time-series models have limitations because they require stationary data-sets [18]. A stationary time series is one whose statistical properties, such as mean, variance, and autocorrelation, are all constant over time. However, the PV power output and related meteorological data are non-stationary. By contrast, autoregressive-integrated-moving-average (ARIMA) models integrate non-stationarity elements from time-series data [19], but these models are computationally intensive because of the inclusion of a summation/integration function. The most general time series analysis model is called NARX (Nonlinear Autoregressive with Exogenous Input). In [20], NARX was chosen as a dynamic artificial neural network () to forecast the PV power generation, and the result showed that the NARX is more efficient because of its capability to learn and generalize formulas, compared with other models. However, the NARX models have limitations in learning long time dependences because of the vanishing gradient. Furthermore, similar to any dynamical system, the models were affected by instability and lack a procedure for optimizing embedded memory. Moreover, time series models need lots of data when the model is structured, and the parameters are difficult to update when new data are uploaded. modeling is excellent for complicated and non-linear data analysis, and it does not require any prior assumption. Neural networks are widely used in PV power generation forecasting, and forecasted results are better when compared with those from regression analysis methods [21]. NARX and feed-forward neural networks with tapped delay lines have been used to forecast PV energy production, and they result in errors (MAPE) of less than 5% [22]. However, the

Energies 2017, 10, 876 3 of 17 performance of this forecasting model depends on meteorological factors and the correlation between explanatory variables and dependent variables. If the correlation factor between variables is less significant, then the forecasting result will be inaccurate. Multilayer perceptron (MLP) [23] is an example of the feed-forward NN model that has been widely deployed in PV power generation forecasting. In this model, the input of each neuron is transformed into output through the sigmoid function. The performance of this model is better than the Box Jenkins method due to its non-linear approximation. The radial basis function neural network (RBFNN) [24] is a type of model that makes use of linear combinations of radial basis functions. For the radial basis function, Gaussian functions are often used to transform inputs into outputs. If tuned accurately, the RBFNN has better performance than MLP. However, the tuning procedures of the RBFNN sometimes cause problems, such that good results are not obtained due to poor parameter adjustments. Among the -based forecasting methods, back propagation NN (BPNN) has been widely used because of its excellent nonlinear mapping function, which is especially suitable for solving complex regression problems [25]. BPNN has the advantages of a complex nonlinear systems simulation ability, a strong learning ability, good approximation performance, and a large fault data tolerance. However, inherent defects are found, such as a slow convergence rate as well as a susceptibility to easily fall into local minimum values; thus BPNN is unable to obtain the global optimal solution [26]. Accuracy is a main consideration when developing models for PV power generation forecasting. For short-term forecasting, errors should be less than 20% [6]. However, for changing conditions (e.g., morning and evening, or rainy or cloudy weather), forecasting accuracy decreases to a point where the relative mean square errors (RMSE) are sometimes higher than 50%. To build better forecasting models, some studies classified forecasted days into different categories on the basis of weather conditions. Kang et al. [27] developed an algorithm by utilizing k-means clustering; Chen et al. [28] presented a RBFNN model; Shi et al. [6] proposed a model based on weather classification and support vector machines (SVM); Yang et al. [29] presented a weather-based hybrid method; and Liu et al. [26] proposed a back-propagation NN model to forecast PV power generation. All of these works classified days into different categories like sunny, cloudy, foggy, and rainy, and then built separate forecasting models for each classification. This approach implies that sub-models should be chosen on the basis of the weather condition of a forecasted day, apart from applying meteorological data to the model. However, although the accuracy is satisfactory for forecasting, sub-modeling is limited by complexity and computational costs. In this paper, a generalized PV power forecasting model is proposed on the basis of support vector regression (SVR), historical data of PV power output, and meteorological data. In particular, SVR is supervised as a learning method and utilized for model development. In the study, PV power output characteristics and influential factors are analyzed to achieve forecasting accuracy. Subsequently, forecasted days are classified according to Malaysian weather conditions and historical PV power output data. The two types of weather in Malaysia are normal days (clear sky) and abnormal days (cloudy or rainy sky). Consequently, a generalized SVR-based model is introduced to forecast PV power generation accurately for any Malaysian weather condition. The proposed model is applied to and validated by three PV power stations situated in an institutional building of the University of Malaya in Kuala Lumpur. A generalized hourly resolution and day-ahead forecasting model is established to compare the forecasting accuracies of different weather conditions. This proposed single model, which is applicable for different weather conditions, is very simple to use and can reduce the complexities and computational costs. The paper is organized as follows: Section 2 presents a brief description of the methods applied to forecast the PV power generation including real PV plant data collection and analysis; Section 3 indicates the performance metrics to evaluate the forecasting models; and Section 4 discusses the results of the proposed model including comparison and validations. Finally, Section 5 summarizes and concludes the study.

Energies 2017, 10, 876 4 of 17 Energies 2017, 10, 876 4 of 17 2. Methodology Three PV power generation systems were selected to collect PV power output and related meteorological 2.1. Data Collection data. and These Analysis systems are located in Kuala Lumpur (latitude = 03 09 N; longitude Three = 101 41 PV power E). The generation PV systems systems were were installed selected on the to rooftop collect PV of an power institutional output building and related of the meteorological University of Malaya. data. Table These1 presents systems the aredetails located of these in Kuala PV systems. Lumpur The (latitude three PV plants = 03 09 are of N; monocrystalline, longitude = 101 polycrystalline, 41 E). The PV and systems thin-film weretypes installed with on installed the rooftop capacities of anof institutional 1875, 2000, building and 2700 of the University Wp, respectively. of Malaya. PV power Table 1outputs presentswere the details collected of these individually PV systems. from The each three plant; PVhowever, plants are meteorological of monocrystalline, data (i.e., polycrystalline, solar irradiance, and atmospheric thin-film types temperature, with installed module capacities temperature, of 1875, and 2000, wind pressure) and 2700 were Wp, collected respectively. as a PVsingle power dataset outputs for were all collected plants because individually of their fromsimilar each plant; geographical however, locations. meteorological data (i.e., solar irradiance, atmospheric temperature, module temperature, and wind pressure) were collected as a single dataset for all plants because of their similar geographical locations. Table 1. Details of photovoltaic (PV) power plants utilized in the study. Table 1. Details of photovoltaic (PV) power plants utilized in the study. Data Sources Nature of Plant Installed Capacity Measurement Items Data Sources Nature of Plant Installed Capacity (i) PV power generation; Plant-1 Monocrystalline 1875 Wp (75 W 25 pcs) Measurement Items (ii) Solar irradiance (W/m2); Plant-2 Plant-1 Polycrystalline Monocrystalline 2000 Wp 1875 (125 Wp W (75 16 W pcs) 25 pcs) (iii) Atmospheric (i) PV powertemperature generation; ( C); (ii) Solar irradiance (W/m 2 Plant-2 Polycrystalline 2000 Wp (125 W 16 pcs) (iv) PV module temperature ( C); Plant-3 Thin-Film 2700 Wp (135 W 20 pcs) (iii) Atmospheric temperature ( C); (v) Wind Plant-3 Thin-Film 2700 Wp (135 W 20 pcs) (iv) speed PV module (m/s) temperature ( C); (v) Wind speed (m/s) Table 2 shows the collected data for each unit parameter identified for the study. The PV power output Table and 2 related shows meteorological the collected data data for were each collected unit parameter by an automatic identified data for the acquisition study. The system PV power in 5 min output durations and related from 1 meteorological January 2016 to data 31 December were collected 2016. by an automatic data acquisition system in 5 min durations from 1 January 2016 to 31 December 2016. Table 2. Set of collected input/output variables of the PV system. Input/Output Table Parameters 2. Set of collected input/output Unit variables of Resolution the PV system. Variables Solar Irradiance W/m 2 5 min Irradiance Input Input/Output Ambient Parameters Temperature C Unit Resolution 5 min Variables Temp_amb Parameters Module Temperature Solar Irradiance W/m C 2 5 min 5 min Irradiance Temp_module Wind Ambient Speed Temperature m/s Input Parameters C 5 min 5 min Temp_amb Wind_speed Output Module Temperature C 5 min Temp_module PV Output Wind Power Speed Watt Parameter m/s 5 min 5 min Wind_speed Power_pv Output Parameter PV Output Power Watt 5 min Power_pv Figure 1a illustrates the mean daily PV power output generated by plant-1 for January 2016, and Figure Figure 1b shows 1a illustrates the daily the energy mean daily generated PV power by output each PV generated plant for by May plant-1 2016. for January Average 2016, power and generation Figure 1b shows differed the across daily days energy due generated to variations by each in the PV plant weather for conditions. May 2016. Average power generation differed across days due to variations in the weather conditions. (a) (b) Figure1. 1. (a) (a) Daily Daily average power generatedby bypv PV plant-1; (b) (b) Daily Daily generated energyof ofpv PV plants. For plant-1 (Figure 1a), the three days of 1, 15, and 16 January generated comparatively low power due to cloudy or rainy weather conditions (abnormal day). Meanwhile, due to clear skies

Energies 2017, 10, 876 5 of 17 Energies 2017, 10, 876 5 of 17 (normal For day), plant-1 average (Figure power 1a), the generation three days was ofcomparatively 1, 15, and 16 January higher generated for the days comparatively of 2, 7, 10, 11, low20, power 24, and due25 to cloudy January. or In rainy Figure weather 1b, conditions the daily (abnormal average PV day). energy Meanwhile, production due toshows clear skies a rather (normal stable day), fluctuation average power in relation generation to the was different comparatively weather higher conditions for the in Malaysia, days of 2, 7, which 10, 11, suggests 20, 24, and the 25 potential January. of Inthe Figure country 1b, to thegenerate daily average PV energy. PV energy production shows a rather stable fluctuation in relation to thephotovoltaic different weather power conditions generation in Malaysia, is closely which related suggests to meteorological the potential parameters, of the country such toas generate solar irradiance, PV energy. ambient/atmospheric temperature, module temperature, and wind speed, among others. Figure Photovoltaic 2a shows the power patterns generation of solar irradiance is closely related and PV topower meteorological output of parameters, different PV such plants ason solar a particular irradiance, day. ambient/atmospheric During clear-sky days temperature, (normal day), module PV power temperature, output very andstrongly wind speed, matched among the solar others. irradiance Figure 2acurve. showssimilarly, the patterns for cloudy of solaror irradiance rainy days and (abnormal PV power day), output a pattern of different harmony PVis plants observed on a between particular PV day. power During output clear-sky and solar days irradiance (normal as shown day), PV in Figure power2a. output A similar verypattern strongly for matched PV power the output solar irradiance and solar irradiance curve. Similarly, was observed for cloudy for the or different rainy days weather (abnormal conditions day), because a pattern the generation harmony is of observed PV power between fully depends PV power on output solar irradiance. and solar irradiance If the irradiance as shownincreases, Figurethen 2a. APV similar power pattern will be for increased, PV powerand output vice and versa solar for irradiance any weather was observed conditions. forin themalaysian different weather weather conditions conditions, because a linear the relationship generation occurs of PV power between fully them. depends Therefore, on solar a high irradiance. correlation If thecoefficient irradiancebetween increases, PV then power PV power and solar will be irradiance increased, has and been viceobserved. versa for any Figure weather 2b shows conditions. a strong In Malaysian positive correlation weather conditions, between asolar linear irradiance relationship and occurs PV power between output. them. Therefore, asolar high correlation irradiance coefficient is an important betweeninput PV power vector and when solar developing irradiance has an appropriate been observed. PV Figure power 2bforecasting shows a strong model, positive as evidenced correlationby between the high solar correlation irradiance coefficient and PV power of R 2 = output. 0.9888. Therefore, solar irradiance is an important input vector when developing an appropriate PV power forecasting model, as evidenced by the high correlation coefficient of R 2 = 0.9888. (a) (b) Figure Figure2. 2. (a) (a) Pattern Patternof of solar solar irradiance irradianceand andpv PVpower poweroutput output for forabnormal days; days; (b) (b) Correlation between betweensolar solar irradianceand andpv PVpower output. output. The analytical results also indicate that the other meteorological variables, such as ambient The analytical results also indicate that the other meteorological variables, such as ambient temperature, module temperature, and wind pressure, were also correlated with PV power temperature, module temperature, and wind pressure, were also correlated with PV power generation. generation. A comparatively weak correlation was established between PV power output and A comparatively weak correlation was established between PV power output and atmospheric atmospheric temperature, while an extremely weak correlation was observed between PV power temperature, while an extremely weak correlation was observed between PV power output and output and wind speed. However, wind speed has been considered an input of the proposed model wind speed. However, wind speed has been considered an input of the proposed model to build to build an appropriate model. By contrast, a strong correlation between PV power output and an appropriate model. By contrast, a strong correlation between PV power output and module module temperature was observed. Nevertheless, it has been ignored in the selection as an input temperature was observed. Nevertheless, it has been ignored in the selection as an input vector vector of the proposed model because the module temperature is highly dependent on other of the proposed model because the module temperature is highly dependent on other variables, variables, such as atmospheric temperature, wind speed, wind direction, humidity, and the amount such as atmospheric temperature, wind speed, wind direction, humidity, and the amount of PV of PV power generation. power generation. 2.2. 2.2. Data Data Preparation Preparation Some Some meteorological variables variables showed showed extremely extremely weak weak correlations correlations with the with PV power the PV generation. power generation. In this study, In this the study, influence the on influence PV power on generation PV power by generation all of theby aforementioned all of the aforementioned independent independent meteorological meteorological variables was variables considered. was considered. Sample data on actual PV power generated and related meteorological variables were collected at 5 min intervals. The obtained PV power output data and meteorological data were averaged as

Energies 2017, 10, 876 6 of 17 Energies 2017, 10, 876 6 of 17 hourly Sample datasets. on However, actual PV during power data generated collection, and PV related power meteorological output samples variables may were be lost collected due to at recording 5 min intervals. errors or The other obtained special PV events. power If abnormal output data data and are meteorological collected, the training data were of SVR averaged becomes as hourly unstable. datasets. Thus, before However, averaging, during missing data collection, data should PVbe power replaced output by same-hours samples may data be from lostthe due latest to recording similar day. errors or other special events. If abnormal data are collected, the training of SVR becomes unstable. Thus, before averaging, missing data should be replaced by same-hours data from the latest similar 2.3. Pre-Processing day. of Data The nonlinear SVR model can map nonlinear inputs into higher-dimensional space to make 2.3. Pre-Processing of Data them linear. However, a wider data range results in imprecisions both in terms of fitting and regression. The nonlinear If data are SVRpre-processed model can map into nonlinear smaller ranges inputs into before higher-dimensional they are inputted space into the to model, make them then linear. regression However, precision a wider can be data increased. range results A well-known in imprecisions solution both to the in terms aforementioned of fitting and limitation regression. is a If normalization data are pre-processed process, wherein into smaller data ranges are restricted before they to the are range inputted of 0 and into1. the Normalization model, then regression minimizes precision regression canerror, be increased. improves A well-known precision, and solution maintains to thecorrelation aforementioned in the limitation data-set. isthe a normalization formula for process, normalization whereinis data [30]: are restricted to the range of 0 and 1. Normalization minimizes regression error, improves precision, and maintains correlation in the data-set. The formula for normalization is [30]: = (1) D Normal = D actual D min (1) where is the normalized input data; D max is the D min original input data (PV power output and meteorological data); and where D Normal is the normalized and input data; are the minimum and maximum values of the utilized D actual is the original input data (PV power output and input data, respectively. meteorological data); and D min and D max are the minimum and maximum values of the utilized input data, respectively. 2.4. Support Vector Regression 2.4. Support SVM [31] Vector is a Regression supervised machine-learning method that follows the structural risk minimization principle. SVM [31] SVMs is a have supervised greater machine-learning generalization ability method compared that follows with the other structural approaches, risk minimization and they are principle. widely used SVMs in resolving have greater classification generalization and regression ability compared problems. with SVMs other are approaches, excellent for and time-series they are widely analysis used due in to resolving its global classification minima. When and applied regression to time-series problems. prediction, SVMs are SVM excellent modeling for time-series is referred analysis to as support due to its vector global regression minima. When (SVR). applied Forecasting to time-series PV power prediction, generation SVM is modeling a typical is referred time-series to as analysis support problem; vector regression in this case, (SVR). SVR Forecasting is an appropriate PV power method. generation is a typical time-series analysis problem; The in SVR this algorithm case, SVR is is a an nonlinear appropriate regression method. algorithm. Inputs from time-series data samples are mapped The SVR into algorithm high-dimensional is a nonlinear feature regression space for algorithm. nonlinear mapping. Inputs from Subsequently, time-series data linear samples regression are mapped is conducted into high-dimensional (Figure 3). feature space for nonlinear mapping. Subsequently, linear regression is conducted (Figure 3). Figure 3. 3. Changing nonlinear regression into into linear linear regression. A set of of training data data {(x (, ), (, ),,(, ) is 1, y 1 ), (x 2, y 2 ),......, (x l, y l )} is considered, where x i R n is the input vector (meteorological data), and y i R 1 is isthe the corresponding output value value (PV (PV power power output). output). The The estimation function function ( ) f (x) is is shown in in Function (2): (2): = ( ) = ( ) + (2) y = f (x) = w ψ(x) + b (2) where is a weight vector, and is the bias term, which can be estimated by minimizing where the regularized w R n isrisk a weight function. vector, and b R is the bias term, which can be estimated by minimizing the regularized Using the risk -insensitive function. loss function in SVR, the regression problem can be changed into the following optimization problem:

Energies 2017, 10, 876 7 of 17 Using the ε-insensitive loss function in SVR, the regression problem can be changed into the following optimization problem: 1 min : 2 w 2 + C l ( ξi + ξ ) i Subject to : y i f (x) ε + ξi ; f (x) y i ε + ξ i ; and ξi, ξ i 0 (3) where ε is the radius of the tube (margin of tolerance), which refers to the data inside the tube that should be ignored during regression. The feature vector, which lies on the boundary of the tube, is known as the support vector. In line with the Lagrange multiplier method, the Lagrange function is acquired as follows: L(w, b, ξ, ξ, α, α, γ, γ ) = 2 1 w 2 + C l (ξ + ξ ) l α i ξ i + ε y i + f (x i ) l αi ξ i + ε + y i f (x i ) l ( ξi γ i + ξ ) i γ i (4) where ξ and ξ are the slack variables representing the distance from actual values to the corresponding boundary values of the ε-tube. Hence, α, α, γ, γ 0. Next, the saddle point of L is calculated. The following equations can thus be obtained: w L = 0 w = l (α i αi )ψ(x i) b L = 0 l (α i α i ) = 0 ξ i L = 0 C α i γ i = 0 (5) By substituting the original Function (4) with Equation (5), the following model can be obtained: max : w(α, α ) w,b,ξ,ξ = 1 2 l ( αi α ) ( ) i α j α j < ψ(x i ), ψ(x j ) > i,j=1 l (α i + αi )ε + l (α i αi )y i l Subject to : (α i αi ) = 0; 0 α i, αi C (6) where C determines the penalties of the estimation errors. Subsequently, the kernel function k ( x i, x j ) is introduced with a mercer condition to replace the original function < ψ(x i ), ψ ( x j ) >, and the following model can thus be obtained: max : w(α, α ) w,b,ξ,ξ = 1 2 l ) (α i αi (α ) j α j k(x i, x j ) i,j=1 l (α i + αi )ε + l (α i αi )y i l Subject to : (α i αi ) = 0; 0 α i, αi C (7) The kernel function is one of the key factors of SVR. The performance of SVR is largely dependent on the selection of the kernel function and its parameters. Four traditional kernel functions are commonly used in SVR: Liner kernel function: k ( x i, x j ) = x T i x j ; Polynomial: k ( x i, x j ) = (γx T i x j + r) d, γ > 0;

Energies 2017, 10, 876 8 of 17 Energies 2017, 10, 876 8 of 17 Liner kernel function:, = ; Polynomial: Gaussian RBF (radial bias function): k ( ( ), =( + ), > ) 0; x i x j = exp x i x j 2 ( = exp γ x 2σ 2 i x j 2), Gaussian RBF (radial bias function):, = ǀǀ ǀǀ = ǀǀ γ > 0; ǀǀ, > 0; Sigmoid: Sigmoid: k ( ) ( x i, x j = tanh γx T i x j + r ), = h +.. The radial bias function (RBF) is often used as the kernel in SVR because it requires only one The radial bias function (RBF) is often used as the kernel in SVR because it requires only one parameter and it has a wide scope of application. RBFs also have the ability to universally parameter and it has a wide scope of application. RBFs also have the ability to universally approximate approximate any distribution in feature space. Therefore, the RBF was used as kernel in this study. any distribution in feature space. Therefore, the RBF was used as kernel in this study. For the RBF, σ 2 For the RBF, was set as the bandwidth of the kernel function. was set as the bandwidth of the kernel function. To optimally solve the problems, the best α i and αi To optimally solve the problems, the best should be obtained. The best-fit regression should be obtained. The best-fit regression function can be expressed as follows: function can be expressed as follows: ( ) = ( l f (x) = (α i αi ) ) k(, ) + (8) x i, x j + b (8) where and are the Lagrange multipliers; and where α i and αi are the Lagrange multipliers; and k ( (, ) ) is the kernel function. The nonlinear separable cases can be easily transformed into linear cases x i, x j by is mapping the kernel the function. original The variable nonlinear into a separable new high-dimension cases can be feature easily transformed space using into linear cases by mapping the original variable into a new high-dimension feature space using k ( ( ), ). x i, x j. 2.5. SVR-Based Model to Forecast PV Power Generation 2.5. SVR-Based Model to Forecast PV Power Generation To establish the generalized SVR-based model for PV power generation forecasting, the LIBSVM package To establish [32] proposed the generalized by Chang SVR-based and Lin model (2001) for was PVadopted power generation in present forecasting, study. For the theproposed LIBSVM package generalized [32] day-ahead proposed by hourly Chang resolution and Lin model (2001) using was adopted the SVR inapproach, present study. the average For thehourly proposed data generalized samples were day-ahead considered hourly for training resolution and model testing using purposes. the SVR The diagram approach, flow theof average the proposed hourlymodel data samples is shown were in Figure considered 4. for training and testing purposes. The diagram flow of the proposed model is shown in Figure 4. Figure 4. Flowchart of the proposed support vector regression (SVR)-based forecasting model. Figure 4. Flowchart of the proposed support vector regression (SVR)-based forecasting model. Historical data of meteorological variables (i.e., solar irradiance, atmospheric temperature, and Historical data of meteorological variables (i.e., solar irradiance, atmospheric temperature, and wind speed) were considered as the input data-set of the model. Meanwhile, historical data of PV wind speed) were considered as the input data-set of the model. Meanwhile, historical data of PV power output were considered as the output data-set for training the proposed model. All of the power output were considered as the output data-set for training the proposed model. All of the original historical PV power data and meteorological data were normalized within the range of [0, 1] to original historical PV power data and meteorological data were normalized within the range of [0, 1]

Energies 2017, 10, 876 9 of 17 minimize regression error. Subsequently, the datasets for training (i.e., more than 70% of the total data) and testing were separated. To develop the model, the initialization values of the three dominating parameters (C, ε, and γ) in SVR should be also included. The training and the testing of the model were conducted first-time using the stipulated historical data-set. A non-optimization model implies that the dominating parameter values should be changed, and model training and testing should be conducted again; this procedure should be continued until the model is optimized. In the present study, the parameters were chosen on the basis of experience-based trial and error. An optimized model implies that PV power generation can be forecasted for a particular day. To forecast PV power generation using the proposed model, the hourly averages of normalized meteorological data for a given forecasted day (i.e., data collected from the Malaysian meteorological department or numerical predicted data) should be applied. Given that the input data of this model were normalized, the output data should be anti-normalized to extract the original values of PV power; this process consolidates the performance analysis of the proposed model. A standard three-layer back-propagation with the number of epochs set to 1000 was utilized as the alternative model for a comparative evaluation of the performance of the proposed model. To evaluate for better performance of the model, the number of hidden neurons of the model was changed to be between 5 and 20. To establish the benchmark of the PV power forecasting model, a persistence model was used to validate the proposed model. 3. Evaluation of Forecasting Accuracy Several types of performance metrics were used to evaluate the accuracy of the forecasting model. In this study, normalized root-mean-square error (nrmse) (i.e., the most commonly used metric), mean absolute error (MAE), and mean bias error (MBE) were adopted for the proposed model: nrmse % = 1 N N MAE = 1 N MBE = 1 N (W Predicted W actual ) 2 100/W actual (max) (9) N N W Predicted W actual (10) (W Predicted W actual ) (11) where W predicted and W actual are the forecasted value and actual measured value of the PV power output at each time point, respectively. In addition, W actual(max) is the maximum value of the actual measured PV power output, and N is the number of test data samples. The datasets for training and testing were normalized; accordingly, the forecasted PV power output data of the model have to be anti-normalized before calculating the nrmse, MAE, and MBE. 4. Result and Discussion In this research, a generalized forecasting model based on SVR was developed. The model was applied for hourly resolution day-ahead PV power generation forecasting. The actual PV power generation data of three different PV plants and their related meteorological data (i.e., solar irradiance, atmospheric temperature, and wind speed) were utilized in the study. In Malaysia, solar irradiance is available from 08:00 to 19:00 at almost all seasons of the year; accordingly, daily PV power outputs are received during these times only. Experimental data covering three months (January, May, and September 2016) were used to verify the proposed model. However, Malaysian weather conditions have not changed drastically over the year. Nevertheless, these three months have been selected in this analysis because all of the weather conditions throughout the year were considered. Sample data were selected randomly for training and test purposes due to different weather conditions. The forecasted

Energies 2017, 10, 876 10 of 17 Energies 2017, 10, 876 10 of 17 extract the actual value of the PV power forecast. Consequently, the forecasted values of the proposed model were compared with those for the standard back-propagation NN model and the persistence PV power of the proposed model should be anti-normalized to extract the actual value of the PV power model. forecast. Consequently, the forecasted values of the proposed model were compared with those for the Figure 5a shows the measured PV power outputs (actual) and forecasted PV power outputs of standard back-propagation NN model and the persistence model. the different models (i.e., proposed model,, and persistence model) of PV plant-1 in normal Figure 5a shows the measured PV power outputs (actual) and forecasted PV power outputs of the weather conditions. In this case, 21 and 22 May were set as the forecast days. As shown in Figure 1b different models (i.e., proposed model,, and persistence model) of PV plant-1 in normal weather and based on the analysis of the PV power output patterns, these days (21 and 22 May) are clear-sky conditions. In this case, 21 and 22 May were set as the forecast days. As shown in Figure 1b and based days (normal day). Hence, the hourly PV power output average and related metrological data of on the analysis of the PV power output patterns, these days (21 and 22 May) are clear-sky days (normal 14 days (7 20 May) were used to train the model. As shown in Figure 5a, the forecasted curve of the day). Hence, the hourly PV power output average and related metrological data of 14 days (7 20 May) proposed model almost matched the actual measured curve of PV power generation. were used to train the model. As shown in Figure 5a, the forecasted curve of the proposed model almost matched the actual measured curve of PV power generation. (a) (b) Figure 5. PV PV power outputof of plant-1 in in (a) normal weather conditions; (b) abnormal weather conditions. means artificial neural network. Figure 5b represents the actual measured PV power outputs and forecasted PV power outputs Figure 5b represents the actual measured PV power outputs and forecasted PV power outputs of of the proposed model, model, and persistence model of plant-1 in abnormal weather the proposed model, model, and persistence model of plant-1 in abnormal weather conditions. conditions. In this figure, 26 and 27 May were set as forecast day. As shown in Figure 1b and based In this figure, 26 and 27 May were set as forecast day. As shown in Figure 1b and based on the analysis on the analysis of PV power output patterns, these days are cloudy or rainy days (abnormal day). of PV power output patterns, these days are cloudy or rainy days (abnormal day). Hence, the hourly Hence, the hourly PV power output average and related meteorological data of 14 days (12 25 May) PV power output average and related meteorological data of 14 days (12 25 May) were used to train were used to train the model. As shown in Figure 5b, the forecasted curve of the proposed model the model. As shown in Figure 5b, the forecasted curve of the proposed model almost matched the almost matched the actual measured curve of PV power generation. However, a significant deviation actual measured curve of PV power generation. However, a significant deviation for the persistence for the persistence model curve was observed, which might have been affected by abnormal weather model curve was observed, which might have been affected by abnormal weather conditions. conditions. The same meteorological data-set and related PV power output data of PV plant-2 were used The same meteorological data-set and related PV power output data of PV plant-2 were used to to train and test the models in different weather conditions. Similarly to plant-1, normal forecast train and test the models in different weather conditions. Similarly to plant-1, normal forecast days days (21 and 22 May) and abnormal forecast days (26 and 27 May) were selected. Figure 6a,b shows (21 and 22 May) and abnormal forecast days (26 and 27 May) were selected. Figure 6a,b shows the the actual measured PV power output and forecasted PV power output of the different models actual measured PV power output and forecasted PV power output of the different models (i.e., the (i.e., the proposed model, the model, and the persistence model) for normal and abnormal proposed model, the model, and the persistence model) for normal and abnormal weather weather conditions, respectively. As shown by Figure 6a,b, the forecasted result of the proposed model conditions, respectively. As shown by Figure 6a,b, the forecasted result of the proposed model almost almost matched the actual measured PV power output of plant-2. matched the actual measured PV power output of plant-2.

Energies 2017, 10, 876 11 of 17 Energies 2017, 10, 876 11 of 17 Energies 2017, 10, 876 11 of 17 (a) (b) Figure 6. PV power output of plant-2 in (a) normal weather conditions; (b) abnormal weather (a) (b) conditions. Figure6. 6. PV power output of of plant-2 plant-2 in (a) in normal (a) normal weather weather conditions; conditions; (b) abnormal (b) abnormal weather conditions. weather conditions. Similarly to the previous approach, the same meteorological data-set and related PV power output data of plant-3 were used to train and test the proposed model, the model, and the Similarly to the previous approach, the same meteorological data-set and related PV power output persistence Similarly model to the in different previous weather approach, conditions. the same Normal meteorological and abnormal data-set forecast and days related were PV selected, power data of plant-3 were used to train and test the proposed model, the model, and the persistence output similarly data to plant-1 of plant-3 and were plant-2, used in to consideration train and test of the same-month proposed data. model, Figure the 7a,b shows model, the and actual the model in different weather conditions. Normal and abnormal forecast days were selected, similarly persistence measured PV model power in different output and weather forecasted conditions. PV power Normal output and of abnormal the different forecast models days for were normal selected, and to plant-1 and plant-2, in consideration of same-month data. Figure 7a,b shows the actual measured similarly abnormal to weather plant-1 conditions, and plant-2, respectively. in consideration The of figures same-month show that data. the Figure proposed 7a,b shows PV power the actual curve PV power output and forecasted PV power output of the different models for normal and abnormal measured almost matched PV power the actual output measured and forecasted PV power PV power curve output of plant-3. of the different models for normal and weather conditions, respectively. The figures show that the proposed PV power curve almost matched abnormal weather conditions, respectively. The figures show that the proposed PV power curve the actual measured PV power curve of plant-3. almost matched the actual measured PV power curve of plant-3. (a) (b) Figure7. 7. PV power output of of plant-3 plant-3 in (a) in normal (a) normal weather weather conditions; conditions; (b) abnormal (b) abnormal weather conditions. weather (a) (b) conditions. Figure The performances 7. PV power output of the of proposed plant-3 in model, (a) normal the weather model, conditions; and the (b) persistence abnormal weather model were conditions. The performances of the proposed model, the model, and the persistence model were evaluated using the experimental data of May 2016. In this case, nrmse, MAE, and MBE were adopted evaluated using the experimental data of May 2016. In this case, nrmse, MAE, and MBE were for the forecasting performance evaluation. Calculation results are shown in Table 3. adopted The performances the forecasting of the performance proposed evaluation. model, the Calculation model, results and the are persistence shown in Table model 3. Similar work has been done by using other practical datasets collected from the months of January were evaluated and September using the of 2016 experimental for further data validation of May of 2016. the proposed In this case, model. nrmse, In this MAE, case, the and analyses MBE were have adopted for the forecasting Table 3. Summary performance of the forecasting evaluation. performance Calculation for results the data are of May shown 2016. been completed by considering the normal and abnormal weather conditions separately. in Table 3. From the analysis of Metrics the PV power Models output pattern Plant-1 of January, Plant-2 it is clear Plant-3 that 24 andmean 25 January Average are normal days, Table 3. Summary of the N = forecasting 2.77 N performance = 2.56 N for = 2.58 the data N of = May 2.64 2016. and 15 and 16 January are abnormal days. On the other hand, it is also clear from3.06 the analysis of Metrics Models A Plant-1 = 3.66 A Plant-2 = 3.53 A Plant-3 = 3.21 A Mean = 3.47 the PV power output pattern of September that 29 and 30 September are normal Average days and 21 and nrmse (%) N = 2.77 3.53 N = 2.56 3.32 N = 2.58 3.45 N = 2.64 3.43 22 September are abnormal days. From this study, it has been found that the actual 3.06 3.83 measured output A = 3.66 4.58 A = 3.53 4.47 A = 3.21 3.65 A = 3.47 4.23 and forecasting output of the proposed model are almost matched in all PV plants for the data-sets of N = 15.97 3.53 N = 15.64 3.32 N = 14.81 3.45 N = 15.47 both months. 3.43 nrmse The(%) detail of the error 16.22 A calculation 3.83 = 17.25 4.58 A result = 16.97 is shown 4.47 A = 16.68 in Table 3.65 A 4 = for 16.97 the month of January and 4.23 Table 5 for the month of September. N = 15.97 N = 15.64 N = 14.81 N = 15.47 16.22 A = 17.25 A = 16.97 A = 16.68 A = 16.97

Energies 2017, 10, 876 12 of 17 Table 3. Summary of the forecasting performance for the data of May 2016. Metrics Models Plant-1 Plant-2 Plant-3 Mean Average nrmse (%) MAE (W) MBE (W) N = 2.77 N = 2.56 N = 2.58 N = 2.64 A = 3.66 A = 3.53 A = 3.21 A = 3.47 N = 3.53 N = 3.32 N = 3.45 N = 3.43 A = 4.58 A = 4.47 A = 3.65 A = 4.23 N = 15.97 N = 15.64 N = 14.81 N = 15.47 A = 17.25 A = 16.97 A = 16.68 A = 16.97 N = 17.12 N = 24.87 N = 46.99 N = 29.66 A = 23.81 A = 34.68 A = 46.65 A = 35.05 N = 25.51 N = 40.74 N = 66.62 N = 44.29 A = 27.94 A = 46.48 A = 58.27 A = 44.23 N = 72.40 N = 120.43 N = 170.47 N = 121.10 A = 64.04 A = 106.50 A = 158.30 A = 109.61 N = 6.91 N = 23.38 N = 6.56 N = 12.28 A = 13.86 A = 16.59 A = 29.35 A = 19.93 N = 14.89 N = 23.79 N = 34.77 N = 24.48 A = 3.95 A = 4.68 A = 4.41 A = 4.35 N = 23.93 N = 44.13 N = 57.38 N = 41.81 A = 45.56 A = 73.30 A = 118.05 A = 78.97 3.06 3.83 16.22 32.36 44.26 115.36 Hence, N means normal day and A means abnormal day. nrmse means normalized root-mean-square error. MAE means mean absolute error. MBE means mean bias error. 16.11 14.42 60.39 Table 4. Summary of the forecasting performance for the data of January 2016. Metrics Models Plant-1 Plant-2 Plant-3 Mean Average nrmse (%) MAE (W) MBE (W) N = 2.78 N = 2.27 N = 2.96 N = 2.67 A = 3.96 A = 3.46 A = 3.30 A = 3.57 3.12 N = 4.16 N = 3.24 N = 3.17 N = 3.52 A = 4.27 A = 4.11 A = 4.20 A = 4.19 3.86 N = 12.99 N = 13.34 N = 13.61 N = 13.31 A = 13.32 A = 12.49 A = 12.41 A = 12.74 13.03 N = 19.71 N = 23.41 N = 36.93 N = 26.68 A = 24.57 A = 36.49 A = 60.66 A = 40.57 33.63 N = 28.90 N = 39.42 N = 67.48 N = 45.27 A = 32.36 A = 50.37 A = 85.57 A = 56.10 50.69 N = 68.48 N = 108.33 N = 170.47 N = 115.76 A = 60.89 A = 96.21 A = 150.60 A = 102.57 109.17 N = 10.62 N = 5.65 N = 11.98 N = 9.42 A = 10.43 A = 2.32 A = 7.45 A = 6.73 8.08 N = 9.62 N = 6.58 N = 14.62 N = 10.27 A = 20.79 A = 8.93 A = 14.58 A = 14.77 12.52 N = 28.99 N = 49.81 N = 76.00 N = 51.60 A = 25.47 A = 37.49 A = 57.52 A = 40.16 45.88

Energies 2017, 10, 876 13 of 17 Table 5. Summary of the forecasting performance for the data of September 2016. Metrics Models Plant-1 Plant-2 Plant-3 Mean Average nrmse (%) MAE (W) MBE (W) N = 2.85 N = 2.67 N = 2.72 N = 2.75 A = 3.71 A = 3.36 A = 3.04 A = 3.37 3.06 N = 3.31 N = 3.03 N = 3.11 N = 3.15 A = 5.02 A = 4.73 A = 4.16 A = 4.64 3.90 N = 10.80 N = 11.47 N = 11.42 N = 11.23 A = 8.95 A = 10.16 A = 9.88 A = 9.66 10.45 N = 22.02 N = 34.64 N = 53.82 N = 36.83 A = 23.24 A = 40.70 A = 51.89 A = 38.61 37.72 N = 26.39 N = 41.45 N = 65.14 N = 44.33 A = 35.68 A = 60.63 A = 79.95 A = 58.75 51.54 N = 44.27 N = 75.78 N = 113.62 N = 77.89 A = 40.94 A = 77.90 A = 117.00 A = 78.61 78.25 N = 2.40 N = 11.92 N = 21.83 N = 12.05 A = 8.43 A = 9.61 A = 4.71 A = 7.58 9.82 N = 11.81 N = 23.67 N = 27.10 N = 20.86 A = 3.85 A = 3.47 A = 12.93 A = 6.75 13.81 N = 25.31 N = 48.19 N = 75.27 N = 49.59 A = 4.87 A = 20.42 A = 32.13 A = 19.14 34.37 The performances of the proposed model, the model, and the persistence model were calculated by averaging the performance values for the three-month period (Table 6). This table shows the average nrmse, MAE, and MBE result for the different models. The overall average result for the different models and those for other -based forecasting models [33] are also presented in Table 6. Table 6. Summary of overall forecasting performance. Metrics nrmse (%) MAE (W) MBE (W) Methods Average (January) Average (May) Average (September) 3.12 3.06 3.06 3.86 3.83 3.90 13.03 16.22 10.45 33.63 32.36 37.72 50.69 44.26 51.54 109.17 115.36 78.25 8.08 16.11 9.82 12.52 14.42 13.81 45.88 60.39 34.37 Average ( Model) Average ( Model) Average of Model Other Models [33] 3.08 3.86 13.23 5.95 34.57 48.83 100.93-11.34 13.58 46.88 - Based on the figures, tables, and discussions presented in this paper, the proposed generalized SVR-based model performed very well for PV power generation forecasting in different weather conditions. The results also showed that the model could forecast the PV power generation accurately in various seasonalities of Malaysian weather conditions. The errors obtained for all of the plants using the proposed model are very similar for normal and abnormal weather conditions. In the proposed model, the errors computed in normal weather conditions are always slightly lower compared with those for the abnormal weather conditions. The reason for this situation is that the model can be fitted well in normal weather conditions because of less variation in sample data. Hence, the forecasted results are nearly the same in various months because of the absence of drastic changes in the weather conditions of Malaysia. The average forecasting results of the proposed model were 3.08% in nrmse, 34.57 W in MAE, and 11.34 W in MBE, which were better compared with the model and the persistence model. The proposed model also outperformed other -based forecasting models [33].

Energies 2017, 10, 876 14 of 17 Energies 2017, 10, 876 14 of 17 situation. Therefore, the proposed model can forecast PV power generation accurately in any weather condition. Figure 8 shows the actual measured PV power output and the forecasted PV power output of the proposed model. In this case, 14 and 15 January were set as forecast days. As shown in Figure 1a and based on the PV power output pattern of plant-1, these two days have cloudy or rainy weather conditions (abnormal day), which suggest comparatively low electrical energy. The shape of the second part of this figure is quite different from the other figures because some hours during these days have Energies 2017, 10, 876 14 of 17 clear skies while the rest are cloudy or rainy. However, the highest peak of the first part of the figure is comparatively low due to the existence of some clouds that day. Nonetheless, the forecasting PV power situation. Therefore, the proposed model can forecast PV power generation accurately in any weather of the proposed model almost matched the actual measured PV power in any situation. Therefore, condition. the proposed model can forecast PV power generation accurately in any weather condition. Figure 8. PV power output of plant-1 in abnormal weather conditions (January 2016). Figure 9a shows the deviation of the actual measured PV power output and the PV power output of the proposed forecasting model of plant-2 in different weather conditions. The deviation of the measured and forecasted PV power generation at each particular point was not higher than 10%, a percentage in the allowable range. The figure also shows that deviations in different weather conditions do not change drastically. The deviations for all plants were then evaluated on the basis of all datasets. In all cases, the results were almost similar to the deviation. Figure Figure 8. 8. PV PV power power output output of of plant-1 plant-1 in in abnormal abnormal weather weather conditions conditions (January (January 2016). For the proposed model, Figure 9b shows the correlation of the actual measured 2016). PV power output and the forecasted PV power of plant-2 in different weather conditions. The correlation coefficient Figure of 9a 9a shows = 0.988 the in deviation normal weather of actual conditions measured and PV PV power power = 0.9746 output output in and abnormal and the PV thepower PV weather power output conditions output of the of proposed explains the proposed forecasting the intensity forecasting model of the model of correlation; plant-2 of plant-2 in different that in different is, the weather correlation weather conditions. conditions. between The the deviation The measured deviation of the power ofmeasured the and measured the and forecasted forecasted and forecasted power PV power PVis power very generation good. generation The each at positive each particular particular sign point of this point was value not washigher not expresses higher than 10%, the than a proportional 10%, percentage a percentage relationship in the in the allowable between range. the range. two The The powers. figure Based also shows on this thatanalysis, deviations the in proposed different weather model performs conditions very do do well. not not change drastically. The The deviations for for all plants all plants were were then then evaluated evaluated on the on basis the basis of allof datasets. all datasets. In all In cases, all cases, the results the results were were almost almost similar similar to the to deviation. the deviation. For the proposed model, Figure 9b shows the correlation of the actual measured PV power output and the forecasted PV power of plant-2 in different weather conditions. The correlation coefficient of = 0.988 in normal weather conditions and = 0.9746 in abnormal weather conditions explains the intensity of the correlation; that is, the correlation between the measured power and the forecasted power is very good. The positive sign of this value expresses the proportional relationship between the two powers. Based on this analysis, the proposed model performs very well. (a) (b) Figure9. 9. (a) Point-wise deviation of of PV PV plant-2 plant-2 in different in different weather weather conditions; conditions; (b) Correlation (b) Correlation between between measured measured and forecasted and forecasted PV power. PV power. 5. Conclusions For the proposed model, Figure 9b shows the correlation of the actual measured PV power output andaccurate the forecasted forecasting PV power of PV of power plant-2 generation in different has weather become conditions. a key point The in the correlation reliable operation coefficient of of electrical R 2 = 0.988 grids in normal due to weather related conditions rapid connection and R 2 = 0.9746 of PV insystems abnormal to weather grid. conditions In this study, explains a generalized the intensity SVR-based of the correlation; forecasting (a) thatmodel is, thewas correlation proposed between to accurately the measured forecast power (b) PV power and the generation forecasted in different weather conditions. The proposed model was applied to three different PV power Figure 9. (a) Point-wise deviation of PV plant-2 in different weather conditions; (b) Correlation between measured and forecasted PV power. 5. Conclusions Accurate forecasting of PV power generation has become a key point in the reliable operation of

Energies 2017, 10, 876 15 of 17 power is very good. The positive sign of this value expresses the proportional relationship between the two powers. Based on this analysis, the proposed model performs very well. 5. Conclusions Accurate forecasting of PV power generation has become a key point in the reliable operation of electrical grids due to the related rapid connection of PV systems to the grid. In this study, a generalized SVR-based forecasting model was proposed to accurately forecast PV power generation in different weather conditions. The proposed model was applied to three different PV power stations, and then its performance was analyzed. The generalized model could forecast PV power generation in both normal days (clear-sky) and abnormal days (cloudy or rainy days) with nearly the same accuracy. The proposed model could also accurately forecast PV power generation in various seasonalities and different abnormal weather conditions. Deviations in the actual measured and forecasted PV powers at each particular point were in an acceptable range (i.e., not greater than 10%), and no drastic fluctuations at any point were observed. The correlation between the actual measured PV power and the forecasted PV power for the proposed model was also very good. Subsequently, this simple model could significantly reduce computational complexity yet increase model accuracy. The model was experimentally validated by deploying it to three different types of PV plants in the same location. The average forecasting errors of the model were 3.08% in nrmse, 34.57 W in MAE, and 11.34 W in MBE. Finally, the proposed model outperformed the model, the persistence model, and other conventional models. Acknowledgments: This work was supported by the Fundamental Research Grant Scheme (FRGS), Project No: FP061-2016. Author Contributions: Utpal Kumar Das, Kok Soon Tey and Mehdi Seyedmahmoudian conceived the theoretical approaches and contributed in designing and developing the proposed forecasting method. Mohd Yamani Idna Idris, Saad Mekhilef, Ben Horan and Alex Stojcevski contributed in analyzing the simulation results and provided constructive inputs in order to develop the proposed method and implement the simulation model. In terms of writing the paper all authors contributed jointly to preparing this manuscript and all have read and approved the manuscript. Conflicts of Interest: The authors declare no conflict of interest. References 1. Bizzarri, F.; Bongiorno, M.; Brambilla, A.; Gruosso, G.; Gajani, G.S. Model of photovoltaic power plants for performance analysis and production forecast. IEEE Trans. Sustain. Energy 2013, 4, 278 285. [CrossRef] 2. Byrne, J.; Kurdgelashvili, L. The role of policy in PV industry growth: Past, present and future. In Handbook of Photovoltaic Science and Engineering, 2nd ed.; John Wiley & Sons Ltd.: New York, NY, USA, 2011. 3. Roselund, C. The Latest Report by National Renewable Energy Laboratory. Available online: http://www. nrel.gov/ (accessed on 28 June 2017). 4. Strzalka, A.; Alam, N.; Duminil, E.; Coors, V.; Eicker, U. Large scale integration of photovoltaics in cities. Appl. Energy 2012, 93, 413 421. [CrossRef] 5. Woyte, A.; Van Thong, V.; Belmans, R.; Nijs, J. Voltage fluctuations on distribution level introduced by photovoltaic systems. IEEE Trans. Energy Conv. 2006, 21, 202 209. [CrossRef] 6. Shi, J.; Lee, W.J.; Liu, Y.Q.; Yang, Y.P.; Wang, P. Forecasting power output of photovoltaic systems based on weather classification and support vector machines. IEEE Trans. Ind. Appl. 2012, 48, 1064 1069. [CrossRef] 7. Wang, F.; Mi, Z.Q.; Su, S.; Zhao, H.S. Short-term solar irradiance forecasting model based on artificial neural network using statistical feature parameters. Energies 2012, 5, 1355 1370. [CrossRef] 8. Jang, H.S.; Bae, K.Y.; Park, H.-S.; Sung, D.K. Solar power prediction based on satellite images and support vector machine. IEEE Trans. Sustain. Energy 2016, 7, 1255 1263. [CrossRef] 9. Mellit, A.; Pavan, A.M. A 24-h forecast of solar irradiance using artificial neural network: Application for performance prediction of a grid-connected PV plant at Trieste, Italy. Sol. Energy 2010, 84, 807 821. [CrossRef] 10. Hocaoglu, F.O.; Gerek, O.N.; Kurban, M. Hourly solar radiation forecasting using optimal coefficient 2-d linear filters and feed-forward neural networks. Sol. Energy 2008, 82, 714 726. [CrossRef]

Energies 2017, 10, 876 16 of 17 11. Yona, A.; Senjyu, T.; Funabashi, T.; Kim, C.-H. Determination method of insolation prediction with fuzzy and applying neural network for long-term ahead PV power output correction. IEEE Trans. Sustain. Energy 2013, 4, 527 533. [CrossRef] 12. Kudo, M.; Takeuchi, A.; Nozaki, Y.; Endo, H.; Sumita, J. Forecasting electric power generation in a photovoltaic power system for an energy network. Electr. Eng. Jpn. 2009, 167, 16 23. [CrossRef] 13. Yang, X.; Ren, J.; Yue, H. Photovoltaic power forecasting with a rough set combination method. In Proceedings of the 2016 UKACC 11th International Conference on Control (CONTROL), Belfast, UK, 31 August 2 September 2016; pp. 1 6. 14. Perez, R.; Kivalov, S.; Schlemmer, J.; Hemker, K.; Renne, D.; Hoff, T.E. Validation of short and medium term operational solar radiation forecasts in the US. Sol. Energy 2010, 84, 2161 2172. [CrossRef] 15. Hassanzadeh, M.; Etezadi-Amoli, M.; Fadali, M.S. Practical approach for sub-hourly and hourly prediction of PV power output. In Proceedings of the North American Power Symposium 2010, Arlington, TX, USA, 26 28 September 2010; pp. 1 5. 16. Box, G.E.P.; Jenkins, G.M. Time Series Analysis: Forecasting and Control; John Wiley & Sons, Inc.: Hoboken, NJ, USA, 2015. 17. Yang, C.; Thatte, A.A.; Xie, L. Multitime-scale data-driven spatio-temporal forecast of photovoltaic generation. IEEE Trans. Sustain. Energy 2015, 6, 104 112. [CrossRef] 18. Diagne, M.; David, M.; Lauret, P.; Boland, J.; Schmutz, N. Review of solar irradiance forecasting methods and a proposition for small-scale insular grids. Renew. Sustain. Energy Rev. 2013, 27, 65 76. [CrossRef] 19. Wan Ahmad, W.K.A.; Ahmad, S.; Ishak, A.; Hashim, I.; Ismail, E.S.; Nazar, R. Arima model and exponential smoothing method: A comparison. AIP Conf. Proc. 2013, 1522, 1312 1321. 20. Sansa, I.; Missaoui, S.; Boussada, Z.; Bellaaj, N.M.; Ahmed, E.M.; Orabi, M. PV power forecasting using different artificial neural networks strategies. In Proceedings of the 2014 International Conference on Green Energy, Sfax, Tunisia, 25 27 March 2014; pp. 54 59. 21. Oudjana, S.H.; Hellal, A.; Mahamed, I.H. Short term photovoltaic power generation forecasting using neural network. In Proceedings of the 2012 11th International Conference on Environment and Electrical Engineering, Venice, Italy, 18 25 May 2012; pp. 706 711. 22. Cococcioni, M.; Andrea, E.D.; Lazzerini, B. 24-hour-ahead forecasting of energy production in solar PV systems. In Proceedings of the 2011 11th International Conference on Intelligent Systems Design and Applications, Cordoba, Spain, 22 24 November 2011; pp. 1276 1281. 23. Ting-Chung, Y.; Hsiao-Tse, C. The forecast of the electrical energy generated by photovoltaic systems using neural network method. In Proceedings of the 2011 International Conference on Electric Information and Control Engineering, Wuhan, China, 15 17 April 2011; pp. 2758 2761. 24. Atsushi, Y.; Tomonobu, S.; Saber, A.Y.; Toshihisa, F.; Hideomi, S.; Kim, C.H. Application of neural network to 24-hour-ahead generating power forecasting for PV system. In Proceedings of the 2008 IEEE Power and Energy Society General Meeting Conversion and Delivery of Electrical Energy in the 21st Century, Chicago, IL, USA, 20 24 July 2008; pp. 1 6. 25. Smolensky, P.; Mozer, M.C.; Rumelhart, D.E. Mathematical Perspectives on Neural Networks; Psychology Press: Hove, UK, 2013. 26. Liu, J.; Fang, W.; Zhang, X.; Yang, C. An improved photovoltaic power forecasting model with the assistance of aerosol index data. IEEE Trans. Sustain. Energy 2015, 6, 434 442. [CrossRef] 27. Kang, M.C.; Sohn, J.M.; Park, J.Y.; Lee, S.K.; Yoon, Y.T. Development of algorithm for day ahead PV generation forecasting using data mining method. In Proceedings of the 2011 IEEE 54th international midwest symposium on circuits and systems, Seoul, Korea, 7 10 August 2011; pp. 1 4. 28. Chen, C.S.; Duan, S.X.; Cai, T.; Liu, B.Y. Online 24-h solar power forecasting based on weather type classification using artificial neural network. Sol. Energy 2011, 85, 2856 2870. [CrossRef] 29. Yang, H.-T.; Huang, C.-M.; Huang, Y.-C.; Pai, Y.-S. A weather-based hybrid method for 1-day ahead hourly forecasting of PV power output. IEEE Trans. Sustain. Energy 2014, 5, 917 926. [CrossRef] 30. Changsong, C.; Shanxu, D.; Jinjun, Y. Design of photovoltaic array power forecasting model based on neutral network. Trans. China Electrotech. Soc. 2009, 24, 153 158. 31. Muller, K.R.; Smola, A.J.; Rätsch, G.; Schölkopf, B.; Kohlmorgen, J.; Vapnik, V. Predicting Time Series with Support Vector Machines. Available online: http://www.svms.org/regularization/msrs97a.pdf (accessed on 28 June 2017).

Energies 2017, 10, 876 17 of 17 32. Chang, C.C.; Lin, C.J. Libsvm: A Library for Support Vector Machines. Available online: http://www.csie. ntu.edu.tw/~cjlin/papers/libsvm.pdf (accessed on 28 June 2017). 33. Yan, X.Y.; Francois, B.; Abbes, D. Operating power reserve quantification through PV generation uncertainty analysis of a microgrid. In Proceedings of the 2015 IEEE Eindhoven PowerTech, Eindhoven, The Netherlands, 29 June 2 July 2015; pp. 1 6. 2017 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).