Thunderstorm Forecasting by using Artificial Neural Network

Similar documents
Three Structures of a Multilayer Artificial Neural Network for Predicting the Solar Radiation of Baghdad City- Iraq

ESTIMATION OF HOURLY MEAN AMBIENT TEMPERATURES WITH ARTIFICIAL NEURAL NETWORKS 1. INTRODUCTION

Name Class Date. 3. In what part of the water cycle do clouds form? a. precipitation b. evaporation c. condensation d. runoff

A Feature Based Neural Network Model for Weather Forecasting

SCI-4 Mil-Brock-Weather Exam not valid for Paper Pencil Test Sessions

Using an Artificial Neural Network to Predict Parameters for Frost Deposition on Iowa Bridgeways

Prediction of Monthly Rainfall of Nainital Region using Artificial Neural Network (ANN) and Support Vector Machine (SVM)

Temp 54 Dew Point 41 Relative Humidity 63%

Module 11: Meteorology Topic 5 Content: Weather Maps Notes

Weather Unit Part 2: Meteorology

Forecasting River Flow in the USA: A Comparison between Auto-Regression and Neural Network Non-Parametric Models

III. Section 3.3 Vertical air motion can cause severe storms

Short-term wind forecasting using artificial neural networks (ANNs)

FORMATION OF AIR MASSES

Name: Date: Part I Weather Tools Match Directions: Correctly identify each weather tool by writing the name on the line.

Optimum Neural Network Architecture for Precipitation Prediction of Myanmar

Mr. P s Science Test!

Holdout and Cross-Validation Methods Overfitting Avoidance

A Hybrid ARIMA and Neural Network Model to Forecast Particulate. Matter Concentration in Changsha, China

Application of Artificial Neural Network for Short Term Load Forecasting

The most abundant gas in the atmosphere by volume is. This gas comprises 78% of the Earth atmosphere by volume.

ANN and Statistical Theory Based Forecasting and Analysis of Power System Variables

A Hybrid Model of Wavelet and Neural Network for Short Term Load Forecasting

Thunderstorm Downburst Prediction: An Integrated Remote Sensing Approach. Ken Pryor Center for Satellite Applications and Research (NOAA/NESDIS)

Use the terms from the following list to complete the sentences below. Each term may be used only once.

Chapter Introduction. Weather. Patterns. Forecasts Chapter Wrap-Up

Evaporation - Water evaporates (changes from a liquid to a gas) into water vapor due to heat from the Sun.

DESIGN AND DEVELOPMENT OF ARTIFICIAL INTELLIGENCE SYSTEM FOR WEATHER FORECASTING USING SOFT COMPUTING TECHNIQUES

A comparative study of ANN and angstrom Prescott model in the context of solar radiation analysis

Copyright 2015 Edmentum All rights reserved.

SOIL MOISTURE MODELING USING ARTIFICIAL NEURAL NETWORKS

A Comparative Study Review of Soft Computing Approach in Weather Forecasting

Unit 5 Lesson 3 How is Weather Predicted? Copyright Houghton Mifflin Harcourt Publishing Company

Guided Notes Weather. Part 2: Meteorology Air Masses Fronts Weather Maps Storms Storm Preparation

Wind Power Forecasting using Artificial Neural Networks

Weather Maps. Name:& & &&&&&Advisory:& & 1.! A&weather&map&is:& & & & 2.! Weather&fronts&are:& & & & & &

Foundations of Earth Science, 6e Lutgens, Tarbuck, & Tasa

Chapter 14 Thunderstorm Fundamentals

Unit 5 Part 2 Test PPT

National Maritime Center

May 17, earthsciencechapter24.notebook. Apr 8 10:54 AM Review. Grade:9th. Subject:Earth Science. Date:4/8.

Thunderstorm: a cumulonimbus cloud or collection of cumulonimbus clouds featuring vigorous updrafts, precipitation and lightning

Rainfall Prediction using Back-Propagation Feed Forward Network

Weather Studies Introduction to Atmospheric Science

ATS 351, Spring 2010 Lab #11 Severe Weather 54 points

Forecasting of Rain Fall in Mirzapur District, Uttar Pradesh, India Using Feed-Forward Artificial Neural Network

Moisture, Clouds, and Precipitation: Clouds and Precipitation. Dr. Michael J Passow

Reading. What meteorological conditions precede a thunderstorm? Thunderstorms: ordinary or single cell storms, multicell storms, supercell storms

25.1 Air Masses. Section 25.1 Objectives

Retrieval of Cloud Top Pressure

Weather Patterns and Severe Storms

Pilots watch the clouds, because clouds can indicate the kind of weather in store for a flight.

Clouds. What they tell us about the weather

Journal of of Computer Applications Research Research and Development and Development (JCARD), ISSN (Print), ISSN

Short Term Load Forecasting Using Multi Layer Perceptron

Aviation Hazards: Thunderstorms and Deep Convection

Key Concept Weather results from the movement of air masses that differ in temperature and humidity.

The Potential of Artificial Neural Network Technique in Daily and Monthly Ambient Air Temperature Prediction

Ch. 3: Weather Patterns

Chapter 21. Weather Patterns and Severe Storms

Application of Text Mining for Faster Weather Forecasting

An ANN based Rotor Flux Estimator for Vector Controlled Induction Motor Drive

a. Air is more dense b. Associated with cold air (more dense than warm air) c. Associated with sinking air

Chapter 12 Section 12.1 The causes of weather

Created by Mrs. Susan Dennison

Feature Selection Optimization Solar Insolation Prediction Using Artificial Neural Network: Perspective Bangladesh

1 What Is Climate? TAKE A LOOK 2. Explain Why do areas near the equator tend to have high temperatures?

Earth/Space Systems and Cycles (SOL 4.6)

ARTIFICIAL NEURAL NETWORK PART I HANIEH BORHANAZAD

Local Prediction of Precipitation Based on Neural Network

Artificial Neural Network

Thunderstorm: a cumulonimbus cloud or collection of cumulonimbus clouds featuring vigorous updrafts, precipitation and lightning

WEATHER. rain. thunder. The explosive sound of air as it is heated by lightning.

Local Precipitation Variability

Observing Weather: Making the Invisible Visible. Dr. Michael J. Passow

Weather Systems. The air around high-pressure weather systems tends to swirl in a clockwise direction, and usually brings clear skies.

Section 13-1: Thunderstorms

3 Severe Weather. Critical Thinking

Unit 4 Review Guide: Weather

Meteorology. I. The Atmosphere - the thin envelope of gas that surrounds the earth.

Guided Reading Chapter 18: Weather Patterns

Predicting Floods in North Central Province of Sri Lanka using Machine Learning and Data Mining Methods

Weather - is the state of the atmosphere at a specific time & place

Malaysian Journal of Civil Engineering 21(1) : 1-16 (2009) Corresponding Author :

Weather. Chapter Test A. Multiple Choice. Write the letter of the correct answer on the line at the left.

4 Forecasting Weather

Journal of Urban and Environmental Engineering, v.3, n.1 (2009) 1 6 ISSN doi: /juee.2009.v3n

Weather: Air Patterns

Modelling and Prediction of 150KW PV Array System in Northern India using Artificial Neural Network

1. Introduction. 2. Artificial Neural Networks and Fuzzy Time Series

Day Ahead Hourly Load and Price Forecast in ISO New England Market using ANN

Three things necessary for weather are Heat, Air, Moisture (HAM) Weather takes place in the Troposphere (The lower part of the atmosphere).

Weather Permitting/Meteorology. North Carolina Science Olympiad Coaches Clinic October 6, 2018 Michelle Hafey

Application of an artificial neural network to typhoon rainfall forecasting

HAZARDOUS WEATHER 1. Dr. Julie Laity Geography 266

Data and prognosis for renewable energy

DEPARTMENT OF GEOSCIENCES SAN FRANCISCO STATE UNIVERSITY. Metr Fall 2012 Test #1 200 pts. Part I. Surface Chart Interpretation.

EE-588 ADVANCED TOPICS IN NEURAL NETWORK

Implementation of Artificial Neural Network for Short Term Load Forecasting

Forecasting Weather: Air Masses and Fronts

Transcription:

Thunderstorm Forecasting by using Artificial Neural Network N.F Nik Ismail, D. Johari, A.F Ali, Faculty of Electrical Engineering Universiti Teknologi MARA 40450 Shah Alam Malaysia nikfasdi@yahoo.com.my Abstract: - Thunderstorm is a form of weather characteristic containing strong wind, lightning, heavy rain and sometimes snow or hail. It can be associated with a cloud type of cumulonimbus. Depending on its type, thunderstorm has great potential to produce serious damage to human life and property. Therefore, there are many sophisticated instrument used to record weather data such as Doppler radar and satellite. By using these data, based on the statistical, mathematical or soft computing technique can be done in order to predict the occurrence the weather characteristic. This project presents the application of artificial neural network (ANN) in forecasting the thunderstorm occurrence in Shah Alam based on the meteorological data. Therefore, several three-layer feed-forward back-propagation ANNs were developed in Matlab and each network was evaluated by using cross-validation technique. A network with the best performance in terms of its R-value was selected as the best design. Thus, the forecasting of thunderstorm occurrence can be successfully done. Key-Words: - artificial neural network, cross validation, thunderstorm forecasting 1 Introduction Thunderstorm is one of the global phenomena that can occur anywhere in the world at anytime. It is also known as electrical storm, lightning storm or hailstorm. This storm is a form of weather characteristic containing strong wind, lightning, heavy rain and sometimes snow or hail [1]. Although thunderstorm is generally very short-lived phenomena, it has great potential to produce serious damage to human life and property such as lightning, damaging straight-line wind, large sized hail, heavy precipitation and flooding [2]. Basically, thunderstorm can be associated with the cloud called cumulonimbus cloud. Cumulonimbus cloud can be categorized into family D of cloud which developed in vertical way. It can be recognized by very tall and large cloud appearance. This cloud usually formed from cumulus cloud which grows vertically instead of horizontally [3]. Therefore, some of the parameter that produces this cloud is used as the input for neural network based model. Artificial neural network (ANN) is a branch of artificial intelligence (AI). It is a mathematical model or computational model that tries to simulate the structure or functional inspired by biological neural network [4]. It consists of an interconnected group of artificial neurons and processes information using connectionist approach to computation. In most cases, ANN is an adaptive system that changes its structure based on external or internal information that flows through the network during the learning phase. Therefore, ANN is suitable for prediction and forecasting tasks. ANN has several advantages. As stated in [5], the advantages of the ANN are performing task that is a linear program cannot and does not need to be reprogrammed. Besides, ANN can be implemented in any application without any problem. But, ANN also comes with several disadvantages such as it needs training to operate and require high processing time for large network. It also needs to be emulated due to different architecture with microprocessor. ANN has several advantages. As stated in [5], the advantages of the ANN are performing task that is a linear program cannot and does not need to be reprogrammed. Besides, ANN can be implemented in any application without any problem. But, ANN also comes with several disadvantages such as it needs training to operate and require high processing time for large network. It also needs to be emulated due to different architecture with microprocessor. ISSN: 1792-5088 143 ISBN: 978-960-474-233-2

2 Problem Formulation The research design for developing the forecasting of thunderstorm occurrence consists of two stages. They are data collection and development of the ANN. 2.1 Data Collection There are two types of data be used in designing the ANN, the meteorological data which acts as the input and thunderstorm occurrence as the target output. These data is taken at city of Shah Alam which cover the whole month of April because of highest number of thunderstorm occurs in this month for every year. Besides, the data used is hourly data which be collected from Malaysian Meteorological Services (MMS). Meteorological parameters such as pressure, moisture difference, wind and type of cloud were chosen as they play important role in shaping the occurrence of thunderstorm. From Figure 1, ANN is a network that formed in three layers. The first layer is the input layer where the selected parameter comes into ANN and followed by hidden layer and output later which the output is produced. The input for ANN is the meteorological data while the output is the thunderstorm occurrence. Each layer consists of one or more nodes, represented in this diagram by the small circles. The lines between the nodes indicate the flow of information from one node to the next. 2.1.1 Pressure Pressure is one the important parameters that determine the weather condition. This is due to variation of pressure for certain time will lead to wind. Therefore, the pressure measurement is needed to predict the atmosphere phenomena. 2.1.2 Moisture Different Moisture different is the estimation of the content of the water of the atmosphere. That s mean the different between dry bulb temperature (T) and dew-point temperature (T d ) is actually the moisture different. Furthermore, the moisture different is inversely proportional with relative humidity which mean when moisture different is high, the relative humidity is low and vice-versa. Moisture different is important because it transfers the heat from lower level to higher level. Therefore, dry bulb temperature, dew point temperature and relative humidity must be considered as an input for ANN model. 2.1.3 Wind Wind has the ability to impact surface transportation. This ability is one of the factors that cause various hazardous weather conditions Therefore, the measurement of speed of wind must be considered in order to forecasting the thunderstorm occurrence. 2.1.4 Type of Cloud Cloud type is use to determine either associated with thunderstorm or not. Cumulonimbus cloud is the one that associated with the thunderstorm. Therefore, type of cloud is important to be considering as input for ANN because it shows the existence of thunderstorm. 2.2 Development of the ANN Fig 1: Schematic Diagram of ANN In order to determine the best network, several feedforward back-propagation of ANN designs were constructed and each network was evaluated by using cross validation technique. Cross validation is a model evaluation method that estimates the generalization error based on re-sampling. 2.2.1 Training Process In order to determine the best network, each network must be trained and tested to estimate its performance configuration. The best result in terms of its R-value is selected as the best ANN model. This could be done by using cross validation technique which estimates the generalization error based on re-sampling. It involves partitioning a sample of data into complementary subset where analysis is performed on one subset (training set) and validating the analysis on the other subset (evaluation set). The cross validation technique can be divided into two types. They are the holdout cross validation and K-fold cross validation. For holdout technique, available data is split into two sets called training set and testing set respectively. The testing set is held out and not been ISSN: 1792-5088 144 ISBN: 978-960-474-233-2

looked during training process. That means, holdout technique use the training set only. Then, the training set is subdivided into training set and evaluation set. The training set is used to train the ANN model while evaluation set is used to evaluate the ANN model by estimating its mean absolute error (MSE). This process is repeated for each ANN model. The advantage of this technique is that it avoids the overlap between training and testing data and takes shorter time to compute the ANN model. developed ANN model receives the input data and target output data to produce the network output. Start Load Training Data Normalize Data Design ANN Fig 2: Partitionig of Data for K=4 In K-fold cross validation, the training data is partitioned into K subset. Of the K subset, a single subset is retained as the evaluation set while the remaining subsets are used as training set as shown in Figure 2. The training set is used to train the ANN model while evaluation set is used to evaluate the ANN model by estimating its mean absolute error (MSE). This process is then repeated K times with each of the K subset used once as the evaluation set. Then, the averaged of the each K is been obtained to produce a single estimate for ANN model. This process then is done for each ANN model in order to determine the best network. 2.2.2 Testing Process Testing process is the process that measure the performance of the trained model. It can be measured by performing a linear regression analysis between model output and target ouuput. The regression coefficient, R with value close to one shows that there is a strong correlation between both outputs. Then, the best model is trained once again by using the whole training data and testing data. It is suppose, this best model should be able to forecast the thunderstotm occurrence by measuring its R-value. Figure 3 shows the whole process of development of the ANN in flowchart. No Check Testing R-value Set Network Parameter Train the ANN R close to 1 Yes Save the Result Load Testing Data Testing Process R close to 1 No 3 Problem Solution The data were selected from meteorological based on city of Shah Alam, Malaysia which cover the whole month of April. There were 720 patterns selected, 480 were utilized for training the model and 240 were employed for the testing process. The meteorological will act as the input data while the thunderstorm occurrence data as the target output. Therefore, the End Yes Fig3: ANN Algorithm Several three-layer feed-forward back-propagation ANN ISSN: 1792-5088 145 ISBN: 978-960-474-233-2

models were developed in Matlab. The network configuration such as number of neuron and transfer function were determined heuristically. The learning rate and momentum constant were chosen as 0.5 and 0.9 respectively. These values were kept constant for each model design at this stage in order to observe the effect of different number of neuron and transfer function. In holdout cross validation, the training data is split into training and evaluation sets. The training set is used to train the ANN models while the evaluation set is used to evaluated the ANN models. For K-fold validation, the training data is split into four equally sets. One of these sets is used as evaluation set while the remaining sets as the training set. Then, the same process as in holdout cross validation are done in order to determine the best ANN model. The comparison between holdout cross validation and K-fold cross validation depend on their R-value. The ANN model with the highest R-value is selected as the best network. This is due to R-value which closed to one shows that there is a strong correlation between target outputs and network outputs while R-value close to zero indicates otherwise. From Table 1, the best R-value for holdout is 0.8349 while for K-fold is 0.8655. Based on these, it can be concluded that the ANN model trained using K-fold cross validation gives better result that holdout cross validation. It can also be seen that the best ANN model for forecasting the thunderstorm occurrence was a [8,5,1] configuration with tansig, logsig and purelin as the transfer function trained using Levernberg Marquardt technique. Figure 4, 5, 6, 7 and 8 show the best result in terms of R-value for holdout cross validation and R-value for each partition for K-fold cross validation respectively. Table 1: Comparison Between Holdout and K-fold Cross Validation No. of Neuron Transfe r Functio n Holdout Validatio n R-value K-fold Validatio n 0.8349 0.7998 0.8314 0.8203 0.8297 0.8202 0.7646 0.8381 0.7430 0.8313 0.7701 0.8294 0.8221 0.8655 0.7458 0.8653 Fig5: The R-value for Partitioning for K=1 Fig4: The Best R-value for Holdout Cross Validation ISSN: 1792-5088 146 ISBN: 978-960-474-233-2

Fig6: The R-value for Partitioning for K=2 Fig7: The R-value for Partitioning for K=3 TABLE 2: Performance of Momentum Constant (mc) Momentum Learning Constant Rate (lr) (mc) R-value 0.1 0.8307 0.2 0.8877 0.3 0.8346 0.4 0.8033 0.9 0.5 0.8377 0.6 0.8433 0.7 0.8033 0.8 0.8868 0.9 0.8441 1.0 0.8776 TABLE 3: Performance of Learning Rate (lr) Momentum Constant (mc) 0.2 Learning Rate (lr) R-Value 0.1 0.7996 0.2 0.8455 0.3 0.8152 0.4 0.8191 0.5 0.8069 0.6 0.8583 0.7 0.8237 0.8 0.8907 0.9 0.8783 1.0 0.8564 Figure 8: The R-value for Partitioning for K=4 After the cross validation technique, the best ANN model is trained using both training and evaluation sets as the training data and evaluated by using the testing data. For this time, the learning rate (lr) and momentum constant (mc) were varied between 0 to 1 in order to improve the performance of the ANN model. Table 2 and 3 show that the R-value varies when the values of the learning rate and momentum constant were varied. The best performance configuration for forecasting the thunderstorm occurrence are 0.8 for learning rate and 0.2 for momentum constant. The R- value obtained by this model was 0.8907. Table 4 shows the best properties for ANN model in order to forecast the thunderstorm occurrence while Figure 9 shows the best R-value for this properties. Table 4: Properties to Developed Network for Thunderstorm Forecasting ANN Properties Properties Network Configuration [8, 5. 1] Transfer Function,, Learning Rate 0.8 Momentum 0.2 Constant Training Levernberg- Technique Marquardt Epochs 100 Regression Coefficeint, R 0.8907 Accuracy 89.07% Training Pattern 480 Testing Pattern 240 ISSN: 1792-5088 147 ISBN: 978-960-474-233-2

Fig 9: The Best of R-Value for ANN Model for Thunderstorm Forecasting 4 Conclusion This project mainly presents an approach to forecast the thunderstorm occurrence based on the meteorological data. Several three-layer feed-forward back-propagation networks were developed by using Matlab. Then, cross validation technique was employed in order to determine the best ANN design by evaluating its mean absolute error. The result shows that the K-fold cross validation produced a better result in terms of the R-value than holdout cross validation. The optimal values of momentum constant and learning rate were found to 0.2 and 0.8 respectively. They were obtained by testing all possible values within the range of 0 to 1. Therefore, the forecasting of thunderstorm occurrence can be successfully done. In future, it is recommended to use large data in order to improve the thunderstorm forecasting. It is either by adding some new parameter such as low pressure area and vorticity for input data or increase the number of data by using data for a year. References: [1] S. Choudry, S. Sitra and H. Chakraborty, A Connectionist Aapproach to Thunderstorm Forecasting. [2] Ed. Richard C. Dorf, Principe, J.C., "Artificial Neural Networks," The Electrical Engineering Handbook, Boca Raton: CRC Press LLC, 2000. [3] J. Schneider and A. W. Moore, A Locally Weighted Learning Tutorial using Vizier 1.0, 1997. [4] D. Johari, T.K.A Rahman, I. Musirin, ANN Model Selection and Performance Evaluation for Lightning Prediction System. [5] S.N. Sivanandam, S. Sumathi and S.N. Deepa, Introduction to Neural Networks using MATLAB 6.0, Feed Forward Network, pp.184-219. Tata McGraw Hill Publishing Company Limited, 2006. ISSN: 1792-5088 148 ISBN: 978-960-474-233-2