Application of Chaos Theory and Genetic Programming in Runoff Time Series

Similar documents
Is correlation dimension a reliable indicator of low-dimensional chaos in short hydrological time series?

Abstract. Introduction. B. Sivakumar Department of Land, Air & Water Resources, University of California, Davis, CA 95616, USA

Predictability and Chaotic Nature of Daily Streamflow

The Effects of Dynamical Noises on the Identification of Chaotic Systems: with Application to Streamflow Processes

We are IntechOpen, the world s leading publisher of Open Access books Built by scientists, for scientists. International authors and editors

Dynamics of Sediment Transport in the Mississippi River Basin: A Temporal Scaling Analysis

Is the streamflow process chaotic?

WHEELSET BEARING VIBRATION ANALYSIS BASED ON NONLINEAR DYNAMICAL METHOD

Monthly River Flow Prediction Using a Nonlinear Prediction Method

SINGAPORE RAINFALL BEHAVIOR: CHAOTIC?

Aggregation and sampling in deterministic chaos: implications for chaos identification in hydrological processes

ESTIMATING THE ATTRACTOR DIMENSION OF THE EQUATORIAL WEATHER SYSTEM M. Leok B.T.

This article appeared in a journal published by Elsevier. The attached copy is furnished to the author for internal non-commercial research and

On embedding dimensions and their use to detect deterministic chaos in hydrological processes

ARTICLE IN PRESS. Testing for nonlinearity of streamflow processes at different timescales. Accepted 8 February 2005

Chaotic analysis of daily and weekly rainfall timeseries over Chennai

The Research of Railway Coal Dispatched Volume Prediction Based on Chaos Theory

Employing nonlinear dynamic concepts for catchment classi cation using runo response of catchments

The Behaviour of a Mobile Robot Is Chaotic

Local polynomial method for ensemble forecast of time series

This is the Pre-Published Version.

Stochastic Hydrology. a) Data Mining for Evolution of Association Rules for Droughts and Floods in India using Climate Inputs

Local polynomial method for ensemble forecast of time series

Dynamics of partial discharges involved in electrical tree growth in insulation and its relation with the fractal dimension

Available online at AASRI Procedia 1 (2012 ) AASRI Conference on Computational Intelligence and Bioinformatics

An Investigation on Changes and Prediction of Urmia Lake water Surface Evaporation by Chaos Theory

Application of Physics Model in prediction of the Hellas Euro election results

Journal of Urban and Environmental Engineering, v.3, n.1 (2009) 1 6 ISSN doi: /juee.2009.v3n

Deep bore well water level fluctuations in the Koyna region, India: the presence of a low order dynamical system in a seismically active environment

Investigation of Chaotic Nature of Sunspot Data by Nonlinear Analysis Techniques

Understanding the dynamics of snowpack in Washington State - part II: complexity of

Information Mining for Friction Torque of Rolling Bearing for Space Applications Using Chaotic Theory

Chaos, Complexity, and Inference (36-462)

Nonlinear dynamics, delay times, and embedding windows

Estimating Lyapunov Exponents from Time Series. Gerrit Ansmann

Multifractal Models for Solar Wind Turbulence

Attractor of a Shallow Water Equations Model

Chaotic dynamics in shear-thickening surfactant solutions

EM-algorithm for Training of State-space Models with Application to Time Series Prediction

Neuroevolution methodologies applied to sediment forecasting

Deterministic chaos versus stochasticity in analysis and modeling of point rainfall series

Reconstruction Deconstruction:

Linear genetic programming for time-series modelling of daily flow rate

Quantitative Description of Robot-Environment Interaction Using Chaos Theory 1

Rainfall attractors and predictability

Determination of fractal dimensions of solar radio bursts

Review of existing statistical methods for flood frequency estimation in Greece

Effects of data windows on the methods of surrogate data

Two Decades of Search for Chaos in Brain.

Table (6): Annual precipitation amounts as recorded by stations X and Y. No. X Y

Solute transport in a heterogeneous aquifer: a search for nonlinear deterministic dynamics

The tserieschaos Package

Revista Economica 65:6 (2013)

INDIAN INSTITUTE OF SCIENCE STOCHASTIC HYDROLOGY. Course Instructor : Prof. P. P. MUJUMDAR Department of Civil Engg., IISc.

Majid Sodagar, 1 Patrick Chang, 1 Edward Coyler, 1 and John Parke 1 School of Physics, Georgia Institute of Technology, Atlanta, Georgia 30332, USA

Does the transition of the interval in perceptional alternation have a chaotic rhythm?

DOMINANT DISCHARGE IN THE KOR RIVER, FARS PROVINCE, IRAN

Predictive uncertainty of chaotic daily streamflow using ensemble wavelet networks approach

SUPPLEMENTARY INFORMATION

Nonlinear Analysis of Experimental Noisy Time Series in Fluidized Bed Systems

DISCUSSION of Evidence of chaos in the rainfall runoff process * Which chaos in the rainfall runoff process?

Appendix D. Model Setup, Calibration, and Validation

Calibration of a Rainfall Runoff Model to Estimate Monthly Stream Flow in an Ungauged Catchment

TABLE OF CONTENTS. 3.1 Synoptic Patterns Precipitation and Topography Precipitation Regionalization... 11

River Flow Forecasting with ANN

Calculating the suspended sediment load of the Dez River

Research Article Multivariate Local Polynomial Regression with Application to Shenzhen Component Index

GIS in Water Resources Midterm Exam Fall 2008 There are 4 questions on this exam. Please do all 4.

Climate variability and its effects on regional hydrology: a case study for the Baltic Sea drainage basin

GIS in Water Resources Midterm Quiz Fall There are 5 questions on this exam. Please do all 5. They are of equal credit.

The relationship between catchment characteristics and the parameters of a conceptual runoff model: a study in the south of Sweden

A combination of neural networks and hydrodynamic models for river flow prediction

Research Article Symplectic Principal Component Analysis: A New Method for Time Series Analysis

On the modelling of extreme droughts

Interdependence and Predictability of Human Mobility and Social Interactions

Investigating volcanoes behaviour. With novel statistical approaches. UCIM-UNAM - CONACYT b. UCIM-UNAM. c. Colegio de Morelos

Tropics & Sub-Tropics. How can predictive approaches be improved: Data Sparse Situations

Complex Dynamics of Microprocessor Performances During Program Execution

arxiv: v2 [physics.soc-ph] 16 Jul 2013

No. 6 Determining the input dimension of a To model a nonlinear time series with the widely used feed-forward neural network means to fit the a

Biological Forum An International Journal 7(1): (2015) ISSN No. (Print): ISSN No. (Online):

Forecasting River Flow in the USA: A Comparison between Auto-Regression and Neural Network Non-Parametric Models

Effect of land use/land cover changes on runoff in a river basin: a case study

MODELLING OF METALLURGICAL PROCESSES USING CHAOS THEORY AND HYBRID COMPUTATIONAL INTELLIGENCE

Characterization of phonemes by means of correlation dimension

CE 394K.3 GIS in Water Resources Midterm Quiz Fall There are 5 questions on this exam. Please do all 5. They are of equal credit.

APPENDIX A. Watershed Delineation and Stream Network Defined from WMS

ESTIMATION OF DISCHARGE FOR UNGAUGED CATCHMENTS USING RAINFALL-RUNOFF MODEL IN DIDESSA SUB-BASIN: THE CASE OF BLUE NILE RIVER BASIN, ETHIOPIA.

SPATIAL AND TEMPORAL VARIATION OF RAINFALL AND RUNOFF IN HONG KONG

Optimal Artificial Neural Network Modeling of Sedimentation yield and Runoff in high flow season of Indus River at Besham Qila for Terbela Dam

Combining Satellite And Gauge Precipitation Data With Co-Kriging Method For Jakarta Region

Predicting monthly streamflow using data-driven models coupled. with data-preprocessing techniques

INTRODUCTION TO CHAOS THEORY T.R.RAMAMOHAN C-MMACS BANGALORE

Optimizing system information by its Dimension

Heihe River Runoff Prediction

Suspended sediment yields of rivers in Turkey

K. Pyragas* Semiconductor Physics Institute, LT-2600 Vilnius, Lithuania Received 19 March 1998

Phase-Space Reconstruction. Gerrit Ansmann

13 Watershed Delineation & Modeling

Results of Intensity-Duration- Frequency Analysis for Precipitation and Runoff under Changing Climate

Transcription:

Application of Chaos Theory and Genetic Programming in Runoff Time Series Mohammad Ali Ghorbani 1, Hossein Jabbari Khamnei, Hakimeh Asadi 3*, Peyman Yousefi 4 1 Associate Professor, Department of Water Engineering, Tabriz University, Tabriz, Iran, E-mail: cusp004@yahoo.com Assistant Professor, Department of Statistics, Tabriz University, Tabriz, Iran, E-mail: h_jabbari@tabrizu.ac.ir 3* Master of Science, Department of Water Engineering, Tabriz University, Tabriz, Iran, E-mail: tu.i006@yahoo.com 4 Master of Science, Department of Civil Engineering, Tabriz University, Tabriz, Iran, E-mail: pe.yousefi@yahoo.com Abstract. Nowadays, prediction of runoff is very important in water resources management and their permanent development. Along with scientific advances in recent years, various intelligent methods and regression and mathematical methods have been presented to estimate the runoff. In this paper, Two different methods are used, Chaos analysis and genetic programming. The performances of models are analyzed and result showed that runoff have had chaotic behavior. Application of genetic programming models in estimating the runoff is also studied in this paper. The data that has been used has chaotic behavior and a mathematical model of genetic programming with rainfall and runoff as model inputs was result. Keywords: Chaos, Genetic Programming, Runoff, Lighvan Basin 1 Introduction Chaotic behaviors refer to the time history of a single variable of a deterministic dynamical system undergoing a loss of temporal correlation in response to small perturbations in initial conditions. Hydrologic and other water-related time series have been modeled by chaos theory over the past two decades and specific applications outlined as follows. (a) The presence of low-dimensional deterministic behaviors in the river flow processes were investigated by Jayawardena and Lai [1], Porporato and Ridolfi [], Krasovskaia et al. [3], Stehlik[4], Sivakumar et al. [5]. (b) Nonlinear deterministic approaches were used to detect the presence of chaos and achieve more accurate river flow predictions by Islam and Sivakumar [6], Lisi and Villi [7], Liu et al. [8]. (c) Alternative mathematical formulations have been developed to investigate 6

water-related problem, e.g. Qingfang and Yuhua who developed a new local linear prediction model for chaotic river flow series [9]. Genetic Programming (GP) is among heuristic algorithms all of which are based on Darwin's evolution theory. The mentioned algorithms define a target function in the shape of qualitative standards and then make use of the mentioned function to measure and compare different solution methods in a step by step process of data source correction and finally present the appropriate solution method. Kalra and Deo applied the GP for the completion of missing data in wave records along the west coast of India [10]. Ustoorikar and Deo used the GP for filling up gaps in datasets of wave heights [11]. Aytek and Kishi used GP approach to suspended sediment modeling for two stations on the Tongue River in Montana, USA, and indicate that the GP formulation performs quite well compared to sediment rating curves and multi linear regression models [1]. Gaur and Deo applied the GP for real-time wave forecasting [13]. Materials and Methods.1 Chaos Chaos theory is a method of nonlinear time series analysis and involves a host of methods, essentially based on the phase-space reconstruction of the process, from scalar or multivariate measurements of physical observables. Phase Space Reconstruction. One way of characterizing dynamical systems is by the concept of phase-space, according to which given a set of physical variables and an analytical model describing their interactions where each of its points corresponds to a state of the system. The delay embedding method reconstructs phase-space from a univariate or multivariate time series, which is assumed to be generated by a deterministic dynamical system [14]. The Takens theorem states that the underlying dynamics can be fully recovered by building a m-dimensional space wherein the components of each state vector Y are defined through the delay coordinates: t Y t X,,,..., t X t X t X t( m1) (1) where m is known as embedding dimension, as delay time and X t ={x 1, x,...x N } with N-observed values. This delay-embedding method is sensitive to both embedding parameters of and m, which are unknown a-priori. As suggested by Cellucci et al 7

[15], Average Mutual Information (AMI) is used to estimate. AMI defines how the measurements X(t) at time t are related, from an information theoretic point of view, to measurements X(t + ) at time t +. The average mutual information is defined as [16]: I( ) P( X ( i), X ( i )) log P( X ( i), X ( i )) X ( i), X ( i ) P( X ( i)) P( X ( i )) () where the sum is extended to the total number of samples in the times series. P(X(i)) and P(X(i + )) are the marginal probabilities for measurements X(i) and X(i+ ), respectively, whereas P(X(i), X(i+ )) is their joint probability. The optimal delay time minimizes the function I( ): for t =, X(i+ ) adds the maximum information on X(i). Correlation Dimension. Correlation dimension is a nonlinear measure of the correlation between pairs lying on the attractor. For time series whose underlying dynamics is chaotic whereas for stochastic systems it is infinite. For an m -dimensional phase-space, the correlation function C m (r) is defined as the fraction of states closer than r [17, 18]: Cm ( r) lim N ( N w)( N w 1) N N i1 ji1 w H( r Y Y ) i j (3) where H is the Heaviside step function, Y i is the i th state vector, and N is the number of points on the reconstructed attractor. For stochastic time series m C ( r) r holds, whereas for chaotic time series the correlation function scales m with r as: C m ( r) r D (4) where D, correlation exponent, quantifies the degrees of freedom of the process, and defined by: 8

D ln Cm( r) lim r0 ln r (5) and can be reliably estimated as the slope in the ln C m ( r) vs. ln(r ) plot.. Genetic Programming The GP is similar to Genetic Algorithm (GA) but employs a parse tree structure for the search of its solutions, whereas the GA employs bite strips. The technique is truly a bottom up process, as there is no assumption made on the structure of the relationship between the independent and dependent variables but an appropriate relationship is identified for any given time series. The relationship can be logical statements or it is normally a mathematical expression, which may be in some familiar mathematical format or it may assemble a mathematical function in a completely unfamiliar format. The construction of the relationship is made possible by two components: (i) a parse tree, which is a functional set of basic operators and those selected in this study are: {,, } (6-a) {,,, xx, } (7-b) which emulates the role of RNA; and (ii) the actual components of the functions and their parameters (referred to as the terminal set), which emulates which emulates the role of RNA..3 Study Area and Data The runoff time series of Lighvan basin, Iran ( 46 0 30 to 46 7 30 east latitude and 37 4 55 to 37 49 30 north longitude) was used in the study. This watershed with a drainage area 76.19 Km is important part of the catchment of Talkheh River watershed. The maximum and minimum elevations of the area are around 3500 and 000 m, respectively. The length of longest stream is 17 km. The 9

average stream slope is 11%. The Lighvan River drains into Talkheh River and Urmia Lake, respectively. For the present investigation, rainfall-runoff data observed for composed storm. Figure 1 shows the variations of rainfall-runoff data. The entire dataset was divided into two parts. The first 80% of data was used in training for the phase space reconstruction, but the subsequent 0% of data was used as observed data in the prediction phase. Fig. 1. Time series plot of runoff data in the Lighvan basin 30

3 Results Two methods are used to identify a possible existence of chaos in the runoff time series in the Lighvan basin. Using the AMI method, the delay time, is estimated for the time series in the Lighvan basin as the intercept with the x-axis of the curves by plotting the values of the AMI evaluated by the TISEAN package against delay times progressively increased from 1 to 100 [19]. As shown in Figure this method shows well-defined first minima at delay time of 10. Fig.. Average mutual information function of runoff data from the Lighvan basin The correlation function is calculated for the dataset using the delay times ( =10), determined by the AMI method in the previous section, and embedding dimensions, m, by allowing it to vary from 1 to 0. Figure 3-a shows the relationship between correlation function C(r) and radius r (i.e. lnc(r) versus ln(r)) for increasing m, whereas Figure 3-b shows the relationship between the correlation dimension values D (m) and the embedding dimension values m. It can be seen from Figure 3-b. that the value of correlation exponent increases with the embedding dimension up to certain value and then saturates beyond it. The saturation of the correlation exponent is an indication of the existence of deterministic dynamics. The saturated correlation dimension is 1.5, (D =1.5). The value of correlation dimension also suggests the possible presence of chaotic behavior in the dataset. 31

Fig. 3. a) Convergence of logc(r) versus log(r) b) Saturation of correlation dimension D (m) with embedding dimension m saturation signifies chaotic signals in the Lighvan basin Two different combinations of arithmetic function set were used to this problem: a) The first set {,, }; b) The second set{,,, xx, }. Genetic programming with two combination of arithmetic function set and three different combination of runoff time series;1: {P t, P t-1, Q t-1, Q t }; : {P t, P t-1, P t-, Q t-1, Q t } and 3: {P t, P t-1, P t-, Q t-1, Q t-, Q t } applied for training and testing data. Comparison of the statistical parameters (RMSE=0.1446, R =0.9989) of GP resulted from training and testing step shows that combination no.3 of runoff time series with first (6-a) arithmetic function set is the best combination of input data and arithmetic function where shown in Table 1. The comparison between observed and computed runoff from GP model is shown in Figure 4. Table 1. The results of GP model for the training and testing steps Model Operators R RMSE 1 Train 0.9948 0.7158 Test 0.9891 0.4584 {,, } Train 0.9947 0.705 Test 0.9891 0.4584 3 Train 0.999 0.70 Test 0.9989 0.1446 1 Train 0.9948 0.7119 Test 0.9891 0.4584 Train 0.9947 0.718 {,,, xx, } Test 0.9891 0.4568 3 Train 0.999 0.733 Test 0.9989 0.1446 3

Fig. 4. Comparison between observed and computed runoff time series by GP model The mathematical model that obtained by GP, According to Equation 7 is combination of the best input and output parameters with P t, P t-1, P t-, Q t-1, Q t- as input and Q t as output parameters. Q Q 0.007843( 0.007843 P ) P Q Q t t1 t1 t t1 t (7) 4 Conclusion This paper investigated possible chaotic behaviours in the runoff time series of the Lighvan basin. The analysis was performed on runoff time series records on an event. The analysis was based on using phase space reconstruction, correlation dimension method. The correlation dimension value is 1.5. Predictions of these time series using a GP are found to be acceptable. In addition, results obtained from GP were compared with observed data. It was documented that prediction with GP is good. References 1. Jayawardena, W., Lai, F.: Analysis and prediction of chaos in rainfall and streamflow time series, J. of Hydrology. 153, 3--5 (1994).. Porporato, Ridolfi, L.: Nonlinear analysis of river flow time sequences, Water Resources Research. 33(6), 1353--1367 (1997). 3. Krasovskaia, L., Gottsehalk, Z., Kundzewicz, W.: Dimensionality of Scandinavian river flow regimes, Hydrol. Sci. J. 44(5),705--73 (1999). 33

4. Stehlík, J.: Deterministic chaos in runoff series, J. of Hydrology and Hydrodynamics. 47(4), 71--87 (1999). 5. Sivakumar, B., Berndtsson, R., Persson, M.: Monthly runoff prediction using phase-space reconstruction, Hydrological Sciences Journal. 46(3), 377--388 (001b). 6. Sivakumar, B.: A phase-space reconstruction approach to prediction of suspended sediment concentration in rivers, J. of Hydrology. 58(1-4), 149--16 (00). 7. Lisi, V., Villi, V.: Chaotic forecasting of discharge time series: A case study, J. of the American Water Resources Association. 37(), 71--79 (001). 8. Liu, Q., Islam, S., Rodriguez-lturbe, V., Lee, Y.: Phase-space analysis of daily streamflow: characterization and prediction, Adv. Wat. Resour. 1, 463--475 (1998). 9. M. Qingfang and P. Yuhua P.: A new local linear prediction model for chaotic time series, Physics Letters A. 370: 465-470 (007). 10. Kalra, R., Deo, M. C.: Genetic programming for retrieving missing information in wave records along the west coast of India, App. Ocean Res. 9(3):99--111(007). 11. Ustoorikar, V., Deo, M. C.: Filling up gaps in wave data with genetic programming, Marine Structures. 1:177--195 (008). 1. Aytek, A., Kisi, O.: A genetic programming approach to suspended sediment modelling, J. of Hydrology. 351: 88--98 (008). 13. Gaur, V., Deo, M. C.: Real-time wave forecasting using genetic programming, Ocean Engineering. 35 (11 1):1166--117 (008). 14. Takens, F.: Detecting strange attractors in turbulence, in Lectures Notes in Mathematics, edited by D.A. Rand and L.S. Young, 898, 366--381, Springer-Verlag, New York (1981). 15. Cellucci, V., Albano, A. M., Rapp, P. E.: Comparative study of embedding methods, Physical Review E. 67, No. 6, 6610 (003). 16. Fraser, M., Swinney, H. L.: Independent coordinates for strange attractors from mutual information, Physical Review A, 33(), 1134--1140 (1986). 17. Grassberger, P., Procaccia, I.: Characterization of strange attractors, Physical review letters. Vol. 50, No. 5, 346--349 (1983). 18. Theiler, J.: Spurious dimension from correlation algorithms applied to limited timeseries data, Phys. Rev. A 34, 47--43 (1986). 19. Hegger, R., Kantz, V., Schreiber, T.: Practical implementation of nonlinear time series methods: The TISEAN package. Chaos. 9, 413--435(1999). 34