Detection of Nonlinearity and Stochastic Nature in Time Series by Delay Vector Variance Method

Similar documents
TESTING FOR NONLINEARITY IN

Algorithms for generating surrogate data for sparsely quantized time series

1 Random walks and data

No. 6 Determining the input dimension of a To model a nonlinear time series with the widely used feed-forward neural network means to fit the a

Phase-Space Reconstruction. Gerrit Ansmann

Effects of data windows on the methods of surrogate data

Discrimination power of measures for nonlinearity in a time series

Information Dynamics Foundations and Applications

NONLINEAR TIME SERIES ANALYSIS, WITH APPLICATIONS TO MEDICINE

Predictability and Chaotic Nature of Daily Streamflow

Investigation of Chaotic Nature of Sunspot Data by Nonlinear Analysis Techniques

SINGAPORE RAINFALL BEHAVIOR: CHAOTIC?

Signal Nonlinearity in fmri: A Comparison Between BOLD and MION

Nonlinear Prediction for Top and Bottom Values of Time Series

Application of Physics Model in prediction of the Hellas Euro election results

Modeling and Predicting Chaotic Time Series

Dynamics of partial discharges involved in electrical tree growth in insulation and its relation with the fractal dimension

Studies in Nonlinear Dynamics & Econometrics

How much information is contained in a recurrence plot?

Understanding the dynamics of snowpack in Washington State - part II: complexity of

The Effects of Dynamical Noises on the Identification of Chaotic Systems: with Application to Streamflow Processes

Testing for Chaos in Type-I ELM Dynamics on JET with the ILW. Fabio Pisano

Sjoerd Verduyn Lunel (Utrecht University)

The Behaviour of a Mobile Robot Is Chaotic

physics/ v2 21 Jun 1999

DYNAMIC ECONOMETRIC MODELS Vol. 10 Nicolaus Copernicus University Toruń Witold Orzeszko Nicolaus Copernicus University in Toruń

Comparison of Resampling Techniques for the Non-Causality Hypothesis

On the characterization of the deterministic/stochastic and linear/nonlinear nature of time series

What is Chaos? Implications of Chaos 4/12/2010

Is correlation dimension a reliable indicator of low-dimensional chaos in short hydrological time series?

Evaluating nonlinearity and validity of nonlinear modeling for complex time series

arxiv: v2 [physics.soc-ph] 16 Jul 2013

Nonlinear dynamics, delay times, and embedding windows

SURROGATE DATA PATHOLOGIES AND THE FALSE-POSITIVE REJECTION OF THE NULL HYPOTHESIS

ISSN Article. Simulation Study of Direct Causality Measures in Multivariate Time Series

Detection of Nonlinearity and Chaos in Time. Series. Milan Palus. Santa Fe Institute Hyde Park Road. Santa Fe, NM 87501, USA

Detecting chaos in pseudoperiodic time series without embedding

Ensembles of Nearest Neighbor Forecasts

A note on the Gamma test analysis of noisy input/output data and noisy time series

A note on the Gamma Test analysis of noisy input/output data and noisy time series

Applications of Methods and Algorithms of Nonlinear. Dynamics in Economics and Finance

Quantitative Description of Robot-Environment Interaction Using Chaos Theory 1

Complex Dynamics of Microprocessor Performances During Program Execution

K. Pyragas* Semiconductor Physics Institute, LT-2600 Vilnius, Lithuania Received 19 March 1998

Available online at AASRI Procedia 1 (2012 ) AASRI Conference on Computational Intelligence and Bioinformatics

Testing for nonlinearity in high-dimensional time series from continuous dynamics

Alternative Methodology for Turning-Point Detection in Business Cycle : A Wavelet Approach

Abstract. Introduction. B. Sivakumar Department of Land, Air & Water Resources, University of California, Davis, CA 95616, USA

If we want to analyze experimental or simulated data we might encounter the following tasks:

A Dynamical Systems Approach to Modeling Input-Output Systems

Determinism, Complexity, and Predictability in Computer Performance

Estimating the predictability of an oceanic time series using linear and nonlinear methods

Biological Systems Modeling & Simulation. Konstantinos P. Michmizos, PhD

Chaos, Complexity, and Inference (36-462)

SIMULATED CHAOS IN BULLWHIP EFFECT

NONLINEAR DYNAMICS PHYS 471 & PHYS 571

Nonlinear Characterization of Activity Dynamics in Online Collaboration Websites

Chaos in GDP. Abstract

Ice-Core Data from Greenland and Near-Term Climate Prediction. Sergey R. Kotov

Commun Nonlinear Sci Numer Simulat

Determination of fractal dimensions of solar radio bursts

PHONEME CLASSIFICATION OVER THE RECONSTRUCTED PHASE SPACE USING PRINCIPAL COMPONENT ANALYSIS

Creating Order Out of Chaos

Interdependence and Predictability of Human Mobility and Social Interactions

The Pattern Recognition System Using the Fractal Dimension of Chaos Theory

CHAOS THEORY AND EXCHANGE RATE PROBLEM

Nonlinearities in Exchange-Rate Dynamics: Chaos?

Vulnerability of economic systems

Chaotic dynamics in shear-thickening surfactant solutions

Documents de Travail du Centre d Economie de la Sorbonne

arxiv: v1 [stat.me] 26 Mar 2013

SUPPLEMENTARY INFORMATION

Phase Space Reconstruction from Economic Time Series Data: Improving Models of Complex Real-World Dynamic Systems

Problem set 1 - Solutions

This article appeared in a journal published by Elsevier. The attached copy is furnished to the author for internal non-commercial research and

Research Article Multivariate Local Polynomial Regression with Application to Shenzhen Component Index

Distinguishing chaos from noise

ESTIMATING THE ATTRACTOR DIMENSION OF THE EQUATORIAL WEATHER SYSTEM M. Leok B.T.

A benchmark test A benchmark test of accuracy and precision in estimating dynamical systems characteristics from a time series

Minimum Sample Size for Reliable Causal Inference Using Transfer Entropy

Introduction to Dynamical Systems Basic Concepts of Dynamics

Is the streamflow process chaotic?

Chaotic motion. Phys 750 Lecture 9

Financial Econometrics

Estimating the predictability of economic and financial time series

Forecasting Chaotic time series by a Neural Network

Two Decades of Search for Chaos in Brain.

Revista Economica 65:6 (2013)

Abstract: Complex responses observed in an experimental, nonlinear, moored structural

Does the transition of the interval in perceptional alternation have a chaotic rhythm?

arxiv: v1 [physics.data-an] 16 Nov 2016

On the use of nearest neighbors in finance

arxiv: v1 [stat.co] 9 Feb 2010

Information Mining for Friction Torque of Rolling Bearing for Space Applications Using Chaotic Theory

arxiv:cond-mat/ v2 [cond-mat.stat-mech] 29 Apr 2008

A Test of the GARCH(1,1) Specification for Daily Stock Returns

ONE DIMENSIONAL CHAOTIC DYNAMICAL SYSTEMS

Forecasting Financial Indices: The Baltic Dry Indices

Optimizing system information by its Dimension

Mechanisms of Chaos: Stable Instability

Transcription:

International Journal of Engineering & Technology IJET-IJENS Vol:10 No:02 11 Detection of Nonlinearity and Stochastic Nature in Time Series by Delay Vector Variance Method Imtiaz Ahmed Abstract-- This paper investigates the suitability of Delay Vector Variance (DVV) algorithm in determining the presence of nonlinear and stochastic nature of time series both in the presence and absence of chaos. A differential entropy based method is used to find the optimum embedding dimension and time lag which are needed to represent the time series in phase space. Results obtained from the simulation indicate that the DVV method gives quantitative and easy to interpret output to characterize the underlying times series in terms of non-linear and stochastic nature. Index Term-- Delay Vector Variance (DVV), Entropy Ratio(ER), Nonlinearity, Phase Space. I. INTRODUCTION Atime series becomes chaotic due to the presence of determinism, non-linearity and strange attractor in it. So chaos implies non-linearity but not vice-versa. Since many nonlinearity analysis techniques rest upon chaos theory, the properties such as determinism and presence of strange attractor have often been confounded with the non-linear behavior. So the presence of strange attractor would lead to the conclusion that the time series is non-linear while it is not necessarily so. As a result determinism and non-linearity have been confounded in time series signal analysis. Methods introduced by Kaplan[1] and by Kennel et al.[2] both rest upon the examination of the predictability of a time series. Furthermore, Kaplan s δ-ε method has been used in combination with the surrogate data [3] strategy for examining the linear or non-linear nature of time series. Kaplan s method has short comings. It's main shortcoming is in the large spread of results for surrogate data which tends to decrease the significance of results found with delta-epsilon. There is also Surrogate method [4] of testing non-linearity but there is a chance of false rejection of the null hypothesis due to stringent definition of linearity of the time series itself. Another classic method to detect non-linearity is Deterministic versus Stochastic (DVS) [5] plots. But DVS method does not allow for a quantitative analysis. Another approach to non-linearity detection is Correlation Exponents Imtiaz Ahmed is with the the Department of Applied Physics Electronics and Communication Engineering, University of Dhaka, Bangladesh (Mobile: +88-01912025741, e-mail:imtiaz@univdhaka.edu) (COR) [6].It examines the local structure of a strange attractor. But linear time series which do exhibit strange attractor or geometric structure in phase space can be erroneously judged non-linear.to this cause aunified approach to analyze the predictability and the degree of nonlinearity in time series signal was needed. This paper investigates such a unified approach to characterize time series namely the Delay Vector Variance (DVV) method [7]. This method requires the underlined time series to be represented in phase space with optimum embedding dimension and time lag. A Differential Entropy based method [8] has been used here to find the optimum embedding dimension and time lag for time series representation in phase space. II. BACKGROUND A time series is a collection of observations made sequentially in time. There are four fundamental properties that describe the nature of the time series: linear, non-linear, deterministic and stochastic properties. Time series should be characterized in terms of these properties so that modeling, prediction and estimation can be done to different physical phenomena. By definition, A time series is considered linear if it can be generated by passing Gaussian white noise through a linear time invariant system.a time series is called deterministic if it can be uniquely defined by an explicit mathematical expression, a table of data, or a well defined rule. All past, present and future values of this type of time series are known precisely without any uncertainty. A time series is considered stochastic if its analysis and description involve uncertainty and unpredictability and need statistical techniques instead of explicit formulas. III. DELAY VECTOR VARIANCE METHOD Time Delay Embedding: It is possible to represent a time series in so called phase space by the method of time delay embedding. When time delay is embedded into a time series it can be represented by a set of delay vectors (DVs) of a given dimension. If m is the dimension of the delay vectors then it can be expressed as x (k) = [x k-mη,, x k-η ] T, where η is the time lag. Now for every DV x (k), there is a corresponding target, namely the next sample x k. Characterization of Time series: A set λ k is generated by grouping those DVs that are within a certain Euclidean

International Journal of Engineering & Technology IJET-IJENS Vol:10 No:02 12 distance to DV x(k).this Euclidean distance will be varied in a manner standardized with respect to the distribution of pairwise distances between DVs. Now for a given embedding dimension m, a measure of unpredictability ζ *2 is computed over all sets of λ k. The variation of the standardized distance enables the complete range of pairwise distances to be examined. Now the algorithm for Delay Vector Variance(DVV) Method is summarized below: For a given embedding dimension m, delay vectors (DVs) x (k)= [x k-m,, x k-1 ] T and corresponding targets x k. are generated: The mean μ d and the standard deviation ζ d are computed over all pair wise Euclidean distances between DVs given by d( i, j). (1) The sets λ k (r d ) are generated such as that λ k (r d )=* +, i.e., sets which consist of all DVs that lie closer to x(k) than a certain distance r d, taken from the interval [min{0, μ d -n d ζ d }; μ d +n d ζ d ] where n d is a parameter controlling the span over which to perform DVV analysis. For every set λ k (r d ), the variance of the corresponding targets ζ k 2 (r d ) is computed. The average over all sets λ k (r d ), normalized by the variance of the time series signal, ζ k 2, yields the target variance : where N = Total number of sets λ k (r d ) Variance measurement is assumed to be valid, if the set λ k (r d ) contains at least 30 DVs, since having too few points for computing a sample variance yields unreliable estimates of the true population variance. This is considered as a minimum set size N o. As a result of the standardization of the distance axis r d is replaced by which will have zero mean and unit variance. Then the DVV plots which are target variance as a function of the standardized distance are quite easy to interpret.the minimum target variance, =, - is a measure of noise present in the time series. The amount of noise is the prevalence of the stochastic component. The presence of strong deterministic component will lead to small target variance for small spans. At the extreme right, the DVV plots smoothly converges to unity, since for maximum spans, all DVs belong to the same universal set, and the variance of the targets is equal to the variance of the time series. In order to generate surrogate time series, Iterative Amplitude Adjusted Fourier Transform (iaaft) [9] has been used. With the help of the original and the surrogate time series DVV scatter diagram can be produced, where the horizontal axis corresponds to the DVV plot of the original time series and the vertical to that of the surrogate time series. DVV plots can be interpreted as follows. If the surrogate time series signal yields DVV plots similar to that of the original time series, the DVV scatter diagram (2) coincides with the bisector line and original time series is judged to be linear. If the surrogate time series yield DVV plots not similar to that of the original time series, the curve will deviate from the bisector line and original time series is judged to be non-linear. Thus the deviation from the bisector line is an indication of non-linearity, and can be quantified as root mean square error (RMSE) between the s of the original time series and the averaged over the DVV plots of the surrogate time series : ( ) (3) Where is the target variance at span r d for the i-th surrogate and the average is taken over all span of r d that is valid in all surrogate and DVV plots. IV. THE ENTROPY RATIO(ER) METHOD The amount of disorder in time series can be measured by the Kozachenko-Leonenko(K-L)[10] estimate of the differential entropy (4) where N is the number of samples in the time series, j is the Euclidean distance of the j-th delay vector to its nearest neighbor, and C E ( 0.5772) is the Euler constant. For a given embedding dimension, m, and time lag, η, let H(x, m, η) denote the differential entropies estimated for time delay embedded versions of a time series, x. The set of optimal parameters, { m opt, η opt }, yields a phase space representation which best reflects the dynamics of the underlying signal production system. Therefore, it is expected that this representation has minimal differential entropy (minimal disorder). Thus, optimization of the differential entropy (Eq. 4) for m and η is achieved and the minimum of H(x, m, η) yielding the optimal set of embedding parameters { m opt, η opt } is obtained. The K-L estimates for the time delay embedded versions of the original time series H(x, m, η), and its surrogates H(x s,i, m, η) are computed using Eq. 4 for increasing m and η (index i refers to the i-th surrogate). To determine the optimal embedding parameters, the ratio needs to he minimized, where denotes the average over i. To penalise for higher embedding dimensions, the minimum description length (MDL) method is superimposed, yielding the entropy ratio (ER): (6) where N is the number of delay vectors, which is kept constant for all values of m and under consideration. V. CONSTRUCTION OF TIME SERIE To test the effectiveness of the Delay Vector Variance Method, a number of time series [11],[12] have been

International Journal of Engineering & Technology IJET-IJENS Vol:10 No:02 13 generated which have different degrees of nonlinearity and stochastic nature. Model 1: Chaotic Mackey-Glass time series which is extensively used in non-linear dynamical system modeling is defined by: We use x 0 =0.2 and η =17 to generate the time series. Model 2: Chaotic Lorenz time series which is widely used in continuous dynamical system is considered. This time series is defined by with X 0 =1, X 1 =0.7 and the sequence { n } is sampled from standard normal distribution. This model contains both moving average and autoregressive terms. VI. SIMULATIONS In this section, the Differential entropy based method is used to find out the optimum embedding dimension m opt and the time lag η opt in order to represent the time series generated by model1 to 7 in phase space. The Entropy-Ratio plot(er-plot) of the different time series mentioned above is shown in Fig.1 to Fig. 7. These optimum values will be used in DVV method for determining the non-linear and stochastic nature in the time series. where,.the sampling frequency is taken to be 100 Hz (oversampled) to generate the time series. Model 3: Hénon map can be written in terms of a single variable with two time delays: x n+1 = 1 + ax n 2 + bx n-1 (9) (8) Fig. 1. ER-plot for Mackey-Glass where a=1.4 and b=0.3. This ensures the Hénon map to be chaotic. This map is widely used in weather forecasting. Model 4: A time is generated by a fully deterministic chaotic Feigenbaum recursion of the form X n =3.57 X n-1 (1-X n-1 ) (10) where the initial condition is, X o = 0.7.The stochastic nature is absent here since the model is noise free. Also this model exhibits third order non-linear dependence. Model 5: A time series is generated by a Autoregressive Conditional Heteroscedastic (ARCH) process of the form Fig. 2. ER-plot for Lorenz X n = (1+0.5X n-1 2 ) 1/2 n (11) with the value of the initial observation set to X 0 = 0.The sequence { n } are sampled from standard normal distribution. This model is a statistical forecasting model for volatility. It considers the variance of the current error term to be a function of the variances of the previous time periods error terms. It is employed commonly in modeling financial time series that exhibits time varying volatility clustering, i.e. periods of swing followed by periods of relative calm. Model 6: A time series is generated by a nonlinear moving average process (NLMA) of the form X n = n+0.8 n-1 n-2 (12) where the sequence { n } are sampled from standard normal distribution. Model 7: A time series is generated by an linear stochastic autoregressive moving average process (ARMA) of the form: X n =0.8X n-1 +0.15X n-2 + n+0.3 n-1 (13) Fig. 3. ER-plot for Hénon map Fig. 4. ER-plot for Feigenbaum

International Journal of Engineering & Technology IJET-IJENS Vol:10 No:02 14 Fig. 5. ER-plot for ARCH F ig. 8. DVV plot for Mackey-Glass Fig. 6. ER-plot for NLMA Fig. 9. DVV plot for Lorenz Fig. 7. ER-plot for ARMA T ABLE.I OPTIMUM VALUES FOR EMBEDDING DIMENSION M AND TIME LAG Τ. Fig. 10. DVV plot for Hénon Time Series m opt η opt Mackey-Glass 4 6 Lorenz 3 1 Hénon 3 1 Feigenbaum 2 4 ARCH 2 1 NLMA 2 2 ARMA 2 2 Table I. gives the { m opt, η opt } for all the time series. Now using these parameters we use DVV method to find out the presence of non-linearity and stochastic nature. The simulation parameters for DVV method are given as: Minimal Size of Set,λ k (r d ),N 0 =30,Number of surrogates, N s = 19,Maximal Span, n d = 3, Number of Reference, DVs = 500, Embedding Dimension, m and Time Delay, τ are chosen according to Table.I, Sample point number=2000. Fig. 11. DVV plot for Feigenbaum Fig. 12. DVV plot for ARCH

International Journal of Engineering & Technology IJET-IJENS Vol:10 No:02 15 Fig. 13. DVV plot for NLMA Fig. 18. DVV scatter diagram for Feigenbaum Fig. 19. DVV scatter diagram for ARCH Fig. 14. DVV plot for ARMA Fig. 20. DVV scatter diagram for NLMA Fig. 15. DVV scatter diagram for Mackey-Glass Fig. 16. DVV scatter diagram for Lorenz Fig. 17. DVV scatter diagram for Hénon Fig. 21. DVV scatter diagram for ARMA VII. DISCUSSION The first four time series namely Mackey-Glass, Lorenz, Hénon and Feigenbaum are chaotic. It ensures the presence of strange attractor in their phase space representation, strong deterministic nature and the non-linearity. In the DVV plots of these time series in Fig.8 to Fig.11, very small value for the target variance (0.09469, 0.1785, 0.005027, 0.0005037 successively )for original series indicates the presence of strong deterministic component since is a measure of noise present in the time series and the amount of noise is the prevalence of the stochastic component. Specially for chaotic Feigenbaum Recursion target variance 0.This indicates fully deterministic behavior. From the DVV scatter diagram of

International Journal of Engineering & Technology IJET-IJENS Vol:10 No:02 16 these time series which are shown in Fig.15 to Fig.18, we can conclude that the DVV method has successfully detected the presence of non-linearity as the curves in these figures deviate significantly from the bisector line. This is because the for surrogates are different than that of original time series. For the rest of the three time series namely ARCH, NLMA and ARMA do not have chaotic behavior but non-linearity and stochastic component are present in ARCH and NLMA. DVV method can still successfully indicates the presence of strong stochastic nature in these time series in Fig.12 and Fig.13 by giving a large value of target variance Also, in Fig.19 and Fig.20, the DVV scatter diagram significantly deviates from the bisector line which indicate the non-linear nature in NLMA and ARCH. Finally the time series generated by ARMA process is stochastic but linear. This is clearly shown in Fig.14 where the is 0.1041 which indicates the presence stochastic nature and in Fig.21 linearity is judged as the DVV scatter diagram coincides with the bisector line. So Delay Vector Variance method can successfully determine the presence of nonlinear and stochastic nature in a time series regardless of the fact that whether or not chaos is present. This is possible because DVV method depends on the local predictability of the time series in phase space which can be observed by the target variance of delay vectors (DVs). 2004. The Delay Vector Variance Method for Detecting Determinism and Nonlinearity in Time Series, Physica D,190, 3-4,167-176. [8] Gautama, T., Mandic,D.P.,& Van Hulle,M., 2003. A Differential Entropy Based Method for Determining the Optimal Embedding Parameter of a Signal. In Proceedings of ICASSP, Hong-Kong, VI, 29-32. [9] Kugiumtzis, D., 1999. Test Your Surrogate before you test your Non-linearity, Phys. Rev. E, 60, 2808-2816. [10] Beirlant, E.J, Gyorfi, L, Meulen, E.C., 1997. Nonparametric Entropy Estimation: An overview, International J. Mathematical and Statistical Sciences, 6, 17-39. [11] Barnett, W., Gallant, A., Hinich, M., Jungeilges, J., Kapalan, D. and Jensen, M. 1997 A single Blind Controlled Competition Among Tests for Nonlinearity and Chaos, J. of Econometrics, 77,297-302. [12] Hou,Shumin., Li,Yourong and Zhao, Sanxing. Detecting the Nonlinearity in Time Series from Continuous Dynamic Systems based on Delay Vector Variance Method, J. of Mathematical, Physical and Engineering Sciences. REFERENCES [1] Kaplan, D.,1994. Exceptional Events as Evidence for Determinism, Physica D, 73,38-48. [2] Kennel, M.B, Brown, R., & Abarbanel, H.D.I, 1992. Determining Embedding Dimension for Phase Space Reconstruction using a Geometrical Construction, Phys. Rev. A,45, 3403-3411. [3] Schreiber, T & Schmitz, A, 2000. Surrogate Time Series, Physica D., 142,346-382. [4] Kaplan, D.,1997. Nonlinearity and Nonstationarity: The Use of Surrogate Data in Interpreting Fluctuations in M. Di Rienzo, G. Mancia, G. Parati., A. Pedotti and A. Zanchetti, eds, Frontiers of Blood Pressure and Heart Rate Analysis IOS press, Amsterdam. [5] Casdagli, M.C. & Weigend, A.S. 1994. Exploring the continuum between deterministic and stochastic modeling, in Time Series Prediction: Forecasting the Future and Understanding the Past, Reading, MA: Addison-Wesley, 347-367. [6] Grassberger, P and Procaccia, I., 1983, Measuring the Strangeness of Strange Attractors, Physica D,9, 189-208. [7] Guatama,T., Mandic,D.P & Van Hulle,M.M.