arxiv:cond-mat/ v1 [cond-mat.stat-mech] 25 Feb 2003

Similar documents
arxiv:cond-mat/ v4 [cond-mat.other] 12 Mar 2005

Statistical Applications in the Astronomy Literature II Jogesh Babu. Center for Astrostatistics PennState University, USA

Unstable Oscillations!

Log-Periodicity in High Frequency Financial Series

Shape of the return probability density function and extreme value statistics

Periodogram of a sinusoid + spike Single high value is sum of cosine curves all in phase at time t 0 :

The Spectral Density Estimation of Stationary Time Series with Missing Data

arxiv:cond-mat/ v2 [cond-mat.stat-mech] 23 Feb 1998

arxiv:cond-mat/ v2 12 May 1999

Periodogram of a sinusoid + spike Single high value is sum of cosine curves all in phase at time t 0 :

The Local Fractal Properties of the Financial Time Series on the Polish Stock Exchange Market

LIGHT CURVE PROPERTIES OF HIGH EARTH OBITAL ROCKET BODIES OBSERVED AT WEIHAI OBSERVATORY OF SHANDONG UNIVERSITY

Discrete scale invariance and its logarithmic extension

arxiv:cond-mat/ v2 [cond-mat.stat-mech] 3 Feb 2004

arxiv:physics/ v2 [physics.soc-ph] 22 Apr 2007

13.7 Power Spectrum Estimation by the Maximum Entropy (All Poles) Method

Fractals. Mandelbrot defines a fractal set as one in which the fractal dimension is strictly greater than the topological dimension.

Space Geophysics. Determination of gravity waves parameters in the airglow combining photometer and imager data Prosper Kwamla Nyassor April 25, 2018

arxiv:cond-mat/ v1 [cond-mat.dis-nn] 3 Apr 2002

Stock mechanics: theory of conservation of total energy and predictions of coming short-term fluctuations of Dow Jones Industrials Average (DJIA)

Long memory and changing persistence

arxiv:cond-mat/ v1 [cond-mat.stat-mech] 1 Aug 2001

Estimating Periodic Signals

New stylized facts in financial markets: The Omori law and price impact of a single transaction in financial markets

Stochastic Integration and Stochastic Differential Equations: a gentle introduction

Ornstein-Uhlenbeck processes for geophysical data analysis

arxiv:cond-mat/ v3 [cond-mat.stat-mech] 21 May 2007

Degrees-of-freedom estimation in the Student-t noise model.

Potentials of Unbalanced Complex Kinetics Observed in Market Time Series

Empirical properties of large covariance matrices in finance

cond-mat/ v2 4 Aug 2000

Clarifications to Questions and Criticisms on the Johansen-Ledoit-Sornette bubble Model

arxiv: v1 [astro-ph] 15 Jun 2007

Stock mechanics: energy conservation theory and the fundamental line of DJIA

8.2 Harmonic Regression and the Periodogram

arxiv:cond-mat/ v1 [cond-mat.stat-mech] 6 Feb 2004

Time Series Analysis. Solutions to problems in Chapter 7 IMM

Homework 2. For the homework, be sure to give full explanations where required and to turn in any relevant plots.

There are self-avoiding walks of steps on Z 3

SPECTRAL ANALYSIS OF NON-UNIFORMLY SAMPLED DATA: A NEW APPROACH VERSUS THE PERIODOGRAM

arxiv:physics/ v1 [physics.data-an] 1 Oct 2006

Equation models versus agent models: an example about pre-crash bubbles

REDFIT: estimatingred-noise spectra directly from unevenly spaced paleoclimatic time series $

Parallels between Earthquakes, Financial crashes and epileptic seizures

If we want to analyze experimental or simulated data we might encounter the following tasks:

Introduction to Algorithmic Trading Strategies Lecture 10

arxiv:cond-mat/ v2 28 Jan 2002

Signal interactions Cross correlation, cross spectral coupling and significance testing Centre for Doctoral Training in Healthcare Innovation

Kernel Learning via Random Fourier Representations

Random Processes. DS GA 1002 Probability and Statistics for Data Science.

Darmstadt Discussion Papers in Economics

arxiv:cond-mat/ v2 [cond-mat.stat-mech] 22 Jan 1998

Multifractal Analysis and Local Hoelder Exponents Approach to Detecting Stock Markets Crashes

Bubble Diagnosis and Prediction of the and Chinese stock market bubbles

Comparison of the Performance of Two Advanced Spectral Methods for the Analysis of Times Series in Paleoceanography

The Laplace driven moving average a non-gaussian stationary process

arxiv:hep-lat/ v1 19 Jan 2005

From time series to superstatistics

Predictability of financial crashes and other catastrophic events

Financial Econometrics

A Comparison of HRV Techniques: The Lomb Periodogram versus The Smoothed Pseudo Wigner-Ville Distribution

Introduction to Rare Event Simulation

Fourier Analysis of Stationary and Non-Stationary Time Series

Problem Sheet 1 Examples of Random Processes

Likelihood-free MCMC

Facultad de Física e Inteligencia Artificial. Universidad Veracruzana, Apdo. Postal 475. Xalapa, Veracruz. México.

covariance function, 174 probability structure of; Yule-Walker equations, 174 Moving average process, fluctuations, 5-6, 175 probability structure of

arxiv: v1 [q-fin.st] 1 Feb 2016

arxiv:cond-mat/ v2 [cond-mat.stat-mech] 7 Aug 2003

REGULARIZED ESTIMATION OF LINE SPECTRA FROM IRREGULARLY SAMPLED ASTROPHYSICAL DATA. Sébastien Bourguignon, Hervé Carfantan, Loïc Jahan

arxiv:cond-mat/ v1 [cond-mat.stat-mech] 16 Dec 1997

First-Passage Statistics of Extreme Values

Frequency estimation by DFT interpolation: A comparison of methods

The Financial Bubble Experiment First Results (2 November May 2010)

File name: Supplementary Information Description: Supplementary Figures, Supplementary Notes and Supplementary References

October 7, :8 WSPC/WS-IJWMIP paper. Polynomial functions are renable

Development of Techniques for Data Interpretation

Market Risk. MFM Practitioner Module: Quantitiative Risk Management. John Dodson. February 8, Market Risk. John Dodson.

Wavelet Analysis on financial time series

The Spectrum of Broadway: A SAS

Econ 424 Time Series Concepts

Dynamic Risk Measures and Nonlinear Expectations with Markov Chain noise

1/f Fluctuations from the Microscopic Herding Model

BME 50500: Image and Signal Processing in Biomedicine. Lecture 5: Correlation and Power-Spectrum CCNY

Fourier analysis of AGN light curves: practicalities, problems and solutions. Iossif Papadakis

X

MISSING INFORMATION AND ASSET ALLOCATION

arxiv:adap-org/ v1 15 Jul 1996

The Characteristics of Global Shallow-Source Seismicities Associated. with Solar Activities in Different Time Scales

ESTIMATION OF PARAMETERS OF PARTIALLY SINUSOIDAL FREQUENCY MODEL

Modeling by the nonlinear stochastic differential equation of the power-law distribution of extreme events in the financial systems

Investigating the use of the Lomb-Scargle Periodogram for Heart Rate Variability Quantification

Nonlinear Stochastic Resonance with subthreshold rectangular pulses arxiv:cond-mat/ v1 [cond-mat.stat-mech] 15 Jan 2004.

arxiv:cond-mat/ v1 [cond-mat.stat-mech] 22 Feb 2002

Physics 403 Probability Distributions II: More Properties of PDFs and PMFs

Package lomb. February 20, 2015

arxiv:cond-mat/ v1 [cond-mat.stat-mech] 5 Jun 2003

High-frequency data modelling using Hawkes processes

Exact power spectra of Brownian motion with solid friction

arxiv: v1 [q-fin.st] 22 Jul 2017

Transcription:

arxiv:cond-mat/0302507v1 [cond-mat.stat-mech] 25 Feb 2003 SIGNIFICANCE OF LOG-PERIODIC SIGNATURES IN CUMULATIVE NOISE HANS-CHRISTIAN GRAF V. BOTHMER Abstract. Using methods introduced by Scargle we derive a cumulative version of the Lomb periodogram that exhibits frequency independent statistics when applied to cumulative noise. We show how this cumulative Lomb periodogram allows us to estimate the significance of log-periodic signatures in the S&P 500 anti-bubble that started in August 2000. 1. Introduction In the last years there has been a heated discussion about the presence of log-periodic signatures in financial data [Fei01b], [SJ01], [Fei01a]. One tool to detect these signatures has been the Lomb periodogram introduced by Lomb [Lom76] and improved by Scargle in [Sca82]. An important property of Scargle s periodogram is that the individual Lomb powers of independently normal distributed noise follow approximately an exponential distribution. Unfortunately this property is lost, when one calculates Scargle s periodogram for cumulative noise, where the differences between two observations are independently normal distributed. In this Brownian-Motion case, the expected Lomb powers are much greater for small frequencies than for large ones. Therefore the significance of large Lomb powers at small frequencies is difficult to estimate. Huang, Saleur, Sornette and Zhou tackle this problem and several related ones with extensive Monte Carlo simulations in [HSS00] and [ZS02]. Here we present an analytic approach. In the first part of this paper we introduce a small correction to Scargle s Lomb periodogram, to make the distribution of Lomb powers exactly exponential for independently normal distributed noise. In the second part we use the same methods to derive a normalisation of the Lomb periodogram that assures an frequency independent exponential distribution of Lomb powers for cumulative noise. In the last section we apply these new methods to estimate the significance of log-periodic signatures in so called S&P 500-anti-bubble after the crash of 2000. We show how our methods greatly simplify the whole analysis and derive that there is about a 6% chance that a signature like the one detected by Sornette and Zhou in [SZ02] arises by chance if one only considers frequencies smaller than 10.0. If one searches all frequencies up to the Nyquist frequency peaks of this hight become much more common. Furthermore we detect equally significant peaks at harmonics of the fundamental frequency of Sornette and Zhou. This complements evidence for a Date: April 20, 2008. 1

2 HANS-CHRISTIAN GRAF V. BOTHMER more sophisticated modelling of the S&P 500 anti-bubble by Sornette and Zhou in [ZS03]. We would like to thank Volker Moeller for hosting our database of financial data and donating cpu-time for the Monte Carlo simulations that lead to this paper. 2. Independent Noise Consider the classical periodogram P(ω) = 1 ( N 0 ) 2 ( N 0 ) 2 X j cos ωt j + X j sinωt j. N 0 This is a generalization of the Fourier transform to the case of unevenly spaced measurements. Unfortunately this classical form has difficult statistical behavior for uneven spacing. Scargle therefore proposed in [Sca82] a normalized from of the periodogram, that restores good statistical properties in the case where all X j are independently normal distributed with mean zero and variance σ 0. For this he observes that and N 0 S(w) = X j cos ωt j N 0 C(w) = X j sinωt j are again normally distributed, with variances and σc 2 = N N 0 0σ 0 cos 2 ωt j σs 2 = N N 0 0σ 0 sin 2 ωt j. Also he claims that for normally distributed random variables with unit variance their sum of squares is exponentially distributed. Therefore he proposes to look at P(ω) = ( N0 X j cos ωt j ) 2 σ c + ( N0 ) 2 X j sin ωt j. Strictly speaking this is only correct, when S(ω) and C(ω) are independent. This is approximately true, when the observation times t j are not to badly bunched. In other cases whole variance/covariance matrix ( ) σ 2 Σ = c σ cs σ cs σ 2 s σ s

with SIGNIFICANCE OF LOG-PERIODIC SIGNATURES IN CUMULATIVE NOISE 3 N 0 σ cs = N 0 σ 0 cos ωt j sin ωt j has to be considered. Then the values of a quadratic form ( ) ( ) C(ω) C(ω),S(ω) Q S(ω) are exponentially distributed if QΣQ = Q or equivalently Q = Σ 1, if Σ is invertible. In our case we have ( ) Σ 1 1 σ 2 = s σ cs σc 2σ2 s σ2 cs σ cs σc 2. This means the natural form of the periodogram is P(ω) = 1 σsc 2 2 (ω) + σcs 2 2 (ω) 2σ cs C(ω)S(ω) 2σ 0 σc 2σ2 s, σ2 cs which reduces to Scargle s case for C and S independent (σ cs = 0), and to the classical case for even spacing (σ c = σ s ). This P(ω) is always exponentially distributed. As Scargle we also replace t j by t j τ with τ = 1 N0 2ω arctan sin 2ωt j N0 cos 2ωt j in all formulas, to make the diagram time-translation invariant. 3. Cumulative Noise Assume now, that the differences Y j = (X j X j 1 ) are independently normal distributed with zero mean and variance σ0 2. Then the distribution of powers in Scargle s periodogram depends on the angular frequency ω = 2πf. See figure 1 for the result of a Monte Carlo simulation of this case. In small frequencies much higher Lomb powers can occur by chance, then in high frequencies. This reflects the well known fact that the expected value at frequency f of the evenly spaced Fourier transform of a brownian motion is proportional to 1/f 2. Our idea is now to normalize the periodogram by adjusting Scargle s methods to this case. First notice that j X j X 0 = k=1 Y k

4 HANS-CHRISTIAN GRAF V. BOTHMER in this situation. From this we obtain n C(ω) = (X j X 0 )cos ωt j = = n ( j Y k )cos ωt j k=1 n n Y k cos ωt j k=1 and a similar formula for S(ω). Notice that C(ω) and S(ω) are still normally distributed with and and With these values j=k σc 2 = C2 (ω) = ( n )( n ) Y k,y l cos ωt j cos ωt j kl = σ 2 0 j=k n ( n ) 2 cos ωt j k=1 j=k j=l σs 2 = S 2 (ω) = ( n )( n ) Y k,y l sin ωt j sin ωt j kl = σ 2 0 j=k n ( n ) 2 sin ωt j k=1 j=k j=l σ cs = C(ω)S(ω) = ( n )( n ) Y k,y l sinωt j cos ωt j kl j=k j=l n ( n )( n = σ0 2 cos ωt j sin ωt j ). k=1 j=k P(ω) = 1 σsc 2 2 (ω) + σcs 2 2 (ω) 2σ cs C(ω)S(ω) 2σ 0 σcσ 2 s 2 σcs 2, is again exponentially distributed: j=k Probability ( P(ω) > z ) = exp( z) In particular the distribution is now independent of ω as exemplified by figure 2 which shows the cumulative Lomb periodogram for a Monte Carlo simulation as above. Notice that is essential to consider the correlation between C(ω) and S(ω) in this case. Figure 3 shows the cumulative Lomb periodograms for 1000

SIGNIFICANCE OF LOG-PERIODIC SIGNATURES IN CUMULATIVE NOISE 5 random walks of length 500 in logarithmic time without the correlation term above. Notice how this introduces spurious peaks at several frequencies. Again we replace t j by t j τ with τ = 1 2ω arctan N0 sin 2ωt j N0 cos 2ωt j in all formulas, to make the diagram time-translation invariant. 4. Application to log-periodicity in financial data Recently Sornette and Zhou have suggested that there is a log periodic signature in the S&P 500 index after the bursting of the new economy bubble [SZ02]. Among other methods they use Scargle s periodogram with t j = log(t j t c ) and X j the logarithm of the index price j days after the critical date t c. To account for a nonlinear trend of the form A + B(t j t c ) α they first detrend the index values. For this they use a value of A obtained from a nonlinear fit of A + B(t j t c ) α + C(t j t c ) α cos(ω ln(t j t c ) + φ) to the logarithm of the index price and then determine B and α for different choices of t c by a linear regression. We eliminate the dependence to the nonlinear fitting procedure by using a slightly different method: Fixing a critical date t c we optimise A for the best correlation between logarithmic time t j and log(x j A). Figure 4 shows the highest Lomb powers obtained by this method for 2- year intervals starting from August 1st to September 5th, 2000. The highest peak in our dataset is observed for the critical date August 22th, 2000. This is in reasonable agreement with the critical date found by Sornette and Zhou (August 9th, 2000). To estimate the significance of this peak we calculated the 203 Lomb periodograms of 2-year intervals starting at the 22nd of each month from Jannuary 1984 to November 2000. Figure 5 shows a comparison of these periodograms with the one from August 22nd, 2002. Notice that the distribution of Lomb-powers depends strongly on the frequency. For small frequencies high Lomb powers are much more probable than at high frequencies. The absence of large powers for very small frequencies is due to the detrending procedure. These facts have also been observed by [HSS00] in a somewhat different setting. Notice also that the peak at f 1.6 is nevertheless quite large, in fact it is the largest one observed at this frequency. But how probable is it that we have a peak of this relative hight for any of the tested frequencies? The SnP500 data set is not large enough, to estimate this probability accurately, but a count shows that there are 39 datasets that have at least one frequency with a peak that is higher than any of the other at the same frequency. This gives a naive estimate of 39/202 19,3%. The cumulative Lomb periodogram proposed above improves and greatly simplifies this analysis. Figure 6 shows the cumulative Lomb periodograms

6 HANS-CHRISTIAN GRAF V. BOTHMER of the S&P 500 anti-bubble together with the cumulative Lomb periodograms of all other datasets. A detrending of the data as above was not necessary. Notice that there are now several peaks of comparable size at frequencies 1.7, 3.4, 7.4 and 8.4. The fact that these lie close to the harmonics of 1.7 compares well to the results of [ZS03]. We have included the theoretical 99.9%, 99% and 95%-quantiles for each frequency derived above together with the 95%-quantiles of the actual S&P 500 data. Notice the excellent agreement for frequencies greater than 1. Since the hight of the peaks is now largely independent from the frequency, we can estimate the global significance of the peak at 1.7 with cumulative Lomb power 5.61 by counting the number of cumulative Lomb-periodograms with at least one peak of higher power. We find 12 of those which implies a significance of approximately 12/203 5.9%. In figure 7 we compare the global significance of peaks in cumulative Lomb periodograms of the S&P 500 with those obtained from a Monto Carlo simulation of brownian motion. There is a reasonable agreement for significances smaller than 0.1. Notice that this result depends strongly on the number of frequencies considered. If we consider all frequencies up to the Nyquist frequency for the average sampling interval f N = N 0 2T = 500 2log(500) 40.2, a peak of hight 5.61 as above is found in 27.6% of the periodograms (estimated again by a Monte Carlo simulation of 1000 random walks with 500 steps each). So it seems that one needs some a priori reason for considering only small frequencies, to justify the claim of a significant log-periodic signature in this case. 5. Conclusion We have indrocuced a new version of the Lomb periodigram that exhibits good statistical properties when applied to cumulative noise. With this we were able to detect the log-periodic signature in the S&P 500 anti-bubble with better significance than with the ususal periodigram, even without a detrending procedure. More importantly this method allows us to estimate the significance of the found log periodic signature which is a reasonable 5.9% if we only consider fequencies smaller than 10.0, and a disappointing 27.6% if one considers all frequencies up to the Nyquist frequency for the average sampling interval. We also detect cumulative lomp peaks of similar significance at harmonics of the fundamental frequency 1.7. References [Fei01a] James A. Feigenbaum. More on a statistical analysis of log-periodic precursors to financial crashes. Quantitative Finance, 1(5):527 532, 2001, cond-mat/0107445. [Fei01b] James A. Feigenbaum. A statistical analysis of log-periodic precursors to financial crashes. Quantitative Finance, 1(3):346 360, 2001, cond-mat/0101031. [HSS00] Y. Huang, H. Saleur, and D. Sornette. Reexamination of log-periodicity observed in the seismic precursors of the 1989 loma prieta earthquake. J. Geophysical Research, 105(B12):28111 28123, 2000, cond-mat/9911421. [Lom76] N. R. Lomb. Least-squares frequency analysis of unequally spaced data. Astrophysics and Science, 39:447 462, 1976.

SIGNIFICANCE OF LOG-PERIODIC SIGNATURES IN CUMULATIVE NOISE 7 [Sca82] Jeffrey.D. Scargle. Studies in astronomical time series analysis. II. Statistical aspects of spectral analysis of unevenly spaced data. Astrophysical Journal, 263:835 853, 1982. [SJ01] D. Sornette and A. Johansen. Significance of log-periodic precursors to financial crashes. Quantitative Finance, 1(4):452 471, 2001, cond-mat/0106520. [SZ02] D. Sornette and W.-X. Zhou. The US 2000-2002 market descent: How much longer and deeper? Quantitative Finance, 2(6):468 481, 2002, cond-mat/0209065. [ZS02] Wei-Xing Zhou and Didier Sornette. Statistical significance of periodicity and logperiodicity with heavy-tailed correlated noise. Int. J. Mod. Phys. C, 13(2):137 170, 2002, cond-mat/0110445. [ZS03] Wei-Xing Zhou and Didier Sornette. Renormalization group analysis of the 2000-2002 anti-bubble in the US S&P 500 index: Explanation of the hierarchy of 5 crashes and prediction. Physica A, 2003, physics/0301023.

8 HANS-CHRISTIAN GRAF V. BOTHMER Figure 1. Scargle s periodogram for 1000 random walks with normal distributed innovations of zero mean and unit variance. Each random walk has 500 steps which are assumed to occur at times log(1)... log(500)

SIGNIFICANCE OF LOG-PERIODIC SIGNATURES IN CUMULATIVE NOISE 9 Figure 2. Cumulative Lomb periodogram for 1000 random walks with innovations of zero mean and unit variance. Each random walk has 500 steps which are assumed to occur at times log(1)... log(500). For the normalisation σ 0 = 1 has been used.

10 HANS-CHRISTIAN GRAF V. BOTHMER Figure 3. Cumulative Lomb periodograms without the correction for correlation between C(ω) and S(ω) of 1000 random walks with innovations of zero mean and unit variance. Each of the random walks has 500 steps which are assumed to occur at times log(1)... log(500). σ 0 was estimated for each dataset. The ommission of the correction terms introduces spurious peaks at several frequencies.

SIGNIFICANCE OF LOG-PERIODIC SIGNATURES IN CUMULATIVE NOISE 11 Figure 4. Highest peaks in Scargle s periodograms of 2-year windows of S&P 500 data starting from different days in August and September 2000. Before calculating the periodogram the price data has been detrended according to the procedure described in the text.

12 HANS-CHRISTIAN GRAF V. BOTHMER Figure 5. 202 Scargle periodograms for 2-year windows of S&P 500 data starting from 22nd of each month. Before calculating the periodogram the price data has been detrended according to the procedure described in the text. For one window the detrending procedure did not converge

SIGNIFICANCE OF LOG-PERIODIC SIGNATURES IN CUMULATIVE NOISE 13 Figure 6. 203 cumulative periodograms for 2-year windows of S&P 500 data starting from 22nd of each month. The price data has not been detrended. σ0 2 = 0.000119403 has been estimated from the total 18 years of data.

14 HANS-CHRISTIAN GRAF V. BOTHMER Figure 7. Highest peaks of cumulative periodograms for 1000 random walks compared with those of 203 cumulative periodograms for 2-year windows of S&P 500 data starting from 22nd of each month. The price data has not been detrended. σ0 2 = 0.000119403 has been estimated from the total 18 years of data. Institut für Mathematik (C), Universität Hannover, Welfengarten 1, D- 30167 Hannover E-mail address: bothmer@math.uni-hannover.de