Applying least absolute deviation regression to regressiontype estimation of the index of a stable distribution using the characteristic function

Similar documents
A Note on Box-Cox Quantile Regression Estimation of the Parameters of the Generalized Pareto Distribution

A goodness-of-fit test based on the empirical characteristic function and a comparison of tests for normality

The performance of univariate goodness-of-fit tests for normality based on the empirical characteristic function in large samples

G. R. Pasha Department of Statistics Bahauddin Zakariya University Multan, Pakistan

Double Stage Shrinkage Estimator of Two Parameters. Generalized Exponential Distribution

Confidence Interval for Standard Deviation of Normal Distribution with Known Coefficients of Variation

Statistical Inference (Chapter 10) Statistical inference = learn about a population based on the information provided by a sample.

WEIGHTED LEAST SQUARES - used to give more emphasis to selected points in the analysis. Recall, in OLS we minimize Q =! % =!

Investigating the Significance of a Correlation Coefficient using Jackknife Estimates

Properties and Hypothesis Testing

MATH 320: Probability and Statistics 9. Estimation and Testing of Parameters. Readings: Pruim, Chapter 4

Correlation Regression

Estimation of Gumbel Parameters under Ranked Set Sampling

Lecture 2: Monte Carlo Simulation

Comparing Two Populations. Topic 15 - Two Sample Inference I. Comparing Two Means. Comparing Two Pop Means. Background Reading

Linear Regression Models

Trimmed Mean as an Adaptive Robust Estimator of a Location Parameter for Weibull Distribution

MOMENT-METHOD ESTIMATION BASED ON CENSORED SAMPLE

A NEW METHOD FOR CONSTRUCTING APPROXIMATE CONFIDENCE INTERVALS FOR M-ESTU1ATES. Dennis D. Boos

Approximate Confidence Interval for the Reciprocal of a Normal Mean with a Known Coefficient of Variation

STATISTICAL PROPERTIES OF LEAST SQUARES ESTIMATORS. Comments:

EXAMINATIONS OF THE ROYAL STATISTICAL SOCIETY

n n i=1 Often we also need to estimate the variance. Below are three estimators each of which is optimal in some sense: n 1 i=1 k=1 i=1 k=1 i=1 k=1

Resampling Methods. X (1/2), i.e., Pr (X i m) = 1/2. We order the data: X (1) X (2) X (n). Define the sample median: ( n.

Chapter 8: Estimating with Confidence

ECE 8527: Introduction to Machine Learning and Pattern Recognition Midterm # 1. Vaishali Amin Fall, 2015

A statistical method to determine sample size to estimate characteristic value of soil parameters

TABLES AND FORMULAS FOR MOORE Basic Practice of Statistics

17. Joint distributions of extreme order statistics Lehmann 5.1; Ferguson 15

This is an introductory course in Analysis of Variance and Design of Experiments.

Since X n /n P p, we know that X n (n. Xn (n X n ) Using the asymptotic result above to obtain an approximation for fixed n, we obtain

Sample Size Estimation in the Proportional Hazards Model for K-sample or Regression Settings Scott S. Emerson, M.D., Ph.D.

Chapter 11 Output Analysis for a Single Model. Banks, Carson, Nelson & Nicol Discrete-Event System Simulation

Random Variables, Sampling and Estimation

1 Inferential Methods for Correlation and Regression Analysis

A proposed discrete distribution for the statistical modeling of

Estimation of the Mean and the ACVF

Assessment and Modeling of Forests. FR 4218 Spring Assignment 1 Solutions

On an Application of Bayesian Estimation


Comparison of Minimum Initial Capital with Investment and Non-investment Discrete Time Surplus Processes

11 Correlation and Regression

Linear Regression Demystified

Lecture Note 8 Point Estimators and Point Estimation Methods. MIT Spring 2006 Herman Bennett

THE SYSTEMATIC AND THE RANDOM. ERRORS - DUE TO ELEMENT TOLERANCES OF ELECTRICAL NETWORKS

3/3/2014. CDS M Phil Econometrics. Types of Relationships. Types of Relationships. Types of Relationships. Vijayamohanan Pillai N.

It should be unbiased, or approximately unbiased. Variance of the variance estimator should be small. That is, the variance estimator is stable.

Maximum likelihood estimation from record-breaking data for the generalized Pareto distribution

There is no straightforward approach for choosing the warmup period l.

Approximating the ruin probability of finite-time surplus process with Adaptive Moving Total Exponential Least Square

(7 One- and Two-Sample Estimation Problem )

In this section we derive some finite-sample properties of the OLS estimator. b is an estimator of β. It is a function of the random sample data.

Section 14. Simple linear regression.

TABLES AND FORMULAS FOR MOORE Basic Practice of Statistics

Optimally Sparse SVMs

t distribution [34] : used to test a mean against an hypothesized value (H 0 : µ = µ 0 ) or the difference

Stochastic Simulation

Confidence Intervals for the Population Proportion p

Statistical Analysis on Uncertainty for Autocorrelated Measurements and its Applications to Key Comparisons

Statistical Inference Based on Extremum Estimators

Overview. p 2. Chapter 9. Pooled Estimate of. q = 1 p. Notation for Two Proportions. Inferences about Two Proportions. Assumptions

SYSTEMATIC SAMPLING FOR NON-LINEAR TREND IN MILK YIELD DATA

Economics 241B Relation to Method of Moments and Maximum Likelihood OLSE as a Maximum Likelihood Estimator

Maximum Likelihood Estimation

STAT 350 Handout 19 Sampling Distribution, Central Limit Theorem (6.6)

ECONOMETRIC THEORY. MODULE XIII Lecture - 34 Asymptotic Theory and Stochastic Regressors

A General Family of Estimators for Estimating Population Variance Using Known Value of Some Population Parameter(s)

Because it tests for differences between multiple pairs of means in one test, it is called an omnibus test.

Chapter If n is odd, the median is the exact middle number If n is even, the median is the average of the two middle numbers

5.4 The spatial error model Regression model with spatially autocorrelated errors

CALCULATING FIBONACCI VECTORS

CEU Department of Economics Econometrics 1, Problem Set 1 - Solutions

Extreme Value Charts and Analysis of Means (ANOM) Based on the Log Logistic Distribution

Goodness-Of-Fit For The Generalized Exponential Distribution. Abstract

LINEAR REGRESSION ANALYSIS. MODULE IX Lecture Multicollinearity

For nominal data, we use mode to describe the central location instead of using sample mean/median.

Mixtures of Gaussians and the EM Algorithm

Average Number of Real Zeros of Random Fractional Polynomial-II

Sequences of Definite Integrals, Factorials and Double Factorials

7-1. Chapter 4. Part I. Sampling Distributions and Confidence Intervals

GUIDELINES ON REPRESENTATIVE SAMPLING

Lecture 11 Simple Linear Regression

Simple Linear Regression

Bootstrap Intervals of the Parameters of Lognormal Distribution Using Power Rule Model and Accelerated Life Tests

Robust Default Correlation for Cost Risk Analysis

EECS564 Estimation, Filtering, and Detection Hwk 2 Solns. Winter p θ (z) = (2θz + 1 θ), 0 z 1

Chapter 8: STATISTICAL INTERVALS FOR A SINGLE SAMPLE. Part 3: Summary of CI for µ Confidence Interval for a Population Proportion p

ANALYSIS OF EXPERIMENTAL ERRORS

Monte Carlo Integration

MOST PEOPLE WOULD RATHER LIVE WITH A PROBLEM THEY CAN'T SOLVE, THAN ACCEPT A SOLUTION THEY CAN'T UNDERSTAND.

Econ 325 Notes on Point Estimator and Confidence Interval 1 By Hiro Kasahara

Stat 139 Homework 7 Solutions, Fall 2015

A new distribution-free quantile estimator

Output Analysis (2, Chapters 10 &11 Law)

Simple Random Sampling!

Mathematical Modeling of Optimum 3 Step Stress Accelerated Life Testing for Generalized Pareto Distribution

Information-based Feature Selection

Quantile regression with multilayer perceptrons.

Hypothesis Testing. Evaluation of Performance of Learned h. Issues. Trade-off Between Bias and Variance

Chapter 10: Power Series

Transcription:

Applyig least absolute deviatio regressio to regressiotype estimatio of the idex of a stable distributio usig the characteristic fuctio J. MARTIN VAN ZYL Departmet of Mathematical Statistics ad Actuarial Sciece, Uiversity of the Free State, PO Box 339, Bloemfotei, South Africa Abstract Least absolute deviatio regressio is applied usig a fixed umber of poits for all values of the idex to estimate the idex ad scale parameter of the stable distributio usig regressio methods based o the empirical characteristic fuctio. The recogized fixed umber of poits estimatio procedure uses te poits i the iterval zero to oe, ad least squares estimatio. It is show that usig the more robust least absolute regressio based o iteratively re-weighted least squares outperforms the least squares procedure with respect to bias ad also mea square error i smaller samples. Keywords Stable Distributio; Idex; Characteristic fuctio; Estimatio. Mathematical Subject Classificatio 62F10; 60E07; 62E10; 1. Itroductio A procedure usig robust LAD regressio based o the empirical characteristic fuctio (c.f.) evaluated at a fixed umber of poits to estimate parameters of the symmetric stable distributio is proposed. Deote the c.f. by φ ( t) Page 1. Let x,..., 1 x deote a sample of size i.i.d. observatios. The sample c.f. is estimated for a 1 give value of t as ˆ itx ( ) j φ t = e. Suppose the c.f. is estimated at K poits 1 j= 1 t,..., t, = 1,..., K. (1980) showed that the trasformatio K ˆ ca be used to costruct liear regressio equatios to 2 log( log( φ ( t ) )) estimate the parameters. The resultig regressio equatio is highly heteroscedastic ad there is also a much more complicated autocorrelatio

structure tha a simple autoregressive process of low order. (1980) foud that the optimal values for calculatig the empirical c.f. are t,..., t, t = π / 25, = 1,..., K. A complicatio is that K depeds o the 1 K uow value of the idex of the stable distributio ad the estimatio results are very sesitive to the umber of poits. Thus the two problems whe usig the empirical c.f. regressio approach is to fid the optimal value of K ad which poits must be chose. Methods were derived usig a fixed umber of poits ad other usig a umber which is a fuctio of the uow parameters. This wor will focus o comparig the LAD regressio estimatio procedure usig a fixed umber of poits for all values of the uow parameters with that of Kogo ad Williams (1998), which also mae use of a fixed umber of poits. The followig aspects will be tae ito accout i this wor: The iterval where the residual variace of the regressio reaches a miimum ad is most costat. It was foud that this iterval is approximately for t [0.5,1.0] ad usig poits chose i this iterval leads to excellet results with respect to the bias of the estimated parameters but performs reasoable with respect to MSE. The more robust least absolute regressio (LAD) maig use of iteratively reweighted least squares (IRLS) is tested. IRLS for LAD estimatio use weigths with are iversely proportioal to the absolute value of the residuals ad may perform good i regressio problems where heteroscedasticity is preset. It was foud that if a fixed umber of poits K=20, i the iterval [0.1,1.0] is used, the excellet estimatio results with respect to both MSE ad bias were foud over the whole rage of parameters. The sesitivity of the various procedures with respect to the umber of poits ad which poits are chose. It was foud that the LAD procedure with poits chose o the iterval [0.1,1.0], the results is robust with Page 2

respect to the choice of K, but for the poits t,..., t, t = π / 25, = 1,..., K, the wrog choice of K ca lead to very 1 K biased estimators of the idex whe least squares estimatio is performed. The more robust LAD regressio usig the IRLS method ad a fixed umber of poits outperforms the Kogo ad Williams (1998) procedure i samples with up to a few hudred observatios. Kogo ad Williams (1998) suggested te poits [0.1, 0.2,,1.0] usig least squares regressio. Some sewess i the data does have a small ifluece o the estimatio results. LAD estimatio outperforms the (1980) estimatio method where the umber ad choice of poits where the c.f. is calculated is chose usig iitial estimatio of the parameters. For a give sample, the c.f., φ ( t), regressio equatios are formed based o calculatig the empirical characteristic fuctio at poits t 1,..., t, t = π / 25, = 1,..., K. I practice the parameters ad specifically K the idex is uow ad K is a fuctio of the uow idex of the stable distributio. Whe the c.f. is calculated at the optimal poits this method performs excellet, but the method of (1980) is very biased whe choosig K icorrectly. This wor will focus o the estimatio of the idex of symmetrically stable distributed data. Such data are ofte used i maret ris aalysis ad especially whe worig with log returs of assets traded i a maret. Some of this is reviewed i the boos by Cize, Härdle ad Wero, eds. (2011), Getle, Härdle, Mori, eds, 2004. Deote the c.f. of the stable distributio by φ ( t) where log ( t) α α φ = σ t {1 iβsig( t) ta( πα / 2)} + iµ t, α 1, ad log φ( t) = σ t {1 + iβsig( t)(2 / π )log( t )} + iµ t, α = 1. Page 3

The parameters are the idex α (0,2], scale parameter σ > 0, coefficiet of sewess β [ 1,1] ad mode µ. The symmetric case with µ = 0, β = 0 will be cosidered i this wor. (1980) made use of the properties of the c.f. 2 α α ad usig the fact that φ( t) = exp( 2 σ t ) derived the model which does ot ivolve β ad µ whe estimatig the idex α ad σ : φ t = σ + α t, (1.1) 2 α log( log( ( ) )) log(2 ) log( ) a simple liear regressio model ca be formed y = m + αω + ε. (1.2) The c.f. is estimated for a give value of t, for a sample of size i.i.d. observatios,..., 1 x1 x, as ˆ itx ( ) j φ t = e, ad j= 1 y = ˆ φ, 2 log( log( ( t ) )) α m = log(2 σ ), ω = log( t ), ε a error term. (1980) derived the optimal poits t = π / 25, = 1,..., K, ad optimal values of K was suggested for various sample sizes ad α ' s. I practice for a specific sample size α is uow ad choosig it icorrectly leads to icorrect estimatio results. A expressio for the covariace ˆ φ ˆ φ ad thus the variace of 2 2 cov( ( t j ), ( t ) ) ˆ ( ) 2 φ t j is give by (1980). This expressio depeds o the 2 uow parameters, ad thus also Var(log( log( ˆ φ ( t ) ))) maig weighted regressio problematic. j Paulso et al. (1997) showed that by usig stadardized data estimatio results ca be improved ad all estimatio i this wor will be performed o stadardized data. (1980) foud that this regressio equatio does ot deped o the locatio parameter. (1980) suggested usig a trucated mea of 25% ad the Fama ad Roll ( 1971) estimator of the scale parameter σ : ˆ σ = ( x x ) /1.654,.72.28 Page 4

where the.72 ad.28 deote percetiles of the data, to stadardize the data. The same stadardizatio will be used before applyig the LAD regressio. Kogo ad Williams (1998) used iitial estimators usig the method of McCullogh (1986) to perform stadardizatio. The LAD results will be compared o stadardized data as described above agaist the method assumig K ow, K estimated ad also the Kogo-Williams procedure (Bora ad Wero (2010c)). The optimal K for the procedure of (1980) is chose usig a iitial estimate of the idex usig the Mcullogh (1986) estimate as applied by Bora ad Wero (1910a). Much research was doe o usig a approximate covariace matrix ad geeralized least squares to estimate the parameters. A excellet overview of this approach is give by Besbeas ad Morga (2008). They suggested usig arithmetic spacig of t s which performs very well but the optimal umber of poits chose is also ot idepedet of the uow parameters. The wor of Feuerverger ad McDuough (1981a, 1981b,1981c), ad Bauer (1982) is also of importace where weighted least squares ad geeralised least squares were applied. But also o defiite umber of samplig poits which will perform good over the whole rage of the idex. It is show i sectio 2 that the residual variace is highly heteroscedastistic with respect to t. This might lead to a decrease i the efficiecy, ad also icorrect estimates of the variaces of the estimated parameters. The variace of the residuals, ε 's for a give t ad the true parameters, is estimated usig simulated samples. Residuals of a sample was calculated usig the true parameters as ε = y m αω, ad from these residuals the variace, Var( ε ) = Var( ε t, σ, α), was estimated. The sesitivity of the procedure with respect to K is also ivestigated. I sectio 3 simulatio studies were coducted to compare the various estimators. Page 5

2. The residual variace whe usig regressio type methods based o the empirical characteristic fuctio The residual variace, autocorrelatio structure whe usig least squares ad the sesitivity whe choosig K icorrectly with respect to the optimal K will be ivestigated i this sectio. The programs of Bora ad Wero (2010a), (2010b), (2010c), Bora et al (2011) were used whe applyig the method of Kogo ad Williams (1998) ad also whe applyig the method of (1980) usig iitial estimated parameters. Their iitial estimatio ad also stadardizatio was accordig to the wor of McCullogh (1986). I figure 1 the estimated residual variace based o the true parameters usig (1.2) ad o-stadardized data is plotted for various values of t. The variace of the residuals of M = 1000 samples of size = 200 each, with respect to t is show. The data was simulated with α = 1.5, σ = 0.1, β = 0, µ = 0. Similar patters was observed for other values of α, σ. The error variace is smallest i the iterval with t betwee 0.5 ad 1.0. The true parameters are used i the calculatio of the error variace, for example 2 2 ˆ ˆ t t t,, var( 1,..., M t,, ), σ = σ σ α = ε ε α σ, where ε α, m, t = y m α log( t), y = ˆ φ t j = M. j 2 log( log( ( ) )), 1,..., j j 0.11 0.1 0.09 residual variace for a give t 0.08 0.07 0.06 0.05 0.04 0.03 0.02 0.01 0 0.5 1 1.5 2 2.5 3 3.5 4 4.5 5 t Figure 1 Estimated residual variaces for give values of t, α = 1.5, σ = 0.1, = 200. Page 6

I figure 2 the estimated residual variace based o the estimated parameters usig (1.2) ad stadardized data is plotted for various values of t. The data stadardized usig the program by Bora ad Wero (2010a). Similar patters of the error variaces were observed for differet values of the idex. The autocorrelatio fuctio plot of the residuals for a specific sample is show i figure 3, which shows that a simple autoregressive type model will ot fit the residuals. Experimetatio showed that the patter is a complicated ARMA type model with terms of high order which is ot easily idetifiable, maig the use of regressio models with autocorrelated residuals very difficult. 0.03 0.025 residual variace 0.02 0.015 0.01 0.005 0 0.5 1 1.5 2 2.5 3 3.5 4 t Figure 2 Estimated residual variaces for K=28 values of t, α = 0.9, σ = 1.0, = 200. Stadardized data ad variaces calculated from 500 regressios. Page 7

1 0.8 Sample Autocorrelatio 0.6 0.4 0.2 0-0.2-0.4 0 2 4 6 8 10 12 14 16 18 20 Lag Figure 3 ACF of residuals, α = 0.9, σ = 1.0, = 200. Stadardized data. The bias whe usig the icorrect umber of poits whe usig the (1980) procedure o stadardized data is show. The estimated parameters was calculated based o 500 estimated values of the idex α = 1.3 for which the optimal K=22. The LAD estimated idex for various values of K is also show, ad it ca be see that the result is ot very sesitive with respect to K. 1.35 1.3 1.25 estimated idex 1.2 1.15 1.1 1.05 1 0.95 10 15 20 25 30 35 40 K Figure 4 Estimated idex, various values of K, α = 1.3, σ = 1.0, β = 0.0, = 200. The solid lie whe usig π / 25, = 1,..., K. Average of 500 estimated idexes at each K. ad the dashed lie K LAD regressio usig K poits i the iterval [0.1:1.0]. Page 8

This small error variace together with the research of Kogo ad Williams (1998) motivated the use of iterval [0.1, 1.0] whe usig LAD regressio. Experimetatio showed that 20 uiformly distributed poits yielded good estimatio results whe usig LAD, with respect to bias ad MSE. 3. Compariso betwee estimatio procedures I this sectio a simulatio study was coducted. Stadardizatio was performed o all the data before estimatio ad the locatio parameter µ = 0, scale parameter σ = 1.0 was used for all the simulatios. Samples from symmetric distributios with locatio parameter zero were cosidered. For the LAD method, data was stadardized usig the Fama ad Roll (1971) estimator ad a 25% trimmed mea. K=20 poits were used for all values of the idex, ad the poits used were t = 0.1+ 0.05( 1), = 1,..., 20. Iteratively reweighted least squares (IRLS) ca be used to miimize with respect to the absolute value orm if the weight matrix is a diagoal matrix with diagoal elemets equal to the iverse of the absolute value of the residuals. This techique was applied to fid the LAD estimators. The method ivolves a weighted least squares multiple liear regressio form, calculated iteratively, usig a diagoal weight matrix ( j) W with diagoal elemets with w = 1/(1 + e ), i = 1,..., 20, where e i is the i-th residual at iteratio j. This form is chose to avoid divisio by zero ad it ca be see that it is weighted regressio with weights iversely related to the size of the absolute value of the residuals. The results are based o 10000 simulated samples each time. No adjustmet was made if the LAD estimate was larger tha 2. ( j) i i For the Kogo ad Williams (1998) method the poits t = 0.1+ 0.1( 1), = 1,...,10 were used. The results of the (1980) procedure choosig K usig the true idex is icluded as a referece. It should be oted this is ot comparable to the other three methods with respect to practical problems where there is o prior owledge of the parameters ad all are Page 9

parameters are calculated usig the sample. The Bora ad Wero (2010a) method base the umber of poits o estimated iitial values. 3.1 Results for the estimatio of the idex The assumptio is ot made that the mea is zero ad stadardizatio is carried out also with respect to the mea i the simulatios. For the idex estimatio, the LAD methods outperforms the other procedures with respect to MSE i smaller samples ad much the same whe it was tested o sewed data where β = 0.5. It seems that i larger samples fidig iitial estimates ad the usig the optimal poits yields the best results. It ca be oted that the Fama ad Roll (1971) estimatio method which is used for stadardizatio, is strictly speaig valid for α 1, which may explai the wea estimatio results whe usig these estimates to stadardize the data whe α < 1, ad thus leads to poor results especially whe α = 0.7 ad usig the method of (1980). This poit was ot icluded i the figures. I table 1 the performace of usig least squares estimatio usig te poits uiformly chose i the iterval [0.5,1] is show. It ca be see that it yields excellet results with respect to bias, but the MSE is weaer tha the Kogo- Williams procedure. Overall LAD regressio is best to use whe both bias ad MSE is tae ito accout with a MSE cosidered as least or more importat tha bias. α LAD Least Squares [0.5,1.0] Kogo-Williams [0.1,1.0] ˆα Bias MSE ˆα Bias MSE ˆα Bias MSE 1.9 1.9043 0.0043 0.0069 1.8934-0.0066 0.0101 1.9046 0.0046 0.0073 1.5 1.5103 0.0103 0.0159 1.4947-0.0053 0.0271 1.5131 0.0131 0.0175 1.3 1.3087 0.0087 0.0141 1.2958-0.0042 0.0329 1.3110 0.0110 0.0150 1.1 1.1070 0.0070 0.0122 1.0971-0.0029 0.0387 1.1100 0.0100 0.0127 0.9 0.9034 0.0034 0.0094 0.8978-0.0022 0.0420 0.9065 0.0065 0.0095 0.7 0.7035 0.0035 0.0075 0.6990-0.0010 0.0440 0.7070 0.0070 0.0075 Table 1 Compariso of estimatio procedures of α with respect to bias ad MSE, Page 10

σ = 1.0, µ = 0, β = 0, = 100. LAD usig 20 poits o the iterval [0.1,1.0], least squares 10 poits o the iterval [0.5,1], ad the Kogo-Williams procedure 10 poits o the iterval [0.1,1]. α LAD (K optimal true α ) (Bora ad Wero) Kogo-Williams ˆα Bias MSE ˆα Bias MSE ˆα Bias MSE ˆα Bias MSE 1.9 1.9053 0.0053 0.0125 1.8965-0.0035 0.0133 1.8921-0.0079 0.0121 1.9039 0.0039 0.0121 1.5 1.5230 0.0230 0.0320 1.5151 0.0151 0.0287 1.4986-0.0014 0.0369 1.5283 0.0283 0.0347 1.3 1.3185 0.0185 0.0299 1.3129 0.0129 0.0269 1.2807-0.0193 0.0344 1.3245 0.0245 0.0319 1.1 1.1120 0.0120 0.0250 1.1090 0.0090 0.0229 1.0692-0.0308 0.0278 1.1181 0.0181 0.0258 0.9 0.9073 0.0073 0.0203 0.9059 0.0059 0.0188 0.8538-0.0462 0.0310 0.9160 0.0160 0.0207 0.7 0.7059 0.0059 0.0156 0.7049 0.0049 0.0148 0.5767-0.1233 0.0416 0.7128 0.0128 0.0158 Table 2 Compariso of estimatio procedures of α with respect to bias ad MSE, σ = 1.0, µ = 0, β = 0, = 100. 0.04 0.035 MSE estimated idex 0.03 0.025 0.02 0.015 0.01 0.9 1 1.1 1.2 1.3 1.4 1.5 1.6 1.7 1.8 1.9 idex α Figure 5 MSE of three procedures for various values of the idex α. Symmetric data ad estimatio performed o stadardized data, =100, 10000 estimated samples. Solid lie - LAD, dash dot - Kogo-Williams method ad dashed the method of with the umber of poits chose usig the method of McCullogh. Page 11

α LAD (K optimal true α ) (Bora ad Wero) Kogo-Williams ˆα Bias MSE ˆα Bias MSE ˆα Bias MSE ˆα Bias MSE 1.9 1.9033 0.0033 0.0069 1.9010 0.0010 0.0070 1.8981-0.0019 0.0065 1.9034 0.0034 0.0073 1.5 1.5104 0.0104 0.0154 1.5038 0.0038 0.0132 1.5001 0.0001 0.0167 1.5135 0.0135 0.0169 1.3 1.3093 0.0093 0.0143 1.2542-0.0458 0.0139 1.2916-0.0084 0.0152 1.3123 0.0123 0.0153 1.1 1.1071 0.0071 0.0119 1.0687-0.0313 0.0104 1.0846-0.0154 0.0121 1.1095 0.0095 0.0123 0.9 0.9039 0.0039 0.0096 0.8830-0.0170 0.0074 0.8914-0.0086 0.0097 0.9069 0.0069 0.0097 0.7 0.7031 0.0031 0.0075 0.7002 0.0002 0.0054 0.6340-0.0660 0.0171 0.7062 0.0062 0.0074 Table 3 Compariso of estimatio procedures of α with respect to bias ad MSE, σ = 1.0, µ = 0, β = 0, = 200. The MSE of the various procedures is show i figure 6. 0.02 0.018 0.016 MSE estimated idex 0.014 0.012 0.01 0.008 0.006 0.9 1 1.1 1.2 1.3 1.4 1.5 1.6 1.7 1.8 1.9 idex α Figure 6 MSE of three procedures for various values of the idex α. Symmetric data ad estimatio performed o stadardized data, =200, 10000 estimated samples. Solid lie - LAD, dash dot - Kogo-Williams method ad dashed the method of with the umber of poits chose usig the method of McCullogh. Page 12

α LAD (K optimal true α ) Kogo-Williams ˆα Bias MSE ˆα Bias MSE ˆα Bias MSE ˆα Bias MSE 1.9 1.9007 0.0007 0.0019 1.9002 0.0002 0.0019 1.8999-0.0001 0.0019 1.9010 0.0010 0.0022 1.5 1.5035 0.0035 0.0039 1.5020 0.0020 0.0033 1.5034 0.0034 0.0037 1.5041 0.0041 0.0043 1.3 1.3024 0.0024 0.0035 1.2987-0.0013 0.0031 1.2970-0.0030 0.0034 1.3030 0.0030 0.0038 1.1 1.1014 0.0014 0.0030 1.0990-0.0010 0.0025 1.0969-0.0031 0.0030 1.1019 0.0019 0.0031 0.9 0.9015 0.0015 0.0024 0.9001 0.0001 0.0021 0.8983-0.0017 0.0022 0.9022 0.0022 0.0024 0.7 0.7005 0.0005 0.0018 0.7010 0.0010 0.0014 0.6794-0.0206 0.0034 0.7014 0.0014 0.0018 Table 4 Compariso of estimatio procedures of α with respect to bias ad MSE, σ = 1.0, µ = 0, β = 0, = 800. 5 x 10-3 4.5 4 MSE estimated idex 3.5 3 2.5 2 1.5 0.9 1 1.1 1.2 1.3 1.4 1.5 1.6 1.7 1.8 1.9 idex α Figure 7 MSE of three procedures for various values of the idex α. Symmetric data ad estimatio performed o stadardized data, =800, 10000 estimated samples. Solid lie - LAD, dash dot - Kogo-Williams method ad dashed the method of with the umber of poits chose usig the method of McCullogh. Page 13

α LAD (K optimal true α ) Kogo-Williams ˆα Bias MSE ˆα Bias MSE ˆα Bias MSE ˆα Bias MSE 1.9 1.9034 0.0034 0.0070 1.8960-0.0040 0.0074 1.8982-0.0018 0.0065 1.9034 0.0034 0.0073 1.5 1.5119 0.0119 0.0159 1.5058 0.0058 0.0134 1.5021 0.0021 0.0162 1.5141 0.0141 0.0170 1.3 1.3102 0.0102 0.0151 1.2667-0.0333 0.0119 1.2909-0.0091 0.0154 1.3123 0.0123 0.0155 1.1 1.1088 0.0088 0.0129 1.0879-0.0121 0.0086 1.0875-0.0125 0.0123 1.1135 0.0135 0.0125 0.9 0.9078 0.0078 0.0107 0.8949-0.0051 0.0064 0.8853-0.0147 0.0107 0.9157 0.0157 0.0100 0.7 0.7066 0.0066 0.0085 0.7031 0.0031 0.0048 0.6080-0.0920 0.0226 0.7164 0.0164 0.0081 Table 5 Compariso of estimatio procedures of α with respect to bias ad MSE, σ = 1.0, µ = 0, β = 0.5, = 200. 0.02 0.018 0.016 MSE estimated idex 0.014 0.012 0.01 0.008 0.006 0.9 1 1.1 1.2 1.3 1.4 1.5 1.6 1.7 1.8 1.9 idex α Figure 8 MSE of three procedures for various values of the idex α. β = 0.5 ad estimatio performed o stadardized data, =200, 10000 estimated samples. Solid lie - LAD, dash dot - Kogo-Williams method ad dashed the method of with the umber of poits chose usig the method of McCullogh. 3.2 Results for the estimatio of the scale parameter There is ot much differece betwee the estimators of the scale parameters. Agai it seems that LAD will be best to apply whe the samples are smaller ad i large samples the other procedures performs well. Page 14

LAD (K optimal) Kogo-Williams ˆ σ Bias MSE ˆ σ Bias MSE ˆ σ Bias MSE ˆ σ Bias MSE 1.9 0.9914-0.0086 0.0073 0.9867-0.0133 0.0073 0.9877-0.0123 0.0121 0.9920-0.0080 0.0073 1.5 0.9942-0.0058 0.0135 0.9891-0.0109 0.0139 0.9806-0.0194 0.0369 0.9954-0.0046 0.0139 1.3 0.9917-0.0083 0.0177 0.9875-0.0125 0.0178 0.9695-0.0305 0.0344 0.9940-0.0060 0.0178 1.1 0.9868-0.0132 0.0249 0.9840-0.0160 0.0248 0.9660-0.0340 0.0278 0.9915-0.0085 0.0248 0.9 0.9869-0.0131 0.0375 0.9854-0.0146 0.0367 0.9763-0.0237 0.0310 0.9952-0.0048 0.0367 0.7 0.9903-0.0097 0.0640 0.9888-0.0112 0.0600 1.3340 0.3340 0.0416 0.9987-0.0013 0.0600 Table 6 Compariso of estimatio procedures of σ with respect to bias ad MSE, σ = 1.0, µ = 0, β = 0, = 100. 0.07 0.06 MSE estimated scale parameter 0.05 0.04 0.03 0.02 0.01 0 0.6 0.8 1 1.2 1.4 1.6 1.8 2 idex α Figure 9 MSE of three procedures for various values of the scale parameterσ. Symmetric data ad estimatio performed o stadardized data, =100, 10000 samples. Solid lie - LAD, dash dot - Kogo-Williams method ad dashed the method of with the umber of poits chose usig the method of McCullogh. Page 15

LAD (K optimal) Kogo-Williams ˆ σ Bias MSE ˆ σ Bias MSE ˆ σ Bias MSE ˆ σ Bias MSE 1.9 0.9954-0.0046 0.0036 0.9943-0.0057 0.0036 0.9939-0.0061 0.0065 0.9957-0.0043 0.0036 1.5 0.9958-0.0042 0.0066 0.9918-0.0082 0.0068 0.9900-0.0100 0.0167 0.9965-0.0035 0.0068 1.3 0.9968-0.0032 0.0090 0.9668-0.0332 0.0092 0.9865-0.0135 0.0152 0.9979-0.0021 0.0092 1.1 0.9957-0.0043 0.0122 0.9727-0.0273 0.0123 0.9816-0.0184 0.0121 0.9977-0.0023 0.0123 0.9 0.9926-0.0074 0.0182 0.9809-0.0191 0.0181 0.9875-0.0125 0.0097 0.9958-0.0042 0.0181 0.7 0.9961-0.0039 0.0307 0.9930-0.0070 0.0296 0.9967-0.0033 0.0171 0.9999-0.0001 0.0296 Table 7 Compariso of estimatio procedures of σ with respect to bias ad MSE, σ = 1.0, µ = 0, β = 0, = 200. A plot of the bias of the various procedures is show i figure 6. It ca be see that overall all the methods performs well with respect to the estimatio of the scale parameter ad there is little differece betwee the methods. I the followig tables the procedures were compared usig sewed data with β = 0.5. It ca be see that usig LAD yielded good results. LAD (K optimal) Kogo-Williams ˆ σ Bias MSE ˆ σ Bias MSE ˆ σ Bias MSE ˆ σ Bias MSE 1.9 0.9960-0.0040 0.0037 0.9948-0.0052 0.0037 0.9943-0.0057 0.0066 0.9963-0.0037 0.0037 1.5 0.9963-0.0037 0.0067 0.9929-0.0071 0.0068 0.9912-0.0088 0.0162 0.9966-0.0034 0.0068 1.3 0.9964-0.0036 0.0090 0.9713-0.0287 0.0087 0.9849-0.0151 0.0154 0.9977-0.0023 0.0087 1.1 0.9949-0.0051 0.0134 0.9807-0.0193 0.0123 0.9813-0.0187 0.0123 1.0000-0.0000 0.0123 0.9 0.9991-0.0009 0.0214 0.9867-0.0133 0.0182 0.9953-0.0047 0.0107 1.0079 0.0079 0.0182 0.7 0.9987-0.0013 0.0399 0.9926-0.0074 0.0294 0.9897-0.0103 0.0226 1.0112 0.0112 0.0294 Table 8 Compariso of estimatio procedures of σ with respect to bias ad MSE, σ = 1.0, µ = 0, β = 0.5, = 200. I table 9 results are give for sample size =800, 10000 simulated samples. Page 16

LAD (K optimal) Kogo-Williams ˆ σ Bias MSE ˆ σ Bias MSE ˆ σ Bias MSE ˆ σ Bias MSE 1.9 0.9995-0.0005 0.0009 0.9993-0.0007 0.0009 0.9992-0.0008 0.0019 0.9996-0.0004 0.0009 1.5 0.9995-0.0005 0.0016 0.9987-0.0013 0.0017 0.9996-0.0004 0.0037 0.9996-0.0004 0.0017 1.3 0.9983-0.0017 0.0022 0.9960-0.0040 0.0023 0.9950-0.0050 0.0034 0.9985-0.0015 0.0023 1.1 0.9982-0.0018 0.0031 0.9966-0.0034 0.0031 0.9953-0.0047 0.0030 0.9986-0.0014 0.0031 0.9 0.9989-0.0011 0.0045 0.9977-0.0023 0.0046 0.9969-0.0031 0.0022 0.9996-0.0004 0.0046 0.7 0.9972-0.0028 0.0074 0.9988-0.0012 0.0074 1.0048 0.0048 0.0034 0.9985-0.0015 0.0074 Table 9 Compariso of estimatio procedures of σ with respect to bias ad MSE, σ = 1.0, µ = 0, β = 0, = 800. It ca be see that the regressio usig LAD ad a fixed umber of poits to evaluate the empirical characteristic fuctio performs almost everywhere better that the Kogo-Williams ad the procedure where the umber of poits is based o iitial estimated parameters. 4. Coclusios LAD regressio performs better tha usig least squares Kogo ad Williams (1998). The estimated ˆα ca also be used to guess the best value of K, ad the performig estimatio based o the poits t = π / 25, = 1,..., K. Because the bias is very small, also whe usig least squares, usig the iterval [0.5,1.0], might be a very easy to calculate good estimate of K before usig the optimal poits derived by (1980). Refiemets ca be made to this procedure, with respect to the umber of t s used i the regressio. The simulatio study shows that the method performs well with respect to bias ad MSE over the whole rage of parameters commoly ecoutered i practical problems. Page 17

Refereces Besbeas, P, Morga, BJT (2008). Improved estimatio of the stable laws. Stat Comput 18: 219 231. Bora, S., Misiore, A. ad Wero, R. (2011). Models for Heavy-tailed Asset Returs, I Statistical Tools for Fiace ad Isurace, P.Cize, W.Härdle, R.Wero (Eds.), 2d editio, Heidelberg : Spriger-Verlag. Bora, S. ad Wero, R. (2010a). STABLEREG: MATLAB fuctio to estimate stable distributio parameters usig the regressio method of, Statistical Software Compoets M429005, Bosto College Departmet of Ecoomics. Bora, S. ad Wero, R. (2010b). STABLECULL: MATLAB fuctio to estimate stable distributio parameters usig the quatile method of McCulloch, Statistical Software Compoets M429004, Bosto College Departmet of Ecoomics. Bora, S. ad Wero, R. (2010c). STABLEREGKW: MATLAB fuctio to estimate stable distributio parameters usig the regressio method of Kogo ad Williams, Statistical Software Compoets M429004, Bosto College Departmet of Ecoomics. Cize,P., Härdle, W., Wero, R., Eds. (2011). Statistical Tools for Fiace ad Isurace, 2d editio, Spriger-Verlag: Heidelberg Fama, E.F., Roll, R. (1971). Parameter Estimatio for Symmetric Stable Distributios, J. of the Am. Stat Assoc., 66, 331 338. Feuerverger, A. ad McDuough, P. (1981a). O the efficiecy of empirical characteristic fuctio procedures. J. Roy. Stat. Soc. Ser B, 43, 20-27. Feuerverger, A. ad McDuough, P. (1981b). O some Fourier methods for iferece. J. Am. Stat. Assoc., 76, 379 387. Feuerverger, A. ad McDuough, P. (1981c). O efficiet iferece i symmetric stable laws ad processes. I Csörgö, M., Dawso, D., Rao, J., Saleh, A. (eds.). Statistics ad Related Topics, vol. 99, pp. 109 112. North-Hollad, Amsterdam. Getle, J.E., Härdle, W., Mori, Y., Eds. (2004). Hadboo of Computatioal Statistics: Cocepts ad Methods. New Yor: Spriger. Kogo, S.M. ad Williams, D.B. (1998). Characteristic fuctio based estimatio of stable distributios parameters. I: Adler, R.J, Feldma, R.E, Taqqu, M.S. (Eds.). A Practical Guide to Heavy Tails, pp. 311 355. Bosto: Birhäuser., I.A. (1980). Regressio-type Estimatio of the Parameters of Stable Laws. J Am Stat Ass 75: 918 928., I.A. ad Bauer, D.F. (1982). Asymptotic distributio of regressio-type estimators of stable laws. Commu. Stat. - Theory ad Methods, 11, 2715 2730. McCulloch, J.H. (1986). Simple Cosistet Estimators of Stable Distributios, Comm. i Statistics Simulatio ad Computatio, 15, 1109 1136. Paulso, A.S., Holcomb, W.E., Leitch, R.A. (1975). The Estimatio of the Parameters of the Stable Laws, Biometria, 62, 163 170. Page 18