THE DATA-BASED CHOICE OF BANDWIDTH FOR KERNEL QUANTILE ESTIMATOR OF VAR

Similar documents
Bull. Korean Math. Soc. 36 (1999), No. 3, pp. 451{457 THE STRONG CONSISTENCY OF NONLINEAR REGRESSION QUANTILES ESTIMATORS Seung Hoe Choi and Hae Kyung

Bayesian and E- Bayesian Method of Estimation of Parameter of Rayleigh Distribution- A Bayesian Approach under Linex Loss Function

Double Stage Shrinkage Estimator of Two Parameters. Generalized Exponential Distribution

A note on self-normalized Dickey-Fuller test for unit root in autoregressive time series with GARCH errors

Precise Rates in Complete Moment Convergence for Negatively Associated Sequences

A NEW METHOD FOR CONSTRUCTING APPROXIMATE CONFIDENCE INTERVALS FOR M-ESTU1ATES. Dennis D. Boos

Confidence interval for the two-parameter exponentiated Gumbel distribution based on record values

MOMENT-METHOD ESTIMATION BASED ON CENSORED SAMPLE

Minimax Estimation of the Parameter of Maxwell Distribution Under Different Loss Functions

A Note on Box-Cox Quantile Regression Estimation of the Parameters of the Generalized Pareto Distribution

Chapter 6 Sampling Distributions

A new distribution-free quantile estimator

Chapter 3. Strong convergence. 3.1 Definition of almost sure convergence

Kernel density estimator

Mathematical Modeling of Optimum 3 Step Stress Accelerated Life Testing for Generalized Pareto Distribution

Statistical Inference (Chapter 10) Statistical inference = learn about a population based on the information provided by a sample.

CEU Department of Economics Econometrics 1, Problem Set 1 - Solutions

A Weak Law of Large Numbers Under Weak Mixing

Research Article On the Strong Laws for Weighted Sums of ρ -Mixing Random Variables

Self-normalized deviation inequalities with application to t-statistic

A statistical method to determine sample size to estimate characteristic value of soil parameters

Random Variables, Sampling and Estimation

Resampling Methods. X (1/2), i.e., Pr (X i m) = 1/2. We order the data: X (1) X (2) X (n). Define the sample median: ( n.

Confidence Interval for Standard Deviation of Normal Distribution with Known Coefficients of Variation

An Extreme Value Theory Approach for Analyzing the Extreme Risk of the Gold Prices

Composite Quantile Generalized Quasi-Likelihood Ratio Tests for Varying Coefficient Regression Models Jin-ju XU 1 and Zhong-hua LUO 2,*

Approximate Confidence Interval for the Reciprocal of a Normal Mean with a Known Coefficient of Variation

SOME THEORY AND PRACTICE OF STATISTICS by Howard G. Tucker

Bootstrap Intervals of the Parameters of Lognormal Distribution Using Power Rule Model and Accelerated Life Tests

An Introduction to Asymptotic Theory

STATISTICAL PROPERTIES OF LEAST SQUARES ESTIMATORS. Comments:

Maximum likelihood estimation from record-breaking data for the generalized Pareto distribution

ESTIMATION AND PREDICTION BASED ON K-RECORD VALUES FROM NORMAL DISTRIBUTION

Review Article Complete Convergence for Negatively Dependent Sequences of Random Variables

A RANK STATISTIC FOR NON-PARAMETRIC K-SAMPLE AND CHANGE POINT PROBLEMS

Product Mix Problem with Radom Return and Preference of Production Quantity. Osaka University Japan

Lecture 19: Convergence

Akaike Information Criterion and Fourth-Order Kernel Method for Line Transect Sampling (LTS)

Lecture 33: Bootstrap

A proposed discrete distribution for the statistical modeling of

BIOS 4110: Introduction to Biostatistics. Breheny. Lab #9

Trimmed Mean as an Adaptive Robust Estimator of a Location Parameter for Weibull Distribution

Investigating the Significance of a Correlation Coefficient using Jackknife Estimates

MATH 320: Probability and Statistics 9. Estimation and Testing of Parameters. Readings: Pruim, Chapter 4

A Note on the Kolmogorov-Feller Weak Law of Large Numbers

Econ 325/327 Notes on Sample Mean, Sample Proportion, Central Limit Theorem, Chi-square Distribution, Student s t distribution 1.

Estimation of the essential supremum of a regression function

Estimation of Gumbel Parameters under Ranked Set Sampling

STAT 350 Handout 19 Sampling Distribution, Central Limit Theorem (6.6)

The Distribution of the Concentration Ratio for Samples from a Uniform Population

ECE 8527: Introduction to Machine Learning and Pattern Recognition Midterm # 1. Vaishali Amin Fall, 2015

Introducing a Novel Bivariate Generalized Skew-Symmetric Normal Distribution

Probability and statistics: basic terms

Chapter 8: STATISTICAL INTERVALS FOR A SINGLE SAMPLE. Part 3: Summary of CI for µ Confidence Interval for a Population Proportion p

Journal of Multivariate Analysis. Superefficient estimation of the marginals by exploiting knowledge on the copula

THE EFFICIENCY OF BIAS CORRECTED ESTIMATORS FOR NONPARAMETRIC KERNEL ESTIMATION BASED ON LOCAL ESTIMATING EQUATIONS

Power Weighted Quantile Regression and Its Application

1.010 Uncertainty in Engineering Fall 2008

Resampling modifications for the Bagai test

This is an introductory course in Analysis of Variance and Design of Experiments.

Asymptotic distribution of products of sums of independent random variables

CSE 527, Additional notes on MLE & EM

A Note on the Distribution of the Number of Prime Factors of the Integers

Nonparametric Conditional Density Estimation

The estimation of the m parameter of the Nakagami distribution

Stochastic Simulation

EECS564 Estimation, Filtering, and Detection Hwk 2 Solns. Winter p θ (z) = (2θz + 1 θ), 0 z 1

Adaptive Estimation of Density Function Derivative

Basis for simulation techniques

Comparison of Minimum Initial Capital with Investment and Non-investment Discrete Time Surplus Processes

z is the upper tail critical value from the normal distribution

Intermittent demand forecasting by using Neural Network with simulated data

Lecture 2: Monte Carlo Simulation

Estimation of Population Mean Using Co-Efficient of Variation and Median of an Auxiliary Variable

POWER COMPARISON OF EMPIRICAL LIKELIHOOD RATIO TESTS: SMALL SAMPLE PROPERTIES THROUGH MONTE CARLO STUDIES*

Big Picture. 5. Data, Estimates, and Models: quantifying the accuracy of estimates.

Kernel Adjusted Conditional Density Estimation

Lecture 01: the Central Limit Theorem. 1 Central Limit Theorem for i.i.d. random variables

Research Article Moment Inequality for ϕ-mixing Sequences and Its Applications

FACULTY OF MATHEMATICAL STUDIES MATHEMATICS FOR PART I ENGINEERING. Lectures

The Sampling Distribution of the Maximum. Likelihood Estimators for the Parameters of. Beta-Binomial Distribution

Estimating the Population Mean using Stratified Double Ranked Set Sample

Modeling and Estimation of a Bivariate Pareto Distribution using the Principle of Maximum Entropy

New Entropy Estimators with Smaller Root Mean Squared Error

Research Article Gamma Kernel Estimators for Density and Hazard Rate of Right-Censored Data

5. Likelihood Ratio Tests

Let us give one more example of MLE. Example 3. The uniform distribution U[0, θ] on the interval [0, θ] has p.d.f.

ANOTHER WEIGHTED WEIBULL DISTRIBUTION FROM AZZALINI S FAMILY

MAXIMAL INEQUALITIES AND STRONG LAW OF LARGE NUMBERS FOR AANA SEQUENCES

Control Charts for Mean for Non-Normally Correlated Data

Section 14. Simple linear regression.

GG313 GEOLOGICAL DATA ANALYSIS

ECE 901 Lecture 12: Complexity Regularization and the Squared Loss

G. R. Pasha Department of Statistics Bahauddin Zakariya University Multan, Pakistan

Estimation for Complete Data

CONTROL CHARTS FOR THE LOGNORMAL DISTRIBUTION

PARAMETER ESTIMATION BASED ON CUMU- LATIVE KULLBACK-LEIBLER DIVERGENCE

Your use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at

Expectation and Variance of a random variable

LECTURE 8: ASYMPTOTICS I

Transcription:

Iteratioal Joural of Iovative Maagemet, Iformatio & Productio ISME Iteratioal c2013 ISSN 2185-5439 Volume 4, Number 1, Jue 2013 PP. 17-24 THE DATA-BASED CHOICE OF BANDWIDTH FOR KERNEL QUANTILE ESTIMATOR OF VAR XIN YANG 1, SHANCHAO YANG 2 College of Iteratioal Educatio 1 GuagXi Uiversity of Fiace ad Ecoomics Naig, 530003, P. R. Chia College of Mathematical Sciece 2 Guagxi Normal Uiversity Guili, 541004, P. R. Chia ABSTRACT. Value-at-Risk is a importat risk measure, has bee wildly applied i market practice ad fiacial risk measuremet. We use the smooth kerel estimator for the quatile proposed by Parze (1979) as a VaR estimator ad propose a data-based choice method of optimal badwidth via ormal referece distributio, which is easy to compute. The simulatios show that the choice method of badwidth is effective, ad the smooth kerel estimator has better performace tha the sample quatile estimator. Keywords: VaR; Kerel Quatile Estimator; Choice of Badwidth 1. Itroductio. Suppose that { Xi, i 1} is a sequece of idepedet ad idetically distributed radom variables. Deote the distributio fuctio of X 1 by F( x ) ad the 1 p-quatile of the distributio fuctio by Q( p) = F ( p) = i { fx: F( x) p} for 0< p < 1. Let the empirical distributio fuctio of a sample X1, X2,, X be 1 F( x) = i= 1I( Xi x) ad the empirical quatile fuctio be 1 Q( p) = F ( p) = i { fx: F( x) p}. Assume that X(1), X(2),, X( ) are the order statistics of the sample. The Q ( p) = X ([ p] + 1), where [ x ] is the iteger part of x. It is well-kow that Q ( p ) is a sample estimator of the p -quatile Q( p ). The sample quatile is a very importat statistics. Its applicatios are beyod the area of statistics. For example, the sample quatile is used to estimate Value-at-Risk (VaR), which is a risk measure ad has bee used widely i fiacial istitutios for their risk maagemet. Let X be a radom variable describig the retur of a asset or a portfolio. Give a positive value p (0,1), the 1 p level VaR is VaRp = Q( p), which measures the maximum potetial loss of a give asset or portfolio over a prescribed holdig period at a give cofidece level. Early estimators of VaR are based o parametric models for the retur distributio F, for istace, Gaussia or t distributios, oe ca refer to Dowd (2000), Giot ad Lauret (2003), Lauret ad Peters (2002), ad the refereces therei. The advatage of parametric methods is that it is easy to calculate, but actual distributio of retur serials is t always

XIN YANG AND SHANCHAO YANG 18 kow. No-parametric methods, without assumig statistical distributio, ca deal with asymmetry ad fat-tail problems of retur serials effectively. This has caused wide attetio amog the world of fiacial maagemet. Dowd (2001) used the sample quatile X ([ p]) to estimate VaR. Of course, X ([ p]) ca be also used as a VaR estimator because it has the same asymptotical properties as X ([ p] + 1). However, Berkowitz ad O Brie (2002) ad Iui et al.(2005) reported that X ([ p]) teds to be coservative. I fact, the estimator X ([ p] + 1) teds to be less tha the real value of VaR while X ([ p]) teds to be more tha the value of VaR. For other oparametric methods, Gourieroux et al.(2000) first itroduced VaR o-parametric kerel estimator. Che ad Tag (2005) proposed VaR o-parametric estimatio based o kerel quatile estimator. Wei et al.(2010) used the kerel quatile estimator proposed by Parze(1979) as a VaR estimator ad studied its Bahadur represetatio, asymptotical mea square error uder strogly mixig assumptio. I this paper, we will cosider the estimator. Let us to recall Parze s smooth kerel estimator for the quatile Q( p) as follows where h is a smoothig badwidth ad K() is a kerel fuctio. The estimator is a liear combiatio of order statistics with kerel weights. Falk (1984) ivestigated the asymptotical mea square error of the estimator. Yag(1985) established a Bahadur represetatio i seses of probability ad a mea squared covergecy rate. Later, Sheather ad Marro (1990) improved Falk s asymptotical mea square error, gave a asymptotical optimal badwidth, ad studied the data-based choice method of the optimal badwidth by usig the estimators of Q ' ( p ) ad Q '' ( p ) as follows ad where K () * is a kerel fuctio of order m ad symmetric about 0, ad a, b are also two badwidths. At the same time, Sheather ad Marro (1990) compared the performace of the kerel quatile estimator with the estimators of Harrel ad Davis (1982) ad Kaigh ad Lachebruch (1982) by usig the samples of size 50 ad 100 from the double-expoetial, expoetial, logormal ad ormal distributios. Uder radom cesorship or radomly trucated data, Xiag(1995) ad Zhou (2006) established a Bahadur represetatio of T ( p ), the latter also gave the strog cosistece, asymptotic ormality of the estimator. For α -mixig radom variables, Wei et al.(2010) showed Bahadur represetatio i seses of almost surely covergece, strog cosistece ad mea square error. By simulatios, they also reported that the mea absolute biases of

THE DATA-BASED CHOICE OF BANDWIDTH 19 the estimator are much less tha those of sample quatile for samples of size 100, 200, 500 ad 1000 from ormal distributio ad studet t distributio, wheever samples are idepedet or depedet, ad from light tail distributio or fat tail distributio. Recetly, Ajami et al.(2011) established the Bahadur-type represetatio of the kerel smooth estimator uder strog mixig ad cesored data, ad showed that this estimator is strogly cosistet from the Bahadur represetatio. These studies metioed above idicate that the kerel estimator possesses may ice properties ad its efficiecy is better tha that of the sample quatile. Therefore, it is a reasoable estimator for the quatile. It is well kow that badwidth selectio for kerel estimator is a hard work. For the estimator T ( p ), the data-based choice method of the asymptotical optimal badwidth by usig the estimators (2) ad (3) was studied i Sheather ad Marro (1990). However, ' because the estimators Qˆ ( p ) '' ad Qˆ ( p ) are also kerel estimatio, oe further faces ew work to choose the badwidths. To simplify the procedure, i this paper, we propose a data-based choice method of badwidth via ormal referece distributio. Our simulatio results report that the choice method of badwidth has good performace. 2. Mea Square Error ad Optimal Badwidth. Here let us recall the results about asymptotical mea square error ad asymptotical optimal badwidth i Sheather ad Marro (1990). Theorem 1: (Sheather ad Marro, 1990, Theorem 1) Suppose that Q" is cotiuous i a eighborhood of p ad that K is a desity with compactly supported set [ cc, ], x symmetric about 0. Let K( x) = K() t dt. The for all fixed p (0,1), apart from p = 0.5 whe F is symmetric, ad for p = 0.5 whe F is symmetric, Theorem 2: (Sheather ad Marro, 1990, Corollary 1) Supposed that the coditios give i Theorem 1 hold. The for all fixed p (0,1), apart from p = 0.5 whe F is symmetric, the asymptotically optimal badwidth is give by Ad whe h = hopt,

XIN YANG AND SHANCHAO YANG 20 Supposed that the distributio fuctio F( x ) has a desity fuctio f( x ). The Hece Because F is symmetric, so f( x ) has a maximal value at x = 0, which implies that f ' ( Q (0.5)) = 0. From (9), Q ' ( p) Q " ( p ) will be ifiity at p = 0.5. Therefore, these theorems require that p is apart from 0.5. 3. Data-based Choice Method via Normal Referece Distributio. From Theorem 2, we see that the asymptotical optimal badwidth h opt depeds o the first ad secod derivatives of the quatile fuctio, ad also o the kerel fuctio. Here, we will use ormal distributio as the populatio distributio to estimate the the quatile fuctio Q( p ), while quadratic fuctio as kerel fuctio. 3.1. Quadratic Kerel Fuctio ad Weight Fuctio. Let the quadratic kerel fuctio x x ad 2 3 The K( x) = K( u) du = 3 (1 u ) du = 3 ( x 1 x + 2 ) 1 1 4 4 3 3 i Deote the weight ω, ( p i ) = K (( p t ) h ) dt. Let a 1, i = max{( i 1), p h} ad a2, i = mi{ i, p+ h}. The ( i 1) It is clear that ω i, ( p) = 0 whe a1, i a2, i. Ad whe a1, i < a2, i we have

THE DATA-BASED CHOICE OF BANDWIDTH 21 where gx ( ) = hx+ ( p x). 2 1 3 3 3.2. Optimal Badwidth Based o Normal Distributio. We use ormal distributio as a referece distributio of the populatio distributio. That is to assume that 2 F( x) N ( µσ, ). I the case, ' ( ) µ x f x = ( ) 2 f x. From (9), σ Thus, the optimal badwidth Therefore, we have the estimator of the optimal badwidth where X ad S are sample mea ad sample stadard deviatio, respectively, ad Q ( p ) is the quatile of 2 N( X, S ). 4. Simulatios. I this sectio, some umerical simulatios are carried out to evaluate the performace of the kerel quatile estimator (1) by usig the data-based badwidth via ormal referece distributio (16), ad to compare it with the sample quatile estimator. We kow that ormal distributio is light tail distributio, while studet t distributio is fat tail distributio. So we cosider the samples that are from ormal distributio with stadard deviatios 1 ad 3, studet t distributio with degrees of freedom 3 ad 5. The sample sizes are 200, 500, 800 ad 1000, ad repeated 1000 times for each oe. The probability level p is from 0.01 to 0.4 by step 0.01, i.e. pi = 0.01 i for i = 1, 2,, m, where m = 40.

XIN YANG AND SHANCHAO YANG 22 FIGURE 1. The estimatio of Quatile for stadard ormal populatio. For stadard ormal populatio, Figure 1 reports the kerel quatile estimator T ( p ) (dashed lie), the sample quatile estimator Q ( p ) (logdash lie) ad the quatile Q( p ) (solid lie) for four cases of the sample sizes 200, 500, 800 ad 1000. The figures illustrate that the estimator values are ear to the real values, ad they are gettig better as sample size grows. Moreover, the kerel quatile estimator is more smooth ad closer to the real value tha the sample quatile estimator. For other populatios, there are similar figures. Due to the space, we do t show the figures. Now we observe the relatively absolute biases of the estimators. Let for i = 1, 2,, m, ad RQ m 1 m RT i= 1 i =, which is the mea of the relatively absolute biases over p for the kerel quatile estimator. Aalogously let for i = 1, 2,, m, ad 1 m RQ m RQ i= 1 i =. It is the mea of the relatively absolute biases over p for the sample quatile estimator. The the meas of the relatively absolute biases for each populatio ad each sample size are listed i Table 1. These show that the meas of the relatively absolute biases are less tha 5% for most cases, whether light or fat tail distributio. Ad the meas of the

THE DATA-BASED CHOICE OF BANDWIDTH 23 relatively absolute biases of the kerel quatile estimator are less tha those of the sample quatile estimator. These results show that the data-based badwidth via ormal referece distributio (16) is effective. TABLE 1: Mea of the relatively absolute biases (%) 5. Coclusio. Value-at-Risk, a very importat risk measure, has bee wildly applied i market practice ad fiacial risk measuremet. Ad estimatig VaR has received a cosiderable amout of attetio i literature. I this paper, we cosider the smooth kerel estimator for the quatile proposed by Parze (1979) as a VaR estimator. I view of the complexity to choose badwidth for the kerel estimator, we propose a data-based choice method of optimal badwidth via ormal referece distributio. The choice method is easy to compute ad the simulatios show that it is effective. Ackowledgmet. This project is supported by the Natioal Sciece Foudatio of Chia(11061007), the Social Sciece Foudatio of Chia(10CTJ004) ad the Natioal Sciece Foudatio of Guaxi(2011GXNSFA018133). REFERENCES [1] B. M. Falk (1984), Relative deficiecy of kerel type estimators of quatile, The Aals of Statistics, vol.12, pp.261-268. [2] C. Gourieroux, O. Scaillet ad J. P. Lauret (2000), Sesitivity aalysis of values at risk, Joural of Empirical Fiace, vol.7, pp.225-245. [3] E. Parze (1979), Noparametric statistical data modelig, America Statistical Associatio, vol.74, pp.105-121. [4] F. E. Harrell ad C. E. Davis (1982), A ew distributio-free quatile estimator, Biometrika, vol.69, pp.635-640.

XIN YANG AND SHANCHAO YANG 24 [5] J. Berkowitz ad J. O Brie (2002), How accurate are value-at-risk models at commercial baks, J. Fiace, vol.57, pp.1093-1111. [6] K. Dowd (2000), Assessig VaR accuracy, Derivatives Quarterly, vol.6, o.3, pp.61-63. [7] K. Dowd (2001), Estimatig VaR with order statistics, Joural of Derivatives, vol.8, o.3, pp.23-30. [8] K. Iui, M. Kijima ad A. Kitao (2005), VaR is subject to a sigificat positive bias, Statistics ad Probability Letters, vol.72, pp.299-311. [9] M. Ajami, V. Fakoor ad S. Jomhoori (2011), The Bahadur represetatio for kerel-type estimator of the quatile fuctio uder strog mixig ad cesored data, Statistics ad Probability Letters, vol.81, pp.1306-1310. [10] P. Giot ad S. Lauret (2003), Value-at-risk for log ad short tradig positios, Joural of Applied Ecoometrics, vol.18, pp.641-664. [11] S. Lauret ad J. P. Peters (2002), GARCH 2.2: A Ox package for estimatig ad forecastig various ARCH models, Joural of Ecoomic Surveys, vol.16, pp.4474851. [12] S. Sheather ad J. Marro (1990), Kerel quatile estimators, Joural of the America Statistical Associatio, vol.85, pp.410-416. [13] S. S. Yag (1985), A smooth oparametric estimator of a quatile fuctio, Joural of America Statistical Associatio, vol.80, pp.1004-1011. [14] S. X. Che ad C. Y. Tag (2005), Noparametric iferece of value-at risk for depedet fiacial returs, Joural of Fiacial Ecoometrics, vol.3, pp.227-255. [15] W. D. Kaigh ad P. A. Lachebruch (1982), A geeralized quatile estimator, Commuicatios i Statistics, Part A-Theory ad Methods, vol.11, pp.2217-2238. [16] X. J. Xiag (1995), Bahadur represetatio of the kerel quatlie estimator uder radom cesorship, Joural of Multivariate Aalysis, vol.54, pp.193-209. [17] X. Wei, S. Yag, K. Yu, X. Yag ad G. Xig (2010), Bahadur represetatio of liear kerel quatile estimator of VaR uder α-mixig assumptios, Joural of Statistical Plaig ad Iferece, vol.140, pp.1620-1634. [18] Y. Zhou (2006), A kerel-type estimator of quatile fuctio uder radomly trucated data, Acta Mathematica Scietia, vol.26, pp.585-594.