Local linear multivariate. regression with variable. bandwidth in the presence of. heteroscedasticity

Size: px
Start display at page:

Download "Local linear multivariate. regression with variable. bandwidth in the presence of. heteroscedasticity"

Transcription

1 Model ISSN X Department of Econometrics and Business Statistics Local linear multivariate regression with variable bandwidth in the presence of heteroscedasticity AzhongYe, RobJHyndman and ZinaiLi May 2006 Working Paper 8/06

2 Local linear multivariate regression with variable bandwidth in the presence of heteroscedasticity Azhong Ye College of Management Fuzhou University, Fuzhou, China. Rob J Hyndman Department of Econometrics and Business Statistics, Monash University, VIC 3800 Australia. Rob.Hyndman@buseco.monash.edu Zinai Li School of Economics and Management, Tsinghua University, Beijing, , China. lizinai@mail.tsinghua.edu.cn 18 May2006 JEL classification: C12,C15,C52

3 Local linear multivariate regression with variable bandwidth in the presence of heteroscedasticity Abstract: We present a local linear estimator with variable bandwidth for multivariate nonparametric regression. We prove its consistency and asymptotic normality in the interior of the observed data and obtain its rates of convergence. This result is used to obtain practical direct plug-in bandwidth selectors for heteroscedastic regression in one and two dimensions. We show that the local linear estimator with variable bandwidth has better goodness-of-fit properties than the local linear estimator with constant bandwidth, in the presence of heteroscedasticity. Keywords: heteroscedasticity; kernel smoothing; local linear regression; plug-in bandwidth, variable bandwidth. 1

4 1 Introduction We are interested in the problem of heteroscedasticity in nonparametric regression, especially when applied to economic data. Heteroscedasticity is very common in economics, and heteroscedasticity in linear regression is covered in almost every econometrics textbook. Applications of nonparametric regression in economics are growing, and Yatchew(1998) argued that it will become an indispensable tool for every economist because it typically assumes little about the shape of the regression function. Consequently, we believe that heteroscedasticity in nonparametric regression is an important problem that has received limited attention to date. We seek to develop new estimators that have better goodness of fit than the common estimators in nonparametric econometric models. In particular, we are interested in using heteroscedasticity to improve nonparametric regression estimation. There have been a few papers on related topics. Testing for heteroscedasticity in nonparametric regression has been discussed by Eubank and Thomas (1993) and Dette and Munk (1998). Ruppert and Wand (1994) discussed multivariate locally weighted least squares regression when the variances of the disturbances are not constant. Ruppert et al. (1997) presented the local polynomial estimator of the conditional variance function in a heteroscedastic, nonparametric regression model using linear smoothing of squared residuals. Sharp-optimal and adaptive estimators for heteroscedastic nonparametric regression using the classical trigonometric Fourier basis are given by Efromovich and Pinsker (1996). Our approach is to exploit the heteroscedasticity by using variable bandwidths in local linear regression. Müller and Stadtmüller (1987) discussed variable bandwidth kernel estimators of regression curves. Fan and Gijbels(1992, 1995, 1996) discussed the local linear estimator with variable bandwidth for nonparametric regression models with a single covariate. In this paper, we extend these papers by presenting a local linear estimator with variable bandwidth for nonparametric multiple regression models. We demonstrate that the local linear estimator has optimal conditional mean squared error when its variable bandwidth is a function of the density of the explanatory variables and conditional variances. Numerical simulation shows that the local linear estimator with this variable 2

5 bandwidth has better goodness of fit than the local linear estimator with constant bandwidth for the heteroscedastic models. 2 Local linear regression with a variable bandwidth Suppose we have a univariate response variable Y and a d-dimensional set of covariates X, andweobservetherandomvectors(x 1, Y 1 ),...,(X n, Y n )whichareindependentandidentically distributed. Itisassumedthateachvariablein X hasbeenscaledsotheyhavesimilarmeasures of spread. Our aimistoestimatetheregression function m(x)=e(y X= x). Wecanregardthe dataas being generated from the model Y= m(x)+u, where E(u X)=0,Var(u X= x)=σ 2 (x) and the marginaldensity of X is denoted by f(x). We assume the second-order derivatives of m(x) are continuous, f(x) is bounded above 0 and σ 2 (x) is continuous and bounded. Let K bead-variatekernelwhichissymmetric,nonnegative,compactlysupported, K(u) du= 1 and uu T K(u) du=µ 2 (K)I whereµ 2 (K) 0 and I is the d d identity matrix. In addition, l all odd-order moments of K vanish, that is, u ul d K(u) du=0 for all nonnegative integers d l 1,..., l d such that their sum is odd. Let K h (u)= K(u/h). Then the local linear estimator of m(x) with variable bandwidth is ˆm n (x,h n,α)=e T 1 (X T xw x,α X x ) 1 X T xw x,α Y, (1) where h n = cn 1/(d+4), c is a constant that depends only on K,α(x) is the variable bandwidth function, e T 1 =(1,0,...,0), X x=(x x,1,...,x x,n ) T, X x,i =(1,(X i x)) T and W x,α = diag Khn α(x 1 )(X 1 x),..., K hn α(x n )(X n x). We assumeα(x) is continuously differentiable. We now state our main result. A proof is given in the Appendix. 3

6 Theorem 1 Let x be a fixed point in the interior of{x f(x)>0}, H m (x)= let s(h m (x)) be the sum of the elements of H m (x). Then 2 m(x) and x i x j d d 1 E[ˆm n (x,h n,α) X 1,...,X n ] m(x)=0.5h 2 nµ 2 (K)α 2 (x)s(h m (x))+ o p (h 2 n); 2 Var[ˆm n (x,h n,α) X 1,...,X n ]=n 1 h d n R(K)α d (x)σ 2 (x)f 1 (x)+ o p (n 1 h d n ), where R(K)= K 2 (u)du; and 3 n 2/(d+4) [ˆm n (x,h n,α) m(x)] d N 0.5c 2 µ 2 (K)α 2 (x)s(h m (x)), c d R(K)α d (x)σ 2 (x)f 1 (x). Whenα(x)=1, results 1 and 2 coincide with Theorem 2.1 of Ruppert and Wand (1994). By result 2 and the law of large numbers, we find thatˆm n (x,h n,α) is consistent. From result 3 we know that the rate of convergence of ˆm n (x,h n,α) in interior points is O(n 2/(d+4) ) which, according to Stone(1980, 1982), is the optimal rate of convergence for nonparametric estimation of a smooth function m(x). 3 Using heteroscedasticity to improve local linear regression Although Fan and Gijbels (1992) and Ruppert and Wand (1994) discuss the local linear estimator of m(x), nobody has previously developed an improved estimator using the information of heteroscedasticity. We now show how this can be achieved. UsingTheorem 1,wecangiveanexpressionfortheconditionalmeansquarederrorofthelocal linear estimator with variable bandwidth. MSE=E [ˆmn (x,h n,α) m(x)] 2 X 1,...,X n = 1 4 [h nα(x)] 4 µ 2 2 (K)s2 (H m (x))+ R(K)σ2 (x) n[h n α(x)] d f(x) + o p(h 2 n)+ o p (n 1 h d n ). Minimizing MSE, ignoring the higher order terms, we obtain the optimal variable bandwidth α opt (x)= c 1 dr(k)σ 2 (x) µ 2 2 (K)f(x)s2 (H m (x)) 1/(d+4). 4

7 Note that the constant c will cancel out in the bandwidth expression h n α opt (x). Therefore, without loss of generality, we can set c= dr(k)µ 2 2 (K) 1/(d+4) which simplifies the above expression to σ 2 1/(d+4) (x) α opt (x)= f(x)s 2. (H m (x)) Toapplythisnew estimator,weneed toreplace f(x), H m (x)andσ 2 (x)withestimators. There are several potential ways to do this, depending on the dimension d. Some proposals for d= 1 and d= 2, are outlined below. 3.1 Univariate regression When d= 1, we first use the direct plug-in methodology of Sheather and Jones (1991) to select the bandwidth of a kernel density estimate for f(x). Second, we estimateσ 2 (x)=e(u 2 X= x) using local linear regression with the model û 2 i=σ 2 (X i )+ v i where û i = Y i ˆm n (X i,ĥn,1), v i are iid with zero mean and ĥn is chosen by the direct plug-in methodology of Ruppert et al. (1995). Third, we estimate m(x) by fitting the quartic m(x)=α 1 +α 2 x+α 3 x 2 +α 4 x 3 +α 5 x 4, usingordinaryleastsquaresregressionandsoobtaintheestimateˆ m(x)=2ˆα 3 +6ˆα 4 x+12ˆα 5 x 2. Then, our direct plug-in bandwidth for univariate regression (d = 1) is ĥ(x)= ˆσ 2 (x) 2n πˆf(x)ˆ m 2 (x) 1/ Bivariate regression When d= 2, we use a bivariate kernel density estimator (Scott, 1992) of f(x), with the direct plug-in methodology of Wand and Jones (1995) for the bandwidth. To estimateσ 2 (x), we first calculate û i = Y i Ŷ i, where Ŷ i = ˆm n (X i,ĥn,1), ĥn = min(ĥ1,ĥ2), and ĥ1 and ĥ2 are chosen by the direct plug-in methodology of Ruppert et al. (1995) for Y = m 1 (X 1 )+u 1 and Y= m 2 (X 2 )+u 2 respectively. Then we estimateσ 2 (x 1 ) using local linear regression with the model û 2 i=σ 2 (X 1i )+v i,where v i areiidwithzeromean. Again,thedirectplug-inmethodology of Ruppert et al. (1995) is used for bandwidth selection. 5

8 To estimate the second derivative of m(x 1, x 2 ), we fit the model m(x 1, x 2 )=α 1 +α 2 x 1 +α 3 x 2 +α 4 x 2 1 +α 5x 1 x 2 +α 6 x 2 2 +α 7x 3 1 +α 8x 2 1 x 2+α 9 x 1 x 2 2 +α 10 x 3 2 +α 11x 4 1 +α 12 x 3 1 x 2+α 13 x 2 1 x2 2 +α 14x 1 x 3 2 +α 15x 4 2, using ordinary least squares, and so obtain estimates of α. Hence, estimators for the second derivatives of m(x 1, x 2 ) are obtained using 2 m(x) x 2 = 2α 4 + 6α 7 x 1 + 2α 8 x α 11 x α 12x 1 x 2 + 2α 13 x 2 2, 1 2 m(x) =α 5 + 2α 8 x 1 + 2α 9 x 2 + 3α 12 x 2 1 x 1 x + 4α 13x 1 x 2 + 3α 14 x and 2 m(x) x 2 = 2α 6 + 2α 9 x 1 + 6α 10 x 2 + 2α 13 x α 14x 1 x α 15 x Then, our direct plug-in bandwidth for bivariate regression (d = 2) is ĥ(x)= ˆσ 2 (x) nπˆf(x)s 2 (Ĥ m (x)) 1/6. 4 Numerical studies with univariate regression This section examines the performance of the proposed variable bandwidth selection method via several data sets of univariate regression, generated from known functions. For comparison, we also compare the performance of the constant bandwidth method, based on the direct plug-in methodology described by Ruppert et al. (1995). As the true regression function is known in each case, the performances of the bandwidth methods are measured and compared using the root mean squared error, 1/2 RMSE= n 1 [ˆm n (X i,h n,α) m(x i )] 2. Wesimulatedatafromthefollowingfivemodels,eachwith Y= m(x)+σ(x)uwhere u N(0,1) 6

9 and the covariate X has a Uniform( 2, 2) distribution. Model A: m A (x)= x 2 + x σ 2 A (x)=32x Model B: m B (x)=(1+ x) sin(1.5x) σ 2 B (x)=3.2x Model C: m C (x)= x+ 2 exp( 2x 2 ) σ 2 C (x)=16(x2 0.01)I (x 2 >0.01) Model D: m D (x)=sin(2x)+2 exp( 2x 2 ) σ 2 D (x)=16(x2 0.01)I (x 2 >0.01) Model E: m E (x)=exp( (x+ 1) 2 )+2 exp( 2x 2 ) σ 2 E (x)=32(x2 0.01)I (x 2 >0.01) We draw 1000 random samples of size 200 from each model. Table 1 presents a summary of the results and shows that the variable bandwidth method has smaller RMSE than the constant bandwidth method in each case. For each model, the variable bandwidth method has smaller RMSE than the constant bandwidth method, and is better for more than 50% of samples. We plot the true regression functions (the solid line) and four typical estimated curves in Figure 1. These correspond to the 10th, 30th, 70th and 90th percentiles. For each percentile, the variable bandwidth method (dotted line) is closer to the true regression function than the constant bandwidth method (dashed line). Therefore, we conclude that for heteroscedastic models, the local linear estimator with variable bandwidth has better goodness-of-fit than the local linear estimator with constant bandwidth. 7

10 5 Numerical studies with bivariate regression We now examine the performance of the proposed variable bandwidth selection method via several data sets of bivariate regression, generated from known functions. For comparison, we also compare the performance of a constant bandwidth method given by ĥ= ˆσ 2 π n s2 (Ĥ m (X i )) 1/6 where ˆσ 2 = n 1 û 2 i, and u i and Ĥ m are the same as for the variable bandwidth selector. We simulate data from four models, each with Y= m(x 1, X 2 )+σ(x 1 )u where u N(0,1) and the covariates X 1 and X 2 are independent and have a Uniform( 2,2) distribution. Model F: m A (x 1, x 2 )= x 1 x 2 σ 2 A (x 1, x 2 )=(x )I (x1 2 >0.04) Model G: m B (x 1, x 2 )= x 1 exp( 2x 2 2 ) σ 2 B (x 1, x 2 )=2.5(x )I (x 2 1 >0.04) Model H: m C (x 1, x 2 )= x sin(1.5x 2 ) σ 2 C (x 1, x 2 )=(x )I (x 2 1 >0.04) Model I: m D (x 1, x 2 )=sin(x 1 + x 2 )+2 exp( 2x 2 2 ) σ 2 D (x 1, x 2 )=3(x )I (x1 2 >0.04) We draw 200 random samples of size 400 from each model. Table 2 presents a summary of the results and shows that the variable bandwidth method has smaller RMSE than the constant bandwidth method. For each model, the variable bandwidth method has lower RMSE than the constant bandwidth method and is better for more than 50% of samples. 8

11 We plot the true regression functions with one fixed variable (solid line) and four typical estimated curves in Figures 2 5. These correspond to the 10th, 30th, 70th and 90th percentiles. For each percentile, the variable bandwidth method(dotted line) is closer to the true regression function than the constant bandwidth method (dashed line). Therefore, we conclude that for heteroscedastic models, the local linear estimator with variable bandwidth has better goodnessof-fit than the local linear estimator with constant bandwidth. 6 Summary We have presented a local linear nonparametric estimator with variable bandwidth for multivariate regression models. We have shown that the estimator is consistent and asymptotically normal in the interior of the sample space. We have also shown that its convergence rate is optimal for nonparametric regression (Stone, 1980, 1982). By minimizing the conditional mean squared error of the estimator, we have derived the optimal variable bandwidth as a function of the density of the explanatory variables and the conditional variance. Wehavealsoprovidedaplug-inalgorithmfor computingtheestimatorwhen d= 1or d = 2. Numerical simulation shows that our local linear estimator with variable bandwidth has better goodness-of-fit than the local linear estimator with constant bandwidth for heteroscedastic models. Acknowledgements We thank Xibin Zhang for his helpful comments. 9

12 References Dette, H. and A. Munk (1998) Testing heteroscedasticity in nonparametric regression, Journal of the Royal Statistical Society, Series B, 60(4), Efromovich, S. and M. Pinsker (1996) Sharp-optimal and adaptive estimation for heteroscedastic nonparametric regression, Statistica Sinica, 6, Eubank, R. L. and W. Thomas (1993) Detecting heteroscedasticity in nonparametric regression, Journal of the Royal Statistical Society, Series B, 55(1), Fan, J. and I. Gijbels (1992) Variable bandwidth and local linear regression smoothers, The Annals of Statistics, 20(4), Fan, J. and I. Gijbels(1995) Data-driven bandwidth selection in local polynomial fitting: variable bandwidth and spatial adaptation, Journal of the Royal Statistical Society, Series B, 57(2), Fan, J. and I. Gijbels(1996) Local polynomial modelling and its applications, Chapman and Hall. Müller, H.-G. and U. Stadtmüller (1987) Variable bandwidth kernel estimators of regression curves, The Annals of Statistics, 15(1), Ruppert, D., S. J. Sheather and M. P. Wand(1995) An effective bandwidth selector for local least squares regression (Corr: 96V91 p1380), Journal of the American Statistical Association, 90, Ruppert, D. and M. P. Wand (1994) Multivariate locally weighted least squares regression, The Annals of Statistics, 22(3), Ruppert, D., M. P. Wand, U. Holst and O. Hössjer (1997) Local polynomial variance-function estimation, Technometrics, 39(3), Scott, D. W. (1992) Multivariate Density Estimation: Theory, Practice, and Visualization, John Wiley & Sons. Sheather, S. J. and M. C. Jones (1991) A reliable data-based bandwidth selection method for kernel density estimation, J.R. Statist. Soc. B, 53(3),

13 Stone, C. J. (1980) Optimal convergence rates for nonparametric estimators, The Annals of Statistics, 8, Stone, C. J. (1982) Optimal global rates of convergence for nonparametric regression, The Annals of Statistics, 10, Wand, M. P. and M. C. Jones (1995) Kernel Smoothing, Chapman and Hall, London. White, H. (1984) Asymptotic theory for econometricians, Academic Press. Yatchew, A. (1998) Nonparametric regression techniques in economics, Journal of Economic Literature, 34,

14 Appendix: Proof of Theorem 1 Before we state a lemma that will be used in the proof, note that n 1 XxW T x,α X x = n n 1 K h n α(x i )(X i x) n K h n α(x i )(X i x)(x i x) n K h n α(x i )(X i x)(x i x) T n K h n α(x i )(X i x)(x i x)(x i x) T and e T 1 (X xw T x,α X x ) 1 XxW T x,α X x m(x) =e T 1 m(x) = m(x), D m (x) D m (x) where D m (x)= m(x) x 1 T,..., m(x). Therefore, x d ˆm n (x,h n,α) m(x)=e T 1 (X T xw x,α X x ) 1 X T xw x,α [0.5Q m (x)+u], where Q m (x) = Q m,1 (x),...,q m,n (x) T, Qm,i (x) = (X i x) T H m (z i (x,x i ))(X i x), H m (x)= 2 m(x), z x i x i (x,x i ) x X i x and U=(u 1,...,u n ) j d d T. Wecandeducethat {z i (x,x i )} n are independent because{x i} n are independent. Now we state a lemma using the notation of (1) and Theorem 1. Lemma 1 Let G(α, f,x)= f(x) udα(x)uu T T D K (u)du+µ 2 (K) d f(x)d α (x)+α 1 (x)d f (x), supp(k) B(x,α)= µ 2 (K) 1 f(x) 2 α(x)g(α, f,x) T and 1 be a generic matrix having each entry equal to 1, the dimensions of which will be clear from the context. Then n 1 n 1 K hn α(x i )(X i x)= f(x)+ o p (1) (2) K hn α(x i )(X i x)(x i x)=h 2 nα 3 (x)g(α, f,x)+o p (h 2 n1) (3) 12

15 n 1 K hn α(x i )(X i x)(x i x)(x i x) T =µ 2 (K)h 2 nα 2 (x)f(x)i+ o p (h 2 n1) (4) (XxW T x,α X x ) 1 f(x) 1 + o p (1) B(x,α)+o p (1) = (5) B T (x,α)+ o p (1) µ 2 (K) 1 h 2 n α 2 (x)f(x) 1 I+ o p (h 2 n 1) n 1 XxW T x,α Q m (x)=h 2 f(x)µ 2 (K)α 2 (x)s(h m (x)) n + o p (h 2 n1) (6) 0 and (nh d n) 1/2 (n 1 XxW T d x,α u) N 0, R(K)α d (x)σ 2 (x)f(x) (7) 0 We only prove results (3), (6) and (7) as the other results can be proved similarly. Proof of (3) It is easy to show that n 1 K hn α(x i )(X i x)(x i x)=e Khn α(x 1 )(X 1 x)(x 1 x) + O p ( n 1 Ψ), (8) where Ψ= Var(K hn α(x 1 )(X 1 x)(x 11 x 1 )),...,Var(K hn α(x 1 )(X 1 x)(x d1 x d )) T. Because x is a fixed point in the interior of supp(f)={x f(x) 0}, we have supp(k) {z :(x+h n α(x)z) supp(f)}, provided the bandwidth h n is small enough. Due to the continuity of f, K andα, we have E Khn α(x 1 )(X 1 x)(x 1 x) = supp(f) h d n α d (x)k(h 1 n (α(x 1 )) 1 (X 1 x))(x 1 x)f(x 1 )dx 1 13

16 = (α(x+h n Q)) d K(Q(α(x+h n Q)) 1 )f(x+h n Q)h n QdQ Ω n = h 2 nα(x) 3 G(α, f,x)+o(h 2 n1), (9) whereω n ={Q : x+h n Q supp(f)}. It is easy to see Var Khn α(x 1 )(X 1 x)(x 1 x) = E Khnα(X1)(X 1 x)(x 1 x) K hnα(x1)(x 1 x)(x 1 x) T E Khn α(x 1 )(X 1 x)(x 1 x) E Khn α(x 1 )(X 1 x)(x 1 x) T. (10) Again by the continuity of f, K andα, we have E Khnα(X 1 )(X 1 x)(x 1 x) K hnα(x1 )(X 1 x)(x 1 x) T = E Khn α(x 1 )(X 1 x) 2 (X 1 x)(x 1 x) T = supp(f) = h d+2 n = h d+2 n h d n (α(x)) d K(h 1 n (α(x 1 )) 1 (X 1 x)) 2 (X1 x)(x 1 x) T f(x 1 )dx 1 (α(x+h n Q)) d K(Q(α(x+h n Q)) 1 ) 2 f(x+h n Q)QQ T dq Ω n (α(x)) d K(Q(α(x)) 1 ) 2 f(x)qq T dq+o(h d+2 n 1)=O(h d+2 n 1). (11) Ω n Therefore we have O p ( n 1 Ψ)= o p (h 2 n1). (12) Then (3) follows from (8) (12). Proof of (6) It is straightforward to show that n 1 X T xw x,α Q m (x) 14

17 n 1 n 1 n = K h n α(x i )(X i x)(x i x) T H m (z i (x,x i ))(X i x) n 1, n K h n α(x i )(X i x)(x i x) T H m (z i (x,x i ))(X i x)(x i x) K hn α(x i )(X i x)(x i x) T H m (z i (x,x i ))(X i x) and n 1 = h 2 n f(x)µ 2 (K)(α(x)) 2 s(h m (x))+ o p (h 2 n1), K hn α(x i )(X i x)(x i x) T H m (z i (x,x i ))(X i x)(x i x)=o p (h 3 n1). Therefore (6) holds. Proof of (7) It is obvious that E n 1 XxW T x,α u =E n 1 XxW T x,α u x = 0, and n 1 XxW T n 1 n x,α u= K h n α(x i )(X i x)u i n 1. n K h n α(x i )(X i x)(x i x)u i By the continuity of f, K,σ 2 andα, we have Var n 1 K hn α(x i )(X i x)u i =n 1 Var Khn α(x 1 )(X 1 x)u 1 = n 1 h d n (α(x)) d K(h 1 n (α(x 1 )) 1 (X 1 x)) 2 σ 2 (X 1 )f(x 1 )dx 1 supp(f) = n 1 h d n = n 1 h d n and Var n 1 ((α(x+h n Q)) d K(Q(α(x+h n Q)) 1 )) 2 σ 2 (x+h n Q)f(x+h n Q)dQ Ω n Ω n ((α(x)) d K(Q(α(x)) 1 )) 2 σ 2 (x)f(x)dq+ o(n 1 h d n ) = n 1 h d n R(K)(α(x)) d σ 2 (x)f(x)+ o(n 1 h d n ), Then (7) holds. K hn α(x i )(X i x)(x i x)u i = o(n 1 h d+2 n 1). 15

18 Proof of Theorem 1 By (5) and (6), we have E[ˆm n (x,h n,α) X 1,...,X n ] m(x)=0.5e T 1 (X T xw x,α X x ) 1 X T xw x,α Q m (x) = 0.5h 2 nµ 2 (K)(α(x)) 2 s(h m (x))+ o p (h 2 n). Therefore Theorem 1(1) holds. Let V =diag σ 2 (X 1 ),...,σ 2 (X n ). Then Var[ˆm n (x,h n,α) X 1,...,X n ]=e T 1 (X T xw x,α X x ) 1 X T xw x,α V W x,α X x (X T xw x,α X x ) 1 e 1, (13) and n 1 XxW T a 11 (x,h n,α) (a 21 (x,h n,α)) T x,α V W x,α X x =, a 21 (x,h n,α) a 22 (x,h n,α) where a 11 (x,h n,α)= n 1 (K hn α(x i )(X i x)) 2 σ 2 (X i ) a 21 (x,h n,α)= n 1 (K hn α(x i )(X i x)) 2 (X i x)σ 2 (X i ) and a 22 (x,h n,α)= n 1 (K hn α(x i )(X i x)) 2 (X i x)(x i x) T σ 2 (X i ). It is easy to prove that n 1 n 1 (K hn α(x i )(X i x)) 2 σ 2 (X i )=h d n R(K)(α(x)) d σ 2 (x)f(x)o p (h d n ), (K hn α(x i )(X i x)) 2 (X i x) T σ 2 (X i )=O p (h d+1 n 1) and n 1 (K hn α(x i )(X i x)) 2 (X i x)(x i x) T σ 2 (X i )=O p (h d+2 n 1). (14) By (13) (14) and (5), we have Var[ˆm n (x,h n,α) X 1,...,X n ]=n 1 h d n R(K)(α(x)) d σ 2 (x)f(x) 1 + o p (n 1 h d n ). 16

19 Therefore Theorem 1(2) holds. By (6) and (7) and the central limit theorem, we have n 2/(d+4) n 1 XxW T x,α [0.5Q m (x)+u] d N(0.5c2 µ 2 (K)(α(x)) 2 f(x)s(h m (x)), c d R(K)(α(x)) d σ 2 (x)f(x)). 0 Applying White (1984, Proposition 2.26) and (5), we can easily deduce Theorem 1(3). 17

20 Tables Table 1: The percentage of 1000 samples in which the variable bandwidth method is better than the constant bandwidth method, and the RMSE of the two methods. Model Percentage Root mean squared error better Constant bandwidth Variable bandwidth Model A Model B Model C Model D Model E Table 2: The percentage of 200 samples in which the variable bandwidth method is better than the constant bandwidth method, and the RMSE of the two methods. Model Percentage Root mean squared error better Constant bandwidth Variable bandwidth Model F Model G Model H Model I

21 replacements Figures Figure 1: Results for the simulated univariate regression data of models A E. The true regression functions (the solid line) and four typical estimated curves are presented. These correspond to the 10th, the 30th, the 70th, the 90th percentile. The dashed line is for the constant bandwidth method and the dotted line is for the variable bandwidth method. Model A Model B Model C x x x m(x) m(x) m(x) m(x) m(x) Model D Model E x x 19

22 Figure 2: Results for the simulated bivariate data of model F. The true regression functions (the solid line) and four typical estimated curves are presented. These correspond to the 10th, the 30th, the 70th, the 90th percentile. The dashed line is for the constant bandwidth method and the dotted line is for the variable bandwidth method. Model F: x 2 = 1 Model F: x 1 = 1 m(x 1,0) m(x 1,1) m(x 1, 1) x 1 Model F: x 2 = 0 x 1 Model F: x 2 = 1 m( 2, x 2 ) m(0, x 2 ) m(1, x 2 ) x 2 Model F: x 1 = 0 x 2 Model F: x 1 = 1 x 1 x 2 20

23 Figure 3: Results for the simulated bivariate data of model G. The true regression functions (the solid line) and four typical estimated curves are presented. These correspond to the 10th, the 30th, the 70th, the 90th percentile. The dashed line is for the constant bandwidth method and the dotted line is for the variable bandwidth method. Model G: x 2 = 1 Model G: x 1 = 1 m(x 1,0) m(x 1,1) m(x 1, 1) x 1 Model G: x 2 = 0 x 1 Model G: x 2 = 1 m( 2, x 2 ) m(0, x 2 ) m(1, x 2 ) x 2 Model G: x 1 = 0 x 2 Model G: x 1 = 1 x 1 x 2 21

24 Figure 4: Results for the simulated bivariate data of model H. The true regression functions (the solid line) and four typical estimated curves are presented. These correspond to the 10th, the 30th, the 70th, the 90th percentile. The dashed line is for the constant bandwidth method and the dotted line is for the variable bandwidth method. Model H: x 2 = 1 Model H: x 1 = 1 m(x 1,0) m(x 1,1) m(x 1, 1) x 1 Model H: x 2 = 0 x 1 Model H: x 2 = 1 m( 2, x 2 ) m(0, x 2 ) m(1, x 2 ) x 2 Model H: x 1 = 0 x 2 Model H: x 1 = 1 x 1 x 2 22

25 Figure 5: Results for the simulated bivariate data of model I. The true regression functions (the solid line) and four typical estimated curves are presented. These correspond to the 10th, the 30th, the 70th, the 90th percentile. The dashed line is for the constant bandwidth method and the dotted line is for the variable bandwidth method. Model I: x 2 = 1 Model I: x 1 = 1 m(x 1,0) m(x 1,1) m(x 1, 1) x 1 Model I: x 2 = 0 x 1 Model I: x 2 = 1 m( 2, x 2 ) m(0, x 2 ) m(1, x 2 ) x 2 Model I: x 1 = 0 x 2 Model I: x 1 = 1 x 1 x 2 23

Local linear multiple regression with variable. bandwidth in the presence of heteroscedasticity

Local linear multiple regression with variable. bandwidth in the presence of heteroscedasticity Local linear multiple regression with variable bandwidth in the presence of heteroscedasticity Azhong Ye 1 Rob J Hyndman 2 Zinai Li 3 23 January 2006 Abstract: We present local linear estimator with variable

More information

Local Polynomial Regression

Local Polynomial Regression VI Local Polynomial Regression (1) Global polynomial regression We observe random pairs (X 1, Y 1 ),, (X n, Y n ) where (X 1, Y 1 ),, (X n, Y n ) iid (X, Y ). We want to estimate m(x) = E(Y X = x) based

More information

Nonparametric Regression. Changliang Zou

Nonparametric Regression. Changliang Zou Nonparametric Regression Institute of Statistics, Nankai University Email: nk.chlzou@gmail.com Smoothing parameter selection An overall measure of how well m h (x) performs in estimating m(x) over x (0,

More information

Nonparametric Regression

Nonparametric Regression Nonparametric Regression Econ 674 Purdue University April 8, 2009 Justin L. Tobias (Purdue) Nonparametric Regression April 8, 2009 1 / 31 Consider the univariate nonparametric regression model: where y

More information

Variance Function Estimation in Multivariate Nonparametric Regression

Variance Function Estimation in Multivariate Nonparametric Regression Variance Function Estimation in Multivariate Nonparametric Regression T. Tony Cai 1, Michael Levine Lie Wang 1 Abstract Variance function estimation in multivariate nonparametric regression is considered

More information

Nonparametric Modal Regression

Nonparametric Modal Regression Nonparametric Modal Regression Summary In this article, we propose a new nonparametric modal regression model, which aims to estimate the mode of the conditional density of Y given predictors X. The nonparametric

More information

DEPARTMENT MATHEMATIK ARBEITSBEREICH MATHEMATISCHE STATISTIK UND STOCHASTISCHE PROZESSE

DEPARTMENT MATHEMATIK ARBEITSBEREICH MATHEMATISCHE STATISTIK UND STOCHASTISCHE PROZESSE Estimating the error distribution in nonparametric multiple regression with applications to model testing Natalie Neumeyer & Ingrid Van Keilegom Preprint No. 2008-01 July 2008 DEPARTMENT MATHEMATIK ARBEITSBEREICH

More information

ERROR VARIANCE ESTIMATION IN NONPARAMETRIC REGRESSION MODELS

ERROR VARIANCE ESTIMATION IN NONPARAMETRIC REGRESSION MODELS ERROR VARIANCE ESTIMATION IN NONPARAMETRIC REGRESSION MODELS By YOUSEF FAYZ M ALHARBI A thesis submitted to The University of Birmingham for the Degree of DOCTOR OF PHILOSOPHY School of Mathematics The

More information

Smooth simultaneous confidence bands for cumulative distribution functions

Smooth simultaneous confidence bands for cumulative distribution functions Journal of Nonparametric Statistics, 2013 Vol. 25, No. 2, 395 407, http://dx.doi.org/10.1080/10485252.2012.759219 Smooth simultaneous confidence bands for cumulative distribution functions Jiangyan Wang

More information

Local Polynomial Modelling and Its Applications

Local Polynomial Modelling and Its Applications Local Polynomial Modelling and Its Applications J. Fan Department of Statistics University of North Carolina Chapel Hill, USA and I. Gijbels Institute of Statistics Catholic University oflouvain Louvain-la-Neuve,

More information

Test for Discontinuities in Nonparametric Regression

Test for Discontinuities in Nonparametric Regression Communications of the Korean Statistical Society Vol. 15, No. 5, 2008, pp. 709 717 Test for Discontinuities in Nonparametric Regression Dongryeon Park 1) Abstract The difference of two one-sided kernel

More information

Simple and Efficient Improvements of Multivariate Local Linear Regression

Simple and Efficient Improvements of Multivariate Local Linear Regression Journal of Multivariate Analysis Simple and Efficient Improvements of Multivariate Local Linear Regression Ming-Yen Cheng 1 and Liang Peng Abstract This paper studies improvements of multivariate local

More information

WEIGHTED QUANTILE REGRESSION THEORY AND ITS APPLICATION. Abstract

WEIGHTED QUANTILE REGRESSION THEORY AND ITS APPLICATION. Abstract Journal of Data Science,17(1). P. 145-160,2019 DOI:10.6339/JDS.201901_17(1).0007 WEIGHTED QUANTILE REGRESSION THEORY AND ITS APPLICATION Wei Xiong *, Maozai Tian 2 1 School of Statistics, University of

More information

Nonparametric Regression Härdle, Müller, Sperlich, Werwarz, 1995, Nonparametric and Semiparametric Models, An Introduction

Nonparametric Regression Härdle, Müller, Sperlich, Werwarz, 1995, Nonparametric and Semiparametric Models, An Introduction Härdle, Müller, Sperlich, Werwarz, 1995, Nonparametric and Semiparametric Models, An Introduction Tine Buch-Kromann Univariate Kernel Regression The relationship between two variables, X and Y where m(

More information

Modelling Non-linear and Non-stationary Time Series

Modelling Non-linear and Non-stationary Time Series Modelling Non-linear and Non-stationary Time Series Chapter 2: Non-parametric methods Henrik Madsen Advanced Time Series Analysis September 206 Henrik Madsen (02427 Adv. TS Analysis) Lecture Notes September

More information

Nonparametric confidence intervals. for receiver operating characteristic curves

Nonparametric confidence intervals. for receiver operating characteristic curves Nonparametric confidence intervals for receiver operating characteristic curves Peter G. Hall 1, Rob J. Hyndman 2, and Yanan Fan 3 5 December 2003 Abstract: We study methods for constructing confidence

More information

Nonparametric Density Estimation (Multidimension)

Nonparametric Density Estimation (Multidimension) Nonparametric Density Estimation (Multidimension) Härdle, Müller, Sperlich, Werwarz, 1995, Nonparametric and Semiparametric Models, An Introduction Tine Buch-Kromann February 19, 2007 Setup One-dimensional

More information

Bickel Rosenblatt test

Bickel Rosenblatt test University of Latvia 28.05.2011. A classical Let X 1,..., X n be i.i.d. random variables with a continuous probability density function f. Consider a simple hypothesis H 0 : f = f 0 with a significance

More information

Model Specification Testing in Nonparametric and Semiparametric Time Series Econometrics. Jiti Gao

Model Specification Testing in Nonparametric and Semiparametric Time Series Econometrics. Jiti Gao Model Specification Testing in Nonparametric and Semiparametric Time Series Econometrics Jiti Gao Department of Statistics School of Mathematics and Statistics The University of Western Australia Crawley

More information

Nonparametric Econometrics

Nonparametric Econometrics Applied Microeconometrics with Stata Nonparametric Econometrics Spring Term 2011 1 / 37 Contents Introduction The histogram estimator The kernel density estimator Nonparametric regression estimators Semi-

More information

Estimation of a quadratic regression functional using the sinc kernel

Estimation of a quadratic regression functional using the sinc kernel Estimation of a quadratic regression functional using the sinc kernel Nicolai Bissantz Hajo Holzmann Institute for Mathematical Stochastics, Georg-August-University Göttingen, Maschmühlenweg 8 10, D-37073

More information

Bayesian estimation of bandwidths for a nonparametric regression model with a flexible error density

Bayesian estimation of bandwidths for a nonparametric regression model with a flexible error density ISSN 1440-771X Australia Department of Econometrics and Business Statistics http://www.buseco.monash.edu.au/depts/ebs/pubs/wpapers/ Bayesian estimation of bandwidths for a nonparametric regression model

More information

Automatic Local Smoothing for Spectral Density. Abstract. This article uses local polynomial techniques to t Whittle's likelihood for spectral density

Automatic Local Smoothing for Spectral Density. Abstract. This article uses local polynomial techniques to t Whittle's likelihood for spectral density Automatic Local Smoothing for Spectral Density Estimation Jianqing Fan Department of Statistics University of North Carolina Chapel Hill, N.C. 27599-3260 Eva Kreutzberger Department of Mathematics University

More information

A New Procedure for Multiple Testing of Econometric Models

A New Procedure for Multiple Testing of Econometric Models A New Procedure for Multiple Testing of Econometric Models Maxwell L. King 1, Xibin Zhang, and Muhammad Akram Department of Econometrics and Business Statistics Monash University, Australia April 2007

More information

Multiscale Exploratory Analysis of Regression Quantiles using Quantile SiZer

Multiscale Exploratory Analysis of Regression Quantiles using Quantile SiZer Multiscale Exploratory Analysis of Regression Quantiles using Quantile SiZer Cheolwoo Park Thomas C. M. Lee Jan Hannig April 6, 2 Abstract The SiZer methodology proposed by Chaudhuri & Marron (999) is

More information

41903: Introduction to Nonparametrics

41903: Introduction to Nonparametrics 41903: Notes 5 Introduction Nonparametrics fundamentally about fitting flexible models: want model that is flexible enough to accommodate important patterns but not so flexible it overspecializes to specific

More information

Introduction. Linear Regression. coefficient estimates for the wage equation: E(Y X) = X 1 β X d β d = X β

Introduction. Linear Regression. coefficient estimates for the wage equation: E(Y X) = X 1 β X d β d = X β Introduction - Introduction -2 Introduction Linear Regression E(Y X) = X β +...+X d β d = X β Example: Wage equation Y = log wages, X = schooling (measured in years), labor market experience (measured

More information

Minimax Rate of Convergence for an Estimator of the Functional Component in a Semiparametric Multivariate Partially Linear Model.

Minimax Rate of Convergence for an Estimator of the Functional Component in a Semiparametric Multivariate Partially Linear Model. Minimax Rate of Convergence for an Estimator of the Functional Component in a Semiparametric Multivariate Partially Linear Model By Michael Levine Purdue University Technical Report #14-03 Department of

More information

Nonparametric Inference via Bootstrapping the Debiased Estimator

Nonparametric Inference via Bootstrapping the Debiased Estimator Nonparametric Inference via Bootstrapping the Debiased Estimator Yen-Chi Chen Department of Statistics, University of Washington ICSA-Canada Chapter Symposium 2017 1 / 21 Problem Setup Let X 1,, X n be

More information

New Local Estimation Procedure for Nonparametric Regression Function of Longitudinal Data

New Local Estimation Procedure for Nonparametric Regression Function of Longitudinal Data ew Local Estimation Procedure for onparametric Regression Function of Longitudinal Data Weixin Yao and Runze Li The Pennsylvania State University Technical Report Series #0-03 College of Health and Human

More information

Nonparametric Density Estimation

Nonparametric Density Estimation Nonparametric Density Estimation Econ 690 Purdue University Justin L. Tobias (Purdue) Nonparametric Density Estimation 1 / 29 Density Estimation Suppose that you had some data, say on wages, and you wanted

More information

Optimal bandwidth selection for differences of nonparametric estimators with an application to the sharp regression discontinuity design

Optimal bandwidth selection for differences of nonparametric estimators with an application to the sharp regression discontinuity design Optimal bandwidth selection for differences of nonparametric estimators with an application to the sharp regression discontinuity design Yoichi Arai Hidehiko Ichimura The Institute for Fiscal Studies Department

More information

Density estimation Nonparametric conditional mean estimation Semiparametric conditional mean estimation. Nonparametrics. Gabriel Montes-Rojas

Density estimation Nonparametric conditional mean estimation Semiparametric conditional mean estimation. Nonparametrics. Gabriel Montes-Rojas 0 0 5 Motivation: Regression discontinuity (Angrist&Pischke) Outcome.5 1 1.5 A. Linear E[Y 0i X i] 0.2.4.6.8 1 X Outcome.5 1 1.5 B. Nonlinear E[Y 0i X i] i 0.2.4.6.8 1 X utcome.5 1 1.5 C. Nonlinearity

More information

On a Nonparametric Notion of Residual and its Applications

On a Nonparametric Notion of Residual and its Applications On a Nonparametric Notion of Residual and its Applications Bodhisattva Sen and Gábor Székely arxiv:1409.3886v1 [stat.me] 12 Sep 2014 Columbia University and National Science Foundation September 16, 2014

More information

DESIGN-ADAPTIVE MINIMAX LOCAL LINEAR REGRESSION FOR LONGITUDINAL/CLUSTERED DATA

DESIGN-ADAPTIVE MINIMAX LOCAL LINEAR REGRESSION FOR LONGITUDINAL/CLUSTERED DATA Statistica Sinica 18(2008), 515-534 DESIGN-ADAPTIVE MINIMAX LOCAL LINEAR REGRESSION FOR LONGITUDINAL/CLUSTERED DATA Kani Chen 1, Jianqing Fan 2 and Zhezhen Jin 3 1 Hong Kong University of Science and Technology,

More information

University, Tempe, Arizona, USA b Department of Mathematics and Statistics, University of New. Mexico, Albuquerque, New Mexico, USA

University, Tempe, Arizona, USA b Department of Mathematics and Statistics, University of New. Mexico, Albuquerque, New Mexico, USA This article was downloaded by: [University of New Mexico] On: 27 September 2012, At: 22:13 Publisher: Taylor & Francis Informa Ltd Registered in England and Wales Registered Number: 1072954 Registered

More information

Function of Longitudinal Data

Function of Longitudinal Data New Local Estimation Procedure for Nonparametric Regression Function of Longitudinal Data Weixin Yao and Runze Li Abstract This paper develops a new estimation of nonparametric regression functions for

More information

Nonparametric Methods

Nonparametric Methods Nonparametric Methods Michael R. Roberts Department of Finance The Wharton School University of Pennsylvania July 28, 2009 Michael R. Roberts Nonparametric Methods 1/42 Overview Great for data analysis

More information

Nonparametric Function Estimation with Infinite-Order Kernels

Nonparametric Function Estimation with Infinite-Order Kernels Nonparametric Function Estimation with Infinite-Order Kernels Arthur Berg Department of Statistics, University of Florida March 15, 2008 Kernel Density Estimation (IID Case) Let X 1,..., X n iid density

More information

Adaptive Kernel Estimation of The Hazard Rate Function

Adaptive Kernel Estimation of The Hazard Rate Function Adaptive Kernel Estimation of The Hazard Rate Function Raid Salha Department of Mathematics, Islamic University of Gaza, Palestine, e-mail: rbsalha@mail.iugaza.edu Abstract In this paper, we generalized

More information

Bandwidth selection for kernel conditional density

Bandwidth selection for kernel conditional density Bandwidth selection for kernel conditional density estimation David M Bashtannyk and Rob J Hyndman 1 10 August 1999 Abstract: We consider bandwidth selection for the kernel estimator of conditional density

More information

Integral approximation by kernel smoothing

Integral approximation by kernel smoothing Integral approximation by kernel smoothing François Portier Université catholique de Louvain - ISBA August, 29 2014 In collaboration with Bernard Delyon Topic of the talk: Given ϕ : R d R, estimation of

More information

Multivariate Locally Weighted Polynomial Fitting and Partial Derivative Estimation

Multivariate Locally Weighted Polynomial Fitting and Partial Derivative Estimation journal of multivariate analysis 59, 8705 (996) article no. 0060 Multivariate Locally Weighted Polynomial Fitting and Partial Derivative Estimation Zhan-Qian Lu Geophysical Statistics Project, National

More information

Statistica Sinica Preprint No: SS

Statistica Sinica Preprint No: SS Statistica Sinica Preprint No: SS-017-0013 Title A Bootstrap Method for Constructing Pointwise and Uniform Confidence Bands for Conditional Quantile Functions Manuscript ID SS-017-0013 URL http://wwwstatsinicaedutw/statistica/

More information

Does k-th Moment Exist?

Does k-th Moment Exist? Does k-th Moment Exist? Hitomi, K. 1 and Y. Nishiyama 2 1 Kyoto Institute of Technology, Japan 2 Institute of Economic Research, Kyoto University, Japan Email: hitomi@kit.ac.jp Keywords: Existence of moments,

More information

On Asymptotic Normality of the Local Polynomial Regression Estimator with Stochastic Bandwidths 1. Carlos Martins-Filho.

On Asymptotic Normality of the Local Polynomial Regression Estimator with Stochastic Bandwidths 1. Carlos Martins-Filho. On Asymptotic Normality of the Local Polynomial Regression Estimator with Stochastic Bandwidths 1 Carlos Martins-Filho Department of Economics IFPRI University of Colorado 2033 K Street NW Boulder, CO

More information

Local Modal Regression

Local Modal Regression Local Modal Regression Weixin Yao Department of Statistics, Kansas State University, Manhattan, Kansas 66506, U.S.A. wxyao@ksu.edu Bruce G. Lindsay and Runze Li Department of Statistics, The Pennsylvania

More information

SUPPLEMENTARY MATERIAL FOR PUBLICATION ONLINE 1

SUPPLEMENTARY MATERIAL FOR PUBLICATION ONLINE 1 SUPPLEMENTARY MATERIAL FOR PUBLICATION ONLINE 1 B Technical details B.1 Variance of ˆf dec in the ersmooth case We derive the variance in the case where ϕ K (t) = ( 1 t 2) κ I( 1 t 1), i.e. we take K as

More information

ECON The Simple Regression Model

ECON The Simple Regression Model ECON 351 - The Simple Regression Model Maggie Jones 1 / 41 The Simple Regression Model Our starting point will be the simple regression model where we look at the relationship between two variables In

More information

Single Index Quantile Regression for Heteroscedastic Data

Single Index Quantile Regression for Heteroscedastic Data Single Index Quantile Regression for Heteroscedastic Data E. Christou M. G. Akritas Department of Statistics The Pennsylvania State University SMAC, November 6, 2015 E. Christou, M. G. Akritas (PSU) SIQR

More information

SEMIPARAMETRIC ESTIMATION OF CONDITIONAL HETEROSCEDASTICITY VIA SINGLE-INDEX MODELING

SEMIPARAMETRIC ESTIMATION OF CONDITIONAL HETEROSCEDASTICITY VIA SINGLE-INDEX MODELING Statistica Sinica 3 (013), 135-155 doi:http://dx.doi.org/10.5705/ss.01.075 SEMIPARAMERIC ESIMAION OF CONDIIONAL HEEROSCEDASICIY VIA SINGLE-INDEX MODELING Liping Zhu, Yuexiao Dong and Runze Li Shanghai

More information

Goodness-of-fit tests for the cure rate in a mixture cure model

Goodness-of-fit tests for the cure rate in a mixture cure model Biometrika (217), 13, 1, pp. 1 7 Printed in Great Britain Advance Access publication on 31 July 216 Goodness-of-fit tests for the cure rate in a mixture cure model BY U.U. MÜLLER Department of Statistics,

More information

NADARAYA WATSON ESTIMATE JAN 10, 2006: version 2. Y ik ( x i

NADARAYA WATSON ESTIMATE JAN 10, 2006: version 2. Y ik ( x i NADARAYA WATSON ESTIMATE JAN 0, 2006: version 2 DATA: (x i, Y i, i =,..., n. ESTIMATE E(Y x = m(x by n i= ˆm (x = Y ik ( x i x n i= K ( x i x EXAMPLES OF K: K(u = I{ u c} (uniform or box kernel K(u = u

More information

Time Series and Forecasting Lecture 4 NonLinear Time Series

Time Series and Forecasting Lecture 4 NonLinear Time Series Time Series and Forecasting Lecture 4 NonLinear Time Series Bruce E. Hansen Summer School in Economics and Econometrics University of Crete July 23-27, 2012 Bruce Hansen (University of Wisconsin) Foundations

More information

Some Monte Carlo Evidence for Adaptive Estimation of Unit-Time Varying Heteroscedastic Panel Data Models

Some Monte Carlo Evidence for Adaptive Estimation of Unit-Time Varying Heteroscedastic Panel Data Models Some Monte Carlo Evidence for Adaptive Estimation of Unit-Time Varying Heteroscedastic Panel Data Models G. R. Pasha Department of Statistics, Bahauddin Zakariya University Multan, Pakistan E-mail: drpasha@bzu.edu.pk

More information

Local linear hazard rate estimation and bandwidth selection

Local linear hazard rate estimation and bandwidth selection Ann Inst Stat Math 2011) 63:1019 1046 DOI 10.1007/s10463-010-0277-6 Local linear hazard rate estimation and bandwidth selection Dimitrios Bagkavos Received: 23 February 2009 / Revised: 27 May 2009 / Published

More information

Transformation and Smoothing in Sample Survey Data

Transformation and Smoothing in Sample Survey Data Scandinavian Journal of Statistics, Vol. 37: 496 513, 2010 doi: 10.1111/j.1467-9469.2010.00691.x Published by Blackwell Publishing Ltd. Transformation and Smoothing in Sample Survey Data YANYUAN MA Department

More information

Introduction to Regression

Introduction to Regression Introduction to Regression Chad M. Schafer May 20, 2015 Outline General Concepts of Regression, Bias-Variance Tradeoff Linear Regression Nonparametric Procedures Cross Validation Local Polynomial Regression

More information

Discussion of the paper Inference for Semiparametric Models: Some Questions and an Answer by Bickel and Kwon

Discussion of the paper Inference for Semiparametric Models: Some Questions and an Answer by Bickel and Kwon Discussion of the paper Inference for Semiparametric Models: Some Questions and an Answer by Bickel and Kwon Jianqing Fan Department of Statistics Chinese University of Hong Kong AND Department of Statistics

More information

UNIVERSIDADE DE SANTIAGO DE COMPOSTELA DEPARTAMENTO DE ESTATÍSTICA E INVESTIGACIÓN OPERATIVA

UNIVERSIDADE DE SANTIAGO DE COMPOSTELA DEPARTAMENTO DE ESTATÍSTICA E INVESTIGACIÓN OPERATIVA UNIVERSIDADE DE SANTIAGO DE COMPOSTELA DEPARTAMENTO DE ESTATÍSTICA E INVESTIGACIÓN OPERATIVA GOODNESS OF FIT TEST FOR LINEAR REGRESSION MODELS WITH MISSING RESPONSE DATA González Manteiga, W. and Pérez

More information

ECON 721: Lecture Notes on Nonparametric Density and Regression Estimation. Petra E. Todd

ECON 721: Lecture Notes on Nonparametric Density and Regression Estimation. Petra E. Todd ECON 721: Lecture Notes on Nonparametric Density and Regression Estimation Petra E. Todd Fall, 2014 2 Contents 1 Review of Stochastic Order Symbols 1 2 Nonparametric Density Estimation 3 2.1 Histogram

More information

Adaptive Nonparametric Density Estimators

Adaptive Nonparametric Density Estimators Adaptive Nonparametric Density Estimators by Alan J. Izenman Introduction Theoretical results and practical application of histograms as density estimators usually assume a fixed-partition approach, where

More information

Web-based Supplementary Material for. Dependence Calibration in Conditional Copulas: A Nonparametric Approach

Web-based Supplementary Material for. Dependence Calibration in Conditional Copulas: A Nonparametric Approach 1 Web-based Supplementary Material for Dependence Calibration in Conditional Copulas: A Nonparametric Approach Elif F. Acar, Radu V. Craiu, and Fang Yao Web Appendix A: Technical Details The score and

More information

A Bootstrap Test for Conditional Symmetry

A Bootstrap Test for Conditional Symmetry ANNALS OF ECONOMICS AND FINANCE 6, 51 61 005) A Bootstrap Test for Conditional Symmetry Liangjun Su Guanghua School of Management, Peking University E-mail: lsu@gsm.pku.edu.cn and Sainan Jin Guanghua School

More information

Some Theories about Backfitting Algorithm for Varying Coefficient Partially Linear Model

Some Theories about Backfitting Algorithm for Varying Coefficient Partially Linear Model Some Theories about Backfitting Algorithm for Varying Coefficient Partially Linear Model 1. Introduction Varying-coefficient partially linear model (Zhang, Lee, and Song, 2002; Xia, Zhang, and Tong, 2004;

More information

BETA KERNEL SMOOTHERS FOR REGRESSION CURVES

BETA KERNEL SMOOTHERS FOR REGRESSION CURVES Statistica Sinica 1(2), 73-91 BETA KERNEL SMOOTHERS FOR REGRESSION CURVES Song Xi Chen La Trobe University Abstract: This paper proposes beta kernel smoothers for estimating curves with compact support

More information

Optimal bandwidth selection for the fuzzy regression discontinuity estimator

Optimal bandwidth selection for the fuzzy regression discontinuity estimator Optimal bandwidth selection for the fuzzy regression discontinuity estimator Yoichi Arai Hidehiko Ichimura The Institute for Fiscal Studies Department of Economics, UCL cemmap working paper CWP49/5 Optimal

More information

Week 9 The Central Limit Theorem and Estimation Concepts

Week 9 The Central Limit Theorem and Estimation Concepts Week 9 and Estimation Concepts Week 9 and Estimation Concepts Week 9 Objectives 1 The Law of Large Numbers and the concept of consistency of averages are introduced. The condition of existence of the population

More information

A Design Unbiased Variance Estimator of the Systematic Sample Means

A Design Unbiased Variance Estimator of the Systematic Sample Means American Journal of Theoretical and Applied Statistics 2015; 4(3): 201-210 Published online May 29, 2015 (http://www.sciencepublishinggroup.com/j/ajtas) doi: 10.1148/j.ajtas.20150403.27 ISSN: 232-8999

More information

Oberwolfach Preprints

Oberwolfach Preprints Oberwolfach Preprints OWP 202-07 LÁSZLÓ GYÖFI, HAO WALK Strongly Consistent Density Estimation of egression esidual Mathematisches Forschungsinstitut Oberwolfach ggmbh Oberwolfach Preprints (OWP) ISSN

More information

Introduction to Regression

Introduction to Regression Introduction to Regression Chad M. Schafer cschafer@stat.cmu.edu Carnegie Mellon University Introduction to Regression p. 1/100 Outline General Concepts of Regression, Bias-Variance Tradeoff Linear Regression

More information

A Bayesian approach to parameter estimation for kernel density estimation via transformations

A Bayesian approach to parameter estimation for kernel density estimation via transformations A Bayesian approach to parameter estimation for kernel density estimation via transformations Qing Liu,, David Pitt 2, Xibin Zhang 3, Xueyuan Wu Centre for Actuarial Studies, Faculty of Business and Economics,

More information

HOW IS GENERALIZED LEAST SQUARES RELATED TO WITHIN AND BETWEEN ESTIMATORS IN UNBALANCED PANEL DATA?

HOW IS GENERALIZED LEAST SQUARES RELATED TO WITHIN AND BETWEEN ESTIMATORS IN UNBALANCED PANEL DATA? HOW IS GENERALIZED LEAST SQUARES RELATED TO WITHIN AND BETWEEN ESTIMATORS IN UNBALANCED PANEL DATA? ERIK BIØRN Department of Economics University of Oslo P.O. Box 1095 Blindern 0317 Oslo Norway E-mail:

More information

LOCAL POLYNOMIAL REGRESSION ON UNKNOWN MANIFOLDS. Department of Statistics. University of California at Berkeley, USA. 1.

LOCAL POLYNOMIAL REGRESSION ON UNKNOWN MANIFOLDS. Department of Statistics. University of California at Berkeley, USA. 1. LOCAL POLYNOMIAL REGRESSION ON UNKNOWN MANIFOLDS PETER J. BICKEL AND BO LI Department of Statistics University of California at Berkeley, USA Abstract. We reveal the phenomenon that naive multivariate

More information

Proceedings of the 2016 Winter Simulation Conference T. M. K. Roeder, P. I. Frazier, R. Szechtman, E. Zhou, T. Huschka, and S. E. Chick, eds.

Proceedings of the 2016 Winter Simulation Conference T. M. K. Roeder, P. I. Frazier, R. Szechtman, E. Zhou, T. Huschka, and S. E. Chick, eds. Proceedings of the 2016 Winter Simulation Conference T. M. K. Roeder, P. I. Frazier, R. Szechtman, E. Zhou, T. Huschka, and S. E. Chick, eds. THE EFFECTS OF ESTIMATION OF HETEROSCEDASTICITY ON STOCHASTIC

More information

Simple Linear Regression: The Model

Simple Linear Regression: The Model Simple Linear Regression: The Model task: quantifying the effect of change X in X on Y, with some constant β 1 : Y = β 1 X, linear relationship between X and Y, however, relationship subject to a random

More information

Discussion Paper No. 28

Discussion Paper No. 28 Discussion Paper No. 28 Asymptotic Property of Wrapped Cauchy Kernel Density Estimation on the Circle Yasuhito Tsuruta Masahiko Sagae Asymptotic Property of Wrapped Cauchy Kernel Density Estimation on

More information

Quantitative Economics for the Evaluation of the European Policy. Dipartimento di Economia e Management

Quantitative Economics for the Evaluation of the European Policy. Dipartimento di Economia e Management Quantitative Economics for the Evaluation of the European Policy Dipartimento di Economia e Management Irene Brunetti 1 Davide Fiaschi 2 Angela Parenti 3 9 ottobre 2015 1 ireneb@ec.unipi.it. 2 davide.fiaschi@unipi.it.

More information

Optimal Bandwidth Choice for the Regression Discontinuity Estimator

Optimal Bandwidth Choice for the Regression Discontinuity Estimator Optimal Bandwidth Choice for the Regression Discontinuity Estimator Guido Imbens and Karthik Kalyanaraman First Draft: June 8 This Draft: September Abstract We investigate the choice of the bandwidth for

More information

Estimation and Testing for Varying Coefficients in Additive Models with Marginal Integration

Estimation and Testing for Varying Coefficients in Additive Models with Marginal Integration Estimation and Testing for Varying Coefficients in Additive Models with Marginal Integration Lijian Yang Byeong U. Park Lan Xue Wolfgang Härdle September 6, 2005 Abstract We propose marginal integration

More information

Math 118, Handout 4: Hermite functions and the Fourier transform. n! where D = d/dx, are a basis of eigenfunctions for the Fourier transform

Math 118, Handout 4: Hermite functions and the Fourier transform. n! where D = d/dx, are a basis of eigenfunctions for the Fourier transform The Hermite functions defined by h n (x) = ( )n e x2 /2 D n e x2 where D = d/dx, are a basis of eigenfunctions for the Fourier transform f(k) = f(x)e ikx dx on L 2 (R). Since h 0 (x) = e x2 /2 h (x) =

More information

Local Polynomial Estimation for Sensitivity Analysis on Models With Correlated Inputs

Local Polynomial Estimation for Sensitivity Analysis on Models With Correlated Inputs Local Polynomial Estimation for Sensitivity Analysis on Models With Correlated Inputs Sébastien Da Veiga, François Wahl, Fabrice Gamboa To cite this version: Sébastien Da Veiga, François Wahl, Fabrice

More information

SMOOTHED BLOCK EMPIRICAL LIKELIHOOD FOR QUANTILES OF WEAKLY DEPENDENT PROCESSES

SMOOTHED BLOCK EMPIRICAL LIKELIHOOD FOR QUANTILES OF WEAKLY DEPENDENT PROCESSES Statistica Sinica 19 (2009), 71-81 SMOOTHED BLOCK EMPIRICAL LIKELIHOOD FOR QUANTILES OF WEAKLY DEPENDENT PROCESSES Song Xi Chen 1,2 and Chiu Min Wong 3 1 Iowa State University, 2 Peking University and

More information

Relative error prediction via kernel regression smoothers

Relative error prediction via kernel regression smoothers Relative error prediction via kernel regression smoothers Heungsun Park a, Key-Il Shin a, M.C. Jones b, S.K. Vines b a Department of Statistics, Hankuk University of Foreign Studies, YongIn KyungKi, 449-791,

More information

CALCULATING DEGREES OF FREEDOM IN MULTIVARIATE LOCAL POLYNOMIAL REGRESSION. 1. Introduction

CALCULATING DEGREES OF FREEDOM IN MULTIVARIATE LOCAL POLYNOMIAL REGRESSION. 1. Introduction CALCULATING DEGREES OF FREEDOM IN MULTIVARIATE LOCAL POLYNOMIAL REGRESSION NADINE MCCLOUD AND CHRISTOPHER F. PARMETER Abstract. The matrix that transforms the response variable in a regression to its predicted

More information

arxiv: v1 [stat.me] 22 Mar 2010

arxiv: v1 [stat.me] 22 Mar 2010 A longest run test for heteroscedasticity in univariate regression model arxiv:1003.4156v1 [stat.me] 22 Mar 2010 Aubin Jean-Baptiste a, Leoni-Aubin Samuela b a Université de Technologie de Compiègne, Rue

More information

A Note on Data-Adaptive Bandwidth Selection for Sequential Kernel Smoothers

A Note on Data-Adaptive Bandwidth Selection for Sequential Kernel Smoothers 6th St.Petersburg Workshop on Simulation (2009) 1-3 A Note on Data-Adaptive Bandwidth Selection for Sequential Kernel Smoothers Ansgar Steland 1 Abstract Sequential kernel smoothers form a class of procedures

More information

SEMIPARAMETRIC QUANTILE REGRESSION WITH HIGH-DIMENSIONAL COVARIATES. Liping Zhu, Mian Huang, & Runze Li. The Pennsylvania State University

SEMIPARAMETRIC QUANTILE REGRESSION WITH HIGH-DIMENSIONAL COVARIATES. Liping Zhu, Mian Huang, & Runze Li. The Pennsylvania State University SEMIPARAMETRIC QUANTILE REGRESSION WITH HIGH-DIMENSIONAL COVARIATES Liping Zhu, Mian Huang, & Runze Li The Pennsylvania State University Technical Report Series #10-104 College of Health and Human Development

More information

LOCAL POLYNOMIAL AND PENALIZED TRIGONOMETRIC SERIES REGRESSION

LOCAL POLYNOMIAL AND PENALIZED TRIGONOMETRIC SERIES REGRESSION Statistica Sinica 24 (2014), 1215-1238 doi:http://dx.doi.org/10.5705/ss.2012.040 LOCAL POLYNOMIAL AND PENALIZED TRIGONOMETRIC SERIES REGRESSION Li-Shan Huang and Kung-Sik Chan National Tsing Hua University

More information

4 Nonparametric Regression

4 Nonparametric Regression 4 Nonparametric Regression 4.1 Univariate Kernel Regression An important question in many fields of science is the relation between two variables, say X and Y. Regression analysis is concerned with the

More information

A NOTE ON TESTING THE SHOULDER CONDITION IN LINE TRANSECT SAMPLING

A NOTE ON TESTING THE SHOULDER CONDITION IN LINE TRANSECT SAMPLING A NOTE ON TESTING THE SHOULDER CONDITION IN LINE TRANSECT SAMPLING Shunpu Zhang Department of Mathematical Sciences University of Alaska, Fairbanks, Alaska, U.S.A. 99775 ABSTRACT We propose a new method

More information

Nonparametric Estimation of Regression Functions In the Presence of Irrelevant Regressors

Nonparametric Estimation of Regression Functions In the Presence of Irrelevant Regressors Nonparametric Estimation of Regression Functions In the Presence of Irrelevant Regressors Peter Hall, Qi Li, Jeff Racine 1 Introduction Nonparametric techniques robust to functional form specification.

More information

PREWHITENING-BASED ESTIMATION IN PARTIAL LINEAR REGRESSION MODELS: A COMPARATIVE STUDY

PREWHITENING-BASED ESTIMATION IN PARTIAL LINEAR REGRESSION MODELS: A COMPARATIVE STUDY REVSTAT Statistical Journal Volume 7, Number 1, April 2009, 37 54 PREWHITENING-BASED ESTIMATION IN PARTIAL LINEAR REGRESSION MODELS: A COMPARATIVE STUDY Authors: Germán Aneiros-Pérez Departamento de Matemáticas,

More information

O Combining cross-validation and plug-in methods - for kernel density bandwidth selection O

O Combining cross-validation and plug-in methods - for kernel density bandwidth selection O O Combining cross-validation and plug-in methods - for kernel density selection O Carlos Tenreiro CMUC and DMUC, University of Coimbra PhD Program UC UP February 18, 2011 1 Overview The nonparametric problem

More information

SOME CONGRUENCES ASSOCIATED WITH THE EQUATION X α = X β IN CERTAIN FINITE SEMIGROUPS

SOME CONGRUENCES ASSOCIATED WITH THE EQUATION X α = X β IN CERTAIN FINITE SEMIGROUPS SOME CONGRUENCES ASSOCIATED WITH THE EQUATION X α = X β IN CERTAIN FINITE SEMIGROUPS THOMAS W. MÜLLER Abstract. Let H be a finite group, T n the symmetric semigroup of degree n, and let α, β be integers

More information

A simple bootstrap method for constructing nonparametric confidence bands for functions

A simple bootstrap method for constructing nonparametric confidence bands for functions A simple bootstrap method for constructing nonparametric confidence bands for functions Peter Hall Joel Horowitz The Institute for Fiscal Studies Department of Economics, UCL cemmap working paper CWP4/2

More information

Additional results for model-based nonparametric variance estimation for systematic sampling in a forestry survey

Additional results for model-based nonparametric variance estimation for systematic sampling in a forestry survey Additional results for model-based nonparametric variance estimation for systematic sampling in a forestry survey J.D. Opsomer Colorado State University M. Francisco-Fernández Universidad de A Coruña July

More information

3 Nonparametric Density Estimation

3 Nonparametric Density Estimation 3 Nonparametric Density Estimation Example: Income distribution Source: U.K. Family Expenditure Survey (FES) 1968-1995 Approximately 7000 British Households per year For each household many different variables

More information

J. Cwik and J. Koronacki. Institute of Computer Science, Polish Academy of Sciences. to appear in. Computational Statistics and Data Analysis

J. Cwik and J. Koronacki. Institute of Computer Science, Polish Academy of Sciences. to appear in. Computational Statistics and Data Analysis A Combined Adaptive-Mixtures/Plug-In Estimator of Multivariate Probability Densities 1 J. Cwik and J. Koronacki Institute of Computer Science, Polish Academy of Sciences Ordona 21, 01-237 Warsaw, Poland

More information

Local Polynomial Wavelet Regression with Missing at Random

Local Polynomial Wavelet Regression with Missing at Random Applied Mathematical Sciences, Vol. 6, 2012, no. 57, 2805-2819 Local Polynomial Wavelet Regression with Missing at Random Alsaidi M. Altaher School of Mathematical Sciences Universiti Sains Malaysia 11800

More information