Goodness-of-fit tests for the cure rate in a mixture cure model
|
|
- Theodora Powell
- 5 years ago
- Views:
Transcription
1 Biometrika (217), 13, 1, pp. 1 7 Printed in Great Britain Advance Access publication on 31 July 216 Goodness-of-fit tests for the cure rate in a mixture cure model BY U.U. MÜLLER Department of Statistics, Texas A&M University, College Station, Texas , U.S.A. uschi@stat.tamu.edu 5 AND I. VAN KEILEGOM Research Centre for Operations Research and Business Statistics, KU Leuven, Naamsestraat 69, 3 Leuven, Belgium ingrid.vankeilegom@kuleuven.be S. SUPPLEMENTARY ONLINE MATERIAL 1 S.1. Proof of Theorem 1 To derive the desired result for the Härdle Mammen-type test statistic T n from equation (2) we write T n nh 1/2 [{ˆp(x) p(x)} 2 + {p(x) pˆθ(x)} 2 + 2{p(x) pˆθ(x)}{ˆp(x) p(x)}]π(x) dx. Consider local alternatives p(x) p θ (x) + n 1/2 h 1/4 n (x), where θ denotes the true parameter. This covers the null hypothesis as a special case with n. We write T n for the first term of T n. We will show that T n determines the distribution of the test statistic under the null hypothesis and that T n b h converges in distribution, 15 T n b h nh 1/2 {ˆp(x) p(x)} 2 π(x) dx b h N(, V ) (n ), (S.1) i.e., T n is under H approximately normally distributed, with mean b h O(h 1/2 ) tending to infinity and variance V. Before proving (S.1) we consider the second term of T n and, writing p(x) p θ (x) + n 1/2 h 1/4 n (x), we can split it into three parts: nh 1/2 {p θ (x) pˆθ(x) + n 1/2 h 1/4 n (x)} 2 π(x) dx nh 1/2 {p θ (x) pˆθ(x)} 2 π(x) dx + nh 1/2 {n 1/2 h 1/4 n (x)} 2 π(x) dx + 2nh 1/2 {p θ (x) pˆθ(x)}n 1/2 h 1/4 n (x) π(x) dx n (x) 2 π(x) dx + o p (1). The resulting integral comes from the middle part. To see that the first part involving (p θ pˆθ) 2 is asymptotically negligible, use assumptions (A6) and (A7), which ensure that p θ is Lipschitz C 217 Biometrika Trust
2 2 U.U. MÜLLER AND I. VAN KEILEGOM in θ, uniformly in x, and that ˆθ is root-n consistent. This gives the order O p (h 1/2 ) o p (1) as h. The third part in the above display vanishes with rate O p (h 1/4 ) o p (1), which follows by the same arguments, combined with the assumption that n ( ) is uniformly bounded. Now consider the mixed third term of T n. We will show that this term is asymptotically negligible, nh 1/2 {p(x) pˆθ(x)}{ˆp(x) p(x)}π(x) dx o p (1). (S.2) The proof requires some auxiliary results, which we provide when we verify (S.1), the asymptotic normality of T n. It is therefore postponed to the end of this proof. Let Λ denote the cumulative conditional hazard function Λ(t x) log S(t x) and let ˆΛ(t x) be the nonparametric estimator provided in Xu and Peng (214). These authors show that (nh) 1/2 {ˆp(x) p(x)} S(Y 1 (n) x)(nh)1/2 {ˆΛ(Y 1 (n) x) Λ(Y 1 (n) x)} + o p (1). (S.3) They also show that Y 1 (n) is weakly consistent for sup x τ (x), and that (nh) 1/2 {S(Y 1 (n) x) p(x)} o p (1). This yields (nh) 1/2 {ˆp(x) p(x)} p(x)(nh) 1/2 {ˆΛ(Y 1 (n) x) Λ(Y 1 (n) x)} + o p (1). In order to derive asymptotic normality of T n, we will need an approximation of ˆΛ Λ, which is, for example, provided in Du and Akritas (22). They show in their Theorem 3.1 that, uniformly in x, ˆΛ(t x) Λ(t x) w hi (x)ζ i (t x) + r(t, x) i1 with sup x,t t r(t, x) O{(nh) 3/4 (log n) 3/4 } almost surely, where t satisfies sup x H(t x) < 1, with w hi as specified in Section 2, and with ζ i (t x) defined in (6). It follows from arguments in Xu and Peng that ˆΛ(Y(n) 1 x) Λ(Y 1 (n) x) is asymptotically equivalent to n i1 w hi(x)ζ i {τ (x) x}. Then, T n nh 1/2 {ˆp(x) p(x)} 2 π(x) dx + o p (1) [ nh 1/2 w hi (x)ζ i {τ (x) x}] dx i1 [ + nh 1/2 r{τ (x), x} w hi (x)ζ i {τ (x) x}] dx i1 + nh 1/2 [r{τ (x), x}] 2 π(x) dx + o p (1) [ nh 1/2 w hi (x)ζ i {τ (x) x}] dx + op (1). i1 The last two terms in the above decomposition of T n (second equality) are of order o p (1), which can be explained as follows. Consider the approximation (S.3) of ˆp(x) p(x) by Du and Akritas (22), which involves the difference ˆΛ Λ; see the proof of Theorem 3.1 in Du and Akritas
3 Goodness-of-fit tests for the cure rate 3 (22) for the definition of that difference. It depends on a quantity Ĥ whose order is given in Lemma 4.2 of the same paper. This yields ˆp(x) p(x) O p {(nh) 1/2 (log n) 1/2 }. Combining this order for the main term n i1 w hi(x)ζ i {τ (x) x} and the order O p {(nh) 3/4 (log n) 3/4 } for the remainder term yields the order O p {nh 1/2 (nh) 5/4 (log n) 5/4 } for the mixed second 4 term. This is o p (1) since we assume nh 3 (log n) 5. The third term is o p (1) because it contains the squared remainder term and is therefore of smaller order. Inserting the definition of w hi into the above gives Now write where T 1n h1/2 n T 2n h1/2 n T n nh 1/2 [n 1 i1 i1 j1 j i i1 K h (x X i ) ζ i {τ (x) x}] dx + op (1). (S.4) T n T 1n + T 2n + o p (1), Kh 2(x X i) f 2 ζi 2 {τ (x) x}π(x) dx, (x) K h (x X i )K h (x X j ) f 2 ζ i {τ (x) x}ζ j {τ (x) x}π(x) dx. (x) Analogously as in the proof of Proposition 1 in Härdle and Mammen (1993), but now with ζ j in place of heteroscedastic errors ε j, we obtain the bias term as follows: E(T 1n ) h 1/2 f 2 (x) E[K2 h (x X)ζ2 {τ (x) x)}]π(x) dx h 1/2 h 1/2 b h + o(1), f 2 (x) E[ 1 f 2 (x) with, using assumptions (A1) and (A5), b h h 1/2 R(K) ) E[ζ 2 {τ (x) x} X] ] π(x) dx h 2 K2( x X h h K2 (v)f(x + vh)µ xx (x + vh) dv π(x) dx π(x) µ xx (x) dx O(h 1/2 ). We have nh 3, thanks to our assumptions on the bandwidth, and therefore var(t 1n X 1,..., X n ) O p (n 1 h 3 ) o p (1), T 1n b h + o p (1). The desired normal approximation (S.1), T n b h N(, V ), now follows from this statement, 45 combined with the asymptotic normality of T 2n, T 2n N(, V ) in distribution as n, which follows analogously to the arguments outlined in Härdle and Mammen (1993), using The-
4 U.U. MÜLLER AND I. VAN KEILEGOM orem 2.1 by de Jong (1987): write T 2n h1/2 V ij, n V ij i1 j1 j i K h(x X i )K h (x X j ) f 2 ζ i {τ (x) x}ζ j {τ (x) x}π(x) dx. (x) De Jong s theorem requires that T 2n is clean in the sense that the conditional expectations of the V ij given X i (i 1,..., n) vanish. Using similar arguments as in Cao and González- Manteiga (28; pages ), we see that ζ i {τ (x) x} and ζ j {τ (x) x} in the expression of V ij can be replaced by ζ i {τ (X i ) X i } and ζ j {τ (X j ) X j }. Since E[ζ{τ (x) x} X x], it follows that T 2n is indeed clean up to an asymptotically negligible term (and the asymptotic mean is zero). In order to obtain the asymptotic variance, it suffices to calculate the second moment of V ij. We have ( E(Vij) 2 1 E f 2 (x) K h(x X i )K h (x X j )ζ i {τ (x) x}ζ j {τ (x) x}π(x) p2 (z) ) f 2 (z) K h(z X i )K h (z X j )ζ i {τ (z) z}ζ j {τ (z) z}π(z) dx dz p 2 (z)π(x)π(z) f 2 (x)f 2 E[K h (x X i )K h (z X i )ζ i {τ (x) x}ζ i {τ (z) z}] (z) E[K h (x X j )K h (z X j )ζ j {τ (x) x}ζ j {τ (z) z}] dx dz p 2 (z)π(x)π(z) f 2 (x)f 2 (z) ( K h (x u)k h (z u)e[ζ{τ (x) x}ζ{τ (z) z} X u]f(u) du ) 2 dx dz p 2 (z)π(x)π(z) µ xz (x)µ xz (z) { 1 ( z x ) }2 f(z) h K K dx dz + o(1) h as h. Hence, using the fact that T 2n is clean, we obtain with ( h 1/2 ) var(t 2n ) var n 2 V ij V h 2h i<j 4h n 2 cov(v ij, V kl ) 4h n 2 i<j k<l 2hE(V 2 ij) + o(1) V h + o(1), p 2 (z)π(x)π(z){ 1 ( z x ) }2 f(z) h K K h i<j E(V 2 ij) + o(1) E[ζ{τ (x) x}ζ{τ (z) z} X x]e[ζ{τ (x) x}ζ{τ (z) z} X z] dx dz. Since p( ) is continuous and µ xz (z) is continuous in z by assumption (A5), we can approximate V h by V, as on page 1931 of Härdle and Mammen (1993), and obtain the desired formula, {p V 2K (4) 2 (x)π(x)e[ζ 2 {τ (x) x} X x] } 2dx. ()
5 Goodness-of-fit tests for the cure rate 5 It remains to verify equation (S.2), that the third mixed term of T n is asymptotically negligible. To see this use the arguments by Xu and Peng (214) and Du and Akritas (22), which we used above to derive approximation (S.4), namely ˆp(x) p(x) p(x)n 1 i1 K h (x X i ) ζ i {τ (x) x} + O{(nh) 3/4 (log n) 3/4 }. almost surely. Inserting the first term on the right-hand side into (S.2) gives nh 1/2 1 {p(x) pˆθ(x)}p(x)n i1 K h (x X i ) ζ i {τ (x) x}π(x) dx. (S.5) To determine the order of the part involving the remainder term remember that p(x) p θ (x) + n 1/2 h 1/4 n (x) with n uniformly bounded. Now use the assumptions on the bandwidth and assumptions (A6) and (A7) to obtain p(x) pˆθ(x) p θ (x) pˆθ(x) + n 1/2 h 1/4 n (x) O p (n 1/2 h 1/4 ). Hence the second part of the statistic has the rate O p { nh 1/2 (nh) 3/4 (log n) 3/4 n 1/2 h 1/4} O p { n 1/4 h 1/2 (log n) 3/4} o p (1), so it suffices to study statistic (S.5). Using a Taylor expansion we write it as a sum of three terms, T 31 + T 32 + T 33, where T 31 nh 1/2 (ˆθ θ) p(x)n 1 K h (x X i ) ζ i {τ (x) x}ṗ θ (x)π(x) dx, i1 T 32 nh 1/2 (ˆθ θ) p(x)n 1 K h (x X i ) ζ i {τ (x) x}{ṗ ξ (x) ṗ θ (x)}π(x) dx, i1 T 33 nh 1/2 n 1/2 h 1/4 p(x)n 1 K h (x X i ) ζ i {τ (x) x} n (x)π(x) dx. i1 Here ṗ t denotes the vector of partial derivatives with respect to the parameter evaluated at t, and 6 ξ is an intermediate value between θ and ˆθ. Consider the second term first and use the Lipschitz assumption (A6) on the gradient ṗ θ and the root-n consistency of ˆθ, assumption (A7), to obtain T 32 O p (nh 1/2 n 1/2 n 1/2 ) o p (1). The first and the third term, T 31 and T 33 are similar. We consider only T 33, which is more difficult to handle since it contains the additional factor h 1/4 and therefore converges at a slower rate. To get the desired o p (1) rate note that the order of the integral in T 33 is O(h 3 ) using the assumptions on the kernel K, and the formula of E{ζ i (t x) X i } given in equation (3.3) on page 474 in Van Keilegom and Veraverbeke (1997). Our function ζ is the function g in that paper. This gives E(T 33 ) nh 1/2 n 1/2 h 1/4 O(h 3 ) O(n 1/2 h 13/4 ) o(1).
6 6 U.U. MÜLLER AND I. VAN KEILEGOM The second moment, E(T33 2 ), is nh 1/2 n 2 ( E i1 p(x)p(y) K h(x X i ) K h (y X i ) f(y) E [ ] ) ζ i {τ (x) x}ζ i {τ (y) y} X i n (x) n (y)π(x)π(y) dx dy ( + nh 1/2 n 2 1 E i,j1 j i p(x)p(y) K h(x X i ) K h (y X j ) f(y) E [ ] [ ] ) ζ i {τ (x) x} X i E ζj {τ (y) y} X j n (x) n (y)π(x)π(y) dx dy Now use again the order of the integral in the expression of T 33 to obtain E(T 2 33) O(h 1/2 ) + O(nh 1/2 h 6 ) o(1) + O(nh 6.5 ) o(1). Since both the mean and the variance of T 33 tend to zero in probability, we can apply Chebyshev s inequality and obtain the desired T 33 o p (1). This completes the proof of (S.2) 65 S.2. Proof of equation (8) We sketch the proof of the asymptotc equivalence of T n and T n, that is, Tn T n o p (1). Consider T n T n nh 1/2 {ˆp(x) pˆθ(x)} 2 d{ ˆF (x) F (x)} nh 1/2 {ˆp(x) pˆθ(x)} 2 d{ ˆF (x) F (x)} +nh 1/2 {ˆp(x) pˆθ(x)} 2 d{ F (x) F (x)}, (S.6) where F (x) ˆF (x vs) dl(v), L( ) l(u) du, l is a symmetric kernel density function with mean zero and bounded support, and s s n is a bandwidth sequence controlling the smoothness of F (x). We first show sup ˆF (x) F (x) o p (n 1/2 ), (S.7) x 7 assuming ns 4 as n, using arguments similar, but in fact much simpler, than those used in the proof of Lemma A.3 in a 217 preprint by Neumeyer and Van Keilegom on distribution functions of residuals entitled Bootstrap of residual processes in regression: to smooth or not to smooth?. Write F (x) ˆF (x) { ˆF (x vs) ˆF (x)} dl(v) { ˆF (x vs) ˆF (x) F (x vs) + F (x)} dl(v) + {F (x vs) F (x)} dl(v) n 1/2 {E n (x vs) E n (x)} dl(v) + {F (x vs) F (x)} dl(v),
7 Goodness-of-fit tests for the cure rate 7 where E n denotes the empirical process n 1/2 n i1 {1(X i ) F ( )}. Since E n is asymptotically equicontinuous, and since l has a bounded support, it follows that the first term has the 75 order o p (n 1/2 ), uniformly in x. To see that the second term has the order O p (s 2 ) o p (n 1/2 ), use the fact that f is bounded, that ns 4 o(1) and that the kernel l has mean zero. This proves (S.7). Consider the first term of (S.6). Using integration by parts, we can write it as the sum of two integrals.the first integral can be bounded by 2nh 1/2 ˆF (x) F (x) ˆp(x) pˆθ(x) ˆp (x) p ˆθ(x) dx. To determine the rate of ˆp(x) pˆθ(x), we only need to consider ˆp(x) p(x), because it is of larger order than pˆθ(x) p(x), which converges with the parametric root-n rate, thanks to Assumptions (A6) and (A7). As explained in the paragraph after equation (S.4), we have ˆp(x) p(x) O p {(nh) 1/2 (log n) 1/2 }. Further it can be shown that ˆp (x) p ˆθ(x) O p {(nh 3 ) 1/2 (log n) 1/2 }, uniformly in x. This combined with (S.7) gives the order nh 1/2 o p (n 1/2 ) O p {(nh) 1/2 (log n) 1/2 } O p {(nh 3 ) 1/2 (log n) 1/2 } o p {(nh 3 ) 1/2 log n} o p (1) for the above term. For the last step we used the assumptions on the bandwidth h h n in Theorem 1: we assume that nh 3 (log n) 5, which implies (nh 3 ) 1/2 log n O(1). The same arguments yield for the second integral the order nh 1/2 o p (n 1/2 ) [O p {(nh) 1/2 (log n) 1/2 }] 2, which is o p (1), since it is of smaller order than the first one. This yields the desired order o p (1) for the first term on the right-handside of (S.6). 8 The second term of (S.6) is nh 1/2 {ˆp(x) pˆθ(x)} 2 { } dx, where (ns) 1 n i1 l{(x i x)/s} is the standard kernel density estimator with O p {(ns) 1/2 (log n) 1/2 }, uniformly in x. It is of order nh 1/2 (nh) 1 log n (ns) 1/2 (log n) 1/2 n 1/2 h 1/2 s 1/2 (log n) 3/2 o(1), if s is chosen such that nhs(log n) 3. Hence both terms of (S.6) are of order o p (1), which completes the proof of T n T n o p (1). REFERENCES DE JONG, P. (1987). A central limit theorem for generalized quadratic forms. Probab. Theory Related Fields., 75, DU, Y. AND AKRITAS, M.G. (22). Uniform strong representation of the conditional Kaplan Meier process. Math. Methods Statist., 11, HÄRDLE, W. AND MAMMEN, E. (1993) Comparing nonparametric versus parametric regression fits. Ann. Statist., 21, VAN KEILEGOM, I. AND VERAVERBEKE, N. (1997). Estimation and bootstrap with censored data in fixed design 9 nonparametric regression. Ann. Inst. Statist. Math., 49, XU, J. AND PENG, Y. (214). Nonparametric cure rate estimation with covariates. Canad. J. Statist., 42, [Received 2 January 217. Editorial decision on 1 April 217]
Goodness-of-fit tests for the cure rate in a mixture cure model
Biometrika (217), 13, 1, pp. 1 17 Printed in Great Britain Advance Access publication on 31 July 216 Goodness-of-fit tests for the cure rate in a mixture cure model BY U.U. MÜLLER Department of Statistics,
More informationDEPARTMENT MATHEMATIK ARBEITSBEREICH MATHEMATISCHE STATISTIK UND STOCHASTISCHE PROZESSE
Estimating the error distribution in nonparametric multiple regression with applications to model testing Natalie Neumeyer & Ingrid Van Keilegom Preprint No. 2008-01 July 2008 DEPARTMENT MATHEMATIK ARBEITSBEREICH
More informationEstimation of the Bivariate and Marginal Distributions with Censored Data
Estimation of the Bivariate and Marginal Distributions with Censored Data Michael Akritas and Ingrid Van Keilegom Penn State University and Eindhoven University of Technology May 22, 2 Abstract Two new
More informationDensity estimators for the convolution of discrete and continuous random variables
Density estimators for the convolution of discrete and continuous random variables Ursula U Müller Texas A&M University Anton Schick Binghamton University Wolfgang Wefelmeyer Universität zu Köln Abstract
More informationPointwise convergence rates and central limit theorems for kernel density estimators in linear processes
Pointwise convergence rates and central limit theorems for kernel density estimators in linear processes Anton Schick Binghamton University Wolfgang Wefelmeyer Universität zu Köln Abstract Convergence
More informationBootstrap of residual processes in regression: to smooth or not to smooth?
Bootstrap of residual processes in regression: to smooth or not to smooth? arxiv:1712.02685v1 [math.st] 7 Dec 2017 Natalie Neumeyer Ingrid Van Keilegom December 8, 2017 Abstract In this paper we consider
More informationSize and Shape of Confidence Regions from Extended Empirical Likelihood Tests
Biometrika (2014),,, pp. 1 13 C 2014 Biometrika Trust Printed in Great Britain Size and Shape of Confidence Regions from Extended Empirical Likelihood Tests BY M. ZHOU Department of Statistics, University
More informationModel Specification Testing in Nonparametric and Semiparametric Time Series Econometrics. Jiti Gao
Model Specification Testing in Nonparametric and Semiparametric Time Series Econometrics Jiti Gao Department of Statistics School of Mathematics and Statistics The University of Western Australia Crawley
More information1 Glivenko-Cantelli type theorems
STA79 Lecture Spring Semester Glivenko-Cantelli type theorems Given i.i.d. observations X,..., X n with unknown distribution function F (t, consider the empirical (sample CDF ˆF n (t = I [Xi t]. n Then
More informationModelling Non-linear and Non-stationary Time Series
Modelling Non-linear and Non-stationary Time Series Chapter 2: Non-parametric methods Henrik Madsen Advanced Time Series Analysis September 206 Henrik Madsen (02427 Adv. TS Analysis) Lecture Notes September
More informationLikelihood ratio confidence bands in nonparametric regression with censored data
Likelihood ratio confidence bands in nonparametric regression with censored data Gang Li University of California at Los Angeles Department of Biostatistics Ingrid Van Keilegom Eindhoven University of
More informationNonparametric Regression. Changliang Zou
Nonparametric Regression Institute of Statistics, Nankai University Email: nk.chlzou@gmail.com Smoothing parameter selection An overall measure of how well m h (x) performs in estimating m(x) over x (0,
More informationSUPPLEMENTARY MATERIAL FOR PUBLICATION ONLINE 1
SUPPLEMENTARY MATERIAL FOR PUBLICATION ONLINE 1 B Technical details B.1 Variance of ˆf dec in the ersmooth case We derive the variance in the case where ϕ K (t) = ( 1 t 2) κ I( 1 t 1), i.e. we take K as
More informationOn variable bandwidth kernel density estimation
JSM 04 - Section on Nonparametric Statistics On variable bandwidth kernel density estimation Janet Nakarmi Hailin Sang Abstract In this paper we study the ideal variable bandwidth kernel estimator introduced
More informationEstimation of a quadratic regression functional using the sinc kernel
Estimation of a quadratic regression functional using the sinc kernel Nicolai Bissantz Hajo Holzmann Institute for Mathematical Stochastics, Georg-August-University Göttingen, Maschmühlenweg 8 10, D-37073
More informationMinimax Rate of Convergence for an Estimator of the Functional Component in a Semiparametric Multivariate Partially Linear Model.
Minimax Rate of Convergence for an Estimator of the Functional Component in a Semiparametric Multivariate Partially Linear Model By Michael Levine Purdue University Technical Report #14-03 Department of
More informationBickel Rosenblatt test
University of Latvia 28.05.2011. A classical Let X 1,..., X n be i.i.d. random variables with a continuous probability density function f. Consider a simple hypothesis H 0 : f = f 0 with a significance
More informationStatistical Modeling and Analysis for Survival Data with a Cure Fraction
Statistical Modeling and Analysis for Survival Data with a Cure Fraction by Jianfeng Xu A thesis submitted to the Department of Mathematics and Statistics in conformity with the requirements for the degree
More informationAsymptotic Nonequivalence of Nonparametric Experiments When the Smoothness Index is ½
University of Pennsylvania ScholarlyCommons Statistics Papers Wharton Faculty Research 1998 Asymptotic Nonequivalence of Nonparametric Experiments When the Smoothness Index is ½ Lawrence D. Brown University
More informationA Bootstrap Test for Conditional Symmetry
ANNALS OF ECONOMICS AND FINANCE 6, 51 61 005) A Bootstrap Test for Conditional Symmetry Liangjun Su Guanghua School of Management, Peking University E-mail: lsu@gsm.pku.edu.cn and Sainan Jin Guanghua School
More informationOberwolfach Preprints
Oberwolfach Preprints OWP 202-07 LÁSZLÓ GYÖFI, HAO WALK Strongly Consistent Density Estimation of egression esidual Mathematisches Forschungsinstitut Oberwolfach ggmbh Oberwolfach Preprints (OWP) ISSN
More informationSMOOTHED BLOCK EMPIRICAL LIKELIHOOD FOR QUANTILES OF WEAKLY DEPENDENT PROCESSES
Statistica Sinica 19 (2009), 71-81 SMOOTHED BLOCK EMPIRICAL LIKELIHOOD FOR QUANTILES OF WEAKLY DEPENDENT PROCESSES Song Xi Chen 1,2 and Chiu Min Wong 3 1 Iowa State University, 2 Peking University and
More informationGOODNESS OF FIT TESTS FOR PARAMETRIC REGRESSION MODELS BASED ON EMPIRICAL CHARACTERISTIC FUNCTIONS
K Y B E R N E T I K A V O L U M E 4 5 2 0 0 9, N U M B E R 6, P A G E S 9 6 0 9 7 1 GOODNESS OF FIT TESTS FOR PARAMETRIC REGRESSION MODELS BASED ON EMPIRICAL CHARACTERISTIC FUNCTIONS Marie Hušková and
More informationNonparametric Function Estimation with Infinite-Order Kernels
Nonparametric Function Estimation with Infinite-Order Kernels Arthur Berg Department of Statistics, University of Florida March 15, 2008 Kernel Density Estimation (IID Case) Let X 1,..., X n iid density
More informationConvergence rates in weighted L 1 spaces of kernel density estimators for linear processes
Alea 4, 117 129 (2008) Convergence rates in weighted L 1 spaces of kernel density estimators for linear processes Anton Schick and Wolfgang Wefelmeyer Anton Schick, Department of Mathematical Sciences,
More informationStatistical Properties of Numerical Derivatives
Statistical Properties of Numerical Derivatives Han Hong, Aprajit Mahajan, and Denis Nekipelov Stanford University and UC Berkeley November 2010 1 / 63 Motivation Introduction Many models have objective
More informationGenerated Covariates in Nonparametric Estimation: A Short Review.
Generated Covariates in Nonparametric Estimation: A Short Review. Enno Mammen, Christoph Rothe, and Melanie Schienle Abstract In many applications, covariates are not observed but have to be estimated
More informationSmooth nonparametric estimation of a quantile function under right censoring using beta kernels
Smooth nonparametric estimation of a quantile function under right censoring using beta kernels Chanseok Park 1 Department of Mathematical Sciences, Clemson University, Clemson, SC 29634 Short Title: Smooth
More informationVariance Function Estimation in Multivariate Nonparametric Regression
Variance Function Estimation in Multivariate Nonparametric Regression T. Tony Cai 1, Michael Levine Lie Wang 1 Abstract Variance function estimation in multivariate nonparametric regression is considered
More informationTESTING FOR THE EQUALITY OF k REGRESSION CURVES
Statistica Sinica 17(2007, 1115-1137 TESTNG FOR THE EQUALTY OF k REGRESSON CURVES Juan Carlos Pardo-Fernández, ngrid Van Keilegom and Wenceslao González-Manteiga Universidade de Vigo, Université catholique
More informationNonparametric Econometrics
Applied Microeconometrics with Stata Nonparametric Econometrics Spring Term 2011 1 / 37 Contents Introduction The histogram estimator The kernel density estimator Nonparametric regression estimators Semi-
More information12 - Nonparametric Density Estimation
ST 697 Fall 2017 1/49 12 - Nonparametric Density Estimation ST 697 Fall 2017 University of Alabama Density Review ST 697 Fall 2017 2/49 Continuous Random Variables ST 697 Fall 2017 3/49 1.0 0.8 F(x) 0.6
More informationEXTENDING THE SCOPE OF EMPIRICAL LIKELIHOOD
The Annals of Statistics 2009, Vol. 37, No. 3, 1079 1111 DOI: 10.1214/07-AOS555 Institute of Mathematical Statistics, 2009 EXTENDING THE SCOPE OF EMPIRICAL LIKELIHOOD BY NILS LID HJORT, IAN W. MCKEAGUE
More informationUniversity, Tempe, Arizona, USA b Department of Mathematics and Statistics, University of New. Mexico, Albuquerque, New Mexico, USA
This article was downloaded by: [University of New Mexico] On: 27 September 2012, At: 22:13 Publisher: Taylor & Francis Informa Ltd Registered in England and Wales Registered Number: 1072954 Registered
More informationSupplement to Quantile-Based Nonparametric Inference for First-Price Auctions
Supplement to Quantile-Based Nonparametric Inference for First-Price Auctions Vadim Marmer University of British Columbia Artyom Shneyerov CIRANO, CIREQ, and Concordia University August 30, 2010 Abstract
More informationSupplemental Material for KERNEL-BASED INFERENCE IN TIME-VARYING COEFFICIENT COINTEGRATING REGRESSION. September 2017
Supplemental Material for KERNEL-BASED INFERENCE IN TIME-VARYING COEFFICIENT COINTEGRATING REGRESSION By Degui Li, Peter C. B. Phillips, and Jiti Gao September 017 COWLES FOUNDATION DISCUSSION PAPER NO.
More informationTime Series and Forecasting Lecture 4 NonLinear Time Series
Time Series and Forecasting Lecture 4 NonLinear Time Series Bruce E. Hansen Summer School in Economics and Econometrics University of Crete July 23-27, 2012 Bruce Hansen (University of Wisconsin) Foundations
More informationThe Central Limit Theorem Under Random Truncation
The Central Limit Theorem Under Random Truncation WINFRIED STUTE and JANE-LING WANG Mathematical Institute, University of Giessen, Arndtstr., D-3539 Giessen, Germany. winfried.stute@math.uni-giessen.de
More informationIntroduction. Linear Regression. coefficient estimates for the wage equation: E(Y X) = X 1 β X d β d = X β
Introduction - Introduction -2 Introduction Linear Regression E(Y X) = X β +...+X d β d = X β Example: Wage equation Y = log wages, X = schooling (measured in years), labor market experience (measured
More informationNonparametric Identi cation and Estimation of Truncated Regression Models with Heteroskedasticity
Nonparametric Identi cation and Estimation of Truncated Regression Models with Heteroskedasticity Songnian Chen a, Xun Lu a, Xianbo Zhou b and Yahong Zhou c a Department of Economics, Hong Kong University
More informationEfficiency of Profile/Partial Likelihood in the Cox Model
Efficiency of Profile/Partial Likelihood in the Cox Model Yuichi Hirose School of Mathematics, Statistics and Operations Research, Victoria University of Wellington, New Zealand Summary. This paper shows
More informationOn robust and efficient estimation of the center of. Symmetry.
On robust and efficient estimation of the center of symmetry Howard D. Bondell Department of Statistics, North Carolina State University Raleigh, NC 27695-8203, U.S.A (email: bondell@stat.ncsu.edu) Abstract
More informationRegularity of the density for the stochastic heat equation
Regularity of the density for the stochastic heat equation Carl Mueller 1 Department of Mathematics University of Rochester Rochester, NY 15627 USA email: cmlr@math.rochester.edu David Nualart 2 Department
More informationEstimation of density level sets with a given probability content
Estimation of density level sets with a given probability content Benoît CADRE a, Bruno PELLETIER b, and Pierre PUDLO c a IRMAR, ENS Cachan Bretagne, CNRS, UEB Campus de Ker Lann Avenue Robert Schuman,
More informationNonparametric Density Estimation (Multidimension)
Nonparametric Density Estimation (Multidimension) Härdle, Müller, Sperlich, Werwarz, 1995, Nonparametric and Semiparametric Models, An Introduction Tine Buch-Kromann February 19, 2007 Setup One-dimensional
More information11 Survival Analysis and Empirical Likelihood
11 Survival Analysis and Empirical Likelihood The first paper of empirical likelihood is actually about confidence intervals with the Kaplan-Meier estimator (Thomas and Grunkmeier 1979), i.e. deals with
More informationLocal Polynomial Regression
VI Local Polynomial Regression (1) Global polynomial regression We observe random pairs (X 1, Y 1 ),, (X n, Y n ) where (X 1, Y 1 ),, (X n, Y n ) iid (X, Y ). We want to estimate m(x) = E(Y X = x) based
More informationEfficiency for heteroscedastic regression with responses missing at random
Efficiency for heteroscedastic regression with responses missing at random Ursula U. Müller Department of Statistics Texas A&M University College Station, TX 77843-343 USA Anton Schick Department of Mathematical
More informationGOODNESS-OF-FIT TEST FOR RANDOMLY CENSORED DATA BASED ON MAXIMUM CORRELATION. Ewa Strzalkowska-Kominiak and Aurea Grané (1)
Working Paper 4-2 Statistics and Econometrics Series (4) July 24 Departamento de Estadística Universidad Carlos III de Madrid Calle Madrid, 26 2893 Getafe (Spain) Fax (34) 9 624-98-49 GOODNESS-OF-FIT TEST
More informationNonparametric Regression. Badr Missaoui
Badr Missaoui Outline Kernel and local polynomial regression. Penalized regression. We are given n pairs of observations (X 1, Y 1 ),...,(X n, Y n ) where Y i = r(x i ) + ε i, i = 1,..., n and r(x) = E(Y
More informationAsymptotic normality of conditional distribution estimation in the single index model
Acta Univ. Sapientiae, Mathematica, 9, 207 62 75 DOI: 0.55/ausm-207-000 Asymptotic normality of conditional distribution estimation in the single index model Diaa Eddine Hamdaoui Laboratory of Stochastic
More informationLecture 7 Introduction to Statistical Decision Theory
Lecture 7 Introduction to Statistical Decision Theory I-Hsiang Wang Department of Electrical Engineering National Taiwan University ihwang@ntu.edu.tw December 20, 2016 1 / 55 I-Hsiang Wang IT Lecture 7
More informationSome Theories about Backfitting Algorithm for Varying Coefficient Partially Linear Model
Some Theories about Backfitting Algorithm for Varying Coefficient Partially Linear Model 1. Introduction Varying-coefficient partially linear model (Zhang, Lee, and Song, 2002; Xia, Zhang, and Tong, 2004;
More informationδ -method and M-estimation
Econ 2110, fall 2016, Part IVb Asymptotic Theory: δ -method and M-estimation Maximilian Kasy Department of Economics, Harvard University 1 / 40 Example Suppose we estimate the average effect of class size
More informationEstimating the inter-arrival time density of Markov renewal processes under structural assumptions on the transition distribution
Estimating the inter-arrival time density of Markov renewal processes under structural assumptions on the transition distribution Priscilla E. Greenwood School of Mathematics and Statistics and Mathematical
More informationUNIVERSIDADE DE SANTIAGO DE COMPOSTELA DEPARTAMENTO DE ESTATÍSTICA E INVESTIGACIÓN OPERATIVA
UNIVERSIDADE DE SANTIAGO DE COMPOSTELA DEPARTAMENTO DE ESTATÍSTICA E INVESTIGACIÓN OPERATIVA GOODNESS OF FIT TEST FOR LINEAR REGRESSION MODELS WITH MISSING RESPONSE DATA González Manteiga, W. and Pérez
More informationMATH 220 solution to homework 4
MATH 22 solution to homework 4 Problem. Define v(t, x) = u(t, x + bt), then v t (t, x) a(x + u bt) 2 (t, x) =, t >, x R, x2 v(, x) = f(x). It suffices to show that v(t, x) F = max y R f(y). We consider
More informationTransformation and Smoothing in Sample Survey Data
Scandinavian Journal of Statistics, Vol. 37: 496 513, 2010 doi: 10.1111/j.1467-9469.2010.00691.x Published by Blackwell Publishing Ltd. Transformation and Smoothing in Sample Survey Data YANYUAN MA Department
More informationarxiv: v1 [stat.me] 28 Apr 2016
An Adaptive-to-Model Test for Parametric Single-Index Errors-in-Variables Models Hira L. Koul, Chuanlong Xie, Lixing Zhu Michigan State University, USA Hong Kong Baptist University, Hong Kong, China Abstract
More informationMotivational Example
Motivational Example Data: Observational longitudinal study of obesity from birth to adulthood. Overall Goal: Build age-, gender-, height-specific growth charts (under 3 year) to diagnose growth abnomalities.
More informationIntroduction to Empirical Processes and Semiparametric Inference Lecture 02: Overview Continued
Introduction to Empirical Processes and Semiparametric Inference Lecture 02: Overview Continued Michael R. Kosorok, Ph.D. Professor and Chair of Biostatistics Professor of Statistics and Operations Research
More informationDESIGN-ADAPTIVE MINIMAX LOCAL LINEAR REGRESSION FOR LONGITUDINAL/CLUSTERED DATA
Statistica Sinica 18(2008), 515-534 DESIGN-ADAPTIVE MINIMAX LOCAL LINEAR REGRESSION FOR LONGITUDINAL/CLUSTERED DATA Kani Chen 1, Jianqing Fan 2 and Zhezhen Jin 3 1 Hong Kong University of Science and Technology,
More informationPart IB Statistics. Theorems with proof. Based on lectures by D. Spiegelhalter Notes taken by Dexter Chua. Lent 2015
Part IB Statistics Theorems with proof Based on lectures by D. Spiegelhalter Notes taken by Dexter Chua Lent 2015 These notes are not endorsed by the lecturers, and I have modified them (often significantly)
More informationLocal linear multiple regression with variable. bandwidth in the presence of heteroscedasticity
Local linear multiple regression with variable bandwidth in the presence of heteroscedasticity Azhong Ye 1 Rob J Hyndman 2 Zinai Li 3 23 January 2006 Abstract: We present local linear estimator with variable
More informationTest for Discontinuities in Nonparametric Regression
Communications of the Korean Statistical Society Vol. 15, No. 5, 2008, pp. 709 717 Test for Discontinuities in Nonparametric Regression Dongryeon Park 1) Abstract The difference of two one-sided kernel
More informationCURRENT STATUS LINEAR REGRESSION. By Piet Groeneboom and Kim Hendrickx Delft University of Technology and Hasselt University
CURRENT STATUS LINEAR REGRESSION By Piet Groeneboom and Kim Hendrickx Delft University of Technology and Hasselt University We construct n-consistent and asymptotically normal estimates for the finite
More informationA new method of nonparametric density estimation
A new method of nonparametric density estimation Andrey Pepelyshev Cardi December 7, 2011 1/32 A. Pepelyshev A new method of nonparametric density estimation Contents Introduction A new density estimate
More informationAsymptotic statistics using the Functional Delta Method
Quantiles, Order Statistics and L-Statsitics TU Kaiserslautern 15. Februar 2015 Motivation Functional The delta method introduced in chapter 3 is an useful technique to turn the weak convergence of random
More informationCONCEPT OF DENSITY FOR FUNCTIONAL DATA
CONCEPT OF DENSITY FOR FUNCTIONAL DATA AURORE DELAIGLE U MELBOURNE & U BRISTOL PETER HALL U MELBOURNE & UC DAVIS 1 CONCEPT OF DENSITY IN FUNCTIONAL DATA ANALYSIS The notion of probability density for a
More informationEMPIRICAL EDGEWORTH EXPANSION FOR FINITE POPULATION STATISTICS. I. M. Bloznelis. April Introduction
EMPIRICAL EDGEWORTH EXPANSION FOR FINITE POPULATION STATISTICS. I M. Bloznelis April 2000 Abstract. For symmetric asymptotically linear statistics based on simple random samples, we construct the one-term
More informationUNIVERSITÄT POTSDAM Institut für Mathematik
UNIVERSITÄT POTSDAM Institut für Mathematik Testing the Acceleration Function in Life Time Models Hannelore Liero Matthias Liero Mathematische Statistik und Wahrscheinlichkeitstheorie Universität Potsdam
More informationAdditive functionals of infinite-variance moving averages. Wei Biao Wu The University of Chicago TECHNICAL REPORT NO. 535
Additive functionals of infinite-variance moving averages Wei Biao Wu The University of Chicago TECHNICAL REPORT NO. 535 Departments of Statistics The University of Chicago Chicago, Illinois 60637 June
More informationA dimension reduction approach for conditional Kaplan-Meier estimators
A dimension reduction approach for conditional Kaplan-Meier estimators Weiyu Li and Valentin Patilea January 2015 Abstract In many situations, the quantities of interest in survival analysis are smooth,
More informationDiscrete uniform limit law for additive functions on shifted primes
Nonlinear Analysis: Modelling and Control, Vol. 2, No. 4, 437 447 ISSN 392-53 http://dx.doi.org/0.5388/na.206.4. Discrete uniform it law for additive functions on shifted primes Gediminas Stepanauskas,
More informationA Central Limit Theorem for the Sum of Generalized Linear and Quadratic Forms. by R. L. Eubank and Suojin Wang Texas A&M University
A Central Limit Theorem for the Sum of Generalized Linear and Quadratic Forms by R. L. Eubank and Suojin Wang Texas A&M University ABSTRACT. A central limit theorem is established for the sum of stochastically
More informationEstimating the density of a possibly missing response variable in nonlinear regression
Estimating the density of a possibly missing response variable in nonlinear regression Ursula U. Müller 1 Department of Statistics Texas A&M University College Station, TX 77843-3143 USA Abstract This
More informationSDS : Theoretical Statistics
SDS 384 11: Theoretical Statistics Lecture 1: Introduction Purnamrita Sarkar Department of Statistics and Data Science The University of Texas at Austin https://psarkar.github.io/teaching Manegerial Stuff
More information1 Introduction. 2 Measure theoretic definitions
1 Introduction These notes aim to recall some basic definitions needed for dealing with random variables. Sections to 5 follow mostly the presentation given in chapter two of [1]. Measure theoretic definitions
More informationECO Class 6 Nonparametric Econometrics
ECO 523 - Class 6 Nonparametric Econometrics Carolina Caetano Contents 1 Nonparametric instrumental variable regression 1 2 Nonparametric Estimation of Average Treatment Effects 3 2.1 Asymptotic results................................
More informationEstimators for the binomial distribution that dominate the MLE in terms of Kullback Leibler risk
Ann Inst Stat Math (0) 64:359 37 DOI 0.007/s0463-00-036-3 Estimators for the binomial distribution that dominate the MLE in terms of Kullback Leibler risk Paul Vos Qiang Wu Received: 3 June 009 / Revised:
More informationb n x n + b n 1 x n b 1 x + b 0
Math Partial Fractions Stewart 7.4 Integrating basic rational functions. For a function f(x), we have examined several algebraic methods for finding its indefinite integral (antiderivative) F (x) = f(x)
More informationDA Freedman Notes on the MLE Fall 2003
DA Freedman Notes on the MLE Fall 2003 The object here is to provide a sketch of the theory of the MLE. Rigorous presentations can be found in the references cited below. Calculus. Let f be a smooth, scalar
More informationNonparametric Regression Härdle, Müller, Sperlich, Werwarz, 1995, Nonparametric and Semiparametric Models, An Introduction
Härdle, Müller, Sperlich, Werwarz, 1995, Nonparametric and Semiparametric Models, An Introduction Tine Buch-Kromann Univariate Kernel Regression The relationship between two variables, X and Y where m(
More informationError distribution function for parametrically truncated and censored data
Error distribution function for parametrically truncated and censored data Géraldine LAURENT Jointly with Cédric HEUCHENNE QuantOM, HEC-ULg Management School - University of Liège Friday, 14 September
More informationOPTIMAL POINTWISE ADAPTIVE METHODS IN NONPARAMETRIC ESTIMATION 1
The Annals of Statistics 1997, Vol. 25, No. 6, 2512 2546 OPTIMAL POINTWISE ADAPTIVE METHODS IN NONPARAMETRIC ESTIMATION 1 By O. V. Lepski and V. G. Spokoiny Humboldt University and Weierstrass Institute
More informationStatistica Sinica Preprint No: SS
Statistica Sinica Preprint No: SS-017-0013 Title A Bootstrap Method for Constructing Pointwise and Uniform Confidence Bands for Conditional Quantile Functions Manuscript ID SS-017-0013 URL http://wwwstatsinicaedutw/statistica/
More informationEstimation of the functional Weibull-tail coefficient
1/ 29 Estimation of the functional Weibull-tail coefficient Stéphane Girard Inria Grenoble Rhône-Alpes & LJK, France http://mistis.inrialpes.fr/people/girard/ June 2016 joint work with Laurent Gardes,
More informationDefect Detection using Nonparametric Regression
Defect Detection using Nonparametric Regression Siana Halim Industrial Engineering Department-Petra Christian University Siwalankerto 121-131 Surabaya- Indonesia halim@petra.ac.id Abstract: To compare
More informationSpline Density Estimation and Inference with Model-Based Penalities
Spline Density Estimation and Inference with Model-Based Penalities December 7, 016 Abstract In this paper we propose model-based penalties for smoothing spline density estimation and inference. These
More informationNonparametric regression with rescaled time series errors
Nonparametric regression with rescaled time series errors José E. Figueroa-López Michael Levine Abstract We consider a heteroscedastic nonparametric regression model with an autoregressive error process
More informationMath 152. Rumbos Fall Solutions to Assignment #12
Math 52. umbos Fall 2009 Solutions to Assignment #2. Suppose that you observe n iid Bernoulli(p) random variables, denoted by X, X 2,..., X n. Find the LT rejection region for the test of H o : p p o versus
More informationLimit theory and inference about conditional distributions
Limit theory and inference about conditional distributions Purevdorj Tuvaandorj and Victoria Zinde-Walsh McGill University, CIREQ and CIRANO purevdorj.tuvaandorj@mail.mcgill.ca victoria.zinde-walsh@mcgill.ca
More informationOn a Nonparametric Notion of Residual and its Applications
On a Nonparametric Notion of Residual and its Applications Bodhisattva Sen and Gábor Székely arxiv:1409.3886v1 [stat.me] 12 Sep 2014 Columbia University and National Science Foundation September 16, 2014
More informationUSING A BIMODAL KERNEL FOR A NONPARAMETRIC REGRESSION SPECIFICATION TEST
Statistica Sinica 25 (2015), 1145-1161 doi:http://dx.doi.org/10.5705/ss.2014.008 USING A BIMODAL KERNEL FOR A NONPARAMETRIC REGRESSION SPECIFICATION TEST Cheolyong Park 1, Tae Yoon Kim 1, Jeongcheol Ha
More informationVARIANCE REDUCTION BY SMOOTHING REVISITED BRIAN J E R S K Y (POMONA)
PROBABILITY AND MATHEMATICAL STATISTICS Vol. 33, Fasc. 1 2013), pp. 79 92 VARIANCE REDUCTION BY SMOOTHING REVISITED BY ANDRZEJ S. KO Z E K SYDNEY) AND BRIAN J E R S K Y POMONA) Abstract. Smoothing is a
More informationExercises. (a) Prove that m(t) =
Exercises 1. Lack of memory. Verify that the exponential distribution has the lack of memory property, that is, if T is exponentially distributed with parameter λ > then so is T t given that T > t for
More informationAsymptotics for posterior hazards
Asymptotics for posterior hazards Pierpaolo De Blasi University of Turin 10th August 2007, BNR Workshop, Isaac Newton Intitute, Cambridge, UK Joint work with Giovanni Peccati (Université Paris VI) and
More informationKernel Methods. Lecture 4: Maximum Mean Discrepancy Thanks to Karsten Borgwardt, Malte Rasch, Bernhard Schölkopf, Jiayuan Huang, Arthur Gretton
Kernel Methods Lecture 4: Maximum Mean Discrepancy Thanks to Karsten Borgwardt, Malte Rasch, Bernhard Schölkopf, Jiayuan Huang, Arthur Gretton Alexander J. Smola Statistical Machine Learning Program Canberra,
More informationMath 162: Calculus IIA
Math 62: Calculus IIA Final Exam ANSWERS December 9, 26 Part A. (5 points) Evaluate the integral x 4 x 2 dx Substitute x 2 cos θ: x 8 cos dx θ ( 2 sin θ) dθ 4 x 2 2 sin θ 8 cos θ dθ 8 cos 2 θ cos θ dθ
More informationQuantile Regression for Residual Life and Empirical Likelihood
Quantile Regression for Residual Life and Empirical Likelihood Mai Zhou email: mai@ms.uky.edu Department of Statistics, University of Kentucky, Lexington, KY 40506-0027, USA Jong-Hyeon Jeong email: jeong@nsabp.pitt.edu
More information