I N S T I T U T D E S T A T I S T I Q U E B I O S T A T I S T I Q U E E T S C I E N C E S A C T U A R I E L L E S (I S B A)

Size: px
Start display at page:

Download "I N S T I T U T D E S T A T I S T I Q U E B I O S T A T I S T I Q U E E T S C I E N C E S A C T U A R I E L L E S (I S B A)"

Transcription

1 I N S T I T U T D E S T A T I S T I Q U E B I O S T A T I S T I Q U E E T S C I E N C E S A C T U A R I E L L E S (I S B A) UNIVERSITÉ CATHOLIQUE DE LOUVAIN D I S C U S S I O N P A P E R 2011/2 ESTIMATION OF THE ERROR DENSITY IN A SEMIPARAMETRIC TRANSFORMATION MODEL SAMB, R., HEUCHENNE, C. and I. VAN KEILEGOM

2 Estimation of the Error Density in a Semiparametric Transformation Model Rawane Sam Université catholique de Louvain Cédric Heuchenne University of Liège and Université catholique de Louvain Ingrid Van Keilegom Université catholique de Louvain Septemer 27, 2011 Astract Consider the semiparametric transformation model Λ θo (Y ) = m(x) + ε, where θ o is an unknown finite dimensional parameter, the functions Λ θo and m are smooth, ε is independent of X, and E(ε) = 0. We propose a kernel-type estimator of the density of the error ε, and prove its asymptotic normality. The estimated errors, which lie at the asis of this estimator, are otained from a profile likelihood estimator of θ o and a nonparametric kernel estimator of m. The practical performance of the proposed density estimator is evaluated in a simulation study. Key Words: Density estimation; Kernel smoothing; Nonparametric regression; Profile likelihood; Transformation model. R. Sam acknowledges financial support from IAP research network P6/0 of the Belgian Government (Belgian Science Policy). C. Heuchenne acknowledges financial support from IAP research network P6/0 of the Belgian Government (Belgian Science Policy), and from the contract Projet d Actions de Recherche Concertées (ARC) 11/16-09 of the Communauté française de Belgique, granted y the Académie universitaire Louvain. I. Van Keilegom acknowledges financial support from IAP research network P6/0 of the Belgian Government (Belgian Science Policy), from the European Research Council under the European Community s Seventh Framework Programme (FP7/ ) / ERC Grant agreement No , and from the contract Projet d Actions de Recherche Concertées (ARC) 11/16-09 of the Communauté française de Belgique, granted y the Académie universitaire Louvain. 1

3 1 Introduction Let (X 1, Y 1 ),..., (X n, Y n ) e independent replicates of the random vector (X, Y ), where Y is a univariate dependent variale and X is a one-dimensional covariate. We assume that Y and X are related via the semiparametric transformation model Λ θo (Y ) = m(x) + ε, (1.1) where ε is independent of X and has mean zero. We assume that {Λ θ : θ Θ} (with Θ R p compact) is a parametric family of strictly increasing functions defined on an unounded suset D in R, while m is the unknown regression function, elonging to an infinite dimensional parameter set M. We assume that M is a space of functions endowed with the norm M =. We denote θ o Θ and m M for the true unknown finite and infinite dimensional parameters. Define the regression function m θ (x) = E[Λ θ (Y ) X = x], for each θ Θ, and let ε θ = ε(θ) = Λ θ (Y ) m θ (X). In this paper, we are interested in the estimation of the proaility density function (p.d.f.) f ε of the residual term ε = Λ θo (Y ) m(x). To this end, we first otain the estimators θ and m θ of the parameter θ o and the function m θ, and second, form the semiparametric regression residuals ε i ( θ) = Λ θ(y i ) m θ(x i ). To estimate θ o we use a profile likelihood (PL) approach, developed in Linton, Sperlich and Van Keilegom (2008), whereas m θ is estimated y means of a Nadaraya-Watson-type estimator (Nadaraya, 1964, Watson, 1964). To our knowledge, the estimation of the density of ε in model (1.1) has not yet een investigated in the statistical literature. This estimation may e very useful in various regression prolems. Indeed, taking transformations of the data may induce normality and error variance homogeneity in the transformed model. So the estimation of the error density in the transformed model may e used for testing these hypotheses. Taking transformations of the data has een an important part of statistical practice for many years. A major contriution to this methodology was made y Box and Cox (1964), who proposed a parametric power family of transformations that includes the logarithm and the identity. They suggested that the power transformation, when applied to the dependent variale in a linear regression model, might induce normality and homoscedasticity. Lots of effort has een devoted to the investigation of the Box-Cox transformation since its introduction. See, for example, Amemiya (1985), Horowitz (1998), Chen, Lockhart and Stephens (2002), Shin (2008), and Fitzenerger, Wilke and Zhang (2010). Other dependent variale transformations have een suggested, for example, the Zellner and Revankar (1969) transform and the Bickel and Doksum 2

4 (1981) transform. The transformation methodology has een quite successful and a large literature exists on this topic for parametric models. See Carroll and Ruppert (1988) and Sakia (1992) and references therein. The estimation of (functionals of) the error distriution and density under simplified versions of model (1.1) has received considerale attention in the statistical literature in recent years. Consider e.g. model (1.1) ut with Λ θo id, i.e. the response is not transformed. Under this model, Escanciano and Jacho- Chavez (2010) considered the estimation of the (marginal) density of the response Y via the estimation of the error density. Akritas and Van Keilegom (2001) estimated the cumulative distriution function of the regression error in a heteroscedastic model with univariate covariates. The estimator they proposed is ased on nonparametrically estimated regression residuals. The weak convergence of their estimator was proved. The results otained y Akritas and Van Keilegom (2001) have een generalized y Neumeyer and Van Keilegom (2010) to the case of multivariate covariates. Müller, Schick and Wefelmeyer (2004) investigated linear functionals of the error distriution in nonparametric regression. Cheng (2005) estalished the asymptotic normality of an estimator of the error density ased on estimated residuals. The estimator he proposed is constructed y splitting the sample into two parts: the first part is used for the estimation of the residuals, while the second part of the sample is used for the construction of the error density estimator. Efromovich (2005) proposed an adaptive estimator of the error density, ased on a density estimator proposed y Pinsker (1980). Finally, Sam (2010) also considered the estimation of the error density, ut his approach is more closely related to the one in Akritas and Van Keilegom (2001). In order to achieve the ojective of this paper, namely the estimation of the error density under model (1.1), we first need to estimate the transformation parameter θ o. To this end, we make use of the results in Linton, Sperlich and Van Keilegom (2008). In the latter paper, the authors first discuss the nonparametric identification of model (1.1), and second, estimate the transformation parameter θ o under the considered model. For the estimation of this parameter, they propose two approaches. The first approach uses a semiparametric profile likelihood (PL) estimator, while the second is ased on a mean squared distance from independence-estimator (MD) using the estimated distriutions of X, ε and (X, ε). Linton, Sperlich and Van Keilegom (2008) derived the asymptotic distriutions of their estimators under certain regularity conditions, and proved that oth estimators of θ o are root-n consistent. The authors also showed that, in practice, the performance of the PL method is etter than that of the MD approach. For this reason, the PL method will e considered in this paper for the estimation of θ o. The rest of the paper is organized as follows. Section 2 presents our estimator of the error density and groups some notations and technical assumptions. Section descries the asymptotic results of the paper.

5 A simulation study is given in Section 4, while Section 5 is devoted to some general conclusions. Finally, the proofs of the asymptotic results are collected in Section 6. 2 Definitions and assumptions 2.1 Construction of the estimators The approach proposed here for the estimation of f ε is ased on a two-steps procedure. In a first step, we estimate the finite dimensional parameter θ o. This parameter is estimated y the profile likelihood (PL) method, developed in Linton, Sperlich and Van Keilegom (2008). The asic idea of this method is to replace all unknown expressions in the likelihood function y their nonparametric kernel estimates. Under model (1.1), we have P (Y y X) = P (Λ θo (Y ) Λ θo (y) X) = P (ε θo Λ θo (y) m θo (X) X) = F ε (Λ θo (y) m θo (X)). Here, F ε (t) = P(ε t), and so f Y X (y x) = f ε (Λ θo (y) m θo (x)) Λ θ o (y), where f ε and f Y X are the densities of ε, and of Y given X, respectively. Then, the log likelihood function is where f εθ {log f εθ (Λ θ (Y i ) m θ (X i )) + log Λ θ(y i )}, θ Θ, is the density function of ε θ. Now, let ( ) n j=1 Λ Xj x θ(y j )K 1 m θ (x) = n j=1 K 1 ( Xj x h h ) (2.1) e the Nadaraya-Watson estimator of m θ (x), and let f εθ (t) = 1 ng ( ) εi (θ) t K 2. (2.2) g where ε i (θ) = Λ θ (Y i ) m θ (X i ). Here, K 1 and K 2 are kernel functions and h and g are andwidth sequences. Then, the PL estimator of θ o is defined y [ θ = arg max log f ] εθ (Λ θ (Y i ) m θ (X i )) + log Λ θ(y i ). (2.) θ Θ Recall that m θ (X i ) converges to m θ (X i ) at a slower rate for those X i which are close to the oundary of the support X of the covariate X. That is why we assume implicitly that the proposed estimator (2.) of θ o 4

6 trims the oservations X i outside a suset X 0 of X. Note that we keep the root-n consistency of θ proved in Linton, Sperlich and Van Keilegom (2008) y trimming the covariates outside X 0. But in this case, the resulting asymptotic variance is different to the one otained in the latter paper. In a second step, we use the aove estimator θ to uild the estimated residuals ε i ( θ) = Λ θ(y i ) m θ(x i ). Then, our proposed estimator f ε (t) of f ε (t) is defined y ( ) f ε (t) = 1 ε i ( θ) t K, (2.4) n where K is a kernel function and is a andwidth sequence, not necessarily the same as the kernel K 2 and the andwidth g used in (2.2). Oserve that this estimator is a feasile estimator in the sense that it does not depend on any unknown quantity, as is desirale in practice. This contrasts with the unfeasile ideal kernel estimator f ε (t) = 1 n ( ) εi t K, (2.5) which depends in particular on the unknown regression errors ε i = ε i (θ o ) = Λ θo (Y i ) m(x i ). It is however intuitively clear that f ε (t) and f ε (t) will e very close for n large enough, as will e illustrated y the results given in Section. 2.2 Notations When there is no amiguity, we use ε and m to indicate ε θo and m θo. Moreover, N (θ o ) represents a neighorhood of θ o. For the kernel K j (j = 1, 2, ), let µ(k j ) = v 2 K j (v)dv and let K (p) j e the pth derivative of K j. For any function ϕ θ (y), denote ϕ θ (y) = ϕ θ (y)/ θ = ( ϕ θ (y)/ θ 1,..., ϕ θ (y)/ θ p ) t and ϕ θ (y) = ϕ θ(y)/ y. Also, let A = (A t A) 1/2 e the Euclidean norm of any vector A. For any functions m, r, f, ϕ and q, and any θ Θ, let s = ( m, r, f, ϕ, q), s θ = (m θ, ṁ θ, f εθ, f ε θ, f εθ ), ε i (θ, m) = Λ θ (Y i ) m(x i ), and define { G n (θ, s) = n 1 n 1 f{ε i (θ, m)} G(θ, s) = E[G n (θ, s)] and G(θ o, s θo ) = θ G(θ, s θ) θ=θo. [ ϕ{ε i (θ, m)}{ Λ ] θ (Y i ) r(x i )} + q{ε i (θ, m)} + Λ } θ (Y i) Λ θ (Y, i) 2. Technical assumptions The assumptions we need for the asymptotic results are listed elow for convenient reference. 5

7 (A1) The function K j (j = 1, 2, ) is symmetric, has compact support, v k K j (v)dv = 0 for k = 1,..., q j 1 and v q j K j (v)dv 0 for some q j 4, K j is twice continuously differentiale, and K (1) (v)dv = 0. (A2) The andwidth sequences h, g and satisfy nh 2q 1 = o(1), ng 2q 2 = o(1) (where q 1 and q 2 are defined in (A1)), (n 5 ) 1 = O(1), n h 2 (log h 1 ) 2 and ng 6 (log g 1 ) 2. (A) (i) The support X of the covariate X is a compact suset of R, and X 0 is a suset with non empty interior, whose closure is in the interior of X. (ii) The density f X is ounded away from zero and infinity on X, and has continuous second order partial derivatives on X. (A4) The function m θ (x) is twice continuously differentiale with respect to θ on X N (θ 0 ), and the functions m θ (x) and ṁ θ (x) are q 1 times continuously differentiale with respect to x on X N (θ 0 ). All these derivatives are ounded, uniformly in (x, θ) X N (θ o ). (A5) The error ε = Λ θo (Y ) m(x) has finite fourth moment and is independent of X. (A6) The distriution F εθ (t) is q + 1 (respectively three) times continuously differentiale with respect to t (respectively θ), and sup θ,t k+l t k θ l θl p p F εθ (t) < for all k and l such that 0 k + l 2, where l = l l p and θ = (θ 1,..., θ p ) t. (A7) The transformation Λ θ (y) is three times continuously differentiale with respect to oth θ and y, and there exists a α > 0 such that E [ sup θ : θ θ α ] k+l y k θ l Λ θl p θ (Y ) p for all θ Θ, and for all k and l such that 0 k + l, where l = l l p and θ = (θ 1,..., θ p ) t. Moreover, sup x X E[ Λ 4 θ o (Y ) X = x] <. (A8) For all η > 0, there exists ɛ(η) > 0 such that < Moreover, the matrix G(θ o, s θo ) is non-singular. inf G(θ, s θ) ɛ(η) > 0. θ θ o >η (A9) (i) E(Λ θo (Y )) = 1, Λ θo (0) = 0 and the set {x X 0 : m (x) 0} has nonempty interior. 6

8 (ii) Assume that φ(x, t) = Λ θo (Λ 1 θ o (m(x) + t))f ε (t) is continuously differentiale with respect to t for all x and that for all t R and for some δ > 0. sup s: t s δ E φ (X, s) s <. (2.6) Assumptions (A1), part of (A2), (A)(ii), (A4) and (A6), (A7) and (A8) are used y Linton, Sperlich and Van Keilegom (2008) to show that the PL estimator θ of θ o is root n-consistent. The differentiaility of K j up to second order imposed in assumption (A1) is used to expand the two-steps kernel estimator f ε (t) in (2.4) around the unfeasile one f ε (t). Assumptions (A)(ii) and (A4) impose that all the functions to e estimated have ounded derivatives. The last assumption in (A2) is useful for otaining the uniform convergence of the Nadaraya-Watson estimator of m θo in (2.1) (see for instance Einmahl and Mason, 2005). This assumption is also needed in the study of the difference etween the feasile estimator f ε (t) and the unfeasile estimator f ε (t). Finally, (A9)(i) is needed for identifying the model (see Vanhems and Van Keilegom (2011)). Asymptotic results In this section we are interested in the asymptotic ehavior of the estimator f ε (t). To this end, we first investigate its asymptotic representation, which will e needed to show its asymptotic normality. Theorem.1. Assume (A1)-(A9). Then, f ε (t) f ε (t) = 1 n where R n (t) = o P ( (n) 1/2 ) for all t R. ( ) εi t K f ε (t) + R n (t), This result is important, since it shows that, provided the ias term is negligile, the estimation of θ o and m( ) has asymptotically no effect on the ehavior of the estimator f ε (t). Therefore, this estimator is asymptotically equivalent to the unfeasile estimator f ε (t), ased on the unknown true errors ε 1,..., ε n. Our next result gives the asymptotic normality of the estimator f ε (t). Theorem.2. Assume (A1)-(A9). In addition, assume that n 2q+1 = O(1). Then, ( ) ( d n f ε (t) f ε (t) N 0, f ε (t) ) K(v)dv 2, 7

9 where f ε (t) = f ε (t) + q q! f (q ) ε (t) v q K (v)dv. The proofs of Theorems.1 and.2 are given in Section 6. 4 Simulations In this section, we investigate the performance of our method for different models and different sample sizes. Consider where Λ θ is the Manly (1976) transformation e θy 1 θ, θ 0, Λ θ (y) = y, θ = 0, Λ θo (Y ) = X sin(πx) + σ e ε, (4.1) θ o [ 0.5, 1.5], X is uniformly distriuted on the interval [ 0.5, 0.5], and ε is independent of X and has a standard normal distriution ut restricted to the interval [, ]. We study three different model settings. For each of them, 0 = σ e + 2. The other parameters are chosen as follows: Model 1: 1 = 5, 2 = 2, σ e = 1.5; Model 2: 1 =.5, 2 = 1.5, σ e = 1; Model : 1 = 2.5, 2 = 1, σ e = 0.5. The parameters and the error distriution have een chosen in such a way that the transformation Λ θo (Y ) is positive, to avoid prolems when generating the variale Y. Our simulations are done for θ o = 0, 0.5 or 1. The estimator of θ o is chosen from a grid on the interval [ 0.5, 1.5] with step size We used ( the kernel K(x) = ) x ( x 1) for oth the regression function and the density estimators. The results are ased on 100 random samples of size n = 50 or n = 100, and we worked with the andwidths h = 0. n 1/5 and = g = r n, where r n = 1.06 std( ε) n 1/5, which is Silverman s (1986) rule of thum andwidth for univariate density estimation. Here std( ε) is the average of the standard deviations of ε over the 100 samples. Tale 1 shows the values of the mean, standard deviation and mean squared error of θ for the considered models, sample sizes and values of θ o. We oserve that the results for the different models are quite similar, and as expected, the results are etter for n = 100 than for n = 50. 8

10 Tale 2 shows the mean squared error (MSE) of the estimator f ε (t) of the standardized (pseudoestimated) error ε = (Λ θ(y ) m θ(x))/σ e, for sample sizes n = 50 and n = 100 and for t = 1, 0 and 1. Results for f ε (t) have also een otained, ut are not reported here. Indeed, Figure 1, displaying f ε (t), shows that, even though residuals are standardized for each simulation (with known σ e ), etter ehavior is oserved for models with smaller σ e. Moreover, we oserve that for θ o = 0 there is very little difference etween the curve of f ε and the one of the standard normal density. On the other hand for θ o = 0.5 and θ o = 1, we notice an important difference etween the two curves under Model 1 and 2, ut the difference is less important under Model. n θ o mean( θ) std( θ) MSE( θ) Model 1 Model 2 Model Model 1 Model 2 Model Model 1 Model 2 Model Tale 1: Approximation of the mean, the standard deviation and the mean squared error of θ for the three regression models. All numers are calculated ased on 100 random samples. 5 Conclusions In this paper we have studied the estimation of the density of the error in a semiparametric transformation model. The regression function in this model is unspecified (except for some smoothness assumptions), whereas the transformation (of the dependent variale in the model) is supposed to elong to a parametric family of monotone transformations. The proposed estimator is a kernel-type estimator, and we have shown its asymptotic normality. The finite sample performance of the estimator is illustrated y means of a simulation study. It would e interesting to explore various possile applications of the results in this paper. For example, one could use the results on the estimation of the error density to test hypotheses concerning e.g. the normality 9

11 of the errors, the homoscedasticity of the error variance, or the linearity of the regression function, all of which are important features in the context of transformation models. 6 Proofs Proof of Theorem.1. Write f ε (t) f ε (t) = [ f ε (t) f ε (t)] + [ f ε (t) f ε (t)], where f ε (t) = 1 n ( ) εi t K and ε i = Λ θo (Y i ) m θo (X i ), i = 1,..., n. In a completely similar way as was done for Lemma A.1 in Linton, Sperlich and Van Keilegom (2008), it can e shown that f ε (t) f ε (t) = 1 n ( ) εi t K f ε (t) + o P ((n) 1/2 ) (6.1) for all t R. Note that the remainder term in Lemma A.1 in the aove paper equals a sum of i.i.d. terms of mean zero, plus a o P (n 1/2 ) term. Hence, the remainder term in that paper is O P (n 1/2 ), whereas we write o P ((n) 1/2 ) in (6.1). Therefore, the result of the theorem follows if we prove that f ε (t) f ε (t) = o P ((n) 1/2 ). To this end, write f ε (t) f ε (t) 1 = n 2 ( ε i ( θ) ε i (θ o ))K (1) + 1 2n ( ε i ( θ) ε i (θ o )) 2 K (2) ( ) εi (θ o ) t ( ε i (θ o ) + β( ε i ( θ) ε i (θ o )) t for some β (0, 1). In what follows, we will show that each of the terms aove is o P ((n) 1/2 ). First consider the last term of (6.2). Since Λ θ (y) and m θ (x) are oth twice continuously differentiale with respect to θ, the second order Taylor expansion gives, for some θ 1 etween θ o and θ (to simplify the notations, we assume here that p = dim(θ) = 1), ), ε i ( θ) ε i (θ o ) = Λ θ(y i ) Λ θo (Y i ) ( m θ(x i ) m θo (X i ) ) = ( θ θ o )( Λ θo (Y i ) m θo (X i )) ( θ θ o ) 2 ( Λ θ1 (Y i ) m θ1 (X i )). 10

12 Therefore, since θ θ o = o P ((n) 1/2 ) y Theorem 4.1 in Linton, Sperlich and Van Keilegom (2008) (as efore, we work with a slower rate than what is shown in the latter paper, since this leads to weaker conditions on the andwidths), assumptions (A2) and (A7) imply that ( ) 1 n ( ε i ( θ) ε i (θ o )) 2 K (2) ε i (θ o ) + β( ε i ( θ) ε i (θ o )) t ( = O P (n ) 1), which is o P ((n) 1/2 ), since (n 5 ) 1 = O(1) under (A2). For the first term of (6.2), the decomposition of ε i ( θ) ε i (θ o ) given aove yields 1 n 2 ( ε i ( θ) ε i (θ o ))K (1) = ( θ θ o ) n 2 = ( θ θ o ) n 2 ( ) εi (θ o ) t ( Λ θo (Y i ) m θo (X i ))K (1) ( Λ θo (Y i ) ṁ θo (X i ))K (1) ( εi (θ o ) t ( εi t ) + o P ((n) 1/2 ) where the last equality follows from a Taylor expansion applied to K (1), the fact that m θo (x) ṁ θo (x) = O P ((nh) 1/2 (log h 1 ) 1/2 ), ) + o P ((n) 1/2 ), (6.2) uniformly in x X 0 y Lemma 6.1, and the fact that nh (log h 1 ) 1 under (A2). Further, write [ n ( ) ] E ( Λ θo (Y i ) ṁ θo (X i ))K (1) εi t [ ( )] = E Λ θo (Y i )K (1) εi t [ ( )] E [ṁ θo (X i )] E K (1) εi t = A n B n. We will only show that the first term aove is O(n 2 ) for any t R. The proof for the other term is similar. Let ϕ(x, t) = Λ θo (Λ 1 θ o (m(x)+t)) and set φ(x, t) = ϕ(x, t)f ε (t). Then, applying a Taylor expansion to φ(x, ), it follows that (for some β (0, 1)) [ ( A n = E Λ θo Λ 1 θ o (m(x i ) + ε i ) ) ( )] K (1) εi t ( ) = n φ(x, e)k (1) e t f X (x)dxde = n φ(x, t + v)k (1) (v)f X(x)dxdv [ = n φ(x, t) + v φ ] (x, t + βv) K (1) t (v)f X(x)dxdv = n 2 v φ (x, t + βv)k(1) t (v)f X(x)dxdv, 11

13 since K (1) (v)dv = 0, and this is ounded y Kn2 sup s: t s δ E φ s (X, s) = O(n2 ) y assumption (A9)(ii). Hence, Tcheychev s inequality ensures that ( θ θ o ) ( ) 2 ( Λ θo (Y i ) ṁ θo (X i ))K (1) εi t = ( θ θ o ) n 2 O P (n 2 + (n) 1/2 ) = o P ((n) 1/2 ), since n /2 y (A2). Sustituting this in (6.2), yields 1 ( ) n 2 ( ε i ( θ) ε i (θ o ))K (1) εi (θ o ) t = o P ((n) 1/2 ), for any t R. This completes the proof. Proof of Theorem.2. It follows from Theorem.1 that f ε (t) f ε (t) = [ f ε (t) E f ε (t)] + [E f ε (t) f ε (t)] + o P ((n) 1/2 ). (6.) The first term on the right hand side of (6.) is treated y Lyapounov s Central Limit Theorem (LCT) for triangular arrays (see e.g. Billingsley 1968, Theorem 7.). To this end, let f in (t) = 1 ( ) K εi t. Then, under (A1), (A2) and (A5) it can e easily shown that n E fin (t) E f in (t) Cn 2 f ε (t) K (v) dv + o ( n 2) ( n Var f ) /2 ( in (t) n 1 f ε (t) K(v)dv 2 + o ( = O((n) 1/2 ) = o(1), n 1)) /2 for some C > 0. Hence, the LCT ensures that This gives f ε (t) E f ε (t) Var f ε (t) = f ε (t) E f ε (t) Var f1n (t) ( n fε (t) E f ) ( d ε (t) N 0, f ε (t) n d N (0, 1). ) K(v)dv 2. (6.4) For the second term of (6.), straightforward calculations show that E f ε (t) f ε (t) = q q! f (q ) ε (t) v q K (v)dv + o( q ). Comining this with (6.4) and (6.), we otain the desired result. 12

14 Lemma 6.1. Assume (A1)-(A5) and (A7). Then, sup m θo (x) m θo (x) = O P ((nh) 1/2 (log h 1 ) 1/2 ), x X 0 sup m θo (x) ṁ θo (x) = O P ((nh) 1/2 (log h 1 ) 1/2 ). x X 0 Proof. We will only show the proof for m θo (x) ṁ θo (x), the proof for m θo (x) m θo (x) eing very similar. Let c n = (nh) 1/2 (log h 1 ) 1/2, and define r θo (x) = 1 nh j=1 Λ θo (Y j )K 1 ( Xj x h x X 0 ), ṙ θo (x) = E[ r θo (x)], f X (x) = E[ f X (x)], where f X (x) = (nh) 1 n j=1 K 1( Xj x h ). Then, sup m θo (x) ṁ θo (x) sup m θo (x) ṙθ o (x) x X 0 f X (x) + sup 1 ṙ θo (x) f X (x)ṁ θo (x). (6.5) x X 0 f X (x) Since E[ Λ 4 θ o (Y ) X = x] < uniformly in x X y assumption (A7), a similar proof as was given for Theorem 2 in Einmahl and Mason (2005) ensures that sup m θo (x) ṙθ o (x) f X (x) = O P (c n ). x X 0 Consider now the second term of (6.5). Since E[ ε(θ o ) X] = 0, where ε(θ o ) = d dθ (Λ θ(y ) m θ (X)) θ=θo, we have [ ( )] X x ṙ θo (x) = h 1 E {ṁ θo (X) + ε(θ o )} K 1 h [ ( )] X x = h 1 E ṁ θo (X)K 1 h = ṁ θo (x + hv)k 1 (v)f X (x + hv)dv, from which it follows that ṙ θo (x) f X (x)ṁ θo (x) = [ṁ θo (x + hv) ṁ θo (x)] K 1 (v)f X (x + hv)dv. Hence, a Taylor expansion applied to ṁ θo ( ) yields sup ṙθo (x) f X (x)ṁ θo (x) = O(h q1 ) = O (c n ), x X 0 since nh 2q 1+1 (log h 1 ) 1 = O(1) y (A2). This proves that the second term of (6.5) is O(c n ), since it can e easily shown that f X (x) is ounded away from 0 and infinity, uniformly in x X 0, using (A)(ii). 1

15 References [1] Akritas, M.G. and Van Keilegom, I. (2001). Non-parametric estimation of the residual distriution. Scand. J. Statist., 28, [2] Amemiya, T. (1985). Advanced Econometrics. Harvard University Press, Camridge. [] Bickel, P.J. and Doksum, K. (1981). An analysis of transformations revisited. J. Amer. Statist. Assoc., 76, [4] Billingsley, P. (1968). Convergence of Proaility Measures. Wiley. [5] Box, G.E.P. and Cox, D.R. (1964). An analysis of transformations. J. Roy. Statist. Soc. - Ser. B, 26, [6] Carroll, R.J. and Ruppert, D. (1988). Transformation and Weighting in Regression. Chapman and Hall, New York. [7] Chen, G., Lockhart, R.A. and Stephens, A. (2002). Box-Cox transformations in linear models: Large sample theory and tests of normality (with discussion). Canad. J. Statist, 0, [8] Cheng, F. (2005). Asymptotic distriutions of error density and distriution function estimators in nonparametric regression. J. Statist. Plann. Infer., 128, [9] Efromovich, S. (2005). Estimation of the density of the regression errors. Ann. Statist.,, [10] Einmahl, U. and Mason, D.M. (2005). Uniform in andwidth consistency of kernel-type function estimators. Ann. Statist.,, [11] Escanciano, J.C. and Jacho-Chavez, D. (2010). n-uniformly consistent density estimation in nonparametric regression models (sumitted). [12] Fitzenerger, B., Wilke, R.A. and Zhang, X. (2010). Implementing Box-Cox quantile regression. Econometric Rev., 29, [1] Horowitz, J.L. (1998). Semiparametric Methods in Economics. Springer-Verlag, New York. [14] Linton, O., Sperlich, S. and Van Keilegom, I. (2008). Estimation of a semiparametric transformation model. Ann. Statist., 6,

16 [15] Manly, B. F. (1976). Exponential data transformation. The Statistician, 25, [16] Müller, U.U., Schick, A. and Wefelmeyer, W. (2004). Estimating linear functionals of the error distriution in nonparametric regression. J. Statist. Plann. Infer., 119, [17] Nadaraya, E.A. (1964). On a regression estimate. Teor. Verojatnost. i Primenen, 9, [18] Neumeyer, N. and Van Keilegom, I. (2010). Estimating the error distriution in nonparametric multiple regression with applications to model testing. J. Multiv. Anal., 101, [19] Pinsker, M.S. (1980). Optimal filtering of a square integrale signal in Gaussian white noise. Prolems Inform. Transmission, 16, [20] Sakia, R.M. (1992). The Box-Cox transformation technique: a review. The Statistician, 41, [21] Sam, R. (2010). Contriution à l estimation nonparamétrique de la densité des erreurs de régression. Doctoral Thesis. Avalaile at http : //arxiv.org/p S cache/arxiv/pdf/1011/ v1.pdf. [22] Shin, Y. (2008). Semiparametric estimation of the Box-Cox transformation model. Econometrics J., 11, [2] Silverman, B.W. (1986). Density Estimation for Statistics and Data Analysis. Monographs on Statistics and Applied Proaility, Chapman and Hall, London. [24] Vanhems, A. and Van Keilegom, I. (2011). Semiparametric transformation model with endogeneity: a control function approach (sumitted). [25] Watson, G.S. (1964). Smooth regression analysis. Sankhya - Ser. A, 26, [26] Zellner, A. and Revankar, N.S. (1969). Generalized production functions. Rev. Economic Studies, 6,

17 n θ o t Mean Squared Error of f ε (t) Model 1 Model 2 Model Tale 2: Mean squared error of f ε (t) for three regression models. All numers are calculated ased on 100 random samples. 16

18 Model 1 Model 2 Model Densities Densities Densities Model 1 Model 2 Model Densities Densities Densities Model 1 Model 2 Model Densities Densities Densities Figure 1: Curves of the pointwise average of f ε over 100 random samples of size n = 100 (solid curve) and of the standard normal density (dashed curve) for θ o = 0 (first row), θ o = 0.5 (second row) and θ o = 1 (third row). 17

Estimation of the Error Density in a Semiparametric Transformation Model

Estimation of the Error Density in a Semiparametric Transformation Model Estimation of the Error Density in a Semiparametric Transformation Model Benjamin Colling Université catholique de Louvain Cédric Heuchenne University of Liège and Université catholique de Louvain Rawane

More information

DEPARTMENT MATHEMATIK ARBEITSBEREICH MATHEMATISCHE STATISTIK UND STOCHASTISCHE PROZESSE

DEPARTMENT MATHEMATIK ARBEITSBEREICH MATHEMATISCHE STATISTIK UND STOCHASTISCHE PROZESSE Estimating the error distribution in nonparametric multiple regression with applications to model testing Natalie Neumeyer & Ingrid Van Keilegom Preprint No. 2008-01 July 2008 DEPARTMENT MATHEMATIK ARBEITSBEREICH

More information

Estimation of a semiparametric transformation model in the presence of endogeneity

Estimation of a semiparametric transformation model in the presence of endogeneity TSE 654 May 2016 Estimation of a semiparametric transformation model in the presence of endogeneity Anne Vanhems and Ingrid Van Keilegom Estimation of a semiparametric transformation model in the presence

More information

Nonparametric Estimation of Smooth Conditional Distributions

Nonparametric Estimation of Smooth Conditional Distributions Nonparametric Estimation of Smooth Conditional Distriutions Bruce E. Hansen University of Wisconsin www.ssc.wisc.edu/~hansen May 24 Astract This paper considers nonparametric estimation of smooth conditional

More information

Estimating a Finite Population Mean under Random Non-Response in Two Stage Cluster Sampling with Replacement

Estimating a Finite Population Mean under Random Non-Response in Two Stage Cluster Sampling with Replacement Open Journal of Statistics, 07, 7, 834-848 http://www.scirp.org/journal/ojs ISS Online: 6-798 ISS Print: 6-78X Estimating a Finite Population ean under Random on-response in Two Stage Cluster Sampling

More information

D I S C U S S I O N P A P E R

D I S C U S S I O N P A P E R I N S T I T U T D E S T A T I S T I Q U E B I O S T A T I S T I Q U E E T S C I E N C E S A C T U A R I E L L E S ( I S B A ) UNIVERSITÉ CATHOLIQUE DE LOUVAIN D I S C U S S I O N P A P E R 2014/06 Adaptive

More information

Nonparametric Econometrics

Nonparametric Econometrics Applied Microeconometrics with Stata Nonparametric Econometrics Spring Term 2011 1 / 37 Contents Introduction The histogram estimator The kernel density estimator Nonparametric regression estimators Semi-

More information

Generalized Seasonal Tapered Block Bootstrap

Generalized Seasonal Tapered Block Bootstrap Generalized Seasonal Tapered Block Bootstrap Anna E. Dudek 1, Efstathios Paparoditis 2, Dimitris N. Politis 3 Astract In this paper a new lock ootstrap method for periodic times series called Generalized

More information

EVALUATIONS OF EXPECTED GENERALIZED ORDER STATISTICS IN VARIOUS SCALE UNITS

EVALUATIONS OF EXPECTED GENERALIZED ORDER STATISTICS IN VARIOUS SCALE UNITS APPLICATIONES MATHEMATICAE 9,3 (), pp. 85 95 Erhard Cramer (Oldenurg) Udo Kamps (Oldenurg) Tomasz Rychlik (Toruń) EVALUATIONS OF EXPECTED GENERALIZED ORDER STATISTICS IN VARIOUS SCALE UNITS Astract. We

More information

On a Nonparametric Notion of Residual and its Applications

On a Nonparametric Notion of Residual and its Applications On a Nonparametric Notion of Residual and its Applications Bodhisattva Sen and Gábor Székely arxiv:1409.3886v1 [stat.me] 12 Sep 2014 Columbia University and National Science Foundation September 16, 2014

More information

Single Index Quantile Regression for Heteroscedastic Data

Single Index Quantile Regression for Heteroscedastic Data Single Index Quantile Regression for Heteroscedastic Data E. Christou M. G. Akritas Department of Statistics The Pennsylvania State University JSM, 2015 E. Christou, M. G. Akritas (PSU) SIQR JSM, 2015

More information

HIGH-DIMENSIONAL GRAPHS AND VARIABLE SELECTION WITH THE LASSO

HIGH-DIMENSIONAL GRAPHS AND VARIABLE SELECTION WITH THE LASSO The Annals of Statistics 2006, Vol. 34, No. 3, 1436 1462 DOI: 10.1214/009053606000000281 Institute of Mathematical Statistics, 2006 HIGH-DIMENSIONAL GRAPHS AND VARIABLE SELECTION WITH THE LASSO BY NICOLAI

More information

Goodness-of-fit tests for the cure rate in a mixture cure model

Goodness-of-fit tests for the cure rate in a mixture cure model Biometrika (217), 13, 1, pp. 1 7 Printed in Great Britain Advance Access publication on 31 July 216 Goodness-of-fit tests for the cure rate in a mixture cure model BY U.U. MÜLLER Department of Statistics,

More information

Density estimators for the convolution of discrete and continuous random variables

Density estimators for the convolution of discrete and continuous random variables Density estimators for the convolution of discrete and continuous random variables Ursula U Müller Texas A&M University Anton Schick Binghamton University Wolfgang Wefelmeyer Universität zu Köln Abstract

More information

FinQuiz Notes

FinQuiz Notes Reading 9 A time series is any series of data that varies over time e.g. the quarterly sales for a company during the past five years or daily returns of a security. When assumptions of the regression

More information

Single Index Quantile Regression for Heteroscedastic Data

Single Index Quantile Regression for Heteroscedastic Data Single Index Quantile Regression for Heteroscedastic Data E. Christou M. G. Akritas Department of Statistics The Pennsylvania State University SMAC, November 6, 2015 E. Christou, M. G. Akritas (PSU) SIQR

More information

Estimation in semiparametric models with missing data

Estimation in semiparametric models with missing data Ann Inst Stat Math (2013) 65:785 805 DOI 10.1007/s10463-012-0393-6 Estimation in semiparametric models with missing data Song Xi Chen Ingrid Van Keilegom Received: 7 May 2012 / Revised: 22 October 2012

More information

Transformation and Smoothing in Sample Survey Data

Transformation and Smoothing in Sample Survey Data Scandinavian Journal of Statistics, Vol. 37: 496 513, 2010 doi: 10.1111/j.1467-9469.2010.00691.x Published by Blackwell Publishing Ltd. Transformation and Smoothing in Sample Survey Data YANYUAN MA Department

More information

Minimax Rate of Convergence for an Estimator of the Functional Component in a Semiparametric Multivariate Partially Linear Model.

Minimax Rate of Convergence for an Estimator of the Functional Component in a Semiparametric Multivariate Partially Linear Model. Minimax Rate of Convergence for an Estimator of the Functional Component in a Semiparametric Multivariate Partially Linear Model By Michael Levine Purdue University Technical Report #14-03 Department of

More information

Estimation of the Bivariate and Marginal Distributions with Censored Data

Estimation of the Bivariate and Marginal Distributions with Censored Data Estimation of the Bivariate and Marginal Distributions with Censored Data Michael Akritas and Ingrid Van Keilegom Penn State University and Eindhoven University of Technology May 22, 2 Abstract Two new

More information

High Dimensional Empirical Likelihood for Generalized Estimating Equations with Dependent Data

High Dimensional Empirical Likelihood for Generalized Estimating Equations with Dependent Data High Dimensional Empirical Likelihood for Generalized Estimating Equations with Dependent Data Song Xi CHEN Guanghua School of Management and Center for Statistical Science, Peking University Department

More information

Large Sample Properties of Estimators in the Classical Linear Regression Model

Large Sample Properties of Estimators in the Classical Linear Regression Model Large Sample Properties of Estimators in the Classical Linear Regression Model 7 October 004 A. Statement of the classical linear regression model The classical linear regression model can be written in

More information

Smoothing the Nelson-Aalen Estimtor Biostat 277 presentation Chi-hong Tseng

Smoothing the Nelson-Aalen Estimtor Biostat 277 presentation Chi-hong Tseng Smoothing the Nelson-Aalen Estimtor Biostat 277 presentation Chi-hong seng Reference: 1. Andersen, Borgan, Gill, and Keiding (1993). Statistical Model Based on Counting Processes, Springer-Verlag, p.229-255

More information

Nonparametric Modal Regression

Nonparametric Modal Regression Nonparametric Modal Regression Summary In this article, we propose a new nonparametric modal regression model, which aims to estimate the mode of the conditional density of Y given predictors X. The nonparametric

More information

ON THE COMPARISON OF BOUNDARY AND INTERIOR SUPPORT POINTS OF A RESPONSE SURFACE UNDER OPTIMALITY CRITERIA. Cross River State, Nigeria

ON THE COMPARISON OF BOUNDARY AND INTERIOR SUPPORT POINTS OF A RESPONSE SURFACE UNDER OPTIMALITY CRITERIA. Cross River State, Nigeria ON THE COMPARISON OF BOUNDARY AND INTERIOR SUPPORT POINTS OF A RESPONSE SURFACE UNDER OPTIMALITY CRITERIA Thomas Adidaume Uge and Stephen Seastian Akpan, Department Of Mathematics/Statistics And Computer

More information

I N S T I T U T D E S T A T I S T I Q U E B I O S T A T I S T I Q U E E T S C I E N C E S A C T U A R I E L L E S (I S B A)

I N S T I T U T D E S T A T I S T I Q U E B I O S T A T I S T I Q U E E T S C I E N C E S A C T U A R I E L L E S (I S B A) I N S T I T U T D E S T A T I S T I Q U E B I O S T A T I S T I Q U E E T S C I E N C E S A C T U A R I E L L E S I S B A UNIVERSITÉ CATHOLIQUE DE LOUVAIN D I S C U S S I O N P A P E R 20/4 SEMIPARAMETRIC

More information

ISSN Asymptotic Confidence Bands for Density and Regression Functions in the Gaussian Case

ISSN Asymptotic Confidence Bands for Density and Regression Functions in the Gaussian Case Journal Afrika Statistika Journal Afrika Statistika Vol 5, N,, page 79 87 ISSN 85-35 Asymptotic Confidence Bs for Density egression Functions in the Gaussian Case Nahima Nemouchi Zaher Mohdeb Department

More information

I N S T I T U T D E S T A T I S T I Q U E

I N S T I T U T D E S T A T I S T I Q U E I N S T I T U T D E S T A T I S T I Q U E UNIVERSITÉ CATHOLIQUE DE LOUVAIN D I S C U S S I O N P A P E R 0610 ESTIMATION OF A SEMIPARAMETRIC TRANSFORMATION MODEL O. LINTON, S. SPERLICH and I. VAN KEILEGOM

More information

STRONG NORMALITY AND GENERALIZED COPELAND ERDŐS NUMBERS

STRONG NORMALITY AND GENERALIZED COPELAND ERDŐS NUMBERS #A INTEGERS 6 (206) STRONG NORMALITY AND GENERALIZED COPELAND ERDŐS NUMBERS Elliot Catt School of Mathematical and Physical Sciences, The University of Newcastle, Callaghan, New South Wales, Australia

More information

Minimizing a convex separable exponential function subject to linear equality constraint and bounded variables

Minimizing a convex separable exponential function subject to linear equality constraint and bounded variables Minimizing a convex separale exponential function suect to linear equality constraint and ounded variales Stefan M. Stefanov Department of Mathematics Neofit Rilski South-Western University 2700 Blagoevgrad

More information

Asymptotic distributions of nonparametric regression estimators for longitudinal or functional data

Asymptotic distributions of nonparametric regression estimators for longitudinal or functional data Journal of Multivariate Analysis 98 (2007) 40 56 www.elsevier.com/locate/jmva Asymptotic distriutions of nonparametric regression estimators for longitudinal or functional data Fang Yao Department of Statistics,

More information

Econ 582 Nonparametric Regression

Econ 582 Nonparametric Regression Econ 582 Nonparametric Regression Eric Zivot May 28, 2013 Nonparametric Regression Sofarwehaveonlyconsideredlinearregressionmodels = x 0 β + [ x ]=0 [ x = x] =x 0 β = [ x = x] [ x = x] x = β The assume

More information

Discussion of the paper Inference for Semiparametric Models: Some Questions and an Answer by Bickel and Kwon

Discussion of the paper Inference for Semiparametric Models: Some Questions and an Answer by Bickel and Kwon Discussion of the paper Inference for Semiparametric Models: Some Questions and an Answer by Bickel and Kwon Jianqing Fan Department of Statistics Chinese University of Hong Kong AND Department of Statistics

More information

Density estimation Nonparametric conditional mean estimation Semiparametric conditional mean estimation. Nonparametrics. Gabriel Montes-Rojas

Density estimation Nonparametric conditional mean estimation Semiparametric conditional mean estimation. Nonparametrics. Gabriel Montes-Rojas 0 0 5 Motivation: Regression discontinuity (Angrist&Pischke) Outcome.5 1 1.5 A. Linear E[Y 0i X i] 0.2.4.6.8 1 X Outcome.5 1 1.5 B. Nonlinear E[Y 0i X i] i 0.2.4.6.8 1 X utcome.5 1 1.5 C. Nonlinearity

More information

UNIVERSITY OF CALIFORNIA Spring Economics 241A Econometrics

UNIVERSITY OF CALIFORNIA Spring Economics 241A Econometrics DEPARTMENT OF ECONOMICS R. Smith, J. Powell UNIVERSITY OF CALIFORNIA Spring 2006 Economics 241A Econometrics This course will cover nonlinear statistical models for the analysis of cross-sectional and

More information

Generated Covariates in Nonparametric Estimation: A Short Review.

Generated Covariates in Nonparametric Estimation: A Short Review. Generated Covariates in Nonparametric Estimation: A Short Review. Enno Mammen, Christoph Rothe, and Melanie Schienle Abstract In many applications, covariates are not observed but have to be estimated

More information

Additive Isotonic Regression

Additive Isotonic Regression Additive Isotonic Regression Enno Mammen and Kyusang Yu 11. July 2006 INTRODUCTION: We have i.i.d. random vectors (Y 1, X 1 ),..., (Y n, X n ) with X i = (X1 i,..., X d i ) and we consider the additive

More information

Spiking problem in monotone regression : penalized residual sum of squares

Spiking problem in monotone regression : penalized residual sum of squares Spiking prolem in monotone regression : penalized residual sum of squares Jayanta Kumar Pal 12 SAMSI, NC 27606, U.S.A. Astract We consider the estimation of a monotone regression at its end-point, where

More information

arxiv: v1 [cs.gt] 4 May 2015

arxiv: v1 [cs.gt] 4 May 2015 Econometrics for Learning Agents DENIS NEKIPELOV, University of Virginia, denis@virginia.edu VASILIS SYRGKANIS, Microsoft Research, vasy@microsoft.com EVA TARDOS, Cornell University, eva.tardos@cornell.edu

More information

Bootstrap of residual processes in regression: to smooth or not to smooth?

Bootstrap of residual processes in regression: to smooth or not to smooth? Bootstrap of residual processes in regression: to smooth or not to smooth? arxiv:1712.02685v1 [math.st] 7 Dec 2017 Natalie Neumeyer Ingrid Van Keilegom December 8, 2017 Abstract In this paper we consider

More information

FIDUCIAL INFERENCE: AN APPROACH BASED ON BOOTSTRAP TECHNIQUES

FIDUCIAL INFERENCE: AN APPROACH BASED ON BOOTSTRAP TECHNIQUES U.P.B. Sci. Bull., Series A, Vol. 69, No. 1, 2007 ISSN 1223-7027 FIDUCIAL INFERENCE: AN APPROACH BASED ON BOOTSTRAP TECHNIQUES H.-D. HEIE 1, C-tin TÂRCOLEA 2, Adina I. TARCOLEA 3, M. DEMETRESCU 4 În prima

More information

Introduction. Linear Regression. coefficient estimates for the wage equation: E(Y X) = X 1 β X d β d = X β

Introduction. Linear Regression. coefficient estimates for the wage equation: E(Y X) = X 1 β X d β d = X β Introduction - Introduction -2 Introduction Linear Regression E(Y X) = X β +...+X d β d = X β Example: Wage equation Y = log wages, X = schooling (measured in years), labor market experience (measured

More information

A New Test in Parametric Linear Models with Nonparametric Autoregressive Errors

A New Test in Parametric Linear Models with Nonparametric Autoregressive Errors A New Test in Parametric Linear Models with Nonparametric Autoregressive Errors By Jiti Gao 1 and Maxwell King The University of Western Australia and Monash University Abstract: This paper considers a

More information

Computation of an efficient and robust estimator in a semiparametric mixture model

Computation of an efficient and robust estimator in a semiparametric mixture model Journal of Statistical Computation and Simulation ISSN: 0094-9655 (Print) 1563-5163 (Online) Journal homepage: http://www.tandfonline.com/loi/gscs20 Computation of an efficient and robust estimator in

More information

Nonparametric Methods

Nonparametric Methods Nonparametric Methods Michael R. Roberts Department of Finance The Wharton School University of Pennsylvania July 28, 2009 Michael R. Roberts Nonparametric Methods 1/42 Overview Great for data analysis

More information

University of California, Berkeley

University of California, Berkeley University of California, Berkeley U.C. Berkeley Division of Biostatistics Working Paper Series Year 24 Paper 153 A Note on Empirical Likelihood Inference of Residual Life Regression Ying Qing Chen Yichuan

More information

M- and Z- theorems; GMM and Empirical Likelihood Wellner; 5/13/98, 1/26/07, 5/08/09, 6/14/2010

M- and Z- theorems; GMM and Empirical Likelihood Wellner; 5/13/98, 1/26/07, 5/08/09, 6/14/2010 M- and Z- theorems; GMM and Empirical Likelihood Wellner; 5/13/98, 1/26/07, 5/08/09, 6/14/2010 Z-theorems: Notation and Context Suppose that Θ R k, and that Ψ n : Θ R k, random maps Ψ : Θ R k, deterministic

More information

A converse Gaussian Poincare-type inequality for convex functions

A converse Gaussian Poincare-type inequality for convex functions Statistics & Proaility Letters 44 999 28 290 www.elsevier.nl/locate/stapro A converse Gaussian Poincare-type inequality for convex functions S.G. Bokov a;, C. Houdre ; ;2 a Department of Mathematics, Syktyvkar

More information

Testing near or at the Boundary of the Parameter Space (Job Market Paper)

Testing near or at the Boundary of the Parameter Space (Job Market Paper) Testing near or at the Boundary of the Parameter Space (Jo Market Paper) Philipp Ketz Brown University Novemer 7, 24 Statistical inference aout a scalar parameter is often performed using the two-sided

More information

Simple Examples. Let s look at a few simple examples of OI analysis.

Simple Examples. Let s look at a few simple examples of OI analysis. Simple Examples Let s look at a few simple examples of OI analysis. Example 1: Consider a scalar prolem. We have one oservation y which is located at the analysis point. We also have a ackground estimate

More information

NONPARAMETRIC ENDOGENOUS POST-STRATIFICATION ESTIMATION

NONPARAMETRIC ENDOGENOUS POST-STRATIFICATION ESTIMATION Statistica Sinica 2011): Preprint 1 NONPARAMETRIC ENDOGENOUS POST-STRATIFICATION ESTIMATION Mark Dahlke 1, F. Jay Breidt 1, Jean D. Opsomer 1 and Ingrid Van Keilegom 2 1 Colorado State University and 2

More information

Bayesian inference with reliability methods without knowing the maximum of the likelihood function

Bayesian inference with reliability methods without knowing the maximum of the likelihood function Bayesian inference with reliaility methods without knowing the maximum of the likelihood function Wolfgang Betz a,, James L. Beck, Iason Papaioannou a, Daniel Strau a a Engineering Risk Analysis Group,

More information

INTRINSIC PALINDROMES

INTRINSIC PALINDROMES Antonio J. Di Scala Politecnico di Torino, Dipartamento di Matematica, Corso Duca degli Aruzzi, 24-10129 Torino, Italy e-mail: antonio.discala@polito.it Martín Somra Université de Lyon 1, Laoratoire de

More information

Nonparametric Regression. Changliang Zou

Nonparametric Regression. Changliang Zou Nonparametric Regression Institute of Statistics, Nankai University Email: nk.chlzou@gmail.com Smoothing parameter selection An overall measure of how well m h (x) performs in estimating m(x) over x (0,

More information

TIGHT BOUNDS FOR THE FIRST ORDER MARCUM Q-FUNCTION

TIGHT BOUNDS FOR THE FIRST ORDER MARCUM Q-FUNCTION TIGHT BOUNDS FOR THE FIRST ORDER MARCUM Q-FUNCTION Jiangping Wang and Dapeng Wu Department of Electrical and Computer Engineering University of Florida, Gainesville, FL 3611 Correspondence author: Prof.

More information

Oberwolfach Preprints

Oberwolfach Preprints Oberwolfach Preprints OWP 202-07 LÁSZLÓ GYÖFI, HAO WALK Strongly Consistent Density Estimation of egression esidual Mathematisches Forschungsinstitut Oberwolfach ggmbh Oberwolfach Preprints (OWP) ISSN

More information

DISCUSSION PAPER 2016/43

DISCUSSION PAPER 2016/43 I N S T I T U T D E S T A T I S T I Q U E B I O S T A T I S T I Q U E E T S C I E N C E S A C T U A R I E L L E S ( I S B A ) DISCUSSION PAPER 2016/43 Bounds on Kendall s Tau for Zero-Inflated Continuous

More information

Bayesian estimation of bandwidths for a nonparametric regression model with a flexible error density

Bayesian estimation of bandwidths for a nonparametric regression model with a flexible error density ISSN 1440-771X Australia Department of Econometrics and Business Statistics http://www.buseco.monash.edu.au/depts/ebs/pubs/wpapers/ Bayesian estimation of bandwidths for a nonparametric regression model

More information

arxiv: v1 [math.st] 4 Apr 2008

arxiv: v1 [math.st] 4 Apr 2008 The Annals of Statistics 2008, Vol. 36, No. 2, 686 718 DOI: 10.1214/009053607000000848 c Institute of Mathematical Statistics, 2008 arxiv:0804.0719v1 [math.st] 4 Apr 2008 ESTIMATION OF A SEMIPARAMETRIC

More information

41903: Introduction to Nonparametrics

41903: Introduction to Nonparametrics 41903: Notes 5 Introduction Nonparametrics fundamentally about fitting flexible models: want model that is flexible enough to accommodate important patterns but not so flexible it overspecializes to specific

More information

Supplement to Quantile-Based Nonparametric Inference for First-Price Auctions

Supplement to Quantile-Based Nonparametric Inference for First-Price Auctions Supplement to Quantile-Based Nonparametric Inference for First-Price Auctions Vadim Marmer University of British Columbia Artyom Shneyerov CIRANO, CIREQ, and Concordia University August 30, 2010 Abstract

More information

Fixed-b Inference for Testing Structural Change in a Time Series Regression

Fixed-b Inference for Testing Structural Change in a Time Series Regression Fixed- Inference for esting Structural Change in a ime Series Regression Cheol-Keun Cho Michigan State University imothy J. Vogelsang Michigan State University August 29, 24 Astract his paper addresses

More information

Expansion formula using properties of dot product (analogous to FOIL in algebra): u v 2 u v u v u u 2u v v v u 2 2u v v 2

Expansion formula using properties of dot product (analogous to FOIL in algebra): u v 2 u v u v u u 2u v v v u 2 2u v v 2 Least squares: Mathematical theory Below we provide the "vector space" formulation, and solution, of the least squares prolem. While not strictly necessary until we ring in the machinery of matrix algera,

More information

On Universality of Blow-up Profile for L 2 critical nonlinear Schrödinger Equation

On Universality of Blow-up Profile for L 2 critical nonlinear Schrödinger Equation On Universality of Blow-up Profile for L critical nonlinear Schrödinger Equation Frank Merle,, Pierre Raphael Université de Cergy Pontoise Institut Universitaire de France Astract We consider finite time

More information

SEMIPARAMETRIC ESTIMATION OF CONDITIONAL HETEROSCEDASTICITY VIA SINGLE-INDEX MODELING

SEMIPARAMETRIC ESTIMATION OF CONDITIONAL HETEROSCEDASTICITY VIA SINGLE-INDEX MODELING Statistica Sinica 3 (013), 135-155 doi:http://dx.doi.org/10.5705/ss.01.075 SEMIPARAMERIC ESIMAION OF CONDIIONAL HEEROSCEDASICIY VIA SINGLE-INDEX MODELING Liping Zhu, Yuexiao Dong and Runze Li Shanghai

More information

Asymptotic inference for a nonstationary double ar(1) model

Asymptotic inference for a nonstationary double ar(1) model Asymptotic inference for a nonstationary double ar() model By SHIQING LING and DONG LI Department of Mathematics, Hong Kong University of Science and Technology, Hong Kong maling@ust.hk malidong@ust.hk

More information

Critical value of the total debt in view of the debts. durations

Critical value of the total debt in view of the debts. durations Critical value of the total det in view of the dets durations I.A. Molotov, N.A. Ryaova N.V.Pushov Institute of Terrestrial Magnetism, the Ionosphere and Radio Wave Propagation, Russian Academy of Sciences,

More information

Econ 273B Advanced Econometrics Spring

Econ 273B Advanced Econometrics Spring Econ 273B Advanced Econometrics Spring 2005-6 Aprajit Mahajan email: amahajan@stanford.edu Landau 233 OH: Th 3-5 or by appt. This is a graduate level course in econometrics. The rst part of the course

More information

13 Endogeneity and Nonparametric IV

13 Endogeneity and Nonparametric IV 13 Endogeneity and Nonparametric IV 13.1 Nonparametric Endogeneity A nonparametric IV equation is Y i = g (X i ) + e i (1) E (e i j i ) = 0 In this model, some elements of X i are potentially endogenous,

More information

Estimation of a quadratic regression functional using the sinc kernel

Estimation of a quadratic regression functional using the sinc kernel Estimation of a quadratic regression functional using the sinc kernel Nicolai Bissantz Hajo Holzmann Institute for Mathematical Stochastics, Georg-August-University Göttingen, Maschmühlenweg 8 10, D-37073

More information

Statistica Sinica Preprint No: SS

Statistica Sinica Preprint No: SS Statistica Sinica Preprint No: SS-017-0013 Title A Bootstrap Method for Constructing Pointwise and Uniform Confidence Bands for Conditional Quantile Functions Manuscript ID SS-017-0013 URL http://wwwstatsinicaedutw/statistica/

More information

Bahadur representations for bootstrap quantiles 1

Bahadur representations for bootstrap quantiles 1 Bahadur representations for bootstrap quantiles 1 Yijun Zuo Department of Statistics and Probability, Michigan State University East Lansing, MI 48824, USA zuo@msu.edu 1 Research partially supported by

More information

TESTING FOR THE EQUALITY OF k REGRESSION CURVES

TESTING FOR THE EQUALITY OF k REGRESSION CURVES Statistica Sinica 17(2007, 1115-1137 TESTNG FOR THE EQUALTY OF k REGRESSON CURVES Juan Carlos Pardo-Fernández, ngrid Van Keilegom and Wenceslao González-Manteiga Universidade de Vigo, Université catholique

More information

Efficiency of Profile/Partial Likelihood in the Cox Model

Efficiency of Profile/Partial Likelihood in the Cox Model Efficiency of Profile/Partial Likelihood in the Cox Model Yuichi Hirose School of Mathematics, Statistics and Operations Research, Victoria University of Wellington, New Zealand Summary. This paper shows

More information

arxiv:math/ v1 [math.st] 1 Aug 2006

arxiv:math/ v1 [math.st] 1 Aug 2006 The Annals of Statistics 2006, Vol. 34, No. 3, 1436 1462 DOI: 10.1214/009053606000000281 c Institute of Mathematical Statistics, 2006 arxiv:math/0608017v1 [math.st] 1 Aug 2006 HIGH-DIMENSIONAL GRAPHS AND

More information

A note on L convergence of Neumann series approximation in missing data problems

A note on L convergence of Neumann series approximation in missing data problems A note on L convergence of Neumann series approximation in missing data problems Hua Yun Chen Division of Epidemiology & Biostatistics School of Public Health University of Illinois at Chicago 1603 West

More information

MINIMAX OPTIMAL DESIGNS IN NONLINEAR REGRESSION MODELS

MINIMAX OPTIMAL DESIGNS IN NONLINEAR REGRESSION MODELS Statistica Sinica 8(998, 49-64 MINIMAX OPTIMAL DESIGNS IN NONLINEAR REGRESSION MODELS Holger Dette and Michael Sahm Ruhr-Universität Bochum Astract: We consider the maximum variance optimality criterion

More information

Supplement to: Guidelines for constructing a confidence interval for the intra-class correlation coefficient (ICC)

Supplement to: Guidelines for constructing a confidence interval for the intra-class correlation coefficient (ICC) Supplement to: Guidelines for constructing a confidence interval for the intra-class correlation coefficient (ICC) Authors: Alexei C. Ionan, Mei-Yin C. Polley, Lisa M. McShane, Kevin K. Doin Section Page

More information

Chapter 2 Inference on Mean Residual Life-Overview

Chapter 2 Inference on Mean Residual Life-Overview Chapter 2 Inference on Mean Residual Life-Overview Statistical inference based on the remaining lifetimes would be intuitively more appealing than the popular hazard function defined as the risk of immediate

More information

A732: Exercise #7 Maximum Likelihood

A732: Exercise #7 Maximum Likelihood A732: Exercise #7 Maximum Likelihood Due: 29 Novemer 2007 Analytic computation of some one-dimensional maximum likelihood estimators (a) Including the normalization, the exponential distriution function

More information

The Capacity Region of 2-Receiver Multiple-Input Broadcast Packet Erasure Channels with Channel Output Feedback

The Capacity Region of 2-Receiver Multiple-Input Broadcast Packet Erasure Channels with Channel Output Feedback IEEE TRANSACTIONS ON INFORMATION THEORY, ONLINE PREPRINT 2014 1 The Capacity Region of 2-Receiver Multiple-Input Broadcast Packet Erasure Channels with Channel Output Feedack Chih-Chun Wang, Memer, IEEE,

More information

Characterization of the Burst Aggregation Process in Optical Burst Switching.

Characterization of the Burst Aggregation Process in Optical Burst Switching. See discussions, stats, and author profiles for this pulication at: http://www.researchgate.net/pulication/221198381 Characterization of the Burst Aggregation Process in Optical Burst Switching. CONFERENCE

More information

On the Optimal Performance in Asymmetric Gaussian Wireless Sensor Networks With Fading

On the Optimal Performance in Asymmetric Gaussian Wireless Sensor Networks With Fading 436 IEEE TRANSACTIONS ON SIGNA ROCESSING, O. 58, NO. 4, ARI 00 [9] B. Chen and. K. Willett, On the optimality of the likelihood-ratio test for local sensor decision rules in the presence of nonideal channels,

More information

On the Robust Modal Local Polynomial Regression

On the Robust Modal Local Polynomial Regression International Journal of Statistical Sciences ISSN 683 5603 Vol. 9(Special Issue), 2009, pp 27-23 c 2009 Dept. of Statistics, Univ. of Rajshahi, Bangladesh On the Robust Modal Local Polynomial Regression

More information

arxiv: v4 [math.st] 29 Aug 2017

arxiv: v4 [math.st] 29 Aug 2017 A Critical Value Function Approach, with an Application to Persistent Time-Series Marcelo J. Moreira, and Rafael Mourão arxiv:66.3496v4 [math.st] 29 Aug 27 Escola de Pós-Graduação em Economia e Finanças

More information

1 Hoeffding s Inequality

1 Hoeffding s Inequality Proailistic Method: Hoeffding s Inequality and Differential Privacy Lecturer: Huert Chan Date: 27 May 22 Hoeffding s Inequality. Approximate Counting y Random Sampling Suppose there is a ag containing

More information

The properties of L p -GMM estimators

The properties of L p -GMM estimators The properties of L p -GMM estimators Robert de Jong and Chirok Han Michigan State University February 2000 Abstract This paper considers Generalized Method of Moment-type estimators for which a criterion

More information

IN this paper, we consider the estimation of the frequency

IN this paper, we consider the estimation of the frequency Iterative Frequency Estimation y Interpolation on Fourier Coefficients Elias Aoutanios, MIEEE, Bernard Mulgrew, MIEEE Astract The estimation of the frequency of a complex exponential is a prolem that is

More information

Merging and splitting endowments in object assignment problems

Merging and splitting endowments in object assignment problems Merging and splitting endowments in oject assignment prolems Nanyang Bu, Siwei Chen, and William Thomson April 26, 2012 1 Introduction We consider a group of agents, each endowed with a set of indivisile

More information

Sharp estimates of bounded solutions to some semilinear second order dissipative equations

Sharp estimates of bounded solutions to some semilinear second order dissipative equations Sharp estimates of ounded solutions to some semilinear second order dissipative equations Cyrine Fitouri & Alain Haraux Astract. Let H, V e two real Hilert spaces such that V H with continuous and dense

More information

Statistical Properties of Numerical Derivatives

Statistical Properties of Numerical Derivatives Statistical Properties of Numerical Derivatives Han Hong, Aprajit Mahajan, and Denis Nekipelov Stanford University and UC Berkeley November 2010 1 / 63 Motivation Introduction Many models have objective

More information

GAUSSIAN PROCESS REGRESSION

GAUSSIAN PROCESS REGRESSION GAUSSIAN PROCESS REGRESSION CSE 515T Spring 2015 1. BACKGROUND The kernel trick again... The Kernel Trick Consider again the linear regression model: y(x) = φ(x) w + ε, with prior p(w) = N (w; 0, Σ). The

More information

Issues on quantile autoregression

Issues on quantile autoregression Issues on quantile autoregression Jianqing Fan and Yingying Fan We congratulate Koenker and Xiao on their interesting and important contribution to the quantile autoregression (QAR). The paper provides

More information

A PRACTICAL WAY FOR ESTIMATING TAIL DEPENDENCE FUNCTIONS

A PRACTICAL WAY FOR ESTIMATING TAIL DEPENDENCE FUNCTIONS Statistica Sinica 20 2010, 365-378 A PRACTICAL WAY FOR ESTIMATING TAIL DEPENDENCE FUNCTIONS Liang Peng Georgia Institute of Technology Abstract: Estimating tail dependence functions is important for applications

More information

University of Pavia. M Estimators. Eduardo Rossi

University of Pavia. M Estimators. Eduardo Rossi University of Pavia M Estimators Eduardo Rossi Criterion Function A basic unifying notion is that most econometric estimators are defined as the minimizers of certain functions constructed from the sample

More information

Likelihood ratio confidence bands in nonparametric regression with censored data

Likelihood ratio confidence bands in nonparametric regression with censored data Likelihood ratio confidence bands in nonparametric regression with censored data Gang Li University of California at Los Angeles Department of Biostatistics Ingrid Van Keilegom Eindhoven University of

More information

Section 8.5. z(t) = be ix(t). (8.5.1) Figure A pendulum. ż = ibẋe ix (8.5.2) (8.5.3) = ( bẋ 2 cos(x) bẍ sin(x)) + i( bẋ 2 sin(x) + bẍ cos(x)).

Section 8.5. z(t) = be ix(t). (8.5.1) Figure A pendulum. ż = ibẋe ix (8.5.2) (8.5.3) = ( bẋ 2 cos(x) bẍ sin(x)) + i( bẋ 2 sin(x) + bẍ cos(x)). Difference Equations to Differential Equations Section 8.5 Applications: Pendulums Mass-Spring Systems In this section we will investigate two applications of our work in Section 8.4. First, we will consider

More information

Optimization of the Determinant of the Vandermonde Matrix and Related Matrices

Optimization of the Determinant of the Vandermonde Matrix and Related Matrices Methodol Comput Appl Proa 018) 0:1417 148 https://doiorg/101007/s11009-017-9595-y Optimization of the Determinant of the Vandermonde Matrix and Related Matrices Karl Lundengård 1 Jonas Östererg 1 Sergei

More information

Semiparametric modeling and estimation of the dispersion function in regression

Semiparametric modeling and estimation of the dispersion function in regression Semiparametric modeling and estimation of the dispersion function in regression Ingrid Van Keilegom Lan Wang September 4, 2008 Abstract Modeling heteroscedasticity in semiparametric regression can improve

More information

Log-mean linear regression models for binary responses with an application to multimorbidity

Log-mean linear regression models for binary responses with an application to multimorbidity Log-mean linear regression models for inary responses with an application to multimoridity arxiv:1410.0580v3 [stat.me] 16 May 2016 Monia Lupparelli and Alerto Roverato March 30, 2016 Astract In regression

More information