Empirical likelihood for average derivatives of hazard regression functions

Size: px
Start display at page:

Download "Empirical likelihood for average derivatives of hazard regression functions"

Transcription

1 Metrika ( :93 2 DOI 0.007/s Empirical likelihood for average derivatives of hazard regression functions Xuewen Lu Jie Sun Yongcheng Qi Received: 26 May 2006 / Published online: 9 February 2007 Springer-Verlag 2007 Abstract In this paper, we propose an empirical likelihood ratio method for the inference about average derivatives in semiparametric hazard regression models for competing risks data. Empirical loglikelihood ratio for the vector of the average derivatives of a hazard regression function is defined and shown to be asymptotically chi-squared with degrees of freedom equal to the dimension of covariate vector. Monte Carlo simulation studies are presented to compare the empirical likelihood ratio method with the normal-approximation-based method. Keywords Average derivative Competing risks data Empirical likelihood Nonparametric hazard regression Single-index model Introduction In survival analysis or censored data modelling, the relationship between a survival time and a set of covariates is often modelled through hazard functions. Typically, there are two types of models: the multiplicative model and the additive model (Cox 972; Aalen 980. In these models, the effects of covariates are modelled linearly through some explicit risk functions. X. Lu (B J. Sun Department of Mathematics and Statistics, University of Calgary, Calgary, AB T2N N4, Canada lux@math.ucalgary.ca Y. Qi Department of Mathematics and Statistics, University of Minnesota-Duluth, Duluth, MN 5582, USA

2 94 X. Lu et al. When the functional form of the risk function is not obvious, it requires nonparametric or semiparametric modelling for the effects of covariates (Sasieni 992; Dabrowska 987; Hastie and Tibshirani 990; Fan et al. 997; Huang 999. But the assumption of proportional hazards is essential for these models. On the other hand, when there are many covariates, the existing nonparametric methods may suffer the curse of dimensionality. In the framework of complete data, many models have been recently developed in studying nonparametric estimation of the regression function and its derivatives, while the latter is of special interest to biostatisticians, econometricians and other applied researchers. One of these models is called a singleindex model, in which the nonparametric regression function is assumed to be a nonparametric regression of the response on some linear combination of covariates (Härdle and Stoker 989; Powell et al. 989; Ichimura 993; Horowitz and Härdle 996. A nice survey on the theory of single-index models is given by Geenens and Delecroix (2005. In the models for complete data, interest centers on the mean regression functions. With censored data, however, it is often more convenient and informative to study hazard rates instead of means. There is not much research on the estimation of average derivatives of hazard functions and single-index models in survival analysis. Retaining the assumption of proportional hazards, Nielsen et al. (998 and Lu et al. (2006 investigated single-index models for censored data. Without this assumption, Lu and Burke (2005 proposed a synthetic data method for the estimation of average derivatives of mean regression functions when the censoring mechanism does not depend on the covariates. G rgens (2004 proposed kernel-based estimators for the average derivatives of risk-specific hazard functions for competing risks data. G rgens method can handle censored data where the censoring mechanism may depend on the covariates, and it has appealing features of single-index models as well as capabilities of dealing with censoring. But G rgens method requires the calculation of the covariance matrix and the finite sample properties may not be very good for small samples since the inference procedure is based on the normal approximation. In this paper, we will apply empirical likelihood methods to infer the average derivatives of hazard functions for competing risks data and conduct some finite sample Monte Carlo simulations to compare the performance of the empirical likelihood method and the normal-approximation-based method. Usually, the empirical loglikelihood ratio statistic has a limiting chi-squared distribution. It has been used for tests and constructing confidence regions (or intervals for a variety of problems, see, e.g., Thomas and Grunkemeier (975, Owen (988, 990, Qin and Lawless (994. An appealing feature of the empirical likelihood approach is that it produces confidence regions whose shape and orientation are determined entirely by the data. It has many advantages over some classical and modern methods such as the normal-approximation-based method and the bootstrap method. In particular, it does not impose prior constraints on the region shape, it does not require the construction of a pivotal quantity and the calculation of any asymptotic covariance matrices, the region is

3 Empirical likelihood for average derivatives 95 range-preserving and transformation-respecting, and it has better small sample performance than the normal-approximation-based method (Hall and La Scala 990. For the inference of the average derivatives of hazard functions, we define an empirical likelihood function based on the estimation procedure by G rgens (2004. This procedure involves the nonparametric estimation of the conditional subdistribution functions of the length of failure time and their derivatives. Hence, the data used to estimate the average derivatives are dependent. As a consequence, the empirical loglikelihood ratio does not have a standard chisquared limiting distribution. Instead, we show in Theorem 2 that the adjusted empirical loglikelihood ratio is distributed approximately as chi-squared. The plan of the paper is as follows. In Sect. 2, we introduce the empirical likelihood method for the inference about the average derivatives of hazard functions and present our main results. In Sect. 3, we report Monte Carlo simulation results and demonstrate the advantages numerically. Some concluding remarks are given in Sect. 4. All the proofs are presented in the Appendix. 2 Methodology and main results First, we introduce the average derivatives method of G rgens (2004 for hazard functions and propose the empirical likelihood method for the average derivatives. In a competing risks problem, there is a possibility of several kinds of failure. A special case of this is random censoring, where there are two risks, and the second risk can be thought of as censoring. We use the notation of G rgens (2004. Let Y represent the length of failure time, S be an indicator of the type of failure, and X be a q-vector of explanatory variables. Define the following conditional distribution functions F (y x = P(Y y, S = s X = x, F 2 (y x = P(Y y X = x. Let ξ(x denote the density of X with respect to Lebesgue measure. Define the following joint subdistributions of Y and X by A (y, x = P(Y y, S = s X = xξ(x, A 2 (y, x = P(Y y X = xξ(x. Then the conditional cumulative hazard function of Y given X is by definition H(y x = y 0 F (dv x F 2 (v x. (

4 96 X. Lu et al. It can be written as H(y x = y 0 A (dv, x A 2 (v, x. The following assumptions were used to derive the estimators of the average derivatives and prove their asymptotic normality by G rgens (2004. A. There are a function H and a vector β such that H(y x = H(y x τ β for all y and x, where a τ denotes the transpose of a column vector a. A2. The sequence {(Y i, S i, Xi τ τ n is a random sample. A3. For k N + (the set of positive integers, the following conditions are satisfied.. The q-vector X is absolutely continuous. 2. ξ is bounded. j 3. x A (dy, exist and are bounded and continuous for j =,...,+ k. 4. xa j 2 exist and are bounded and continuous for j =,...,+ k. A4. The function w : R + q R is a weight function and satisfies the following conditions.. The quantity C defined in the following is finite and nonzero. C = w(y, xa 2 (y, x 2 2 H(dy x τ βdx. 2. w(y, x is bounded. 3. x w and x 2 w exist and are bounded. A5. For k N +, the kernel function K : R q R satisfies the following conditions.. K is a bounded kernel with support [, ] q and the order of K is at least k. That is, K(x dx = and x j K(x dx = 0forj =, 2,..., k. 2. x K exists and is bounded and continuous on R q. If the hazard function is of index form, then x H(dy x = 2 H(dy x τ ββ.let W be another weight function. Then the weighted average derivative of H is defined as β = W(y, x x H(dy x dx. In model (, β is intimately related to β,asβ = γβ, where γ = W(y, x 2 H(dy x τ βdx is a nonzero scalar. Due to this relationship, one can estimate β through estimating β. Similar to the density weighted average derivative estimator of Powell et al. (989, G rgens (2004 considered the approach as follows.

5 Empirical likelihood for average derivatives 97 To estimate β and x H(dy x and to avoid random denominators in the estimation formula for x H(dy x, choose W(y, x = w(y, xa 2 (y, x 2, where w is the weight function satisfying Assumption A4. It follows that β = w(y, xa 2 (y, x x A (dy, x dx w(y, x x A 2 (y, xa (dy, x dx. Define K b (x = b q K(b x and the following estimators of A (y, x and A 2 (y, x: A n (y, x = n A 2n (y, x = n K b (x X i I(Y i yi(s i = s, K b (x X i I(Y i y. (2 Then, G rgens estimator of β is defined as βn = w(y, xa 2n (y, x x A n (dy, x dx w(y, x x A 2n (y, xa n (dy, x dx = n 2 x K b (x X i K b (x X j w(y i, xi(y j Y i I(S i = s dx n 2 j= j= K b (x X i x K b (x X j w(y i, xi(y j Y i I(S i = s dx. Under appropriate conditions, βn is a uniformly consistent and asymptotically normal estimator of β, as stated below. Theorem [Theorem (iii, G rgens (2004] Suppose Assumptions A A5 hold. If nb 2q + 2 and nb 2k 0, then βn β n and Φ(Y i, S i, X i = o p (n /2 n /2 (β n β D N(0, Σ,

6 98 X. Lu et al. where D " denotes the convergence in distribution, and Σ is the covariance matrix of Φ(Y, S, X with Φ(y, s, x = 2 + w(v, xi(y v x A (dv, x 2w(y, x x A 2 (y, xi(s = s x w(v, xi(y va (dv, x x w(y, xa 2 (y, xi(s = s 2β. (3 One has from (3 that EΦ(Y, S, X = 0 and Σ = EΦ(Y, S, XΦ(Y, S, X τ. To make inference about β, one needs to estimate the covariance matrix Σ of βn.lett = (Y, S, Xτ τ, t = (y, s, x τ τ and t i = (y i, s i, x τ i τ for i n. Define ρ 0 (t i, t j = x K b (x x i K b (x x j w(y i, xi(y j y i I(s i = s dx K b (x x i x K b (x x j w(y i, xi(y j y i I(s i = s dx and r n (t = n Then (ρ 0 (t, T j + ρ 0 (T j, t. (4 j= β n = n 2 ρ 0 (T i, T j = n j= r n (T i, 2 and Σ can be consistently estimated by Σ n = n = 4 n ( rn (T i 2βn ( rn (T i 2βn τ ( ( rn (T i βn rn (T i τ β 2 2 n. Therefore, a large sample ( α-level confidence region for the true parameter value β 0 of β based on the above normal approximation is given by R NA,α = { β : n(β n β τ Σ n (β n β χ 2 q (α,

7 Empirical likelihood for average derivatives 99 where χq 2 (α is the ( αth quantile of the chi-squared distribution with q degrees of freedom. In (3, for t = (y, s, x τ τ,letη(t = η(y, s, x = (/2{Φ(y, s, x + 2β. Then { η(t = (/2 2 + w(v, xi(y v x A (dv, x 2w(y, x x A 2 (y, xi(s = s x w(v, xi(y va (dv, x x w(y, xa 2 (y, xi(s = s. Since Eη(T i = β, i =,..., n, testing whether β0 is the true parameter value of β is equivalent to testing whether Eη(T i = β0, i =,..., n. Todothis, we apply Owen (990 empirical likelihood method. Let p = (p,..., p n be a probability vector satisfying n p i = and p i 0fori =,..., n. LetF p be a distribution function which assigns probability p i at point η(t i. Denote β (F p = p i η(t i. Then the empirical likelihood, evaluated at the true parameter value β 0,is defined by { n L(β 0 = sup p i : p i =, β (F p = β0. Since β (F p depends on unknown functions A (y, x and A 2 (y, x, we need to replace them by their kernel estimators A n (y, x and A 2n (y, x given by (2. Hence, an estimated empirical likelihood, evaluated at the true parameter value β 0 of β, is defined by L(β 0 = sup { n p i : p i =, βn (F p = β0, where β n (F p = n p i η n (T i, η n (T i = r n (T i /2. For simplicity, write W ni = η n (T i β 0. Then, by the method of Lagrange multipliers, we can easily get p i = n { + λτ W ni, i =,..., n,

8 00 X. Lu et al. where λ = (λ,..., λ q τ is the solution of n W ni + λ τ W ni = 0. (5 Note that n p i, subject to n p i =, attains its maximum n n at p i = n. So we define the empirical likelihood ratio at β 0 by R ( β0 n = (np i = n { + λ τ W ni, and the corresponding empirical loglikelihood ratio statistic is defined as L ( β 0 = 2 log R(β 0 = 2 log { + λ τ W ni. (6 The following theorem gives the asymptotic distribution of the adjusted empirical loglikelihood ratio (/4L(β 0. Theorem 2 Let β 0 be the true value of β. Then, under the assumptions stated in Theorem, we have (/4L(β 0 D χ 2 q, where χq 2 is a standard chi-squared random variable with q degrees of freedom. It is well known that the empirical loglikelihood ratio statistics for the mean of i.i.d. random vectors have a limiting chi-squared distribution. This is due to the fact that the limit of the covariance matrix for the sample mean is in agreement with the limit of the sample covariance. In this paper, η(t i β0 are i.i.d. random vectors, but the unknown functions A (y, x and A 2 (y, x have to be estimated. Instead W ni s are used to define the empirical likelihood ratio. Although η(t i β0 and W ni are so close that both sequences have the same limit covariance matrix Σ, the covariance matrix for the limit distribution of the sample mean n n W ni = βn β 0 is different (see Theorem. This disagreement explains why we need the adjusted empirical loglikelihood ratio. Theorem 2 can be used to construct the empirical likelihood ratio confidence region of β0 as follows, R EL,α ={β R q : (/4L(β χ 2 q (α, since P(β0 R EL,α = P((/4L(β0 χ q 2 (α α, n. In some applications, we often need to make inferences about a particular index coefficient or a linear combination of β0. This can be done by an empirical

9 Empirical likelihood for average derivatives 0 likelihood confidence interval for a linear combination θ = a τ β. The following theorem states the result. Theorem 3 Under the conditions of Theorem 2, for any parameter θ 0 = a τ β0, which is a linear combination of elements in β0, where a is a constant q-vector, we have (/4L a (θ 0 D χ 2, where L a (θ 0 is defined by (6 with W ni = W ni,a a τ η n (T i θ 0. A large sample ( α-level confidence interval for θ 0 based on the above approximation is given by R a,α ={θ a : (/4L a (θ a χ 2 (α. 3 Simulation studies We conduct several Monte Carlo simulations to compare the performance of the empirical likelihood method and the normal-approximation-based method in three censored hazard regression models. In all simulations, the weight function is simply w(y, x =. The kernel function is the product kernel K(x, x 2 = K u (x K u (x 2 for bivariate covariates and K u (x for univariate covariates, where K u is the univariate 4th order (k = 4 kernel function suggested by G rgens (2004: K u (v = (05/64( 5v 2 + 7v 4 3v 6 I( v. The simulations are implemented using codes in R, and a function called el.test in the R library emplik written by Mai Zhou, which is adapted from Owen s SPlus code. Example Exponential regression. Consider the following data generating process for a single-index exponential hazard regression model with two covariates (q = 2, which is a competing risks model with two risks, the second risk is viewed as the censoring. Assume that the conditional hazard function of the survival time T is given by h(t X = x = exp(u, which is a proportional hazards model, where u = x τ β 0 is the single-index with the true parameter vector β 0 = ( 2/2, 2/2 τ, X = (X, X 2 τ, X and X 2 are independent, X N(0, and X 2 χ3 2. The conditional cumulative hazard function is H(t X = x = t exp(u.

10 02 X. Lu et al. Table Coverage probabilities (% in comparisons of empirical likelihood (EL and normal approximation (NA at ( different censoring proportions (cp, sample sizes (n and bandwidths Var(X (b = c σ, where σ =, τ Var(X 2 = (, 6 τ is the standard deviation of (X, X 2 τ, c is some constant factor. The nominal level is set up at 95%. The conditional survival function is exponential NA EL NA EL NA EL NA EL NA EL cp n b = 0.6σ b = 0.7σ b = 0.8σ b = 0.9σ b =.0σ 0% b = 0.7σ b = 0.8σ b = 0.9σ b =.0σ b =.σ 20% b =.2σ b =.3σ b =.4σ b =.5σ b =.6σ 50% The distribution of the censoring variable C is exponential with its hazard function specified by h C (s X = x = exp(ν + x τ β C, where β C = (, 0 τ, ν is a constant, taking values 0.8, 0., 2 so that the censoring proportion cp is 0, 20 and 50% respectively. The sample size n has been chosen to be 30, 60 and 20, respectively. The focus of doing this is on the effect of the degree of censoring and the size of samples. For this model specification, the q-vector of the true average derivatives β0 is a function of the censoring proportion cp but proportional to β 0. Finally, the observations will be {(Y i, δ i, X i n, where Y i = min(t i, C i, δ i = I[T i C i ]=I[S i = ], [S i = ] indicates that the failure is caused by the first risk, which is of interest. The nominal confidence level ( α has been set at 95%. To assess the sensitivity of the estimates and the coverage probabilities to bandwidth, simulations are conducted for different bandwidth values. Let σ = ( Var(X, Var(X 2 τ = (, 6 τ, b = c σ, σ is the standard deviation of (X, X 2 τ and c is some constant factor. For a range of b values, we report the simulation results using 000 replicates in Table. Example 2 Log-logistic regression. In this example, we assume that the survival time T follows a log-logistic distribution with a conditional hazard function h(t X = x = exp(βτ 0 x + t exp(β τ 0 x,

11 Empirical likelihood for average derivatives 03 Table 2 Coverage probabilities (% in comparisons of empirical likelihood (EL and normal approximation (NA at ( different censoring proportions (cp, sample sizes (n and bandwidths Var(X (b = c σ, where σ =, τ Var(X 2 = (, 6 τ is the standard deviation of (X, X 2 τ, c is some constant factor. The nominal level is set up at 95%. The conditional survival function is log-logistic NA EL NA EL NA EL NA EL NA EL cp n b =.0σ b =.σ b =.2σ b =.3σ b =.4σ 0% b =.0σ b =.σ b =.2σ b =.3σ b =.4σ 20% b =.0σ b =.σ b =.2σ b =.3σ b =.4σ 50% which is not a proportional hazards regression model, where β 0 = (/ 3, 2/ 3 τ. The settings for all other random variables and parameters are the same as that in Example except for β C = ( 2/2, 2/2 τ and ν, which takes values 3, 2, 0.2 so that the censoring proportion cp is kept at 0, 20 and 50%, respectively. Table 2 summarizes the results. It is seen that the bandwidth in this example is less variable with the censoring proportion than that in Example. Example 3 Exponential regression with univariate covariate. As suggested by a referee, it is interesting to compare the average length of confidence intervals based on two methods. To do that, we use a model analogous to the one in Example except that the covariate is univariate covariate (q =. Namely, assume that h(t X = x = exp(u, where u = (x 3β 0 with β 0 =, X χ 2 3 and h C (s X = x = exp(ν + xβ C, where β C =, ν is a constant controlling the censoring proportions. We have set up the censoring proportion at cp = 50% (ν = 3 and sample size at n = 20. At the 95% nominal confidence level, 000 replicates were simulated for a range of b values. Table 3 presents the results, from which we have the following observations: the coverage probability for both methods decreases with the bandwidth increasing, and it crosses the nominal level where the bandwidth is considered the optimal; the average length of confidence intervals for both methods has a similar pattern; the empirical likelihood method gives more accurate coverage probabilities than the normalapproximation-based method and slightly wider confidence intervals. Based on these results, we conduct an additional simulation study subject to the same coverage probability. Table 4 provides the results. It is found that the empirical

12 04 X. Lu et al. Table 3 Coverage probabilities (% and the average length (numbers in parentheses of confidence intervals in comparisons of empirical likelihood (EL and normal approximation (NA at different bandwidths (b = c σ, where σ = Var(X = 6 is the standard deviation of X and c is some constant factor. The nominal level is set up at 95%, the censoring proportion at cp = 50% and the sample size at n = 20 respectively. The true average derivative β0 = NA EL NA EL NA EL NA EL b = 0.7σ b = 0.8σ b = 0.9σ b =.0σ (0.233 (0.33 (0.009 (0.086 ( (0.099 ( (0.089 b =.σ b =.2σ b =.3σ b =.4σ ( ( ( ( ( ( ( ( Table 4 The average length (numbers in parentheses of confidence intervals in comparisons of empirical likelihood (EL and normal approximation (NA subject to the same coverage probability (% (numbers in the first row. The censoring proportion cp = 50% and the sample size n = 20. The true average derivative β0 = NA EL NA EL NA EL b = 0.7σ b =.0σ b =.3σ (0.2 (0.206 ( (0.074 (0.054 ( likelihood method has slightly shorter average length than the normal-approximation-based method when they are compared with each other at the same coverage probability, and the difference becomes more prominent when the bandwidth deviates downward from the optimal bandwidth. From the simulation results shown in Tables, 2, 3 and 4, we draw the following conclusions. The coverage probability varies with the bandwidth for both the empirical likelihood method and the normal-approximation-based method, but the empirical likelihood method outperforms the normal-approximation-based method in terms of the coverage probability, the average length of confidence intervals and the sensitivity to the bandwidth. Particularly, the former is less sensitive to the change of the bandwidth, because it inclines to give higher coverage than the latter using the same bandwidth. Subject to the same coverage probability, the former also tends toward giving slightly narrower confidence intervals than the latter. In summary, the empirical likelihood method appears to be superior to the normal-approximation-based method for the inference about the average derivatives of hazard functions. We have noticed that, in all cases, if the bandwidth is appropriately chosen, the coverage probability is in agreement with the nominal level. However, the coverage accuracy for both methods is very sensitive to the bandwidth. It is apparent that the bandwidth at which the best coverage occurs depends on the

13 Empirical likelihood for average derivatives 05 censoring proportion and the sample size. From our simulation results, we see that the range of b is between 0.7σ and.4σ, where σ is the standard deviations of covariates. Hence, we can select the bandwidth as b = cσ, c is a constant depending on the sample size, the censoring proportion and underlying distributions. For example, an empirical formula for c used for the model in Example can be fitted as c = ( cpn /7 and the resulting bandwidth satisfies the conditions in Theorem with q = 2 and k = 4. Although the criterion for the bandwidth selection has not been developed for the estimation of censored average derivatives, our Monte Carlo studies suggest that the reasonable small-sample performance is obtained by setting b in the range of 0.7 to.4 standard deviations of the covariates. Obviously, a formal criterion is desirable in applying any of these two methods for statistical inferences and it needs a full investigation. 4 Conclusion remarks We have proposed a semiparametric method for the inference about the average derivatives of hazard functions by using empirical likelihood methods. Our method applies to general hazard regression models without other assumptions on the model structure except for a weak index assumption. It does not need additional assumptions rather than those for the normal-approximation-based method. The finite sample behavior of our method appears comparable with that of the latter in general and less sensitive to the change of the bandwidth. Moreover, the confidence region based on empirical likelihood does not have predetermined symmetry and does not require the calculation of any covariance matrices. The factor /4 in the adjusted empirical loglikelihood ratio can be viewed as the cost of estimating the unknown functions in constructing empirical likelihood. Therefore, our method is a useful and competitive alternative to the normal-approximation-based method. Appendix: Proof of Theorem 2 To prove Theorem 2, we need the following lemmas. Lemma Under the assumptions of Theorem 2, we have (a max i n W ni =o p (n /2. (b λ =O p (n /2. Lemma 2 Under the assumptions of Theorem 2, we have (a Σ n P Σ, where P denotes the convergence in probability.

14 06 X. Lu et al. (b Set Σ n = (η n (T i βn n (η n(t i βn τ, Σ = E[(η(Y, S, X β 0 (η(y, S, X β 0 τ ]. (c P Then, Σ = (/4Σ and Σ n Σ. n n W ni Wni τ P Σ. Proof of Lemma. Part (b can be proved using the similar arguments as those used in the proof of (2.4 of Owen (990. Next, we prove (a. Set d (t = E{ρ 0 (t, T and d 2 (t = E{ρ 0 (T, t. Then it is easily seen that max ρ 0(T i, T i =o p (n /2 and max d j(t i β0 =o p(n /2 for j =, 2. i n i n Since for each i n W ni = η n (T i β 0 and = ρ 0 (T i, T i + ρ 0 (T i, T i n 2 = n n { ρ0 (T i, T i β 0 + n 2n n j =i j =i it suffices to show + n n n n j =i ρ 0 (T i, T j + ρ 0 (T j, T i 2 β 0 { ρ0 (T i, T j β0 + ρ 0(T j, T i β0, j =i { ρ0 (T i, T j β 0 = n { ρ0 (T j, T i β 0 = n max i n ρ 0 (T i, T j d (T i + d (T i β0 j =i ρ 0 (T j, T i d 2 (T i + d 2 (T i β0, j =i ρ 0 (T i, T j d (T i n = o p(n /2 (7 j =i and max i n ρ 0 (T j, T i d 2 (T i n = o p(n /2. (8 j =i

15 Empirical likelihood for average derivatives 07 We prove (7 only. (8 can be proved using some similar arguments. Set ρ 0 (T i, T j = ρ 02 (T i, T j = x K b (x X i K b (x X j w(y i, xi(y j Y i I(S i = s dx, K b (x X i x K b (x X j w(y i, xi(y j Y i I(S i = s dx. Then ρ 0 (T i, T j = ρ 0 (T i, T j ρ 02 (T i, T j. Therefore, proving (7 is equivalent to showing the following two equations: max i n ρ 0 (T i, T j d 0 (T i n = o p(n /2 (9 j =i and max i n ρ 02 (T i, T j d 02 (T i n = o p(n /2, j =i where d 0j (t = Eρ 0j (t, T for j =, 2. We prove the first equation only since the second one can be proved in the same way. For any t i = (y i, s i, x τ i τ,let ρ n (t i = (n j =i ρ 0(t i, T j d 0 (t i. Note that ρ n (t i is a q-vector. Without loss of generality, we consider the first element of ρ n (t i,say,ρ n (t i.let P n (t i = P( ρ n (t i > C for some constant C > 0, σn 2 = Var{ρ n(t i and K ( (x be the first element of xk(x. Then σn 2 = (n 2 (n b2(2q + { ( ( Var K ( x xi x Xj K b b (n b2(2q + { ( ( E K ( x xi x Xj K b b w(y i, xi(y j y i I(s i = s dx w(y i, xi(y j y i I(s i = s dx 2.

16 08 X. Lu et al. By using Cauchy-Schwarz inequality and changing variables, { E K ( ( x xi K ( x Xj b b [{ { ( ( E K ( x xi x Xj K b b { [ ] x X j ] I b dx [ { ( ( 2 q b q E K ( x xi x Xj K b b 2 q b {w 3q 2 (y i, x i A 2 (y i, x i 2 w(y i, xi(y j y i I(s i = s dx w(y i, xi(y j y i I(s i = s 2 dx ] 2 w(y i, xi(y j y i I(s i = s dx K (2 (uk 2 (v dudv + O(b. Therefore, we have σ 2 n 2 q { (n b q+2 w 2 (y i, x i A 2 (y i, x i K (2 (uk 2 (v dudv + O(b. By letting K max ( = max x{ K ( (x, K max = max x { K(x, wmax = max y,x { w(y, x, and K n = wmaxk max ( K max/{(n b 2q + and applying Bernstein s inequality (Serfling 980, p. 95, we obtain [ C 2 ] P n (t i 2 exp 2σn 2 + (2/3K nc [ ] C(n b 2q + 2 exp 2C b q C + (2/3wmaxK max ( K max 2 exp[ C 2 C(n b 2q + ], where C and C 2 are two constants independent of t i. For any ε>0, taking C = εn /2, and noticing that nb 2q + 2 implies that there exists a sufficiently large number >0 such that (n 3/2 b 2q + > log n and C 2 ε >, therefore, we get

17 Empirical likelihood for average derivatives 09 [ ] P max ρ n(t i εn /2 i n = P [ ρ n (T i εn /2] E {P n (T i T i 2q + 2n exp { C 2 C(n b 2n exp { C 2 ε(n 3/2 2q + b 2n exp { (C 2 ε log n = o(, which implies (9. The proof of Lemma is complete. Proof of Lemma 2. In fact, Lemma 2(a is a direct consequence of Theorem in G rgens (2004. Noticing that Σ n = (/4Σ n, we see that Lemma 2(a implies Lemma 2(b. Set S n = n n W ni Wni τ. To prove Lemma 2(c, we only need to show that S n = Σ n + o p (, since Lemma 2(b holds. For any a R q, we have a τ (S n Σ n a = [2(a τ η n (T i { a τ ( βn n ( β 0 + a τ β0 2 (a τ βn 2 ] = n {2a τ (η n (T i η(t i a τ (βn β 0 + n {a τ (β n β 0 2 2(a τ β 0 {aτ (β n β 0 = R n + R n2 + R n3 + R n4. {2a τ η(t i a τ (βn β 0 Notice that sup i η n (T i η(t i =sup i η n (T i E{η n (T i T i + sup i E{η n (T i T i η(t i =o p (n /2 +o(b = o p (n /2 as shown in (7. By the assumptions given in Theorem, we obtain R n 2 a 2 sup η n (T i η(t i βn β 0 =o p(, i ( R n2 2 a 2 βn β 0 n η(t i = O p (n /2, R n3 a 2 βn β 0 2 = O p (n, R n4 2 a τ β0 a β n β 0 =O p(n /2. Therefore, a τ (S n Σ n a = o p (. Hence, S n = Σ n + o p ( and Lemma 2(c is proved. Thus, the proof of Lemma 2 is complete.

18 0 X. Lu et al. Proof of Theorem 2. The Taylor s expansion of L(β0 in (4 with respect to λ τ W ni gives L(β 0 = 2 { λ τ W ni (/2(λ τ W ni 2 + R n, (0 where R n satisfies the following inequality in light of Lemmas and 2(c for some constant C > 0, R n C (λ τ W ni 3 C λ 3 max i n W ni W ni 2 = o p (. Using Lemmas, 2(c and some similar arguments as above we obtain From (5 we have 0 = λ τ W ni + λ τ W ni = (λ τ W ni 3 + λ τ W ni = o p (. ( (λ τ W ni which together with ( yields Again by (5 we have Since 0 = (λ τ W ni = W ni + λ τ W ni = = (λ τ W ni 2 + (λ τ W ni 3 + λ τ W ni, (λ τ W ni 2 + o p (. (2 [ W ni λ τ W ni + (λτ W ni 2 ] + λ τ W ni W ni (W ni Wni τ λ + n W ni(λ τ W ni 2 + λ τ W ni C λ 2 max i n W ni n W ni (λ τ W ni 2 + λ τ W ni. W ni 2 = o p (n /2

19 Empirical likelihood for average derivatives from Lemmas and 2(c, we conclude ( ( λ = W ni Wni τ W ni + n ( W ni Wni τ n W ni (λ τ W ni 2 + λ τ W ni ( = W ni Wni τ W ni + o p (n /2. (3 Consequently, by (0, (2 and (3, we obtain L(β 0 = = = λ τ W ni Wni τ λ + o p( ( ( n /2 n /2 τ ( W ni ( = Σ /2 n /2 n W ni τ Σ ( W ni Wni τ n /2 W ni + o p ( ( W ni + o p ( n /2 τ ( ( W ni Σ /2 Σ Σ/2 Σ /2 n /2 W ni +o p (. Noticing that Σ = (/4Σ, we obtain Σ /2 Σ Σ/2 = 4I q, where I q denotes a q-dimensional identity matrix. Hence, L ( τ ( β0 = 4 (Σ /2 n /2 W ni Σ /2 n /2 W ni + o p (. By Theorem, Σ /2 n /2 n W ni proved. D N(0, Iq, Therefore, Theorem 2 is Acknowledgments The authors thank the editor and an anonymous referee for their valuable and constructive comments. Lu s research was partly supported by the NSERC Discovery Grant of Canada and Qi s research was supported by NSF grant DMS of USA. References Aalen OO (980 A model for nonparametric regression analysis of counting processes. In: Klonecki N, Kosek A, Rosiński J (eds Lecture notes in statistics, vol 2, Springer, New York, pp 25 Cox DR (972 Regression models and life-tables (with discussion. J R Statist Soc Ser B 34: Dabrowska DM(987 Non-parametric regression with censored survival time data. Scand J Statist 4:8 92 Fan J, Gijbels I, King M (997 Local likelihood and local partial likelihood in hazard regression. Ann Statist 25:66 690

20 2 X. Lu et al. Geenens G, Delecroix M (2005 A survey about single-index models theory. Discussion paper #0508, Intitut de Statistique, Université Catholique de Louvain, Belgium. Available from G rgens T (2004 Average derivatives for hazard functions. Econom Theory 20: Hall P, La Scala B (990 Methodology and algorithms of empirical likelihood. Int Statist Rev 58:09 27 Härdle W, Stoker TM (989 Investigating smooth multiple regression by the method of average derivative. J Am Statist Assoc 84: Hastie T, Tibshirani R (990 Generalized additive models. Chapman and Hall, London Horowitz JL, Härdle W (996 Direct semiparametric estimation of single-index models with discrete covariates. J Am Statist Assoc 9: Huang J (999 Efficient estimation of the partly linear additive Cox model. Ann Statist 27: Ichimura H (993 Semiparametric least squares (SLS and weighted SLS estimation of single-index models. J Econom 58:7 20 Lu X, Burke MD (2005 Censored multiple regression by the method of average derivatives. J Multivariate Anal 95: Lu X, Chen G, Song X-K, Singh RS (2006 A class of partially linear single-index survival models. Can J Statist 34:97 2 Nielsen JP, Linton OB, Bickel PJ (998 On a semiparametric survival model with flexible covariate effect. Ann Statist 26:25 24 Owen A (988 Empirical likelihood ratio confidence intervals for single functional. Biometrika 75: Owen A (990 Empirical likelihood ratio confidence regions. Ann Statist 8:90 20 Powell JL, Stock JH, Stoker TM (989 Semiparametric estimation of index coefficients. Econometrica 57: Qin J, Lawless JF (994 Empirical likelihood and general estimating equations. Ann Statist 22: Sasieni P (992 Non-orthogonal projections and their application to calculating the information in a partly linear Cox model. Scand J Statist 9: Serfling RJ (980 Approximation theorems of mathematical statistics. Wiley, New York Thomas DR, Grunkemeier GL (975 Confidence interval estimation of survival probabilities for censored data. J Am Statist Assoc 70:866 87

University of California, Berkeley

University of California, Berkeley University of California, Berkeley U.C. Berkeley Division of Biostatistics Working Paper Series Year 24 Paper 153 A Note on Empirical Likelihood Inference of Residual Life Regression Ying Qing Chen Yichuan

More information

A COMPARISON OF POISSON AND BINOMIAL EMPIRICAL LIKELIHOOD Mai Zhou and Hui Fang University of Kentucky

A COMPARISON OF POISSON AND BINOMIAL EMPIRICAL LIKELIHOOD Mai Zhou and Hui Fang University of Kentucky A COMPARISON OF POISSON AND BINOMIAL EMPIRICAL LIKELIHOOD Mai Zhou and Hui Fang University of Kentucky Empirical likelihood with right censored data were studied by Thomas and Grunkmier (1975), Li (1995),

More information

FULL LIKELIHOOD INFERENCES IN THE COX MODEL

FULL LIKELIHOOD INFERENCES IN THE COX MODEL October 20, 2007 FULL LIKELIHOOD INFERENCES IN THE COX MODEL BY JIAN-JIAN REN 1 AND MAI ZHOU 2 University of Central Florida and University of Kentucky Abstract We use the empirical likelihood approach

More information

SMOOTHED BLOCK EMPIRICAL LIKELIHOOD FOR QUANTILES OF WEAKLY DEPENDENT PROCESSES

SMOOTHED BLOCK EMPIRICAL LIKELIHOOD FOR QUANTILES OF WEAKLY DEPENDENT PROCESSES Statistica Sinica 19 (2009), 71-81 SMOOTHED BLOCK EMPIRICAL LIKELIHOOD FOR QUANTILES OF WEAKLY DEPENDENT PROCESSES Song Xi Chen 1,2 and Chiu Min Wong 3 1 Iowa State University, 2 Peking University and

More information

11 Survival Analysis and Empirical Likelihood

11 Survival Analysis and Empirical Likelihood 11 Survival Analysis and Empirical Likelihood The first paper of empirical likelihood is actually about confidence intervals with the Kaplan-Meier estimator (Thomas and Grunkmeier 1979), i.e. deals with

More information

Discussion of the paper Inference for Semiparametric Models: Some Questions and an Answer by Bickel and Kwon

Discussion of the paper Inference for Semiparametric Models: Some Questions and an Answer by Bickel and Kwon Discussion of the paper Inference for Semiparametric Models: Some Questions and an Answer by Bickel and Kwon Jianqing Fan Department of Statistics Chinese University of Hong Kong AND Department of Statistics

More information

Size and Shape of Confidence Regions from Extended Empirical Likelihood Tests

Size and Shape of Confidence Regions from Extended Empirical Likelihood Tests Biometrika (2014),,, pp. 1 13 C 2014 Biometrika Trust Printed in Great Britain Size and Shape of Confidence Regions from Extended Empirical Likelihood Tests BY M. ZHOU Department of Statistics, University

More information

A Bootstrap Test for Conditional Symmetry

A Bootstrap Test for Conditional Symmetry ANNALS OF ECONOMICS AND FINANCE 6, 51 61 005) A Bootstrap Test for Conditional Symmetry Liangjun Su Guanghua School of Management, Peking University E-mail: lsu@gsm.pku.edu.cn and Sainan Jin Guanghua School

More information

A note on profile likelihood for exponential tilt mixture models

A note on profile likelihood for exponential tilt mixture models Biometrika (2009), 96, 1,pp. 229 236 C 2009 Biometrika Trust Printed in Great Britain doi: 10.1093/biomet/asn059 Advance Access publication 22 January 2009 A note on profile likelihood for exponential

More information

Nonparametric Econometrics

Nonparametric Econometrics Applied Microeconometrics with Stata Nonparametric Econometrics Spring Term 2011 1 / 37 Contents Introduction The histogram estimator The kernel density estimator Nonparametric regression estimators Semi-

More information

Likelihood ratio confidence bands in nonparametric regression with censored data

Likelihood ratio confidence bands in nonparametric regression with censored data Likelihood ratio confidence bands in nonparametric regression with censored data Gang Li University of California at Los Angeles Department of Biostatistics Ingrid Van Keilegom Eindhoven University of

More information

A Recursive Formula for the Kaplan-Meier Estimator with Mean Constraints

A Recursive Formula for the Kaplan-Meier Estimator with Mean Constraints Noname manuscript No. (will be inserted by the editor) A Recursive Formula for the Kaplan-Meier Estimator with Mean Constraints Mai Zhou Yifan Yang Received: date / Accepted: date Abstract In this note

More information

Jackknife Euclidean Likelihood-Based Inference for Spearman s Rho

Jackknife Euclidean Likelihood-Based Inference for Spearman s Rho Jackknife Euclidean Likelihood-Based Inference for Spearman s Rho M. de Carvalho and F. J. Marques Abstract We discuss jackknife Euclidean likelihood-based inference methods, with a special focus on the

More information

Adjusted Empirical Likelihood for Long-memory Time Series Models

Adjusted Empirical Likelihood for Long-memory Time Series Models Adjusted Empirical Likelihood for Long-memory Time Series Models arxiv:1604.06170v1 [stat.me] 21 Apr 2016 Ramadha D. Piyadi Gamage, Wei Ning and Arjun K. Gupta Department of Mathematics and Statistics

More information

Quantile Regression for Residual Life and Empirical Likelihood

Quantile Regression for Residual Life and Empirical Likelihood Quantile Regression for Residual Life and Empirical Likelihood Mai Zhou email: mai@ms.uky.edu Department of Statistics, University of Kentucky, Lexington, KY 40506-0027, USA Jong-Hyeon Jeong email: jeong@nsabp.pitt.edu

More information

UNIVERSITÄT POTSDAM Institut für Mathematik

UNIVERSITÄT POTSDAM Institut für Mathematik UNIVERSITÄT POTSDAM Institut für Mathematik Testing the Acceleration Function in Life Time Models Hannelore Liero Matthias Liero Mathematische Statistik und Wahrscheinlichkeitstheorie Universität Potsdam

More information

EMPIRICAL ENVELOPE MLE AND LR TESTS. Mai Zhou University of Kentucky

EMPIRICAL ENVELOPE MLE AND LR TESTS. Mai Zhou University of Kentucky EMPIRICAL ENVELOPE MLE AND LR TESTS Mai Zhou University of Kentucky Summary We study in this paper some nonparametric inference problems where the nonparametric maximum likelihood estimator (NPMLE) are

More information

High Dimensional Empirical Likelihood for Generalized Estimating Equations with Dependent Data

High Dimensional Empirical Likelihood for Generalized Estimating Equations with Dependent Data High Dimensional Empirical Likelihood for Generalized Estimating Equations with Dependent Data Song Xi CHEN Guanghua School of Management and Center for Statistical Science, Peking University Department

More information

Power and Sample Size Calculations with the Additive Hazards Model

Power and Sample Size Calculations with the Additive Hazards Model Journal of Data Science 10(2012), 143-155 Power and Sample Size Calculations with the Additive Hazards Model Ling Chen, Chengjie Xiong, J. Philip Miller and Feng Gao Washington University School of Medicine

More information

PENALIZED LIKELIHOOD PARAMETER ESTIMATION FOR ADDITIVE HAZARD MODELS WITH INTERVAL CENSORED DATA

PENALIZED LIKELIHOOD PARAMETER ESTIMATION FOR ADDITIVE HAZARD MODELS WITH INTERVAL CENSORED DATA PENALIZED LIKELIHOOD PARAMETER ESTIMATION FOR ADDITIVE HAZARD MODELS WITH INTERVAL CENSORED DATA Kasun Rathnayake ; A/Prof Jun Ma Department of Statistics Faculty of Science and Engineering Macquarie University

More information

ASSESSING A VECTOR PARAMETER

ASSESSING A VECTOR PARAMETER SUMMARY ASSESSING A VECTOR PARAMETER By D.A.S. Fraser and N. Reid Department of Statistics, University of Toronto St. George Street, Toronto, Canada M5S 3G3 dfraser@utstat.toronto.edu Some key words. Ancillary;

More information

Efficiency of Profile/Partial Likelihood in the Cox Model

Efficiency of Profile/Partial Likelihood in the Cox Model Efficiency of Profile/Partial Likelihood in the Cox Model Yuichi Hirose School of Mathematics, Statistics and Operations Research, Victoria University of Wellington, New Zealand Summary. This paper shows

More information

FULL LIKELIHOOD INFERENCES IN THE COX MODEL: AN EMPIRICAL LIKELIHOOD APPROACH

FULL LIKELIHOOD INFERENCES IN THE COX MODEL: AN EMPIRICAL LIKELIHOOD APPROACH FULL LIKELIHOOD INFERENCES IN THE COX MODEL: AN EMPIRICAL LIKELIHOOD APPROACH Jian-Jian Ren 1 and Mai Zhou 2 University of Central Florida and University of Kentucky Abstract: For the regression parameter

More information

Generalized Neyman Pearson optimality of empirical likelihood for testing parameter hypotheses

Generalized Neyman Pearson optimality of empirical likelihood for testing parameter hypotheses Ann Inst Stat Math (2009) 61:773 787 DOI 10.1007/s10463-008-0172-6 Generalized Neyman Pearson optimality of empirical likelihood for testing parameter hypotheses Taisuke Otsu Received: 1 June 2007 / Revised:

More information

Bahadur representations for bootstrap quantiles 1

Bahadur representations for bootstrap quantiles 1 Bahadur representations for bootstrap quantiles 1 Yijun Zuo Department of Statistics and Probability, Michigan State University East Lansing, MI 48824, USA zuo@msu.edu 1 Research partially supported by

More information

Pairwise rank based likelihood for estimating the relationship between two homogeneous populations and their mixture proportion

Pairwise rank based likelihood for estimating the relationship between two homogeneous populations and their mixture proportion Pairwise rank based likelihood for estimating the relationship between two homogeneous populations and their mixture proportion Glenn Heller and Jing Qin Department of Epidemiology and Biostatistics Memorial

More information

Jackknife Empirical Likelihood for the Variance in the Linear Regression Model

Jackknife Empirical Likelihood for the Variance in the Linear Regression Model Georgia State University ScholarWorks @ Georgia State University Mathematics Theses Department of Mathematics and Statistics Summer 7-25-2013 Jackknife Empirical Likelihood for the Variance in the Linear

More information

Empirical Likelihood Ratio Confidence Interval Estimation of Best Linear Combinations of Biomarkers

Empirical Likelihood Ratio Confidence Interval Estimation of Best Linear Combinations of Biomarkers Empirical Likelihood Ratio Confidence Interval Estimation of Best Linear Combinations of Biomarkers Xiwei Chen a, Albert Vexler a,, Marianthi Markatou a a Department of Biostatistics, University at Buffalo,

More information

Empirical Likelihood in Survival Analysis

Empirical Likelihood in Survival Analysis Empirical Likelihood in Survival Analysis Gang Li 1, Runze Li 2, and Mai Zhou 3 1 Department of Biostatistics, University of California, Los Angeles, CA 90095 vli@ucla.edu 2 Department of Statistics, The

More information

A GENERALIZED ADDITIVE REGRESSION MODEL FOR SURVIVAL TIMES 1. By Thomas H. Scheike University of Copenhagen

A GENERALIZED ADDITIVE REGRESSION MODEL FOR SURVIVAL TIMES 1. By Thomas H. Scheike University of Copenhagen The Annals of Statistics 21, Vol. 29, No. 5, 1344 136 A GENERALIZED ADDITIVE REGRESSION MODEL FOR SURVIVAL TIMES 1 By Thomas H. Scheike University of Copenhagen We present a non-parametric survival model

More information

TESTING SERIAL CORRELATION IN SEMIPARAMETRIC VARYING COEFFICIENT PARTIALLY LINEAR ERRORS-IN-VARIABLES MODEL

TESTING SERIAL CORRELATION IN SEMIPARAMETRIC VARYING COEFFICIENT PARTIALLY LINEAR ERRORS-IN-VARIABLES MODEL Jrl Syst Sci & Complexity (2009) 22: 483 494 TESTIG SERIAL CORRELATIO I SEMIPARAMETRIC VARYIG COEFFICIET PARTIALLY LIEAR ERRORS-I-VARIABLES MODEL Xuemei HU Feng LIU Zhizhong WAG Received: 19 September

More information

Extending the two-sample empirical likelihood method

Extending the two-sample empirical likelihood method Extending the two-sample empirical likelihood method Jānis Valeinis University of Latvia Edmunds Cers University of Latvia August 28, 2011 Abstract In this paper we establish the empirical likelihood method

More information

Chapter 2 Inference on Mean Residual Life-Overview

Chapter 2 Inference on Mean Residual Life-Overview Chapter 2 Inference on Mean Residual Life-Overview Statistical inference based on the remaining lifetimes would be intuitively more appealing than the popular hazard function defined as the risk of immediate

More information

Product-limit estimators of the survival function with left or right censored data

Product-limit estimators of the survival function with left or right censored data Product-limit estimators of the survival function with left or right censored data 1 CREST-ENSAI Campus de Ker-Lann Rue Blaise Pascal - BP 37203 35172 Bruz cedex, France (e-mail: patilea@ensai.fr) 2 Institut

More information

A review of some semiparametric regression models with application to scoring

A review of some semiparametric regression models with application to scoring A review of some semiparametric regression models with application to scoring Jean-Loïc Berthet 1 and Valentin Patilea 2 1 ENSAI Campus de Ker-Lann Rue Blaise Pascal - BP 37203 35172 Bruz cedex, France

More information

TESTING FOR NORMALITY IN THE LINEAR REGRESSION MODEL: AN EMPIRICAL LIKELIHOOD RATIO TEST

TESTING FOR NORMALITY IN THE LINEAR REGRESSION MODEL: AN EMPIRICAL LIKELIHOOD RATIO TEST Econometrics Working Paper EWP0402 ISSN 1485-6441 Department of Economics TESTING FOR NORMALITY IN THE LINEAR REGRESSION MODEL: AN EMPIRICAL LIKELIHOOD RATIO TEST Lauren Bin Dong & David E. A. Giles Department

More information

Working Paper No Maximum score type estimators

Working Paper No Maximum score type estimators Warsaw School of Economics Institute of Econometrics Department of Applied Econometrics Department of Applied Econometrics Working Papers Warsaw School of Economics Al. iepodleglosci 64 02-554 Warszawa,

More information

Computation of an efficient and robust estimator in a semiparametric mixture model

Computation of an efficient and robust estimator in a semiparametric mixture model Journal of Statistical Computation and Simulation ISSN: 0094-9655 (Print) 1563-5163 (Online) Journal homepage: http://www.tandfonline.com/loi/gscs20 Computation of an efficient and robust estimator in

More information

1 Glivenko-Cantelli type theorems

1 Glivenko-Cantelli type theorems STA79 Lecture Spring Semester Glivenko-Cantelli type theorems Given i.i.d. observations X,..., X n with unknown distribution function F (t, consider the empirical (sample CDF ˆF n (t = I [Xi t]. n Then

More information

Tolerance limits for a ratio of normal random variables

Tolerance limits for a ratio of normal random variables Tolerance limits for a ratio of normal random variables Lanju Zhang 1, Thomas Mathew 2, Harry Yang 1, K. Krishnamoorthy 3 and Iksung Cho 1 1 Department of Biostatistics MedImmune, Inc. One MedImmune Way,

More information

Jackknife Empirical Likelihood Test for Equality of Two High Dimensional Means

Jackknife Empirical Likelihood Test for Equality of Two High Dimensional Means Jackknife Empirical Likelihood est for Equality of wo High Dimensional Means Ruodu Wang, Liang Peng and Yongcheng Qi 2 Abstract It has been a long history to test the equality of two multivariate means.

More information

BARTLETT IDENTITIES AND LARGE DEVIATIONS IN LIKELIHOOD THEORY 1. By Per Aslak Mykland University of Chicago

BARTLETT IDENTITIES AND LARGE DEVIATIONS IN LIKELIHOOD THEORY 1. By Per Aslak Mykland University of Chicago The Annals of Statistics 1999, Vol. 27, No. 3, 1105 1117 BARTLETT IDENTITIES AND LARGE DEVIATIONS IN LIKELIHOOD THEORY 1 By Per Aslak Mykland University of Chicago The connection between large and small

More information

On robust and efficient estimation of the center of. Symmetry.

On robust and efficient estimation of the center of. Symmetry. On robust and efficient estimation of the center of symmetry Howard D. Bondell Department of Statistics, North Carolina State University Raleigh, NC 27695-8203, U.S.A (email: bondell@stat.ncsu.edu) Abstract

More information

Inference on reliability in two-parameter exponential stress strength model

Inference on reliability in two-parameter exponential stress strength model Metrika DOI 10.1007/s00184-006-0074-7 Inference on reliability in two-parameter exponential stress strength model K. Krishnamoorthy Shubhabrata Mukherjee Huizhen Guo Received: 19 January 2005 Springer-Verlag

More information

Smooth nonparametric estimation of a quantile function under right censoring using beta kernels

Smooth nonparametric estimation of a quantile function under right censoring using beta kernels Smooth nonparametric estimation of a quantile function under right censoring using beta kernels Chanseok Park 1 Department of Mathematical Sciences, Clemson University, Clemson, SC 29634 Short Title: Smooth

More information

A PRACTICAL WAY FOR ESTIMATING TAIL DEPENDENCE FUNCTIONS

A PRACTICAL WAY FOR ESTIMATING TAIL DEPENDENCE FUNCTIONS Statistica Sinica 20 2010, 365-378 A PRACTICAL WAY FOR ESTIMATING TAIL DEPENDENCE FUNCTIONS Liang Peng Georgia Institute of Technology Abstract: Estimating tail dependence functions is important for applications

More information

11. Bootstrap Methods

11. Bootstrap Methods 11. Bootstrap Methods c A. Colin Cameron & Pravin K. Trivedi 2006 These transparencies were prepared in 20043. They can be used as an adjunct to Chapter 11 of our subsequent book Microeconometrics: Methods

More information

Simple Estimators for Semiparametric Multinomial Choice Models

Simple Estimators for Semiparametric Multinomial Choice Models Simple Estimators for Semiparametric Multinomial Choice Models James L. Powell and Paul A. Ruud University of California, Berkeley March 2008 Preliminary and Incomplete Comments Welcome Abstract This paper

More information

Empirical likelihood ratio with arbitrarily censored/truncated data by EM algorithm

Empirical likelihood ratio with arbitrarily censored/truncated data by EM algorithm Empirical likelihood ratio with arbitrarily censored/truncated data by EM algorithm Mai Zhou 1 University of Kentucky, Lexington, KY 40506 USA Summary. Empirical likelihood ratio method (Thomas and Grunkmier

More information

Can we do statistical inference in a non-asymptotic way? 1

Can we do statistical inference in a non-asymptotic way? 1 Can we do statistical inference in a non-asymptotic way? 1 Guang Cheng 2 Statistics@Purdue www.science.purdue.edu/bigdata/ ONR Review Meeting@Duke Oct 11, 2017 1 Acknowledge NSF, ONR and Simons Foundation.

More information

Estimation of the Bivariate and Marginal Distributions with Censored Data

Estimation of the Bivariate and Marginal Distributions with Censored Data Estimation of the Bivariate and Marginal Distributions with Censored Data Michael Akritas and Ingrid Van Keilegom Penn State University and Eindhoven University of Technology May 22, 2 Abstract Two new

More information

Full likelihood inferences in the Cox model: an empirical likelihood approach

Full likelihood inferences in the Cox model: an empirical likelihood approach Ann Inst Stat Math 2011) 63:1005 1018 DOI 10.1007/s10463-010-0272-y Full likelihood inferences in the Cox model: an empirical likelihood approach Jian-Jian Ren Mai Zhou Received: 22 September 2008 / Revised:

More information

Improving linear quantile regression for

Improving linear quantile regression for Improving linear quantile regression for replicated data arxiv:1901.0369v1 [stat.ap] 16 Jan 2019 Kaushik Jana 1 and Debasis Sengupta 2 1 Imperial College London, UK 2 Indian Statistical Institute, Kolkata,

More information

Smooth simultaneous confidence bands for cumulative distribution functions

Smooth simultaneous confidence bands for cumulative distribution functions Journal of Nonparametric Statistics, 2013 Vol. 25, No. 2, 395 407, http://dx.doi.org/10.1080/10485252.2012.759219 Smooth simultaneous confidence bands for cumulative distribution functions Jiangyan Wang

More information

Econ 273B Advanced Econometrics Spring

Econ 273B Advanced Econometrics Spring Econ 273B Advanced Econometrics Spring 2005-6 Aprajit Mahajan email: amahajan@stanford.edu Landau 233 OH: Th 3-5 or by appt. This is a graduate level course in econometrics. The rst part of the course

More information

Issues on quantile autoregression

Issues on quantile autoregression Issues on quantile autoregression Jianqing Fan and Yingying Fan We congratulate Koenker and Xiao on their interesting and important contribution to the quantile autoregression (QAR). The paper provides

More information

and Comparison with NPMLE

and Comparison with NPMLE NONPARAMETRIC BAYES ESTIMATOR OF SURVIVAL FUNCTIONS FOR DOUBLY/INTERVAL CENSORED DATA and Comparison with NPMLE Mai Zhou Department of Statistics, University of Kentucky, Lexington, KY 40506 USA http://ms.uky.edu/

More information

Asymptotic inference for a nonstationary double ar(1) model

Asymptotic inference for a nonstationary double ar(1) model Asymptotic inference for a nonstationary double ar() model By SHIQING LING and DONG LI Department of Mathematics, Hong Kong University of Science and Technology, Hong Kong maling@ust.hk malidong@ust.hk

More information

arxiv: v1 [stat.co] 26 May 2009

arxiv: v1 [stat.co] 26 May 2009 MAXIMUM LIKELIHOOD ESTIMATION FOR MARKOV CHAINS arxiv:0905.4131v1 [stat.co] 6 May 009 IULIANA TEODORESCU Abstract. A new approach for optimal estimation of Markov chains with sparse transition matrices

More information

Introduction to Empirical Processes and Semiparametric Inference Lecture 25: Semiparametric Models

Introduction to Empirical Processes and Semiparametric Inference Lecture 25: Semiparametric Models Introduction to Empirical Processes and Semiparametric Inference Lecture 25: Semiparametric Models Michael R. Kosorok, Ph.D. Professor and Chair of Biostatistics Professor of Statistics and Operations

More information

Frailty Models and Copulas: Similarities and Differences

Frailty Models and Copulas: Similarities and Differences Frailty Models and Copulas: Similarities and Differences KLARA GOETHALS, PAUL JANSSEN & LUC DUCHATEAU Department of Physiology and Biometrics, Ghent University, Belgium; Center for Statistics, Hasselt

More information

Research Article Optimal Portfolio Estimation for Dependent Financial Returns with Generalized Empirical Likelihood

Research Article Optimal Portfolio Estimation for Dependent Financial Returns with Generalized Empirical Likelihood Advances in Decision Sciences Volume 2012, Article ID 973173, 8 pages doi:10.1155/2012/973173 Research Article Optimal Portfolio Estimation for Dependent Financial Returns with Generalized Empirical Likelihood

More information

Statistical Inference on Constant Stress Accelerated Life Tests Under Generalized Gamma Lifetime Distributions

Statistical Inference on Constant Stress Accelerated Life Tests Under Generalized Gamma Lifetime Distributions Int. Statistical Inst.: Proc. 58th World Statistical Congress, 2011, Dublin (Session CPS040) p.4828 Statistical Inference on Constant Stress Accelerated Life Tests Under Generalized Gamma Lifetime Distributions

More information

Goodness-of-fit tests for the cure rate in a mixture cure model

Goodness-of-fit tests for the cure rate in a mixture cure model Biometrika (217), 13, 1, pp. 1 7 Printed in Great Britain Advance Access publication on 31 July 216 Goodness-of-fit tests for the cure rate in a mixture cure model BY U.U. MÜLLER Department of Statistics,

More information

Approximation of Survival Function by Taylor Series for General Partly Interval Censored Data

Approximation of Survival Function by Taylor Series for General Partly Interval Censored Data Malaysian Journal of Mathematical Sciences 11(3): 33 315 (217) MALAYSIAN JOURNAL OF MATHEMATICAL SCIENCES Journal homepage: http://einspem.upm.edu.my/journal Approximation of Survival Function by Taylor

More information

Bickel Rosenblatt test

Bickel Rosenblatt test University of Latvia 28.05.2011. A classical Let X 1,..., X n be i.i.d. random variables with a continuous probability density function f. Consider a simple hypothesis H 0 : f = f 0 with a significance

More information

Identification and Estimation of Partially Linear Censored Regression Models with Unknown Heteroscedasticity

Identification and Estimation of Partially Linear Censored Regression Models with Unknown Heteroscedasticity Identification and Estimation of Partially Linear Censored Regression Models with Unknown Heteroscedasticity Zhengyu Zhang School of Economics Shanghai University of Finance and Economics zy.zhang@mail.shufe.edu.cn

More information

Transformation and Smoothing in Sample Survey Data

Transformation and Smoothing in Sample Survey Data Scandinavian Journal of Statistics, Vol. 37: 496 513, 2010 doi: 10.1111/j.1467-9469.2010.00691.x Published by Blackwell Publishing Ltd. Transformation and Smoothing in Sample Survey Data YANYUAN MA Department

More information

STAT 6350 Analysis of Lifetime Data. Failure-time Regression Analysis

STAT 6350 Analysis of Lifetime Data. Failure-time Regression Analysis STAT 6350 Analysis of Lifetime Data Failure-time Regression Analysis Explanatory Variables for Failure Times Usually explanatory variables explain/predict why some units fail quickly and some units survive

More information

Estimation of the Conditional Variance in Paired Experiments

Estimation of the Conditional Variance in Paired Experiments Estimation of the Conditional Variance in Paired Experiments Alberto Abadie & Guido W. Imbens Harvard University and BER June 008 Abstract In paired randomized experiments units are grouped in pairs, often

More information

Empirical likelihood based modal regression

Empirical likelihood based modal regression Stat Papers (2015) 56:411 430 DOI 10.1007/s00362-014-0588-4 REGULAR ARTICLE Empirical likelihood based modal regression Weihua Zhao Riquan Zhang Yukun Liu Jicai Liu Received: 24 December 2012 / Revised:

More information

Semiparametric Trending Panel Data Models with Cross-Sectional Dependence

Semiparametric Trending Panel Data Models with Cross-Sectional Dependence he University of Adelaide School of Economics Research Paper No. 200-0 May 200 Semiparametric rending Panel Data Models with Cross-Sectional Dependence Jia Chen, Jiti Gao, and Degui Li Semiparametric rending

More information

Interval Estimation for AR(1) and GARCH(1,1) Models

Interval Estimation for AR(1) and GARCH(1,1) Models for AR(1) and GARCH(1,1) Models School of Mathematics Georgia Institute of Technology 2010 This talk is based on the following papers: Ngai Hang Chan, Deyuan Li and (2010). Toward a Unified of Autoregressions.

More information

Birkbeck Working Papers in Economics & Finance

Birkbeck Working Papers in Economics & Finance ISSN 1745-8587 Birkbeck Working Papers in Economics & Finance Department of Economics, Mathematics and Statistics BWPEF 1809 A Note on Specification Testing in Some Structural Regression Models Walter

More information

COMPUTATION OF THE EMPIRICAL LIKELIHOOD RATIO FROM CENSORED DATA. Kun Chen and Mai Zhou 1 Bayer Pharmaceuticals and University of Kentucky

COMPUTATION OF THE EMPIRICAL LIKELIHOOD RATIO FROM CENSORED DATA. Kun Chen and Mai Zhou 1 Bayer Pharmaceuticals and University of Kentucky COMPUTATION OF THE EMPIRICAL LIKELIHOOD RATIO FROM CENSORED DATA Kun Chen and Mai Zhou 1 Bayer Pharmaceuticals and University of Kentucky Summary The empirical likelihood ratio method is a general nonparametric

More information

SEMIPARAMETRIC ESTIMATION OF CONDITIONAL HETEROSCEDASTICITY VIA SINGLE-INDEX MODELING

SEMIPARAMETRIC ESTIMATION OF CONDITIONAL HETEROSCEDASTICITY VIA SINGLE-INDEX MODELING Statistica Sinica 3 (013), 135-155 doi:http://dx.doi.org/10.5705/ss.01.075 SEMIPARAMERIC ESIMAION OF CONDIIONAL HEEROSCEDASICIY VIA SINGLE-INDEX MODELING Liping Zhu, Yuexiao Dong and Runze Li Shanghai

More information

Department of Statistics, School of Mathematical Sciences, Ferdowsi University of Mashhad, Iran.

Department of Statistics, School of Mathematical Sciences, Ferdowsi University of Mashhad, Iran. JIRSS (2012) Vol. 11, No. 2, pp 191-202 A Goodness of Fit Test For Exponentiality Based on Lin-Wong Information M. Abbasnejad, N. R. Arghami, M. Tavakoli Department of Statistics, School of Mathematical

More information

Statistical Analysis of Competing Risks With Missing Causes of Failure

Statistical Analysis of Competing Risks With Missing Causes of Failure Proceedings 59th ISI World Statistics Congress, 25-3 August 213, Hong Kong (Session STS9) p.1223 Statistical Analysis of Competing Risks With Missing Causes of Failure Isha Dewan 1,3 and Uttara V. Naik-Nimbalkar

More information

Exact Inference for the Two-Parameter Exponential Distribution Under Type-II Hybrid Censoring

Exact Inference for the Two-Parameter Exponential Distribution Under Type-II Hybrid Censoring Exact Inference for the Two-Parameter Exponential Distribution Under Type-II Hybrid Censoring A. Ganguly, S. Mitra, D. Samanta, D. Kundu,2 Abstract Epstein [9] introduced the Type-I hybrid censoring scheme

More information

Tests of independence for censored bivariate failure time data

Tests of independence for censored bivariate failure time data Tests of independence for censored bivariate failure time data Abstract Bivariate failure time data is widely used in survival analysis, for example, in twins study. This article presents a class of χ

More information

Nonparametric Inference via Bootstrapping the Debiased Estimator

Nonparametric Inference via Bootstrapping the Debiased Estimator Nonparametric Inference via Bootstrapping the Debiased Estimator Yen-Chi Chen Department of Statistics, University of Washington ICSA-Canada Chapter Symposium 2017 1 / 21 Problem Setup Let X 1,, X n be

More information

Spring 2017 Econ 574 Roger Koenker. Lecture 14 GEE-GMM

Spring 2017 Econ 574 Roger Koenker. Lecture 14 GEE-GMM University of Illinois Department of Economics Spring 2017 Econ 574 Roger Koenker Lecture 14 GEE-GMM Throughout the course we have emphasized methods of estimation and inference based on the principle

More information

Stat 710: Mathematical Statistics Lecture 31

Stat 710: Mathematical Statistics Lecture 31 Stat 710: Mathematical Statistics Lecture 31 Jun Shao Department of Statistics University of Wisconsin Madison, WI 53706, USA Jun Shao (UW-Madison) Stat 710, Lecture 31 April 13, 2009 1 / 13 Lecture 31:

More information

The Central Limit Theorem Under Random Truncation

The Central Limit Theorem Under Random Truncation The Central Limit Theorem Under Random Truncation WINFRIED STUTE and JANE-LING WANG Mathematical Institute, University of Giessen, Arndtstr., D-3539 Giessen, Germany. winfried.stute@math.uni-giessen.de

More information

Density estimators for the convolution of discrete and continuous random variables

Density estimators for the convolution of discrete and continuous random variables Density estimators for the convolution of discrete and continuous random variables Ursula U Müller Texas A&M University Anton Schick Binghamton University Wolfgang Wefelmeyer Universität zu Köln Abstract

More information

K-ANTITHETIC VARIATES IN MONTE CARLO SIMULATION ISSN k-antithetic Variates in Monte Carlo Simulation Abdelaziz Nasroallah, pp.

K-ANTITHETIC VARIATES IN MONTE CARLO SIMULATION ISSN k-antithetic Variates in Monte Carlo Simulation Abdelaziz Nasroallah, pp. K-ANTITHETIC VARIATES IN MONTE CARLO SIMULATION ABDELAZIZ NASROALLAH Abstract. Standard Monte Carlo simulation needs prohibitive time to achieve reasonable estimations. for untractable integrals (i.e.

More information

DEPARTMENT MATHEMATIK ARBEITSBEREICH MATHEMATISCHE STATISTIK UND STOCHASTISCHE PROZESSE

DEPARTMENT MATHEMATIK ARBEITSBEREICH MATHEMATISCHE STATISTIK UND STOCHASTISCHE PROZESSE Estimating the error distribution in nonparametric multiple regression with applications to model testing Natalie Neumeyer & Ingrid Van Keilegom Preprint No. 2008-01 July 2008 DEPARTMENT MATHEMATIK ARBEITSBEREICH

More information

The EM Algorithm for the Finite Mixture of Exponential Distribution Models

The EM Algorithm for the Finite Mixture of Exponential Distribution Models Int. J. Contemp. Math. Sciences, Vol. 9, 2014, no. 2, 57-64 HIKARI Ltd, www.m-hikari.com http://dx.doi.org/10.12988/ijcms.2014.312133 The EM Algorithm for the Finite Mixture of Exponential Distribution

More information

A Goodness-of-fit Test for Copulas

A Goodness-of-fit Test for Copulas A Goodness-of-fit Test for Copulas Artem Prokhorov August 2008 Abstract A new goodness-of-fit test for copulas is proposed. It is based on restrictions on certain elements of the information matrix and

More information

A Simulation Study on Confidence Interval Procedures of Some Mean Cumulative Function Estimators

A Simulation Study on Confidence Interval Procedures of Some Mean Cumulative Function Estimators Statistics Preprints Statistics -00 A Simulation Study on Confidence Interval Procedures of Some Mean Cumulative Function Estimators Jianying Zuo Iowa State University, jiyizu@iastate.edu William Q. Meeker

More information

Empirical Likelihood Inference for Two-Sample Problems

Empirical Likelihood Inference for Two-Sample Problems Empirical Likelihood Inference for Two-Sample Problems by Ying Yan A thesis presented to the University of Waterloo in fulfillment of the thesis requirement for the degree of Master of Mathematics in Statistics

More information

ICSA Applied Statistics Symposium 1. Balanced adjusted empirical likelihood

ICSA Applied Statistics Symposium 1. Balanced adjusted empirical likelihood ICSA Applied Statistics Symposium 1 Balanced adjusted empirical likelihood Art B. Owen Stanford University Sarah Emerson Oregon State University ICSA Applied Statistics Symposium 2 Empirical likelihood

More information

Testing Goodness-of-Fit for Exponential Distribution Based on Cumulative Residual Entropy

Testing Goodness-of-Fit for Exponential Distribution Based on Cumulative Residual Entropy This article was downloaded by: [Ferdowsi University] On: 16 April 212, At: 4:53 Publisher: Taylor & Francis Informa Ltd Registered in England and Wales Registered Number: 172954 Registered office: Mortimer

More information

COMPARISON OF FIVE TESTS FOR THE COMMON MEAN OF SEVERAL MULTIVARIATE NORMAL POPULATIONS

COMPARISON OF FIVE TESTS FOR THE COMMON MEAN OF SEVERAL MULTIVARIATE NORMAL POPULATIONS Communications in Statistics - Simulation and Computation 33 (2004) 431-446 COMPARISON OF FIVE TESTS FOR THE COMMON MEAN OF SEVERAL MULTIVARIATE NORMAL POPULATIONS K. Krishnamoorthy and Yong Lu Department

More information

Spatially Smoothed Kernel Density Estimation via Generalized Empirical Likelihood

Spatially Smoothed Kernel Density Estimation via Generalized Empirical Likelihood Spatially Smoothed Kernel Density Estimation via Generalized Empirical Likelihood Kuangyu Wen & Ximing Wu Texas A&M University Info-Metrics Institute Conference: Recent Innovations in Info-Metrics October

More information

Marcia Gumpertz and Sastry G. Pantula Department of Statistics North Carolina State University Raleigh, NC

Marcia Gumpertz and Sastry G. Pantula Department of Statistics North Carolina State University Raleigh, NC A Simple Approach to Inference in Random Coefficient Models March 8, 1988 Marcia Gumpertz and Sastry G. Pantula Department of Statistics North Carolina State University Raleigh, NC 27695-8203 Key Words

More information

large number of i.i.d. observations from P. For concreteness, suppose

large number of i.i.d. observations from P. For concreteness, suppose 1 Subsampling Suppose X i, i = 1,..., n is an i.i.d. sequence of random variables with distribution P. Let θ(p ) be some real-valued parameter of interest, and let ˆθ n = ˆθ n (X 1,..., X n ) be some estimate

More information

Inferences for the Ratio: Fieller s Interval, Log Ratio, and Large Sample Based Confidence Intervals

Inferences for the Ratio: Fieller s Interval, Log Ratio, and Large Sample Based Confidence Intervals Inferences for the Ratio: Fieller s Interval, Log Ratio, and Large Sample Based Confidence Intervals Michael Sherman Department of Statistics, 3143 TAMU, Texas A&M University, College Station, Texas 77843,

More information

Motivation Scale Mixutres of Normals Finite Gaussian Mixtures Skew-Normal Models. Mixture Models. Econ 690. Purdue University

Motivation Scale Mixutres of Normals Finite Gaussian Mixtures Skew-Normal Models. Mixture Models. Econ 690. Purdue University Econ 690 Purdue University In virtually all of the previous lectures, our models have made use of normality assumptions. From a computational point of view, the reason for this assumption is clear: combined

More information

Statistical Inference and Methods

Statistical Inference and Methods Department of Mathematics Imperial College London d.stephens@imperial.ac.uk http://stats.ma.ic.ac.uk/ das01/ 31st January 2006 Part VI Session 6: Filtering and Time to Event Data Session 6: Filtering and

More information