arxiv: v2 [stat.me] 7 Oct 2017

Size: px

Start display at page:

Download "arxiv: v2 [stat.me] 7 Oct 2017"

Phillip Little
6 years ago
Views:

1 Weighted empirical likelihood for quantile regression with nonignorable missing covariates arxiv: v2 [stat.me] 7 Oct 2017 Xiaohui Yuan, Xiaogang Dong School of Basic Science, Changchun University of Technology, Changchun , China Abstract In this paper, we propose an empirical likelihood-based weighted estimator of regression parameter in quantile regression model with nonignorable missing covariates. The proposed estimator is computationally simple and achieves semiparametric efficiency if the probability of missingness on the fully observed variables is correctly specified. The efficiency gain of the proposed estimator over the complete-case-analysis estimator is quantified theoretically and illustrated via simulation and a real data application. Keywords: Complete-case-analysis estimator, Empirical likelihood, Nonignorable missing covariates, Quantile regression 1. Introduction Quantile regression, as introduced by Koenker and Bassett (1978), is robust against outliers and can describe the entire conditional distribution of the response variable given the covariates. Due to these advantages, quantile regression became appealing in econometrics, statistics, and biostatistics. The book by Koenker (2005) contains a comprehensive account of overview and discussions in quantile regression. Let Y denote the outcome variable, Z be a vector of covariates which is always observed, and X be a vector of covariates which may not be observed for all subjects. The quantile regression model assumes that the τ-th Corresponding author. addresses: yuanxh@ccut.edu.cn (Xiaohui Yuan), dongxiaogang@ccut.edu.cn (Xiaogang Dong) Preprint submitted to October 10, 2017

2 conditional quantile of Y given X and Z: Q τ (Y X, Z, ) = 0 + X T 1 + Z T 2 = W T, (1) where W = (1, X T, Z T ) T and = (0, 1 T, 2 T ) T is interior to parameter space Θ, Θ is a compact subset of R p. We are interested in the inference about based on a random sample of incomplete data (Y i, X T i, Z T i, δ i ), i = 1,, n, where all the Z i s and Y i s are observed, and δ i = 0 if X i is missing, otherwise δ i = 1. The most commonly used method for handling missing covariate data is the complete-case analysis (CCA), with only the remaining complete data used to perform a regression-based or likelihood-based analysis. The CCA esitmator of is given by ˆ C = arg min Θ 1 n δ i ρ τ (Y i W T i ), (2) where ρ τ (u) = u{τ I(u < 0)} is the quantile loss function and I( ) is the indicator function. In statistic literature, there are three missing data categories (Little and Rubin, 2002). The first case is missing completely at random (MCAR), i.e., data missing mechanism is independent of any observable or unobservable quantities. The second case is missing at random (MAR), i.e., data missing mechanism depends on the observed variables. The third case is not missing at random (NMAR) or nonignorable, i.e., data missing mechanism depends on their own values. When X i s are not MCAR, the CCA estimator can be biased. Consistent and efficient estimators have been proposed in the statistical literature for the quantile regression model when the covariates data are MAR. See for example, Wei et al. (2012) developed an iterative imputation procedure for estimating the conditional quantile in the presence of missing covariates. Sherwood et al. (2013) proposed an inverse probability weighted (IPW) approach to correct for the bias from longitudinal dropouts. Chen et al. (2015) examined the problem of estimation in a quantile regression model and developed three nonparametric methods when observations are missing at ran- 2

3 dom under independent and nonidentically distributed errors. Liu and Yuan (2016) proposed a weighted quantile regression model with weights chosen by empirical likelihood. This approach efficiently incorporates the incomplete data into the data analysis by combining the complete data unbiased estimating equations and incomplete data unbiased estimating equations. However, it may not be an easy task to extend these methods to deal with NMAR missing data mechanisms, because these methods are biased under the NMAR assumption. NMAR is the most difficult problem in the missing data literature. Following Little and Zhang (2011) and Bartlett et al. (2014), we make the following not missing at random (NMAR) assumption: Y δ X, Z. (3) The NMAR assumption (3) implies that, missingness in a covariate depends on the value of that covariate, but is conditionally independent of outcome. The CCA estimator is valid but inefficient under the assumption (3) because it fails to draw on the observed information contained in the incomplete cases. In the context of mean regression model, Bartlett et al. (2014) proposed an augmented CCA estimator to improve upon the efficiency of CCA estimator by modeling an additional model for the probability of missingness on the fully observed variables, i.e. P (δ = 1 Y, Z). The estimating function used in Bartlett et al. (2014) utilizes all the observed data by drawing on the information available from both complete and incomplete cases and thus improves upon the efficiency of CCA estimator. Note that under NMAR assumption (3), P (δ = 1 Y, X, Z) = P (δ = 1 X, Z), whose feasible estimators are not available, since the observations of X are missing on some subjects. Thanks to the NMAR assumption (3), there is no need to estimate P (δ = 1 X, Z) under the assumption (3). Recently, Xie and Zhang (2017) proposed an empirical likelihood approach for estimating the regression parameters in mean regression model with missing covariates under NMAR assumption (3). They showed that the empirical likelihood estimator can improve estimation efficiency if P (δ = 1 Y, Z) is correctly specified. In this paper, we put forward an empirical likelihood-based weighted (ELW) estimator for estimating quantile regression model with nonignorable missing covariates under NMAR assumption (3). To fully utilize the information contained in the incomplete data, we incorporate the unbiased estimating equations of incomplete observations into empirical likelihood and 3

4 obtain the empirical likelihood-based weights to adjust the CCA estimator defined in (2). The proposed ELW estimator is computationally simple as the CCA estimator and achieves semiparametric efficiency if P (δ = 1 Y, Z) is correctly specified. Empirical likelihood is an effective approach to improving efficiency. For a comprehensive review of the empirical likelihood method, one can refer to Qin and Lawless (1994), Owen (2001), Lopez et al. (2009) among others. For applications of empirical likelihood in missing-data problems, one can refer to Wang and Rao (2002), Qin et al. (2009), Liu and Yuan (2012), Liu et al. (2013), Zhong and Qin (2017) among others. The rest of this paper is organized as follows. In section 2, we introduce the empirical likelihood-based weighted estimator for quantile regression model. In section 3, we show that the ELW estimator is asymptotically equivalent to the profile empirical likelihood estimator and thus achieves semiparametric efficiency. Numerical studies are reported in sections 4-5. Proofs of the main theorems needed are given in the Appendix. 2. The empirical likelihood-based weighted estimation In this section, we propose the ELW estimator of under the assumption (3). Under the assumption (3), we only need to estimate the probability of X being observed given Y and Z, i.e. P (δ = 1 Y, Z). Following Bartlett et al. (2014) and Xie and Zhang (2017), we assume that P (δ = 1 Y, Z) is described by the probability model: P (δ = 1 Y, Z) = π(y, Z, γ ), (4) where γ is a q 1 unknown vector parameter. It is natural to estimate γ by the binomial likelihood estimator ˆγ which maximizes the binomial loglikelihood L B (γ) = [δ i log{π(y i, Z i, γ)} + (1 δ i ) log{1 π(y i, Z i, γ)}]. Let m(y i, Z i,, α) be a working model of E{δ i φ(x i, Z i, Y i, ) Z i, Y i } with φ(x i, Z i, Y i, ) = W i {I(Y i W T i < 0) τ}. In the following, we proposed 4

5 the ELW estimator of. Define δ i π(y i, Z i, γ) π(y i, Z i, γ) U B (δ i, Z i, Y i, γ) =, π(y i, Z i, γ){1 π(y i, Z i, γ)} γ g 1 (δ i, X i, Z i, Y i, θ) = [δ i π(y i, Z i, γ)]m(z i, Y i,, α), ( ) g1 (δ g(δ i, X i, Z i, Y i, θ) = i, X i, Z i, Y i, θ). U B (δ i, Z i, Y i, γ) Let p i represent the probability weight allocated to g(δ i, X i, Z i, Y i, ˆθ), where ˆθ = (ˆα T, ˆ T C, ˆγT ) T and ˆα is a consistent estimator for some α. If π(y, z, γ) is correctly specified, one can show that E{g(δ i, X i, Z i, Y i, θ )} = 0, where θ = (α T, T, ˆγ T ) T. Then, we maximize the empirical likelihood function n p i subject to the constraints: p i 0, p i = 1, p i g(δ i, X i, Z i, Y i, ˆθ) = 0. By using the Lagrange multiplier method, we get ˆp i = 1 1 n 1 + ˆλ T g(δ i, X i, Z i, Y i, ˆθ), where ˆλ is the Lagrange multiplier that satisfies 1 n g(δ i, X i, Z i, Y i, ˆθ) 1 + ˆλ = 0. T g(δ i, X i, Z i, Y i, ˆθ) The ELW estimator of is given by ˆ ELW = arg min Θ ˆp i δ i ρ τ (Y i W T i ). (5) Define λ(θ) = arg max λ log{1 + λ T g(δ i, X i, Z i, Y i, θ)}. (6) 5

6 From (5), it is easily seen ˆλ = λ(ˆθ). For fixed θ = ˆθ, solving (6) is a wellbehaved optimization problem since the objective function is globally concave and can be solved by a simple Newton-Raphson numerical procedure. Let F i ( ) and f i ( ) denote respectively the conditional distribution and density functions of Y i given (X i, Z i ). Denote F = E { } δ i f i (0)W i Wi T, S φ = E { δ i φ(x i, Z i, Y i, )φ T (X i, Z i, Y i, ) }, D 1 = E { [δ i π(y i, Z i, γ )] 2 m(z i, Y i,, α )m T (Z i, Y i,, α ) }, D 2 = E { [δ i π(y i, Z i, γ )]m(z i, Y i,, α )U T B(δ i, Z i, Y i, γ ) }, D 3 = E { δ i [δ i π(y i, Z i, γ )]φ(x i, Z i, Y i, )m T (Z i, Y i,, α ) }, D 4 = E { δ i φ(x i, Z i, Y i, )U T B(δ i, Z i, Y i, γ ) }, S B = E { U B (δ i, Z i, Y i, γ )U T B(δ i, Z i, Y i, γ ) }. The following regularity conditions help us in doing asymptotic analysis: C1 The τ-th conditional quantile of Y i given W i is Q τ (Y i W i, ) = Wi T and W i has a bounded support. C2 Y δ X, Z. C3 F, S φ, S B are positive definite. C4 F i ( ) is absolutely continuous and f i ( ) is uniformly bounded away from 0 and at 0. C5 (a) P (δ = 1 Y, Z) = π(y, Z, γ ) (b) inf (Y,Z) π(y, Z, γ ) c 0 for some c 0 > 0. (c) For all (Y i, Z i ), π(y i, Z i, γ) admits all third partial derivatives 3 π(y i,z i,γ) γ k γ l γ m for all γ in a neighborhood of the true value γ, 3 π(y i,z i,γ) γ k γ l γ m and π(y i, Z i, γ)/ γ 2 are bounded by an integrable function for all γ in this neighborhood. C6 For all (Y i, Z i ), m(y i, Z i,, α) admits all second partial derivatives 2 m(y i,z i,,α) i j and 2 m(y i,z i,,α) α i α j ( T, α T ) T. m(y i, Z i,, α) 2, 2 m(y i,z i,,α) i j for all and α in a neighborhood of and 2 m(y i,z i,,α) α i α j are bounded by an integrable function for all and α in this neighborhood. The asymptotic distribution of ˆ C is given by the following theorem. Theorem 2.1. Under conditions C1-C4, n 1/2 ( ˆ C ) n, where Σ C = F 1 S φf 1. 6 d N(0, Σ C ) as

7 The asymptotic distribution of ˆ ELW is given by the following theorem. Theorem 2.2. Under conditions C1-C6, n 1/2 ( ˆ ELW d ) N(0, Σ ELW ) as n, where Σ ELW = F 1 ( ) Sφ V 1 V2 1 V1 T F 1 = Σ C F 1 V 1V2 1 V1 T F 1, V 1 = D 3 D 4 S 1 B DT 2 and V 2 = D 1 D 2 S 1 B DT 2. For two matrices A and B, we write A B if B A is a nonnegativedefinite matrix. Corollary 2.3. If both F and V 2 are positive definite, we have Σ ELW Σ C, and the equality holds if and only if V 1 = 0. Corollary 2.3 reveals that ˆ ELW is at least as efficient as ˆ C for any working regression function m(y i, Z i,, α), whether or not it correctly identifies the optimal regression function E{φ(X i, Z i, Y i, ) Z i, Y i, δ i = 1}. Although ˆ ELW can be obtained easily, it is difficult to estimate the limiting covariance matrix analytically. We apply the resampling method in Liu and Yuan (2016) to the inference about. 3. Simulation studies In this section, we investigate the performance of the proposed estimator ˆ ELW and several other estimators based on Monte-Carlo simulations. The simulated data are generated by the procedure of Bartlett et al. (2014), in which the non-missing indicator δ is distributed with P (δ = 1) = 0.5, and (X, Z, Y ) is generated from a trivariate normal distribution conditional on δ: (X, Z, Y ) T δ N((δ, 0, ηδ) T, Ψ), where Ψ = (σ ab ), a, b = x, z, y, η = (σ xy σ zz σ xz σ zy )υ 1 and υ 1 = (σ xx σ zz σ 2 xz) 1. It is easy to verify that the assumption δ Y (X, Z) is satisfied in this setup. Conditional on Z and Y, the probability of P (δ = 1 Z, Y ) is a logistic regression with P (δ = 1 Z, Y ) = exp(γ 0 + γ 1 Z + γ 2 Y ) 1 + exp(γ 0 + γ 1 Z + γ 2 Y ) 7

8 where γ 0 = 0.5η 2 σ zz υ 2, γ 1 = ησ zy υ 2, γ 2 = ησ zz υ 2 and υ 2 = (σ zz σ yy σ 2 zy) 1. The conditional quantile model of interest is specified as Q τ (Y X, Z) = X + 2 Z, with 0 = Φ 1 (τ, σ 2 ), 1 = (σ xy σ zz σ xz σ zy )υ 1, 2 = (σ zy σ xx σ xz σ xy )υ 1, σ 2 = σ yy (σ 2 xzσ zz 2σ 2 xzσ zy + σ 2 zyσ xx )υ 1. We set σ xx = σ yy = σ zz = 1, σ xz = σ xy = σ zy = 0.5 and generate 1000 Monte Carlo data sets of sample sizes n = 100 and 300. Five estimators are considered: 1. ˆideal : the quantile regression estimator with the full observations. This is the ideal case, but it is not feasible in practice. Nevertheless, we used it as a benchmark for comparison; 2. ˆC : the CCA estimator defined in equation (2); 3. ˆIP W MAR : the IPW estimator assuming MAR, introduced in Sherwood et al. (2013); 4. ˆELW MAR : the ELW estimator assuming MAR, proposed by Liu and Yuan (2016); 5. ˆELW : the ELW estimator defined in equation (5). The empirical bias and the root-mean-squared errors (RMSEs) of the proposed estimators with sample sizes of 100 and 300 are reported in Table 1. The results can be summarized as follows: the CCA estimator ˆ C and the ELW estimator ˆ ELW are unbiased as expected. While ˆ IP W MAR and ˆ ELW MAR for 0 are clearly biased. ˆELW performs better than ˆ C in terms of RMSE in most cases, which agrees with our theory. ˆC and ˆ ELW are improved in terms of RMSE as the sample size n goes up from 100 to Data analysis In this section, we apply the proposed method to the data on alcohol consumption, age, body mass index and systolic blood pressure from the NHANES. We model the population quantile of SBP (systolic blood pressure) as a function of the following four covariates: BMI (body mass index), Alcohol (log{alcohol consumption per day+1}), Age ({age 50}/10) and Age 2 ({age 50} 2 /100). In our analysis, there are 7104 observations in the data set, where the dependent variable SBP and the covariates BMI and Age have complete data, 8

9 the covariate Alcohol are missing 53.29%. It is a priori plausible that missingness in Alcohol is primarily dependent on the value of itself (i.e. MNAR), and that missingness in Alcohol is independent of SBP conditional on Alcohol, BMI, Age, and Age 2. Consequently, CCA is expected to give valid inferences, while the MAR assumption likely does not hold. For i = 1,, n = 7104, let Y i denote the ith observation of Y =SBP, Z i denote the ith observation of Z=(BMI, Age, Age 2 ) T and X i denote the ith observation of X =Alcohol. Then, we consider the following model for the τth conditional quantile of Y i given W i = (1, X i, Z T i ) T : Q τ (Y i X i, Z i, ) = 0 + X i 1 + Z T i 2, i = 1,, n, where = ( 0, 1, T 2 ) T and 2 = ( 21, 22, 23 ) T. We consider two estimators ˆ C and ˆ ELW. For the ELW method, the probability of whether the Alcohol is observed is modeled by π(y, Z, γ) = {1 + exp( γ 0 Y γ 1 Z T γ 2 )} 1. In Figure 1, we plot the estimated regression coefficients, ˆ C and ˆ ELW for 1, 21, 22 and 23, at quantile levels τ = 0.1, 0.2,, 0.9. We see that the CCA and ELW methods produce similar estimated regression coefficients. In Figure 2, we plot the standard errors of ˆ C and ˆ ELW for 1, 21, 22 and 23 at various quantile levels. The standard error of ˆ ELW is smaller than that of ˆ C in most cases. 5. Conclusions In this paper, we develop weighted empirical likelihood approach for estimating the conditional quantile functions in linear models with nonignorable missing covariates. By incorporating the unbiased estimating equations of incomplete data into empirical likelihood, the ELW estimator can achieve semiparametric efficiency if the probability of missingness is correctly specified. We will extend the proposed methods to other regression models, which will be investigated in the future work. Acknowledgements Xiaohui Yuan was partly supported by the NSFC (No , , ). Xiaogang Dong was partly supported by the NSFC (No ). 9

10 6. Appendix In the section, we list a preliminary lemma which has been used in the proofs of the main results in section 2. Lemma 6.1. Under conditions C1-C5, we have ˆλ = λ(ˆθ) = n 1 S 1 g where λ(θ) is defined in (6). [ Ug (θ ) + G γ S 1 B U(γ ) ] + o p (n 1/2 ), The proof of Lemma 6.1 By Lemma A.2 in Liu and Yuan (2016), we have { 1 ˆθ)} 1 ˆλ = g(δ i, X i, Z i, Y i, n ˆθ)g T (δ i, X i, Z i, Y i, n 1 U g (ˆθ) + o p (n 1/2 ), where U g (θ) = n g(δ i, X i, Z i, Y i, θ). By a Taylor expansion, n 1 U g (ˆθ) = n 1 U g (θ ) + n 1 U g( θ) α T (ˆα α ) + n 1 U g( θ) ( ˆ T C ) + n 1 U g( θ) (ˆγ γ ), γ T where θ is a point on the segment connecting ˆθ and θ. By the law of large numbers, we have 1 n ( g(δ i, X i, Z i, Y i, ˆθ)g T (δ i, X i, Z i, Y i, ˆθ) p D1 D 2 n 1 U g( θ) γ T n 1 U g( θ) α T D T 2 S B { } ( ) p g(δi, X i, Z i, Y i, θ ) D2 E = = G γ T S γ, B p 0, n 1 U g( θ) T p 0. ) = S g, By the asymptotic properties of maximum likelihood estimate, we have ˆγ γ = n 1 S 1 B U(γ ) + o p (n 1/2 ), (8) (7) 10

11 where U(γ ) = n U B(δ i, Z i, Y i, γ ). Thus by (7) and (8), { } ˆλ = Sg 1 n 1 U g (θ ) + n 1 U g( θ) (ˆγ γ ) + o γ T p (n 1/2 ) [ = n 1 Sg 1 Ug (θ ) + G γ S 1 B U(γ ) ] + o p (n 1/2 ). The desired result follows. The proof of Theorem 2.1 The proof is similar to the proof of Theorem 4.1 in Koenker (2005, page 120). The proof of Theorem 2.2 For i = 1,, n, let A i (η) = ρ τ (ε i W T i η/ n) ρ τ (ε i ), where ε i = Y i Wi T. The function A(η) = n nˆp iδ i A i (η) is convex and is minimized at ˆη = n( ˆ ELW ). Following Knight s identity (Knight,1998) ρ(u v) ρ(u) = v[i(u < 0) τ] + we can write A(η) = A 1 (η) + A 2 (η), where v 0 [I(u s) I(u 0)]ds, A 1 (η) = n 1/2 η T nˆp i δ i φ(x i, Z i, Y i, ), (9) A 2 (η) = w T i η/ nˆp i δ i 0 n {I(ε i s) I(ε i 0)}ds. (10) We first give the asymptotic expression of (9). Applying a Taylor expansion, we get nˆp i δ i A 1i (η) = η T n 1/2 δ i φ(x i, Z i, Y i, ) η T n 1 δ i φ(x i, Z i, Y i, )g T (δ i, X i, Z i, Y i, θ )n 1/2ˆλ + op (1). 11

12 By the law of large numbers, we have n 1 δ i φ(x i, Z i, Y i, )g T (δ i, X i, Z i, Y i, θ p ) F g = ( ) D 3 D 4. (11) By Lemma 6.1, nˆp i δ i A 1i (η) = η T n 1/2 δ i φ(x i, Z i, Y i, ) η T F g n 1/2ˆλ + op (1) = η T n 1/2 { U φ (θ ) F g S 1 g [ Ug (θ ) + G γ S 1 B U(γ ) ]} + o p (1), where U φ (θ ) = n δ iφ(x i, Z i, Y i, ). Next, we give the asymptotic expression of (10). reveals that nˆp i δ i A 2i (η) = δ i A 2i (η) A Taylor expansion A 2i (η)δ i g T (δ i, X i, Z i, Y i, θ )ˆλ + o p (1). Moreover, similar to the proof of Theorem 4.1 in Koenker(2005), one can show that δ i A 2i (η) = 1 2 ηt F η + o p (1). Thus, we only need to show that n A 2i(η)δ i g T (δ i, X i, Z i, Y i, θ )ˆλ is asymptotically negligible. By Lemma 6.1 and Lemma D.2 in Kitamura et al. (2004), we have ˆλ = O p (n 1/2 ) and max 1 i n { g(δ i, X i, Z i, Y i, θ ) } = o p (n 1/2 ). Then, A 2i (η)δ i g T (δ i, X i, Z i, Y i, θ )ˆλ max { g(δ i, X i, Z i, Y i, θ ) } ˆλ δ i A 2i (η) = o p (1). 1 i n 12

13 d By the asymptotic expressions of (9) and (10), we conclude that A(η) A 0 (η), where A 0 (η) = η T n { [ 1/2 U φ (θ ) F g Sg 1 Ug (θ ) + G γ S 1 B U(γ ) ]} ηt F η. Then, it follows that n( ˆELW ) = ˆη d arg min η A 0 (η), where arg min A 0 (η) = F { [ 1 η n 1/2 U φ (θ ) F g Sg 1 Ug (θ ) + G γ S 1 B U(γ ) ]}. Furthermore, by simple algebra, one can verify that and ( ) ( D D3 D 1 D 2 4 D2 T S B ) 1 = ( V 1 V2 1 D 4 S 1 B V 1V2 1 D 2 S 1 B ) = U g (θ ) + G γ S 1 B U(γ ) ( n [g 1(δ i, X i, Z i, Y i, θ ) D 2 S 1 B U B(δ i, Z i, Y i, γ )] 0 ). Therefore, Let [ F g Sg 1 Ug (θ ) + G γ S 1 = ( D 3 D 4 ) ( D 1 D 2 = V 1 V 1 2 D T 2 B U(γ ) ] S B ) 1 [ Ug (θ ) + G γ S 1 B U(γ ) ] [ g1 (δ i, X i, Z i, Y i, θ ) D 2 S 1 B U B(δ i, Z i, Y i, γ ) ]. h 1i = δ i φ(x i, Z i, Y i, ), h 2i = g 1 (δ i, X i, Z i, Y i, θ ) D 2 S 1 B U B(δ i, Z i, Y i, γ ). 13

14 One can write arg min η A 0 (η) as F 1 n 1/2 n { } h1i V 1 V2 1 h 2i. It is easily verified that V ar (h 1i ) = E(h 1i h T 1i) = S φ, E(h 1i h T 2i) = E(h 1i g1 T (δ i, X i, Z i, Y i, θ )) E(h 1i UB(δ T i, Z i, Y i, γ ))S 1 B DT 2 = D 3 D 4 S 1 B DT 2, and V ar (h 2i ) = V ar(g 1 (δ i, X i, Z i, Y i, θ )) + D 2 S 1 B V ar(u B(δ i, Z i, Y i, γ ))S 1 B DT 2 E[g 1 (δ i, X i, Z i, Y i, θ )UB(δ T i, Z i, Y i, γ )]S 1 B DT 2 D 2 S 1 B E[U B(δ i, Z i, Y i, γ )g1 T (δ i, X i, Z i, Y i, θ )] = D 1 D 2 S 1 B DT 2. Thus, V ar ( ) h 1i V 1 V2 1 h 2i = V ar(h 1i ) V 1 V2 1 E(h 2i h T 1i) E(h 1i h T 2i)V2 1 V1 T + V 1 V2 1 V ar(h 2i )V2 1 V1 T = S φ V 1 V2 1 (D 3 D 4 S 1 B DT 2 ) T (D 3 D 4 S 1 B DT 2 )V2 1 V1 T +V 1 V2 1 (D 1 D 2 S 1 B DT 2 )V2 1 V1 T = S φ V 1 V 1 2 V T 1. The desired result follows by the central limit theorem. The proof of Theorem?? According to the proof of Theorem 1 of Lopez et al.(2009), it can be shown that n 1/2 ( ˆEL ˆγ EL γ where Σ EL = ( S T 1 S 1 2 S 1 ) 1, S1 = ( D1 D Let E 22 = 2 D T 2 S B defined in (11) We know that ( S2 1 = ) d N (0, Σ EL ), F 0 0 D 2 0 S B ), then we write S 2 = E E 1 and S 2 = S φ D 3 D 4 D3 T D 1 D 2 D4 T D2 T S B. ( ) Sφ F g Fg T, where F E g is F g E F g E22 1 E 1 22 F T g E E E 1 22 F T g E 1 ), 14

15 with E 11.2 = S φ F g E 1 22 F T g = S φ V 1 V 1 2 V T 1 D 4 S 1 B DT 4. Note that S T 1 S 1 2 S 1 can be written as = = S1 T S 1 2 S 1 ( F E11.2F 1 F E11.2F 1 g E22 1 D2 S B (D2 T ( F E11.2F 1 F E 1 D4 T E11.2F 1 S B + D4 T E 1 S B )E 1 22 F T g E F (D T 2 S B ){E E 1 22 F T g E F g E D D 4 ) ( H11 H = 12 H 21 H 22 ), ) ( D2 22 } S B ) where H 11 = F E11.2F 1, H 12 = F E11.2D 1 4, H 21 = H12 T and H 22 = S B + D4 T E11.2D 1 4. Therefore, we have (S T 1 S 1 2 S 1 ) 1 = = ( ) 1 H11 H 12 H 21 H 22 ( H H11 1 H 12 H H 21 H11 1 H 1 H22.1H 1 21 H11 1 H H 12 H ), where H 22.1 = H 22 H 21 H 1 11 H 12 = S B. By direct calculation, it follows that H H11 1 H 12 H22.1H 1 21 H11 1 E 11.2F 1 D 4S 1 = F 1 = F 1 S φf 1 = F 1 S φf 1 = Σ ELW, + F 1 F 1 F 1 V 1V 1 V 1V 1 B DT 4 F 1 2 V1 T F 1 2 V1 T F 1 F 1 D 4S 1 B DT 4 F 1 + F 1 D 4S 1 B DT 4 F 1 and H11 1 H 12 H = F 1 result follows. E 11.2F 1 F E D 4 S 1 B = F 1 D 4S 1. The desired B Reference References Bartlett J W, Carpenter J R, Tilling K, et al Improving upon the efficiency of complete case analysis when covariates are MNAR[J]. Biostatistics, 15(4):

16 Chen X, Wan A T K, Zhou Y Efficient quantile regression analysis with missing observations[j]. Journal of the American Statistical Association, 110(510): Kitamura Y, Tripathi G, Ahn H Empirical likelihood-based inference in conditional moment restriction Models[J]. Econometrica, 72(6): Knight K Limiting distributions for L 1 regression estimators under general conditions[j]. Annals of Statistics, 26: Koenker, R. and Bassett, G Regression quantiles. Econometrica,46, Koenker R Quantile regression[m]. Cambridge university press. Little RJA, Rubin DB 2002 Statistical analysis with missing data, 2nd ed, Wiley, Hoboken, NJ. Little, R.J., Zhang, N Subsample ignorable likelihood for regression analysis with missing data. J R Stat Soc Ser C 60: Liu, T. and Yuan, X Combining quasi and empirical likelihoods in generalized linear models with missing responses. Journal of Multivariate Analysis 111, Liu, T., Yuan, X., Li, Z. and Li, Y Empirical and weighted conditional likelihoods for matched case-control studies with missing covariates. Journal of Multivariate Analysis 119, Liu T, Yuan X Weighted quantile regression with missing covariates using empirical likelihood[j]. Statistics A Journal of Theoretical and Applied Statistics, 50(1): Lopez EMM, Keilegom IV, Veraverbeke N (2009) Empirical likelihood for non-smooth criterion functions. Scand J Stat 36: Owen, A. B Empirical Likelihood. Chapman and Hall, New York. Qin J, Lawless J 1994 Empirical likelihood and general estimating equations. Ann Stat 22(1):

17 Qin, J., Zhang, B. and Leung, D. H Empirical likelihood in missing data problems[j]. Journal of the American Statistical Association, 104: Sherwood B, Wang L, Zhou X H Weighted quantile regression for analyzing health care cost data with missing covariates.[j]. Statistics in Medicine, 32(28): Wang, Q., and Rao, J. N. K Empirical likelihood-based inference under imputation for missing response data[j]. Annals of Statistics, 30: Wei Y, Ma Y, Carroll R J Multiple imputation in quantile regression[j]. Biometrika, 99(2): Xie, Y., Zhang, H Empirical likelihood in nonignorable covariatemissing data problems. Int. J. Biostat 13(1): Zhong G, Qin J Empirical likelihood method for non-ignorable missing data problems. Lifetime Data Anal 23(1):

18 Table 1: Empirical bias and RMSE (in parentheses) based on 1000 simulations with n = 100, 300. τ n Estimator ˆideal (0.1185) (0.1095) (0.1272) ˆ C (0.2403) (0.1851) (0.1853) ˆ IP W MAR (0.3065) (0.2128) (0.2077) ˆ ELW MAR (0.3021) (0.2144) (0.1945) ˆ ELW (0.2446) (0.1875) (0.1752) 300 ˆideal (0.0685) (0.0617) (0.0676) ˆ C (0.1332) (0.1031) (0.1016) ˆ IP W MAR (0.2188) (0.1167) (0.1150) ˆ ELW MAR (0.2079) (0.1116) (0.0968) ˆ ELW (0.1252) (0.1002) (0.0873) ˆideal (0.1128) (0.0997) (0.1187) ˆ C (0.2347) (0.1781) (0.1765) ˆ IP W MAR (0.2761) (0.1851) (0.1850) ˆ ELW MAR (0.2794) (0.1814) (0.1649) ˆ ELW (0.2326) (0.1685) (0.1617) 300 ˆideal (0.0648) (0.0608) (0.0679) ˆ C (0.1274) (0.0979) (0.0973) ˆ IP W MAR (0.2040) (0.1036) (0.1003) ˆ ELW MAR (0.2007) (0.1002) (0.0891) ˆ ELW (0.1238) (0.0954) (0.0889) ˆideal (0.1216) (0.1147) (0.1212) ˆ C (0.2471) (0.1864) (0.1791) ˆ IP W MAR (0.2839) (0.1845) (0.1790) ˆ ELW MAR (0.2901) (0.1868) (0.1694) ˆ ELW (0.2498) (0.1870) (0.1795) 300 ˆideal (0.0706) (0.0630) (0.0708) ˆ C (0.1371) (0.1008) (0.1028) ˆ IP W MAR (0.2007) (0.1046) (0.1033) ˆ ELW MAR (0.1983) (0.1006) (0.0935) ˆ ELW (0.1325) (0.0969) (0.0964) 18

19 Figure 1: The estimated regression coefficients, ˆ C (-.) and ˆ ELW ( ) at various quantile levels. 19

20 Figure 2: The standard errors of ˆ C (-.) and ˆ ELW levels. ( ) at various quantile 20

Modification and Improvement of Empirical Likelihood for Missing Response Problem

UW Biostatistics Working Paper Series 12-30-2010 Modification and Improvement of Empirical Likelihood for Missing Response Problem Kwun Chuen Gary Chan University of Washington - Seattle Campus, kcgchan@u.washington.edu