Nuisance parameter elimination for proportional likelihood ratio models with nonignorable missingness and random truncation

Similar documents
A note on profile likelihood for exponential tilt mixture models

Modification and Improvement of Empirical Likelihood for Missing Response Problem

FULL LIKELIHOOD INFERENCES IN THE COX MODEL

STAT331. Cox s Proportional Hazards Model

Survival Analysis for Case-Cohort Studies

Sample size calculations for logistic and Poisson regression models

Improving Efficiency of Inferences in Randomized Clinical Trials Using Auxiliary Covariates

Introduction An approximated EM algorithm Simulation studies Discussion

Pairwise rank based likelihood for estimating the relationship between two homogeneous populations and their mixture proportion

Cox s proportional hazards model and Cox s partial likelihood

Semiparametric Regression

7 Sensitivity Analysis

A unified framework for studying parameter identifiability and estimation in biased sampling designs

Tests of independence for censored bivariate failure time data

Empirical Likelihood Methods for Two-sample Problems with Data Missing-by-Design

Classification. Chapter Introduction. 6.2 The Bayes classifier

Introduction to mtm: An R Package for Marginalized Transition Models

Power and Sample Size Calculations with the Additive Hazards Model

PQL Estimation Biases in Generalized Linear Mixed Models

Primal-dual Covariate Balance and Minimal Double Robustness via Entropy Balancing

arxiv: v2 [stat.me] 8 Jun 2016

Multivariate Regression

Hypothesis Testing Based on the Maximum of Two Statistics from Weighted and Unweighted Estimating Equations

ON SMALL SAMPLE PROPERTIES OF PERMUTATION TESTS: INDEPENDENCE BETWEEN TWO SAMPLES

Charles E. McCulloch Biometrics Unit and Statistics Center Cornell University

Other Survival Models. (1) Non-PH models. We briefly discussed the non-proportional hazards (non-ph) model

On robust and efficient estimation of the center of. Symmetry.

Estimation of Conditional Kendall s Tau for Bivariate Interval Censored Data

A weighted simulation-based estimator for incomplete longitudinal data models

General Regression Model

Estimation in Generalized Linear Models with Heterogeneous Random Effects. Woncheol Jang Johan Lim. May 19, 2004

Modeling the scale parameter ϕ A note on modeling correlation of binary responses Using marginal odds ratios to model association for binary responses

Approximation of Survival Function by Taylor Series for General Partly Interval Censored Data

Chapter 1 Statistical Inference

GOODNESS-OF-FIT TESTS FOR ARCHIMEDEAN COPULA MODELS

Multivariate Survival Analysis

6 Pattern Mixture Models

A Bayesian Nonparametric Approach to Monotone Missing Data in Longitudinal Studies with Informative Missingness

Analysis of Matched Case Control Data in Presence of Nonignorable Missing Exposure

Miscellanea A note on multiple imputation under complex sampling

Minimum distance estimation for the logistic regression model

The consequences of misspecifying the random effects distribution when fitting generalized linear mixed models

Semiparametric Generalized Linear Models

MONTE CARLO ANALYSIS OF CHANGE POINT ESTIMATORS

Testing Homogeneity Of A Large Data Set By Bootstrapping

Some methods for handling missing values in outcome variables. Roderick J. Little

Semiparametric Mixed Effects Models with Flexible Random Effects Distribution

Statistical Methods for Handling Incomplete Data Chapter 2: Likelihood-based approach

Outline of GLMs. Definitions

Discussion of Missing Data Methods in Longitudinal Studies: A Review by Ibrahim and Molenberghs

Testing for Ordered Failure Rates under General Progressive Censoring

Published online: 10 Apr 2012.

Part [1.0] Measures of Classification Accuracy for the Prediction of Survival Times

Regression analysis of biased case control data

Stat 5101 Lecture Notes

TESTS FOR EQUIVALENCE BASED ON ODDS RATIO FOR MATCHED-PAIR DESIGN

Missing Covariate Data in Matched Case-Control Studies

Chapter 12 - Lecture 2 Inferences about regression coefficient

TESTS FOR LOCATION WITH K SAMPLES UNDER THE KOZIOL-GREEN MODEL OF RANDOM CENSORSHIP Key Words: Ke Wu Department of Mathematics University of Mississip

BIOS 2083 Linear Models c Abdus S. Wahed

A Sampling of IMPACT Research:

Chapter 2 Inference on Mean Residual Life-Overview

Simple and Fast Overidentified Rank Estimation for Right-Censored Length-Biased Data and Backward Recurrence Time

Lecture 5: LDA and Logistic Regression

and Comparison with NPMLE

University of California, Berkeley

FULL LIKELIHOOD INFERENCES IN THE COX MODEL: AN EMPIRICAL LIKELIHOOD APPROACH

Biostatistics Workshop Longitudinal Data Analysis. Session 4 GARRETT FITZMAURICE

Introduction to Statistical Analysis

Likelihood-based inference with missing data under missing-at-random

Integrated likelihoods in survival models for highlystratified

GEE for Longitudinal Data - Chapter 8

An Approximate Test for Homogeneity of Correlated Correlation Coefficients

Latent Variable Model for Weight Gain Prevention Data with Informative Intermittent Missingness

Semiparametric Inferential Procedures for Comparing Multivariate ROC Curves with Interaction Terms

,..., θ(2),..., θ(n)

Gauge Plots. Gauge Plots JAPANESE BEETLE DATA MAXIMUM LIKELIHOOD FOR SPATIALLY CORRELATED DISCRETE DATA JAPANESE BEETLE DATA

Inferences on a Normal Covariance Matrix and Generalized Variance with Monotone Missing Data

Figure 36: Respiratory infection versus time for the first 49 children.

Interval Estimation for Parameters of a Bivariate Time Varying Covariate Model

Review of Econometrics

Statistical Practice

β j = coefficient of x j in the model; β = ( β1, β2,

LOCAL LINEAR REGRESSION FOR GENERALIZED LINEAR MODELS WITH MISSING DATA

Robust covariance estimator for small-sample adjustment in the generalized estimating equations: A simulation study

Spring 2012 Math 541B Exam 1

TECHNICAL REPORT # 59 MAY Interim sample size recalculation for linear and logistic regression models: a comprehensive Monte-Carlo study

Quantile Regression for Residual Life and Empirical Likelihood

Bayesian inference for sample surveys. Roderick Little Module 2: Bayesian models for simple random samples

Asymptotic equivalence of paired Hotelling test and conditional logistic regression

Discussion of the paper Inference for Semiparametric Models: Some Questions and an Answer by Bickel and Kwon

1. Introduction This paper focuses on two applications that are closely related mathematically, matched-pair studies and studies with errors-in-covari

BIAS OF MAXIMUM-LIKELIHOOD ESTIMATES IN LOGISTIC AND COX REGRESSION MODELS: A COMPARATIVE SIMULATION STUDY

On the Breslow estimator

TECHNICAL REPORT Fixed effects models for longitudinal binary data with drop-outs missing at random

Analysis of variance, multivariate (MANOVA)

Generalized Multivariate Rank Type Test Statistics via Spatial U-Quantiles

Reconstruction of individual patient data for meta analysis via Bayesian approach

Least Absolute Value vs. Least Squares Estimation and Inference Procedures in Regression Models with Asymmetric Error Distributions

The Logit Model: Estimation, Testing and Interpretation

Transcription:

Biometrika Advance Access published October 24, 202 Biometrika (202), pp. 8 C 202 Biometrika rust Printed in Great Britain doi: 0.093/biomet/ass056 Nuisance parameter elimination for proportional likelihood ratio models with nonignorable missingness and random truncation BY KWUN CHUEN GARY CHAN Department of Biostatistics, University of Washington, Seattle, Washington 9895, U.S.A. kcgchan@u.washington.edu SUMMARY We show that the proportional likelihood ratio model proposed recently by Luo & sai (202) enjoys model-invariant properties under certain forms of nonignorable missing mechanisms and randomly doubletruncated data, so that target parameters in the population can be estimated consistently from those biased samples. We also construct an alternative estimator for the target parameters by maximizing a pseudolikelihood that eliminates a functional nuisance parameter in the model. he corresponding estimating equation has a U-statistic structure. As an added advantage of the proposed method, a simple score-type test is developed to test a null hypothesis on the regression coefficients. Simulations show that the proposed estimator has a small-sample efficiency similar to that of the nonparametric likelihood estimator and performs well for certain nonignorable missing data problems. Some key words: Double truncation; Nonignorable missingness; Pairwise pseudolikelihood; U-statistic.. INRODUCION Regression modelling is popular in data analysis. However, challenges arise when the response has a finite range with a substantial portion of data taking boundary values, when its distribution is multimodal and asymmetric, or when the relationship between response and covariates is nonlinear. o handle regression modelling with an unknown response distribution and other such practical concerns, Luo & sai (202) proposed a semiparametric proportional likelihood ratio model, which is a parsimonious alternative to existing nonparametric models. he semiparametric model relates a response Y and a p covariate vector X in the following way: f (y x) = xy) g(y) xy) dg(y), () where β is a Euclidean parameter of interest and G(y) is an unspecified baseline distribution function with density function g(y) with respect to some dominating measure. For arbitrary y and y 2 within the support of G, wehave log f (y 2 x) f (y x) = log g(y 2) g(y ) + β x(y 2 y ). (2) If X is one-dimensional and the response Y is binary, then β = log f ( x + )/f (0 x + ), f ( x)/f (0 x) which is exactly the log odds ratio. herefore, β is a generalization of the log odds ratio for an arbitrary response. For vector-valued X, it follows from (2) that the jth component of β is the loglikelihood ratio that the response increases by one unit given that the jth component of x increases by one unit, while all

2 K. C. G. CHAN the other components of x are held fixed. here is no intercept term in this model, because an intercept would be contained in G. Luo & sai (202) discussed extensively the connection of model () with generalized linear models (Nelder & Wedderburn, 972), density ratio models (Qin & Zhang, 997), biased sampling models (Gilbert et al., 999), single-index models (Ichimura, 993) and exponential tilt regression models (Rathouz & Gao, 2009). Huang & Rathouz (202) explicitly modelled the mean function for a proportional likelihood ratio model with discrete covariates. We will show that model () is a robust extension of the generalized linear model for certain missing data problems and doubly truncated data. For model (), Luo & sai (202) proposed a nonparametric maximum likelihood method that estimates β and G simultaneously by an iterative algorithm, as usual likelihood arguments do not lead to elimination of the nuisance function G(y). We present an alternative method for estimating β, which eliminates the unknown function G based on conditioning on the rank of responses (Kalbfleisch, 978) in a pairwise fashion (Liang & Qin, 2000). 2. INVARIANCE PROPERIES UNDER MISSINGNESS AND RUNCAION Missing data are common in practice for reasons that are often beyond the control of investigators. Suppose that we fail to observe response Y for some observations and covariates X for some possibly different observations. Let R Y and R X be the indicators that Y and X are being observed, and let R = R X R Y be the indicator of both being observed. We refer to an observation with R = as a complete case; for such an observation, the conditional density is f (y i x i, r i = ) = pr(r i = y i, x i ) f (y i x i ) pr(ri = y, x i ) f (y x i ) dy. (3) As seen from (3), inference for β from complete case observations often requires correct specification of the conditional probability of R given (Y, X). However, when pr(r = Y, X) depends on the unobserved data, the missing mechanism is nonignorable (Rubin, 976). When pr(r Y, X) is arbitrary, inference based on (3) requires untestable assumptions on the missing data mechanism. We assume the following factorization of the missing data model: pr(r = Y = y, X = x) = h (x)h 2 (y) (4) for arbitrary functions h ( ) and h 2 ( ). his assumption will be satisfied for the following examples, which include cases where missingness is nonignorable. Example. Suppose that response Y is subject to missingness but covariate X is completely observed; that is, R X. (a) Suppose that the missingness depends only on unobserved Y,pr(R Y = Y, X) = π a (Y ); then (4) is satisfied with h (x) = and h 2 (y) = π a (y). (b) Suppose that the missingness depends only on observed X,pr(R Y = Y, X) = π b (X); then (4)is satisfied with h (x) = π b (x) and h 2 (y) =. Example 2. Suppose that response Y is completely observed but covariate X is subject to missingness; that is, R Y. (a) Suppose that the missingness depends only on observed Y,pr(R X = Y, X) = π 2a (Y ); then (4)is satisfied with h (x) = and h 2 (y) = π 2a (y). (b) Suppose that the missingness depends only on unobserved X,pr(R X = Y, X) = π 2b (X); then (4) is satisfied with h (x) = π 2b (x) and h 2 (y) =. Example 3. Here both the response Y and the covariate X may be missing. (a) Suppose that the missing data mechanism for both Y and X depends only on response Y, i.e. pr(r X =, R Y = Y, X) = π 3a (Y ); then (4) is satisfied with h (x) = and h 2 (y) = π 3a (y).

Miscellanea 3 (b) Suppose that the missing data mechanism for both Y and X depends only on covariate X, i.e., pr(r X =, R Y = Y, X) = π 3b (X); then (4) is satisfied with h (x) = π 3b (X) and h 2 (y) =. (c) Suppose that the missing data mechanism for Y depends only on Y and the missing data mechanism for X depends only on X, i.e., pr(r Y = Y, X) = π 3cy (Y ) and pr(r X = Y, X) = π 3cx (X). Suppose further that R X and R Y are conditionally independent given (Y, X). hen (4) issatisfied with h (x) = π 3cx (x) and h 2 (y) = π 3cy (y). Except for Examples (b) and 2(a), all the cases have nonignorable missing data. We now state an invariance property for the proportional likelihood ratio model. Property. If (4) is satisfied, then f (y x, r = ) = xy) dg (y) xy) dg (y) where G (y) = y h 2(y ) dg(y ) and β is the same as in (). Property can be shown by combining (), (3) and (4), and it generalizes the model-invariant property of the logistic regression model under case-control sampling (Prentice & Pyke, 979). Suppose that a binary response Y follows a logistic regression model logit{pr(y = X = x)}=α + β x. his model is a special case of model () where the baseline distribution follows a Bernoulli distribution with success probability exp(α)/{ + exp(α)}. Case-control sampling is a special case of Example 2(a) with pr(r X = Y = y) = π y (y = 0, ). It is well known that case-control data also follow a logistic regression model with the same β coefficients as in the population model but a different intercept α = α + log(π /π 0 ). he second invariance property is for randomly truncated data, which are often collected in medical studies. Suppose that the response Y is observed only when Y 2, where and 2 are random variables indicating the left and right truncation times. When Y is a time-to-event response, left truncation is often observed in cross-sectional sampling, where prevalent subjects have to be surviving at the recruitment time to be eligible to enter the study. Right truncation often occurs in retrospective sampling, where subjects are only observed if the failure event happens before the sampling time. Let f, 2 (t, t 2 ) be the joint distribution of (, 2 ), and let f S (y x) be the sampling conditional density for the randomly truncated data. Property 2. If (, 2 ) is independent of (Y, X), then f S (y x) = xy) dg 2 (y) xy) dg 2 (y) where G 2 (y) = y y f, 2 (t, t 2 ) dt 2 dt dg(y ) and β is the same as in (). A derivation of Property 2 is given in the Appendix. If the population follows a proportional likelihood ratio model, randomly truncated data will also follow such a model, with the same target parameters as in the population model. Randomly truncated data can be used directly for estimating β, even though the sample is biased. Property 2 holds under an independent truncation condition and is not true in general when the truncation distribution is covariate-dependent. We will discuss covariate-dependent truncation in 5. 3. PAIRWISE CONDIIONING AND ESIMAION In this section, we propose an alternative method for estimating β in model (), which eliminates the functional nuisance parameter G(y). By the invariance properties and 2 discussed in 2, the method also estimates the target parameters β in certain nonignorable missing data problems and for randomly truncated responses.

4 K. C. G. CHAN Suppose that we observe (y i, x i )(i =,...,n), which are independent and identically distributed as the random variables (Y, X) where the conditional density function of Y given X follows model (). Let y = (y,...,y n ) and x = (x,...,x n ), and let y ( ) = (y (),...,y (n) ) be the order statistics for (y,...,y n ). Kalbfleisch (978) showed that n i= f (y x, y ( ) ) = f (y i x i ) n j σ l= f (y j l x jl ), where j = ( j,..., j n ) and σ is the set of permutations of the integers {,...,n}. For model (), we get f (y x, y ( ) ) = n i= x i y i ) j σ n l= x j l y jl ), (5) which contains only the target parameters β but not the nuisance parameter G. he conditional likelihood (5) provides a legitimate way of constructing likelihood inference for β, but the computational burden is of order n!. By applying (5) to a parametric generalized linear model with missing data, Liang & Qin (2000) proposed the pairwise pseudolikelihood i<k f (y i x i ) f (y k x k ) f (y i x i ) f (y k x k ) + f (y i x k ) f (y k x i ), which applies the conditioning argument of Kalbfleisch (978) to every pair of observations. he computational burden is of order n 2. Upon applying this argument to model (), we have the pairwise likelihood L PL (β) i<k + exp{ β (x i x k )(y i y k )}, which is again free of the functional parameter G. We denote by ˆβ PL the value of β that maximizes L PL (β), which is equivalent to the solution of the pseudo-score equation 2 n(n ) (xi xk )(y exp{ β (x i x k )(y i y k )} i y k ) = 0. (6) + exp{ β (x i x k )(y i y k )} i<k he factor 2/{n(n )} is the reciprocal of the number of terms in the summation. Before we discuss the statistical properties of ˆβ PL and a score test based on (6), let us introduce some notation to simplify the discussion. Let ψ ik (β) = (xi xk )(y exp{ β (x i x k )(y i y k )} i y k ) + exp{ β (x i x k )(y i y k )}, and denote the score function by s PL (β) = 2/{n(n )} i<k ψ ik(β). he score function is a secondorder U-statistic (Serfling, 980). It follows from Proposition in Liang & Qin (2000) that n /2 ( ˆβ PL β 0 ) converges weakly to a multivariate normal distribution with covariance matrix V = 2, where = E{ ψ 2 (β 0 )/ β} and 2 = 4E{ψ 2 (β 0 )ψ3 (β 0)}. o estimate the asymptotic variance, we could use a sandwich-type estimator ˆV = ˆ ˆ 2 ˆ with ˆ = 2 ψ ik ( ˆβ PL ) n(n ) β i<k

Miscellanea 5 and ˆ 2 = 4 n = 4 n n n n n i= i= n ψ ij ( ˆβ PL ) s PL( ˆβ PL ) n 2 ψ ij ( ˆβ PL ), j=, j = i j=, j = i 2 where a 2 = aa for a column vector a. his estimator follows from the variance estimator of U-statistics proposed by Sen (960). An added advantage of the proposed method is that a simple score-type test can be constructed to test a null hypothesis H 0 : β = β 0 based on the pseudo-score function (6). Under the null hypothesis, s PL (β 0 ) is a zero-mean U-statistic. Suppose that E{ψ 2 (β 0 ) 2 } < and E{ψ 2 (β 0 )ψ 3 (β 0 ) } is nonsingular; it then follows from Serfling (980, p. 92) that n /2 s PL (β 0 ) converges weakly to a normal distribution with mean zero and covariance matrix 2. Under the null hypothesis, we could estimate 2 by 2 2 (β 0 ) = 4 n n ψ ij (β 0 ) n n s PL(β 0 ), and the test statistic has a limiting χ 2 p distribution. i= j=, j = i = ns PL (β 0 ) 2 (β 0 ) s PL (β 0 ) (7) 4. SIMULAIONS We conducted simulation studies to evaluate the finite-sample performance of the proposed estimator. First, we compared the performance of the proposed estimator ˆβ PL with the nonparametric likelihood estimator ˆβ NL of Luo & sai (202) under the same simulation setting as in that paper. Next, we compared the performance of the proposed estimator and the least-squares estimator for a Gaussian-distributed response with nonignorable missing data and doubly truncated data. We also evaluated the empirical ype I error and power for the score test (7). We generated 000 independent datasets in each scenario. A continuous covariate X 2 was generated from a normal distribution with mean zero and standard deviation 0 5. For a given X 2, X was generated from a Bernoulli distribution with probability of success being exp( X 2 )/{ + exp( X 2 )}. he finite-dimensional parameters are (β,β 2 ) = (, ), and we considered two baseline densities g(y) = dg(y). In scenario, g(y) = (2/π) /2 (0 5) exp{ 2(y /4) 2 }; the response is a positive continuous random variable. In scenario 2, g(y) = ( + y)3 y exp( 3)/(4y!); the response is a nonnegative integer-valued random variable. he simulation results for both scenarios are shown in able. he proposed estimator had a slightly higher sampling variability than did the maximum likelihood estimator and a similar small-sample bias. We conclude that even though the proposed estimator does not maximize a full likelihood, its performance is very close to that of the nonparametric maximum likelihood estimator for small samples. We also compared the computation times for simulation studies in scenario. For n = 50, 00 and 200, it took on average 0 2, 0 9 and 3 4 seconds, respectively, to compute the proposed estimator, and 2, 6 and 84 7 seconds to compute the maximum likelihood estimator. hus, using the proposed estimator led to a substantial gain in computational efficiency; we will discuss this further in 5. We then considered a scenario in which there were missing data and where the missing mechanism was nonignorable. We considered a Gaussian response and compared ˆβ PL with the least-squares estimator ˆβ LS. Covariate data followed the same distribution as in the previous simulation, and g(y) was taken to be the standard normal density function. he response Y was observed with probability

6 K. C. G. CHAN able. Comparison of ˆβ PL and ˆβ NL ˆβ NL ˆβ NL ˆβ PL ˆβ PL ˆβ PL ˆβ PL Bias 0 3 SSE 0 3 Bias 0 3 SSE 0 3 SEE 0 3 95% CP β β 2 β β 2 β β 2 β β 2 β β 2 β β 2 Scenario n = 50 45 77 059 08 70 207 090 048 984 98 95 94 n = 00 56 38 696 604 66 48 705 66 66 66 95 96 n = 200 40 4 464 434 44 44 468 437 457 427 96 94 Scenario 2 n = 50 25 36 366 370 00 384 376 342 333 93 93 n = 00 68 68 238 227 56 6 253 240 235 238 95 94 n = 200 40 36 64 59 34 28 74 66 67 63 94 95 SSE, sampling standard error; SEE, average of standard error estimates; CP, coverage probability. able 2. Comparison of ˆβ PL and ˆβ LS Bias 0 3 SSE 0 3 SEE 0 3 90% CP 95% CP β β 2 β β 2 β β 2 β β 2 β β 2 (a) Nonignorable missingness n = 00 ˆβ PL 46 70 45 377 430 367 90 89 96 95 ˆβ LS 284 276 248 230 233 26 63 72 72 83 n = 200 ˆβ PL 53 42 39 28 294 255 88 89 94 95 ˆβ LS 29 269 80 60 62 82 43 56 57 7 (b) Double truncation n = 00 ˆβ PL 02 09 623 60 60 585 89 88 94 93 ˆβ LS 423 47 264 252 235 266 38 50 50 65 n = 200 ˆβ PL 6 65 387 397 378 376 90 89 95 93 ˆβ LS 428 425 84 78 69 85 6 25 23 38 SSE, sampling standard error; SEE, average of standard error estimates; CP, coverage probability. able 3. Percentage of hypotheses rejected by score test (7) (β,β 2 ) (0, 0) (, ) (2, 2) n = 50 6 3 86 n = 00 5 59 00 n = 200 5 89 00 pr(r = Y = y, X = x) = exp( 2y)/{ + exp( 2y)}. Missingness was nonignorable and the leastsquares estimator was inconsistent. he results are shown in able 2(a). We further considered a scenario for doubly truncated data. We generated from a normal distribution with mean and variance and let 2 = + 2. Data are observed only when Y 2. he results are shown in able 2(b). In both scenarios, we observed that the least-squares estimator was biased and its confidence interval had a much lower coverage probability than the nominal value. he proposed estimator was more variable, but performed well in the sense of having a small bias and a coverage probability close to the nominal value. able 3 summarizes results from the score test (7) for testing H 0 : (β,β 2 ) = (0, 0). A 5% significance level was chosen. he null hypothesis was rejected if exceeded a critical value based on the χ 2 2 distribution. Data were generated following scenario with a varying β. he results showed that the test had an empirical ype I error that was close to the nominal value even for small samples. Under alternative hypotheses, the power of the tests increased quickly with sample size.

Miscellanea 7 5. DISCUSSION In this paper, we have discussed invariance properties and an alternative method for estimating the target parameters β for a proportional likelihood ratio model that was proposed recently by Luo & sai (202). he invariance properties are useful in situations where the missing data mechanism is probably nonignorable and where the analysts may not have enough knowledge to model the missing data mechanism or the truncation distribution. Such unknown quantities become part of the functional nuisance parameter. Any biased sampling (Gilbertetal., 999)intheform f S (y x) = b(y, x) f (y x) b(y, x) xy) dg(y), where the data are doubly truncated with covariate-dependent truncation probability, fits into this general situation. In this case, a pairwise pseudolikelihood can be constructed when b(y, x) is known and does not follow a factorization as in (4): L PL (β) i<k + R(y i, y k, x i, x k ) exp{ β (x i x k )(y i y k )} where R(y i, y k, x i, x k ) = b(y i, x k )b(y k, x i ) b(y i, x i )b(y k, x k ). he proposed pseudolikelihood method has a lower computational burden than the maximum likelihood estimator. As can be seen from equation (4) in Luo & sai (202), the computation is dominated by the last term in that expression. he computation of the constraint maximization involves a profiling estimator (Luo & sai, 202, equation 5), so the dominating term will involve summation over three indices. When both Y and X are continuous the computational burden will be O(n 3 ), while the estimator proposed in this paper has a computational burden of O(n 2 ). ACKNOWLEDGEMEN he author thanks the editor, an associate editor, two reviewers and Drs Patrick Heagerty, Ali Shojaie and Mary Lou hompson for suggestions which have greatly improved this paper. APPENDIX Derivation of Property 2 Conditioning on = t and 2 = t 2, the sampling density for the randomly truncated data is f (y x) if t y t 2 and 0 otherwise. Averaging over the joint distribution of truncation times (, 2 ),the joint probability of Y = y and y 2 is y f (y x) f, 2 (t, t 2 ) dt 2 dt, and the probability pr( Y 2 X = x) is y f (y x) f, 2 (t, t 2 ) dt 2 dt dy. For randomly truncated data, f S (y x) = f (y x, Y 2 ), and therefore y f (y x) f, 2 (t, t 2 ) dt 2 dt f S (y x) = f (y x) f, 2 (t, t 2 ) dt 2 dt dy y = xy) y f, 2 (t, t 2 ) dg(y) xy) y f, 2 (t, t 2 ) dg(y) = xy) dg 2 (y) xy) dg 2 (y). REFERENCES GILBER, P.B., LELE, S.R.& VARDI, Y.(999). Maximum likelihood estimation in semiparametric selection bias models with application to AIDS vaccine trials. Biometrika 86, 27 43. HUANG,A.& RAHOUZ,P.J.(202). Proportional likelihood ratio models for mean regression. Biometrika 99, 223 9.

8 K. C. G. CHAN ICHIMURA, H.(993). Semiparametric least squares (SLS) and weighted SLS estimation of single-index models. J. Economet. 58, 7 20. KALBFLEISCH,J.D.(978). Likelihood methods and nonparametric tests. J. Am. Statist. Assoc. 73, 67 70. LIANG, K.-Y.& QIN, J.(2000). Regression analysis under non-standard situations: a pairwise pseudolikelihood approach. J. R. Statist. Soc. B 62, 773 86. LUO,X.& SAI,W.Y.(202). A proportional likelihood ratio model. Biometrika, 99, 2 22. NELDER, J.A.& WEDDERBURN, R.W.M.(972). Generalized linear models. J. R. Statist. Soc. A 35, 370 84. PRENICE, R.L.& PYKE, R.(979). Logistic disease incidence models and case-control studies. Biometrika 66, 403. QIN, J.& ZHANG, B.(997). A goodness-of-fit test for logistic regression models based on case-control data. Biometrika 84, 609 8. RAHOUZ, P.J.& GAO, L.P.(2009). Generalized linear models with unspecified reference distribution. Biostatistics 0, 205 8. RUBIN,D.B.(976). Inference and missing data. Biometrika 63, 58 92. SEN, P.K.(960). On some convergence properties of U-statistics. Calcutta Statist. Assoc. Bull. 0, 8. SERFLING,R.J.(980). Approximation heorems of Mathematical Statistics. New York: Wiley. [Received February 202. Revised August 202]