Efficiency Comparisons in Multivariate Multiple Regression with Missing Outcomes

Size: px

Start display at page:

Download "Efficiency Comparisons in Multivariate Multiple Regression with Missing Outcomes"

Leon Jennings
5 years ago
Views:

1 joural of multvarate aalyss 61, (1997) artcle o. MV Effcecy Comparsos Multvarate Multple Regresso wth Mssg Outcomes Adrea Rottzky* Harvard School of Publc Health Chrsta A. Holcroft - Uversty of Massachusetts Lowell ad James M. Robs Harvard School of Publc Health We cosder a follow-up study whch a outcome varable s to be measured at fxed tme pots ad covarate values are measured pror to start of follow-up. We assume that the codtoal mea of the outcome gve the covarates s a lear fucto of the covarates ad s dexed by occaso-specfc regresso parameters. I ths paper we study the asymptotc propertes of several frequetly used estmators of the regresso parameters, amely the ordary least squares (OLS), the geeralzed least squares (GLS), ad the geeralzed estmatg equato (GEE) estmators whe the complete vector of outcomes s ot always observed, the mssg data patters are mootoe ad the data are mssg completely at radom (MCAR) the sese defed by Rub [11]. We show that whe the covarace of the outcome gve the covarates s costat, as opposed to the omssg data case (a) the GLS estmator s more effcet tha the OLS estmator, (b) the GLS estmator s effcet, ad (c) the semparametrc effcet estmator a model that mposes lear restrctos oly o the codtoal mea of the last occaso regresso ca be less effcet tha the effcet estmator a model that mposes Receved March 21, 1996 revsed November 11, AMS subject classfcato prmary 62J05 secodary 62B99. Key words ad phrases geeralzed estmatg equatos, geeralzed least squares, mssg data, repeated measures, semparametrc effcet. * Supported part by the Natoal Isttutes of Health uder Grat 1-R29-GM A1. - Supported by the Natoal Isttutes of Health uder Grats 532 CA ad 1-R29-GM A1. Supported part by the Natoal Isttutes of Health uder Grats 2 P30 ES00002, R01AI32475, R01-ES03405, ad K04-ES X Copyrght 1997 by Academc Press All rghts of reproducto ay form reserved. 102

2 REGRESSION WIH MISSING OUCOMES 103 lear restrctos o the codtoal meas of all the outcomes. We provde formulae ad calculatos of the asymptotc relatve effceces of the cosdered estmators three mportat cases (1) for the estmators of the occaso-specfc meas, (2) for estmators of occaso-specfc mea dffereces, ad (3) for estmators of occaso-specfc dose-respose model parameters Academc Press 1. INRODUCION May radomzed ad oradomzed follow-up studes are desged so that outcomes Y t, t=1,...,, correspodg to the th subject are to be measured at prespecfed tme pots ad a vector of covarates X s to be measured at basele. I radomzed studes, X may record a treatmet arm dcator as well as pretreatmet varables such as age, sex, ad race. Ofte the codtoal mea of the outcome Y t gve X s assumed to be lear X, that s E(Y t X )= t X, ad the goal of the study s to make fereces about the ukow regresso parameters t. For example, f X represets dose levels of a drug admstered at basele, vestgators are ofte terested estmatg the parameter t dexg a occaso-specfc lear dose-respose model. Ofte a subset of the outcome vector Y =(Y 1,..., Y ) s mssg for some subjects. I ths paper we assume that the outcomes are mssg completely at radom (MCAR) the sese defed by Rub [11] ad that the orespose patters are mootoe, that s oce a subject msses a cycle of the study he or she msses also all subsequet cycles. Mootoe patters of MCAR data arse, for example, radomzed studes wth staggered etry ad a fxed termato caledar tme. Mootoe MCAR data also arses f subjects drop out of the study for reasos urelated to Y. Extesve lterature exsts o the estmato of parameters =(,..., 1 ) the absece of mssg data. Whe the covarace of Y gve X, 7(X ), s kow, the the geeralzed least squares (GLS) estmator G of s best lear ubased [7, p. 301]. Chamberla [3] showed that the asymptotc varace of G attas the semparametrc varace boud for regular estmators of the semparametrc model defed solely by the lear model restrctos o the margal meas. Whe 7(X ) s ukow, G s ufeasble because t depeds o the ukow covarace fucto. Carroll ad Ruppert [2] showed that whe 7(X ) s a smooth fucto of X, the the two-stage geeralzed least squares estmator G that uses a oparametrc estmator of 7(X ), has the same asymptotc dstrbuto as G. he geeralzed estmatg equatos (GEE) estmator GEE proposed by Lag ad Zeger [5] s a geeralzed least squares estmator of that uses a estmate of 7(X ) from a, possbly msspecfed, parametrc model for the covarace fucto. Whe the parametrc model for the covarace fucto s correctly specfed the GEE s asymptotcally equvalet to G ad G.

3 104 RONIZKY, HOLCROF, AND ROBINS Whe the true covarace fucto does ot deped o X,.e., 7(X )=7 for all, the G s exactly equal to OLS=(,..., 1, OLS, OLS ), where t, OLS s the ordary least squares (OLS) estmator of the coeffcet the lear regresso for the tth outcome Y t o the covarates X [4, pp , pp ]. hus, whe the covarace fucto s costat, the ordary least squares estmator of t s semparametrc effcet a model that mposes solely lear restrctos o the codtoal meas of the outcomes Y t gve X, t=1,...,. he estmator t, OLS s also semparametrc effcet the model defed by the lear restrcto o the tth mea oly,.e., E(Y t X )= t X, but wthout restrctos mposed o the codtoal meas of the remag outcomes,.e., E(Y j X ), j{t, s uspecfed [9]. hus, wth full data, whe 7(X ) s ot a fucto of X, kowledge that the meas of the remag outcomes are lear X does ot asymptotcally add formato about the regresso parameter t correspodg to the tth outcome. Furthermore, sce t, OLS s also the semparametrc effcet estmator of t whe the outcomes Y j, j{t, are ot recorded, the we coclude that oly the outcome Y t coveys formato about t whe o Y t are mssg ad 7(X ) costat. Wth mootoe MCAR outcomes the estmators G, G, GEE ad OLS are cosstet for estmatg but they may be less effcet tha the semparametrc effcet estmator EFF of the model defed by the lear restrctos o the codtoal meas of the vector Y gve X ad the MCAR codto [9]. he goal of ths paper s to compare ad expla the asymptotc relatve effceces of the estmators G, GEE, ad OLS relatve to EFF. I Secto 2 we descrbe the model assumptos. I Secto 3 we revew well-kow results about the estmato of whe the complete vector Y s observed for all subjects. I Secto 4 we revew a class of estmators troduced by Robs ad Rottzky [9] that cludes estmators that are asymptotcally equvalet to G, GEE, OLS, ad EFF. I Secto 5 we use a represetato of the asymptotc varace of the estmators ths class that helps terpretg the source of dffereces amog the asymptotc varaces of the varous cosdered estmators. Asymptotc relatve effceces are explctly calculated for the varous estmators of three mportat specal cases, amely, (1) whe X =1, (2) whe X =(1, X *) ad X* s bary, ad (3) whe X =(1, X*) ad X* s a arbtrary explaatory varable. Secto 6 cotas some fal remarks. 2. MODEL Wth =1,..., dexg subject, let Y t be the outcome of the th subject at the tth follow-up cycle of the study, t=1,...,. Let X deote a p_1

4 REGRESSION WIH MISSING OUCOMES 105 vector of explaatory varables for the th subject measured just pror to start of follow-up. We assume that the frst elemet of the vector X s the costat 1. Defe R t =1 f Y t s observed ad R t =0 otherwse. We assume that the mssg data patters are mootoe, that s R t =0 mples R (t+1) =0. We also assume that X s completely observed for all subjects ad that the vectors (X, Y, R ), =1,...,, are depedet ad detcally dstrbuted, where Y =(Y 1,..., Y ) ad R =(R 1,..., R ). We further assume that the mssg data process satsfes ad that P(R t =1 R (t&1) =1, Y, X )=P(R t =1 R (t&1) =1, X ). (1) P(R t =1 R (t&1) =1, X )>_>0, (2) for some _>0. Codto (1) s equvalet to the codto that the data are mssg completely at radom [11]. Codto (2) says that all subjects have a probablty of havg the full vector Y completely observed that s bouded away from zero. We suppose that the codtoal mea of Y t gve X follows the lear regresso model E(Y t X )= 0t X, (3) where 0t s a p_1 vector of ukow parameters, t=1,...,. hroughout we refer to the semparametrc model defed by restrcto (3) as the ``alllear-meas'' model. he goal of ths artcle s to compare the asymptotc relatve effceces of several commoly used estmators of 0t whe the outcomes Y t are ot always observed, the mssgess patters are mootoe, ad the data are mssg competely at radom,.e., Eq. (1) s true. 3. ESIMAION WIHOU MISSING DAA I ths secto we brefly revew well-kow results about the estmato of 0t whe Y s observed for all subjects. Let = t ( t )=Y t & t X, = ()= (= 1 ( 1 ),..., = ( )) wth =( 1,..., ), ad let d(x )beap_fxed matrx of fuctos of X. Whe Y s observed for all subjects, the uder mld regularty codtos, the estmatg equato =1 d(x ) = ()=0, (4)

5 106 RONIZKY, HOLCROF, AND ROBINS has a root that s cosstet ad asymptotcally ormal for estmatg 0. Several commoly used estmators of 0 are solutos to Eq. (4) for some specfc choce of d(x ). Whe 7(X ), the covarace of Y gve X, s kow, the geeralzed least squares estmator G solves (4) that uses d* GLS (X )=(IX ) 7(X ) &1, where I s the _ detty matrx ad deotes the Kroecker product. he Kroecker product of a a_b matrx ad a c_d matrx S s the ac_bd matrx wth block elemets [ j S] (Seber, 1984, p. 7). Whe 7(X ) s ukow ad satsfes certa smoothess codtos, Carroll ad Ruppert [2] showed that the two-stage geeralzed least squares estmator G that solves (4) wth d GLS (X )=(IX ) 7 (X ) &1, where 7 (X ) &1 s a prelmary cosstet oparametrc estmator of 7(X ), has the same asymptotc dstrbuto as G. he GEE estmator [5], GEE, solves (4) wth d GEE (X )=(IX )_ C (X ) &1, where C (X )=C(X ^ ) ad ^ s a cosstet estmator of 0 the model 7(X )=C(X 0 ), (5) where 0 s a q_1 ukow parameter vector ad C(X ) s, for each, a _ symmetrc ad postve defte matrx fucto of X. Lag ad Zeger [5] showed that the soluto to (4) that uses d GEE (X ) wll be a cosstet ad asymptotcally ormal estmator of 0 eve whe (5) s msspecfed. I fact, t s stadard to show that GEE wll have the same asymptotc dstrbuto as GEE solvg Eq. (4) that uses d* GEE (X )= (IX ) C(X *) &1, where * s the probablty lmt of ^ (see, for example, [8]). hus, whe (5) s correctly specfed, d* GEE (X )=d* GLS (X ), ad hece GEE ad G have the same asymptotc dstrbuto. he estmator OLS=(,..., 1, OLS, OLS ) whch each 0t s the ordary least squares estmator of 0t from the regresso of Y t o X s also obtaed as a soluto to Eq. (4). I fact, OLS solves (4) that uses d OLS (X )=IX. Robs ad Rottzky [9] showed that the solutos to the estmatg Eq. (4) essetally costtute all regular ad asymptotcally lear (RAL) estmators of 0. hat s, ay RAL estmator of 0 s asymptotcally equvalet to a soluto of Eq. (4) for some choce of fucto d(x ). wo estmators +^ 1 ad +^ 2 of + 0 are sad to be asymptotcally equvalet f - (+^ 1&+^ 2) coverges to 0 probablty. If +^ 1 ad +^ 2 are asymptotcally equvalet the - (+^ 1&+ 0 ) ad - (+^ 2&+ 0 ) have the same asymptotc dstrbuto. A estmator s sad to be asymptotcally lear f ( & 0 ) s asymptotcally equvalet to a sample average of..d. mea zero, fte varace radom varables. For example, the soluto to a estmatg equato =1 m(y, X )=0 s, uder smoothess codtos for m(y, X ), asymptotcally lear because usg a stadard aylor seres expaso,

6 REGRESSION WIH MISSING OUCOMES 107 ( & 0 ) ca be show to be asymptotcally equvalet to the sample average of E(m(Y, X ) =0 ] &1 m(y, X 0 ). Regularty s a techcal codto that prohbts super-effcet estmators by specfyg that the covergece of the estmator to ts lmtg dstrbuto s locally uform. Chamberla [3] showed that the asymptotc varace of G acheves the semparametrc varace boud for regular estmators of 0 the sese defed by Begu, Hall, Huag, ad Weller [1]. he semparametrc varace boud for 0 a semparametrc model s the supremum of the CramerRao varace bouds for 0 over all regular parametrc submodels ested wth the semparametrc model ad t s therefore a lower boud for the asymptotc varace of all regular estmators of 0. Whe 7(X ) s ot a fucto of X, t ca easly be show that GLS ad OLS are algebracally detcal (see, for example, [4, p. 307]). hus, OLS cocdes wth the semparametrc effcet estmator G ad t s therefore locally semparametrc effcet the ``all-lear-meas'' model at the addtoal restrcto that 7(X ) s costat. A locally semparametrc effcet estmator of a parameter 0 model A at a addtoal restrcto B s a estmator that attas the semparametrc varace boud for 0 model A whe both A ad B are true ad remas cosstet whe A s true but B s false. Cosder ow the estmato of 0, the coeffcet of the regresso of the outcome Y o X, a model that does ot mpose restrctos o the codtoal meas E(Y t X ) for t<. Specfcally, uder the ew model, whch throughout we call the ``last-mea-lear'' model, data o X ad the vector Y are observed, =1,...,, but the model mposes oly a lear restrcto o the last codtoal mea,.e., E(Y X )= 0 X. (6) Robs ad Rottzky [9] showed that, OLS s locally semparametrc effcet for 0 the ``last-mea-lear'' model at the addtoal restrcto that Var(Y X ) s ot a fucto of X. hus, sce, OLS s also a locally semparametrc effcet estmator of 0 the ``all-lear-meas'' model at the restrcto that 7(X ) s costat, the t follows that whe Y s observed for all subjects ad 7(X ) s costat, kowledge that the codtoal meas E(Y t X ) for the precedg outcomes Y =(Y 1,..., Y (&1) ) are lear X does ot asymptotcally add formato about 0. Furthermore, sce, OLS s also a locally semparametrc estmator of 0 the model (6) at the restrcto that Var(Y X ) s costat whe data o Y are ot recorded [3], the t follows that whe 7(X )s costat ad the ``all-lear-meas'' model holds, data o Y does ot provde formato about 0.

7 108 RONIZKY, HOLCROF, AND ROBINS 4. ESIMAION WIH MONOONE MCAR DAA I ths secto we revew results about the estmato of 0 whe Y s ot fully observed for all subjects ad the mssg data process satsfes (1) ad (2). Let * t #P(R t =1 R (t&1) =1, X ) ad? t #> t =1 * t. Suppose frst that * t are kow for all ad t. (7) Robs ad Rottzky [9] showed that the estmatg equato where U (d,, )= R =1 d(x? ) = ()& t=1 U (d,,)=0, (8) (R t &* t R (t&1) )? t, t (Y t, X ) wth, t (Y t, X ), t=1,...,, a arbtrary p_1 fucto of Y t #(Y 1,..., Y (t&1) ) ad X chose by the vestgator, has, uder mld regularty codtos, a soluto (d,,) that s a cosstet ad asymptotcally ormal estmator of 0. he asymptotc varace of (d,,) s gve by 1(d) &1 0(d,,) 1(d) &1, (9) where 1(d)=E[d(X )[IX ] ] ad 0(d,,)=Var[U (d,, 0 )]. hey also showed that the solutos of (8) are essetally all RAL estmators of 0 the ``all-lear-meas'' model wth the addtoal restrctos (1), (2), ad (7). Furthermore the soluto of (8), (d eff,, eff ), that uses d eff (X )=(IX ) {Var _\ R =? & t=1 &1 _E(= Y t, X ) &= +}X (R t &* t R (t&1) )? t (10) ad, eff, t (Y t, X )=d eff (X ) E(= Y t, X ), (11) where = #= ( 0 ), s semparametrc effcet ths model. I addto, they showed that kowledge of the orespose probabltes * t does ot asymptotcally provde formato about 0 sce the semparametrc effcecy boud for 0 remas uchaged f the restrcto (7) s dropped.

8 REGRESSION WIH MISSING OUCOMES 109 hat s, the semparametrc varace boud for 0 s the same the models (a) defed by (1), (2), (3), ad (7) (b) defed by (1), (2), ad (3). hey further showed that all RAL estmators of 0 model (b) are asymptotcally equvalet to the soluto of (8) for some choce of d(x ) ad, t (Y t, X ). Cosder ow Eq. (4) restrcted to the avalable observatos,.e., =1 d obs (X ) = obs ()=0, (12) where = obs () s the vector of observed resduals for the th subject ad d obs (X ) s the correspodg submatrx of d(x ). Lag ad Zeger [5] showed that (12) has a soluto that s cosstet ad asymptotcally ormal for estmatg 0. hus, sce ths soluto s a RAL estmator of 0, t must have the same asymptotc dstrbuto as a soluto of Eq. (8) for some specfc d(x ) ad, t (Y t, X ). he estmators G, G, GEE ad OLS calculated from the avalable observatos all solve the Eq. (12) usg the correspodg submatrces of ther respectve fuctos d* GLS (X ), d GLS (X ), d GEE (X ), ad d OLS (X ) defed Secto 3. hey are therefore asymptotcally equvalet to the soluto of Eq. (8) for specfc fuctos d(x ) ad, t (Y t, X ). Defe ad d C l (X )=(IX ) {Var C_ R = &? t=1 _E l, C (= Y t, X ) } &1 &= X,, C l, t(y t, X )=d C l(x ) E l, C (= Y t, X ), (R t &* t R (t&1) )? t E l, C (= Y t, X )=Cov C [(Y, Y t) X )] Var C (Y t X ) &1 = t, where = t s the (t&1)_1 vector wth jth elemet equal to Y j & 0j X, ad C, whe used as a subscrpt of Var ad Cov, dcates that the codtoal varaces ad covaraces are calculated assumg Cov(Y X )=C(X ), where C(X ) s a gve _ symmetrc postve defte matrx fucto of X. I the Appedx we show Lemma 1. Let l(c) be the soluto of Eq. (8)that uses d C (X l ) ad, C (Y l, t t, X ) stead of d(x ) ad, t (Y t, X ). he, (a) G ad G are asymptotcally equvalet to l(7), where 7(X )=Cov(Y X ) s the true codtoal covarace of Y gve X

9 110 RONIZKY, HOLCROF, AND ROBINS (b) GEE ad GEE are asymptotcally equvalet to l(c * ), where C * (X )#C(X *) s the ``workg covarace'' fucto defed Eq. (5) evaluated at *. Here, * s the probablty lmt of ^ estmated from model (5) ad (c) OLS s asymptotcally equvalet to l(i), where I s the _ detty matrx. Part (a) of Lemma 1 was show by Robs ad Rottzky [9] ad s cluded here for completeess. Robs ad Rottzky [9] also showed that the asymptotc varace of l(7) s equal to 0(d 7, l,7 l )&1. Part (b) of Lemma 1 mples that whe model (5) s correctly specfed, GEE has the same asymptotc dstrbuto as G. Robs ad Rottzky [9] showed that G has the smallest asymptotc varace the class of estmators that are solutos to Eq. (12). hey also showed that G ad the semparametrc effcet estmator (d eff,, eff ) have the same asymptotc varace f ad oly f E l, 7 (= Y t, X )=E(= Y t, X ),.e., whe the codtoal expectato of = s lear Y t. I ths secto we have show that the estmators G, G, GEE, GEE, ad OLS calculated from all avalable observatos are asymptotcally equvalet to solutos of Eq. (8) for specfc choces of fuctos d(x ) ad, t (Y t, X ) whe the MCAR codto (1) holds ad the mssg data patters are mootoe. I Secto 3 we oted that, the absece of mssg data, G ad G were semparametrc effcet. As argued prevously wth mootoe MCAR data, G s o loger effcet f the codtoal meas E(= Y t, X ) are olear fuctos of Y t. I Secto 3 we further oted that whe 7(X ) s costat, G ad OLS are algebracally detcal. hs s o loger true wth mootoe MCAR data. I fact, the ext secto we show that large effcecy gas ca be obtaed by usg G stead of OLS. Cosder ow the estmato of 0 the ``last-mea-lear'' model defed by restrcto (6) whe Y s ot observed for all subjects ad the data are MCAR ad mootoe. Rottzky ad Robs [10] showed that all RAL estmators of 0 the model defed by (1), (2), ad (6) are asymptotcally equvalet to a soluto (d*,,*) of =1 S (d*,,* )=0, for some specfc p_1 fuctos d*(x ) ad,*(y t, X ), t=1,...,. he estmatg fucto S s defed as S (d*,,* )= R d*(x ) = ( )&? t=1 (R t &* t R (t&1) )? t, *(Y t, X ).

10 REGRESSION WIH MISSING OUCOMES 111 Robs ad Rottzky [9] also showed that the soluto (d* eff,,* eff ) that uses d* eff =X { Var _ R = &? t=1 (R t &* t R (t&1) ) E(= Y t, X ) } &1? X t &= ad,* eff (Y t, X )=d* eff (X ) E(= Y t, X ) has asymptotc varace equal to 0 &1 =Var[S last (d* eff,,* eff 0 )] &1 that attas the semparametrc varace boud for estmatg 0 the model defed by restrctos (1), (2), ad (6). Sce (d eff,, eff ) has asymptotc varace that attas the semparametrc boud the model that addtoally assumes the learty of the codtoal meas of Y t gve X, t<, the f we let the verse of the varace boud of 0 represet the amout of formato about 0 a gve model, we have that AVar[ (d eff,, eff )] &1 &AVar[ (d* eff,,* eff )] &1 AVar[ (d eff,, eff )] &1 represets the fracto of the formato about 0 assocated wth the kowledge that E(Y t X ) s a lear fucto of X for all t<, where for ay estmator +^ of a parameter + 0, AVar(+^ ) deotes the varace of the asymptotc dstrbuto of - (+^ &+ 0 ). I Secto 5 we exame ths fracto for the specal case whch X =(1, X *) for a arbtrary explaatory varable X *. 5. EFFICIENCY COMPARISONS I ths secto we compare the asymptotc relatve effcecy (ARE) of the varous estmators of 0t dscussed Secto 4 the model E(Y t X )= 0, 0, t + 0, 1, t X *, where X * s a scalar radom varable. We start wth the case X *=0 whch correspods to the problem of estmatg the mea 0, 0, t of Y t, t=1,...,. We the cosder the case whch X * s a bary varable ad fally the case of a arbtrary covarate X *. Wthout loss of geeralty, we focus o the effcecy comparsos of the estmators of the coeffcets 0, 0, ad 0, 1, of the model for the codtoal mea of the last outcome Y gve X.

11 112 RONIZKY, HOLCROF, AND ROBINS 5.1. Estmato of Occaso-Specfc Meas Suppose that X cossts solely of the costat 1. I ths case we are terested estmatg 0, 0,, the mea of the outcome Y measured at the last occaso. o llustrate the asymptotc behavor of the estmators of 0, 0, we cosder frst the smple but pedagogcal case whch =2 ad Y 1 s observed for all subjects. he semparametrc effcet estmator 2(d eff,, eff )of 0, 0, 2 has asymptotc varace equal to the lower rghtmost elemet of 0 &1 eff #0(d eff,, eff ) &1 whch ca be easly calculated to be Var(= 2 )+ 1&* 2 * 2 E[Var(= 2 Y 1 )]. (13) Sce 2(d eff,, eff ) s semparametrc effcet, I MIS =AVar[ 2(d eff,, eff )] &1 represets the formato avalable for estmatg 0, 0, 2 whe asymptotcally, a fracto 1&* 2 of the outcomes Y 2 are mssg. Sce I FULL = Var(= 2 ) &1 s the formato for estmatg 0, 0, 2 whe all Y 2 's are observed the wth 8 eff =* &1 2 (1&* 2 ) E[Var(= 2 Y 1 )], I FULL &I MIS 8 = eff I FULL Var(= 2 )+8 eff represets the fracto of formato lost due to mssg Y 2 's. hs fracto s equal to 0 whe 8 eff =0, whch occurs whe * 2 =1,.e., whe Y 2 s observed for all subjects, or whe Var(= 2 Y 1 )=0,.e., whe Y 1 s a perfect predctor of Y 2. he asymptotc varace of l(c) s gve by the lower rghtmost elemet of 1(d C l )&1 0(d C l,,c l ) 1(dC l )&1,. It s easy to show that ths elemet s equal to Var(= 2 )+ 1&* 2 * 2 E[[= 2 &E l, C (= 2 = 1 )] 2 ]. (14) Formula (14) wth C(X )=C(X *) s, vew of Lemma 1, the asymptotc varace of GEE, 2, the GEE estmator of 0, 0, 2, that uses the ``workg covarace'' model (5). I partcular, takg C(X )=I, the asymptotc varace of OLS, 2 s gve by Var(= 2 )+ 1&* 2 * 2 [Var(= 2 )]. (15) Notce that (15) cocdes wth Var(= 2 )* 2, whch s equal to Var(= 2 ), the asymptotc varace of the ormalzed estmator of the sample mea of Y 2 had o Y 2 bee mssg, dvded by * 2, the fracto of subjects wth Y 2 observed for large.

12 REGRESSION WIH MISSING OUCOMES 113 Formula (14) says that the asymptotc varace of GEE, 2 depeds o the probablty lmt of the estmated workg covarace oly through E[[= 2 &E l, C (= 2 = 1 )] 2 ]. I partcular, AVar( GEE)&AVar( OLS)= 1&* 2 * 2 [E[[= 2 &E l, C (= 2 = 1 )] 2 ]&E(= 2 2 )]. (16) It follows from (16) that OLS, 2 s ot ecessarly less effcet tha GEE, 2 sce for certa choces of workg covarace C, the rght-had sde of (16) wll be postve. For example, f the workg covarace model specfes that the covarace of Y s costat ad equal to Cov(Y )= \ 1 &12 & (17) but the true covarace of Y s ( 1 \ 0 ) for some \ \ {&12, the (16) s equal to (1&* 2 ) * &1 2 whch s postve f \ 0 >&(0.5) 2. hs result says that GEE s ot ecessarly a more effcet estmator tha the (``workgdepedece'') OLS estmator OLS. Of course our example, sce X =1, the GEE estmator that uses a urestrcted model for Cov(Y X ), that s, Cov(Y X )= \ , for some ukow parameters 01, 02, ad 03 s semparametrc effcet ad feasble, ad t wll be preferred to GEE estmators usg, possbly correct, costat-valued workg covaraces, such as (17). he pot of our example was to show that GEE ca be less effcet that OLS ad the workg covarace models should be chose carefully f effcecy mprovemets over OLS are desred. he asymptotc varace of the geeralzed least squares estmator G, 2 s by Lemma 1, equal to part (a)(14) wth C(X )=Cov(Y X ) ad thus ca be wrtte as Var(= 2 )+ 1&* 2 * 2 E[Var l (= 2 Y 1 )], (18) where Var l (= 2 Y 2 )=Var(Y 2 )&[Cov(Y 1, Y 2 ) 2 Var(Y 1 )] s the resdual varace from the populato lear regresso of Y 2 o Y 1. I the Appedx we show that E[Var l (= 2 Y 1 )] s equal to E[Var(= 2 Y 1 ]f

13 114 RONIZKY, HOLCROF, AND ROBINS ad oly f E(= 2 Y 1 ) s a lear fucto of Y 1. hus G s semparametrc effcet oly whe E(= 2 Y 1 ) s lear Y 1. As oted Secto 4, ths has bee prevously observed by Robs ad Rottzky [9]. he followg argumet helps to uderstad why G, 2 may fal to be semparametrc effcet. For the th subject wth observed outcome Y 1, let Y 2 be the predcted value of Y 2 from the lear regresso of Y 2 o Y 1 based o subjects wth observed outcomes at both occasos. hat s, lettg $ 1 ad $ 2 be the soluto of =1 R 2\ 1 Y 1+ (Y 2&$ 1 &$ 2 Y 1 )=0, we defe Y 2=$ 1+$ 2Y 1, =1,...,. I the Appedx we show that G, 2 has the same asymptotc dstrbuto as the soluto IMP, 2 of =1 R 2 (Y 2 & 2 )+(1&R 2 )(Y 2& 2 )=0. he soluto IMP, 2 cocdes wth the regresso mputato estmator of 0, 0, 2 descrbed by Lttle ad Rub [6, pp. 4547]. hs estmator s calculated by frst mputg the mssg Y 2 's wth ther predcted values from the lear regresso of Y 2 o Y 1 based o the complete data, ad the averagg the observed ad mputed values of Y 2. he loss of effcecy of IMP, 2 ad therefore of G, 2, arses because the mssg Y 2 are mputed from a model that assumes that E(Y 2 Y 1 ) s lear Y 1. Rottzky ad Robs [10] showed that, whe Y 1 s dscrete, IMP ca be made semparametrc effcet by replacg Y 2 by E (Y 2 Y 1 ), the oparametrc maxmum lkelhood estmator of E(Y 2 Y 1 ). A comparso of formulas (13), (15), ad (18) helps to uderstad the effcecy dffereces amog EFF, 2, G, 2, ad OLS, 2. Sce E[Var(= 2 Y 1 )] E[Var l (= 2 Y 1 )]E[Var(= 2 )], OLS, 2 ca ever be more effcet that G, 2, whch, tur, ca ever be more effcet tha EFF, 2. I the Appedx we show that E[Var(= 2 )]=E[Var l (= 2 Y 1 )] oly whe Cov(Y 1, Y 2 )=0 ad therefore G, 2 ad OLS, 2 wll have the same asymptotc varace oly whe Y 1 ad Y 2 are ucorrelated. he greater effcecy of G, 2 relatve to OLS, 2 s therefore explaed because G, 2, as opposed to OLS, 2, explots the correlato betwee Y 1 ad Y 2 for estmato of 02 va the lear regresso mputato of the mssg Y 2 's. However, as oted earler, the lear regresso mputato of Y 2 wll oly lead to effcet estmators of 02 whe E(Y 2 Y 1 ) s lear Y 1, ad except for ths case, G, 2 wll fal to extract all the formato avalable Y 1 ad Y 2 about 0, 0, 2.

14 REGRESSION WIH MISSING OUCOMES 115 Gve two estmators +~ ad +^ of a scalar parameter +, the asymptotc relatve effcecy of +~ compared to +^ s deoted by ARE(+~, +^ ) ad s defed by ARE(+~, +^ )=AVar(+^ )AVar(+~). Wth EFF # (d eff,, eff ), (13) ad (18) mply that ad ARE( OLS, 2, EFF, 2)=1&(1&* 2 ) {Var(= 2)&E[Var(= 2 Y 1 )] Var(= 2 ) = ARE( G, 2,EFF, 2) =1& (1&* 2) 2{ Var(= 2)&E[Var(= 2 Y 1)] 2= &\ 1&(1&* 2 ) \ Var(= 2 ), where \=Corr(Y 1, Y 2 ). he effcecy loss of G, 2 relatve to EFF, 2 s summarzed the term (1&* 2 ) 2{ Var(= 2)&E[Var(= 2 Y 1)] 2= &\ 1&(1&* 2 ) \ Var(= 2 ). he factor [Var(= 2 )&E[Var(= 2 Y 1 ]] Var(= 2 ) &1 &\ 2 ca be terpreted as a measure of the degree of o-learty E(Y 2 Y 1 ). hs factor s equal to 0 whe E(Y 2 Y 1 ) s lear Y 1, ad t ca be as large as 1. he upper boud 1 s acheved whe Y 1 ad Y 2 are ucorrelated but Y 1 s a perfect predctor of Y 2, for example f Y 1 s ormally dstrbuted wth zero mea ad Y 2 =Y 2 1. he factor (1&* 2)[1&(1&* 2 ) \ 2 ] quatfes the effcecy loss as a fucto of the fracto of mssg Y 2. Example. o llustrate the relatve effceces of OLS, 2 ad G, 2 compared to the semparametrc effcet estmator EFF, 2, cosder Y 1 = Z 73 1 wth \ Z1 = 2+ td Normal \\0 0+, _2\ 1 ' ' 1++. (19) Sce Y 1 s a oe-to-oe trasformato of Z 1, E(= 2 Y 1 )=E(= 2 Z 1 ) ad sce, by ormalty, E(= 2 Z 1 ) s lear Z 1, the E(= 2 Y 1 )= a+by 37 1 for some costats a ad b. hus, the codtoal mea of Y 2 gve Y 1 s a olear fucto of Y 1. I the Appedx we show that Corr(Y 1, Y 2 )=% Corr(Z 1, Z 2 ), (20)

15 116 RONIZKY, HOLCROF, AND ROBINS where %=E(Z 103 ) E(Z 143 ) &12. Usg the average of 10,000 smulated values of Z ad Z we calculated %r0.88. Furthermore, Var(Y 2 Y 1 ) s, by Y 1 a oe-to-oe trasformato of Z 1, equal to Var(Y 2 Z 1 )=_ 2 (1&' 2 ) ad vew of (20), Var(Y 2 Y 1 )=_ 2 [1& (0.88) 2 \ 2 ]. Settg * 2 =0.5, the ARE's of OLS, 2 ad G, 2 compared to EFF, 2 reduce to ad ARE( OLS, 2, EFF, 2)=1&(0.5)(0.88) 2 \ 2 ARE( G, 2,EFF, 2)=1& 0.5[1&(0.88)2 ]\ 2, [1&0.5\ 2 ] where \=Corr(Y 1, Y 2 ). Fgure 1 plots ARE( OLS, 2, EFF, 2) ad ARE( G, 2, EFF, 2) as a fucto of \ for * 2 =0.5. he plots dcate that the effcecy of both OLS, 2 ad G, 2 decreases as a fucto of \. he relatvely small effcecy loss of G, 2 s due to the relatvely small fracto of mssg data,.e., 1&* 2 =0.5, ad the fact that E(Y 2 Y 1 ) s well-approxmated by a lear fucto of Y 1, for values of Y 1 lyg a rego of hgh probablty. We have also calculated ARE( G, 2,EFF, 2) for * 2 =0.2 (results ot preseted) ad obtaed that the ARE reached a mmum of Cosder ow the estmato of 0 for 2. he asymptotc varaces of EFF, ad l(c) are the lower rghtmost elemets of 0(d eff,, eff ) &1 ad 1(d C l )&1 0(d C l,,c l ) 1(dC l )&1,, respectvely. A straghtforward calculato gves ad AVar( eff, )=Var(= )+ AVar[ l, (C)]=Var(= )+ t=1 t=1 1&* t? t E[Var(= Y t)] (21) 1&* t? t E[[= &E l, C (= Y t)] 2 ]. (22) hus, by Lemma 1, the asymptotc varaces of G, ad OLS, are ad AVar( G, )=Var(= )+ t=1 AVar( OLS, )=Var(= )+ 1&* t? t E[Var l (= Y t)] (23) t=1 1&* t? t E[Var(= )], (24)

16 REGRESSION WIH MISSING OUCOMES 117 Fg. 1. ARE's for estmatg the mea of Y 2 whe Y 1 s always observed ad P(Y 2 mssg)=0.5. where Var l (= Y t)=cov(y, Y t) Var(Y t) &1 Cov(Y t, Y ). hus, dffereces the asymptotc varaces of EFF,, G,, ad OLS, are drve by dffereces amog E[Var(= Y t)], E[Var l (= Y t)] ad E[Var(= )]. Aalogously to the case =2, E[Var(= Y t)]= E[Var l (= Y t)] f ad oly f E[= Y t] s a lear fucto of Y t, t=1,...,, whch s the the ecessary ad suffcet codto for G, to be fully effcet. Whe Y ad = are depedet, the Var(= Y t) =Var(= ) ad OLS, s effcet. Aalogously to the case =2, t ca be show that G, s asymptotcally equvalet to a regresso mputato estmator of the th mea whch a mssg Y from a subject wth data observed up to tme t&1, s mputed wth ts predcted value from the lear regresso of Y o Y t based o subjects wth complete data. hus, the effcecy loss of G, relatve to EFF, s due to the mputato of the

17 118 RONIZKY, HOLCROF, AND ROBINS mssg Y from, possbly msspecfed, lear models for E(Y Y t). Sce E[Var l (= Y t)]=var(= ) holds for all t=1,..., f ad oly f Cov(=, Y )=0, t follows that OLS, ad G, wll have the same asymptotc dstrbuto oly whe = ad Y are ucorrelated. Also, OLS, wll be fully effcet oly whe Var(= Y )=Var(= ). Fally, as the case =2, t ca be show from formula (22) ad Lemma 1 that the asymptotc varace of GEE, ca be larger tha the asymptotc varace of OLS, for some msspecfed workg covarace models (5) Estmato of Occaso-Specfc Mea Dffereces Suppose that X * s a bary dcator varable ad cosder the model E(Y t X )= 0, 0, t + 0, 1, t X *. I a radomzed placebo-cotrolled follow-up tral for comparg treatmet A versus placebo, for example, X *=0 f subject s assged to the placebo arm ad X *=1 f subject s assged to the treatmet A arm. hus, 0, 0, t =E(Y t X*=0) s the occaso-specfc mea the placebo arm ad 0, 1, t =E(Y t X *=1)&E(Y t X *=0) s the occaso-specfc dfferece betwee the treatmet A ad placebo meas. Cosder ow the estmato of 0 =( 0, 0, 1, 0, 1, 1,..., 0, 0,, 0, 1, ). Let 0, G be the geeralzed least squares estmator of the vector of occaso-specfc meas the placebo arm, 0, 0 =( 0, 0, 1,..., 0, 0, ) computed from placebo-arm data oly. Smlarly, let 0, GEE, 0, OLS, ad 0, EFF be the GEE, OLS, ad semparametrc effcet estmators of 0, 0 computed from placebo-arm data oly. Defe aalogously the estmators 1, G, 1, GEE, 1, OLS, ad 1, EFF of 0, 1 =( 0, 1, 1,..., 0, 1, ) computed from treatmet A-arm data oly. I the Appedx we show that the estmators G, GEE, OLS, ad EFF of 0 ca be expressed respectvely terms of j, G, j, GEE, j, OLS, ad j, EFF, j=0, 1. Specfcally, G, 0,t, the geeralzed least squares estmator of the tercept of the t th-regresso, t=1,...,, based o data o both treatmet arms cocdes wth the geeralzed least squares of the tth mea the placebo arm,.e., G, 0,t=0, G, t. (25) he geeralzed least squares estmator G, 1,t of the slope the t th regresso, t=1,...,, based o data from both treatmet arms s equal to the dfferece betwee the arm-specfc geeralzed least squares estmators of the tth occaso meas,.e., G, 1,t=1, G, t& 0, G, t. (26)

18 REGRESSION WIH MISSING OUCOMES 119 Relatoshps (25) ad (26) hold also for the GEE, OLS, ad semparametrc effcet estmators of 0. Equato (25) mples that the ARE of the GLS ad OLS estmators of the occaso-specfc tercepts 0, 0, t compared to the semparametrc effcet estmator of 0, 0, t are equal to the ratos of the asymptotc varaces gve (23) ad (24) to the asymptotc varace gve (22). It follows from (26) that AVar( G, 1,t)=AVar( 1, G, t)+avar( 0, G, t), ad the same relatoshp holds for the GEE, OLS, ad semparametrc effcet estmator. Furthermore, t follows from (21), (22), (23), ad (24) that for j=0, 1, AVar( j, EFF, t)=p(x = j) &1 t { Var(= 1&* lj t X = j)+? l=1 lj _E[Var(= t Y l, X = j)] =, AVar[ j, l, t(c)]=p(x = j) &1 t { Var(= 1&* lj t X = j)+? l=1 lj _E[[= t &E l, C (= t Y l, X = j)] 2 ] =, ad AVar( j, OLS, t) AVar( j, G, t)=p(x = j) &1 =P(X = j) &1 t { Var(= 1&* lj t X = j)+ l=1? lj _E[Var l (= t Y l, X = j)] =, (27) t { Var(= 1&* lj t X = j)+ E[Var(= t X = j)]? =, l=1 lj where * lj =P(R l =1 R (l&1) =1, X = j) ad Var l (= t Y l, X = j)= Cov(Y t, Y l X = j) Var(Y l X = j) &1 = l. hus whe (a) the orespose probabltes * lj do ot deped o the treatmet arm,.e., * lj =* l (b) the covarace of Y s the same for both treatmet arms,.e., Cov(Y X *)= Cov(Y ) ad (c) Var(= t Y l, X *) s ot a fucto of X *, l=1,..., t,

19 120 RONIZKY, HOLCROF, AND ROBINS t=1,...,, the the ARE of the GLS ad OLS estmators of the occasospecfc slopes compared to the semparametrc effcet estmator rema the same as the ARE's of the respectve estmators of the occaso-specfc tercepts dscussed earler. Fally, as Secto 5.1, t ca be show that the GEE estmator of 0, 1, ca be less effcet tha the OLS estmator for some msspecfed workg covarace models. Example. o llustrate the depedece of the ARE's o the dfferece betwee the correlato matrces the two treatmet groups, we cosder a radomzed placebo-cotrolled study wth data measured at basele ad at oe follow-up pot. We assume that data at basele are always observed,.e., * 1j =1, j=0, 1, ad that the probablty that Y 2 s mssg s the same both treatmet arms. We assume that Y 1 =Z 73 1 ad that gve X*, \ Z 1 = 2+ dep t Normal 0+ \\0, _2 \ 1 '(X *) '(X *) (28) Uder (28), E(Y 1 X *)=0 so ths example we assume that there are o dffereces the treatmet meas at basele. hus, wth each treatmet arm, the data follows the model (19) of Example 1. However, sce '(X *) s a fucto of X *, the covarace betwee Y 1 ad Y 2 chages wth treatmet arm. A straghtforward calculato shows that ad 2{ 1 AVar( EFF,1,2)=_ & 1&* 2 (0.88) &2 (\2P 0 1+\ 2P 1 0) * 2 P 0 P 1 * 2 P 0 P 1 =, (29) 2{ 1 AVar( G, 1,2)=_ & 1&* 2 (\ 2 P 0 1+\ 2 P 1 0) * 2 P 0 P 1 * 2 P 0 P 1 =, (30) 2{ 1 AVar( OLS, 1, 2)=_ * 2 P 0 P 1=, (31) where \ j =Corr(Y 1, Y 2 X *= j), P j =P(X *= j), j=0, 1. Fgure 2 plots the ARE of the OLS ad GLS estmator of 0, 1, 2, the slope the regresso model for the secod occaso, compared to the semparametrc effcet estmator of 0, 1, 2 agast \ 1 for * 2 =0.5, \ 0 =- 0.5 ad P 0 =P 1 =0.5. Both ARE's atta ther maxmum at \ 1 =0, but these maxmums are ot equal to 1. he OLS estmator s substatally less effcet tha the semparametrc effcet estmator whe \ 1 s large. he GLS estmator

20 REGRESSION WIH MISSING OUCOMES 121 Fg. 2. ARE's for estmatg the mea dfferece of the outcomes Y 2 2 groups. Here, Y 1 s always observed, P(Y 2 mssg)=0.5, Corr(Y 1, Y 2 )=-0.5 the frst group ad Corr2(Y1, Y2)=Corr(Y 1, Y 2 ) the secod group. performs relatvely well over the whole rage of \ 1 as dcated by the theory sce E(= 2 Y 1 ) s well approxmated by a lear fucto of Y 1 over the rage of hgh probablty values of Y Estmato of Occaso-Specfc Slopes We ow cosder the effcecy of dfferet estmators of 0 the model E(Y t X )= 0, 0, t + 0, 1, t X *, (32) for a arbtrary radom varable X *. I what follows t wll be coveet to defe 0 *=( 0, 0, 1, 0, 0, 2,..., 0, 0,, 0, 1, 1,..., 0, 1, ). he vector 0 *s obtaed by permutg the elemets of 0 so that the frst elemets of 0 *

21 122 RONIZKY, HOLCROF, AND ROBINS are the tme-ordered tercepts ad the last elemets of 0 * are the tmeordered slopes. he sem-parametrc varace boud for estmatg 0 * model (32) s 0* &1 eff =E {\ I X *I+ K &1 (X eff )(I, X *I) = &1, (33) where I s the _ detty matrx ad K eff (X )=Var(= X *)+ If, for t=1,...,, t=1 1&* t? t E[Var(= Y t, X *) X ]. Var(= X *), E[Var(= Y t, X *) X *], ad * t do ot deped o X *, (34) the K eff (X ) s a costat matrx ad 0* eff = \ K &1 eff + 1 K &1 eff + 1 K &1 eff + 2 K +, &1 eff where + 1 =E(X *) ad + 2 =E(X* 2 ). he semparametrc varace boud for estmatg the vector of occaso-specfc slopes 0, 1 =( 0, 1, 1,..., 0, 1, ) s the _ lower rghtmost block matrx of 0* &1 eff, whch, whe (34) holds s, by the formula of the verse of a parttoed matrx, equal to 0 &1 1, eff =[+ 2K &1 eff &+ 1 K &1 eff K eff + 1 K &1 eff ] &1 =K eff Var(X *). hus, whe (34) holds the semparametrc varace boud for estmatg the slope at the last occaso s gve by the lower rghtmost elemet of ad t s equal to 0 &1 1, eff AVar( EFF, 1, )= _Var(= )+ t=1 1&* t? t E[Var(= Y t, X *)] &<Var(X *). (35) Cosder ow * G, the geeralzed least squares estmator of 0 *. Its asymptotc varace s gve by 0* &1 l = {\ I X *I+ K &1(X l )(I, X *I) = &1, (36)

22 REGRESSION WIH MISSING OUCOMES 123 where K l (X )=Var(= X *)+ t=1 1&* t? t E[Var l (= Y t, X *) X *], ad Var l (= Y t, X *)=Cov(=, Y t X ) Var(Y t X ) &1 Cov(Y t, = X ). Whe * t ad Var(Y X *) do ot deped o X *, a detcal argumet used to derve (35) ow gves AVar( G, 1,)= _Var(= )+ t=1 1&* t? t Var l (= Y t) &<Var(X *). (37) hus, whe (34) holds G, 1, s semparametrc effcet f ad oly f E[Var l (= Y t, X )]=E[Var(= Y t, X )] or equvaletly whe E(= Y t, X ) s lear Y t, as oted also by Robs ad Rottzky [9]. Whe * t s ot a fucto of X*, OLS, 1, s computed from a fracto of the outcomes Y that, as, s equal to?. hus, the asymptotc varace of the OLS estmator of 0, 1, s equal to Var(= )[? Var(X *)]. A straghtforward calculato shows that ths varace ca be rewrtte as AVar( OLS, 1, )= _Var(= )+ t=1 (1&* t )? t Var(= ) &<Var(X *). (38) Comparg Eqs. (37) ad (38) to Eqs. (23) ad (24), t follows that whe (34) holds the asymptotc varaces of the estmators OLS, 1, ad G,1, of the occaso-specfc slopes are equal to the asymptotc varaces of the correspodg estmators of the occaso-specfc meas dvded by the varace of X*. We coclude that whe (34) holds, the asymptotc relatve effceces of OLS, 1, ad G, 1, compared to EFF, 1, are less tha or equal to those dscussed Secto 5.1 for estmato of the mea of Y. Cosder ow the estmato of 0, 1, the ``last-mea-lear'' model (6) wth the addtoal restrcto (1), where X =(1, X *). he semparametrc varace boud for estmatg 0, ths model s gve by 0 &1 = last Var[S (d* eff,,* eff 0 )] &1. It s straghtforward to show that 0 &1 last =E {\ K eff, (X ) &1 X *K eff, (X ) &1 X *K eff, (X ) &1 X* 2 &1+= &1, K eff, (X ) where K eff, (X ) s the lower rghtmost elemet of the _ matrx K eff (X ). hus, whe (34) holds, 0 &1=K last eff, \ 1 & ,

23 124 RONIZKY, HOLCROF, AND ROBINS ad the semparametrc varace boud for estmatg 0, 1, s equal to K eff, Var(X *) whch, by (35), cocdes wth AVar( EFF, 1, ). hs result says that whe (34) holds, kowledge that the codtoal meas of Y t gve X* are lear fuctos of X* for t=1,..., &1 does ot asymptotcally add formato about the parameter 0, 1,. It s terestg to ote that sce, (a) OLS, 1, s semparametrc effcet whe (34) holds ad data o Y are ot avalable ad (b) the asymptotc varace of OLS, 1, s larger tha the asymptotc varace of EFF, 1, whe, gve X, Y ad = are statstcally depedet the, as opposed to the full-data case, data o Y provde formato about 0, 1, whe, gve X, Y s a predctor of Y. Whe (34) s ot true, the lower rghtmost elemets of 0* &1 eff ad may ot be equal. I such cases, kowledge of the learty of the 0* &1 last codtoal meas of Y t gve X, does provde addtoal formato about 0, 1,. Fally, the asymptotc varace of GEE, s gve by (9) wth d l ad, l defed Lemma 1(b). he results of Secto 5.1 suggest that GEE, ca be eve less effcet tha OLS, for some msspecfed workg covarace models (5). A detaled study of whch estmated covaraces C (X ) lead to GEE, beg less effcet tha OLS, s beyod the scope of ths paper. 6. FINAL REMARKS I ths paper we have examed the relatve effceces of varous estmators of the parameter t dexg the occaso-specfc lear models for the codtoal meas of Y t gve X, t=1,...,, whe the outcomes Y t are MCAR ad the mssg data patters are mootoe. We have show that, as opposed to the case whch the full-data vector Y s observed for all subjects, the GLS ad OLS estmators ca be less effcet tha the semparametrc effcet estmator of t. We have oted that the effcecy loss of the GLS estmator of t s related to the degree of olearty of the codtoal meas E(Y t Y t, X ) as fuctos of Y t. We also observed that, as opposed to the full-data case, the OLS estmator of t s effcet sce t oly uses X ad the outcomes Y t recorded at the tth occaso, ad wth mootoe mssg data, the outcomes Y t recorded pror to tme t carry formato about t. Fally, the results of Lemma 1 are vald also whe model (3) s replaced by E(Y t X )=g t (X, 0 ), where g t (X 0 ) s a, possbly olear, fucto of X ad 0. Whe g t (X 0 ) depeds o 0 oly through the occaso-specfc parameters 0t, but g t (X 0 ) s ot a lear fucto of 0t, the OLS ad G are o loger equal, eve whe o Y t 's are mssg. hus, wth full-data ad olear codtoal mea models, data o Y j, j{t, provde formato about the occaso-specfc parameters dexg the codtoal mea of Y t gve X.

24 REGRESSION WIH MISSING OUCOMES 125 APPENDIX Proof of Lemma 1. Part (a) s exactly Lemma 1 of Robs ad Rottzky [9]. o prove part (b) we wll show that GEE= l(c * ) ad the argue that sce GEE s asymptotcally equvalet to GEE the GEE ad l(c * ) must have the same asymptotc dstrbuto. he estmator GEE solves =1 (IX ) C * (X ) &1 2 = ()=0, (39) where 2 =dag(r j ) s the _ dagoal matrx wth dagoal elemets R j, j=1,...,. Robs ad Rottzky [9] showed that whe Cov(Y X )= C * (X ), (IX ) C * (X ) &1 2 = =U (d C l,,c l 0), (40) where d C l ad, C l are defed Secto 4. By defto, U (d C l,, C l 0 ) s a lear fucto of =. hus, U (d C, l,c l 0)=a(X, R ) = for some a(x, R ). Let b(x, R )#(IX ) C * (X ) &1 2 ad h(x, R )#a(x, R )& b(x, R ). By (40), h(x, R ) = =0 whe Cov(Y X )=C * (X ). hus, by the MCAR assumpto (1), Cov[h(X, R ) = X, R ]=h(x, R )_ C * (X ) h(x, R ) =0 whch, by C(X ) a postve defte matrx, mples that h(x, R )=0 almost everywhere. Hece, a(x, R )=b(x, R ) a.e. ad Eq. (40) s true eve whe Cov(Y X ){C(X ) whch eds the proof of part (b). Part (c) follows mmedately from part (b) by otg that OLS solves (39) wth C * (X )=I. Proof that E [Var l ( Y 2 Y 1 )]=E[ Var( Y 2 Y 1 )] s equvalet to E(Y 2 Y 1 ) s lear Y 1. Suppose frst that E(Y 2 Y 1 ) s lear Y 1, the E(Y 2 Y 1 )=E(Y 2 )+Cov(Y 1, Y 2 ) Var(Y 1 ) &1 = 1 ad Var[E(Y 2 Y 1 )] = Cov( Y 1, Y 2 ) 2 Var( Y 1 ) &1. hus E [ Var( Y 2 Y 1 )] = Var( Y 2 )& Var[ E(Y 2 Y 1 )] mples E [ Var( Y 2 Y 1 )]=Var( Y 2 )&Cov ( Y 1, Y 2 ) 2 _ Var( Y 1 ) &1 whch proves that E [ Var( Y 2 Y 1 )]=E[ Var l ( Y 2 Y 1 )]. Suppose ow that E[Var(Y 2 Y 1 )]=Var(Y 2 )&Cov(Y 1, Y 2 ) 2 Var(Y 1 ) &1, the Var[E(Y 2 Y 1 )]=Cov(Y 1, Y 2 ) 2 Var(Y 1 ) &1. hus, Var[E(Y 2 Y 1 )] =Var[Cov(Y 1, Y 2 ) Var( Y 1 ) &1 = 1 ]. Now, Var[E(Y 2 Y 1 )&Cov( Y 1, Y 2 ) _Var(Y 1 ) &1 = 1 ]= Var[E(Y 2 Y 1 ) + Var[ Cov( Y 1, Y 2 ) Var(Y 1 ) &1 = 1 ]& 2 Cov[E(Y 2 Y 1 ) Cov ( Y 1, Y 2 ) Var( Y 1 ) &1 = 1 ]. But Cov[ E ( Y 2 Y 1 )_ Cov(Y 1, Y 2 ) Var(Y 1 ) &1 = 1 ]=E[Y 2 = 1 ] Cov(Y 1, Y 2 )_Var(Y 1 ) &1 =Cov(Y 1, Y 2 ) 2 Var(Y 1 ) &1. hus Var[E(Y 2 Y 1 )&Cov(Y 1, Y 2 ) Var(Y 1 ) &1 = 1 ]=0 whch proves the asserto. Proof that Var l (Y 2 Y 1 )=Var(Y 2 ) s equvalet to Cov(Y 1, Y 2 )=0. By defto Var l ( Y 2 Y 1 ) = Var( Y 2 ) & Cov(Y 1, Y 2 ) 2 Var(Y 1 ) &1 thus

25 126 RONIZKY, HOLCROF, AND ROBINS Var l (Y 2 Y 1 )=Var(Y 2 ) Cov(Y 1, Y 2 ) 2 Var(Y 1 ) &1 =0 whch s equvalet to Cov(Y 1, Y 2 )=0. Proof that G, 2 ad IMP, 2 are asymptotcally equvalet. Sce G= l(7), the by defto of l, 2(7), - ( G, 2& 0 )= &12 _ =1{ R 2 = 2 & R 2&* 2 Cov(Y 1, Y 2 ) Var(Y 1 ) &1 = * 2 * 2 1=. (41) Also, by defto of IMP, 2, - ( IMP, 2& 0 ) = &12 =1{ R 2= 2 +(1&R 2 ) _Y 2, obs+ (Y 1, Y 2 ) 1 ) =^ 1& 2&=, where 1, Y 2 ) ad 1 ) are the sample covarace of Y 1 ad Y 2 ad the sample varace of Y 1 amog subjects wth R 2 =1, =^ 1 = Y 1 & Y 1, obs ad Y j, obs, j=1, 2, s the sample average of Y j from subjects wth R 2 =1. Now, ad =1 [R 2 = 2 +(1&R 2 )(Y 2, obs& 2 )]= =1 R 2= 2 =1 R 2 =1 hus, (1&R 2 ) (Y 1, Y 2 ) - ( IMP, 2& 0 ) 1 ) =- { =1 R 2 = 2 =1 R 2 =^ 1 = { =1 = 1 + (Y 1, Y 2 ) 1 ) =- R =1 2= 2 + Cov(Y 1, Y 2 ) { * 2 Var(Y 1 ) = &12 { R 2 = 2 & Cov(Y 1, Y 2 ) * =1 2 Var(Y 1 ) =1 & =1 R 2= 1 =1 R 2 =1 = 1 =1 = 1 = 1, Y 2 ). 1 ) & =1 R 2 = 1 R =1 2 &= & R =1 2= 1 * 2 &= +o p(1) R 2 &* 2 * 2 = 1= +o p(1), (42)

26 REGRESSION WIH MISSING OUCOMES 127 where the secod equalty follows by Slutsky's theorem. hus, by (41) ad (42) ad the cetral lmt theorem, IMP, 2 ad G, 2 are asymptotcally equvalet. Proof of Eqs. 25ad 26. Let M C (= )= R = &? t=1 [R t &* t R (t&1) ]? t Cov C (=, = t) Var C (= t) &1 = t, where Cov C ad Var C are calculated uder the assumpto that Cov(Y X*) =C(X *). he geeralzed least squares estmator G s asymptotcally equvalet to l(c) that solves =1 (IX ) K &1 (X ) M C [= ()]=0, (43) where X =(1, X *), K(X )=Var[M C (= ) X *] ad C(X )=Var(Y X *). Whe X * s a bary varable (43) s equvalet to X*=0\ I \ K&1 0 M C [= (0) ()]+ X*=1\ I \ K &1 1 M C [= (1) ()]=0, (44) where K &1 j =K &1 (X *= j), = (0) () s the _1 vector wth the jth elemet equal to (Y j &B 0j ) ad = (1) () s the _1 vector wth the jth elemet equal to (Y j &B 0j &B 1j ). he system (44) cossts of 2 equatos. Rearragg these equatos so that the equatos occupyg odd umbered places (44) come frst, we have X *=0 hus, 0j, j=1,..., solves IK &1 0 M C [= (0) ()]+ IK &1 1 M C [= (1) ()]=0 (45) X*=1 X *=0 X *=1 IK &1 1 M C[= (1) ()]=0. (46) M C [= (0) ()]=0, (47) ad t s therefore equal to the geeralzed least squares estmator of 0, 0 based o subjects wth X *=0. Smlarly,@ 0j + 0j, j=1,...,, solves X *=1 M C [= (1) ()]=0, (48)

27 128 RONIZKY, HOLCROF, AND ROBINS whch s the geeralzed least squares estmator of the mea vector amog subjects wth X*=1. hus t follows that 1j=@ 0j + 1j & 0j s the dfferece betwee the geeralzed least squares estmator of the mea vector amog subjects wth X*=1 ad the GLS estmator of the mea vector amog subjects wth X*=0. hat relatoshps (47) ad (48) hold also for the GEE, OLS, ad semparametrc effcet estmators follows by a aalogous argumet by cosderg the approprate fuctos M C [= ()] each case. Proof of Eq. (20). E(Y 1 )=0 sce (1) Y 1 =Z 73 1, (2) the fucto h(z)=z 73 s odd, ad (3) Z has a symmetrc dstrbuto wth zero mea. hus, Var(Y 1 )=E(Y 2 1 )=E(Z143 1 ). Also, Cov(Y 1, Y 2 )=E(Y 1, = 2 ) ad E(Y 1 = 2 )=E[Y 1 E(= 2 Y 1 )]. But E(= 2 Y 1 )=E(= 2 Z 1 ) because h(z) =Z 73 s a oe-to-oe fucto. hus, E[Y 1 E(= 2 Y 1 )]=E(Y 1 \Z 1 )= \E(Z 103 ). Fally, Corr(Y 1, Y 2 )#Cov(Y 1, Y 2 )[Var(Y 1 ) Var(Y 2 )] 12 =\E(Z 103 )- E(Z ) because Var(Y 2 )=1. ACKNOWLEDGMEN hs work was coducted as part of Chrsta Holcroft's doctoral dssertato. REFERENCES 1. Begu, J. M., Hall, W. J., Huag, W. M., ad Weller, J. A. (1983). Iformato ad asymptotc effcecy parametrcoparametrc models. A. Statst Carroll, R. J., ad Ruppert, D. (1982). Robust estmato heteroscedastc lear models. A. Statst Chamberla, G. (1987). Asymptotc effcecy estmato wth codtoal momet restrctos. J. Ecoometrcs Johso, R. A., ad Wcher, D. W. (1988). Appled Multvarate Statstcal Aalyss, 2d ed. PretceHall, Eglewood Clffs, NJ. 5. Lag, K-Y, ad Zeger, S. L. (1986). Logtudal data aalyss usg geeralzed lear models. Bometrka Lttle, R. J. A., ad Rub, D. B. (1987). Statstcal Aalyss wth Mssg Data. Wley, New York. 7. Rao, C. R. (1973). Lear Statstcal Iferece ad Its Applcatos, 2d ed. Wley, New York. 8. Robs, J. M., Mark, S. D., ad Newey, W. K. (1992). Estmatg exposure effects by modellg the expectato of exposure codtoal o cofouders. Bometrcs Robs, J. M., ad Rottzky, A. (1995). Semparametrc effcecy multvarate regresso models wth mssg data. J. Am. Statst. Assoc Rottzky, A., ad Robs, J. M. (1995). Semparametrc regresso estmato the presece of depedet cesorg. Bometrka Rub, D. B. (1976). Iferece ad mssg data. Bometrka Seber, G. A. F. (1984). Multvarate Observatos. Wley, New York.

UNIVERSITY OF OSLO DEPARTMENT OF ECONOMICS

UNIVERSITY OF OSLO DEPARTMENT OF ECONOMICS Postpoed exam: ECON430 Statstcs Date of exam: Jauary 0, 0 Tme for exam: 09:00 a.m. :00 oo The problem set covers 5 pages Resources allowed: All wrtte ad prted