ESTIMATION OF A LOGISTIC REGRESSION MODEL WITH MISMEASURED OBSERVATIONS
|
|
- Jennifer Griffin
- 5 years ago
- Views:
Transcription
1 Statistica Sinica 13(2003, ESTIMATION OF A LOGISTIC REGRESSION MODEL WITH MISMEASURED OBSERVATIONS K. F. Cheng and H. M. Hsueh National Cental Univesity and National Chengchi Univesity Abstact: We conside the estimation poblem of a logistic egession model. We assume the esponse obsevations and covaiate values ae both subject to measuement eos. We discuss some paametic and semipaametic estimation methods using mismeasued obsevations with validation data and deive thei asypmtotic distibutions. Ou esults ae extentions of some well known esults in the liteatue. Compaisons of the asymptotic covaiance matices of the studied estimatos ae made, and some lowe and uppe bounds fo the asymptotic elative efficiencies ae given to show the advantages of the semipaametic method. Some simulation esults also show the method pefoms well. Key wods and phases: Kenel estimation, estimated likelihood, logistic egession, measuement eo, misclassification. 1. Intoduction Logistic egession is the most popula fom of binay egession; see Cox (1970 and Pegibon (1981. Reseaches often use logistic egession to estimate the effect of vaious pedictos on some binay outcome of inteest. Basically, the model assumes that the log of the odds of the outcome is a linea function of the pedictos. That is, suppose that the vaiables (X i, Y i follow the model P (Y i = 1 X i = x i = exp(x T i β0 1 + exp(x T i β0 } F (xt i β 0, whee the X i ae andom d-vecto pedictos and the Y i ae Benoulli esponse vaiables. Usually, the maximum likelihood (ML method is used to estimate the egession coefficients β 0. Unde some egulaity conditions, the maximum likelihood estimato of β 0 is asymptotically nomal. The ML method equies that the data consist of pecise measuements fo the binay outcomes and pedictos. Howeve, the data ae often not measued pefectly. Fo instance, Golm, Holloan and Longini (1998, 1999 mentioned that collecting infomation on exposue to infection fo estimating vaccine efficacy may be mismeasued. On the othe hand, Albet, Hunsbege and Bid (1997 and Bollinge and David (1997 gave examples showing that the binay outcome of
2 112 K. F. CHENG AND H. M. HSUEH inteest may also be misclassified. It is geneally tue that the usual analyses based on the mismeasued obsevations lead to inconsistent estimation. The topic of binay egession when pedictos X i ae measued with eo has been the subject of seveal ecent papes, see Caoll, Spiegelman, Lan, Bailey and Abbott (1984, Caoll and Wand (1991, Reilly and Pepe (1995, and Lawless, Kalbfleisch and Wild(1999, etc. When the binay esponses Y i ae subject to misclassification, Pepe(1992 and Cheng and Hsueh (1999 discussed bias coection methods in the estimation of logistic egession paametes. In this pape, we study the estimation poblem of β 0 when both obsevations Y i and pedictos X i ae measued with eo. Paametic and semipaametic methods ae discussed. We find that the poposed semipaametic estimation method is a genealization of the pseudolikelihood method of Caoll and Wand (1991 and the estimated likelihood method of Pepe (1992. Many othe estimating appoaches have been poposed in the liteatue. A method based on the mean scoe was poposed by Pepe, Reilly and Fleming (1994 and Reilly and Pepe (1995 fo the mismeasued outcome data poblem and mismeasued covaiate data poblem, espectively. Howeve, Golm et al. (1999 and Lawless et al. (1999 agued that this semipaametic method is less efficient. Robins, Rotnitzky and Zhao (1994 poposed a class of semipaametic efficient estimatos fo the model with mismeasued pedictos based on the invese pobability weighted estimating equation appoach. When both esponse vaiable and pedictos ae subject to measuement eo, the semipaametic efficient estimato has not been fomally deived yet. In this pape we only emphasize vaious imputation appoaches. The weighting methods, such as the semipaametic efficient estimation deived by Robins et al. (1994, will not be discussed futhe. We assume in this pape that the complete data set consists of a pimay sample plus a smalle validation subsample which is obtained by double sampling scheme. Extension of the selection pobabilities fo the validation data set to depend on the obseved suogate covaiates is discussed biefly in Section 3. Asymptotically, we also suppose that the validation subsample size is a faction of the majo sample size. The estimatos unde discussion will be fomally defined in Section 2. In Section 3, some asymptotic esults ae deived fo semipaametic and paametic estimatos of β 0. Compaisons of thei asymptotic covaiance matices ae given in Section 4. Futhe, finite sample popeties ae exploed though a simulation study in Section Estimation Methods Suppose the tue andom vaiables (Y, X ae subject to mismeasuement and the suogate obsevations ae epesented by (Y 0, W, hee X = (1, X 1 T and W = (1, W 1 T. In addition to the pimay sample (Y 0 i, W i, i = 1,..., n},
3 LOGISTIC REGRESSION MODEL WITH MISMEASURED OBSERVATIONS 113 a smalle validation subsample is also obseved in ode to undestand the mismeasuement stuctue. The sampling scheme is to andomly select k units fom the pimay sample and at the selected units the tue measuement devices ae used to obtain the validation data. Hee the simple andom sampling scheme is consideed and fo the sake of simplicity, the fist k units of the pimay sample ae assumed to be the selected units. Thus we have the validation subsample (Yi 0, Y i, X i, W i, i = 1,..., k}. Late in Section 3 the theoetical esults will be given fo the case that the sampling scheme depends on the value of the suogate pedictos W. Fom the pimay sample, we denote the egession function of Y 0 on W = w as π 0 (w = P (Y 0 = 1 w. Fo Y and W being assumed mutually independent given X, and Y 0 and X being mutually independent given Y, W, the above egession function can be ewitten as π 0 (w = π(w, β 0, whee π(w, β = 1 θ 0 (w}ef (X T β w} + φ 0 (we F (X T β w} 1 π(w, β. Hee the expectation is taken with espect to the conditional density fx W 0, F ( = 1 F (, and the misclassification pobability functions θ 0 (w and φ 0 (w ae defined by θ 0 (w = P (Y 0 = 0 Y = 1, w and φ 0 (w = P (Y 0 = 1 Y = 0, w. Note that the suogate W is often a fallible measuement of X and coesponding to a coase patition in the sample space. The conditional independence of Y 0 and X given Y, W would be implied if the misclassification scheme depends on X only though the value of W. It is clea that the expectation EY 0 F (W T β 0 }W T ] is not necessaily zeo. Hence it is inappopiate to apply the usual likelihood method to the pimay sample fo infeence about β 0. In the following, we discuss some consistent estimates based on diffeent imputation methods. Suppose fist that fx W 0 and θ0 (w, φ 0 (w ae known a pioi. Then fo the obsevation (yj 0, w j, j = k + 1,..., n, in the pimay sample, the coesponding log-likelihood and scoe function ae L (β y 0 j, w j=y 0 j ln π(w j, β + (1 y 0 j ln π(w j, β, S (β y 0 j, w j L (β y 0 j, w j = y0 j π(w j, β π(w j, β π(w j, β 1 θ0 (w j φ 0 (w j }EF (X T j β F (X T j βx j w j }. Since, yj 0 π(w j, β π(w j, β π(w j, β = 2yj 0 1 yj 0π(w j, β + (1 yj 0 π(w j, β, 1 θ 0 (w j φ 0 (w j = 1 θ 0 EF (Xj T (w j π(w j, β} β w j} EF (Xj T β w j}e F (Xj T β w j},
4 114 K. F. CHENG AND H. M. HSUEH we have y 0 j π(w j, β π(w j, β π(w j, β 1 θ0 (w j φ 0 (w j } = E(Y j y 0 j, w j EF (X T j β w j} EF (X T j β w j}e F (X T j β w j}. Consequently, S (β yj 0, w j = E(Y j yj 0, w j EF (Xj T β w j} EF (Xj Tβ w j}e F (Xj Tβ w j} E F (Xj T β F } (Xj T βx j w j = A0 1 (y0 j, w j A 0 2 (y0 j, w j}ef (Xj T β F (Xj T βx j w j } A 0 1 (y0 j, w jef (Xj T β w j} + A 0 2 (y0 j, w je F ], (Xj T β w j} whee A 0 1 (y0 j, w j = P (Y 0 j = y 0 j Y j = 1, w j = θ 0 (w j } (1 y 0 j 1 θ 0 (w j } y 0 j, and A 0 2 (y0 j, w j = P (Y 0 j = y 0 j Y j = 0, w j = φ 0 (w j } y 0 j 1 φ 0 (w j } (1 y 0 j. Using this esult, we easily see that the MLE ˆβ f of β 0 can be obtained by solving the likelihood equations k n S(β y i, x i + S (β yj 0, w j = 0, (1 i=1 j=k+1 whee S(β y i, x i = y i F (x T i β}x i. Unfotunately, in applications fx W 0 (, θ0 ( and φ 0 ( ae aely known and ˆβ f can not be obtained. If the subsample size k is lage enough, a simple estimate ˆβ s can be obtained by using only the validation subsample, i.e., ˆβ s satisfies ki=1 S(β y i, x i = 0. Such a simple estimate is in geneal not efficient; see Cheng and Hsueh (1999. The basic eason is that the infomation contained in the pimay sample is not popely exploited. To do this, assume thee exist paametic functions and paametes (γ 0, α 0, independent of β 0, such that θ 0 (W = θ(w ; α 0, φ 0 (W = φ(w ; α 0 and fx W 0 (X W = f(x W ; γ0 almost suely. See Caoll and Wand (1991 and Cheng and Hsueh (1999 fo elated discussions. Then one can employ the usual appoach to obtain the joint MLE ( ˆβ m, ˆα m, ˆγ m. On the othe hand, the maximum pseudolikelihood estimate (MPLE ˆβ p can be obtained by solving (1 with α 0 and γ 0 in S (β yj 0, w j being eplaced by thei MLE ˆα p and ˆγ p deived fom the validation data. In geneal, ˆβm can be shown to be moe efficient than ˆβ p asymptotically. Howeve, in solving the equations fo finite sample cases, the computational complexity gows (and numeical stability deteioates with the numbe of unknown paametes, so that ˆβ m may not have pope pefomance, see the simulation study in Section 5.
5 LOGISTIC REGRESSION MODEL WITH MISMEASURED OBSERVATIONS 115 The last appoach to be discussed is a semipaametic method. Assuming fx W 0 ( w, θ0 (w and φ 0 (w in S (β yj 0, w j of (1 ae unknown functions, a nonpaametic method is used to estimate them. Fomally, we popose the MPLE ˆβ sp solving the following pseudolikelihood equations Ŝ (β y 0, w = k n S(β y i, x i + Ŝ (β yj 0, w j = 0, i=1 j=k+1 Â1(y 0, w Â2(y 0, w}êf (XT β F (X T βx w] Â1 (y 0, wêf (XT β w} + Â2(y 0, wê F (X T β w}]. Hee, fo any integable function g(x; β, the estimated expectation Êg(X; β w} is given by Êg(X; β w} = k i=1 K h (w w i g(x i ; β}/ k i=1 K h (w w i }. Moeove Â1(y 0, w and Â2(y 0, w ae estimates of A 0 1 (y0, w and A 0 2 (y0, w, with θ 0 (w and φ 0 (w being eplaced by thei nonpaametic estimates ˆθ(w = k i=1 K h (w w i y i (1 yi 0}/ k i=1 K h (w w i y i } and ˆφ(w = k i=1 K h (w w i (1 y i yi 0}/ k i=1 K h (w w i (1 y i }. The above kenel function K(t is taken to be a density function, K h (t = h 1 K(t/h, the bandwidth h depends on n and tends to zeo as n. We emak that the poposed semipaametic estimation method genealizes the methods of Caoll and Wand (1991 and Pepe (1992. If Y = Y 0 with pobability one, i.e., only measuement eo occus in the pedicto, ˆβ sp is the MPLE of Caoll and Wand (1991. On the othe hand, if X = W with pobability one, i.e., only misclassification occus in the esponse, ˆβsp educes to the maximum estimated likelihood estimate of Pepe ( Asymptotic Distibutions The asymptotic distibutions of the estimatos poposed in the peceding section will be pesented hee fo the case d = 1. Extension to geneal d is simple and the elated esults will be given in a emak. The asymptotic popeties depend on cetain egulaity conditions. A.1 β 0 Λ, an open set in R 2. A.2 E(X 2 <. A.3 The misclassification pobability functions θ 0 (w and φ 0 (w (0, 1, and the density function fw 0 (w of W ae stictly positive in the space of W. Futhemoe, these functions and thei l-th deivatives, l = 1, 2, ae in L 4 and also satisfy a weighted Lipschitz condition. (A function η( is said to satisfy a weighted Lipschitz condition if thee exists a constant c and a bounded function ψ in L 4 such that η(x η(y < ψ(x x y fo all x, y with x y < c.
6 116 K. F. CHENG AND H. M. HSUEH A.4 The paametic functions θ(w; α,φ(w; α and f(x w; γ, and thei patial deivatives θ(w; α/ α, φ(w; α/ α and f(x w; γ/ γ, ae unifomly continuous at (α 0, γ 0 fo all x and w in the suppot of X and W. A.5 The function K( is a second-ode kenel (see Gasse and Mülle (1979. A.6 h = h n 0 with nh 2 and nh 4 0, as n. Also, as n, k = n1 + O(h 2 }, whee (0, 1]. In the following, we give the basic asymptotic esults unde the assumption that α = (α 0, α 1 T and γ = (γ 0, γ 1 T. Except fo the asymptotic nomality of ˆβ sp, whose poof is given in the Appendix, the poofs fo othe estimatos ae standad and thus ae omitted. Theoem 1. Suppose conditions (A.1 (A.6 hold and let β be any estimato discussed in Section 2. Then as n, n 1/2 ( β β 0 conveges in distibution to a nomal with mean zeo. Thei asymptotic covaiance matices ae lim Cov n n1/2 ( ˆβ f β 0 } = Σ f = I v + (1 Iv} c 1, lim Cov n n1/2 ( ˆβ s β 0 } = Σ s = (I v 1, lim Cov n n1/2 ( ˆβ m β 0 } = Σ m = Σ f + lim Cov n n1/2 ( ˆβ p β 0 } = Σ p = Σ f + lim Cov n n1/2 ( ˆβ sp β 0 } = Σ sp = Σ f + Hee I v = EF (X T β 0 F (X T β 0 X 2 } and (1 2 Σ f I m Σ f, (1 2 Σ f I p Σ f, (1 2 Σ f I sp Σ f. Iv c 1 θ 0 (W φ 0 (W } 2 EF (X T β 0 F ] 2 (X T β 0 X W } = E π 0 (W 1 π 0, (W } I p = E 1 θ0 (W φ 0 (W }EF (X T β 0 F (X T β 0 X W } π 0 (W 1 π 0 (W } ( π 0 α (W ; α 0, γ 0 T π γ 0(W ; α0, γ 0 ] E F (X T β 0 θ 2 (W ;α 0 + F (X T β 0 φ 2 (W ;α θ 0 (W 1 θ 0 (W } φ 0 (W 1 φ 0 (W } 0 E f(x W ; γ 0 } 2 E 1 θ0 (W φ 0 (W }EF (X T β 0 F (X T β 0 X W } ( π 0 α (W ; α 0, γ 0 T T π 0 (W 1 π 0 (W } π γ(w 0 ; α 0, γ 0,
7 LOGISTIC REGRESSION MODEL WITH MISMEASURED OBSERVATIONS θ 0 (W φ 0 (W } EF (X T β 0 F ] 2 (X T β 0 X W } I sp = E π 0 (W 1 π 0 (W }] 2 ( EF (X T β 0 W }θ 0 (W 1 θ 0 (W }+E F (X T β 0 W }φ 0 (W 1 φ 0 (W } } + 1 θ 0 (W φ 0 (W } 2 EF 2 (X T β 0 W } E 2 F (X T β 0 W }], I m = (1 2 (β 0,α 0,γ 0 (α 0,γ 0 T (β 0,α 0,γ 0 Σ f (β 0,α 0,γ 0 } 1 T (β 0,α 0,γ 0, 1 θ 0 (W φ 0 (W }EF (X T β 0 (β 0,α 0,γ 0 = (1 E F (X T β 0 X W } π 0 (W 1 π 0 (W } ( π 0 α (W ; α 0, γ 0 T ] π γ(w 0 ; α 0, γ 0, ] (α 0,γ 0 = E F (X T β 0 θ 2 (W ;α 0 θ 0 (W 1 θ 0 (W } + F (X T β 0 φ 2 (W ;α 0 φ 0 (W 1 φ 0 (W } 0 0 E f(x W ; γ 0 } 2 ( 1 +(1 E π 0 α (W ; α 0, γ 0 2 π 0 (W 1 π 0 (W } π γ 0(W ; α0, γ 0. In this, π α (W ; α 0, γ 0 = π0 (W α (α 0,γ 0 = E F (X T β 0 W } φ(w ; α 0 EF (X T β 0 W } θ(w ; α 0, π γ (W ; α 0, γ 0 = π0 (W γ (α 0,γ 0 = 1 θ 0 (W φ 0 (W }EF (X T β 0 f(x W ; γ 0 W }, θ(w ; α 0 θ(w ; α = α, φ(w ; α 0 φ(w ; α = α 0 α, α 0 f(x W ; γ 0 = ln f(x W ; γ γ, γ 0 whee, fo any column vecto A, A 2 = AA T. Remak 1. It can be seen that I m, I p and I sp ae nonnegative definite matices, and thus (1 2 Σ f I m Σ f, (1 2 Σ f I p Σ f, (1 2 Σ f I sp Σ f can be egaded as addi-
8 118 K. F. CHENG AND H. M. HSUEH tional vaiations due to estimating the unknown functions θ 0 (W, φ 0 (W and fx W 0 by paametic and nonpaametic methods. Remak 2. It is well known that the infomation matix I v is the vaiance of the scoe; that is, I v = Va S(β 0 Y, X} = EVa S(β 0 Y, X X}]. We see that the infomation matices Iv c and I sp also have simila expessions: Iv c = EVa S (β 0 Y 0, W W }] = Eρ(Y, Y 0 W Va S (β 0 Y, W W }], and I sp = E(ρ(Y, Y 0 W 1 ρ(y, Y 0 W 1 ρ(y, F (X T β 0 W }]Va S (β 0 Y, W W }, whee S (β Y, W = ln P (Y W /, and ρ(y, Y 0 W = Co 2 (Y, Y 0 W = 1 θ 0 (W φ 0 (W } 2 EF (X T β 0 W }E F (X T β 0 W }}/π 0 (W 1 π 0 (W }], ρ(y, F (X T β 0 W = Co 2 (Y, F (X T β 0 W = EF 2 (X T β 0 W } EF (X T β 0 W }] 2 EF (X T β 0 W }E F (X T β 0. W } Thus these infomation matices depend not only on the scoe S (β 0 Y, W but also on the squaed coelation functions. Remak 3. Fo obtaining the validation subsample, a simple andom sampling design is used. Howeve, the selection pobabilities may depend on the value of W. Fo example, suppose g(w is the selection pobability. Then the asymptotic covaiance matix of ˆβ sp becomes Σ sp = Σ + Σ I sp Σ, whee Σ 1 =E g(w EF (X T β 0 F ] (X T β 0 X 2 W } +E (1 g(w } 1 θ0 (W φ 0 (W } 2 1 g(w } Isp=E 2 g(w ( 1 θ 0 (W φ 0 (W } EF (X T β 0 F (X T β 0 X W } π 0 (W 1 π 0 (W } ] 2 EF (X T β 0 F (X T β 0 X W } π 0 (W 1 π 0 (W }] 2 EF (X T β 0 W }θ 0 (W 1 θ 0 (W }+E F (X T β 0 W }φ 0 (W 1 φ 0 (W } } +1 θ 0 (W φ 0 (W } 2 EF 2 (X T β 0 W } E 2 F (X T β 0 W }]. Othe asymptotic covaiance matices in Theoem 1 can be modified accodingly. Remak 4. It is clea that Σ sp educes to the asymptotic covaiance matix of Caoll and Wand s (1991 estimato povided Y = Y 0 with pobability one. Suppose X = W with pobability one, then Σ sp is the asymptotic covaiance matix of Pepe s (1992 estimato; see also Cheng and Hsueh (1999., ] 2
9 LOGISTIC REGRESSION MODEL WITH MISMEASURED OBSERVATIONS 119 Remak 5. The esults in Theoem 1 can be extended to d > 1 vecto pedictos, povided the function K is a pth ode kenel, p > d, and the bandwidth paamete satisfies nh 2d and nh 2p 0 as n. Remak 6. All the asymptotic covaiance matices can be estimated by moment type estimates. 4. Compaisons of Asymptotic Covaiance Matices In this section, we discuss the behavios of diffeent estimatos by compaing thei asymptotic covaiance matices. The geneal esults basically agee with ou expectation, but some of thei poofs ae not tivial. Fo any matices A and B, we wite A B if A B is a nonnegative definite matix Results unde geneal conditions Fist, fom Remak 1 of Section 3, we have Σ m Σ f, Σ p Σ f and Σ sp Σ f. Futhe, suppose we let v (α 0,γ 0 } 1 be the asymptotic covaiance matix of the MLE (ˆα p, ˆγ p which is obtained fom the validation subsample. Then one can ewite Σ p as Σ p = Σ f Σ 1 f + (β 0,α 0,γ 0 v (α 0,γ 0 } 1 T (β 0,α 0,γ 0 ]Σ f, and consequently we have Σ p Σ m =Σ f (β 0,α 0,γ 0 v (α 0,γ 0 } 1 (α 0,γ 0 T (β 0,α 0,γ 0 Σ f (β 0,α 0,γ 0 } 1] T (β 0,α 0,γ 0 Σ f. Since v (α 0,γ 0 } 1 and (α 0,γ 0 T (β 0,α 0,γ 0 Σ f (β 0,α 0,γ 0 } 1 ae espectively, the asymptotic covaiance matices of (ˆα p, ˆγ p and (ˆα m, ˆγ m, the above esults yield the following theoem. Theoem 2. Given conditions (A.1 (A.6, we have Σ s Σ f, Σ sp Σ f and Σ p Σ m Σ f Results unde special constaints Cheng and Hsueh (1999 compaed the asymptotic covaiance matices when X = W with pobability one. Hee the compaisons focus on the case when Y = Y 0 with pobability one. As a consequence of Theoem 1, if Y = Y 0 with pobability one, the matices Σ f and I sp simplify to Σ f = EF (X T β 0 F (X T β 0 X 2 } ( EF (X T β 0 +(1 E F (X T β 0 X W }] 2 ] 1, EF (X T β 0 W }E F (X T β 0 W }
10 120 K. F. CHENG AND H. M. HSUEH ( ] I sp = E ρ(y, F (X T β 0 W E S (β 0 Y, W } 2 W. Then Coollay 1 follows easily fom Σ s Σ sp = Σ f ( (1 ES (β 0 Y, W } 2 (1 2 ( E ρ(y, F (X T β 0 W ES (β 0 Y, W } 2 W ] + (1 2 ES (β 0 Y, W } 2 ES(β 0 Y, X} 2] 1 ES (β 0 Y, W } Σ 2 f. Coollay 1. Suppose conditions (A.1 (A.6 ae satisfied, that Y = Y 0 with pobability one, and ρ(y, F (X T β 0 W min(/(1, 1. Then Σ s Σ sp. Accodingly, the semipaametic estimato ˆβ sp is always bette than the simple MLE ˆβ s unde ou conditions. Simila conclusion can be deived fo the case that X = W with pobability one. Now suppose β 0 is a scala. Then unde the conditions of Theoem 1 and ρ(y, F (X T β 0 W ρ, with pobability one fo some constant ρ, the ARE of ˆβ s with espect to ˆβ sp is always smalle than e 1 (ρ, I, = 1 (1 2 II ρ /(1 }] + (1 I} 2, whee the coefficient of eliability I = ES (β 0 Y, W } 2 /ES(β 0 Y, X} 2 1. This can be used to measue the quality of suogate pedictos W. Some cuves of e 1 ae given in Figue 1 fo ρ = 0.3. The esults clealy show that if the coefficient I is lage enough, then even fo a smalle sampling faction, the semipaametic estimato ˆβ sp is still much moe efficient than the simple MLE ˆβ s. 1.0 I= I=0.3 I= Bound of ARE 0.4 I=1.0 I=0.9 I= Figue 1. Fo ρ = 0.3, cuves of e 1 (ρ, I, at vaious I.
11 LOGISTIC REGRESSION MODEL WITH MISMEASURED OBSERVATIONS 121 Simila esults can be deived fo the ARE of ˆβ sp with espect to ˆβ f. We note that unde the same conditions, a lowe bound fo the ARE of ˆβsp with espect to ˆβ f is e 2 (ρ, I, = 1 (1 2 Iρ 2 + (1 I + (1 2 Iρ. Some cuves of e 2 ae also given in Figue 2 fo ρ = 0.3. The ARE is at least 0.60 fo 0.3. Futhe, fo fixed, e 2 inceases as the coefficient of eliability I deceases I=0.7 I= Bound of ARE 0.4 I=0.1 I=0.3 I= Figue 2. Fo ρ = 0.3, cuves of e 2 (ρ, I, at vaious I. 5. Simulation Studies In ode to study the finite sample pefomance of the estimates, some empiical studies wee caied out. The logistic egession model consideed was F (x T β 0 = exp(x/1 + exp(x} so (β0 0, β0 1 = (0, 1. A linea eo model fo the pedicto X was assumed: W i = X i + v U i, fo some v. X i and U i wee pseudo independent N(0, 0.25 andom vaiables. Thus given W i = w i, X i is a N(w i /(1 + v 2, (0.25v 2 /(1 + v 2 vaiate. We also assumed that given W i = w i, the misclassification pobabilities wee independent of X i and Y i, i.e., θ 0 (w i = φ 0 (w i, fo all w i, whee θ 0 (w i = θ(w i, α 0 and φ 0 (w i = φ(w i, α 0. Fou diffeent models wee consideed. (1a W i = X i + 0.1U i and Y i = Yi 0 with pobability one. In this case, no misclassification occus, and hence the only functional fom needed fo estimating ˆβ m and ˆβ p is f(x w; γ = N(w/(1 + 4γ, γ/(1 + 4γ, γ > 0. (1b W i = X i + 0.1U i and θ 0 (w i = φ 0 (w i = 0.1. In this case, the functional foms needed fo computing ˆβ m, ˆβ p ae θ(w, α = α 0, φ(w, α = α 1, and f(x w; γ = N(w/(1 + 4γ, γ/(1 + 4γ, γ > 0.
12 122 K. F. CHENG AND H. M. HSUEH (2a W i = X i + U i and Y i = Yi 0 with pobability one. In this case, no misclassification occus, and hence the only functional fom needed fo estimating ˆβ m and ˆβ p is f(x w; γ = N(w/(1 + 4γ, γ/(1 + 4γ, γ > 0. (2b W i = X i + U i and θ 0 (w i = φ 0 (w i = exp(w i 2.5/1 + exp(w i 2.5}. In this case, the misclassification pobabilities follow a logit model with egession coefficients α 0 0 = 2.5, α0 1 = 1. The functional foms needed fo estimating ˆβ m and ˆβ p ae θ(w, α = exp(α 0 + α 1 w/1 + exp(α 0 + α 1 w}, φ(w, α = exp(α 0 + α 1 w/1 + exp(α 0 + α 1 w}, and f(x w; γ = N(w/(1 + 4γ, γ/(1 + 4γ, γ > 0. Model (1b featues small measuement eo fo both esponses and covaiates. On the othe hand, Model (2b has moe sevee measuement eo. The pimay sample size used was n = 150, and the sampling factions fo validation subsamples wee = 0.2, 0.4 and 0.6. One thousand pseudo data sets wee geneated to compute the simulated mean squaed eos. The function K( used in computing the nonpaametic egession estimates was the Epanechnikov kenel, that is, K(t = (3/4(1 t 2 I 1,1] (t; see Eubank (1988. Seveal bandwidths h = aˆσ w k 1/3 wee used in ou simulations. Hee ˆσ w is the sample standad deviation of W based on the validation data set, and a is some constant value. Such choice of bandwidth was justified by Sepanski, Knickebocke and Caoll (1994; see also Caoll and Wand (1991 and Wang and Wang (1997. Fist, we investigate the pefomance of ˆβ sp fo vaious choices of a. The pefomance of ˆβsp = ( ˆβ sp,0, ˆβ sp,1 T is measued by its simulated total mean squaed eo (TMSE, defined to be MSE( ˆβ sp,0 +MSE( ˆβ sp,1. Table 1 epots the simulation esults. It is seen that the pefomance of ˆβ sp is not sensitive to the choice of a. This agees with many findings fo semipaametic estimation using kenel egession. Hee, howeve, a bette choice is a = 1 and hence we use this value fo h in emaining simulations. Table 1. Simulated TMSE of ˆβ sp fo diffeent a W = X +.1U W = X + U θ 0 (W = 0 θ 0 (W =.1 θ 0 (W = 0 θ 0 (W = exp(w exp(W 2.5} a =.2 =.4 =.6 =.2 =.4 =.6 =.2 =.4 =.6 =.2 =.4 = Next, we compae the simulated mean squaed eo of diffeent estimates fo β = (β 0, β 1 T. The esults ae tabulated in Table 2. Obviously, ˆβ sp has the best
13 LOGISTIC REGRESSION MODEL WITH MISMEASURED OBSERVATIONS 123 oveall pefomance among the competing estimates. Moeove, the behavio of the simulated TMSE fo ˆβ sp seems insensitive to the selection of values. This is paticulaly tue when the model has only mino measuement eos. Unepoted calculations also show that the simulated standad eos (SE fo ˆβ sp,0 and ˆβ sp,1 ae stable. Unde all cases consideed, the simulated SE s ange fom to fo ˆβ sp,0 and fom to fo ˆβ sp,1. In geneal, ˆβsp has much bette pefomance than the othe estimates, especially when 0.4. Fom Table 2, we also see that the estimates ˆβ f and ˆβ p ae quite competitive and bette than the simple estimate ˆβ s. The latte statement is especially clea when = 0.2. Howeve, the simulated SE s fo ˆβ f,0, ˆβ p,0 and ˆβ s,0 ae not vey diffeent. These values ange fom to On the othe hand, the simulated SE s fo ˆβ f,1, ˆβ p,1 ae smalle than those of ˆβ s,1, paticulaly when = 0.2. The lagest simulated SE fo ˆβ f,1 and ˆβ p,1 is compaed with the coesponding value fo ˆβ s,1. Table 2. Simulated TMSE of diffeent estimatos ˆβ s ˆβm ˆβp ˆβf ˆβsp W = X +.1U θ 0 (W = 0. = = = θ 0 (W = θ 0 (W = 0.1 = = = W = X + U θ 0 (W = 0. = = = exp(w exp(W 2.5 = = = Finally, we comment on the compaison between ˆβ m and ˆβ p. In geneal, ˆβ p is bette than ˆβ m unless = 0.6. This happens because thee ae moe paametes to be estimated simultaneously in computing ˆβ m, and the validation subsample size k is not lage enough. The lagest simulated SE is fo ˆβ m,1 and fo ˆβ m,0, showing that the estimates ˆβ m ae not vey stable. The pefomance of ˆβ m and ˆβ p ae expected to become bette if k is sufficiently lage and the paametic functions θ(w, α, φ(w, α and f X W (x w; γ ae coectly modeled. Cheng and Hsueh (1999 epoted that the pefomances of ˆβ m and ˆβ p depend heavily on
14 124 K. F. CHENG AND H. M. HSUEH the coect choice of the models fo misclassification pobabilities when X = W with pobability one. Theefoe, ˆβ m and ˆβ p ae not vey obust and exta cae needs to be taken to apply the paametic methods. Acknowledgements This eseach was suppoted in pat by the National Science Council, R.O.C. The authos thank the Edito, an associate edito and efeees fo thei useful comments and suggestions which geatly impove the pesentation of this pape. Appendix Poof of Theoem 1. By Taylo s Theoem and fo n, we have 1 n( ˆβsp β = 0 2ˆl(β } 0 n 2 1 3ˆl(β } 1 } 2n 3 ( ˆβ 1 ˆl(β sp β ] 0 0 +o p (1, n whee β is some quantity such that β β 0 ˆβ sp β 0. Hee ˆl is the pseudolikelihood, and ˆl(β 0 k l1,i (β 0 } n ˆl2,j (β 0 } = +, l 1,i (β 0 ˆl 2,j (β 0 Futhe, i=1 = Y i F (X T i β0 }X i, = j=k+1 Â1(Yj 0, W j Â2(Yj 0, Wj]ÊF (XT j β0 F (Xj T β0 X j W j ] ÊF (Xj T β0 W j }Â1(Yj 0, W j + Ê F (Xj T β0 W j }Â2(Yj 0, W j. 2ˆl(β 0 2 = k 2 l 1,i (β 0 2 i=1 } + n j=k+1 2 l 1,i (β 0 2 = F (X T i β 0 F (X T i β 0 X i X T i, 2ˆl2,j (β 0 2 2ˆl2,j (β 0 2 ( Â1(Yj 0 =, W j Â2(Yj 0, Wj]ÊF (XT j β0 F (Xj T β0 X j W j ] ÊF (Xj T β0 W j }Â1(Yj 0, W j + Ê F (Xj T β0 W j }Â2(Yj 0, W j }, 2 Â1(Y j 0, W j Â2(Yj 0, Wj]Ê1 2F (XT j β0 }F (Xj T β0 F (Xj T β0 X j W j ] ÊF (Xj T β0 W j }Â1(Yj 0, W j + Ê F (Xj T β0 W j }Â2(Yj 0, W. j
15 LOGISTIC REGRESSION MODEL WITH MISMEASURED OBSERVATIONS 125 By the Law of Lage Numbes, 1 2ˆl(β } 0 n 2 = 1 2 l(β 0 } 1 n 2 + 2ˆl(β 0 } n l(β 0 }] n 2 = EF (X T β 0 F (X T β 0 }XX T ] 1 θ 0 (W φ 0 (W } 2 } +(1 E π 0 (W 1 π 0 (W } (EF (XT β 0 F (X T β 0 X W ] 2 1 +o p (1 + O p ( n h h 2 n = Σ 1 f + o p (1, n, h 0 and n 2 h. Then as n, h 0, n 2 h, } n( ˆβ sp β 0 ˆl(β = Σ f + o p (1} 0 / n + o p (1. Futhe, by Taylo s Theoem again, as n, n ˆl2,j (β 0 } n l2,j (β 0 } = + H n,j + O p ( 1 + nh nh 3 2 h + nh 3 2, i=k+1 i=k+1 whee H n,j = 1 ki=1 k h i,j, and h i,j = K ( h(w i W j 2 l 2,j (β 0 f(w j θ 0 (W j 2 l 2,j (β 0 } (1 Yi Yi 0 φ 0 (W j } + φ 0 (W j E F (Xj T β0 W j } 2 l 2,j (β 0 + EF (Xj T β0 W j } 2 l 2,j (β 0 } + EF (Xj T β0 F (Xj T β0 X j W j ] } Yi 1 Y 0 i θ 0 (W j } EF (X T j β0 W j } } ] F (Xi T β 0 EF (Xj T β 0 W j } F (Xi T β0 F (Xi T β0 X i EF (Xj T β0 F (Xj T β0 X j W j ]]. As a consequence, fo n, 1 ˆl(β 0 } n k n k l1,i (β 0 } = n k(n k n i=1 j=k+1 1 +O p ( + n 1 n 3 2 h 3 2 h n 1 2 h + h n k n l2,j (β 0 } + n k n h i,j ]
16 126 K. F. CHENG AND H. M. HSUEH Define two i.i.d. andom vectos Z i = (X i, W i, Y i, Yi 0, i = 1,..., k and Z j = (W j, Yj 0, j = k + 1,..., n and let Q n (Z i ; Zj = k l1,i (β 0 } + n k l2,j (β 0 } + n k n n n h i,j. Then k 1 (n k 1 k nj=k+1 i=1 Q n (Z i ; Zj is a genealized U-statistic. Applying the Cental Limit Theoem fo genealized U-statistics (see Sefling(1980, we have, fo n, n k n Q n (Z i ; Zj k(n k i=1 j=k+1 k = n 1 k i=1 ( d N 0, Σ 1 f + EQ n (Z i ; Z j Z i} + 1 n k (1 2 I sp, n i=k+1 EQ n (Z i ; Z j Z j } + o p (1 EQ n (Z i ; Zj Z i} l1,i (β 0 } = k n + n k n 1 θ 0 (W i φ 0 (W i }EF (X T i β0 F (X T i β0 X i W i ] π 0 (W i 1 π 0 (W i } ( Y i 1 Yi 0 θ 0 (W i } (1 Y i Yi 0 φ 0 (W i } 1 θ 0 (W i φ 0 (W i }F (Xi T β0 EF (Xi T β0 W i }] + O p (h 2, EQ n (Z i ; Zj Zj } = n k l2,j (β 0 }. n Consequently, as n, n( ˆβ sp β 0 d N (0, Σ f + (1 2 Σ f I sp Σ f. Refeences Albet, P. S., Hunsbege, S. A. and Bid, F. M. (1997. Modeling epeated measues with monotonic odinal esponses and misclassification, with applications to studying matuation. J. Ame. Statist. Assoc. 92, Bollinge, C. R. and David, M. H. (1997. Modeling discete choice with esponse eo: Food stamp paticipation. J. Ame. Statist. Assoc. 92, Caoll, R. J., Spiegelman, C. H., Lan, K. K. G., Bailey, K. T. and Abbott, R. D. (1984. On eos-in-vaiables fo binay egession models. Biometika 71, Caoll, R. J. and Wand, M. P. (1991. Semipaametic estimation in logistic measuement eo models. J. Roy. Statist. Soc. Se. B 53, Cheng, K. F. and Hsueh, H. M. (1999. Coecting bias due to misclassification in the estimation of logistic egession models. Statist. Pobab. Lett. 44,
17 LOGISTIC REGRESSION MODEL WITH MISMEASURED OBSERVATIONS 127 Cox, D. R. (1970. The Analysis of Binay Data. Chapman and Hall, London. Eubank, R. L. (1988. Spline Smoothing and Nonpaametic Regession. Macel Dekke, New Yok. Gasse, T. and Mülle, H. G. (1979. Kenel Estimation of Regession Functions in Smoothing Techniques fo Cuve Estimation. (Edited by T. Gasse and M. Rosenblatt, Lectue Notes in Mathematics 757, Spinge-Velag, Belin. Golm, G. T., Halloan, M. E. and Longini, I. M. J. (1998. Semi-paametic models fo mismeasued exposue infomation in vaccine tials. Statist. Medicine 17, Golm, G. T., Halloan, M. E. and Longini, I. M. J. (1999. Semipaametic methods fo multiple exposue mismeasuement and bivaiate outcome in HIV vaccine tials. Biometics 55, Lawless, J. F., Kalbfleisch, J. D. and Wild, C. J. (1999. Semipaametic methods fo esponseselective and missing data poblems in egession. J. Roy. Statist. Soc. Se. B 61, Pepe, M. S. (1992. Infeence using suogate outcome data and a validation sample. Biometika 79, Pepe, M. S., Reilly, M. and Fleming, T. R. (1994. Auxiliay outcome data and the mean scoe method. J. Statist. Plann. Infeence 42, Pegibon, D. (1981. Logistic egession diagnostics. Ann. Statist. 9, Reilly, M. and Pepe, M. S. (1995. A mean scoe method fo missing and auxiliay covaiate data in egession models. Biometika 82, Robins, J. M., Rotnitzky, A., Zhao, L.-P. and Lipsitz, S. (1994. Estimation of egession coefficients when some egessos ae not always obseved. J. Ame. Statist. Assoc. 89, Sepanski, J. H., Knickebocke, R. and Caoll, R. J. (1994. A semipaametic coection fo attenuation. J. Ame. Statist. Assoc. 89, Sefling, R. J. (1980. Appoximation Theoems of Mathematical Statistics. John Wiley, New Yok. Wang, C. Y. and Wang, S. (1997. Semipaametic methods in logistic egession with measuement eo. Statist. Sinica 7, Gaduate Institute of Statistics, National Cental Univesity, Chungli, Taiwan, R.O.C. kfcheng@cc.ncu.edu.tw Depatment of Statistics, National Chengchi Univesity, Taipei, Taiwan, R.O.C. hsueh@nccu.edu.tw (Received Apil 2000; accepted July 2002
Central Coverage Bayes Prediction Intervals for the Generalized Pareto Distribution
Statistics Reseach Lettes Vol. Iss., Novembe Cental Coveage Bayes Pediction Intevals fo the Genealized Paeto Distibution Gyan Pakash Depatment of Community Medicine S. N. Medical College, Aga, U. P., India
More informationEstimation of the Correlation Coefficient for a Bivariate Normal Distribution with Missing Data
Kasetsat J. (Nat. Sci. 45 : 736-74 ( Estimation of the Coelation Coefficient fo a Bivaiate Nomal Distibution with Missing Data Juthaphon Sinsomboonthong* ABSTRACT This study poposes an estimato of the
More informationPearson s Chi-Square Test Modifications for Comparison of Unweighted and Weighted Histograms and Two Weighted Histograms
Peason s Chi-Squae Test Modifications fo Compaison of Unweighted and Weighted Histogams and Two Weighted Histogams Univesity of Akueyi, Bogi, v/noduslód, IS-6 Akueyi, Iceland E-mail: nikolai@unak.is Two
More informationAlternative Tests for the Poisson Distribution
Chiang Mai J Sci 015; 4() : 774-78 http://epgsciencecmuacth/ejounal/ Contibuted Pape Altenative Tests fo the Poisson Distibution Manad Khamkong*[a] and Pachitjianut Siipanich [b] [a] Depatment of Statistics,
More informationOn the Poisson Approximation to the Negative Hypergeometric Distribution
BULLETIN of the Malaysian Mathematical Sciences Society http://mathusmmy/bulletin Bull Malays Math Sci Soc (2) 34(2) (2011), 331 336 On the Poisson Appoximation to the Negative Hypegeometic Distibution
More informationA Multivariate Normal Law for Turing s Formulae
A Multivaiate Nomal Law fo Tuing s Fomulae Zhiyi Zhang Depatment of Mathematics and Statistics Univesity of Noth Caolina at Chalotte Chalotte, NC 28223 Abstact This pape establishes a sufficient condition
More informationSurveillance Points in High Dimensional Spaces
Société de Calcul Mathématique SA Tools fo decision help since 995 Suveillance Points in High Dimensional Spaces by Benad Beauzamy Januay 06 Abstact Let us conside any compute softwae, elying upon a lage
More informationTESTING THE VALIDITY OF THE EXPONENTIAL MODEL BASED ON TYPE II CENSORED DATA USING TRANSFORMED SAMPLE DATA
STATISTICA, anno LXXVI, n. 3, 2016 TESTING THE VALIDITY OF THE EXPONENTIAL MODEL BASED ON TYPE II CENSORED DATA USING TRANSFORMED SAMPLE DATA Hadi Alizadeh Noughabi 1 Depatment of Statistics, Univesity
More informationJournal of Inequalities in Pure and Applied Mathematics
Jounal of Inequalities in Pue and Applied Mathematics COEFFICIENT INEQUALITY FOR A FUNCTION WHOSE DERIVATIVE HAS A POSITIVE REAL PART S. ABRAMOVICH, M. KLARIČIĆ BAKULA AND S. BANIĆ Depatment of Mathematics
More informationThe Substring Search Problem
The Substing Seach Poblem One algoithm which is used in a vaiety of applications is the family of substing seach algoithms. These algoithms allow a use to detemine if, given two chaacte stings, one is
More information6 Matrix Concentration Bounds
6 Matix Concentation Bounds Concentation bounds ae inequalities that bound pobabilities of deviations by a andom vaiable fom some value, often its mean. Infomally, they show the pobability that a andom
More information3.1 Random variables
3 Chapte III Random Vaiables 3 Random vaiables A sample space S may be difficult to descibe if the elements of S ae not numbes discuss how we can use a ule by which an element s of S may be associated
More informationLET a random variable x follows the two - parameter
INTERNATIONAL JOURNAL OF MATHEMATICS AND SCIENTIFIC COMPUTING ISSN: 2231-5330, VOL. 5, NO. 1, 2015 19 Shinkage Bayesian Appoach in Item - Failue Gamma Data In Pesence of Pio Point Guess Value Gyan Pakash
More informationON INDEPENDENT SETS IN PURELY ATOMIC PROBABILITY SPACES WITH GEOMETRIC DISTRIBUTION. 1. Introduction. 1 r r. r k for every set E A, E \ {0},
ON INDEPENDENT SETS IN PURELY ATOMIC PROBABILITY SPACES WITH GEOMETRIC DISTRIBUTION E. J. IONASCU and A. A. STANCU Abstact. We ae inteested in constucting concete independent events in puely atomic pobability
More informationLecture 28: Convergence of Random Variables and Related Theorems
EE50: Pobability Foundations fo Electical Enginees July-Novembe 205 Lectue 28: Convegence of Random Vaiables and Related Theoems Lectue:. Kishna Jagannathan Scibe: Gopal, Sudhasan, Ajay, Swamy, Kolla An
More informationGoodness-of-fit for composite hypotheses.
Section 11 Goodness-of-fit fo composite hypotheses. Example. Let us conside a Matlab example. Let us geneate 50 obsevations fom N(1, 2): X=nomnd(1,2,50,1); Then, unning a chi-squaed goodness-of-fit test
More informationResearch Article On Alzer and Qiu s Conjecture for Complete Elliptic Integral and Inverse Hyperbolic Tangent Function
Abstact and Applied Analysis Volume 011, Aticle ID 697547, 7 pages doi:10.1155/011/697547 Reseach Aticle On Alze and Qiu s Conjectue fo Complete Elliptic Integal and Invese Hypebolic Tangent Function Yu-Ming
More informationKOEBE DOMAINS FOR THE CLASSES OF FUNCTIONS WITH RANGES INCLUDED IN GIVEN SETS
Jounal of Applied Analysis Vol. 14, No. 1 2008), pp. 43 52 KOEBE DOMAINS FOR THE CLASSES OF FUNCTIONS WITH RANGES INCLUDED IN GIVEN SETS L. KOCZAN and P. ZAPRAWA Received Mach 12, 2007 and, in evised fom,
More informationA New Method of Estimation of Size-Biased Generalized Logarithmic Series Distribution
The Open Statistics and Pobability Jounal, 9,, - A New Method of Estimation of Size-Bied Genealized Logaithmic Seies Distibution Open Access Khushid Ahmad Mi * Depatment of Statistics, Govt Degee College
More informationIntroduction to Mathematical Statistics Robert V. Hogg Joeseph McKean Allen T. Craig Seventh Edition
Intoduction to Mathematical Statistics Robet V. Hogg Joeseph McKean Allen T. Caig Seventh Edition Peason Education Limited Edinbugh Gate Halow Essex CM2 2JE England and Associated Companies thoughout the
More informationUnobserved Correlation in Ascending Auctions: Example And Extensions
Unobseved Coelation in Ascending Auctions: Example And Extensions Daniel Quint Univesity of Wisconsin Novembe 2009 Intoduction In pivate-value ascending auctions, the winning bidde s willingness to pay
More informationBayesian Analysis of Topp-Leone Distribution under Different Loss Functions and Different Priors
J. tat. Appl. Po. Lett. 3, No. 3, 9-8 (6) 9 http://dx.doi.og/.8576/jsapl/33 Bayesian Analysis of Topp-Leone Distibution unde Diffeent Loss Functions and Diffeent Pios Hummaa ultan * and. P. Ahmad Depatment
More informationMultiple Experts with Binary Features
Multiple Expets with Binay Featues Ye Jin & Lingen Zhang Decembe 9, 2010 1 Intoduction Ou intuition fo the poect comes fom the pape Supevised Leaning fom Multiple Expets: Whom to tust when eveyone lies
More informationNew problems in universal algebraic geometry illustrated by boolean equations
New poblems in univesal algebaic geomety illustated by boolean equations axiv:1611.00152v2 [math.ra] 25 Nov 2016 Atem N. Shevlyakov Novembe 28, 2016 Abstact We discuss new poblems in univesal algebaic
More informationCSCE 478/878 Lecture 4: Experimental Design and Analysis. Stephen Scott. 3 Building a tree on the training set Introduction. Outline.
In Homewok, you ae (supposedly) Choosing a data set 2 Extacting a test set of size > 3 3 Building a tee on the taining set 4 Testing on the test set 5 Repoting the accuacy (Adapted fom Ethem Alpaydin and
More informationMultiple Criteria Secretary Problem: A New Approach
J. Stat. Appl. Po. 3, o., 9-38 (04 9 Jounal of Statistics Applications & Pobability An Intenational Jounal http://dx.doi.og/0.785/jsap/0303 Multiple Citeia Secetay Poblem: A ew Appoach Alaka Padhye, and
More informationHypothesis Test and Confidence Interval for the Negative Binomial Distribution via Coincidence: A Case for Rare Events
Intenational Jounal of Contempoay Mathematical Sciences Vol. 12, 2017, no. 5, 243-253 HIKARI Ltd, www.m-hikai.com https://doi.og/10.12988/ijcms.2017.7728 Hypothesis Test and Confidence Inteval fo the Negative
More information6 PROBABILITY GENERATING FUNCTIONS
6 PROBABILITY GENERATING FUNCTIONS Cetain deivations pesented in this couse have been somewhat heavy on algeba. Fo example, detemining the expectation of the Binomial distibution (page 5.1 tuned out to
More informationSome technical details on confidence. intervals for LIFT measures in data. mining
Some technical details on confidence intevals fo LIFT measues in data mining Wenxin Jiang and Yu Zhao June 2, 2014 Abstact A LIFT measue, such as the esponse ate, lift, o the pecentage of captued esponse,
More informationWeb-based Supplementary Materials for. Controlling False Discoveries in Multidimensional Directional Decisions, with
Web-based Supplementay Mateials fo Contolling False Discoveies in Multidimensional Diectional Decisions, with Applications to Gene Expession Data on Odeed Categoies Wenge Guo Biostatistics Banch, National
More informationOn the ratio of maximum and minimum degree in maximal intersecting families
On the atio of maximum and minimum degee in maximal intesecting families Zoltán Lóánt Nagy Lale Özkahya Balázs Patkós Máté Vize Mach 6, 013 Abstact To study how balanced o unbalanced a maximal intesecting
More informationq i i=1 p i ln p i Another measure, which proves a useful benchmark in our analysis, is the chi squared divergence of p, q, which is defined by
CSISZÁR f DIVERGENCE, OSTROWSKI S INEQUALITY AND MUTUAL INFORMATION S. S. DRAGOMIR, V. GLUŠČEVIĆ, AND C. E. M. PEARCE Abstact. The Ostowski integal inequality fo an absolutely continuous function is used
More informationDetermining solar characteristics using planetary data
Detemining sola chaacteistics using planetay data Intoduction The Sun is a G-type main sequence sta at the cente of the Sola System aound which the planets, including ou Eath, obit. In this investigation
More informationSPATIAL BOOTSTRAP WITH INCREASING OBSERVATIONS IN A FIXED DOMAIN
Statistica Sinica 1(00), 7- SPATIAL BOOTSTRAP WITH INCREASING OBSERVATIONS IN A FIXED DOMAIN Ji Meng Loh and Michael L. Stein Columbia Univesity and Univesity of Chicago Abstact: Vaious methods, such as
More informationWeighted least-squares estimators of parametric functions of the regression coefficients under a general linear model
Ann Inst Stat Math (2010) 62:929 941 DOI 10.1007/s10463-008-0199-8 Weighted least-squaes estimatos of paametic functions of the egession coefficients unde a geneal linea model Yongge Tian Received: 9 Januay
More informationarxiv: v2 [physics.data-an] 15 Jul 2015
Limitation of the Least Squae Method in the Evaluation of Dimension of Factal Bownian Motions BINGQIANG QIAO,, SIMING LIU, OUDUN ZENG, XIANG LI, and BENZONG DAI Depatment of Physics, Yunnan Univesity,
More informationGENLOG Multinomial Loglinear and Logit Models
GENLOG Multinomial Loglinea and Logit Models Notation This chapte descibes the algoithms used to calculate maximum-likelihood estimates fo the multinomial loglinea model and the multinomial logit model.
More informationLocalization of Eigenvalues in Small Specified Regions of Complex Plane by State Feedback Matrix
Jounal of Sciences, Islamic Republic of Ian (): - () Univesity of Tehan, ISSN - http://sciencesutaci Localization of Eigenvalues in Small Specified Regions of Complex Plane by State Feedback Matix H Ahsani
More informationStanford University CS259Q: Quantum Computing Handout 8 Luca Trevisan October 18, 2012
Stanfod Univesity CS59Q: Quantum Computing Handout 8 Luca Tevisan Octobe 8, 0 Lectue 8 In which we use the quantum Fouie tansfom to solve the peiod-finding poblem. The Peiod Finding Poblem Let f : {0,...,
More informationHydroelastic Analysis of a 1900 TEU Container Ship Using Finite Element and Boundary Element Methods
TEAM 2007, Sept. 10-13, 2007,Yokohama, Japan Hydoelastic Analysis of a 1900 TEU Containe Ship Using Finite Element and Bounday Element Methods Ahmet Egin 1)*, Levent Kaydıhan 2) and Bahadı Uğulu 3) 1)
More informationQIP Course 10: Quantum Factorization Algorithm (Part 3)
QIP Couse 10: Quantum Factoization Algoithm (Pat 3 Ryutaoh Matsumoto Nagoya Univesity, Japan Send you comments to yutaoh.matsumoto@nagoya-u.jp Septembe 2018 @ Tokyo Tech. Matsumoto (Nagoya U. QIP Couse
More informationQuasi-Randomness and the Distribution of Copies of a Fixed Graph
Quasi-Randomness and the Distibution of Copies of a Fixed Gaph Asaf Shapia Abstact We show that if a gaph G has the popety that all subsets of vetices of size n/4 contain the coect numbe of tiangles one
More informationA pathway to matrix-variate gamma and normal densities
Linea Algeba and its Applications 396 005 317 38 www.elsevie.com/locate/laa A pathway to matix-vaiate gamma and nomal densities A.M. Mathai Depatment of Mathematics and Statistics, McGill Univesity, 805
More informationConstruction and Analysis of Boolean Functions of 2t + 1 Variables with Maximum Algebraic Immunity
Constuction and Analysis of Boolean Functions of 2t + 1 Vaiables with Maximum Algebaic Immunity Na Li and Wen-Feng Qi Depatment of Applied Mathematics, Zhengzhou Infomation Engineeing Univesity, Zhengzhou,
More information4/18/2005. Statistical Learning Theory
Statistical Leaning Theoy Statistical Leaning Theoy A model of supevised leaning consists of: a Envionment - Supplying a vecto x with a fixed but unknown pdf F x (x b Teache. It povides a desied esponse
More informationPsychometric Methods: Theory into Practice Larry R. Price
ERRATA Psychometic Methods: Theoy into Pactice Lay R. Pice Eos wee made in Equations 3.5a and 3.5b, Figue 3., equations and text on pages 76 80, and Table 9.1. Vesions of the elevant pages that include
More informationarxiv: v1 [math.co] 4 May 2017
On The Numbe Of Unlabeled Bipatite Gaphs Abdullah Atmaca and A Yavuz Ouç axiv:7050800v [mathco] 4 May 207 Abstact This pape solves a poblem that was stated by M A Haison in 973 [] This poblem, that has
More informationKR- 21 FOR FORMULA SCORED TESTS WITH. Robert L. Linn, Robert F. Boldt, Ronald L. Flaugher, and Donald A. Rock
RB-66-4D ~ E S [ B A U R L C L Ii E TI KR- 21 FOR FORMULA SCORED TESTS WITH OMITS SCORED AS WRONG Robet L. Linn, Robet F. Boldt, Ronald L. Flaughe, and Donald A. Rock N This Bulletin is a daft fo inteoffice
More informationSIMPLE LINEAR REGRESSION ANALYSIS FOR INCOMPLETE LONGITUDINAL DATA. Juthaphorn Saekhoo
SIMPLE LINEAR REGRESSION ANALYSIS FOR INCOMPLETE LONGITUDINAL DATA Juthaphon Saekhoo A Dissetation Submitted in Patial Fulfillment of the Requiements fo the Degee of Docto of Philosophy (Statistics) School
More informationChapter 3 Optical Systems with Annular Pupils
Chapte 3 Optical Systems with Annula Pupils 3 INTRODUCTION In this chapte, we discuss the imaging popeties of a system with an annula pupil in a manne simila to those fo a system with a cicula pupil The
More informationA Multivariate Normal Law for Turing s Formulae
A Multivaiate Nomal Law fo Tuing s Fomulae Zhiyi Zhang Depatment of Mathematics and Statistics Univesity of Noth Caolina at Chalotte Chalotte, NC 28223 Abstact This pape establishes a sufficient condition
More informationSOME GENERAL NUMERICAL RADIUS INEQUALITIES FOR THE OFF-DIAGONAL PARTS OF 2 2 OPERATOR MATRICES
italian jounal of pue and applied mathematics n. 35 015 (433 44) 433 SOME GENERAL NUMERICAL RADIUS INEQUALITIES FOR THE OFF-DIAGONAL PARTS OF OPERATOR MATRICES Watheq Bani-Domi Depatment of Mathematics
More informationMoment-free numerical approximation of highly oscillatory integrals with stationary points
Moment-fee numeical appoximation of highly oscillatoy integals with stationay points Sheehan Olve Abstact We pesent a method fo the numeical quadatue of highly oscillatoy integals with stationay points.
More informationConcomitants of Multivariate Order Statistics With Application to Judgment Poststratification
JASA jasa v.00/1/09 Pn:0/06/006; 16:6 F:jasatm05511.tex; (R p. 1 Concomitants of Multivaiate Ode Statistics With Application to Judgment Poststatification Xinlei WANG, Lynne STOKES, Johan LIM, and Min
More informationGradient-based Neural Network for Online Solution of Lyapunov Matrix Equation with Li Activation Function
Intenational Confeence on Infomation echnology and Management Innovation (ICIMI 05) Gadient-based Neual Netwok fo Online Solution of Lyapunov Matix Equation with Li Activation unction Shiheng Wang, Shidong
More informationASTR415: Problem Set #6
ASTR45: Poblem Set #6 Cuan D. Muhlbege Univesity of Mayland (Dated: May 7, 27) Using existing implementations of the leapfog and Runge-Kutta methods fo solving coupled odinay diffeential equations, seveal
More informationOn the ratio of maximum and minimum degree in maximal intersecting families
On the atio of maximum and minimum degee in maximal intesecting families Zoltán Lóánt Nagy Lale Özkahya Balázs Patkós Máté Vize Septembe 5, 011 Abstact To study how balanced o unbalanced a maximal intesecting
More informationA Comparison and Contrast of Some Methods for Sample Quartiles
A Compaison and Contast of Some Methods fo Sample Quatiles Anwa H. Joade and aja M. Latif King Fahd Univesity of Petoleum & Mineals ABSTACT A emainde epesentation of the sample size n = 4m ( =, 1, 2, 3)
More informationAbsorption Rate into a Small Sphere for a Diffusing Particle Confined in a Large Sphere
Applied Mathematics, 06, 7, 709-70 Published Online Apil 06 in SciRes. http://www.scip.og/jounal/am http://dx.doi.og/0.46/am.06.77065 Absoption Rate into a Small Sphee fo a Diffusing Paticle Confined in
More informationFractional Tikhonov regularization for linear discrete ill-posed problems
BIT manuscipt No. (will be inseted by the edito) Factional Tikhonov egulaization fo linea discete ill-posed poblems Michiel E. Hochstenbach Lotha Reichel Received: date / Accepted: date Abstact Tikhonov
More informationA generalization of the Bernstein polynomials
A genealization of the Benstein polynomials Halil Ouç and Geoge M Phillips Mathematical Institute, Univesity of St Andews, Noth Haugh, St Andews, Fife KY16 9SS, Scotland Dedicated to Philip J Davis This
More informationNuclear Medicine Physics 02 Oct. 2007
Nuclea Medicine Physics Oct. 7 Counting Statistics and Eo Popagation Nuclea Medicine Physics Lectues Imaging Reseach Laboatoy, Radiology Dept. Lay MacDonald 1//7 Statistics (Summaized in One Slide) Type
More informationChaos and bifurcation of discontinuous dynamical systems with piecewise constant arguments
Malaya Jounal of Matematik ()(22) 4 8 Chaos and bifucation of discontinuous dynamical systems with piecewise constant aguments A.M.A. El-Sayed, a, and S. M. Salman b a Faculty of Science, Aleandia Univesity,
More informationResearch Design - - Topic 17 Multiple Regression & Multiple Correlation: Two Predictors 2009 R.C. Gardner, Ph.D.
Reseach Design - - Topic 7 Multiple Regession & Multiple Coelation: Two Pedictos 009 R.C. Gadne, Ph.D. Geneal Rationale and Basic Aithmetic fo two pedictos Patial and semipatial coelation Regession coefficients
More informationDirected Regression. Benjamin Van Roy Stanford University Stanford, CA Abstract
Diected Regession Yi-hao Kao Stanfod Univesity Stanfod, CA 94305 yihaoao@stanfod.edu Benjamin Van Roy Stanfod Univesity Stanfod, CA 94305 bv@stanfod.edu Xiang Yan Stanfod Univesity Stanfod, CA 94305 xyan@stanfod.edu
More informationRelating Branching Program Size and. Formula Size over the Full Binary Basis. FB Informatik, LS II, Univ. Dortmund, Dortmund, Germany
Relating Banching Pogam Size and omula Size ove the ull Binay Basis Matin Saueho y Ingo Wegene y Ralph Wechne z y B Infomatik, LS II, Univ. Dotmund, 44 Dotmund, Gemany z ankfut, Gemany sauehof/wegene@ls.cs.uni-dotmund.de
More information10/04/18. P [P(x)] 1 negl(n).
Mastemath, Sping 208 Into to Lattice lgs & Cypto Lectue 0 0/04/8 Lectues: D. Dadush, L. Ducas Scibe: K. de Boe Intoduction In this lectue, we will teat two main pats. Duing the fist pat we continue the
More informationarxiv: v1 [math.na] 8 Feb 2013
A mixed method fo Diichlet poblems with adial basis functions axiv:1302.2079v1 [math.na] 8 Feb 2013 Nobet Heue Thanh Tan Abstact We pesent a simple discetization by adial basis functions fo the Poisson
More informationMULTILAYER PERCEPTRONS
Last updated: Nov 26, 2012 MULTILAYER PERCEPTRONS Outline 2 Combining Linea Classifies Leaning Paametes Outline 3 Combining Linea Classifies Leaning Paametes Implementing Logical Relations 4 AND and OR
More informationA Power Method for Computing Square Roots of Complex Matrices
JOURNAL OF MATHEMATICAL ANALYSIS AND APPLICATIONS 13, 39345 1997 ARTICLE NO. AY975517 A Powe Method fo Computing Squae Roots of Complex Matices Mohammed A. Hasan Depatment of Electical Engineeing, Coloado
More informationClosed-form Formulas for Ergodic Capacity of MIMO Rayleigh Fading Channels
Closed-fom Fomulas fo Egodic Capacity of MIMO Rayleigh Fading Channels Hyundong Shin and Jae Hong Lee School of Electical Engineeing Seoul National Univesity Shillim-dong, Gwanak-gu, Seoul 151-742, Koea
More informationLikelihood vs. Information in Aligning Biopolymer Sequences. UCSD Technical Report CS Timothy L. Bailey
Likelihood vs. Infomation in Aligning Biopolyme Sequences UCSD Technical Repot CS93-318 Timothy L. Bailey Depatment of Compute Science and Engineeing Univesity of Califonia, San Diego 1 Febuay, 1993 ABSTRACT:
More informationMEASURES OF BLOCK DESIGN EFFICIENCY RECOVERING INTERBLOCK INFORMATION
MEASURES OF BLOCK DESIGN EFFICIENCY RECOVERING INTERBLOCK INFORMATION Walte T. Fedee 337 Waen Hall, Biometics Unit Conell Univesity Ithaca, NY 4853 and Tey P. Speed Division of Mathematics & Statistics,
More informationOn the Quasi-inverse of a Non-square Matrix: An Infinite Solution
Applied Mathematical Sciences, Vol 11, 2017, no 27, 1337-1351 HIKARI Ltd, wwwm-hikaicom https://doiog/1012988/ams20177273 On the Quasi-invese of a Non-squae Matix: An Infinite Solution Ruben D Codeo J
More informationTemporal-Difference Learning
.997 Decision-Making in Lage-Scale Systems Mach 17 MIT, Sping 004 Handout #17 Lectue Note 13 1 Tempoal-Diffeence Leaning We now conside the poblem of computing an appopiate paamete, so that, given an appoximation
More informationIdentification of the degradation of railway ballast under a concrete sleeper
Identification of the degadation of ailway ballast unde a concete sleepe Qin Hu 1) and Heung Fai Lam ) 1), ) Depatment of Civil and Achitectual Engineeing, City Univesity of Hong Kong, Hong Kong SAR, China.
More informationA Bijective Approach to the Permutational Power of a Priority Queue
A Bijective Appoach to the Pemutational Powe of a Pioity Queue Ia M. Gessel Kuang-Yeh Wang Depatment of Mathematics Bandeis Univesity Waltham, MA 02254-9110 Abstact A pioity queue tansfoms an input pemutation
More informationOn decompositions of complete multipartite graphs into the union of two even cycles
On decompositions of complete multipatite gaphs into the union of two even cycles A. Su, J. Buchanan, R. C. Bunge, S. I. El-Zanati, E. Pelttai, G. Rasmuson, E. Spaks, S. Tagais Depatment of Mathematics
More informationONE-POINT CODES USING PLACES OF HIGHER DEGREE
ONE-POINT CODES USING PLACES OF HIGHER DEGREE GRETCHEN L. MATTHEWS AND TODD W. MICHEL DEPARTMENT OF MATHEMATICAL SCIENCES CLEMSON UNIVERSITY CLEMSON, SC 29634-0975 U.S.A. E-MAIL: GMATTHE@CLEMSON.EDU, TMICHEL@CLEMSON.EDU
More informationRegularization. Stephen Scott and Vinod Variyam. Introduction. Outline. Machine. Learning. Problems. Measuring. Performance.
leaning can geneally be distilled to an optimization poblem Choose a classifie (function, hypothesis) fom a set of functions that minimizes an objective function Clealy we want pat of this function to
More informationNOTE. Some New Bounds for Cover-Free Families
Jounal of Combinatoial Theoy, Seies A 90, 224234 (2000) doi:10.1006jcta.1999.3036, available online at http:.idealibay.com on NOTE Some Ne Bounds fo Cove-Fee Families D. R. Stinson 1 and R. Wei Depatment
More informationworking pages for Paul Richards class notes; do not copy or circulate without permission from PGR 2004/11/3 10:50
woking pages fo Paul Richads class notes; do not copy o ciculate without pemission fom PGR 2004/11/3 10:50 CHAPTER7 Solid angle, 3D integals, Gauss s Theoem, and a Delta Function We define the solid angle,
More informationSOME SOLVABILITY THEOREMS FOR NONLINEAR EQUATIONS
Fixed Point Theoy, Volume 5, No. 1, 2004, 71-80 http://www.math.ubbcluj.o/ nodeacj/sfptcj.htm SOME SOLVABILITY THEOREMS FOR NONLINEAR EQUATIONS G. ISAC 1 AND C. AVRAMESCU 2 1 Depatment of Mathematics Royal
More informationA THREE CRITICAL POINTS THEOREM AND ITS APPLICATIONS TO THE ORDINARY DIRICHLET PROBLEM
A THREE CRITICAL POINTS THEOREM AND ITS APPLICATIONS TO THE ORDINARY DIRICHLET PROBLEM DIEGO AVERNA AND GABRIELE BONANNO Abstact. The aim of this pape is twofold. On one hand we establish a thee citical
More informationMATH 415, WEEK 3: Parameter-Dependence and Bifurcations
MATH 415, WEEK 3: Paamete-Dependence and Bifucations 1 A Note on Paamete Dependence We should pause to make a bief note about the ole played in the study of dynamical systems by the system s paametes.
More informationFunctions Defined on Fuzzy Real Numbers According to Zadeh s Extension
Intenational Mathematical Foum, 3, 2008, no. 16, 763-776 Functions Defined on Fuzzy Real Numbes Accoding to Zadeh s Extension Oma A. AbuAaqob, Nabil T. Shawagfeh and Oma A. AbuGhneim 1 Mathematics Depatment,
More informationA Simple Nonparametric Approach to Estimating the Distribution of Random Coefficients in Structural Models
A Simple Nonpaametic Appoach to Estimating the Distibution of Random Coefficients in Stuctual Models Jeemy T. Fox Rice Univesity & NBER Kyoo il Kim Michigan State Univesity May 2016 Chenyu Yang Univesity
More informationPower and sample size calculations for longitudinal studies estimating a main effect of a time-varying exposure
Stat Methods Med Res OnlineFist, published on June 14, 2010 as doi:10.1177/0962280210371563 Statistical Methods in Medical Reseach 2010; 00: 1 17 Powe and sample size calculations fo longitudinal studies
More informationCompactly Supported Radial Basis Functions
Chapte 4 Compactly Suppoted Radial Basis Functions As we saw ealie, compactly suppoted functions Φ that ae tuly stictly conditionally positive definite of ode m > do not exist The compact suppot automatically
More informationLifting Private Information Retrieval from Two to any Number of Messages
Lifting Pivate Infomation Retieval fom Two to any umbe of Messages Rafael G.L. D Oliveia, Salim El Rouayheb ECE, Rutges Univesity, Piscataway, J Emails: d746@scaletmail.utges.edu, salim.elouayheb@utges.edu
More informationBounds on the performance of back-to-front airplane boarding policies
Bounds on the pefomance of bac-to-font aiplane boading policies Eitan Bachmat Michael Elin Abstact We povide bounds on the pefomance of bac-to-font aiplane boading policies. In paticula, we show that no
More informationChem 453/544 Fall /08/03. Exam #1 Solutions
Chem 453/544 Fall 3 /8/3 Exam # Solutions. ( points) Use the genealized compessibility diagam povided on the last page to estimate ove what ange of pessues A at oom tempeatue confoms to the ideal gas law
More informationEmpirical Prediction of Fitting Densities in Industrial Workrooms for Ray Tracing. 1 Introduction. 2 Ray Tracing using DRAYCUB
Empiical Pediction of Fitting Densities in Industial Wokooms fo Ray Tacing Katina Scheebnyj, Muay Hodgson Univesity of Bitish Columbia, SOEH-MECH, Acoustics and Noise Reseach Goup, 226 East Mall, Vancouve,
More informationA Comparative Study of Exponential Time between Events Charts
Quality Technology & Quantitative Management Vol. 3, No. 3, pp. 347-359, 26 QTQM ICAQM 26 A Compaative Study of Exponential Time between Events Chats J. Y. Liu 1, M. Xie 1, T. N. Goh 1 and P. R. Shama
More informationChapter 5 Linear Equations: Basic Theory and Practice
Chapte 5 inea Equations: Basic Theoy and actice In this chapte and the next, we ae inteested in the linea algebaic equation AX = b, (5-1) whee A is an m n matix, X is an n 1 vecto to be solved fo, and
More informationFUSE Fusion Utility Sequence Estimator
FUSE Fusion Utility Sequence Estimato Belu V. Dasaathy Dynetics, Inc. P. O. Box 5500 Huntsville, AL 3584-5500 belu.d@dynetics.com Sean D. Townsend Dynetics, Inc. P. O. Box 5500 Huntsville, AL 3584-5500
More informationFailure Probability of 2-within-Consecutive-(2, 2)-out-of-(n, m): F System for Special Values of m
Jounal of Mathematics and Statistics 5 (): 0-4, 009 ISSN 549-3644 009 Science Publications Failue Pobability of -within-consecutive-(, )-out-of-(n, m): F System fo Special Values of m E.M.E.. Sayed Depatment
More informationELASTIC ANALYSIS OF CIRCULAR SANDWICH PLATES WITH FGM FACE-SHEETS
THE 9 TH INTERNATIONAL CONFERENCE ON COMPOSITE MATERIALS ELASTIC ANALYSIS OF CIRCULAR SANDWICH PLATES WITH FGM FACE-SHEETS R. Sbulati *, S. R. Atashipou Depatment of Civil, Chemical and Envionmental Engineeing,
More informationDo Managers Do Good With Other People s Money? Online Appendix
Do Manages Do Good With Othe People s Money? Online Appendix Ing-Haw Cheng Haison Hong Kelly Shue Abstact This is the Online Appendix fo Cheng, Hong and Shue 2013) containing details of the model. Datmouth
More informationTight Upper Bounds for the Expected Loss of Lexicographic Heuristics in Binary Multi-attribute Choice
Tight Uppe Bounds fo the Expected Loss of Lexicogaphic Heuistics in Binay Multi-attibute Choice Juan A. Caasco, Manel Baucells Except fo fomatting details and the coection of some eata, this vesion matches
More information