Semiparametric Estimation of Treatment Effect in a Pretest-Posttest Study

Size: px
Start display at page:

Download "Semiparametric Estimation of Treatment Effect in a Pretest-Posttest Study"

Transcription

1 Semparametrc Estmaton of Treatment Effect n a Pretest-Posttest Study Mare Davdan Department of Statstcs North Carolna State Unversty Based on: Avalable on my web page Leon, S., Tsats, A.A., and Davdan, M. (2003) Semparametrc estmaton of treatment effect n a pretest-posttest study. Bometrcs 59, Davdan, M., Tsats, A.A., and Leon, S. (2005) Semparametrc estmaton of treatment effect n a pretest-posttest study wth mssng data (wth Dscusson). Statstcal Scence 20, davdan (Jont work wth A.A. Tsats and S. Leon) Greenberg Lecture II: Pretest-Posttest Study 1 Greenberg Lecture II: Pretest-Posttest Study 2 Outlne of popular methods 2. Influence functons 3. Robns, Rotntzky, and Zhao (1994) 4. Estmaton when full data are avalable 5. Estmaton when posttest response s mssng at random (MAR) 6. Full data nfluence functons, revsted 7. Smulaton evdence 8. Applcaton ACTG Dscusson The pretest-posttest study: Ubqutous n research n medcne, publc health, socal scence, etc... Subjects are randomzed to two treatments ( treatment and control ) Response s measured at baselne ( pretest ) and at a pre-specfed follow-up tme ( posttest ) Focus of nference: Dfference n change of mean response from baselne to follow-up between treatment and control Greenberg Lecture II: Pretest-Posttest Study 3 Greenberg Lecture II: Pretest-Posttest Study 4 For example: AIDS Clncal Trals Group patents randomzed to ZDV, ZDV+ddI, ZDV+zalctabne, ddi wth equal probablty (1/4) Prmary analyss (tme-to-event endpont): ZDV nferor to other three (no dfferences) Two groups: ZDV alone ( control ) and other three ( treatment ) Secondary analyses : Compare change n CD4 count (mmunologc status) from () baselne to 20±5 weeks and () baselne to 96±5 weeks between control and treatment Formally: Defne Y 1 Y 2 Z baselne (pretest) response (e.g., count) follow-up (posttest) response (e.g., 20±5 week CD4 count) = 0 f control, = 1 f treatment, P (Z = 1) = By randomzaton, reasonable to assume E(Y 1 Z = 0) = E(Y 1 Z = 1) = E(Y 1) = µ 1 Effect of nterest: β, where {E(Y 2 Z = 1) E(Y 1 Z = 1)} {E(Y 2 Z = 0) E(Y 1 Z = 0)} = {E(Y 2 Z = 1) µ 1} {E(Y 2 Z = 0) µ 1)} = E(Y 2 Z = 1) E(Y 2 Z = 0) = µ (1) 2 µ (0) 2 = β Greenberg Lecture II: Pretest-Posttest Study 5 Greenberg Lecture II: Pretest-Posttest Study 6

2 Basc data: (Y 1, Y 2, Z ), = 1,..., n, d n 1 = Z = I(Z = 1), n 0 = (1 Z ) = I(Z = 0) Popular estmators for β: Two-sample t-test estmator β 2samp = n 1 1 Z Y 2 n 1 0 Pared t-test estmator ( change scores ) β par = D 1 D 0, D c = n 1 c (1 Z )Y 2 I(Z = c)(y 2 Y 1), c = 0, 1 Popular estmators for β: ANCOVA Adjust drectly by fttng the model E(Y 2 Y 1, Z) = α 0 + α 1Y 1 + βz ANCOVA II Include nteracton and estmate β as coeffcent of Z Z n regresson of Y 2 Y 2 on Y 1 Y 1, Z Z, and (Y 1 Y 1)(Z Z) GEE (Y 1, Y 2) T s multvarate response wth mean (µ 1, µ (0) 2 + βz) T and (2 2) unstructured covarance matrx Assume lnear relatonshp between Y 2 and Y 1 Greenberg Lecture II: Pretest-Posttest Study 7 Greenberg Lecture II: Pretest-Posttest Study 8 ACTG 175: Y 2 = CD4 at 20±5 weeks vs. Y 1 = (control and treatment groups) follow-up CD4 at 20+/-5 weeks follow-up CD4 at 20+/-5 weeks Addtonal data: Baselne and ntermedate covarates X 1 X 2 Baselne (pre-treatment) characterstcs Characterstcs observed after pretest but before posttest, ncludng ntermedate responses In ACTG 175: X 1 ncludes weght, age, gender, Karnofsky score, pror ARV therapy, CD8 count, sexual preference,... X 2 ncludes off treatment ndcator, ntermedate CD4, CD8 Greenberg Lecture II: Pretest-Posttest Study 9 Greenberg Lecture II: Pretest-Posttest Study 10 Addtonal estmators: Incorporate nonlnearty, covarates Fancer regresson models, e.g. E(Y 2 Y 1, Z) = α 0 + α 1Y 1 + α 2Y βz Adjustment for baselne covarates, e.g., Both E(Y 2 X 1, Y 1, Z) = α 0 + α 1Y 1 + α 2X 1 + βz Intutvely, adjustment for ntermedate covarates buys nothng wthout some assumptons (formally exhbted shortly... ) and could be dangerous Whch estmator? Stll confuson among practtoners Is normalty requred? What f the relatonshp sn t lnear? What f a model for E(Y 2 X 1, Y 1, Z) s wrong? Further complcaton: Mssng posttest response Y 2 In ACTG 175, no mssng CD4 for any subject at 20±5 weeks but 797 (37%) of subjects were mssng CD4 at 96±5 weeks (almost entrely due to dropout from study) Common n practce complete case analyss, whch yelds possbly based nference on β unless Y 2 s mssng completely at random Also common Last Observaton Carred Forward, also based Greenberg Lecture II: Pretest-Posttest Study 11 Greenberg Lecture II: Pretest-Posttest Study 12

3 Introducton and revew Introducton and revew Mssng at random (MAR) assumpton: Posttest mssngness assocated wth (X 1, Y 1, X 2, Z) but not Y 2 Often reasonable, but s an assumpton If R = 1 f Y 2 s observed and R = 0 f Y 2 s mssng, the assumpton s P (R = 1 X 1, Y 1, X 2, Y 2, Z) = P (R = 1 X 1, Y 1, X 2, Z) Mssngness s assocated only wth nformaton that s always observed Full data: If no mssngness, observe (X 1, Y 1, X 2, Y 2, Z) Ordnarly: Models for (X 1, Y 1, X 2, Y 2, Z) may nvolve assumptons If Y 2 not mssng, wdespread belef that normalty of (Y 1, Y 2) s requred for valdty of popular estmators When Y 2 s MAR, maxmum lkelhood, mputaton approaches requre assumptons on aspects of the jont dstrbuton of (X 1, Y 1, X 2, Y 2, Z) Consequences? Greenberg Lecture II: Pretest-Posttest Study 13 Greenberg Lecture II: Pretest-Posttest Study 14 Introducton and revew Introducton and revew Semparametrc models: May contan parametrc and nonparametrc components Nonparametrc components unable or unwllng to make specfc assumptons on aspects of (X 1, Y 1, X 2, Y 2, Z) Here: Consder a semparametrc model for (X 1, Y 1, X 2, Y 2, Z) No assumptons on jont dstrbuton of (X 1, Y 1, X 2, Y 2, Z) beyond ndependence of (X 1, Y 1) and Z nduced by randomzaton (nonparametrc) Interested n the functonal of ths dstrbuton β = µ (1) 2 µ (0) 2 = E(Y 2 Z = 1) E(Y 2 Z = 0) Where we are gong: Under ths semparametrc model Fnd a class of consstent and asymptotcally normal (CAN ) estmators for β when full data are avalable and dentfy the best (effcent ) estmator n the class As a by-product, show that popular estmators are potentally neffcent members of ths class can do better! When Y 2 s MAR, fnd a class of CAN estmators for β and dentfy the best In both cases, translate the theory nto practcal technques What we wll explot: Theory n a landmark paper by Robns, Rotntzky, and Zhao (1994) Greenberg Lecture II: Pretest-Posttest Study 15 Greenberg Lecture II: Pretest-Posttest Study Influence functons 2. Influence functons Defnton: For functonal β (scalar) n a parametrc or semparametrc model, an estmator β based on d random vectors W, = 1,..., n, s asymptotcally lnear f n 1/2 ( β β 0) = n 1/2 ϕ(w ) + o p(1) for some ϕ(w ) β 0 = true value of β, E{ϕ(W )} = 0, E{ϕ 2 (W )} < ϕ(w ) s called the nfluence functon of β If β s also regular (not pathologcal ), β s CAN wth asymptotc varance E{ϕ 2 (W )} Effcent nfluence functon ϕ eff (W ) has smallest varance and corresponds to the effcent, regular asymptotcally lnear (RAL ) estmator Result: By the Central Lmt Theorem, f β s RAL wth nfluence functon ϕ(w ), then approxmately ( ) β N β 0, n 1 E{ϕ 2 (W )} As we wll see, ϕ(w ) may be vewed as dependng on β and other parameters θ, say So can estmate E{ϕ 2 (W )} by n 1 ϕ 2 (W, β, θ), say Often called a sandwch estmator for (approxmate) varance Greenberg Lecture II: Pretest-Posttest Study 17 Greenberg Lecture II: Pretest-Posttest Study 18

4 2. Influence functons Example: It may be shown drectly by manpulatng the expresson for n 1/2 ( β 2samp β) and usng n 0/n 1, n 1/n as n that β 2samp has nfluence functon of the form Z(Y 2 µ (1) 2 ) (1 Z)(Y2 µ(0) 2 ) 1 Z(Y2 µ(0) = [depends on W = (X 1, Y 1, X 2, Y 2, Z), β, µ (0) 2 ] 2 β) (1 Z)(Y2 µ(0) 2 ) 1 Why s ths useful? There s a correspondence between CAN, RAL estmators and nfluence functons By dentfyng nfluence functons, one can deduce estmators 2. Influence functons General prncple: From the defnton, β β 0 n 1 ϕ(w) = To deduce estmators, solve ϕ(w) = 0 for β For example: Influence functon for β 2samp { } Z (Y 2 µ (0) 2 β) (1 Z)(Y2 µ(0) 2 0 = ) 1 Substtutng µ (0) 2 = n 1 0 (1 Z)Y2 and solvng for β yelds β = n 1 1 Z Y 2 n 1 0 (1 Z )Y 2 In general, closed form may not be possble (ϕ(w ) depends on β 0) Greenberg Lecture II: Pretest-Posttest Study 19 Greenberg Lecture II: Pretest-Posttest Study Robns, Rotntzky, and Zhao (1994) 3. Robns, Rotntzky, and Zhao (1994) What dd RRZ do? Derved asymptotc theory based on nfluence functons for nference on functonals n general semparametrc models where some components of the full data are possbly MAR Observed data: Data observed when some components of the full data are potentally mssng What dd RRZ do, more specfcally? For the functonal of nterest, dstngushed between Full-data nfluence functons correspond to RAL estmators calculable f full data were avalable; functons of the full data Observed-data nfluence functons correspond to RAL estmators calculable from the observed data under MAR; functons of the observed data RRZ characterzed the class of all observed-data nfluence functons for a general semparametrc model, ncludng the effcent one, and showed that observed-data nfluence functons may be expressed n terms of full-data nfluence functons Greenberg Lecture II: Pretest-Posttest Study 21 Greenberg Lecture II: Pretest-Posttest Study Robns, Rotntzky, and Zhao (1994) 3. Robns, Rotntzky, and Zhao (1994) The man result: Generc full data D = (O, M), semparametrc model for D, functonal β O Part of D that s always observed (never mssng ) M R Part of D that may be mssng = 1 f M s observed, = 0 f M s mssng Observed data are (O, R, RM) Let ϕ F (D) be a full data nfluence functon Let π(o) = P (R = 1 D) = P (R = 1 O) > ɛ (MAR assumpton ) All observed-data nfluence functons have form Rϕ F (D) π(o) R π(o) g(o), π(o) g(o) square-ntegrable Result: Strategy for dervng estmators for a semparametrc model 1. Characterze the class of full-data nfluence functons (whch correspond to full-data estmators) 2. Characterze the observed data under the partcular MAR mechansm and the class of observed-data nfluence functons 3. Identfy observed-data estmators wth nfluence functons n ths class Our approach: Follow these steps for the semparametrc pretestposttest model Jont dstrbuton of (X 1, Y 1, X 2, Y 2, Z) unspecfed except (X 1, Y 1) ndependent of Z Greenberg Lecture II: Pretest-Posttest Study 23 Greenberg Lecture II: Pretest-Posttest Study 24

5 4. Estmaton wth full data 4. Estmaton wth full data Full-data nfluence functons: Can show (later) under the semparametrc pretest-posttest model that all full-data nfluence functons are of the form ( Z(Y2 µ (1) 2 ) (Z ) h (1) (X1, Y1) ) ( ) (1 Z)(Y2 µ (0) 2 ) 1 (Z ) + 1 h(0) (X1, Y1), for arbtrary h (c) (X 1, Y 1), c = 0, 1 wth var{h (c) (X 1, Y 1)} < Dfference of nfluence functons for estmators for µ (1) 2 and µ (0) 2 Full-data estmators may depend on X 1 (but not X 2) Effcent full-data nfluence functon: Correspondng to effcent full-data estmator; takes h (c) (X 1, Y 1) = E(Y 2 X 1, Y 1, Z = c) µ (c) 2, c = 0, 1 Popular estmators: Influence functons of β 2samp, β par, ANCOVA, ANCOVA II, and GEE have h (c) (X 1, Y 1) = η c(y 1 µ 1), c = 0, 1, for constants η c E.g., η c = 0, c = 0, 1 for β 2samp; η c = {(1 )} 1 for β par Popular estmators are n the class = are CAN even f (Y 1, Y 2) are not normal or the relatonshp between Y 1 and Y 2 s not lnear ANCOVA estmators ncorporatng baselne covarates are also n the class, e.g., E(Y 1 X 1, Y 1, Z) = α 0 + α 1Y 1 + α 2X 1 + βz Popular estmators are potentally neffcent among class of RAL estmators for semparametrc model How to use all ths? Effcent estmator s best! Greenberg Lecture II: Pretest-Posttest Study 25 Greenberg Lecture II: Pretest-Posttest Study Estmaton wth full data Effcent estmator: Settng sum over of effcent nfluence functon = 0 and replacng by = n 1/n yelds ( X n nx ) β = n 1 1 ZY2 (Z )E(Y2 X1, b Y1, Z = 1) ( X n nx ) n 1 0 (1 Z)Y2 + (Z )E(Y2 X1, b Y1, Z = 0) Practcal use replace E(Y 2 X 1, Y 1, Z = c) by predcted values ê h(c), say, c = 0, 1, from parametrc or nonparametrc regresson modelng Can lead to substantal ncrease n precson over popular estmators Even f E(Y 2 X 1, Y 1, Z = c) are modeled ncorrectly, β s stll consstent Observed data: (X 1, Y 1, X 2, Z) are never mssng, Y 2 may be mssng for some subjects R = 1 f Y 2 observed, R = 0 f Y 2 mssng Observed data are (X 1, Y 1, X 2, Z, R, RY 2) MAR assumpton P (R = 1 X 1, Y 1, X 2, Y 2, Z) = P (R = 1 X 1, Y 1, X 2, Z) = π(x 1, Y 1, X 2, Z) ɛ > 0 π(x 1, Y 1, X 2, Z) = Zπ (1) (X 1, Y 1, X 2) + (1 Z)π (0) (X 1, Y 1, X 2), π (c) (X 1, Y 1, X 2) = π(x 1, Y 1, X 2, c), c = 0, 1 Greenberg Lecture II: Pretest-Posttest Study 27 Greenberg Lecture II: Pretest-Posttest Study 28 Recall: Generc form of observed-data nfluence functons Rϕ F (D) π(o) R π(o) g(o) π(o) For smplcty: Focus on nfluence functons for estmators for µ (1) 2 Those for estmators for µ (0) 2 smlar Influence functons for estmators for β: take the dfference Full-data nfluence functons for estmators for µ (1) 2 : Recall Z(Y 2 µ (1) 2 ) (Z ) h (1) (X 1, Y 1), var{h (1) (X 1, Y 1)} < Thus: Observed-data nfluence functons for estmators for µ (1) 2 have form R{Z(Y 2 µ (1) 2 ) (Z )h(1) (X 1, Y 1)} R π(x1, Y1, X2, Z) g (1) (X 1, Y 1, X 2, Z) π(x 1, Y 1, X 2, Z) π(x 1, Y 1, X 2, Z) var{h (1) (X 1, Y 1)} <, var{g (1) (X 1, Y 1, X 2, Z)} < Choce of h (1) leadng to the effcent observed-data nfluence functon need not be the same as that leadng to the effcent full-data nfluence functon n general Turns out that the optmal h (1) s the same n the specal case of the pretest-posttest problem... Greenberg Lecture II: Pretest-Posttest Study 29 Greenberg Lecture II: Pretest-Posttest Study 30

6 Re-wrtng: Equvalently, observed-data nfluence functons are RZ(Y 2 µ (1) 2 ) π(x 1, Y 1, X 2, Z) ) (Z h (1) (X 1, Y 1) R π(x1, Y1, X2, Z) g (1) (X 1, Y 1, X 2, Z) π(x 1, Y 1, X 2, Z) Optmal choces leadng to the effcent observed-data nfluence functon are h eff(1) (X 1, Y 1) = E(Y 2 X 1, Y 1, Z = 1) µ (1) 2 g eff(1) (X 1, Y 1, X 2, Z) = Z{E(Y 2 X 1, Y 1, X 2, Z) µ (1) 2 } = Z{E(Y 2 X 1, Y 1, X 2, Z = 1) µ (1) 2 } Result: Wth these optmal choces, algebra yelds { µ (1) 2 = (n) 1 R Z Y 2 π (1) (X 1, Y 1, X (Z )E(Y 2 X 1, Y 1, Z = 1) 2) } {R π (1) (X 1, Y 1, X 2)}Z E(Y 2 X 1, Y 1, X 2, Z π (1) = 1) (X 1, Y 1, X 2) Smlarly for µ (0) 2 dependng on π (0), E(Y 2 X 1, Y 1, Z = 0), E(Y 2 X 1, Y 1, X 2, Z = 0) Estmator for β take the dfference Practcal use replace these quanttes by predcted values from regresson modelng (comng up) Greenberg Lecture II: Pretest-Posttest Study 31 Greenberg Lecture II: Pretest-Posttest Study 32 Complcaton 1: π (c) (X 1, Y 1, X 2) are not known, c = 0, 1 Common strategy: adopt parametrc models (e.g. logstc regresson) dependng on parameter γ (c) π (c) (X 1, Y 1, X 2; γ (c) ) Imposes an addtonal assumpton on semparametrc model for (X 1, Y 1, X 2, Y 1, Z) Substtute the MLE γ (c) for γ (c), obtan predcted values π (c) As long as ths model s correct, resultng estmators wll be CAN Greenberg Lecture II: Pretest-Posttest Study 33 Complcaton 2: Modelng E(Y 2 X 1, Y 1, Z = c), E(Y 2 X 1, Y 1, X 2, Z = c), c = 0, 1 MAR = E(Y 2 X 1, Y 1, X 2, Z) = E(Y 2 X 1, Y 1, X 2, Z, R = 1) (can base modelng/fttng on complete cases only) Obtan predcted values ê q(c), c = 0, 1 However, deally requre compatblty,.e. E(Y 2 X 1, Y 1, Z) = E{E(Y 2 X 1, Y 1, X 2, Z) X 1, Y 1, Z} and no longer vald to ft usng only complete cases Practcally go ahead and model drectly and ft usng complete cases, obtan predcted values ê h(c) Estmaton of parameters n these models does not affect (asymptotc) varance of β as long as π (c) models are correct Greenberg Lecture II: Pretest-Posttest Study 34 Estmator: Wth = n 1/n { R Z Y 2 (Z π (1) )ê h(1) β = n 1 1 n 1 0 { R (1 Z )Y 2 π (0) + (Z )ê h(0) (R π (1) π (1) (R π (0) } )Z ê q(1) Effcent f modelng done correctly; otherwse, close to optmal performance Takng ê h(c) = ê q(c) = 0 yelds the smple nverse-weghted complete case estmator (neffcent ) )(1 Z )ê q(0) π (0) Modelng E(Y 2 X 1, Y 1, Z = c), E(Y 2 X 1, Y 1, X 2, Z = c) augments ths, takng advantage of relatonshps among varables to mprove precson } Double Robustness: Stll consstent f π (c) are correctly modeled but E(Y 2 X 1, Y 1, Z = c) and E(Y 2 X 1, Y 1, X 2, Z = c) aren t E(Y 2 X 1, Y 1, Z = c) and E(Y 2 X 1, Y 1, X 2, Z = c) are correctly modeled but π (c) aren t No longer effcent If both sets of models ncorrect, nconsstent n general Standard errors: Use the sandwch formula (follows from nfluence functon) Greenberg Lecture II: Pretest-Posttest Study 35 Greenberg Lecture II: Pretest-Posttest Study 36

7 6. Full data, revsted Recap: Ths approach requres one to make an assumpton about π (c) (X 1, Y 1, X 2), c = 0, 1 No assumpton s made about E(Y 2 X 1, Y 1, X 2, Z = c), E(Y 2 X 1, Y 1, Z = c) Model s stll semparametrc... and double robustness holds Alternatve approach: Make an assumpton nstead about the E(Y 2 X 1, Y 1, X 2, Z = c), E(Y 2 X 1, Y 1, Z = c) Effcent estmator s maxmum lkelhood Don t need to even worry about π (c) (X 1, Y 1, X 2) But no double robustness property! For dscusson see: Bang and Robns (2005, Bometrcs) Greenberg Lecture II: Pretest-Posttest Study 37 How dd we get the full-data nfluence functons? One way use classcal semparametrc theory Another way Vew as a fake mssng data problem by castng the full-data problem n terms of counterfactuals Counterfactual representaton: Y (0) 2, Y (1) 2 are potental posttest responses f a subject were assgned to control or treatment We observe Y 2 = ZY (1) 2 + (1 Z)Y (0) 2 Fake full data (X 1, Y 1, X 2, Y (0) 2, Y (1) 2, Z) Fake observed data (X 1, Y 1, X 2, Z, ZY (1) 2, (1 Z)Y (0) 2 ) Apply the RRZ theory Greenberg Lecture II: Pretest-Posttest Study Smulaton evdence 7. Smulaton evdence Full-data problem: Substantal gans n effcency over popular methods, especally when there are nonlnear relatonshps among varables Parametrc and nonparametrc regresson modelng work well Vald standard errors, confdence ntervals Observed-data problem: Popular methods wth complete cases or LOCF can exhbt substantal bases Inverse-weghted complete case estmator s unbased but neffcent Substantal gans n effcency possble through modelng Vald standard errors, confdence ntervals Full data scenaro: 5000 Monte Carlo data sets = 0.5, Y 1 N (0, 1), Y 2 = µ (0) 2 +βz+β1y1+β2y 2 1 +ɛ, ɛ N (0, 1) µ (0) 2 = 0.25, (β 1, β 2) = (0.5, 0.4), β = 0.5 Estmator MC Mean MC SD Est SE MSE Rato Coverage n New-LOESS New-Quadratc ANCOVA-Interact ANCOVA Pared samp New-LOESS New-Quadratc ANCOVA-Interact ANCOVA Pared samp Greenberg Lecture II: Pretest-Posttest Study 39 Greenberg Lecture II: Pretest-Posttest Study Smulaton evdence Mssng data scenaro: 1000 Monte Carlo data sets = 0.5, Y 1 N (0, 1), ɛ N (0, 1) P (X 2 = 1 Y 1, Z) = (1 Z)/{1+exp( Y 1)}+Z/[1+exp( Y 1)} P (R = 1 Y 1, X 2, Z) = (1 Z)/{1 + exp( 0.2 2Y 1 0.1X 2 0.1X 2Y 1)} + Z/{1 + exp( 1.0Y X 2 0.1X 2Y 1)} yeldng 20% (Z = 0) and 34% (Z = 1) mssng Y 2 = 0.5Z + [ {X 2 E(X 2)}] exp(y 1) + 2.0{X 2 E(X 2)} + ɛ mples a model for E(Y 2 Y 1, Z) Overall, mples β = Smulaton evdence For n = 250 Estmator MC Mean MC SD Est SE MSE Rato Coverage Benchmark New-LOESS New-Quadratc New-Quadratc-π wrong IWCC GEE ANCOVA-Interact ANCOVA Pared samp For n = 1000 Any approach to modelng regresson (LOESS, Quadratc) acheves almost full effcency relatve to Benchmark (MSE ratos > 0.98) Popular estmators on complete cases stll horrbly based Greenberg Lecture II: Pretest-Posttest Study 41 Greenberg Lecture II: Pretest-Posttest Study 42

8 8. Applcaton ACTG Applcaton ACTG 175 Recall: Y 2 = CD4 at 20±5 weeks vs. Y 1 = (control and treatment groups) Apparent curvature follow-up CD4 at 20+/-5 weeks follow-up CD4 at 20+/-5 weeks Results: Models for E(Y 2 X 1, Y 1, Z = c), c = 0, 1 Estmator β SE Parametrc modelng (quadratc n Y 1) Nonparametrc modelng (GAM) ANCOVA Pared t Two-sample t Greenberg Lecture II: Pretest-Posttest Study 43 Greenberg Lecture II: Pretest-Posttest Study Applcaton ACTG Applcaton ACTG 175 Complete cases: Y 2 = CD4 at 96±5 weeks vs. Y 1 = (control and treatment groups) Results: Logstc regresson for π (c), c = 0, 1; parametrc regresson modelng of E(Y 2 X 1, Y 1, X 2, Z = c), E(Y 2 X 1, Y 1, Z = c) 37% mssng Y 2 follow-up CD4 at 96 ± 5 weeks follow-up CD4 at 96 ± 5 weeks Estmator β SE Parametrc modelng (quadratc n Y 1) Smple nverse-weghtng ANCOVA Pared t Greenberg Lecture II: Pretest-Posttest Study 45 Greenberg Lecture II: Pretest-Posttest Study Dscusson 9. Dscusson RRZ theory appled to a standard problem For more than you ever wanted to know: General framework for pretest-posttest analyss llumnatng how relatonshps among varables may be frutfully exploted Practcal estmators Can be extended to censored covarate nformaton Results are equally applcable to baselne covarate adjustment n comparson of two means (Y 1 s just another baselne covarate) Lots of methods for ths problem (lkelhood, mputaton combnatons thereof,... ); semparametrc theory provdes a framework for understandng commonaltes and dfferences among them Avalable June 2006! Greenberg Lecture II: Pretest-Posttest Study 47 Greenberg Lecture II: Pretest-Posttest Study 48

Semiparametric Estimation of Treatment Effect in a Pretest Posttest Study with Missing Data

Semiparametric Estimation of Treatment Effect in a Pretest Posttest Study with Missing Data Statstcal Scence 005, Vol. 0, No. 3, 61 301 DOI 10.114/08834305000000151 Insttute of Mathematcal Statstcs, 005 Semparametrc Estmaton of Treatment Effect n a Pretest Posttest Study wth Mssng Data Mare Davdan,

More information

Parametric fractional imputation for missing data analysis. Jae Kwang Kim Survey Working Group Seminar March 29, 2010

Parametric fractional imputation for missing data analysis. Jae Kwang Kim Survey Working Group Seminar March 29, 2010 Parametrc fractonal mputaton for mssng data analyss Jae Kwang Km Survey Workng Group Semnar March 29, 2010 1 Outlne Introducton Proposed method Fractonal mputaton Approxmaton Varance estmaton Multple mputaton

More information

Chapter 5 Multilevel Models

Chapter 5 Multilevel Models Chapter 5 Multlevel Models 5.1 Cross-sectonal multlevel models 5.1.1 Two-level models 5.1.2 Multple level models 5.1.3 Multple level modelng n other felds 5.2 Longtudnal multlevel models 5.2.1 Two-level

More information

4 Analysis of Variance (ANOVA) 5 ANOVA. 5.1 Introduction. 5.2 Fixed Effects ANOVA

4 Analysis of Variance (ANOVA) 5 ANOVA. 5.1 Introduction. 5.2 Fixed Effects ANOVA 4 Analyss of Varance (ANOVA) 5 ANOVA 51 Introducton ANOVA ANOVA s a way to estmate and test the means of multple populatons We wll start wth one-way ANOVA If the populatons ncluded n the study are selected

More information

Composite Hypotheses testing

Composite Hypotheses testing Composte ypotheses testng In many hypothess testng problems there are many possble dstrbutons that can occur under each of the hypotheses. The output of the source s a set of parameters (ponts n a parameter

More information

Econ107 Applied Econometrics Topic 3: Classical Model (Studenmund, Chapter 4)

Econ107 Applied Econometrics Topic 3: Classical Model (Studenmund, Chapter 4) I. Classcal Assumptons Econ7 Appled Econometrcs Topc 3: Classcal Model (Studenmund, Chapter 4) We have defned OLS and studed some algebrac propertes of OLS. In ths topc we wll study statstcal propertes

More information

Comparison of Regression Lines

Comparison of Regression Lines STATGRAPHICS Rev. 9/13/2013 Comparson of Regresson Lnes Summary... 1 Data Input... 3 Analyss Summary... 4 Plot of Ftted Model... 6 Condtonal Sums of Squares... 6 Analyss Optons... 7 Forecasts... 8 Confdence

More information

STAT 3008 Applied Regression Analysis

STAT 3008 Applied Regression Analysis STAT 3008 Appled Regresson Analyss Tutoral : Smple Lnear Regresson LAI Chun He Department of Statstcs, The Chnese Unversty of Hong Kong 1 Model Assumpton To quantfy the relatonshp between two factors,

More information

Economics 130. Lecture 4 Simple Linear Regression Continued

Economics 130. Lecture 4 Simple Linear Regression Continued Economcs 130 Lecture 4 Contnued Readngs for Week 4 Text, Chapter and 3. We contnue wth addressng our second ssue + add n how we evaluate these relatonshps: Where do we get data to do ths analyss? How do

More information

Efficient nonresponse weighting adjustment using estimated response probability

Efficient nonresponse weighting adjustment using estimated response probability Effcent nonresponse weghtng adjustment usng estmated response probablty Jae Kwang Km Department of Appled Statstcs, Yonse Unversty, Seoul, 120-749, KOREA Key Words: Regresson estmator, Propensty score,

More information

Lecture 4 Hypothesis Testing

Lecture 4 Hypothesis Testing Lecture 4 Hypothess Testng We may wsh to test pror hypotheses about the coeffcents we estmate. We can use the estmates to test whether the data rejects our hypothess. An example mght be that we wsh to

More information

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur Analyss of Varance and Desgn of Experment-I MODULE VII LECTURE - 3 ANALYSIS OF COVARIANCE Dr Shalabh Department of Mathematcs and Statstcs Indan Insttute of Technology Kanpur Any scentfc experment s performed

More information

LINEAR REGRESSION ANALYSIS. MODULE IX Lecture Multicollinearity

LINEAR REGRESSION ANALYSIS. MODULE IX Lecture Multicollinearity LINEAR REGRESSION ANALYSIS MODULE IX Lecture - 30 Multcollnearty Dr. Shalabh Department of Mathematcs and Statstcs Indan Insttute of Technology Kanpur 2 Remedes for multcollnearty Varous technques have

More information

# c i. INFERENCE FOR CONTRASTS (Chapter 4) It's unbiased: Recall: A contrast is a linear combination of effects with coefficients summing to zero:

# c i. INFERENCE FOR CONTRASTS (Chapter 4) It's unbiased: Recall: A contrast is a linear combination of effects with coefficients summing to zero: 1 INFERENCE FOR CONTRASTS (Chapter 4 Recall: A contrast s a lnear combnaton of effects wth coeffcents summng to zero: " where " = 0. Specfc types of contrasts of nterest nclude: Dfferences n effects Dfferences

More information

Psychology 282 Lecture #24 Outline Regression Diagnostics: Outliers

Psychology 282 Lecture #24 Outline Regression Diagnostics: Outliers Psychology 282 Lecture #24 Outlne Regresson Dagnostcs: Outlers In an earler lecture we studed the statstcal assumptons underlyng the regresson model, ncludng the followng ponts: Formal statement of assumptons.

More information

1. Inference on Regression Parameters a. Finding Mean, s.d and covariance amongst estimates. 2. Confidence Intervals and Working Hotelling Bands

1. Inference on Regression Parameters a. Finding Mean, s.d and covariance amongst estimates. 2. Confidence Intervals and Working Hotelling Bands Content. Inference on Regresson Parameters a. Fndng Mean, s.d and covarance amongst estmates.. Confdence Intervals and Workng Hotellng Bands 3. Cochran s Theorem 4. General Lnear Testng 5. Measures of

More information

Limited Dependent Variables

Limited Dependent Variables Lmted Dependent Varables. What f the left-hand sde varable s not a contnuous thng spread from mnus nfnty to plus nfnty? That s, gven a model = f (, β, ε, where a. s bounded below at zero, such as wages

More information

Properties of Least Squares

Properties of Least Squares Week 3 3.1 Smple Lnear Regresson Model 3. Propertes of Least Squares Estmators Y Y β 1 + β X + u weekly famly expendtures X weekly famly ncome For a gven level of x, the expected level of food expendtures

More information

Negative Binomial Regression

Negative Binomial Regression STATGRAPHICS Rev. 9/16/2013 Negatve Bnomal Regresson Summary... 1 Data Input... 3 Statstcal Model... 3 Analyss Summary... 4 Analyss Optons... 7 Plot of Ftted Model... 8 Observed Versus Predcted... 10 Predctons...

More information

The Multiple Classical Linear Regression Model (CLRM): Specification and Assumptions. 1. Introduction

The Multiple Classical Linear Regression Model (CLRM): Specification and Assumptions. 1. Introduction ECONOMICS 5* -- NOTE (Summary) ECON 5* -- NOTE The Multple Classcal Lnear Regresson Model (CLRM): Specfcaton and Assumptons. Introducton CLRM stands for the Classcal Lnear Regresson Model. The CLRM s also

More information

The Ordinary Least Squares (OLS) Estimator

The Ordinary Least Squares (OLS) Estimator The Ordnary Least Squares (OLS) Estmator 1 Regresson Analyss Regresson Analyss: a statstcal technque for nvestgatng and modelng the relatonshp between varables. Applcatons: Engneerng, the physcal and chemcal

More information

LOGIT ANALYSIS. A.K. VASISHT Indian Agricultural Statistics Research Institute, Library Avenue, New Delhi

LOGIT ANALYSIS. A.K. VASISHT Indian Agricultural Statistics Research Institute, Library Avenue, New Delhi LOGIT ANALYSIS A.K. VASISHT Indan Agrcultural Statstcs Research Insttute, Lbrary Avenue, New Delh-0 02 amtvassht@asr.res.n. Introducton In dummy regresson varable models, t s assumed mplctly that the dependent

More information

Interval Estimation in the Classical Normal Linear Regression Model. 1. Introduction

Interval Estimation in the Classical Normal Linear Regression Model. 1. Introduction ECONOMICS 35* -- NOTE 7 ECON 35* -- NOTE 7 Interval Estmaton n the Classcal Normal Lnear Regresson Model Ths note outlnes the basc elements of nterval estmaton n the Classcal Normal Lnear Regresson Model

More information

A Robust Method for Calculating the Correlation Coefficient

A Robust Method for Calculating the Correlation Coefficient A Robust Method for Calculatng the Correlaton Coeffcent E.B. Nven and C. V. Deutsch Relatonshps between prmary and secondary data are frequently quantfed usng the correlaton coeffcent; however, the tradtonal

More information

Stat260: Bayesian Modeling and Inference Lecture Date: February 22, Reference Priors

Stat260: Bayesian Modeling and Inference Lecture Date: February 22, Reference Priors Stat60: Bayesan Modelng and Inference Lecture Date: February, 00 Reference Prors Lecturer: Mchael I. Jordan Scrbe: Steven Troxler and Wayne Lee In ths lecture, we assume that θ R; n hgher-dmensons, reference

More information

ANSWERS. Problem 1. and the moment generating function (mgf) by. defined for any real t. Use this to show that E( U) var( U)

ANSWERS. Problem 1. and the moment generating function (mgf) by. defined for any real t. Use this to show that E( U) var( U) Econ 413 Exam 13 H ANSWERS Settet er nndelt 9 deloppgaver, A,B,C, som alle anbefales å telle lkt for å gøre det ltt lettere å stå. Svar er gtt . Unfortunately, there s a prntng error n the hnt of

More information

Basically, if you have a dummy dependent variable you will be estimating a probability.

Basically, if you have a dummy dependent variable you will be estimating a probability. ECON 497: Lecture Notes 13 Page 1 of 1 Metropoltan State Unversty ECON 497: Research and Forecastng Lecture Notes 13 Dummy Dependent Varable Technques Studenmund Chapter 13 Bascally, f you have a dummy

More information

Durban Watson for Testing the Lack-of-Fit of Polynomial Regression Models without Replications

Durban Watson for Testing the Lack-of-Fit of Polynomial Regression Models without Replications Durban Watson for Testng the Lack-of-Ft of Polynomal Regresson Models wthout Replcatons Ruba A. Alyaf, Maha A. Omar, Abdullah A. Al-Shha ralyaf@ksu.edu.sa, maomar@ksu.edu.sa, aalshha@ksu.edu.sa Department

More information

Department of Statistics University of Toronto STA305H1S / 1004 HS Design and Analysis of Experiments Term Test - Winter Solution

Department of Statistics University of Toronto STA305H1S / 1004 HS Design and Analysis of Experiments Term Test - Winter Solution Department of Statstcs Unversty of Toronto STA35HS / HS Desgn and Analyss of Experments Term Test - Wnter - Soluton February, Last Name: Frst Name: Student Number: Instructons: Tme: hours. Ads: a non-programmable

More information

Predictive Analytics : QM901.1x Prof U Dinesh Kumar, IIMB. All Rights Reserved, Indian Institute of Management Bangalore

Predictive Analytics : QM901.1x Prof U Dinesh Kumar, IIMB. All Rights Reserved, Indian Institute of Management Bangalore Sesson Outlne Introducton to classfcaton problems and dscrete choce models. Introducton to Logstcs Regresson. Logstc functon and Logt functon. Maxmum Lkelhood Estmator (MLE) for estmaton of LR parameters.

More information

Primer on High-Order Moment Estimators

Primer on High-Order Moment Estimators Prmer on Hgh-Order Moment Estmators Ton M. Whted July 2007 The Errors-n-Varables Model We wll start wth the classcal EIV for one msmeasured regressor. The general case s n Erckson and Whted Econometrc

More information

Chapter 11: Simple Linear Regression and Correlation

Chapter 11: Simple Linear Regression and Correlation Chapter 11: Smple Lnear Regresson and Correlaton 11-1 Emprcal Models 11-2 Smple Lnear Regresson 11-3 Propertes of the Least Squares Estmators 11-4 Hypothess Test n Smple Lnear Regresson 11-4.1 Use of t-tests

More information

Factor models with many assets: strong factors, weak factors, and the two-pass procedure

Factor models with many assets: strong factors, weak factors, and the two-pass procedure Factor models wth many assets: strong factors, weak factors, and the two-pass procedure Stanslav Anatolyev 1 Anna Mkusheva 2 1 CERGE-EI and NES 2 MIT December 2017 Stanslav Anatolyev and Anna Mkusheva

More information

Computing MLE Bias Empirically

Computing MLE Bias Empirically Computng MLE Bas Emprcally Kar Wa Lm Australan atonal Unversty January 3, 27 Abstract Ths note studes the bas arses from the MLE estmate of the rate parameter and the mean parameter of an exponental dstrbuton.

More information

See Book Chapter 11 2 nd Edition (Chapter 10 1 st Edition)

See Book Chapter 11 2 nd Edition (Chapter 10 1 st Edition) Count Data Models See Book Chapter 11 2 nd Edton (Chapter 10 1 st Edton) Count data consst of non-negatve nteger values Examples: number of drver route changes per week, the number of trp departure changes

More information

Linear Approximation with Regularization and Moving Least Squares

Linear Approximation with Regularization and Moving Least Squares Lnear Approxmaton wth Regularzaton and Movng Least Squares Igor Grešovn May 007 Revson 4.6 (Revson : March 004). 5 4 3 0.5 3 3.5 4 Contents: Lnear Fttng...4. Weghted Least Squares n Functon Approxmaton...

More information

x i1 =1 for all i (the constant ).

x i1 =1 for all i (the constant ). Chapter 5 The Multple Regresson Model Consder an economc model where the dependent varable s a functon of K explanatory varables. The economc model has the form: y = f ( x,x,..., ) xk Approxmate ths by

More information

Lecture 3 Stat102, Spring 2007

Lecture 3 Stat102, Spring 2007 Lecture 3 Stat0, Sprng 007 Chapter 3. 3.: Introducton to regresson analyss Lnear regresson as a descrptve technque The least-squares equatons Chapter 3.3 Samplng dstrbuton of b 0, b. Contnued n net lecture

More information

Chapter 12 Analysis of Covariance

Chapter 12 Analysis of Covariance Chapter Analyss of Covarance Any scentfc experment s performed to know somethng that s unknown about a group of treatments and to test certan hypothess about the correspondng treatment effect When varablty

More information

January Examinations 2015

January Examinations 2015 24/5 Canddates Only January Examnatons 25 DO NOT OPEN THE QUESTION PAPER UNTIL INSTRUCTED TO DO SO BY THE CHIEF INVIGILATOR STUDENT CANDIDATE NO.. Department Module Code Module Ttle Exam Duraton (n words)

More information

Computation of Higher Order Moments from Two Multinomial Overdispersion Likelihood Models

Computation of Higher Order Moments from Two Multinomial Overdispersion Likelihood Models Computaton of Hgher Order Moments from Two Multnomal Overdsperson Lkelhood Models BY J. T. NEWCOMER, N. K. NEERCHAL Department of Mathematcs and Statstcs, Unversty of Maryland, Baltmore County, Baltmore,

More information

/ n ) are compared. The logic is: if the two

/ n ) are compared. The logic is: if the two STAT C141, Sprng 2005 Lecture 13 Two sample tests One sample tests: examples of goodness of ft tests, where we are testng whether our data supports predctons. Two sample tests: called as tests of ndependence

More information

Non-Mixture Cure Model for Interval Censored Data: Simulation Study ABSTRACT

Non-Mixture Cure Model for Interval Censored Data: Simulation Study ABSTRACT Malaysan Journal of Mathematcal Scences 8(S): 37-44 (2014) Specal Issue: Internatonal Conference on Mathematcal Scences and Statstcs 2013 (ICMSS2013) MALAYSIAN JOURNAL OF MATHEMATICAL SCIENCES Journal

More information

Maximum Likelihood Estimation of Binary Dependent Variables Models: Probit and Logit. 1. General Formulation of Binary Dependent Variables Models

Maximum Likelihood Estimation of Binary Dependent Variables Models: Probit and Logit. 1. General Formulation of Binary Dependent Variables Models ECO 452 -- OE 4: Probt and Logt Models ECO 452 -- OE 4 Maxmum Lkelhood Estmaton of Bnary Dependent Varables Models: Probt and Logt hs note demonstrates how to formulate bnary dependent varables models

More information

A New Method for Estimating Overdispersion. David Fletcher and Peter Green Department of Mathematics and Statistics

A New Method for Estimating Overdispersion. David Fletcher and Peter Green Department of Mathematics and Statistics A New Method for Estmatng Overdsperson Davd Fletcher and Peter Green Department of Mathematcs and Statstcs Byron Morgan Insttute of Mathematcs, Statstcs and Actuaral Scence Unversty of Kent, England Overvew

More information

3.1 Expectation of Functions of Several Random Variables. )' be a k-dimensional discrete or continuous random vector, with joint PMF p (, E X E X1 E X

3.1 Expectation of Functions of Several Random Variables. )' be a k-dimensional discrete or continuous random vector, with joint PMF p (, E X E X1 E X Statstcs 1: Probablty Theory II 37 3 EPECTATION OF SEVERAL RANDOM VARIABLES As n Probablty Theory I, the nterest n most stuatons les not on the actual dstrbuton of a random vector, but rather on a number

More information

Lecture 10 Support Vector Machines II

Lecture 10 Support Vector Machines II Lecture 10 Support Vector Machnes II 22 February 2016 Taylor B. Arnold Yale Statstcs STAT 365/665 1/28 Notes: Problem 3 s posted and due ths upcomng Frday There was an early bug n the fake-test data; fxed

More information

Homework Assignment 3 Due in class, Thursday October 15

Homework Assignment 3 Due in class, Thursday October 15 Homework Assgnment 3 Due n class, Thursday October 15 SDS 383C Statstcal Modelng I 1 Rdge regresson and Lasso 1. Get the Prostrate cancer data from http://statweb.stanford.edu/~tbs/elemstatlearn/ datasets/prostate.data.

More information

Methods Lunch Talk: Causal Mediation Analysis

Methods Lunch Talk: Causal Mediation Analysis Methods Lunch Talk: Causal Medaton Analyss Taeyong Park Washngton Unversty n St. Lous Aprl 9, 2015 Park (Wash U.) Methods Lunch Aprl 9, 2015 1 / 1 References Baron and Kenny. 1986. The Moderator-Medator

More information

Introduction to Regression

Introduction to Regression Introducton to Regresson Dr Tom Ilvento Department of Food and Resource Economcs Overvew The last part of the course wll focus on Regresson Analyss Ths s one of the more powerful statstcal technques Provdes

More information

Topic 23 - Randomized Complete Block Designs (RCBD)

Topic 23 - Randomized Complete Block Designs (RCBD) Topc 3 ANOVA (III) 3-1 Topc 3 - Randomzed Complete Block Desgns (RCBD) Defn: A Randomzed Complete Block Desgn s a varant of the completely randomzed desgn (CRD) that we recently learned. In ths desgn,

More information

9. Binary Dependent Variables

9. Binary Dependent Variables 9. Bnar Dependent Varables 9. Homogeneous models Log, prob models Inference Tax preparers 9.2 Random effects models 9.3 Fxed effects models 9.4 Margnal models and GEE Appendx 9A - Lkelhood calculatons

More information

Statistics for Economics & Business

Statistics for Economics & Business Statstcs for Economcs & Busness Smple Lnear Regresson Learnng Objectves In ths chapter, you learn: How to use regresson analyss to predct the value of a dependent varable based on an ndependent varable

More information

β0 + β1xi and want to estimate the unknown

β0 + β1xi and want to estimate the unknown SLR Models Estmaton Those OLS Estmates Estmators (e ante) v. estmates (e post) The Smple Lnear Regresson (SLR) Condtons -4 An Asde: The Populaton Regresson Functon B and B are Lnear Estmators (condtonal

More information

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur Analyss of Varance and Desgn of Experment-I MODULE VIII LECTURE - 34 ANALYSIS OF VARIANCE IN RANDOM-EFFECTS MODEL AND MIXED-EFFECTS EFFECTS MODEL Dr Shalabh Department of Mathematcs and Statstcs Indan

More information

Time-Varying Coefficient Model with Linear Smoothing Function for Longitudinal Data in Clinical Trial

Time-Varying Coefficient Model with Linear Smoothing Function for Longitudinal Data in Clinical Trial Tme-Varyng Coeffcent Model wth Lnear Smoothng Functon for Longtudnal Data n Clncal Tral Masanor Ito, Toshhro Msum and Hdek Hrooka Bostatstcs Group, Data Scence Dept., Astellas Pharma Inc. Introducton In

More information

Chapter 6. Supplemental Text Material

Chapter 6. Supplemental Text Material Chapter 6. Supplemental Text Materal S6-. actor Effect Estmates are Least Squares Estmates We have gven heurstc or ntutve explanatons of how the estmates of the factor effects are obtaned n the textboo.

More information

Continuous vs. Discrete Goods

Continuous vs. Discrete Goods CE 651 Transportaton Economcs Charsma Choudhury Lecture 3-4 Analyss of Demand Contnuous vs. Dscrete Goods Contnuous Goods Dscrete Goods x auto 1 Indfference u curves 3 u u 1 x 1 0 1 bus Outlne Data Modelng

More information

Chapter 5: Hypothesis Tests, Confidence Intervals & Gauss-Markov Result

Chapter 5: Hypothesis Tests, Confidence Intervals & Gauss-Markov Result Chapter 5: Hypothess Tests, Confdence Intervals & Gauss-Markov Result 1-1 Outlne 1. The standard error of 2. Hypothess tests concernng β 1 3. Confdence ntervals for β 1 4. Regresson when X s bnary 5. Heteroskedastcty

More information

How its computed. y outcome data λ parameters hyperparameters. where P denotes the Laplace approximation. k i k k. Andrew B Lawson 2013

How its computed. y outcome data λ parameters hyperparameters. where P denotes the Laplace approximation. k i k k. Andrew B Lawson 2013 Andrew Lawson MUSC INLA INLA s a relatvely new tool that can be used to approxmate posteror dstrbutons n Bayesan models INLA stands for ntegrated Nested Laplace Approxmaton The approxmaton has been known

More information

Department of Quantitative Methods & Information Systems. Time Series and Their Components QMIS 320. Chapter 6

Department of Quantitative Methods & Information Systems. Time Series and Their Components QMIS 320. Chapter 6 Department of Quanttatve Methods & Informaton Systems Tme Seres and Ther Components QMIS 30 Chapter 6 Fall 00 Dr. Mohammad Zanal These sldes were modfed from ther orgnal source for educatonal purpose only.

More information

Number of cases Number of factors Number of covariates Number of levels of factor i. Value of the dependent variable for case k

Number of cases Number of factors Number of covariates Number of levels of factor i. Value of the dependent variable for case k ANOVA Model and Matrx Computatons Notaton The followng notaton s used throughout ths chapter unless otherwse stated: N F CN Y Z j w W Number of cases Number of factors Number of covarates Number of levels

More information

Econ107 Applied Econometrics Topic 9: Heteroskedasticity (Studenmund, Chapter 10)

Econ107 Applied Econometrics Topic 9: Heteroskedasticity (Studenmund, Chapter 10) I. Defnton and Problems Econ7 Appled Econometrcs Topc 9: Heteroskedastcty (Studenmund, Chapter ) We now relax another classcal assumpton. Ths s a problem that arses often wth cross sectons of ndvduals,

More information

Chapter 9: Statistical Inference and the Relationship between Two Variables

Chapter 9: Statistical Inference and the Relationship between Two Variables Chapter 9: Statstcal Inference and the Relatonshp between Two Varables Key Words The Regresson Model The Sample Regresson Equaton The Pearson Correlaton Coeffcent Learnng Outcomes After studyng ths chapter,

More information

Chapter 11: I = 2 samples independent samples paired samples Chapter 12: I 3 samples of equal size J one-way layout two-way layout

Chapter 11: I = 2 samples independent samples paired samples Chapter 12: I 3 samples of equal size J one-way layout two-way layout Serk Sagtov, Chalmers and GU, February 0, 018 Chapter 1. Analyss of varance Chapter 11: I = samples ndependent samples pared samples Chapter 1: I 3 samples of equal sze one-way layout two-way layout 1

More information

Linear Feature Engineering 11

Linear Feature Engineering 11 Lnear Feature Engneerng 11 2 Least-Squares 2.1 Smple least-squares Consder the followng dataset. We have a bunch of nputs x and correspondng outputs y. The partcular values n ths dataset are x y 0.23 0.19

More information

Lecture 12: Classification

Lecture 12: Classification Lecture : Classfcaton g Dscrmnant functons g The optmal Bayes classfer g Quadratc classfers g Eucldean and Mahalanobs metrcs g K Nearest Neghbor Classfers Intellgent Sensor Systems Rcardo Guterrez-Osuna

More information

For now, let us focus on a specific model of neurons. These are simplified from reality but can achieve remarkable results.

For now, let us focus on a specific model of neurons. These are simplified from reality but can achieve remarkable results. Neural Networks : Dervaton compled by Alvn Wan from Professor Jtendra Malk s lecture Ths type of computaton s called deep learnng and s the most popular method for many problems, such as computer vson

More information

Chapter 13: Multiple Regression

Chapter 13: Multiple Regression Chapter 13: Multple Regresson 13.1 Developng the multple-regresson Model The general model can be descrbed as: It smplfes for two ndependent varables: The sample ft parameter b 0, b 1, and b are used to

More information

STAT 405 BIOSTATISTICS (Fall 2016) Handout 15 Introduction to Logistic Regression

STAT 405 BIOSTATISTICS (Fall 2016) Handout 15 Introduction to Logistic Regression STAT 45 BIOSTATISTICS (Fall 26) Handout 5 Introducton to Logstc Regresson Ths handout covers materal found n Secton 3.7 of your text. You may also want to revew regresson technques n Chapter. In ths handout,

More information

Module 3 LOSSY IMAGE COMPRESSION SYSTEMS. Version 2 ECE IIT, Kharagpur

Module 3 LOSSY IMAGE COMPRESSION SYSTEMS. Version 2 ECE IIT, Kharagpur Module 3 LOSSY IMAGE COMPRESSION SYSTEMS Verson ECE IIT, Kharagpur Lesson 6 Theory of Quantzaton Verson ECE IIT, Kharagpur Instructonal Objectves At the end of ths lesson, the students should be able to:

More information

x = , so that calculated

x = , so that calculated Stat 4, secton Sngle Factor ANOVA notes by Tm Plachowsk n chapter 8 we conducted hypothess tests n whch we compared a sngle sample s mean or proporton to some hypotheszed value Chapter 9 expanded ths to

More information

NUMERICAL DIFFERENTIATION

NUMERICAL DIFFERENTIATION NUMERICAL DIFFERENTIATION 1 Introducton Dfferentaton s a method to compute the rate at whch a dependent output y changes wth respect to the change n the ndependent nput x. Ths rate of change s called the

More information

Lecture 6: Introduction to Linear Regression

Lecture 6: Introduction to Linear Regression Lecture 6: Introducton to Lnear Regresson An Manchakul amancha@jhsph.edu 24 Aprl 27 Lnear regresson: man dea Lnear regresson can be used to study an outcome as a lnear functon of a predctor Example: 6

More information

10-701/ Machine Learning, Fall 2005 Homework 3

10-701/ Machine Learning, Fall 2005 Homework 3 10-701/15-781 Machne Learnng, Fall 2005 Homework 3 Out: 10/20/05 Due: begnnng of the class 11/01/05 Instructons Contact questons-10701@autonlaborg for queston Problem 1 Regresson and Cross-valdaton [40

More information

Issues To Consider when Estimating Health Care Costs with Generalized Linear Models (GLMs): To Gamma/Log Or Not To Gamma/Log? That Is The New Question

Issues To Consider when Estimating Health Care Costs with Generalized Linear Models (GLMs): To Gamma/Log Or Not To Gamma/Log? That Is The New Question Issues To Consder when Estmatng Health Care Costs wth Generalzed Lnear Models (GLMs): To Gamma/Log Or Not To Gamma/Log? That Is The New Queston ISPOR 20th Annual Internatonal Meetng May 19, 2015 Jalpa

More information

STK4080/9080 Survival and event history analysis

STK4080/9080 Survival and event history analysis SK48/98 Survval and event hstory analyss Lecture 7: Regresson modellng Relatve rsk regresson Regresson models Assume that we have a sample of n ndvduals, and let N (t) count the observed occurrences of

More information

Lecture 9: Linear regression: centering, hypothesis testing, multiple covariates, and confounding

Lecture 9: Linear regression: centering, hypothesis testing, multiple covariates, and confounding Recall: man dea of lnear regresson Lecture 9: Lnear regresson: centerng, hypothess testng, multple covarates, and confoundng Sandy Eckel seckel@jhsph.edu 6 May 8 Lnear regresson can be used to study an

More information

Estimation: Part 2. Chapter GREG estimation

Estimation: Part 2. Chapter GREG estimation Chapter 9 Estmaton: Part 2 9. GREG estmaton In Chapter 8, we have seen that the regresson estmator s an effcent estmator when there s a lnear relatonshp between y and x. In ths chapter, we generalzed the

More information

Chapter 2 - The Simple Linear Regression Model S =0. e i is a random error. S β2 β. This is a minimization problem. Solution is a calculus exercise.

Chapter 2 - The Simple Linear Regression Model S =0. e i is a random error. S β2 β. This is a minimization problem. Solution is a calculus exercise. Chapter - The Smple Lnear Regresson Model The lnear regresson equaton s: where y + = β + β e for =,..., y and are observable varables e s a random error How can an estmaton rule be constructed for the

More information

Lecture 9: Linear regression: centering, hypothesis testing, multiple covariates, and confounding

Lecture 9: Linear regression: centering, hypothesis testing, multiple covariates, and confounding Lecture 9: Lnear regresson: centerng, hypothess testng, multple covarates, and confoundng Sandy Eckel seckel@jhsph.edu 6 May 008 Recall: man dea of lnear regresson Lnear regresson can be used to study

More information

Advances in Longitudinal Methods in the Social and Behavioral Sciences. Finite Mixtures of Nonlinear Mixed-Effects Models.

Advances in Longitudinal Methods in the Social and Behavioral Sciences. Finite Mixtures of Nonlinear Mixed-Effects Models. Advances n Longtudnal Methods n the Socal and Behavoral Scences Fnte Mxtures of Nonlnear Mxed-Effects Models Jeff Harrng Department of Measurement, Statstcs and Evaluaton The Center for Integrated Latent

More information

ANOMALIES OF THE MAGNITUDE OF THE BIAS OF THE MAXIMUM LIKELIHOOD ESTIMATOR OF THE REGRESSION SLOPE

ANOMALIES OF THE MAGNITUDE OF THE BIAS OF THE MAXIMUM LIKELIHOOD ESTIMATOR OF THE REGRESSION SLOPE P a g e ANOMALIES OF THE MAGNITUDE OF THE BIAS OF THE MAXIMUM LIKELIHOOD ESTIMATOR OF THE REGRESSION SLOPE Darmud O Drscoll ¹, Donald E. Ramrez ² ¹ Head of Department of Mathematcs and Computer Studes

More information

Bayesian predictive Configural Frequency Analysis

Bayesian predictive Configural Frequency Analysis Psychologcal Test and Assessment Modelng, Volume 54, 2012 (3), 285-292 Bayesan predctve Confgural Frequency Analyss Eduardo Gutérrez-Peña 1 Abstract Confgural Frequency Analyss s a method for cell-wse

More information

8/25/17. Data Modeling. Data Modeling. Data Modeling. Patrice Koehl Department of Biological Sciences National University of Singapore

8/25/17. Data Modeling. Data Modeling. Data Modeling. Patrice Koehl Department of Biological Sciences National University of Singapore 8/5/17 Data Modelng Patrce Koehl Department of Bologcal Scences atonal Unversty of Sngapore http://www.cs.ucdavs.edu/~koehl/teachng/bl59 koehl@cs.ucdavs.edu Data Modelng Ø Data Modelng: least squares Ø

More information

Now we relax this assumption and allow that the error variance depends on the independent variables, i.e., heteroskedasticity

Now we relax this assumption and allow that the error variance depends on the independent variables, i.e., heteroskedasticity ECON 48 / WH Hong Heteroskedastcty. Consequences of Heteroskedastcty for OLS Assumpton MLR. 5: Homoskedastcty var ( u x ) = σ Now we relax ths assumpton and allow that the error varance depends on the

More information

Systems of Equations (SUR, GMM, and 3SLS)

Systems of Equations (SUR, GMM, and 3SLS) Lecture otes on Advanced Econometrcs Takash Yamano Fall Semester 4 Lecture 4: Sstems of Equatons (SUR, MM, and 3SLS) Seemngl Unrelated Regresson (SUR) Model Consder a set of lnear equatons: $ + ɛ $ + ɛ

More information

Laboratory 3: Method of Least Squares

Laboratory 3: Method of Least Squares Laboratory 3: Method of Least Squares Introducton Consder the graph of expermental data n Fgure 1. In ths experment x s the ndependent varable and y the dependent varable. Clearly they are correlated wth

More information

An Introduction to Censoring, Truncation and Sample Selection Problems

An Introduction to Censoring, Truncation and Sample Selection Problems An Introducton to Censorng, Truncaton and Sample Selecton Problems Thomas Crossley SPIDA June 2003 1 A. Introducton A.1 Basc Ideas Most of the statstcal technques we study are for estmatng (populaton)

More information

Statistics II Final Exam 26/6/18

Statistics II Final Exam 26/6/18 Statstcs II Fnal Exam 26/6/18 Academc Year 2017/18 Solutons Exam duraton: 2 h 30 mn 1. (3 ponts) A town hall s conductng a study to determne the amount of leftover food produced by the restaurants n the

More information

Multinomial logit regression

Multinomial logit regression 07/0/6 Multnomal logt regresson Introducton We now turn our attenton to regresson models for the analyss of categorcal dependent varables wth more than two response categores: Y car owned (many possble

More information

Web-based Supplementary Materials for Inference for the Effect of Treatment. on Survival Probability in Randomized Trials with Noncompliance and

Web-based Supplementary Materials for Inference for the Effect of Treatment. on Survival Probability in Randomized Trials with Noncompliance and Bometrcs 000, 000 000 DOI: 000 000 0000 Web-based Supplementary Materals for Inference for the Effect of Treatment on Survval Probablty n Randomzed Trals wth Noncomplance and Admnstratve Censorng by Ne,

More information

Chapter 8 Indicator Variables

Chapter 8 Indicator Variables Chapter 8 Indcator Varables In general, e explanatory varables n any regresson analyss are assumed to be quanttatve n nature. For example, e varables lke temperature, dstance, age etc. are quanttatve n

More information

BIO Lab 2: TWO-LEVEL NORMAL MODELS with school children popularity data

BIO Lab 2: TWO-LEVEL NORMAL MODELS with school children popularity data Lab : TWO-LEVEL NORMAL MODELS wth school chldren popularty data Purpose: Introduce basc two-level models for normally dstrbuted responses usng STATA. In partcular, we dscuss Random ntercept models wthout

More information

PHYS 450 Spring semester Lecture 02: Dealing with Experimental Uncertainties. Ron Reifenberger Birck Nanotechnology Center Purdue University

PHYS 450 Spring semester Lecture 02: Dealing with Experimental Uncertainties. Ron Reifenberger Birck Nanotechnology Center Purdue University PHYS 45 Sprng semester 7 Lecture : Dealng wth Expermental Uncertantes Ron Refenberger Brck anotechnology Center Purdue Unversty Lecture Introductory Comments Expermental errors (really expermental uncertantes)

More information

ECONOMICS 351*-A Mid-Term Exam -- Fall Term 2000 Page 1 of 13 pages. QUEEN'S UNIVERSITY AT KINGSTON Department of Economics

ECONOMICS 351*-A Mid-Term Exam -- Fall Term 2000 Page 1 of 13 pages. QUEEN'S UNIVERSITY AT KINGSTON Department of Economics ECOOMICS 35*-A Md-Term Exam -- Fall Term 000 Page of 3 pages QUEE'S UIVERSITY AT KIGSTO Department of Economcs ECOOMICS 35* - Secton A Introductory Econometrcs Fall Term 000 MID-TERM EAM ASWERS MG Abbott

More information

since [1-( 0+ 1x1i+ 2x2 i)] [ 0+ 1x1i+ assumed to be a reasonable approximation

since [1-( 0+ 1x1i+ 2x2 i)] [ 0+ 1x1i+ assumed to be a reasonable approximation Econ 388 R. Butler 204 revsons Lecture 4 Dummy Dependent Varables I. Lnear Probablty Model: the Regresson model wth a dummy varables as the dependent varable assumpton, mplcaton regular multple regresson

More information

Laboratory 1c: Method of Least Squares

Laboratory 1c: Method of Least Squares Lab 1c, Least Squares Laboratory 1c: Method of Least Squares Introducton Consder the graph of expermental data n Fgure 1. In ths experment x s the ndependent varable and y the dependent varable. Clearly

More information

ECONOMETRICS - FINAL EXAM, 3rd YEAR (GECO & GADE)

ECONOMETRICS - FINAL EXAM, 3rd YEAR (GECO & GADE) ECONOMETRICS - FINAL EXAM, 3rd YEAR (GECO & GADE) June 7, 016 15:30 Frst famly name: Name: DNI/ID: Moble: Second famly Name: GECO/GADE: Instructor: E-mal: Queston 1 A B C Blank Queston A B C Blank Queston

More information

Uncertainty as the Overlap of Alternate Conditional Distributions

Uncertainty as the Overlap of Alternate Conditional Distributions Uncertanty as the Overlap of Alternate Condtonal Dstrbutons Olena Babak and Clayton V. Deutsch Centre for Computatonal Geostatstcs Department of Cvl & Envronmental Engneerng Unversty of Alberta An mportant

More information