Mixed Models for Longitudinal Ordinal and Nominal Outcomes. Don Hedeker Department of Public Health Sciences University of Chicago

Size: px
Start display at page:

Download "Mixed Models for Longitudinal Ordinal and Nominal Outcomes. Don Hedeker Department of Public Health Sciences University of Chicago"

Transcription

1 Mixed Models for Longitudinal Ordinal and Nominal Outcomes Don Hedeker Department of Public Health Sciences University of Chicago Hedeker, D. (2008). Multilevel models for ordinal and nominal variables. In J. de Leeuw & E. Meijer (Eds.), Handbook of Multilevel Analysis. Springer, New York. Hedeker, D. (2015). Methods for multilevel ordinal data in prevention research. Prevention Science. Hedeker, D. & Gibbons, R.D. (2006). Longitudinal Data Analysis, chapters 10 & 11. Wiley. 1

2 Mixed-effects ordinal logistic regression model (Hedeker & Gibbons, 1994, 1996) i = 1,... N level-2 units (clusters or subjects) j = 1,..., n i level-1 units (subjects or repeated observations) c = 1, 2,..., C response categories Y ij = ordinal response of level-2 unit i and level-1 unit j How was your day? (asked repeatedly each day for a week) 2

3 Mixed-effects ordinal logistic regression model λ ijc = log P ijc = γ c (x (1 P ijc ) ijβ + z ijυ i ) P ijc = Pr (Y ij c υ ; γ c, β, Σ υ ) = 1 1+exp( λ ijc ) p ijc = Pr (Y ij = c υ ; γ c, β, Σ υ ) = P ijc P ijc 1 C 1 strictly increasing model thresholds γ c x ij = p 1 covariate vector z ij = r 1 design vector for random effects β = p 1 fixed regression parameters υ i = r 1 random effects for level-2 unit i N(0, Σ υ ) 3

4 Model for Latent Continuous Responses Model with p covariates for the latent response strength y ij : y ij = x ijβ + υ 0i + ε ij where υ 0i N(0, σ 2 υ), and assuming ε ij standard normal (mean 0 and σ 2 = 1) leads to mixed-effects ordinal probit regression ε ij standard logistic (mean 0 and σ 2 = π 2 /3) leads to mixed-effects ordinal logistic regression 4

5 Underlying latent variable not an essential assumption of the model useful for obtaining intra-class correlation (r) and for design effect (d) r = σ2 υ σ 2 υ + σ 2 d = σ2 υ + σ 2 σ 2 = 1/(1 r) ratio of actual variance to the variance that would be obtained by simple random sampling (holding sample size constant) 5

6 Mixed-effects model β estimates from mixed-effects (random intercepts) model are larger (in abs. value) than from fixed-effects model by approximately because d = συ 2 + σ 2 σ 2 V (y) = σ 2 υ + σ 2 in mixed-effects (random intercepts) model V (y) = σ 2 in fixed-effects model difference depends on size of random-effects variance σ 2 υ more complex for models with multiple random effects 6

7 Treatment-Related Change Across Time Data from the NIMH Schizophrenia collaborative study on treatment related changes in overall severity. IMPS item 79, Severity of Illness, was scored as: 1 = normal or borderline mentally ill 2 = mildly or moderately ill 3 = markedly ill 4 = severely or among the most extremely ill The experimental design and corresponding sample sizes: Sample size at Week Group completers PLC (n=108) % DRUG (n=329) % Drug = Chlorpromazine, Fluphenazine, or Thioridazine Main question of interest: Was there differential improvement for the drug groups relative to the control group? 7

8 Stata code: schiz ordinal logits cd "u:\stata_long\" log using schiz_ordinal_logits.log, replace infile id imps79 imps79b imps79o inter tx week sweek txswk /// using SCHIZX1.DAT.txt, clear recode imps79 imps79b imps79o (-9=.) /* only use weeks 0, 1, 3 6 for these descriptives */ keep if week == 0 week == 1 week == 3 week == 6 tabulate imps79o, generate(imps79oi) collapse (mean) p1=imps79oi1 p2=imps79oi2 p3=imps79oi3 /// p4=imps79oi4, by(tx sweek) list 8

9 tx sweek p1 p2 p3 p

10 . gen codds1 = (p2 + p3 + p4) / p1 (1 missing value generated). gen codds2 = (p3 + p4) / (p1 + p2). gen codds3 = p4 / (p1 + p2 + p3). list tx sweek codds1 codds2 codds tx sweek codds1 codds2 codds

11 . gen clogit1 = log(codds1) (1 missing value generated). gen clogit2 = log(codds2). gen clogit3 = log(codds3). list tx sweek clogit1 clogit2 clogit tx sweek clogit1 clogit2 clogit

12 * graph cumulative logits graph twoway (connected clogit1 sweek if tx==0) /// (connected clogit1 sweek if tx==1), /// legend(label(1 "placebo") label(2 "drug")) /// yscale(range(0 4)) xlabel(0(.5)2.5) graph twoway (connected clogit2 sweek if tx==0) /// (connected clogit2 sweek if tx==1), /// legend(label(1 "placebo") label(2 "drug")) /// yscale(range( )) graph twoway (connected clogit3 sweek if tx==0) /// (connected clogit3 sweek if tx==1), /// legend(label(1 "placebo") label(2 "drug")) /// yscale(range(-3-1)) 12

13 clogit sweek placebo drug clogit sweek placebo drug clogit sweek placebo drug 13

14 Under SSI, Inc > SuperMix (English) or SuperMix (English) Student Under File click on Open Spreadsheet Open C:\SuperMixEn Examples\Workshop\Binary\SCHIZX1.ss3 (or C:\SuperMixEn Student Examples\Workshop\Binary\SCHIZX1.ss3) 14

15 C:\SuperMixEn Examples\Workshop\Binary\SCHIZX1.ss3 15

16 Select Imps79O column, then Edit > Set Missing Value 16

17 Within-Subjects / Between-Subjects components Within-subjects model - level 1 (j = 1,..., n i obs) λ ijc = γ c [b 0i + b 1i W eekj ] Between-subjects model - level 2 (i = 1,..., N subjects) b 0i = β 0 + β 2 Grp i + υ 0i b 1i = β 1 + β 3 Grp i + υ 1i υ i N ID(0, Σ υ ) 17

18 Under File click on Open Existing Model Setup Open C:\SuperMixEn Examples\Workshop\Ordinal\schizo2.mum (or C:\SuperMixEn Student Examples\Workshop\Ordinal\schizo2.mum) 18

19 Note that Dependent Variable Type is ordered 19

20 Note the lack of TxDrug as an explanatory variable 20

21 Make sure Optimization Method is set to adaptive quadrature 21

22 Note: Cumulative Logit link function 22

23 Category response indicators (IMPS79O1-IMPS79O4); results of fixed-effects model (to be ignored, or for comparison purposes) 23

24 24

25 25

26 26

27 Model Fit of Observed Proportions 27

28 Stata code: schiz ordinal mataest /* random int and trend model with sweek + tx*sweek from Supermix: Population Average Estimates Standard Parameter Estimate Error z Value P Value Threshold Threshold Threshold SqrtWeek Tx*SWeek */ mata beta = ( \ ) xmat_0 = (0, 0 \ 1, 0 \ , 0 \ , 0) xmat_1 = (0, 0 \ 1, 1 \ , \ , ) xbeta_0 = xmat_0*beta xbeta_1 = xmat_1*beta cut1 = * J(4,1,1) cut2 = * J(4,1,1) cut3 = * J(4,1,1) 28

29 /* results for placebo group */ clogit1_0 = cut1 - xbeta_0 clogit2_0 = cut2 - xbeta_0 clogit3_0 = cut3 - xbeta_0 cprob1_0 = invlogit(clogit1_0) cprob2_0 = invlogit(clogit2_0) cprob3_0 = invlogit(clogit3_0) prob1_0 = cprob1_0 prob2_0 = cprob2_0 - cprob1_0 prob3_0 = cprob3_0 - cprob2_0 prob4_0 = J(4,1,1) - cprob3_0 prob1_0 prob2_0 prob3_0 prob4_0 29

30 : prob1_ : prob2_ : prob3_ : prob4_

31 /* results for drug group */ clogit1_1 = cut1 - xbeta_1 clogit2_1 = cut2 - xbeta_1 clogit3_1 = cut3 - xbeta_1 cprob1_1 = invlogit(clogit1_1) cprob2_1 = invlogit(clogit2_1) cprob3_1 = invlogit(clogit3_1) prob1_1 = cprob1_1 prob2_1 = cprob2_1 - cprob1_1 prob3_1 = cprob3_1 - cprob2_1 prob4_1 = J(4,1,1) - cprob3_1 prob1_1 prob2_1 prob3_1 prob4_1 end 31

32 : prob1_ : prob2_ : prob3_ : prob4_

33 SAS IML code: SCHZOFIT.SAS - computing marginal probabilities - ordinal model adapted from syntax at (Week 12) TITLE1 NIMH Schizophrenia Data - Estimated Marginal Probabilities ; PROC IML; /* Results from random intercept and trend model */; /* using Population Average Estimates */; x0 = { , , , }; x1 = { , , , }; beta = { , }; thresh = { , , }; 33

34 za0 = (thresh[1] - x0*beta) ; zb0 = (thresh[2] - x0*beta) ; zc0 = (thresh[3] - x0*beta) ; za1 = (thresh[1] - x1*beta) ; zb1 = (thresh[2] - x1*beta) ; zc1 = (thresh[3] - x1*beta) ; grp0a = 1 / ( 1 + EXP(- za0)); grp0b = 1 / ( 1 + EXP(- zb0)); grp0c = 1 / ( 1 + EXP(- zc0)); grp1a = 1 / ( 1 + EXP(- za1)); grp1b = 1 / ( 1 + EXP(- zb1)); grp1c = 1 / ( 1 + EXP(- zc1)); print Random intercept and trend model ; print using Population Average Estimates ; print marginal prob for group 0 - catg 1 grp0a [FORMAT=8.4]; print marginal prob for group 0 - catg 2 (grp0b-grp0a) [FORMAT=8.4]; print marginal prob for group 0 - catg 3 (grp0c-grp0b) [FORMAT=8.4]; print marginal prob for group 0 - catg 4 (1-grp0c) [FORMAT=8.4]; print marginal prob for group 1 - catg 1 grp1a [FORMAT=8.4]; print marginal prob for group 1 - catg 2 (grp1b-grp1a) [FORMAT=8.4]; print marginal prob for group 1 - catg 3 (grp1c-grp1b) [FORMAT=8.4]; print marginal prob for group 1 - catg 4 (1-grp1c) [FORMAT=8.4]; 34

35 Proportional and Non-proportional Odds Proportional Odds model log P (Y ij c) = γ c [ x 1 P (Y ij c) ijβ + z ] ijυ i with υ i N(0, Σ υ ) relationship between the explanatory variables and the cumulative logits does not depend on c effects of x variables DO NOT vary across the C 1 cumulative logits 35

36 Non-Proportional/Partial Proportional Odds model log P (Y ij c) = γ 1 P (Y ij c) 0c [ u ijγ c + x ijβ + z ] ijυ i u ij = h 1 vector for the set of h covariates for which proportional odds is not assumed effects of u variables DO vary across the C 1 cumulative logits more flexible model for ordinal response relations can be used to empirically test proportional odds assumption 36

37 Proportional Odds Assumption: covariate effects are the same across all cumulative logits Response group Absent Mild Severe total control cumulative odds = = 2.7 logit -1 1 treatment cumulative odds = = 4.6 logit group difference =.5 for both cumulative logits 37

38 Non-Proportional Odds: covariate effects vary across the cumulative logits Response group Absent Mild Severe total control cumulative odds = = 2.7 logit -1 1 treatment cumulative odds = = 7.3 logit UNEQUAL group difference across cumulative logits 38

39 Open C:\SuperMixEn Examples\Workshop\Ordinal\schizo2np.mum (or C:\SuperMixEn Student Examples\Workshop\Ordinal\schizo2np.mum) 39

40 Note that Dependent Variable Type is ordered 40

41 Two explanatory variables: SqrtWeek and Tx SWeek 41

42 Explanatory Variable Interactions - both are selected 42

43 Proportional Odds Assumption Accepted: χ 2 4 = =

44 Linear Transforms Fixed part of model: λ c = ˆγ 0c ˆβ1 SqrtWeek + ˆβ 2 Tx*SWeek + ˆγ 1c SqrtWeek + ˆγ 2c Tx*SWeek] cumulative logit variable 1 vs 2,3,4 1,2 vs 3,4 1,2,3, vs 4 SqrtWeek ˆβ1 ˆβ1 + ˆγ 12 ˆβ1 + ˆγ 13 Tx SWeek ˆβ2 ˆβ2 + ˆγ 22 ˆβ2 + ˆγ 23 H 0 : β 1 + γ 12 = 0; SqrtWeek effect is 0 on the 2nd cumulative logit z = ˆβ 1 + ˆγ 12 SE( ˆβ 1 + ˆγ 12 ) 44

45 45

46 46

47 47

48 NIMH Schiz Study: Severity of Illness (N = 437) Ordinal LR Estimates (se) - random intercept and trend model Proportional Non-Proportional Odds Model Odds Model 1 vs 2,3,4 1,2 vs 3,4 1,2,3 vs 4 Time (sqrt week) (0.190) (0.331) (0.224) (0.206) Drug by Time (0.208) (0.288) (0.229) (0.237) 2 log L Proportional Odds accepted (χ 2 4 = = 1.35) 48

49 Stata code: schizo.do log using c:\stata_examples\schizo.log, replace infile id imps79 imps79b imps79o inter tx week sweek txswk /// using clear recode imps79 imps79b imps79o (-9=.) summarize meologit imps79o sweek tx txswk id: meologit imps79o sweek tx txswk id: sweek, /// covariance(unstructured) log close this code is for proportional odds mixed model; Stata doesn t appear to have a procedure for non-proportional odds mixed model (other than treating the outcome as nominal) 49

50 SAS code: schizo.sas FILENAME SchizDat HTTP " DATA one; INFILE SchizDat; INPUT id imps79 imps79b imps79o int tx week sweek txswk ; /* get rid of observations with missing values */ IF imps79 > -9; /* random trend ORDINAL logistic regression via GLIMMIX */ PROC GLIMMIX METHOD=QUAD(QPOINTS=11) NOCLPRINT; CLASS id; MODEL imps79o(desc) = tx sweek tx*sweek / S DIST=MULTINOMIAL LINK=CUMLOGIT; RANDOM INTERCEPT sweek / SUBJECT=id TYPE=UN GCORR; RUN; GLIMMIX only allows for proportional odds mixed model; can use NLMIXED for non-proportional odds mixed model 50

51 /* ordinal NON PROPORTIONAL ODDS random trend model via NLMIXED */ PROC NLMIXED QPOINTS=11; PARMS b1_tx=-.06 b1_sweek=-.77 b1_txswk=-1.21 b2_tx=-.06 b2_sweek=-.77 b2_txswk=-1.21 b3_tx=-.06 b3_sweek=-.77 b3_txswk=-1.21 v0=3.77 c01=0 v1=1 g1=-5.9 g2=-2.8 g3=-.71; z1 = b1_tx*tx + b1_sweek*sweek + b1_txswk*tx*sweek + u0 + u1*sweek; z2 = b2_tx*tx + b2_sweek*sweek + b2_txswk*tx*sweek + u0 + u1*sweek; z3 = b3_tx*tx + b3_sweek*sweek + b3_txswk*tx*sweek + u0 + u1*sweek; IF (imps79o=1) THEN p = 1 / (1 + EXP(-(g1-z1))); ELSE IF (imps79o=2) THEN p = (1/(1 + EXP(-(g2-z2)))) - (1/(1 + EXP(-(g1-z1)))); ELSE IF (imps79o=3) THEN p = (1/(1 + EXP(-(g3-z3)))) - (1/(1 + EXP(-(g2-z2)))); ELSE IF (imps79o=4) THEN p = 1 - (1 / (1 + EXP(-(g3-z3)))); loglik = LOG(p); MODEL imps79o ~ GENERAL(loglik); RANDOM u0 u1 ~ NORMAL([0,0], [v0,c01,v1]) SUBJECT=id; ESTIMATE re corr c01/sqrt(v0*v1); RUN; 51

52 San Diego Homeless Research Project (Hough) 361 mentally ill subjects who were homeless or at very high risk of becoming homeless 2 conditions: HUD Section 8 rental certificates (yes/no) baseline and 6, 12, and 24 month follow-ups Categorical outcome: housing status streets / shelters (Y = 0) community / institutions (Y = 1) independent (Y = 2) Question: Do Section 8 certificates influence housing status across time? 52

53 Stata code: sdhouse ordinal logits.do cd "u:\stata_long\" log using sd_ordinal_logits.log, replace infile id Housing Inter Section8 Time1 Time2 Time3 Sect8T1 Sect8T2 Sect8T3 /// NonSect8 LinTime S8LinT using sdhouse.dat.txt, clear recode Housing (999=.) gen time6mo = LinTime replace time6mo = 4 if (time6mo ==3) tabulate Housing, generate(housingi) collapse (mean) p1=housingi1 p2=housingi2 p3=housingi3, by(section8 time6mo) list gen codds1 = (p2 + p3) / p1 gen codds2 = p3 / (p1 + p2) list Section8 time6mo codds1 codds2 gen clogit1 = log(codds1) gen clogit2 = log(codds2) list Section8 time6mo clogit1 clogit2 53

54 * graph cumulative logits graph twoway (connected clogit1 time6mo if Section8==0) /// (connected clogit1 time6mo if Section8==1), /// legend(label(1 "Non Section8") label(2 "Section8")) /// yscale(range(-.5 3)) xlabel(0(.5)4) graph twoway (connected clogit2 time6mo if Section8==0) /// (connected clogit2 time6mo if Section8==1), /// legend(label(1 "Non Section8") label(2 "Section8")) /// yscale(range( )) log close 54

55 clogit1: independent + community vs street clogit time6mo Non Section8 Section8 clogit2: independent vs community + street clogit time6mo Non Section8 Section8 55

56 Under SSI, Inc > SuperMix (English) or SuperMix (English) Student Under File click on Open Spreadsheet Open C:\SuperMixEn Examples\Workshop\Nominal\SDHOUSE.ss3 (or C:\SuperMixEn Student Examples\Workshop\Nominal\SDHOUSE.ss3) 56

57 C:\SuperMixEn Examples\Workshop\Nominal\SDHOUSE.ss3 57

58 Select Housing column, then Edit > Set Missing Value 58

59 Under File click on Open Existing Model Setup Open C:\SuperMixEn Examples\Workshop\Ordinal\SDO1.mum (or C:\SuperMixEn Student Examples\Workshop\Ordinal\SDO1.mum) 59

60 Note that Dependent Variable Type is ordered 60

61 All explanatory variables are indicator (dummy) variables 61

62 Housing status across time: 1289 observations within 361 subjects Ordinal Mixed Regression Model estimates and standard errors (se) Proportional Odds Non-Proportional Odds Non-street 1 Independent 2 difference term estimate se estimate se estimate se estimate se threshold threshold t1 (6 month) t2 (12 month) t3 (24 month) section 8 (yes=1) section 8 by t section 8 by t section 8 by t subject var (ICC.4) 2 log L (χ 2 7 = 52.14) bold indicates p <.05 italic indicates.05 < p <.10 1 = independent + community vs street 2 = independent vs community + street 62

63 For Non-Proportional Odds model, under File click on Open Existing Model Setup Open C:\SuperMixEn Examples\Workshop\Ordinal\SDO2.mum (or C:\SuperMixEn Student Examples\Workshop\Ordinal\SDO2.mum) 63

64 Note that Dependent Variable Type is ordered 64

65 Note Explanatory Variable Interactions is set to 7 65

66 66

67 Stata code: sdo.do log using c:\stata_examples\sdo.log, replace infile Id Housing Inter Section8 Time1 Time2 Time3 /// Sect8T1 Sect8T2 Sect8T3 NonSect8 LinTime S8LinT /// using clear recode Housing (999=.) summarize meologit Housing Section8 Time1 Time2 Time3 Sect8T1 Sect8T2 Sect8T3 Id: log close this code is for proportional odds mixed model; Stata doesn t appear to have a procedure for non-proportional odds mixed model (other than treating the outcome as nominal) 67

68 SAS code: sdo.sas FILENAME SDDat HTTP " DATA one; INFILE SDDat; INPUT Id Housing Int Section8 Time1 Time2 Time3 Sect8T1 Sect8T2 Sect8T3 NonSect8 LinTime S8LinT ; IF Housing = 999 then Housing =.; /* Random-intercept proportional odds model via GLIMMIX */ PROC GLIMMIX DATA=one METHOD=QUAD(QPOINTS=21) NOCLPRINT; CLASS id; MODEL Housing(DESC) = Time1 Time2 Time3 Section8 Sect8T1 Sect8T2 Sect8T3 / SOLUTION DIST=MULTINOMIAL LINK=CUMLOGIT; RANDOM INTERCEPT / SUBJECT=id; RUN; GLIMMIX only allows for proportional odds mixed model; can use NLMIXED for non-proportional odds mixed model 68

69 /* Random-intercept NON-proportional odds model via NLMIXED */ PROC NLMIXED DATA=one QPOINTS=21; PARMS b1_time1=1.4 b1_time2=1.8 b1_time3=2.0 b1_section8=.4 b1_sect8t1=.9 b1_sect8t2=.9 b1_sect8t3=.4 b2_time1=1.4 b2_time2=1.8 b2_time3=2.0 b2_section8=.4 b2_sect8t1=.9 b2_sect8t2=.9 b2_sect8t3=.4 g1=.2 g2=2.3 varu=1; z1 = b1_time1*time1 + b1_time2*time2 + b1_time3*time3 + b1_section8*section8 + b1_sect8t1*sect8t1 + b1_sect8t2*sect8t2 + b1_sect8t3*sect8t3+ u; z2 = b2_time1*time1 + b2_time2*time2 + b2_time3*time3 + b2_section8*section8 + b2_sect8t1*sect8t1 + b2_sect8t2*sect8t2 + b2_sect8t3*sect8t3+ u; IF (HOUSING=0) THEN p = 1 / (1 + EXP(-(g1-z1))); ELSE IF (HOUSING=1) THEN p = (1/(1 + EXP(-(g2-z2)))) - (1/(1 + EXP(-(g1-z1)))); ELSE IF (HOUSING=2) THEN p = 1 - (1 / (1 + EXP(-(g2-z2)))); LogLike = LOG(p); MODEL Housing ~ GENERAL(LogLike); RANDOM u ~ NORMAL(0,varu) SUBJECT=Id; ESTIMATE icc varu/((((atan(1)*4)**2)/3) + varu); RUN; 69

70 Mixed Multinomial Logistic Regression Model Y ij = nominal response of level-2 unit i and level-1 unit j Which member of The Polkaholics is your favorite? (asked before, during, and after a show) 70

71 Mixed-effects Multinomial Logistic Regression Model log p ijc p ij1 = u ijγ c + z ijυ ic C 1 contrasts to reference cell (c = 1) regression effects γ c vary across contrasts random-effects υ ic vary across contrasts independent correlated For example, with C = 3 c = 2, 3,... C contrast ordinal nominal c1 2 & 3 vs 1 2 vs 1 c2 3 vs 1 & 2 3 vs 1 71

72 Model in terms of the category probabilities p ijc = Pr(Y ij = c υ ic ) = exp(z ijc ) 1 + C h=2 exp(z ijh ) for c = 2, 3,..., C p ij1 = Pr(Y ij = 1 υ ic ) = C h=2 exp(z ijh ) where the multinomial logit z ijc = u ijγ c + z ijυ ic 72

73 Stata code: sdhouse nominal logits.do cd "u:\stata_long\" log using sd_nominal_logits.log, replace infile id Housing Inter Section8 Time1 Time2 Time3 Sect8T1 Sect8T2 Sect8T3 /// NonSect8 LinTime S8LinT using sdhouse.dat.txt, clear recode Housing (999=.) gen time6mo = LinTime replace time6mo = 4 if (time6mo ==3) tabulate Housing, generate(housingi) collapse (mean) p1=housingi1 p2=housingi2 p3=housingi3, by(section8 time6mo) list gen odds2 = p2 / p1 gen odds3 = p3 / p1 list Section8 time6mo odds2 odds3 gen logit2 = log(odds2) gen logit3 = log(odds3) list Section8 time6mo logit2 logit3 73

74 * graph cumulative logits graph twoway (connected logit2 time6mo if Section8==0) /// (connected logit2 time6mo if Section8==1), /// legend(label(1 "Non Section8") label(2 "Section8")) /// yscale(range(-2 2)) xlabel(0(.5)4) graph twoway (connected logit3 time6mo if Section8==0) /// (connected logit3 time6mo if Section8==1), /// legend(label(1 "Non Section8") label(2 "Section8")) /// yscale(range(-2 2)) log close 74

75 logit2: community vs street logit time6mo Non Section8 Section8 logit3: independent vs street logit time6mo Non Section8 Section8 75

76 Under File click on Open Existing Model Setup Open C:\SuperMixEn Examples\Workshop\Nominal\sdhouse.mum (or C:\SuperMixEn Student Examples\Workshop\Nominal\sdhouse.mum) 76

77 Note that Dependent Variable Type is nominal 77

78 78

79 Can select first or last category as the reference cell 79

80 Try independent random effects first 80

81 81

82 82

83 Now allow the random effects to be correlated 83

84 LR test comparing models: χ 2 1 = =

85 85

86 STATA code: sdn.do log using u:\stata_examples\sdn.log, replace infile id housing inter section8 time1 time2 time3 /// sect8t1 sect8t2 sect8t3 nonsect8 lintime s8lint /// using clear recode housing (999=.) summarize local myx section8 time1 time2 time3 sect8t1 sect8t2 sect8t3 gsem (ib(first).housing <- myx mlogit gsem (1.housing <- myx M1[id]) (2.housing <- myx M2[id]), /// cov(m1[id]*m2[id]) mlogit log close 86

87 SAS code: sdn.sas FILENAME SDDat HTTP " DATA one; INFILE SDDat; INPUT Id Housing Int Section8 Time1 Time2 Time3 Sect8T1 Sect8T2 Sect8T3 NonSect8 LinTime S8LinT ; IF Housing = 999 then Housing =.; /* Nominal random intercept logistic via GLIMMIX - independent random effects */ PROC GLIMMIX DATA=one METHOD=QUAD(QPOINTS=11) NOCLPRINT; CLASS Id Housing; MODEL Housing(REF=FIRST) = Time1 Time2 Time3 Section8 Sect8T1 Sect8T2 Sect8T3 / SOLUTION DIST=MULTINOMIAL LINK=GLOGIT; RANDOM INTERCEPT / SUBJECT=Id GROUP=Housing; RUN; GLIMMIX assumes the random effects for the logits are independent; can use NLMIXED to allow for correlated random effects 87

88 /* Nominal random intercept logistic via NLMIXED - correlated random effects */ PROC NLMIXED DATA=one QPOINTS=11; PARMS b1_0=-.49 b1_t1=1.63 b1_t2=2.37 b1_t3=1.8 b1_sect8=.43 b1_sect8t1=-.46 b1_sect8t2=-2.12 b1_sect8t3=-1.1 var1=1 b2_0=-1.7 b2_t1=1.90 b2_t2=2.97 b2_t3=2.9 b2_sect8=.54 b2_sect8t1=1.13 b2_sect8t2=-.04 b2_sect8t3=-.07 var2=1 c12=0; z1 = b1_0 + b1_t1*time1 + b1_t2*time2 + b1_t3*time3 + b1_sect8*section8 + b1_sect8t1*sect8t1 + b1_sect8t2*sect8t2 + b1_sect8t3*sect8t3 + u1; z2 = b2_0 + b2_t1*time1 + b2_t2*time2 + b2_t3*time3 + b2_sect8*section8 + b2_sect8t1*sect8t1 + b2_sect8t2*sect8t2 + b2_sect8t3*sect8t3 + u2; IF (Housing=0) THEN p = 1 / (1 + EXP(z1) + EXP(z2)); ELSE IF (Housing=1) THEN p = EXP(z1) / (1 + EXP(z1) + EXP(z2)); ELSE IF (Housing=2) THEN p = EXP(z2) / (1 + EXP(z1) + EXP(z2)); LogLike = LOG(p); MODEL Housing ~ GENERAL(LogLike); RANDOM u1 u2 ~ NORMAL([0,0],[var1,c12,var2]) SUBJECT=Id; ESTIMATE ICC1 var1 /((((ATAN(1)*4)**2)/3) + var1); ESTIMATE ICC2 var2 /((((ATAN(1)*4)**2)/3) + var2); ESTIMATE re corr c12/sqrt(var1*var2); RUN; 88

89 Summary Models for longitudinal ordinal and nominal data as developed as models for continuous and dichotomous data Proportional odds models Non and partial proportional odds models Nominal models (with reference-cell contrasts) Grouped-time survival analysis models clust.pdf SuperMix can do it all, including 3-level models, also for counts full likelihood solution using adaptive quadrature 89

Mixed Models for Longitudinal Ordinal and Nominal Outcomes

Mixed Models for Longitudinal Ordinal and Nominal Outcomes Mixed Models for Longitudinal Ordinal and Nominal Outcomes Don Hedeker Department of Public Health Sciences Biological Sciences Division University of Chicago hedeker@uchicago.edu Hedeker, D. (2008). Multilevel

More information

Mixed Models for Longitudinal Binary Outcomes. Don Hedeker Department of Public Health Sciences University of Chicago.

Mixed Models for Longitudinal Binary Outcomes. Don Hedeker Department of Public Health Sciences University of Chicago. Mixed Models for Longitudinal Binary Outcomes Don Hedeker Department of Public Health Sciences University of Chicago hedeker@uchicago.edu https://hedeker-sites.uchicago.edu/ Hedeker, D. (2005). Generalized

More information

Mixed Models for Longitudinal Ordinal and Nominal Data

Mixed Models for Longitudinal Ordinal and Nominal Data Mixed Models for Longitudinal Ordinal and Nominal Data Hedeker, D. (2008). Multilevel models for ordinal and nominal variables. In J. de Leeuw & E. Meijer (Eds.), Handbook of Multilevel Analysis. Springer,

More information

Why analyze as ordinal? Mixed Models for Longitudinal Ordinal Data Don Hedeker University of Illinois at Chicago

Why analyze as ordinal? Mixed Models for Longitudinal Ordinal Data Don Hedeker University of Illinois at Chicago Why analyze as ordinal? Mixed Models for Longitudinal Ordinal Data Don Hedeker University of Illinois at Chicago hedeker@uic.edu www.uic.edu/ hedeker/long.html Efficiency: Armstrong & Sloan (1989, Amer

More information

More Mixed-Effects Models for Ordinal & Nominal Data. Don Hedeker University of Illinois at Chicago

More Mixed-Effects Models for Ordinal & Nominal Data. Don Hedeker University of Illinois at Chicago More Mixed-Effects Models for Ordinal & Nominal Data Don Hedeker University of Illinois at Chicago This work was supported by National Institute of Mental Health Contract N44MH32056. 1 Proportional and

More information

Missing Data in Longitudinal Studies: Mixed-effects Pattern-Mixture and Selection Models

Missing Data in Longitudinal Studies: Mixed-effects Pattern-Mixture and Selection Models Missing Data in Longitudinal Studies: Mixed-effects Pattern-Mixture and Selection Models Hedeker D & Gibbons RD (1997). Application of random-effects pattern-mixture models for missing data in longitudinal

More information

Mixed-Effects Pattern-Mixture Models for Incomplete Longitudinal Data. Don Hedeker University of Illinois at Chicago

Mixed-Effects Pattern-Mixture Models for Incomplete Longitudinal Data. Don Hedeker University of Illinois at Chicago Mixed-Effects Pattern-Mixture Models for Incomplete Longitudinal Data Don Hedeker University of Illinois at Chicago This work was supported by National Institute of Mental Health Contract N44MH32056. 1

More information

Multilevel Modeling of Non-Normal Data. Don Hedeker Department of Public Health Sciences University of Chicago.

Multilevel Modeling of Non-Normal Data. Don Hedeker Department of Public Health Sciences University of Chicago. Multilevel Modeling of Non-Normal Data Don Hedeker Department of Public Health Sciences University of Chicago email: hedeker@uchicago.edu https://hedeker-sites.uchicago.edu/ Hedeker, D. (2005). Generalized

More information

Example 7b: Generalized Models for Ordinal Longitudinal Data using SAS GLIMMIX, STATA MEOLOGIT, and MPLUS (last proportional odds model only)

Example 7b: Generalized Models for Ordinal Longitudinal Data using SAS GLIMMIX, STATA MEOLOGIT, and MPLUS (last proportional odds model only) CLDP945 Example 7b page 1 Example 7b: Generalized Models for Ordinal Longitudinal Data using SAS GLIMMIX, STATA MEOLOGIT, and MPLUS (last proportional odds model only) This example comes from real data

More information

STAT 705 Generalized linear mixed models

STAT 705 Generalized linear mixed models STAT 705 Generalized linear mixed models Timothy Hanson Department of Statistics, University of South Carolina Stat 705: Data Analysis II 1 / 24 Generalized Linear Mixed Models We have considered random

More information

Advantages of Mixed-effects Regression Models (MRM; aka multilevel, hierarchical linear, linear mixed models) 1. MRM explicitly models individual

Advantages of Mixed-effects Regression Models (MRM; aka multilevel, hierarchical linear, linear mixed models) 1. MRM explicitly models individual Advantages of Mixed-effects Regression Models (MRM; aka multilevel, hierarchical linear, linear mixed models) 1. MRM explicitly models individual change across time 2. MRM more flexible in terms of repeated

More information

Application of Item Response Theory Models for Intensive Longitudinal Data

Application of Item Response Theory Models for Intensive Longitudinal Data Application of Item Response Theory Models for Intensive Longitudinal Data Don Hedeker, Robin Mermelstein, & Brian Flay University of Illinois at Chicago hedeker@uic.edu Models for Intensive Longitudinal

More information

Contrasting Marginal and Mixed Effects Models Recall: two approaches to handling dependence in Generalized Linear Models:

Contrasting Marginal and Mixed Effects Models Recall: two approaches to handling dependence in Generalized Linear Models: Contrasting Marginal and Mixed Effects Models Recall: two approaches to handling dependence in Generalized Linear Models: Marginal models: based on the consequences of dependence on estimating model parameters.

More information

Generalized Linear Models for Non-Normal Data

Generalized Linear Models for Non-Normal Data Generalized Linear Models for Non-Normal Data Today s Class: 3 parts of a generalized model Models for binary outcomes Complications for generalized multivariate or multilevel models SPLH 861: Lecture

More information

Hierarchical Generalized Linear Models. ERSH 8990 REMS Seminar on HLM Last Lecture!

Hierarchical Generalized Linear Models. ERSH 8990 REMS Seminar on HLM Last Lecture! Hierarchical Generalized Linear Models ERSH 8990 REMS Seminar on HLM Last Lecture! Hierarchical Generalized Linear Models Introduction to generalized models Models for binary outcomes Interpreting parameter

More information

GEE for Longitudinal Data - Chapter 8

GEE for Longitudinal Data - Chapter 8 GEE for Longitudinal Data - Chapter 8 GEE: generalized estimating equations (Liang & Zeger, 1986; Zeger & Liang, 1986) extension of GLM to longitudinal data analysis using quasi-likelihood estimation method

More information

Homework Solutions Applied Logistic Regression

Homework Solutions Applied Logistic Regression Homework Solutions Applied Logistic Regression WEEK 6 Exercise 1 From the ICU data, use as the outcome variable vital status (STA) and CPR prior to ICU admission (CPR) as a covariate. (a) Demonstrate that

More information

Lecture 3.1 Basic Logistic LDA

Lecture 3.1 Basic Logistic LDA y Lecture.1 Basic Logistic LDA 0.2.4.6.8 1 Outline Quick Refresher on Ordinary Logistic Regression and Stata Women s employment example Cross-Over Trial LDA Example -100-50 0 50 100 -- Longitudinal Data

More information

Review. Timothy Hanson. Department of Statistics, University of South Carolina. Stat 770: Categorical Data Analysis

Review. Timothy Hanson. Department of Statistics, University of South Carolina. Stat 770: Categorical Data Analysis Review Timothy Hanson Department of Statistics, University of South Carolina Stat 770: Categorical Data Analysis 1 / 22 Chapter 1: background Nominal, ordinal, interval data. Distributions: Poisson, binomial,

More information

SAS Code for Data Manipulation: SPSS Code for Data Manipulation: STATA Code for Data Manipulation: Psyc 945 Example 1 page 1

SAS Code for Data Manipulation: SPSS Code for Data Manipulation: STATA Code for Data Manipulation: Psyc 945 Example 1 page 1 Psyc 945 Example page Example : Unconditional Models for Change in Number Match 3 Response Time (complete data, syntax, and output available for SAS, SPSS, and STATA electronically) These data come from

More information

Longitudinal Modeling with Logistic Regression

Longitudinal Modeling with Logistic Regression Newsom 1 Longitudinal Modeling with Logistic Regression Longitudinal designs involve repeated measurements of the same individuals over time There are two general classes of analyses that correspond to

More information

Correspondence Analysis of Longitudinal Data

Correspondence Analysis of Longitudinal Data Correspondence Analysis of Longitudinal Data Mark de Rooij* LEIDEN UNIVERSITY, LEIDEN, NETHERLANDS Peter van der G. M. Heijden UTRECHT UNIVERSITY, UTRECHT, NETHERLANDS *Corresponding author (rooijm@fsw.leidenuniv.nl)

More information

Generalized Models: Part 1

Generalized Models: Part 1 Generalized Models: Part 1 Topics: Introduction to generalized models Introduction to maximum likelihood estimation Models for binary outcomes Models for proportion outcomes Models for categorical outcomes

More information

Logistic Regression for Ordinal Responses

Logistic Regression for Ordinal Responses Logistic Regression for Ordinal Responses Edps/Psych/Soc 589 Carolyn J. Anderson Department of Educational Psychology c Board of Trustees, University of Illinois Fall 2018 Outline Common models for ordinal

More information

SAS Syntax and Output for Data Manipulation: CLDP 944 Example 3a page 1

SAS Syntax and Output for Data Manipulation: CLDP 944 Example 3a page 1 CLDP 944 Example 3a page 1 From Between-Person to Within-Person Models for Longitudinal Data The models for this example come from Hoffman (2015) chapter 3 example 3a. We will be examining the extent to

More information

Models for Ordinal Response Data

Models for Ordinal Response Data Models for Ordinal Response Data Robin High Department of Biostatistics Center for Public Health University of Nebraska Medical Center Omaha, Nebraska Recommendations Analyze numerical data with a statistical

More information

Fitting mixed-effects models for repeated ordinal outcomes with the NLMIXED procedure

Fitting mixed-effects models for repeated ordinal outcomes with the NLMIXED procedure Behavior Research Methods, Instruments, & Computers 00, 34 (), 151-157 ARTICLES FROM THE SCIP CONFERENCE Fitting mixed-effects models for repeated ordinal outcomes with the NLMIXED procedure CHING-FAN

More information

over Time line for the means). Specifically, & covariances) just a fixed variance instead. PROC MIXED: to 1000 is default) list models with TYPE=VC */

over Time line for the means). Specifically, & covariances) just a fixed variance instead. PROC MIXED: to 1000 is default) list models with TYPE=VC */ CLP 944 Example 4 page 1 Within-Personn Fluctuation in Symptom Severity over Time These data come from a study of weekly fluctuation in psoriasis severity. There was no intervention and no real reason

More information

SAS Syntax and Output for Data Manipulation:

SAS Syntax and Output for Data Manipulation: CLP 944 Example 5 page 1 Practice with Fixed and Random Effects of Time in Modeling Within-Person Change The models for this example come from Hoffman (2015) chapter 5. We will be examining the extent

More information

Chapter 1. Modeling Basics

Chapter 1. Modeling Basics Chapter 1. Modeling Basics What is a model? Model equation and probability distribution Types of model effects Writing models in matrix form Summary 1 What is a statistical model? A model is a mathematical

More information

Introduction to Generalized Models

Introduction to Generalized Models Introduction to Generalized Models Today s topics: The big picture of generalized models Review of maximum likelihood estimation Models for binary outcomes Models for proportion outcomes Models for categorical

More information

ST3241 Categorical Data Analysis I Multicategory Logit Models. Logit Models For Nominal Responses

ST3241 Categorical Data Analysis I Multicategory Logit Models. Logit Models For Nominal Responses ST3241 Categorical Data Analysis I Multicategory Logit Models Logit Models For Nominal Responses 1 Models For Nominal Responses Y is nominal with J categories. Let {π 1,, π J } denote the response probabilities

More information

Lab 11. Multilevel Models. Description of Data

Lab 11. Multilevel Models. Description of Data Lab 11 Multilevel Models Henian Chen, M.D., Ph.D. Description of Data MULTILEVEL.TXT is clustered data for 386 women distributed across 40 groups. ID: 386 women, id from 1 to 386, individual level (level

More information

Models for binary data

Models for binary data Faculty of Health Sciences Models for binary data Analysis of repeated measurements 2015 Julie Lyng Forman & Lene Theil Skovgaard Department of Biostatistics, University of Copenhagen 1 / 63 Program for

More information

Investigating Models with Two or Three Categories

Investigating Models with Two or Three Categories Ronald H. Heck and Lynn N. Tabata 1 Investigating Models with Two or Three Categories For the past few weeks we have been working with discriminant analysis. Let s now see what the same sort of model might

More information

2. We care about proportion for categorical variable, but average for numerical one.

2. We care about proportion for categorical variable, but average for numerical one. Probit Model 1. We apply Probit model to Bank data. The dependent variable is deny, a dummy variable equaling one if a mortgage application is denied, and equaling zero if accepted. The key regressor is

More information

8 Nominal and Ordinal Logistic Regression

8 Nominal and Ordinal Logistic Regression 8 Nominal and Ordinal Logistic Regression 8.1 Introduction If the response variable is categorical, with more then two categories, then there are two options for generalized linear models. One relies on

More information

You can specify the response in the form of a single variable or in the form of a ratio of two variables denoted events/trials.

You can specify the response in the form of a single variable or in the form of a ratio of two variables denoted events/trials. The GENMOD Procedure MODEL Statement MODEL response = < effects > < /options > ; MODEL events/trials = < effects > < /options > ; You can specify the response in the form of a single variable or in the

More information

Generalized Linear Modeling - Logistic Regression

Generalized Linear Modeling - Logistic Regression 1 Generalized Linear Modeling - Logistic Regression Binary outcomes The logit and inverse logit interpreting coefficients and odds ratios Maximum likelihood estimation Problem of separation Evaluating

More information

Interpreting and using heterogeneous choice & generalized ordered logit models

Interpreting and using heterogeneous choice & generalized ordered logit models Interpreting and using heterogeneous choice & generalized ordered logit models Richard Williams Department of Sociology University of Notre Dame July 2006 http://www.nd.edu/~rwilliam/ The gologit/gologit2

More information

SAS PROC NLMIXED Mike Patefield The University of Reading 12 May

SAS PROC NLMIXED Mike Patefield The University of Reading 12 May SAS PROC NLMIXED Mike Patefield The University of Reading 1 May 004 E-mail: w.m.patefield@reading.ac.uk non-linear mixed models maximum likelihood repeated measurements on each subject (i) response vector

More information

The Multilevel Logit Model for Binary Dependent Variables Marco R. Steenbergen

The Multilevel Logit Model for Binary Dependent Variables Marco R. Steenbergen The Multilevel Logit Model for Binary Dependent Variables Marco R. Steenbergen January 23-24, 2012 Page 1 Part I The Single Level Logit Model: A Review Motivating Example Imagine we are interested in voting

More information

Stat 642, Lecture notes for 04/12/05 96

Stat 642, Lecture notes for 04/12/05 96 Stat 642, Lecture notes for 04/12/05 96 Hosmer-Lemeshow Statistic The Hosmer-Lemeshow Statistic is another measure of lack of fit. Hosmer and Lemeshow recommend partitioning the observations into 10 equal

More information

Spring RMC Professional Development Series January 14, Generalized Linear Mixed Models (GLMMs): Concepts and some Demonstrations

Spring RMC Professional Development Series January 14, Generalized Linear Mixed Models (GLMMs): Concepts and some Demonstrations Spring RMC Professional Development Series January 14, 2016 Generalized Linear Mixed Models (GLMMs): Concepts and some Demonstrations Ann A. O Connell, Ed.D. Professor, Educational Studies (QREM) Director,

More information

Random Coefficient Model (a.k.a. multilevel model) (Adapted from UCLA Statistical Computing Seminars)

Random Coefficient Model (a.k.a. multilevel model) (Adapted from UCLA Statistical Computing Seminars) STAT:5201 Applied Statistic II Random Coefficient Model (a.k.a. multilevel model) (Adapted from UCLA Statistical Computing Seminars) School math achievement scores The data file consists of 7185 students

More information

Correlated data. Non-normal outcomes. Reminder on binary data. Non-normal data. Faculty of Health Sciences. Non-normal outcomes

Correlated data. Non-normal outcomes. Reminder on binary data. Non-normal data. Faculty of Health Sciences. Non-normal outcomes Faculty of Health Sciences Non-normal outcomes Correlated data Non-normal outcomes Lene Theil Skovgaard December 5, 2014 Generalized linear models Generalized linear mixed models Population average models

More information

Sample Size Estimation for Longitudinal Studies. Don Hedeker University of Illinois at Chicago hedeker

Sample Size Estimation for Longitudinal Studies. Don Hedeker University of Illinois at Chicago  hedeker Sample Size Estimation for Longitudinal Studies Don Hedeker University of Illinois at Chicago www.uic.edu/ hedeker Hedeker, Gibbons, & Waternaux (1999). Sample size estimation for longitudinal designs

More information

Package threg. August 10, 2015

Package threg. August 10, 2015 Package threg August 10, 2015 Title Threshold Regression Version 1.0.3 Date 2015-08-10 Author Tao Xiao Maintainer Tao Xiao Depends R (>= 2.10), survival, Formula Fit a threshold regression

More information

Comparing IRT with Other Models

Comparing IRT with Other Models Comparing IRT with Other Models Lecture #14 ICPSR Item Response Theory Workshop Lecture #14: 1of 45 Lecture Overview The final set of slides will describe a parallel between IRT and another commonly used

More information

Introduction to mtm: An R Package for Marginalized Transition Models

Introduction to mtm: An R Package for Marginalized Transition Models Introduction to mtm: An R Package for Marginalized Transition Models Bryan A. Comstock and Patrick J. Heagerty Department of Biostatistics University of Washington 1 Introduction Marginalized transition

More information

Data-analysis and Retrieval Ordinal Classification

Data-analysis and Retrieval Ordinal Classification Data-analysis and Retrieval Ordinal Classification Ad Feelders Universiteit Utrecht Data-analysis and Retrieval 1 / 30 Strongly disagree Ordinal Classification 1 2 3 4 5 0% (0) 10.5% (2) 21.1% (4) 42.1%

More information

Model Assumptions; Predicting Heterogeneity of Variance

Model Assumptions; Predicting Heterogeneity of Variance Model Assumptions; Predicting Heterogeneity of Variance Today s topics: Model assumptions Normality Constant variance Predicting heterogeneity of variance CLP 945: Lecture 6 1 Checking for Violations of

More information

Consider Table 1 (Note connection to start-stop process).

Consider Table 1 (Note connection to start-stop process). Discrete-Time Data and Models Discretized duration data are still duration data! Consider Table 1 (Note connection to start-stop process). Table 1: Example of Discrete-Time Event History Data Case Event

More information

Lecture 10: Introduction to Logistic Regression

Lecture 10: Introduction to Logistic Regression Lecture 10: Introduction to Logistic Regression Ani Manichaikul amanicha@jhsph.edu 2 May 2007 Logistic Regression Regression for a response variable that follows a binomial distribution Recall the binomial

More information

ML estimation: Random-intercepts logistic model. and z

ML estimation: Random-intercepts logistic model. and z ML estimation: Random-intercepts logistic model log p ij 1 p = x ijβ + υ i with υ i N(0, συ) 2 ij Standardizing the random effect, θ i = υ i /σ υ, yields log p ij 1 p = x ij β + σ υθ i with θ i N(0, 1)

More information

Generalized Multilevel Models for Non-Normal Outcomes

Generalized Multilevel Models for Non-Normal Outcomes Generalized Multilevel Models for Non-Normal Outcomes Topics: 3 parts of a generalized (multilevel) model Models for binary, proportion, and categorical outcomes Complications for generalized multilevel

More information

Recent Developments in Multilevel Modeling

Recent Developments in Multilevel Modeling Recent Developments in Multilevel Modeling Roberto G. Gutierrez Director of Statistics StataCorp LP 2007 North American Stata Users Group Meeting, Boston R. Gutierrez (StataCorp) Multilevel Modeling August

More information

Models for Binary Outcomes

Models for Binary Outcomes Models for Binary Outcomes Introduction The simple or binary response (for example, success or failure) analysis models the relationship between a binary response variable and one or more explanatory variables.

More information

An Application of a Mixed-Effects Location Scale Model for Analysis of Ecological Momentary Assessment (EMA) Data

An Application of a Mixed-Effects Location Scale Model for Analysis of Ecological Momentary Assessment (EMA) Data An Application of a Mixed-Effects Location Scale Model for Analysis of Ecological Momentary Assessment (EMA) Data Don Hedeker, Robin Mermelstein, & Hakan Demirtas University of Illinois at Chicago hedeker@uic.edu

More information

Simple logistic regression

Simple logistic regression Simple logistic regression Biometry 755 Spring 2009 Simple logistic regression p. 1/47 Model assumptions 1. The observed data are independent realizations of a binary response variable Y that follows a

More information

Lecture 12: Effect modification, and confounding in logistic regression

Lecture 12: Effect modification, and confounding in logistic regression Lecture 12: Effect modification, and confounding in logistic regression Ani Manichaikul amanicha@jhsph.edu 4 May 2007 Today Categorical predictor create dummy variables just like for linear regression

More information

7/28/15. Review Homework. Overview. Lecture 6: Logistic Regression Analysis

7/28/15. Review Homework. Overview. Lecture 6: Logistic Regression Analysis Lecture 6: Logistic Regression Analysis Christopher S. Hollenbeak, PhD Jane R. Schubart, PhD The Outcomes Research Toolbox Review Homework 2 Overview Logistic regression model conceptually Logistic regression

More information

POLI 7050 Spring 2008 March 5, 2008 Unordered Response Models II

POLI 7050 Spring 2008 March 5, 2008 Unordered Response Models II POLI 7050 Spring 2008 March 5, 2008 Unordered Response Models II Introduction Today we ll talk about interpreting MNL and CL models. We ll start with general issues of model fit, and then get to variable

More information

Turning a research question into a statistical question.

Turning a research question into a statistical question. Turning a research question into a statistical question. IGINAL QUESTION: Concept Concept Concept ABOUT ONE CONCEPT ABOUT RELATIONSHIPS BETWEEN CONCEPTS TYPE OF QUESTION: DESCRIBE what s going on? DECIDE

More information

Review of Panel Data Model Types Next Steps. Panel GLMs. Department of Political Science and Government Aarhus University.

Review of Panel Data Model Types Next Steps. Panel GLMs. Department of Political Science and Government Aarhus University. Panel GLMs Department of Political Science and Government Aarhus University May 12, 2015 1 Review of Panel Data 2 Model Types 3 Review and Looking Forward 1 Review of Panel Data 2 Model Types 3 Review

More information

De-mystifying random effects models

De-mystifying random effects models De-mystifying random effects models Peter J Diggle Lecture 4, Leahurst, October 2012 Linear regression input variable x factor, covariate, explanatory variable,... output variable y response, end-point,

More information

SAS Code: Joint Models for Continuous and Discrete Longitudinal Data

SAS Code: Joint Models for Continuous and Discrete Longitudinal Data CHAPTER 14 SAS Code: Joint Models for Continuous and Discrete Longitudinal Data We show how models of a mixed type can be analyzed using standard statistical software. We mainly focus on the SAS procedures

More information

Ronald Heck Week 14 1 EDEP 768E: Seminar in Categorical Data Modeling (F2012) Nov. 17, 2012

Ronald Heck Week 14 1 EDEP 768E: Seminar in Categorical Data Modeling (F2012) Nov. 17, 2012 Ronald Heck Week 14 1 From Single Level to Multilevel Categorical Models This week we develop a two-level model to examine the event probability for an ordinal response variable with three categories (persist

More information

Goodness-of-Fit Tests for the Ordinal Response Models with Misspecified Links

Goodness-of-Fit Tests for the Ordinal Response Models with Misspecified Links Communications of the Korean Statistical Society 2009, Vol 16, No 4, 697 705 Goodness-of-Fit Tests for the Ordinal Response Models with Misspecified Links Kwang Mo Jeong a, Hyun Yung Lee 1, a a Department

More information

Stat 587: Key points and formulae Week 15

Stat 587: Key points and formulae Week 15 Odds ratios to compare two proportions: Difference, p 1 p 2, has issues when applied to many populations Vit. C: P[cold Placebo] = 0.82, P[cold Vit. C] = 0.74, Estimated diff. is 8% What if a year or place

More information

Introduction to Random Effects of Time and Model Estimation

Introduction to Random Effects of Time and Model Estimation Introduction to Random Effects of Time and Model Estimation Today s Class: The Big Picture Multilevel model notation Fixed vs. random effects of time Random intercept vs. random slope models How MLM =

More information

McGill University. Faculty of Science. Department of Mathematics and Statistics. Statistics Part A Comprehensive Exam Methodology Paper

McGill University. Faculty of Science. Department of Mathematics and Statistics. Statistics Part A Comprehensive Exam Methodology Paper Student Name: ID: McGill University Faculty of Science Department of Mathematics and Statistics Statistics Part A Comprehensive Exam Methodology Paper Date: Friday, May 13, 2016 Time: 13:00 17:00 Instructions

More information

Using the Delta Method to Construct Confidence Intervals for Predicted Probabilities, Rates, and Discrete Changes 1

Using the Delta Method to Construct Confidence Intervals for Predicted Probabilities, Rates, and Discrete Changes 1 Using the Delta Method to Construct Confidence Intervals for Predicted Probabilities, Rates, Discrete Changes 1 JunXuJ.ScottLong Indiana University 2005-02-03 1 General Formula The delta method is a general

More information

An Introduction to Multilevel Models. PSYC 943 (930): Fundamentals of Multivariate Modeling Lecture 25: December 7, 2012

An Introduction to Multilevel Models. PSYC 943 (930): Fundamentals of Multivariate Modeling Lecture 25: December 7, 2012 An Introduction to Multilevel Models PSYC 943 (930): Fundamentals of Multivariate Modeling Lecture 25: December 7, 2012 Today s Class Concepts in Longitudinal Modeling Between-Person vs. +Within-Person

More information

Review of Multinomial Distribution If n trials are performed: in each trial there are J > 2 possible outcomes (categories) Multicategory Logit Models

Review of Multinomial Distribution If n trials are performed: in each trial there are J > 2 possible outcomes (categories) Multicategory Logit Models Chapter 6 Multicategory Logit Models Response Y has J > 2 categories. Extensions of logistic regression for nominal and ordinal Y assume a multinomial distribution for Y. 6.1 Logit Models for Nominal Responses

More information

Linear Modelling in Stata Session 6: Further Topics in Linear Modelling

Linear Modelling in Stata Session 6: Further Topics in Linear Modelling Linear Modelling in Stata Session 6: Further Topics in Linear Modelling Mark Lunt Arthritis Research UK Epidemiology Unit University of Manchester 14/11/2017 This Week Categorical Variables Categorical

More information

Binomial Model. Lecture 10: Introduction to Logistic Regression. Logistic Regression. Binomial Distribution. n independent trials

Binomial Model. Lecture 10: Introduction to Logistic Regression. Logistic Regression. Binomial Distribution. n independent trials Lecture : Introduction to Logistic Regression Ani Manichaikul amanicha@jhsph.edu 2 May 27 Binomial Model n independent trials (e.g., coin tosses) p = probability of success on each trial (e.g., p =! =

More information

Introduction (Alex Dmitrienko, Lilly) Web-based training program

Introduction (Alex Dmitrienko, Lilly) Web-based training program Web-based training Introduction (Alex Dmitrienko, Lilly) Web-based training program http://www.amstat.org/sections/sbiop/webinarseries.html Four-part web-based training series Geert Verbeke (Katholieke

More information

Ninth ARTNeT Capacity Building Workshop for Trade Research "Trade Flows and Trade Policy Analysis"

Ninth ARTNeT Capacity Building Workshop for Trade Research Trade Flows and Trade Policy Analysis Ninth ARTNeT Capacity Building Workshop for Trade Research "Trade Flows and Trade Policy Analysis" June 2013 Bangkok, Thailand Cosimo Beverelli and Rainer Lanz (World Trade Organization) 1 Selected econometric

More information

Multi-level Models: Idea

Multi-level Models: Idea Review of 140.656 Review Introduction to multi-level models The two-stage normal-normal model Two-stage linear models with random effects Three-stage linear models Two-stage logistic regression with random

More information

Review of CLDP 944: Multilevel Models for Longitudinal Data

Review of CLDP 944: Multilevel Models for Longitudinal Data Review of CLDP 944: Multilevel Models for Longitudinal Data Topics: Review of general MLM concepts and terminology Model comparisons and significance testing Fixed and random effects of time Significance

More information

SuperMix2 features not available in HLM 7 Contents

SuperMix2 features not available in HLM 7 Contents SuperMix2 features not available in HLM 7 Contents Spreadsheet display of.ss3 files... 2 Continuous Outcome Variables: Additional Distributions... 3 Additional Estimation Methods... 5 Count variables including

More information

Introduction to Within-Person Analysis and RM ANOVA

Introduction to Within-Person Analysis and RM ANOVA Introduction to Within-Person Analysis and RM ANOVA Today s Class: From between-person to within-person ANOVAs for longitudinal data Variance model comparisons using 2 LL CLP 944: Lecture 3 1 The Two Sides

More information

Ron Heck, Fall Week 8: Introducing Generalized Linear Models: Logistic Regression 1 (Replaces prior revision dated October 20, 2011)

Ron Heck, Fall Week 8: Introducing Generalized Linear Models: Logistic Regression 1 (Replaces prior revision dated October 20, 2011) Ron Heck, Fall 2011 1 EDEP 768E: Seminar in Multilevel Modeling rev. January 3, 2012 (see footnote) Week 8: Introducing Generalized Linear Models: Logistic Regression 1 (Replaces prior revision dated October

More information

(z) normal >< ª(z) logistic 1=(1 + exp( z)) P(v ij c) = (z) clog-log 1 exp( exp(z)) 1 (z) log-log exp( exp(z))

(z) normal >< ª(z) logistic 1=(1 + exp( z)) P(v ij c) = (z) clog-log 1 exp( exp(z)) 1 (z) log-log exp( exp(z)) MIXOR2: MIXOR version 2 ² i = 1:::N level-2 units ² j = 1:::n i level-1 units for level-2 unit i ² c = 1:::C categories The response model for MIXOR2 can be written as: z ijc = c h u 0 ij ³ c +w 0 ij +x0

More information

EPSY 905: Fundamentals of Multivariate Modeling Online Lecture #7

EPSY 905: Fundamentals of Multivariate Modeling Online Lecture #7 Introduction to Generalized Univariate Models: Models for Binary Outcomes EPSY 905: Fundamentals of Multivariate Modeling Online Lecture #7 EPSY 905: Intro to Generalized In This Lecture A short review

More information

ECON 594: Lecture #6

ECON 594: Lecture #6 ECON 594: Lecture #6 Thomas Lemieux Vancouver School of Economics, UBC May 2018 1 Limited dependent variables: introduction Up to now, we have been implicitly assuming that the dependent variable, y, was

More information

STAT 7030: Categorical Data Analysis

STAT 7030: Categorical Data Analysis STAT 7030: Categorical Data Analysis 5. Logistic Regression Peng Zeng Department of Mathematics and Statistics Auburn University Fall 2012 Peng Zeng (Auburn University) STAT 7030 Lecture Notes Fall 2012

More information

Fixed effects results...32

Fixed effects results...32 1 MODELS FOR CONTINUOUS OUTCOMES...7 1.1 MODELS BASED ON A SUBSET OF THE NESARC DATA...7 1.1.1 The data...7 1.1.1.1 Importing the data and defining variable types...8 1.1.1.2 Exploring the data...12 Univariate

More information

Regression Methods for Survey Data

Regression Methods for Survey Data Regression Methods for Survey Data Professor Ron Fricker! Naval Postgraduate School! Monterey, California! 3/26/13 Reading:! Lohr chapter 11! 1 Goals for this Lecture! Linear regression! Review of linear

More information

Categorical and Zero Inflated Growth Models

Categorical and Zero Inflated Growth Models Categorical and Zero Inflated Growth Models Alan C. Acock* Summer, 2009 *Alan C. Acock, Department of Human Development and Family Sciences, Oregon State University, Corvallis OR 97331 (alan.acock@oregonstate.edu).

More information

mixor: An R Package for Longitudinal and Clustered Ordinal Response Modeling

mixor: An R Package for Longitudinal and Clustered Ordinal Response Modeling mixor: An R Package for Longitudinal and Clustered Ordinal Response Modeling Kellie J. Archer Donald Hedeker Rachel Nordgren Robert D. Gibbons Virginia Commonwealth University University of Illinois Chicago

More information

Logistic Regression. Continued Psy 524 Ainsworth

Logistic Regression. Continued Psy 524 Ainsworth Logistic Regression Continued Psy 524 Ainsworth Equations Regression Equation Y e = 1 + A+ B X + B X + B X 1 1 2 2 3 3 i A+ B X + B X + B X e 1 1 2 2 3 3 Equations The linear part of the logistic regression

More information

RESEARCH ARTICLE. Likelihood-based Approach for Analysis of Longitudinal Nominal Data using Marginalized Random Effects Models

RESEARCH ARTICLE. Likelihood-based Approach for Analysis of Longitudinal Nominal Data using Marginalized Random Effects Models Journal of Applied Statistics Vol. 00, No. 00, August 2009, 1 14 RESEARCH ARTICLE Likelihood-based Approach for Analysis of Longitudinal Nominal Data using Marginalized Random Effects Models Keunbaik Lee

More information

Longitudinal Data Analysis. Michael L. Berbaum Institute for Health Research and Policy University of Illinois at Chicago

Longitudinal Data Analysis. Michael L. Berbaum Institute for Health Research and Policy University of Illinois at Chicago Longitudinal Data Analysis Michael L. Berbaum Institute for Health Research and Policy University of Illinois at Chicago Course description: Longitudinal analysis is the study of short series of observations

More information

5/3/2012 Moderator Effects with SAS 1

5/3/2012 Moderator Effects with SAS 1 5/3/2012 Moderator Effects with SAS 1 Maximum Likelihood Equal-Variances Ratio Test and Estimation of Moderator Effects On Correlation with SAS Michael Smithson Department of Psychology, The Australian

More information

Introducing Generalized Linear Models: Logistic Regression

Introducing Generalized Linear Models: Logistic Regression Ron Heck, Summer 2012 Seminars 1 Multilevel Regression Models and Their Applications Seminar Introducing Generalized Linear Models: Logistic Regression The generalized linear model (GLM) represents and

More information

Repeated ordinal measurements: a generalised estimating equation approach

Repeated ordinal measurements: a generalised estimating equation approach Repeated ordinal measurements: a generalised estimating equation approach David Clayton MRC Biostatistics Unit 5, Shaftesbury Road Cambridge CB2 2BW April 7, 1992 Abstract Cumulative logit and related

More information

multilevel modeling: concepts, applications and interpretations

multilevel modeling: concepts, applications and interpretations multilevel modeling: concepts, applications and interpretations lynne c. messer 27 october 2010 warning social and reproductive / perinatal epidemiologist concepts why context matters multilevel models

More information

Logistic Regression Models for Multinomial and Ordinal Outcomes

Logistic Regression Models for Multinomial and Ordinal Outcomes CHAPTER 8 Logistic Regression Models for Multinomial and Ordinal Outcomes 8.1 THE MULTINOMIAL LOGISTIC REGRESSION MODEL 8.1.1 Introduction to the Model and Estimation of Model Parameters In the previous

More information