A Flexible Modeling Framework for Overdispersed Hierarchical Data

Size: px
Start display at page:

Download "A Flexible Modeling Framework for Overdispersed Hierarchical Data"

Transcription

1 A Flexible Modeling Framework for Overdispersed Hierarchical Data Geert Molenberghs Interuniversity Institute for Biostatistics and statistical Bioinformatics (I-BioStat) Universiteit Hasselt & Katholieke Universiteit Leuven, Belgium & July 10, 2012 Paris, France

2 Joint Work With Modeling framework: Clarice G.B. Demétrio, Tony Vangeneugden Geert Verbeke, Afrânio Vieira Incomplete data concepts: Geert Verbeke, Michael G. Kenward, Marc Aerts, Niel Hens, An Creemers Ongoing work also with: Achmad Efendi, Mehreteab Fantahun, Aklilu Ghebretinsae, Samuel Iddi, Elasma Milanzi, Thomas Neyens, Edmund Njeri. Paris, July 10,

3 The Epilepsy Data Randomized, double-blind, parallel group multi-center study placebo (45) new anti-epileptic drug (AED; 44) 12-week run-in period & 16 weeks of follow up (some until week 27) outcome: the number of epileptic seizures experienced during the last week research question: reduction in # seizures by new therapy Let Y i now be the total number of seizures for subject i: Y i = n i i=1 Y ij where Y ij was the original (longitudinally measured) weekly outcome. Paris, July 10,

4 The basic Poisson model: Y i Poisson(λ i ) ln(λ i /n i ) = x i β The regression model is equivalent to λ i = n i exp(x i β) = exp(x i β + ln n i ) Paris, July 10,

5 Methodological Context Study Outcome type type N(µ, σ 2 ) N(µ,σ 2 ) Effect Param Effect Param Y i Mean µ Var. σ 2 Corr. Mean µ Var. µ (& θ i ) Corr. (θ i ) Effect Param Effect Param (Y i1,...,y ini ) Mean Var. µ i b i & Σ i Mean µ i Var. b i (& θ i ) Corr. b i & Σ i Corr. b i (& θ i ) Paris, July 10,

6 The Generalized Linear Model Suppose a sample Y 1,...,Y N of independent observations is available All Y i have densities f(y i θ i, φ) which belong to the exponential family: f(y θ i, φ) = exp { φ 1 [yθ i ψ(θ i )] + c(y, φ) } natural parameter θ i = x i β linear predictor Scale parameter (overdispersion parameter): φ Inverse link function: ψ (.) Mean-variance relationship: Var(Y ) = φψ [ ψ 1 (µ) ] = φv(µ) Paris, July 10,

7 Some Examples Element notation continuous binary count time to event Standard univariate exponential family Model normal Bernoulli Poisson Exponential Weibull Model f(y) 1 σ e (y µ) 2 2π 2σ 2 π y (1 π) 1 y e λ λ y ϕe ϕy ϕρy ρ 1 e ϕyρ y! Nat. param η µ ln[π/(1 π)] ln λ ϕ Mean function ψ(η) η 2 /2 ln[1 + exp(η)] λ = exp(η) ln( η) Norm. constant c(y, φ) ln(2πφ) 2 y2 2φ 0 lny! 0 (Over)dispersion φ σ Mean µ µ π λ ϕ 1 ϕ 1/ρ Γ(ρ 1 + 1) Variance φv(µ) σ 2 π(1 π) λ ϕ 2 ϕ 2/ρ [Γ(2ρ 1 + 1) Γ(ρ 1 + 1) 2 ] Paris, July 10,

8 Standard Overdispersion Models φ 1 = Var(Y i ) = φµ i Two-stage approach: Formulation: Y i λ i Poi(λ i ) E(λ i ) = µ i Implications: Var(λ i ) = σ 2 i E(Y i ) = E[E(Y i λ i )] = E(λ i ) = µ i Var(Y i ) = E[Var(Y i λ i )] + Var[E(Y i λ i )] = E(λ i ) + Var(λ i ) = µ i + σ 2 i Paris, July 10,

9 Repeated-measures version: λ i = (λ i1,...,λ ini ) E(λ i ) = µ i Var(λ i ) = Σ i E(Y i ) = µ i Var(Y i ) = M i + Σ i Paris, July 10,

10 The Standard Poisson-normal Model General formulation: f i (y ij b i, β, φ) = exp { φ 1 [y ij θ ij ψ(θ ij )] + c(y ij, φ) } η(µ ij ) = η[e(y ij b i )] = x ijβ + z ijb i b i N(0, D) Poisson formulation: Y ij Poi(λ ij ) ln(λ ij ) = x ijβ + z ijb i b i N(0,D) Paris, July 10,

11 Marginal moments: µ ij = exp x ijβ z ijdz ij Var(Y i ) = M i + M i ( e Z i DZ i J ni ) Mi Univariate version of the marginal moments: µ i = exp x i β d Var(Y i ) = µ i + µ 2 i (ed 1) Paris, July 10,

12 A Combined Model: The Poisson-(... )-normal Model Model formulation: Y ij Poi(λ ij ) λ ij = θ ij exp ( x ijβ + z ijb i ) b i N(0,D) E(θ i ) = E[(θ i1,...,θ ini ) ] = Φ i Var(θ i ) = Σ i Paris, July 10,

13 Marginal moments: µ ij = φ ij exp x ijβ z ijdz ij Var(Y i ) = M i + M i (P i J ni ) M i p i,jk = exp 1 2 z ij Dz ik σi,jk + φ ij φ ik φ ij φ ik exp 1 2 z ik Dz ij Univariate version of the marginal moments: µ i = φ i exp x iβ d Var(Y i ) = µ i + µ 2 i e d 1 + σ2 i φ 2 i e d Paris, July 10,

14 Fully Parametric Specification So far, random effects are specified only in a moment-based fashion: Mean Variance Fully parametric specification is possible Gamma specification is sensible Univariate case: the negative-binomial model Hierarchical case: the Poisson-gamma-normal model Paris, July 10,

15 The Negative-binomial Model λ Gamma(α,β) f(λ) = P(Y = y) = 1 β α Γ(α) λα 1 e λ/β 1 + β α 0 λ α 1 e λ/βλy e λ Γ(α) y! = α + y 1 α 1 β β + 1 y 1 β α Paris, July 10,

16 The Poisson-gamma-normal Model The marginal probability mass function: P(Y i = y i ) = t n i j=1 y ij + t j y ij α j + y ij + t j 1 α j 1 ( 1) tj β y ij+t j j exp n i (y ij + t j )x ijβ exp j=1 The special case of the Poisson-normal model: P(Y i = y i ) = 1 n i j=1 y ij! t ( 1) n i j=1 t j n i j=1 t j! exp n i exp n i (y ij + t j )z ij D j=1 n i j=1 (y ij + t j )x ijβ (y ij + t j )z ij D j=1 n i n i j=1 (y ij + t j )z ij j=1 (y ij + t j )z ij Paris, July 10,

17 Univariate case for Poisson-gamma-normal model: P(Y i = y i ) = + t=0 y i + t y i α + y i + t 1 α 1 β yi+t ( 1) t exp x i β(y i + t) d(y i + t) 2 Univariate case for Poisson-normal model: P(Y i = y i ) = 1 y i! + t=0 ( 1) t t! exp x i β(y i + t) d(y i + t) 2 Paris, July 10,

18 Estimation Integrate over the gamma random effects only: P(Y ij = y ij b i ) = α j + y ij 1 α j 1 κ ij = exp[x ijβ + z ijb i ] β j 1 + κ ij β j ij y κ ij β j α j y κ ij ij Can be fitted using the SAS procedure NLMIXED Model for the epilepsy data: ln(λ ij ) = (β 00 + b i ) + β 01 t ij if placebo (β 10 + b i ) + β 11 t ij if treated, b i N(0, d) Paris, July 10,

19 Parameter Estimates Poisson Negative-binomial Effect Parameter Estimate (s.e.) Estimate (s.e.) Intercept placebo β (0.0424) (0.1119) Slope placebo β (0.0043) (0.0111) Intercept treatment β (0.0383) (0.1093) Slope treatment β (0.0038) (0.0101) Negative-binomial parameter α (0.0255) Negative-binomial parameter α 2 = 1/α (0.0918) Variance of random intercepts d Poisson-normal Combined Effect Parameter Estimate (s.e.) Estimate (s.e.) Intercept placebo β (0.1677) (0.1755) Slope placebo β (0.0044) (0.0077) Intercept treatment β (0.1701) (0.1782) Slope treatment β (0.0043) (0.0074) Negative-binomial parameter α (0.2113) Negative-binomial parameter α 2 = 1/α (0.0348) Variance of random intercepts d (0.1844) (0.1850) Paris, July 10,

20 Implications for Correlation Function Smallest value Largest value Model Arm ρ time pair ρ time pair Poisson-normal placebo & & 2 Poisson-normal treatment & & 2 Combined placebo & & 2 Combined treatment & & 27 Paris, July 10,

21 Implications for Hypothesis Testing p-values Model H 0 : β 11 β 01 = 0 H 0 : β 11 /β 01 = 1 Poisson Poisson-normal negative-binomial combined Paris, July 10,

22 The General Case f i (y ij b i, ξ) = exp { φ 1 [y ij λ ij ψ(λ ij )] + c(y ij, φ) } E(Y ij θ ij, b i ) = µ c ij = θ ijκ ij θ ij G ij (ξ ij,σ 2 ij) κ ij = g(x ij ξ + z ij b i) b i N(0, D) η ij = x ijξ + z ijb i Paris, July 10,

23 Classical Conjugacy f(y θ) = exp { φ 1 [yh(θ) g(θ)] + c(y, φ) } f(θ) = exp {γ[ψh(θ) g(θ)] + c (γ, ψ)} f(y) = exp c(y, φ) + c (γ, ψ) c φ 1 + γ, φ 1 y + γψ φ 1 + γ Paris, July 10,

24 Strong Conjugacy Write: f(y κθ) = exp { φ 1 [yh(κθ) g(κθ)] + c(y, φ) } Apply the transformation theorem: f(θ γ, ψ) = κ f(κθ γ, ψ) Request the form: f(κθ) = exp {γ [ψ h(κθ) g(κθ)] + c (γ, ψ )} Leading to the marginal model: f(y κ) = exp c(y, φ) + c (γ,ψ ) + c φ 1 + γ, φ 1 y + γ ψ φ 1 + γ Paris, July 10,

25 The Weibull Case The Weibull-gamma-normal model: f(y i θ i,b i ) = n i λρθ ijyij ρ 1 j=1 e x ijβ+z ijb i e λyρ ij θ ije x ijβ+z ijb i f(θ i ) = n i j=1 1 j Γ(α j ) θα j 1 ij β α j e θ ij/β j f(b i ) = 1 2b i D 1 b i (2π) q/2 D 1/2e 1 Paris, July 10,

26 Marginal density: f(y i ) = n i (m 1,...,m ni ) j=1 ( 1) m j Γ(α j + m j + 1)β m j+1 j m j! Γ(α j ) exp (m j + 1) x ijβ (m j + 1) z ijdz ij λ m j+1 ρy (m j+1)ρ 1 ij Marginal moments: E(Y k ij) = α jb(α j k/ρ, k/ρ + 1) λ k/ρ β k/ρ j exp k ρ x ijβ + k2 2ρ 2z ijdz ij E(Y ij ) = α jb(α j 1/ρ, 1/ρ + 1) λ 1/ρ β 1/ρ j exp 1 ρ x ijβ + 1 2ρ 2z ijdz ij Paris, July 10,

27 Var(Y ij ) = α j λ 2/ρ β 2ρ j exp 2 ρ x ijβ + 1 ρ ijdz 2z ij B (α j 2/ρ, 2/ρ + 1) exp 1 ρ ijdz 2z ij α j B α j 1 ρ, 1 ρ Cov(Y ij, Y ik ) = α j α k λ 2/ρ β 1/ρ j β 1/ρ k exp 1 ρ (x ijβ + x ikβ) B α j 1 ρ, 1 ρ + 1 B α k 1 ρ, 1 ρ + 1 exp 1 2ρ ijdz 2(z ij + z ikdz ik ) exp 1 ρ ijdz 2z ik 1 Paris, July 10,

28 Identifiability can be achieved in, for example, two ways: For the second case: E(Y k ij) = δk/ρ j f(θ i ) = n i j=1 f(θ i ) = n i k λ Γ k 1 k ρ 1 α j j=1 δ je δ jθ ij Γ Weibull Exponential 1 α j Γ(αj ) θα j 1 k exp ρ ij e α jθ ij k ρ x ijβ + k2 E(Yij k ) = δk jk Γ (1 k) Γ (k) exp kx λk ij β + k2 2ρ 2z ijdz ij 2 z ij Dz ij Paris, July 10,

29 Univariate case, Weibull-exponential: f(y) = ϕρyρ 1 δ (δ + ϕy ρ ) 2 E(Y k ) = k ρ δ ϕ kρ Γ(1 k/ρ) Γ(k/ρ) Univariate case, exponential-exponential: f(y) = ϕδ (δ + ϕy) 2 E(Y k ) = k δ ϕ k Γ(1 k) Γ(k) Paris, July 10,

30 This leads to a class of Cauchy-type distributions, including the log-logistic distribution, of which there are known to be problems. Estimation using the marginal-conditional approach: f(y ij b i ) = λκ ije µ ijρy ij ρ 1 α j β j (1 + λκ ij e µ ijβ j yij) ρ α j+1 Further issues: censorship related functions non-parametric baseline hazard functions Paris, July 10,

31 The Binary Case A logit formulation: Y ij Bernoulli(π ij = θ ij κ ij ) κ ij = exp ( x ijβ + z ijb i 1 + exp ( x ij β + z ij b i ) ) No closed forms Conjugacy no strong conjugacy Marginal-conditional version (for NLMIXED implementation) f(y ij b i ) = 1 α j + β j (κ ij α j ) y ij [(1 κ ij )α j + β j ] 1 y ij Paris, July 10,

32 A probit formulation: κ ij = Φ 1 (x ijβ + z ijb i ) θ ij Beta(α,β) f ni (y i = 1) = α α + β n i Φni (X iβ; L 1 n i ) L ni = I ni Z i ( D 1 + Z iz i ) 1 Z i E(Y ij ) = Var(Y ij ) = α α + β Φ 1(x ijβ; L 1 1 ) = α α + β Φ 1( I + Dz ij z ij 1/2 x ijβ) α α + β Φ 1(x ijβ; L 1 1 ) 1 α α + β.φ 1(x ijβ; L 1 1 ) Paris, July 10,

33 Cov(Y ij, Y ik ) = α α + β 2 Φ 2 x ij x ik β, L 1 2jk Φ 1 (x ijβ; L 1 1j )Φ 1 (x ikβ; L 1 1k ) L 2jk = I 2 z ij z ik D 1 + z ij z ik (z ij z ik ) 1 (z ij z ik ) The above gives the success probabilities; all other joint probabilities can be derived in a similar fashion Paris, July 10,

34 Joint Modeling of a Poisson and Weibull Variable The combined model lends itself naturally to a joint modeling approach Example: Y is Poisson with gamma random effect θ y Z is Weibull with gamma random effect θ z The joint normal random effect is b Moments and joint marginal distribution can be derived Paris, July 10,

35 Joint distribution: P(Y = n, Z = z) = + + ( 1) m+k m=0 k=0 n!m!k! β n+m y Γ(n + m + α y) Γ(α y ) β k+1 z Γ(k α z) Γ(α z ) λ k+1 e (n+m)γ ρz ρ(k+1) 1 exp µ(k + 1) d(n + m + k + 1)2 Other joint distributions: similar Longitudinal versions: similar Estimation: using similar SAS procedure NLMIXED implementation Paris, July 10,

36 Joint Models with Longitudinal Data Missing Data f(y i,r i X i θ,ψ) Selection Models: f(y i X i,θ) f(r i X i, y o i, y m i, ψ) MCAR MAR MNAR f(r i X i, ψ) f(r i X i,y o i,ψ) f(r i X i, y o i, ym i, ψ) Pattern-mixture Models: f(y i X i, r i, θ) f(r i X i,ψ) Shared-parameter Models: f(y i X i, b i, θ) f(r i X i, b i,ψ) Paris, July 10,

37 MAR in 3 Frameworks Selection models f(r i y i, ψ) = f(r i y o i, ψ) Pattern-mixture models f(y m i y o i,r i,θ) = f(y m i y o i, θ) Shared-parameter models? Paris, July 10,

38 MAR in Selection Models: Natural Diggle and Kenward (ApStat 1994) f(r i y i,ψ) = f(r i y o i,ψ) Longitudinal data: logit [P(D i = j D i j,y ij, y i,j 1 )] = ψ 0 + ψ 1 y i,j 1 + ψ 2 y ij ψ 2 0 MNAR ψ 2 = 0 MAR ψ 1 = ψ 2 = 0 MCAR No dependence on the future (NFD): built in Paris, July 10,

39 MAR in Pattern-mixture Models: Feasible Molenberghs, Michiels, Kenward, and Diggle (Statistica Neerlandica 1998) Thijs, Molenberghs, Michiels, Verbeke, and Curran (Biostatistics 2002) f(y m i yo i,r i,θ) = f(y m i yo i,θ) For longitudinal data: ACMV: available case missing value restrictions: t 2, s < t : f(y it y i1,,y i,t 1, d i = s) = f(y it y i1,,y i,t 1,d i t) Practical implementation: doable! Likewise, non-future dependence can be built in Paris, July 10,

40 MAR in Shared-parameter Models?! Creemers, Hens, Aerts, Molenberghs, Verbeke, and Kenward (2008) Conventional f(y i X i, b i, θ) f(r i X i,b i, ψ) Extended f(y o i g i,h i, j i,l i )f(y m i y o i, g i, h i,k i, m i ) f(r i g i, j i, k i n i ) f(y o i g i,h i, j i )f(y m i y o i,g i,h i, k i )f(r i g i, j i, k i )f(b i ) db i f(y o i g i, j i )f(r i g i,j i )f(b i ) db i MAR = f(y o i g i, h i )f(y m i yo i, g i, h i )f(b i ) db i f(y o i) Paris, July 10,

41 MAR in Shared-parameter Models Creemers, Hens, Aerts, Molenberghs, Verbeke, and Kenward (2008) Extended f(y o i g i, h i, j i,l i )f(y m i y o i, g i, h i,k i, m i ) f(r i g i, j i, k i n i ) f(y o i g i,h i, j i )f(y m i yo i, g i,h i, k i )f(r i g i, j i, k i )f(b i ) db i f(y o i g i, j i )f(r i g i,j i )f(b i ) db i MAR = f(y o i g i, h i )f(y m i yo i, g i, h i )f(b i ) db i f(y o i) Sub-class MAR f(y o i j i,l i )f(y m i y o i, m i )f(r i j i, n i ) Paris, July 10,

42 Sensitivity: Every MNAR Model Has Got an MAR Counterpart Molenberghs, Beunckens, Sotto, and Kenward (JRSSB 2008) Creemers, Hens, Aerts, Molenberghs, Verbeke, and Kenward (2008) Fit an MNAR model to a set of incomplete data Change the conditional distribution of the unobserved outcomes, given the observed ones, to comply with MAR Resulting new model has exactly the same fit as the original MNAR model The missing data mechanism has changed This implies that definitively testing for MAR versus MNAR is not possible Paris, July 10,

43 MAR Counterpart to Pattern-mixture Models f(y o i, y m i, r i θ, ψ) = f(y o i r i, θ) f(r i ψ) f(y m i y o i, r i, θ) h(y o i, y m i, r i θ, ψ) = f(y o i r i, θ) f(r i ψ) f(y m i y o i, θ, ψ) Starting from PMM is natural : clear separation into: fully observable components entirely unobserved component Paris, July 10,

44 MAR Counterpart to Selection Models f(y o i,y m i,r i θ, ψ) = f(y o i, y m i θ) f(r i y o i,y m i, ψ) f(y o i,y m i,r i θ, ψ) = f(y o i r i, θ, ψ) f(r i θ, ψ) f(y m i y o i,r i, θ, ψ) h(y o i,y m i,r i θ, ψ) = f(y o i r i, θ, ψ) f(r i θ, ψ) f(y m i y o i, θ, ψ) Paris, July 10,

45 MAR Counterpart to Shared-parameter Models: Important Here! f(y i o, y i m, r i b i ) = f(y o i g i, h i, j i, l i ) f(y m i yo i, g i, h i, k i,m i ) f(r i g i, j i,k i n i ) h(y i o, y i m, r i b i ) = f(y o i g i, h i, j i, l i ) h(y m i yo i, m i) f(r i g i, j i,k i n i ) with h(y m i y o i, m i ) = g i h i k i f(y m i y o i, g i, h i,k i, m i )dg i dh i dk i Paris, July 10,

46 Common Ground for Various Settings Classical survival : time to event & censorship Joint modeling : time to event(s) & longitudinal process Incomplete data : outcome(s) & missigness process Sequential trials : outcomes & sample size Informative cluster size : outcomes & cluster size Paris, July 10,

47 Concluding Table Element notation continuous binary count time to event Standard univariate exponential family Model normal Bernoulli Poisson Exponential Weibull (y µ) 2 1 Model f(y) σ 2π e 2σ 2 π y (1 π) 1 y e λ λ y ϕe ϕy ϕρy ρ 1 e ϕyρ y! Nat. param η µ ln[π/(1 π)] ln λ ϕ Mean function ψ(η) η 2 /2 ln[1 + exp(η)] λ = exp(η) ln( η) Norm. constant c(y, φ) ln(2πφ) 2 y2 2φ 0 lny! 0 (Over)dispersion φ σ Mean µ µ π λ ϕ 1 ϕ 1/ρ Γ(ρ 1 + 1) Variance φv(µ) σ 2 π(1 π) λ ϕ 2 [ ϕ 2/ρ Γ(2ρ 1 + 1) Γ(ρ 1 + 1) 2] Exponential family with conjugate random effects Model normal-normal Beta-binomial Negative binomial Exponential-gamma Weibull-gamma Hier. model f(y θ) (y θ) 2 1 σ 2π e 2σ 2 θ y (1 θ) 1 y e θ θ y y! ϕθe ϕθy ϕθρy ρ 1 e ϕθyρ RE model f(θ) 1 e (θ µ)2 2d d 2π Marg. model f(y) θ α 1 (1 θ) β 1 B(α,β) (y µ) 2 1 σ 2 +d 2π e 2(σ 2 +d) (α + β) Γ(α) Γ(β) Γ(α+y) Γ(β+1 y) θ α 1 e θ/β β α Γ(α) ( Γ(α+y) β y!γ(α) β+1 ) y ( 1 β+1 θ α 1 e θ/β β α Γ(α) ) α ϕαβ (1+ϕβy) α+1 θ α 1 e θ/β β α Γ(α) ϕρy ρ 1 αβ (1+ϕβy ρ ) α+1 h(θ) θ ln[θ/(1 θ)] ln(θ) θ θ g(θ) 1 2 θ2 ln(1 θ) θ ln(θ)/ϕ ln(θ)/ϕ φ σ /ϕ 1/ϕ γ 1/d α + β 2 1/β ϕ(α 1) ϕ(α 1) ψ µ c(y, φ) c (γ, ψ) 1 2 φy2 1 2 ln( 2π φ 1 2 γψ2 1 2 ln( 2π γ Mean E(Y ) µ Variance Var(Y ) σ 2 + d ) ) α 1 α+β 2 β(α 1) [βϕ(α 1)] 1 [βϕ(α 1)] 1 ( 0 ln(y!) ln(ϕ) ln ϕρy ρ 1) γ+ϕ ln B(γψ + 1,γ ψγ + 1) (1 + γψ) ln γ lnγ(1 + γψ) ϕ ln(γψ) lnγ( ) γ+ϕ γ+ϕ ϕ ϕ ln(γψ) lnγ( ) γ+ϕ ϕ α αβ [ϕ(α 1)β] 1 Γ(α ρ 1 )Γ(ρ 1 +1) α+β (ϕβ) 1/ρ Γ(α) [ αβ (α+β) 2 αβ(β + 1) α[ϕ 2 (α 1) 2 (α 2)β 2 ] 1 1 2Γ(α 2ρ 1 )Γ(2ρ 1 ) ρ(ϕβ) 2/ρ Γ(α) ] Γ(α ρ 1 ) 2 Γ(ρ 1 ) 2 ργ(α) Paris, July 10,

48 References Molenberghs, G. and Verbeke, G. (2011). On the Weibull-Gamma frailty model, its infinite moments, and its connection to generalized log-logistic, logistic, Cauchy, and extreme-value distributions. Journal of Statistical Planning and Inference, 141, Molenberghs, G., Verbeke, G., and Demétrio, C. (2007). An extended random-effects approach to modeling repeated, overdispersed count data. Lifetime Data Analysis, 13, Molenberghs, G., Verbeke, G., Demétrio, C.G.B., and Vieira, A. (2010). A family of generalized linear models for repeated measures with normal and conjugate random effects. Statistical Science, 25, Vangeneugden, T., Molenberghs, G., Verbeke, G., and Demétrio, C. (2011). Marginal correlation from an extended random-effects model for repeated and overdispersed counts. Journal of Applied Statistics, 38, Paris, July 10,

The combined model: A tool for simulating correlated counts with overdispersion. George KALEMA Samuel IDDI Geert MOLENBERGHS

The combined model: A tool for simulating correlated counts with overdispersion. George KALEMA Samuel IDDI Geert MOLENBERGHS The combined model: A tool for simulating correlated counts with overdispersion George KALEMA Samuel IDDI Geert MOLENBERGHS Interuniversity Institute for Biostatistics and statistical Bioinformatics George

More information

A Joint Model with Marginal Interpretation for Longitudinal Continuous and Time-to-event Outcomes

A Joint Model with Marginal Interpretation for Longitudinal Continuous and Time-to-event Outcomes A Joint Model with Marginal Interpretation for Longitudinal Continuous and Time-to-event Outcomes Achmad Efendi 1, Geert Molenberghs 2,1, Edmund Njagi 1, Paul Dendale 3 1 I-BioStat, Katholieke Universiteit

More information

Introduction (Alex Dmitrienko, Lilly) Web-based training program

Introduction (Alex Dmitrienko, Lilly) Web-based training program Web-based training Introduction (Alex Dmitrienko, Lilly) Web-based training program http://www.amstat.org/sections/sbiop/webinarseries.html Four-part web-based training series Geert Verbeke (Katholieke

More information

Discussion of Missing Data Methods in Longitudinal Studies: A Review by Ibrahim and Molenberghs

Discussion of Missing Data Methods in Longitudinal Studies: A Review by Ibrahim and Molenberghs Discussion of Missing Data Methods in Longitudinal Studies: A Review by Ibrahim and Molenberghs Michael J. Daniels and Chenguang Wang Jan. 18, 2009 First, we would like to thank Joe and Geert for a carefully

More information

A Bayesian Nonparametric Approach to Monotone Missing Data in Longitudinal Studies with Informative Missingness

A Bayesian Nonparametric Approach to Monotone Missing Data in Longitudinal Studies with Informative Missingness A Bayesian Nonparametric Approach to Monotone Missing Data in Longitudinal Studies with Informative Missingness A. Linero and M. Daniels UF, UT-Austin SRC 2014, Galveston, TX 1 Background 2 Working model

More information

Hierarchical Hurdle Models for Zero-In(De)flated Count Data of Complex Designs

Hierarchical Hurdle Models for Zero-In(De)flated Count Data of Complex Designs for Zero-In(De)flated Count Data of Complex Designs Marek Molas 1, Emmanuel Lesaffre 1,2 1 Erasmus MC 2 L-Biostat Erasmus Universiteit - Rotterdam Katholieke Universiteit Leuven The Netherlands Belgium

More information

Flexible modeling tools for hierarchical and incomplete data, with applications in comet assays and clinical trials

Flexible modeling tools for hierarchical and incomplete data, with applications in comet assays and clinical trials 2013 School voor Informatietechnologie doctoraatsproefschrift Flexible modeling tools for hierarchical and incomplete data, with applications in comet assays and clinical trials Proefschrift voorgelegd

More information

A weighted simulation-based estimator for incomplete longitudinal data models

A weighted simulation-based estimator for incomplete longitudinal data models To appear in Statistics and Probability Letters, 113 (2016), 16-22. doi 10.1016/j.spl.2016.02.004 A weighted simulation-based estimator for incomplete longitudinal data models Daniel H. Li 1 and Liqun

More information

A Few Special Distributions and Their Properties

A Few Special Distributions and Their Properties A Few Special Distributions and Their Properties Econ 690 Purdue University Justin L. Tobias (Purdue) Distributional Catalog 1 / 20 Special Distributions and Their Associated Properties 1 Uniform Distribution

More information

Distributions of Functions of Random Variables. 5.1 Functions of One Random Variable

Distributions of Functions of Random Variables. 5.1 Functions of One Random Variable Distributions of Functions of Random Variables 5.1 Functions of One Random Variable 5.2 Transformations of Two Random Variables 5.3 Several Random Variables 5.4 The Moment-Generating Function Technique

More information

Surrogate marker evaluation when data are small, large, or very large Geert Molenberghs

Surrogate marker evaluation when data are small, large, or very large Geert Molenberghs Surrogate marker evaluation when data are small, large, or very large Geert Molenberghs Interuniversity Institute for Biostatistics and statistical Bioinformatics (I-BioStat) Universiteit Hasselt & KU

More information

Chapter 4: Generalized Linear Models-II

Chapter 4: Generalized Linear Models-II : Generalized Linear Models-II Dipankar Bandyopadhyay Department of Biostatistics, Virginia Commonwealth University BIOS 625: Categorical Data & GLM [Acknowledgements to Tim Hanson and Haitao Chu] D. Bandyopadhyay

More information

,..., θ(2),..., θ(n)

,..., θ(2),..., θ(n) Likelihoods for Multivariate Binary Data Log-Linear Model We have 2 n 1 distinct probabilities, but we wish to consider formulations that allow more parsimonious descriptions as a function of covariates.

More information

T E C H N I C A L R E P O R T KERNEL WEIGHTED INFLUENCE MEASURES. HENS, N., AERTS, M., MOLENBERGHS, G., THIJS, H. and G. VERBEKE

T E C H N I C A L R E P O R T KERNEL WEIGHTED INFLUENCE MEASURES. HENS, N., AERTS, M., MOLENBERGHS, G., THIJS, H. and G. VERBEKE T E C H N I C A L R E P O R T 0465 KERNEL WEIGHTED INFLUENCE MEASURES HENS, N., AERTS, M., MOLENBERGHS, G., THIJS, H. and G. VERBEKE * I A P S T A T I S T I C S N E T W O R K INTERUNIVERSITY ATTRACTION

More information

Multilevel Methodology

Multilevel Methodology Multilevel Methodology Geert Molenberghs Interuniversity Institute for Biostatistics and statistical Bioinformatics Universiteit Hasselt, Belgium geert.molenberghs@uhasselt.be www.censtat.uhasselt.be Katholieke

More information

LECTURE 11: EXPONENTIAL FAMILY AND GENERALIZED LINEAR MODELS

LECTURE 11: EXPONENTIAL FAMILY AND GENERALIZED LINEAR MODELS LECTURE : EXPONENTIAL FAMILY AND GENERALIZED LINEAR MODELS HANI GOODARZI AND SINA JAFARPOUR. EXPONENTIAL FAMILY. Exponential family comprises a set of flexible distribution ranging both continuous and

More information

Shared Random Parameter Models for Informative Missing Data

Shared Random Parameter Models for Informative Missing Data Shared Random Parameter Models for Informative Missing Data Dean Follmann NIAID NIAID/NIH p. A General Set-up Longitudinal data (Y ij,x ij,r ij ) Y ij = outcome for person i on visit j R ij = 1 if observed

More information

Type I and type II error under random-effects misspecification in generalized linear mixed models Link Peer-reviewed author version

Type I and type II error under random-effects misspecification in generalized linear mixed models Link Peer-reviewed author version Type I and type II error under random-effects misspecification in generalized linear mixed models Link Peer-reviewed author version Made available by Hasselt University Library in Document Server@UHasselt

More information

7 Sensitivity Analysis

7 Sensitivity Analysis 7 Sensitivity Analysis A recurrent theme underlying methodology for analysis in the presence of missing data is the need to make assumptions that cannot be verified based on the observed data. If the assumption

More information

Joint Modeling of Longitudinal Item Response Data and Survival

Joint Modeling of Longitudinal Item Response Data and Survival Joint Modeling of Longitudinal Item Response Data and Survival Jean-Paul Fox University of Twente Department of Research Methodology, Measurement and Data Analysis Faculty of Behavioural Sciences Enschede,

More information

1 Uniform Distribution. 2 Gamma Distribution. 3 Inverse Gamma Distribution. 4 Multivariate Normal Distribution. 5 Multivariate Student-t Distribution

1 Uniform Distribution. 2 Gamma Distribution. 3 Inverse Gamma Distribution. 4 Multivariate Normal Distribution. 5 Multivariate Student-t Distribution A Few Special Distributions Their Properties Econ 675 Iowa State University November 1 006 Justin L Tobias (ISU Distributional Catalog November 1 006 1 / 0 Special Distributions Their Associated Properties

More information

Bayesian Multivariate Logistic Regression

Bayesian Multivariate Logistic Regression Bayesian Multivariate Logistic Regression Sean M. O Brien and David B. Dunson Biostatistics Branch National Institute of Environmental Health Sciences Research Triangle Park, NC 1 Goals Brief review of

More information

Modelling Dropouts by Conditional Distribution, a Copula-Based Approach

Modelling Dropouts by Conditional Distribution, a Copula-Based Approach The 8th Tartu Conference on MULTIVARIATE STATISTICS, The 6th Conference on MULTIVARIATE DISTRIBUTIONS with Fixed Marginals Modelling Dropouts by Conditional Distribution, a Copula-Based Approach Ene Käärik

More information

Linear model A linear model assumes Y X N(µ(X),σ 2 I), And IE(Y X) = µ(x) = X β, 2/52

Linear model A linear model assumes Y X N(µ(X),σ 2 I), And IE(Y X) = µ(x) = X β, 2/52 Statistics for Applications Chapter 10: Generalized Linear Models (GLMs) 1/52 Linear model A linear model assumes Y X N(µ(X),σ 2 I), And IE(Y X) = µ(x) = X β, 2/52 Components of a linear model The two

More information

A Flexible Bayesian Approach to Monotone Missing. Data in Longitudinal Studies with Nonignorable. Missingness with Application to an Acute

A Flexible Bayesian Approach to Monotone Missing. Data in Longitudinal Studies with Nonignorable. Missingness with Application to an Acute A Flexible Bayesian Approach to Monotone Missing Data in Longitudinal Studies with Nonignorable Missingness with Application to an Acute Schizophrenia Clinical Trial Antonio R. Linero, Michael J. Daniels

More information

STAT 705 Generalized linear mixed models

STAT 705 Generalized linear mixed models STAT 705 Generalized linear mixed models Timothy Hanson Department of Statistics, University of South Carolina Stat 705: Data Analysis II 1 / 24 Generalized Linear Mixed Models We have considered random

More information

The consequences of misspecifying the random effects distribution when fitting generalized linear mixed models

The consequences of misspecifying the random effects distribution when fitting generalized linear mixed models The consequences of misspecifying the random effects distribution when fitting generalized linear mixed models John M. Neuhaus Charles E. McCulloch Division of Biostatistics University of California, San

More information

(y 1, y 2 ) = 12 y3 1e y 1 y 2 /2, y 1 > 0, y 2 > 0 0, otherwise.

(y 1, y 2 ) = 12 y3 1e y 1 y 2 /2, y 1 > 0, y 2 > 0 0, otherwise. 54 We are given the marginal pdfs of Y and Y You should note that Y gamma(4, Y exponential( E(Y = 4, V (Y = 4, E(Y =, and V (Y = 4 (a With U = Y Y, we have E(U = E(Y Y = E(Y E(Y = 4 = (b Because Y and

More information

ST3241 Categorical Data Analysis I Generalized Linear Models. Introduction and Some Examples

ST3241 Categorical Data Analysis I Generalized Linear Models. Introduction and Some Examples ST3241 Categorical Data Analysis I Generalized Linear Models Introduction and Some Examples 1 Introduction We have discussed methods for analyzing associations in two-way and three-way tables. Now we will

More information

Generalized Linear Models I

Generalized Linear Models I Statistics 203: Introduction to Regression and Analysis of Variance Generalized Linear Models I Jonathan Taylor - p. 1/16 Today s class Poisson regression. Residuals for diagnostics. Exponential families.

More information

A Nonparametric Approach Using Dirichlet Process for Hierarchical Generalized Linear Mixed Models

A Nonparametric Approach Using Dirichlet Process for Hierarchical Generalized Linear Mixed Models Journal of Data Science 8(2010), 43-59 A Nonparametric Approach Using Dirichlet Process for Hierarchical Generalized Linear Mixed Models Jing Wang Louisiana State University Abstract: In this paper, we

More information

Frailty Modeling for Spatially Correlated Survival Data, with Application to Infant Mortality in Minnesota By: Sudipto Banerjee, Mela. P.

Frailty Modeling for Spatially Correlated Survival Data, with Application to Infant Mortality in Minnesota By: Sudipto Banerjee, Mela. P. Frailty Modeling for Spatially Correlated Survival Data, with Application to Infant Mortality in Minnesota By: Sudipto Banerjee, Melanie M. Wall, Bradley P. Carlin November 24, 2014 Outlines of the talk

More information

High-Throughput Sequencing Course

High-Throughput Sequencing Course High-Throughput Sequencing Course DESeq Model for RNA-Seq Biostatistics and Bioinformatics Summer 2017 Outline Review: Standard linear regression model (e.g., to model gene expression as function of an

More information

Review. Timothy Hanson. Department of Statistics, University of South Carolina. Stat 770: Categorical Data Analysis

Review. Timothy Hanson. Department of Statistics, University of South Carolina. Stat 770: Categorical Data Analysis Review Timothy Hanson Department of Statistics, University of South Carolina Stat 770: Categorical Data Analysis 1 / 22 Chapter 1: background Nominal, ordinal, interval data. Distributions: Poisson, binomial,

More information

CTDL-Positive Stable Frailty Model

CTDL-Positive Stable Frailty Model CTDL-Positive Stable Frailty Model M. Blagojevic 1, G. MacKenzie 2 1 Department of Mathematics, Keele University, Staffordshire ST5 5BG,UK and 2 Centre of Biostatistics, University of Limerick, Ireland

More information

Bayesian Inference on Joint Mixture Models for Survival-Longitudinal Data with Multiple Features. Yangxin Huang

Bayesian Inference on Joint Mixture Models for Survival-Longitudinal Data with Multiple Features. Yangxin Huang Bayesian Inference on Joint Mixture Models for Survival-Longitudinal Data with Multiple Features Yangxin Huang Department of Epidemiology and Biostatistics, COPH, USF, Tampa, FL yhuang@health.usf.edu January

More information

USEFUL PROPERTIES OF THE MULTIVARIATE NORMAL*

USEFUL PROPERTIES OF THE MULTIVARIATE NORMAL* USEFUL PROPERTIES OF THE MULTIVARIATE NORMAL* 3 Conditionals and marginals For Bayesian analysis it is very useful to understand how to write joint, marginal, and conditional distributions for the multivariate

More information

Generalized Linear Models Introduction

Generalized Linear Models Introduction Generalized Linear Models Introduction Statistics 135 Autumn 2005 Copyright c 2005 by Mark E. Irwin Generalized Linear Models For many problems, standard linear regression approaches don t work. Sometimes,

More information

General Regression Model

General Regression Model Scott S. Emerson, M.D., Ph.D. Department of Biostatistics, University of Washington, Seattle, WA 98195, USA January 5, 2015 Abstract Regression analysis can be viewed as an extension of two sample statistical

More information

Generalized Linear Models. Kurt Hornik

Generalized Linear Models. Kurt Hornik Generalized Linear Models Kurt Hornik Motivation Assuming normality, the linear model y = Xβ + e has y = β + ε, ε N(0, σ 2 ) such that y N(μ, σ 2 ), E(y ) = μ = β. Various generalizations, including general

More information

Figure 36: Respiratory infection versus time for the first 49 children.

Figure 36: Respiratory infection versus time for the first 49 children. y BINARY DATA MODELS We devote an entire chapter to binary data since such data are challenging, both in terms of modeling the dependence, and parameter interpretation. We again consider mixed effects

More information

Generalized Linear Models

Generalized Linear Models Generalized Linear Models Advanced Methods for Data Analysis (36-402/36-608 Spring 2014 1 Generalized linear models 1.1 Introduction: two regressions So far we ve seen two canonical settings for regression.

More information

6 Pattern Mixture Models

6 Pattern Mixture Models 6 Pattern Mixture Models A common theme underlying the methods we have discussed so far is that interest focuses on making inference on parameters in a parametric or semiparametric model for the full data

More information

Continuous Time Survival in Latent Variable Models

Continuous Time Survival in Latent Variable Models Continuous Time Survival in Latent Variable Models Tihomir Asparouhov 1, Katherine Masyn 2, Bengt Muthen 3 Muthen & Muthen 1 University of California, Davis 2 University of California, Los Angeles 3 Abstract

More information

T E C H N I C A L R E P O R T A SANDWICH-ESTIMATOR TEST FOR MISSPECIFICATION IN MIXED-EFFECTS MODELS. LITIERE S., ALONSO A., and G.

T E C H N I C A L R E P O R T A SANDWICH-ESTIMATOR TEST FOR MISSPECIFICATION IN MIXED-EFFECTS MODELS. LITIERE S., ALONSO A., and G. T E C H N I C A L R E P O R T 0658 A SANDWICH-ESTIMATOR TEST FOR MISSPECIFICATION IN MIXED-EFFECTS MODELS LITIERE S., ALONSO A., and G. MOLENBERGHS * I A P S T A T I S T I C S N E T W O R K INTERUNIVERSITY

More information

Latent Variable Models for Binary Data. Suppose that for a given vector of explanatory variables x, the latent

Latent Variable Models for Binary Data. Suppose that for a given vector of explanatory variables x, the latent Latent Variable Models for Binary Data Suppose that for a given vector of explanatory variables x, the latent variable, U, has a continuous cumulative distribution function F (u; x) and that the binary

More information

STA 216, GLM, Lecture 16. October 29, 2007

STA 216, GLM, Lecture 16. October 29, 2007 STA 216, GLM, Lecture 16 October 29, 2007 Efficient Posterior Computation in Factor Models Underlying Normal Models Generalized Latent Trait Models Formulation Genetic Epidemiology Illustration Structural

More information

STAT5044: Regression and Anova

STAT5044: Regression and Anova STAT5044: Regression and Anova Inyoung Kim 1 / 18 Outline 1 Logistic regression for Binary data 2 Poisson regression for Count data 2 / 18 GLM Let Y denote a binary response variable. Each observation

More information

Longitudinal analysis of ordinal data

Longitudinal analysis of ordinal data Longitudinal analysis of ordinal data A report on the external research project with ULg Anne-Françoise Donneau, Murielle Mauer June 30 th 2009 Generalized Estimating Equations (Liang and Zeger, 1986)

More information

Lecture 2. (See Exercise 7.22, 7.23, 7.24 in Casella & Berger)

Lecture 2. (See Exercise 7.22, 7.23, 7.24 in Casella & Berger) 8 HENRIK HULT Lecture 2 3. Some common distributions in classical and Bayesian statistics 3.1. Conjugate prior distributions. In the Bayesian setting it is important to compute posterior distributions.

More information

Stat 5101 Notes: Brand Name Distributions

Stat 5101 Notes: Brand Name Distributions Stat 5101 Notes: Brand Name Distributions Charles J. Geyer February 14, 2003 1 Discrete Uniform Distribution DiscreteUniform(n). Discrete. Rationale Equally likely outcomes. The interval 1, 2,..., n of

More information

Generalized Quasi-likelihood versus Hierarchical Likelihood Inferences in Generalized Linear Mixed Models for Count Data

Generalized Quasi-likelihood versus Hierarchical Likelihood Inferences in Generalized Linear Mixed Models for Count Data Sankhyā : The Indian Journal of Statistics 2009, Volume 71-B, Part 1, pp. 55-78 c 2009, Indian Statistical Institute Generalized Quasi-likelihood versus Hierarchical Likelihood Inferences in Generalized

More information

Probability Distributions Columns (a) through (d)

Probability Distributions Columns (a) through (d) Discrete Probability Distributions Columns (a) through (d) Probability Mass Distribution Description Notes Notation or Density Function --------------------(PMF or PDF)-------------------- (a) (b) (c)

More information

Modeling the scale parameter ϕ A note on modeling correlation of binary responses Using marginal odds ratios to model association for binary responses

Modeling the scale parameter ϕ A note on modeling correlation of binary responses Using marginal odds ratios to model association for binary responses Outline Marginal model Examples of marginal model GEE1 Augmented GEE GEE1.5 GEE2 Modeling the scale parameter ϕ A note on modeling correlation of binary responses Using marginal odds ratios to model association

More information

Contrasting Marginal and Mixed Effects Models Recall: two approaches to handling dependence in Generalized Linear Models:

Contrasting Marginal and Mixed Effects Models Recall: two approaches to handling dependence in Generalized Linear Models: Contrasting Marginal and Mixed Effects Models Recall: two approaches to handling dependence in Generalized Linear Models: Marginal models: based on the consequences of dependence on estimating model parameters.

More information

Why analyze as ordinal? Mixed Models for Longitudinal Ordinal Data Don Hedeker University of Illinois at Chicago

Why analyze as ordinal? Mixed Models for Longitudinal Ordinal Data Don Hedeker University of Illinois at Chicago Why analyze as ordinal? Mixed Models for Longitudinal Ordinal Data Don Hedeker University of Illinois at Chicago hedeker@uic.edu www.uic.edu/ hedeker/long.html Efficiency: Armstrong & Sloan (1989, Amer

More information

Regularization in Cox Frailty Models

Regularization in Cox Frailty Models Regularization in Cox Frailty Models Andreas Groll 1, Trevor Hastie 2, Gerhard Tutz 3 1 Ludwig-Maximilians-Universität Munich, Department of Mathematics, Theresienstraße 39, 80333 Munich, Germany 2 University

More information

STAT 526 Advanced Statistical Methodology

STAT 526 Advanced Statistical Methodology STAT 526 Advanced Statistical Methodology Fall 2017 Lecture Note 10 Analyzing Clustered/Repeated Categorical Data 0-0 Outline Clustered/Repeated Categorical Data Generalized Linear Mixed Models Generalized

More information

Bayesian Inference and Decision Theory

Bayesian Inference and Decision Theory Bayesian Inference and Decision Theory Instructor: Kathryn Blackmond Laskey Room 224 ENGR (703) 993-644 Office Hours: Thursday 4:00-6:00 PM, or by appointment Spring 208 Unit 3: Bayesian Inference with

More information

STA216: Generalized Linear Models. Lecture 1. Review and Introduction

STA216: Generalized Linear Models. Lecture 1. Review and Introduction STA216: Generalized Linear Models Lecture 1. Review and Introduction Let y 1,..., y n denote n independent observations on a response Treat y i as a realization of a random variable Y i In the general

More information

Repeated ordinal measurements: a generalised estimating equation approach

Repeated ordinal measurements: a generalised estimating equation approach Repeated ordinal measurements: a generalised estimating equation approach David Clayton MRC Biostatistics Unit 5, Shaftesbury Road Cambridge CB2 2BW April 7, 1992 Abstract Cumulative logit and related

More information

Marginal correlation in longitudinal binary data based on generalized linear mixed models

Marginal correlation in longitudinal binary data based on generalized linear mixed models Marginal correlation in longitudinal binary data based on generalized linear mixed models Tony Vangeneugden 1,2 Geert Molenberghs 2,3 Annouschka Laenen 2 Helena Geys 2,4 Caroline Beunckens 2 Cristina Sotto

More information

H-LIKELIHOOD ESTIMATION METHOOD FOR VARYING CLUSTERED BINARY MIXED EFFECTS MODEL

H-LIKELIHOOD ESTIMATION METHOOD FOR VARYING CLUSTERED BINARY MIXED EFFECTS MODEL H-LIKELIHOOD ESTIMATION METHOOD FOR VARYING CLUSTERED BINARY MIXED EFFECTS MODEL Intesar N. El-Saeiti Department of Statistics, Faculty of Science, University of Bengahzi-Libya. entesar.el-saeiti@uob.edu.ly

More information

Statistical Inference and Methods

Statistical Inference and Methods Department of Mathematics Imperial College London d.stephens@imperial.ac.uk http://stats.ma.ic.ac.uk/ das01/ 31st January 2006 Part VI Session 6: Filtering and Time to Event Data Session 6: Filtering and

More information

MA/ST 810 Mathematical-Statistical Modeling and Analysis of Complex Systems

MA/ST 810 Mathematical-Statistical Modeling and Analysis of Complex Systems MA/ST 810 Mathematical-Statistical Modeling and Analysis of Complex Systems Review of Basic Probability The fundamentals, random variables, probability distributions Probability mass/density functions

More information

Longitudinal and Incomplete Data

Longitudinal and Incomplete Data Longitudinal and Incomplete Data Geert Verbeke geert.verbeke@med.kuleuven.be Geert Molenberghs geert.molenberghs@uhasselt.be geert.molenberghs@med.kuleuven.be Interuniversity Institute for Biostatistics

More information

Lecture 16: Mixtures of Generalized Linear Models

Lecture 16: Mixtures of Generalized Linear Models Lecture 16: Mixtures of Generalized Linear Models October 26, 2006 Setting Outline Often, a single GLM may be insufficiently flexible to characterize the data Setting Often, a single GLM may be insufficiently

More information

Contents 1. Contents

Contents 1. Contents Contents 1 Contents 6 Distributions of Functions of Random Variables 2 6.1 Transformation of Discrete r.v.s............. 3 6.2 Method of Distribution Functions............. 6 6.3 Method of Transformations................

More information

LOGISTIC REGRESSION Joseph M. Hilbe

LOGISTIC REGRESSION Joseph M. Hilbe LOGISTIC REGRESSION Joseph M. Hilbe Arizona State University Logistic regression is the most common method used to model binary response data. When the response is binary, it typically takes the form of

More information

Missing Data in Longitudinal Studies: Mixed-effects Pattern-Mixture and Selection Models

Missing Data in Longitudinal Studies: Mixed-effects Pattern-Mixture and Selection Models Missing Data in Longitudinal Studies: Mixed-effects Pattern-Mixture and Selection Models Hedeker D & Gibbons RD (1997). Application of random-effects pattern-mixture models for missing data in longitudinal

More information

Pattern Recognition and Machine Learning. Bishop Chapter 2: Probability Distributions

Pattern Recognition and Machine Learning. Bishop Chapter 2: Probability Distributions Pattern Recognition and Machine Learning Chapter 2: Probability Distributions Cécile Amblard Alex Kläser Jakob Verbeek October 11, 27 Probability Distributions: General Density Estimation: given a finite

More information

Generalized Linear Models. Last time: Background & motivation for moving beyond linear

Generalized Linear Models. Last time: Background & motivation for moving beyond linear Generalized Linear Models Last time: Background & motivation for moving beyond linear regression - non-normal/non-linear cases, binary, categorical data Today s class: 1. Examples of count and ordered

More information

Models for Longitudinal Analysis of Binary Response Data for Identifying the Effects of Different Treatments on Insomnia

Models for Longitudinal Analysis of Binary Response Data for Identifying the Effects of Different Treatments on Insomnia Applied Mathematical Sciences, Vol. 4, 2010, no. 62, 3067-3082 Models for Longitudinal Analysis of Binary Response Data for Identifying the Effects of Different Treatments on Insomnia Z. Rezaei Ghahroodi

More information

Linear, Generalized Linear, and Mixed-Effects Models in R. Linear and Generalized Linear Models in R Topics

Linear, Generalized Linear, and Mixed-Effects Models in R. Linear and Generalized Linear Models in R Topics Linear, Generalized Linear, and Mixed-Effects Models in R John Fox McMaster University ICPSR 2018 John Fox (McMaster University) Statistical Models in R ICPSR 2018 1 / 19 Linear and Generalized Linear

More information

Machine Learning. Lecture 3: Logistic Regression. Feng Li.

Machine Learning. Lecture 3: Logistic Regression. Feng Li. Machine Learning Lecture 3: Logistic Regression Feng Li fli@sdu.edu.cn https://funglee.github.io School of Computer Science and Technology Shandong University Fall 2016 Logistic Regression Classification

More information

Generalized linear mixed models for dependent compound risk models

Generalized linear mixed models for dependent compound risk models Generalized linear mixed models for dependent compound risk models Emiliano A. Valdez joint work with H. Jeong, J. Ahn and S. Park University of Connecticut ASTIN/AFIR Colloquium 2017 Panama City, Panama

More information

Stat 579: Generalized Linear Models and Extensions

Stat 579: Generalized Linear Models and Extensions Stat 579: Generalized Linear Models and Extensions Linear Mixed Models for Longitudinal Data Yan Lu April, 2018, week 15 1 / 38 Data structure t1 t2 tn i 1st subject y 11 y 12 y 1n1 Experimental 2nd subject

More information

Sample Size and Power Considerations for Longitudinal Studies

Sample Size and Power Considerations for Longitudinal Studies Sample Size and Power Considerations for Longitudinal Studies Outline Quantities required to determine the sample size in longitudinal studies Review of type I error, type II error, and power For continuous

More information

Statistical Methods for Handling Incomplete Data Chapter 2: Likelihood-based approach

Statistical Methods for Handling Incomplete Data Chapter 2: Likelihood-based approach Statistical Methods for Handling Incomplete Data Chapter 2: Likelihood-based approach Jae-Kwang Kim Department of Statistics, Iowa State University Outline 1 Introduction 2 Observed likelihood 3 Mean Score

More information

Non-maximum likelihood estimation and statistical inference for linear and nonlinear mixed models

Non-maximum likelihood estimation and statistical inference for linear and nonlinear mixed models Optimum Design for Mixed Effects Non-Linear and generalized Linear Models Cambridge, August 9-12, 2011 Non-maximum likelihood estimation and statistical inference for linear and nonlinear mixed models

More information

Generalized Linear Models 1

Generalized Linear Models 1 Generalized Linear Models 1 STA 2101/442: Fall 2012 1 See last slide for copyright information. 1 / 24 Suggested Reading: Davison s Statistical models Exponential families of distributions Sec. 5.2 Chapter

More information

Mixtures of Negative Binomial distributions for modelling overdispersion in RNA-Seq data

Mixtures of Negative Binomial distributions for modelling overdispersion in RNA-Seq data Mixtures of Negative Binomial distributions for modelling overdispersion in RNA-Seq data Cinzia Viroli 1 joint with E. Bonafede 1, S. Robin 2 & F. Picard 3 1 Department of Statistical Sciences, University

More information

Estimation After a Group Sequential Trial

Estimation After a Group Sequential Trial DOI 10.1007/s12561-014-9112-6 Estimation After a Group Sequential Trial Elasma Milanzi Geert Molenberghs Ariel Alonso Michael G. Kenward Anastasios A. Tsiatis Marie Davidian Geert Verbeke Received: 19

More information

Stat 5101 Notes: Brand Name Distributions

Stat 5101 Notes: Brand Name Distributions Stat 5101 Notes: Brand Name Distributions Charles J. Geyer September 5, 2012 Contents 1 Discrete Uniform Distribution 2 2 General Discrete Uniform Distribution 2 3 Uniform Distribution 3 4 General Uniform

More information

E(Y ij b i ) = f(x ijβ i ), (13.1) β i = A i β + B i b i. (13.2)

E(Y ij b i ) = f(x ijβ i ), (13.1) β i = A i β + B i b i. (13.2) 1 Advanced topics 1.1 Introduction In this chapter, we conclude with brief overviews of several advanced topics. Each of these topics could realistically be the subject of an entire course! 1. Generalized

More information

Multiple Imputation in Generalized Linear Mixed Models

Multiple Imputation in Generalized Linear Mixed Models DEPARTMENT OF STATISTICS Master Thesis Multiple Imputation in Generalized Linear Mixed Models Author: Rudolf Jagdhuber Supervisor: Prof. Dr. Christian Heumann Ludwig Maximilians University Munich Winter

More information

Convergence Properties of a Sequential Regression Multiple Imputation Algorithm

Convergence Properties of a Sequential Regression Multiple Imputation Algorithm Convergence Properties of a Sequential Regression Multiple Imputation Algorithm Jian Zhu 1, Trivellore E. Raghunathan 2 Department of Biostatistics, University of Michigan, Ann Arbor Abstract A sequential

More information

An exploration of fixed and random effects selection for longitudinal binary outcomes in the presence of non-ignorable dropout

An exploration of fixed and random effects selection for longitudinal binary outcomes in the presence of non-ignorable dropout Biometrical Journal 0 (2011) 0, zzz zzz / DOI: 10.1002/ An exploration of fixed and random effects selection for longitudinal binary outcomes in the presence of non-ignorable dropout Ning Li,1, Michael

More information

Stat 5101 Lecture Notes

Stat 5101 Lecture Notes Stat 5101 Lecture Notes Charles J. Geyer Copyright 1998, 1999, 2000, 2001 by Charles J. Geyer May 7, 2001 ii Stat 5101 (Geyer) Course Notes Contents 1 Random Variables and Change of Variables 1 1.1 Random

More information

Parameter Estimation

Parameter Estimation Parameter Estimation Consider a sample of observations on a random variable Y. his generates random variables: (y 1, y 2,, y ). A random sample is a sample (y 1, y 2,, y ) where the random variables y

More information

Longitudinal breast density as a marker of breast cancer risk

Longitudinal breast density as a marker of breast cancer risk Longitudinal breast density as a marker of breast cancer risk C. Armero (1), M. Rué (2), A. Forte (1), C. Forné (2), H. Perpiñán (1), M. Baré (3), and G. Gómez (4) (1) BIOstatnet and Universitat de València,

More information

Markov Chain Monte Carlo (MCMC)

Markov Chain Monte Carlo (MCMC) Markov Chain Monte Carlo (MCMC Dependent Sampling Suppose we wish to sample from a density π, and we can evaluate π as a function but have no means to directly generate a sample. Rejection sampling can

More information

Stat 535 C - Statistical Computing & Monte Carlo Methods. Arnaud Doucet.

Stat 535 C - Statistical Computing & Monte Carlo Methods. Arnaud Doucet. Stat 535 C - Statistical Computing & Monte Carlo Methods Arnaud Doucet Email: arnaud@cs.ubc.ca 1 Suggested Projects: www.cs.ubc.ca/~arnaud/projects.html First assignement on the web: capture/recapture.

More information

SB1a Applied Statistics Lectures 9-10

SB1a Applied Statistics Lectures 9-10 SB1a Applied Statistics Lectures 9-10 Dr Geoff Nicholls Week 5 MT15 - Natural or canonical) exponential families - Generalised Linear Models for data - Fitting GLM s to data MLE s Iteratively Re-weighted

More information

Generalized linear mixed models (GLMMs) for dependent compound risk models

Generalized linear mixed models (GLMMs) for dependent compound risk models Generalized linear mixed models (GLMMs) for dependent compound risk models Emiliano A. Valdez joint work with H. Jeong, J. Ahn and S. Park University of Connecticut 52nd Actuarial Research Conference Georgia

More information

McGill University. Department of Epidemiology and Biostatistics. Bayesian Analysis for the Health Sciences. Course EPIB-675.

McGill University. Department of Epidemiology and Biostatistics. Bayesian Analysis for the Health Sciences. Course EPIB-675. McGill University Department of Epidemiology and Biostatistics Bayesian Analysis for the Health Sciences Course EPIB-675 Lawrence Joseph Bayesian Analysis for the Health Sciences EPIB-675 3 credits Instructor:

More information

Lecture 3.1 Basic Logistic LDA

Lecture 3.1 Basic Logistic LDA y Lecture.1 Basic Logistic LDA 0.2.4.6.8 1 Outline Quick Refresher on Ordinary Logistic Regression and Stata Women s employment example Cross-Over Trial LDA Example -100-50 0 50 100 -- Longitudinal Data

More information

Chapter 3 Common Families of Distributions

Chapter 3 Common Families of Distributions Lecture 9 on BST 631: Statistical Theory I Kui Zhang, 9/3/8 and 9/5/8 Review for the previous lecture Definition: Several commonly used discrete distributions, including discrete uniform, hypergeometric,

More information

Web-based Supplementary Material for A Two-Part Joint. Model for the Analysis of Survival and Longitudinal Binary. Data with excess Zeros

Web-based Supplementary Material for A Two-Part Joint. Model for the Analysis of Survival and Longitudinal Binary. Data with excess Zeros Web-based Supplementary Material for A Two-Part Joint Model for the Analysis of Survival and Longitudinal Binary Data with excess Zeros Dimitris Rizopoulos, 1 Geert Verbeke, 1 Emmanuel Lesaffre 1 and Yves

More information

Advanced Quantitative Methods: limited dependent variables

Advanced Quantitative Methods: limited dependent variables Advanced Quantitative Methods: Limited Dependent Variables I University College Dublin 2 April 2013 1 2 3 4 5 Outline Model Measurement levels 1 2 3 4 5 Components Model Measurement levels Two components

More information