Supplementary materials for:

Similar documents
Paper AA Demonstrate how this methodology can be implemented on output from lmm (linear mixed model) from SAS Proc Mixed.

Using PROC GENMOD to Analyse Ratio to Placebo in Change of Dactylitis. Irmgard Hollweck / Meike Best 13.OCT.2013

You can specify the response in the form of a single variable or in the form of a ratio of two variables denoted events/trials.

Contrasting Marginal and Mixed Effects Models Recall: two approaches to handling dependence in Generalized Linear Models:

GEE for Longitudinal Data - Chapter 8

STA6938-Logistic Regression Model

Marginal versus conditional effects: does it make a difference? Mireille Schnitzer, PhD Université de Montréal

Lab 11. Multilevel Models. Description of Data

Using the Delta Method to Construct Confidence Intervals for Predicted Probabilities, Rates, and Discrete Changes 1

Confidence Intervals for the Odds Ratio in Logistic Regression with One Binary X

Lecture 15 (Part 2): Logistic Regression & Common Odds Ratio, (With Simulations)

Simplified marginal effects in discrete choice models

COMPLEMENTARY LOG-LOG MODEL

Using PROC GENMOD to Analyse Ratio to Placebo in Change of Dactylitis

DIAGNOSTICS FOR STRATIFIED CLINICAL TRIALS IN PROPORTIONAL ODDS MODELS

Robustness of location estimators under t- distributions: a literature review

Practical Considerations Surrounding Normality

Simple logistic regression

UNIVERSITY OF TORONTO. Faculty of Arts and Science APRIL 2010 EXAMINATIONS STA 303 H1S / STA 1002 HS. Duration - 3 hours. Aids Allowed: Calculator

Biostatistics Workshop Longitudinal Data Analysis. Session 4 GARRETT FITZMAURICE

Confidence Intervals for the Interaction Odds Ratio in Logistic Regression with Two Binary X s

Categorical and Zero Inflated Growth Models

STAT 705 Generalized linear mixed models

Multilevel Methodology

Accepted Manuscript. Comparing different ways of calculating sample size for two independent means: A worked example

Introduction to Statistical Analysis

,..., θ(2),..., θ(n)

Models for binary data

Chapter 1. Modeling Basics

Department of Statistical Science FIRST YEAR EXAM - SPRING 2017

Measures of Association and Variance Estimation

Sample Size and Power Considerations for Longitudinal Studies

UNIVERSITY OF MASSACHUSETTS Department of Mathematics and Statistics Applied Statistics Friday, January 15, 2016

CHL 5225 H Crossover Trials. CHL 5225 H Crossover Trials

Tutorial 6: Tutorial on Translating between GLIMMPSE Power Analysis and Data Analysis. Acknowledgements:

Introduction to lnmle: An R Package for Marginally Specified Logistic-Normal Models for Longitudinal Binary Data

Three-Level Modeling for Factorial Experiments With Experimentally Induced Clustering

Longitudinal Modeling with Logistic Regression

Pairwise rank based likelihood for estimating the relationship between two homogeneous populations and their mixture proportion

Model Based Statistics in Biology. Part V. The Generalized Linear Model. Chapter 18.1 Logistic Regression (Dose - Response)

SAS PROC NLMIXED Mike Patefield The University of Reading 12 May

Confidence Intervals for the Odds Ratio in Logistic Regression with Two Binary X s

Tutorial 4: Power and Sample Size for the Two-sample t-test with Unequal Variances

Logistic Regression. Interpretation of linear regression. Other types of outcomes. 0-1 response variable: Wound infection. Usual linear regression

INVERTED KUMARASWAMY DISTRIBUTION: PROPERTIES AND ESTIMATION

Sections 4.1, 4.2, 4.3

Product Held at Accelerated Stability Conditions. José G. Ramírez, PhD Amgen Global Quality Engineering 6/6/2013

The MIANALYZE Procedure (Chapter)

Categorical data analysis Chapter 5

Analyzing Residuals in a PROC SURVEYLOGISTIC Model

Count data page 1. Count data. 1. Estimating, testing proportions

Directly and Efficiently Optimizing Prediction Error and AUC of Linear Classifiers

Mixed-effects Polynomial Regression Models chapter 5

IP WEIGHTING AND MARGINAL STRUCTURAL MODELS (CHAPTER 12) BIOS IPW and MSM

SAS/STAT 13.1 User s Guide. The MIANALYZE Procedure

The Logit Model: Estimation, Testing and Interpretation

Chapter 5: Logistic Regression-I

Statistics 203: Introduction to Regression and Analysis of Variance Course review

Multiple Regression Analysis: The Problem of Inference

Interactions between Binary & Quantitative Predictors

ST3241 Categorical Data Analysis I Logistic Regression. An Introduction and Some Examples

Odds ratio estimation in Bernoulli smoothing spline analysis-ofvariance

Using Estimating Equations for Spatially Correlated A

Gaussian Processes. Le Song. Machine Learning II: Advanced Topics CSE 8803ML, Spring 2012

Bayesian inference for sample surveys. Roderick Little Module 2: Bayesian models for simple random samples

SUPPLEMENTARY INFORMATION

ESTIMATE PROP. IMPAIRED PRE- AND POST-INTERVENTION FOR THIN LIQUID SWALLOW TASKS. The SURVEYFREQ Procedure

The GENMOD Procedure. Overview. Getting Started. Syntax. Details. Examples. References. SAS/STAT User's Guide. Book Contents Previous Next

STAT 7030: Categorical Data Analysis

Biost 518 Applied Biostatistics II. Purpose of Statistics. First Stage of Scientific Investigation. Further Stages of Scientific Investigation

Economics 583: Econometric Theory I A Primer on Asymptotics

EC212: Introduction to Econometrics Review Materials (Wooldridge, Appendix)

Fall 2017 STAT 532 Homework Peter Hoff. 1. Let P be a probability measure on a collection of sets A.

Multi-level Models: Idea

Non-parametric Inference and Resampling

ST3241 Categorical Data Analysis I Multicategory Logit Models. Logit Models For Nominal Responses

Chapter 14 Logistic and Poisson Regressions

Impact Evaluation of Mindspark Centres

Longitudinal analysis of ordinal data

Interactions among Continuous Predictors

Biostat Methods STAT 5820/6910 Handout #5a: Misc. Issues in Logistic Regression

BIAS OF MAXIMUM-LIKELIHOOD ESTIMATES IN LOGISTIC AND COX REGRESSION MODELS: A COMPARATIVE SIMULATION STUDY

Comparing two independent samples

Nemours Biomedical Research Statistics Course. Li Xie Nemours Biostatistics Core October 14, 2014

Downloaded from:

Standardization methods have been used in epidemiology. Marginal Structural Models as a Tool for Standardization ORIGINAL ARTICLE

The Delta Method and Applications

Calculating Effect-Sizes. David B. Wilson, PhD George Mason University

GMM Logistic Regression with Time-Dependent Covariates and Feedback Processes in SAS TM

A Survival Analysis of GMO vs Non-GMO Corn Hybrid Persistence Using Simulated Time Dependent Covariates in SAS

ADVANCED STATISTICAL ANALYSIS OF EPIDEMIOLOGICAL STUDIES. Cox s regression analysis Time dependent explanatory variables

Introduction to mtm: An R Package for Marginalized Transition Models

Comparing Priors in Bayesian Logistic Regression for Sensorial Classification of Rice

Introduction (Alex Dmitrienko, Lilly) Web-based training program

Stat 587: Key points and formulae Week 15

Estimating the Marginal Odds Ratio in Observational Studies

TESTS FOR EQUIVALENCE BASED ON ODDS RATIO FOR MATCHED-PAIR DESIGN

Testing Indirect Effects for Lower Level Mediation Models in SAS PROC MIXED

2 >1. That is, a parallel study design will require

The consequences of misspecifying the random effects distribution when fitting generalized linear mixed models

Transcription:

Supplementary materials for: Tang TS, Funnell MM, Sinco B, Spencer MS, Heisler M. Peer-led, empowerment-based approach to selfmanagement efforts in diabetes (PLEASED: a randomized controlled trial in an African-American community. Ann Fam Med. 015;13:S7-S35. Doi: 10.1370/afm.1819. 1

Appendix: Using the Multivariate Delta Method to Report Logistic Regression Results with Repeated Measures By Brandy R. Sinco, MS, University of Michigan, Ann Arbor, MI Generalized Estimating Equation With Repeated Measures Note: This example uses time points, but could easily be extended to the four time points in this paper. Let i = 0 for control and 1 for treatment. Let j = 1 for pre-intervention and for post-intervention. Let k = k th subject. Let R = 0 for control and 1 for treatment. Let T = 0 for baseline and 1 for first follow-up. Let π = probability of success. Generalized Estimating Equation for a Binary Outcome. logit(π ijk = ln(π ijk /(1 - π ijk = (β 0 + β 1 R + β T + β 3 RT + ε ijk, ε ijk = error term. The β terms are assumed to be multivariate normal. Estimated Mean: ln(π ijk /(1 - π ijk = β 0 + β 1 R + β T + β 3 RT. The mean percentages for the control group are exp(β 0 /(1+ exp(β 0 at preintervention and exp(β 0 + β /(1+ exp(β 0 + β at post-intervention. The means for the treatment group are exp(β 0 + β 1 /(1+ exp(β 0 + β 1 at preintervention and exp(β 0 + β 1 + β + β 3 /(1+ exp(β 0 + β 1 + β + β 3 at postintervention. The Univariate Delta Method 1 Let Y ~ Normal(µ, σ, µ, σ 0; n = sample size. Let g(y be a differentiable function of Y with non-zero first derivative. Then, a first-order Taylor series for g(y = g(µ + g (µ(y µ. Mean of g(y g(µ and Variance of (g(y (g (µ σ. Delta Method Theorem: ( ( ( ( µ D σ ( ( µ lim n g Y g Normal 0, g'. n I.E., asymptotic distribution of g(y = Normal(g(µ, (g (µ σ. Example: Let Y ~ Normal(µ, σ. Let W = g(y = e Y. g (µ = e µ. Using the delta method, mean of W = g(µ = e µ and Variance of W = (g (µ σ = e µ σ ; Standard deviation of W = e µ σ. Asymptotic distribution of W = Normal(e µ, e µ σ. The Multivariate Delta Method Let Y be a multivariate vector of m normal variables, Y = [Y 1 Y Y m ]. Y ~ N(µ,, where is a m m covariance matrix. Let g(y be a differentiable function of Y with non-zero first derivative.

The multivariate delta method states that if lim n( Y µ D N( 0, n ( D T ( ( ( µ g( µ g ( µ lim n g Y g N 0, J Σ J. n Where J g (µ is the Jacobian matrix, evaluated at Y = µ. g1( Y g1( Y g1( Y L Y1 Y Y m g( Y g( Y g( Y J g ( µ L = Y1 Y Y m evaluated at Y = µ. M M O M gm( Y gm( Y gm( Y L Y1 Y Y m Σ, then Application of the Multivariate Delta Method to Report the Outcomes of a Generalize Estimating Equation for a Binary Variable As Percentages Instead of As Odds Ratios. (Note: Reference shows how to apply the delta method to the log transform when used with a linear mixed model. The same methodology can be used to calculate percentages from a logistic model. The β s are assumed to be multivariate normal, with covariance matrix β. Percent success of control group at time 1 = exp(β 0 /(1+ exp(β 0. g 1 (β = exp(β 0 /(1+ exp(β 0. Note that g 1 (β/ β 0 = exp(β 0 /(1+ exp(β 0.and g 1 (β/ β i = 0 if i>0. Percent success of control group at time = exp(β 0 + β /(1+ exp(β 0 + β. Let g (β = exp(β 0 + β /(1+ exp(β 0 + β. Note that g (β/ β 0 = g (β/ β = exp(β 0 + β /(1+ exp(β 0 + β and g (β/ β i = 0 for i 0 and i. Same property holds for g 3 and g 4 derivatives with respect to the β s. Percent success of treatment group at time 1 = exp(β 0 + β 1 /(1+ exp(β 0 + β 1. g 3 (β = exp(β 0 + β 1 /(1+ exp(β 0 + β 1. g 3 (β/ β 0 = g 3 (β/ β 1 = exp(β 0 + β 1 /(1+ exp(β 0 + β 1, and g 3 (β/ β i = 0 for i>1. Percent success of treatment group at time = exp(β 0 + β 1 + β + β 3 /(1+ exp(β 0 + β 1 + β + β 3. g 4 (β = exp(β 0 + β 1 + β + β 3 /(1+ exp(β 0 + β 1 + β + β 3 g 4 (β/ β i = exp(β 0 + β 1 + β + β 3 /(1+ exp(β 0 + β 1 + β + β 3, 0 i 3. 3

Jacobian Matrix for g(β. exp( β0 0 0 0 ( 1+ exp( β0 exp( β0 + β exp( β0 + β 0 0 ( 1+ exp( β0 + β ( 1+ exp( β0 + β exp( β0 + β1 exp( β0 + β1 J ( β = g 0 0 ( 1+ exp( β0 + β1 ( 1+ exp( β0 + β1 3 3 3 3 exp βi exp βi exp βi exp βi i= 0 i= 0 i= 0 i= 0 3 3 3 3 1+ exp βi 1+ exp βi 1+ exp βi 1+ exp βi i= 0 i= 0 i= 0 i= 0 W1 0 0 0 W 0 W 0 In simpler form, J g ( β =. W3 W3 0 0 W4 W4 W4 W4 Define the covariance matrix for the β s as σ σ σ σ β00 β01 β0 β03 σβ01 σβ11 σβ1 σβ13 Σ β =. σ σ σ σ β0 β1 β β3 σ σ σ σ β03 β13 β3 β33 The covariance matrix for W (estimates for each group at each time point will be W = J g (β β (J g (β T. Example SAS Code to Compute Covariance Matrix With Four Time Points (Other software could be used. The delta method can be applied with any statistical software. *** PHQ Bin, Minimally Depressed or Greater ***; ods html; ods graphics on; Proc Genmod Data=AcrossTimeLong_Ypsi Descending; Class REACHID TimepointN RandomizationN; Model PHQBin=TimepointN RandomizationN TimepointN*RandomizationN / dist=bin; Repeated Subject=REACHID / Type=UN Modelse ECOVB; ODS OUTPUT GEEModPEst=RegBinPHQ GEERCov=CovBPHQ; Run; 4

ods graphics off; ods html close; /* PHQBin: Compute standard errors for difference scores via the delta method */ ods html path="c:\temp"; Proc IML; /* Input beta vector; begin at B1 instead of B0 because SAS numbers parameters starting at 1 */ use regbinphq; read all var {'Estimate'} into B; close regbinphq; /* B5=B7=B9=B11=B13=B14=B15=0 reference levels */ B00 = B[1]; /* B00 = control group at baseline */ B10 = B[1] + B[6]; /* B10 = peer group at baseline */ B01 = B[1] + B[4]; /* B01 = control group at 3 months */ B11 = B[1] + B[4] + B[6] + B[1]; /* B11 = peer group at 3 months */ B0 = B[1] + B[3]; /* B0 = control group at 9 months */ B1 = B[1] + B[3] + B[6] + B[10]; /* B1 = peer group at 9 months */ B03 = B[1] + B[]; /* B03 = control group at 15 months */ B13 = B[1] + B[] + B[6] + B[8]; /* B1 = peer group at 15 months */ /* Convert to exp scale */ expb00=exp(b00/((1+exp(b00**; expb10=exp(b10/((1+exp(b10**; expb01=exp(b01/((1+exp(b01**; expb11=exp(b11/((1+exp(b11**; expb0=exp(b0/((1+exp(b0**; expb1=exp(b1/((1+exp(b1**; expb03=exp(b03/((1+exp(b03**; expb13=exp(b13/((1+exp(b13**; /* Input covariance matrix, 8x8 matrix */ use CovBPHQ; read all var{'prm1' 'Prm' 'Prm3' 'Prm4' 'Prm6' 'Prm8' 'Prm10' 'Prm1'} into CovB; close CovBPHQ; print CovB; /* Contruct W, 8x8 matrix */ W=j(8,8,0; W[1,1]=expB00; z={1 5}; W[,z]=expB10; z={1 4}; W[3, z]=expb01; z={1 4 5 8}; W[4,z]=expB11; z={1 3}; 5

W[5,z]=expB0; z={1 3 5 7}; W[6,z]=expB1; W[7,1:]=expB03; z={1 5 6}; W[8,z]=expB13; print W; /* V=W*CovB*T(W = cov matrix of exp(bij/((1+exp(bij**, i=0 to 1, j=0 to 3 */ reset fuzz; /* This corrects rounding errors. */ V=W*CovB*T(W; print V; quit; ods html close; REFERENCES 1 Multivariate Delta Method: Reference: Casella G, Berger RL. Statistical inference. nd ed. Pacific Grove, CA: Duxbury; 00. Sinco B., Kieffer E., Spencer M. Using the Delta Method to generate means and confidence intervals from a Linear Mixed Model on the original scale, when the analysis is done on the log scale. Presented at the American Statistical Association s Conference on Statistical Practice in New Orleans, /0/015. 6