# Longitudinal Modeling with Logistic Regression

Save this PDF as:

Size: px
Start display at page:

## Transcription

1 Newsom 1 Longitudinal Modeling with Logistic Regression Longitudinal designs involve repeated measurements of the same individuals over time There are two general classes of analyses that correspond to two definitions of change, the absolute level definition and the relative level definition Absolute Level Definition of Stability and Change One type of question involves whether the scores on a variable increase or decrease over time To answer this question, we can subtract one score from another, usually Y 2-1 = Y 2 Y 1, where the score at the first time point is subtracted from the score at the second time point Referred to as difference scores, gain scores, or sometimes changes scores The mean of the difference score indicates an average increase if it is positive and an average decrease if it is negative A variable is considered stable on average to the extent the average difference is 0, and the degree of change is measured by the extent to which the average difference is greater or less than 0 The average of the differences equals the differences of the averages, E( Y2 Y1) = Y2 Y1 This is true whether variables are continuous or binary, but with binary variables, the difference score has only three possible values, -1, 0, and +1 The downside to this definition of change is that regression toward the mean might be mistaken for changes over time and the change scores are sensitive to any random or temporary fluctuations in the values For a continuous variable, the statistical test for whether there is a significant difference on average, we would use a dependent-samples (paired) t-test or within-subjects ANOVA The within-subjects ANOVA extends the difference test to three or more time points For binary variables, where the mean is a proportion when using 0,1 coding the McNemar s (or Cochran s Q for more than two time points) test whether there is a significant difference between P(Y 2 = 1) P(Y 1 = 1) is greater than zero As we saw in earlier, this is the same as determining whether p 1+ = p +1 (and, equivalently that p 12 = p 21) 1 A loglinear analysis could also be used for this comparison For any analysis that is based on a log transformation, however, the concept of absolute change is further complicated When a log transformation is involved, the average difference between two binary variables and the difference in the averages (marginal proportions) are no longer equivalent Mathematically, this is because the difference between two logged averages does not equal the average of the log of the differences between two scores ( Y2 Y1) ( Y2) ( Y1) ln ln ln The difference on the left-hand side of the inequality, can be referred to as a conditional or subjectspecific value, whereas the difference on the right-hand side of the inequality above can be referred to as a marginal, unconditional, or population-averaged value 2 Relative Level Definition of Stability and Change An alternative way to think about stability and change is in terms of the correlation of a variable with itself over time Stability is defined in this way by the size of the correlation between Y 2 and Y 1, with r 12 = 10 indicating a perfectly stable variable There is some change if the correlation is less than perfect This definition is distinct from the absolute definition of change, because the correlation between the two variables can be perfect even if there is a large increase of decrease from Time 1 to Time 2 Adding 5, 10, or even 100 points to all of the Y 2 scores would not change the correlation This is because the correlation coefficient is more a function of the correspondence in the relative position of the scores in the sample than a metric of how well the scores match in absolute terms The focus under this definition 1 Refer to the Matched Pairs Analysis handout for more on this equivalence 2 2 Recall that from the quotient rule of logs, that ln ( Y2) ln ( Y1) ln Y =, instead Y1

2 Newsom 2 of stability and change has to do with the strength of the relationship over time, and it does not provide information about whether the scores increase or decrease over time Longitudinal analyses involve a correlation between Y 1 and Y 2 or autoregression with Y 1 predicting Y 2 The residual in an autoregression represents the degree to which the variable is not stable over time or changing in the relative level sense Modeling Absolute Level Change: Conditional Logistic Regression and GEE In addition to the McNemar test change in a binary variable over two or more time points can be assessed with a conditional logistic regression model The conditional logistic model (Cox, 1958) models examine change over time using time as a predictor of Y t, measured repeatedly at multiple time points ( ) logit PYit = 1 = αi + βxt This is a typical logistic regression, except for two things The dependent variable, Y it, has values for each person and each time point, indicated by the subscript ij, and the intercept α i has a different value for each person x t is some arbitrary time code, usually beginning 0,1, for however many time points are available The term conditional logistic is used because the regression is conditioned on subject To conduct the analysis, data must be transformed into a long (or person time) format If there are 100 cases measured at two time points, the data set will have 200 records i t Y ij In general, the data set has n T records A standard logistic regression procedure can be used to estimate the conditional logistic model if the analysis can be stratified by subject (eg, Proc Logistic in SAS) or survival analysis procedures can be used The regression coefficient represents the change in the logit for each unit change in x t, with positive values indicating an increase over time and negative values indicating a decrease over time The odds ratio is the odds increase or decrease with each unit increase in x t For two time points, it s the odds that Y = 1 increase from the first time point to the second Or, β ln n n = And notice that these are the same two cells, the discordant cells, that are compared in McNemar s test The distinction is that the conditional logistic represents a test of the subject-specific effect and the McNemar s chi-squared represents a test of the marginal or population-averaged effect The intercept α i gives the logit value of Y when X equals 0, or the first time point The logistic transformation can be used to obtain the estimate of the probability that Y = 1 at the first time point, where αi e = = 1 + e ( 1) PY it αi Generalized estimating equations (GEE; Liang & Zeger, 1986) approach to change over time works much the same way The GEE model accounts for correlated observations, and, for longitudinal data, it takes into account the dependence that occurs with multiple observations per person The analysis uses quasi-likelihood estimation rather than the usual Newton-Raphson maximum likelihood estimation of

4 Newsom 4 Testing Global Null Hypothesis: BETA=0 Test Chi-Square DF Pr > ChiSq Likelihood Ratio <0001 Score <0001 Wald <0001 The LOGISTIC Procedure Analysis of Maximum Likelihood Estimates Standard Wald Parameter DF Estimate Error Chi-Square Pr > ChiSq Intercept w1dep <0001 w1neg Odds Ratio Estimates Point 95% Wald Effect Estimate Confidence Limits w1dep w1neg Sample Write-Up Clinical depression measured at the six-month follow-up was regressed on negative social exchanges and clinical depression measured at baseline Both baseline predictors significantly predicted clinical depression at follow-up As expected, those who were depressed at baseline were more likely to be depressed at follow-up, b = 164, SE = 13, OR = 516, p < 001 Of greater interest was that those who reported more frequent negative social exchanges were more likely to be depressed at follow-up after controlling for initial depression, b = 45, SE = 20, OR = 157, p < 05, suggesting that negative social exchanges at baseline were predictive of an increase in clinical depression over the six-month interval Questions about absolute level change (difference of binary variable with no predictor) can be tested with the McNemar s test and conditional logistic Conditional logistic requires configuration of the data to a long (person period) data set In SAS, ID is used as a strata to allow the intercept to vary by case, α i A survival analysis approach is an alternative method for obtaining a conditional regression, which would be needed in SPSS (coxreg) and R (clogit in the survival package) I only illustrate the conditional logistic here for didactic reasons, so don t give code for either of these models I recommend growth curve models (or possibly GEE if there are only two time points) instead of conditional logistic for this general approach /* mcnemar's test */ proc freq; tables w1dep*w2dep /agree; w1dep(time 1 depression) w2dep(time 2 depression) Frequency Percent Row Pct Col Pct not depr depresse Total essed d not depressed depressed Total

5 Newsom 5 McNemar's Test ƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒ Statistic (S) DF 1 Pr > S <0001 /*reconfigure data set to be long format */ data one; set one; id = _N_; DATA long ; SET one ; t = 0 ; dep = w1dep ; OUTPUT ; t = 1 ; dep = w2dep; OUTPUT ; keep id t dep ; RUN; /*conditional logistic regression */ proc logistic data=long order=data descending; strata id; MODEL dep=t ; The LOGISTIC Procedure Conditional Analysis Testing Global Null Hypothesis: BETA=0 Test Chi-Square DF Pr > ChiSq Likelihood Ratio <0001 Score <0001 Wald <0001 Analysis of Conditional Maximum Likelihood Estimates Standard Wald Parameter DF Estimate Error Chi-Square Pr > ChiSq t <0001 Odds Ratio Estimates Point 95% Wald Effect Estimate Confidence Limits t Notice that the score test from the conditional logistic matches the McNemar s test Both are a test of the population-averaged (marginal) difference The Wald test from the conditional logistic does not match either, because it is a test of the subject-specific (conditional) effect References and Further Readings Cox, D R (1958) Two further applications of a model for binary regression Biometrika, 45(3/4), Hu, F B, Goldberg, J, Hedeker, D, Flay, B R, & Pentz, M A (1998) Comparison of population-averaged and subject-specific approaches for analyzing repeated binary outcomes American Journal of Epidemiology, 147, Hanley, J A, Negassa, A, & Forrester, J E (2003) Statistical analysis of correlated data using generalized estimating equations: an orientation American journal of epidemiology, 157(4), Huber, Peter J (1967) The behavior of maximum likelihood estimates under nonstandard conditions Proceedings of the Fifth Berkeley Symposium on Mathematical Statistics and Probability, Liang, K Y, & Zeger, S L (1986) Longitudinal data analysis using generalized linear models Biometrika, 73(1), Newsom (2012) Basic Longitudinal Analysis Approaches for Continuous and Categorical Variables In Newsom, JT, Jones, RN, & Hofer, SM (Eds) (2012) Longitudinal Data Analysis: A Practical Guide for Researchers in Aging, Health, and Social Science New York: Routledge (pp ) Raudenbush, S W, & Bryk, A S (2002) Hierarchical linear models: Applications and data analysis methods (Vol 1) Thousand Oaks, CA: Sage White, H (1980) A Heteroskedasticity-Consistent Covariance Matrix Estimator and a Direct Test for Heteroskedasticity Econometrica, 48,

### Three-Way Contingency Tables

Newsom PSY 50/60 Categorical Data Analysis, Fall 06 Three-Way Contingency Tables Three-way contingency tables involve three binary or categorical variables. I will stick mostly to the binary case to keep

### Multinomial Logistic Regression Models

Stat 544, Lecture 19 1 Multinomial Logistic Regression Models Polytomous responses. Logistic regression can be extended to handle responses that are polytomous, i.e. taking r>2 categories. (Note: The word

### GEE for Longitudinal Data - Chapter 8

GEE for Longitudinal Data - Chapter 8 GEE: generalized estimating equations (Liang & Zeger, 1986; Zeger & Liang, 1986) extension of GLM to longitudinal data analysis using quasi-likelihood estimation method

### SAS Analysis Examples Replication C8. * SAS Analysis Examples Replication for ASDA 2nd Edition * Berglund April 2017 * Chapter 8 ;

SAS Analysis Examples Replication C8 * SAS Analysis Examples Replication for ASDA 2nd Edition * Berglund April 2017 * Chapter 8 ; libname ncsr "P:\ASDA 2\Data sets\ncsr\" ; data c8_ncsr ; set ncsr.ncsr_sub_13nov2015

### Contrasting Marginal and Mixed Effects Models Recall: two approaches to handling dependence in Generalized Linear Models:

Contrasting Marginal and Mixed Effects Models Recall: two approaches to handling dependence in Generalized Linear Models: Marginal models: based on the consequences of dependence on estimating model parameters.

### Model Estimation Example

Ronald H. Heck 1 EDEP 606: Multivariate Methods (S2013) April 7, 2013 Model Estimation Example As we have moved through the course this semester, we have encountered the concept of model estimation. Discussions

### Hierarchical Generalized Linear Models. ERSH 8990 REMS Seminar on HLM Last Lecture!

Hierarchical Generalized Linear Models ERSH 8990 REMS Seminar on HLM Last Lecture! Hierarchical Generalized Linear Models Introduction to generalized models Models for binary outcomes Interpreting parameter

### Stat 642, Lecture notes for 04/12/05 96

Stat 642, Lecture notes for 04/12/05 96 Hosmer-Lemeshow Statistic The Hosmer-Lemeshow Statistic is another measure of lack of fit. Hosmer and Lemeshow recommend partitioning the observations into 10 equal

### Generalized Linear Models for Non-Normal Data

Generalized Linear Models for Non-Normal Data Today s Class: 3 parts of a generalized model Models for binary outcomes Complications for generalized multivariate or multilevel models SPLH 861: Lecture

### Ron Heck, Fall Week 8: Introducing Generalized Linear Models: Logistic Regression 1 (Replaces prior revision dated October 20, 2011)

Ron Heck, Fall 2011 1 EDEP 768E: Seminar in Multilevel Modeling rev. January 3, 2012 (see footnote) Week 8: Introducing Generalized Linear Models: Logistic Regression 1 (Replaces prior revision dated October

### Repeated ordinal measurements: a generalised estimating equation approach

Repeated ordinal measurements: a generalised estimating equation approach David Clayton MRC Biostatistics Unit 5, Shaftesbury Road Cambridge CB2 2BW April 7, 1992 Abstract Cumulative logit and related

### COMPLEMENTARY LOG-LOG MODEL

COMPLEMENTARY LOG-LOG MODEL Under the assumption of binary response, there are two alternatives to logit model: probit model and complementary-log-log model. They all follow the same form π ( x) =Φ ( α

### Logistic Regression. Interpretation of linear regression. Other types of outcomes. 0-1 response variable: Wound infection. Usual linear regression

Logistic Regression Usual linear regression (repetition) y i = b 0 + b 1 x 1i + b 2 x 2i + e i, e i N(0,σ 2 ) or: y i N(b 0 + b 1 x 1i + b 2 x 2i,σ 2 ) Example (DGA, p. 336): E(PEmax) = 47.355 + 1.024

### CHAPTER 9 EXAMPLES: MULTILEVEL MODELING WITH COMPLEX SURVEY DATA

Examples: Multilevel Modeling With Complex Survey Data CHAPTER 9 EXAMPLES: MULTILEVEL MODELING WITH COMPLEX SURVEY DATA Complex survey data refers to data obtained by stratification, cluster sampling and/or

### Correlation and regression

1 Correlation and regression Yongjua Laosiritaworn Introductory on Field Epidemiology 6 July 2015, Thailand Data 2 Illustrative data (Doll, 1955) 3 Scatter plot 4 Doll, 1955 5 6 Correlation coefficient,

### STA6938-Logistic Regression Model

Dr. Ying Zhang STA6938-Logistic Regression Model Topic 2-Multiple Logistic Regression Model Outlines:. Model Fitting 2. Statistical Inference for Multiple Logistic Regression Model 3. Interpretation of

### LISA Short Course Series Generalized Linear Models (GLMs) & Categorical Data Analysis (CDA) in R. Liang (Sally) Shan Nov. 4, 2014

LISA Short Course Series Generalized Linear Models (GLMs) & Categorical Data Analysis (CDA) in R Liang (Sally) Shan Nov. 4, 2014 L Laboratory for Interdisciplinary Statistical Analysis LISA helps VT researchers

### Lab 11. Multilevel Models. Description of Data

Lab 11 Multilevel Models Henian Chen, M.D., Ph.D. Description of Data MULTILEVEL.TXT is clustered data for 386 women distributed across 40 groups. ID: 386 women, id from 1 to 386, individual level (level

### Longitudinal Data Analysis Using SAS Paul D. Allison, Ph.D. Upcoming Seminar: October 13-14, 2017, Boston, Massachusetts

Longitudinal Data Analysis Using SAS Paul D. Allison, Ph.D. Upcoming Seminar: October 13-14, 217, Boston, Massachusetts Outline 1. Opportunities and challenges of panel data. a. Data requirements b. Control

### Hypothesis testing, part 2. With some material from Howard Seltman, Blase Ur, Bilge Mutlu, Vibha Sazawal

Hypothesis testing, part 2 With some material from Howard Seltman, Blase Ur, Bilge Mutlu, Vibha Sazawal 1 CATEGORICAL IV, NUMERIC DV 2 Independent samples, one IV # Conditions Normal/Parametric Non-parametric

### An Introduction to Multilevel Models. PSYC 943 (930): Fundamentals of Multivariate Modeling Lecture 25: December 7, 2012

An Introduction to Multilevel Models PSYC 943 (930): Fundamentals of Multivariate Modeling Lecture 25: December 7, 2012 Today s Class Concepts in Longitudinal Modeling Between-Person vs. +Within-Person

### Introduction to Generalized Models

Introduction to Generalized Models Today s topics: The big picture of generalized models Review of maximum likelihood estimation Models for binary outcomes Models for proportion outcomes Models for categorical

### Multilevel Statistical Models: 3 rd edition, 2003 Contents

Multilevel Statistical Models: 3 rd edition, 2003 Contents Preface Acknowledgements Notation Two and three level models. A general classification notation and diagram Glossary Chapter 1 An introduction

### Generalized linear models

Generalized linear models Douglas Bates November 01, 2010 Contents 1 Definition 1 2 Links 2 3 Estimating parameters 5 4 Example 6 5 Model building 8 6 Conclusions 8 7 Summary 9 1 Generalized Linear Models

### Lab 3: Two levels Poisson models (taken from Multilevel and Longitudinal Modeling Using Stata, p )

Lab 3: Two levels Poisson models (taken from Multilevel and Longitudinal Modeling Using Stata, p. 376-390) BIO656 2009 Goal: To see if a major health-care reform which took place in 1997 in Germany was

### Investigating Models with Two or Three Categories

Ronald H. Heck and Lynn N. Tabata 1 Investigating Models with Two or Three Categories For the past few weeks we have been working with discriminant analysis. Let s now see what the same sort of model might

### Research Design: Topic 18 Hierarchical Linear Modeling (Measures within Persons) 2010 R.C. Gardner, Ph.d.

Research Design: Topic 8 Hierarchical Linear Modeling (Measures within Persons) R.C. Gardner, Ph.d. General Rationale, Purpose, and Applications Linear Growth Models HLM can also be used with repeated

### EDF 7405 Advanced Quantitative Methods in Educational Research. Data are available on IQ of the child and seven potential predictors.

EDF 7405 Advanced Quantitative Methods in Educational Research Data are available on IQ of the child and seven potential predictors. Four are medical variables available at the birth of the child: Birthweight

### Sections 4.1, 4.2, 4.3

Sections 4.1, 4.2, 4.3 Timothy Hanson Department of Statistics, University of South Carolina Stat 770: Categorical Data Analysis 1/ 32 Chapter 4: Introduction to Generalized Linear Models Generalized linear

### Using Estimating Equations for Spatially Correlated A

Using Estimating Equations for Spatially Correlated Areal Data December 8, 2009 Introduction GEEs Spatial Estimating Equations Implementation Simulation Conclusion Typical Problem Assess the relationship

### Models for Binary Outcomes

Models for Binary Outcomes Introduction The simple or binary response (for example, success or failure) analysis models the relationship between a binary response variable and one or more explanatory variables.

### Introduction to SAS proc mixed

Faculty of Health Sciences Introduction to SAS proc mixed Analysis of repeated measurements, 2017 Julie Forman Department of Biostatistics, University of Copenhagen 2 / 28 Preparing data for analysis The

### Generalized Estimating Equations (gee) for glm type data

Generalized Estimating Equations (gee) for glm type data Søren Højsgaard mailto:sorenh@agrsci.dk Biometry Research Unit Danish Institute of Agricultural Sciences January 23, 2006 Printed: January 23, 2006

### 2/26/2017. PSY 512: Advanced Statistics for Psychological and Behavioral Research 2

PSY 512: Advanced Statistics for Psychological and Behavioral Research 2 When and why do we use logistic regression? Binary Multinomial Theory behind logistic regression Assessing the model Assessing predictors

### Interactions among Continuous Predictors

Interactions among Continuous Predictors Today s Class: Simple main effects within two-way interactions Conquering TEST/ESTIMATE/LINCOM statements Regions of significance Three-way interactions (and beyond

### Appendix: Computer Programs for Logistic Regression

Appendix: Computer Programs for Logistic Regression In this appendix, we provide examples of computer programs to carry out unconditional logistic regression, conditional logistic regression, polytomous

### Generalized Linear Modeling - Logistic Regression

1 Generalized Linear Modeling - Logistic Regression Binary outcomes The logit and inverse logit interpreting coefficients and odds ratios Maximum likelihood estimation Problem of separation Evaluating

### SAS/STAT 13.1 User s Guide. Introduction to Survey Sampling and Analysis Procedures

SAS/STAT 13.1 User s Guide Introduction to Survey Sampling and Analysis Procedures This document is an individual chapter from SAS/STAT 13.1 User s Guide. The correct bibliographic citation for the complete

### STAT 526 Advanced Statistical Methodology

STAT 526 Advanced Statistical Methodology Fall 2017 Lecture Note 10 Analyzing Clustered/Repeated Categorical Data 0-0 Outline Clustered/Repeated Categorical Data Generalized Linear Mixed Models Generalized

### Inverse Sampling for McNemar s Test

International Journal of Statistics and Probability; Vol. 6, No. 1; January 27 ISSN 1927-7032 E-ISSN 1927-7040 Published by Canadian Center of Science and Education Inverse Sampling for McNemar s Test

### Inference for Binomial Parameters

Inference for Binomial Parameters Dipankar Bandyopadhyay, Ph.D. Department of Biostatistics, Virginia Commonwealth University D. Bandyopadhyay (VCU) BIOS 625: Categorical Data & GLM 1 / 58 Inference for

### Binomial Model. Lecture 10: Introduction to Logistic Regression. Logistic Regression. Binomial Distribution. n independent trials

Lecture : Introduction to Logistic Regression Ani Manichaikul amanicha@jhsph.edu 2 May 27 Binomial Model n independent trials (e.g., coin tosses) p = probability of success on each trial (e.g., p =! =

### Linear Regression Models P8111

Linear Regression Models P8111 Lecture 25 Jeff Goldsmith April 26, 2016 1 of 37 Today s Lecture Logistic regression / GLMs Model framework Interpretation Estimation 2 of 37 Linear regression Course started

### The linear mixed model: introduction and the basic model

The linear mixed model: introduction and the basic model Analysis of Experimental Data AED The linear mixed model: introduction and the basic model of 39 Contents Data with non-independent observations

### Using the same data as before, here is part of the output we get in Stata when we do a logistic regression of Grade on Gpa, Tuce and Psi.

Logistic Regression, Part III: Hypothesis Testing, Comparisons to OLS Richard Williams, University of Notre Dame, https://www3.nd.edu/~rwilliam/ Last revised January 14, 2018 This handout steals heavily

### SAS/STAT 13.2 User s Guide. Introduction to Survey Sampling and Analysis Procedures

SAS/STAT 13.2 User s Guide Introduction to Survey Sampling and Analysis Procedures This document is an individual chapter from SAS/STAT 13.2 User s Guide. The correct bibliographic citation for the complete

### Machine Learning Linear Classification. Prof. Matteo Matteucci

Machine Learning Linear Classification Prof. Matteo Matteucci Recall from the first lecture 2 X R p Regression Y R Continuous Output X R p Y {Ω 0, Ω 1,, Ω K } Classification Discrete Output X R p Y (X)

### Longitudinal Data Analysis of Health Outcomes

Longitudinal Data Analysis of Health Outcomes Longitudinal Data Analysis Workshop Running Example: Days 2 and 3 University of Georgia: Institute for Interdisciplinary Research in Education and Human Development

### SPSS Output. ANOVA a b Residual Coefficients a Standardized Coefficients

SPSS Output Homework 1-1e ANOVA a Sum of Squares df Mean Square F Sig. 1 Regression 351.056 1 351.056 11.295.002 b Residual 932.412 30 31.080 Total 1283.469 31 a. Dependent Variable: Sexual Harassment

### over Time line for the means). Specifically, & covariances) just a fixed variance instead. PROC MIXED: to 1000 is default) list models with TYPE=VC */

CLP 944 Example 4 page 1 Within-Personn Fluctuation in Symptom Severity over Time These data come from a study of weekly fluctuation in psoriasis severity. There was no intervention and no real reason

### Latent Variable Models for Binary Data. Suppose that for a given vector of explanatory variables x, the latent

Latent Variable Models for Binary Data Suppose that for a given vector of explanatory variables x, the latent variable, U, has a continuous cumulative distribution function F (u; x) and that the binary

### Time-Invariant Predictors in Longitudinal Models

Time-Invariant Predictors in Longitudinal Models Topics: Summary of building unconditional models for time Missing predictors in MLM Effects of time-invariant predictors Fixed, systematically varying,

### 9 Generalized Linear Models

9 Generalized Linear Models The Generalized Linear Model (GLM) is a model which has been built to include a wide range of different models you already know, e.g. ANOVA and multiple linear regression models

### ESP 178 Applied Research Methods. 2/23: Quantitative Analysis

ESP 178 Applied Research Methods 2/23: Quantitative Analysis Data Preparation Data coding create codebook that defines each variable, its response scale, how it was coded Data entry for mail surveys and

### Introduction to Regression Analysis. Dr. Devlina Chatterjee 11 th August, 2017

Introduction to Regression Analysis Dr. Devlina Chatterjee 11 th August, 2017 What is regression analysis? Regression analysis is a statistical technique for studying linear relationships. One dependent

### Stat/F&W Ecol/Hort 572 Review Points Ané, Spring 2010

1 Linear models Y = Xβ + ɛ with ɛ N (0, σ 2 e) or Y N (Xβ, σ 2 e) where the model matrix X contains the information on predictors and β includes all coefficients (intercept, slope(s) etc.). 1. Number of

### Analytic Methods for Applied Epidemiology: Framework and Contingency Table Analysis

Analytic Methods for Applied Epidemiology: Framework and Contingency Table Analysis 2014 Maternal and Child Health Epidemiology Training Pre-Training Webinar: Friday, May 16 2-4pm Eastern Kristin Rankin,

### Chapter 3 ANALYSIS OF RESPONSE PROFILES

Chapter 3 ANALYSIS OF RESPONSE PROFILES 78 31 Introduction In this chapter we present a method for analysing longitudinal data that imposes minimal structure or restrictions on the mean responses over

### Chapter 5: Logistic Regression-I

: Logistic Regression-I Dipankar Bandyopadhyay Department of Biostatistics, Virginia Commonwealth University BIOS 625: Categorical Data & GLM [Acknowledgements to Tim Hanson and Haitao Chu] D. Bandyopadhyay

### Time-Invariant Predictors in Longitudinal Models

Time-Invariant Predictors in Longitudinal Models Today s Topics: What happens to missing predictors Effects of time-invariant predictors Fixed vs. systematically varying vs. random effects Model building

### Practical Meta-Analysis -- Lipsey & Wilson

Overview of Meta-Analytic Data Analysis Transformations, Adjustments and Outliers The Inverse Variance Weight The Mean Effect Size and Associated Statistics Homogeneity Analysis Fixed Effects Analysis

### Multivariate Extensions of McNemar s Test

Multivariate Extensions of McNemar s Test Bernhard Klingenberg Department of Mathematics and Statistics, Williams College Williamstown, MA 01267, U.S.A. e-mail: bklingen@williams.edu and Alan Agresti Department

### Glossary for the Triola Statistics Series

Glossary for the Triola Statistics Series Absolute deviation The measure of variation equal to the sum of the deviations of each value from the mean, divided by the number of values Acceptance sampling

### The Application and Promise of Hierarchical Linear Modeling (HLM) in Studying First-Year Student Programs

The Application and Promise of Hierarchical Linear Modeling (HLM) in Studying First-Year Student Programs Chad S. Briggs, Kathie Lorentz & Eric Davis Education & Outreach University Housing Southern Illinois

### Penalized likelihood logistic regression with rare events

Penalized likelihood logistic regression with rare events Georg Heinze 1, Angelika Geroldinger 1, Rainer Puhr 2, Mariana Nold 3, Lara Lusa 4 1 Medical University of Vienna, CeMSIIS, Section for Clinical

### University of California, Berkeley

University of California, Berkeley U.C. Berkeley Division of Biostatistics Working Paper Series Year 2009 Paper 251 Nonparametric population average models: deriving the form of approximate population

### ST3241 Categorical Data Analysis I Generalized Linear Models. Introduction and Some Examples

ST3241 Categorical Data Analysis I Generalized Linear Models Introduction and Some Examples 1 Introduction We have discussed methods for analyzing associations in two-way and three-way tables. Now we will

### A COEFFICIENT OF DETERMINATION FOR LOGISTIC REGRESSION MODELS

A COEFFICIENT OF DETEMINATION FO LOGISTIC EGESSION MODELS ENATO MICELI UNIVESITY OF TOINO After a brief presentation of the main extensions of the classical coefficient of determination ( ), a new index

### Chapter 1 Statistical Inference

Chapter 1 Statistical Inference causal inference To infer causality, you need a randomized experiment (or a huge observational study and lots of outside information). inference to populations Generalizations

### Models for Ordinal Response Data

Models for Ordinal Response Data Robin High Department of Biostatistics Center for Public Health University of Nebraska Medical Center Omaha, Nebraska Recommendations Analyze numerical data with a statistical

### Maximum Likelihood Estimation; Robust Maximum Likelihood; Missing Data with Maximum Likelihood

Maximum Likelihood Estimation; Robust Maximum Likelihood; Missing Data with Maximum Likelihood PRE 906: Structural Equation Modeling Lecture #3 February 4, 2015 PRE 906, SEM: Estimation Today s Class An

### Longitudinal Data Analysis Using Stata Paul D. Allison, Ph.D. Upcoming Seminar: May 18-19, 2017, Chicago, Illinois

Longitudinal Data Analysis Using Stata Paul D. Allison, Ph.D. Upcoming Seminar: May 18-19, 217, Chicago, Illinois Outline 1. Opportunities and challenges of panel data. a. Data requirements b. Control

### Categorical and Zero Inflated Growth Models

Categorical and Zero Inflated Growth Models Alan C. Acock* Summer, 2009 *Alan C. Acock, Department of Human Development and Family Sciences, Oregon State University, Corvallis OR 97331 (alan.acock@oregonstate.edu).

### A Re-Introduction to General Linear Models

A Re-Introduction to General Linear Models Today s Class: Big picture overview Why we are using restricted maximum likelihood within MIXED instead of least squares within GLM Linear model interpretation

### Logistic regression model for survival time analysis using time-varying coefficients

Logistic regression model for survival time analysis using time-varying coefficients Accepted in American Journal of Mathematical and Management Sciences, 2016 Kenichi SATOH ksatoh@hiroshima-u.ac.jp Research

### Generalized Linear Model under the Extended Negative Multinomial Model and Cancer Incidence

Generalized Linear Model under the Extended Negative Multinomial Model and Cancer Incidence Sunil Kumar Dhar Center for Applied Mathematics and Statistics, Department of Mathematical Sciences, New Jersey

### Correlation and simple linear regression S5

Basic medical statistics for clinical and eperimental research Correlation and simple linear regression S5 Katarzyna Jóźwiak k.jozwiak@nki.nl November 15, 2017 1/41 Introduction Eample: Brain size and

### The Common Factor Model. Measurement Methods Lecture 15 Chapter 9

The Common Factor Model Measurement Methods Lecture 15 Chapter 9 Today s Class Common Factor Model Multiple factors with a single test ML Estimation Methods New fit indices because of ML Estimation method

### Generalized linear models for binary data. A better graphical exploratory data analysis. The simple linear logistic regression model

Stat 3302 (Spring 2017) Peter F. Craigmile Simple linear logistic regression (part 1) [Dobson and Barnett, 2008, Sections 7.1 7.3] Generalized linear models for binary data Beetles dose-response example

### Logistic regression: Miscellaneous topics

Logistic regression: Miscellaneous topics April 11 Introduction We have covered two approaches to inference for GLMs: the Wald approach and the likelihood ratio approach I claimed that the likelihood ratio

### Comparing IRT with Other Models

Comparing IRT with Other Models Lecture #14 ICPSR Item Response Theory Workshop Lecture #14: 1of 45 Lecture Overview The final set of slides will describe a parallel between IRT and another commonly used

### 2. We care about proportion for categorical variable, but average for numerical one.

Probit Model 1. We apply Probit model to Bank data. The dependent variable is deny, a dummy variable equaling one if a mortgage application is denied, and equaling zero if accepted. The key regressor is

### STAT 5500/6500 Conditional Logistic Regression for Matched Pairs

STAT 5500/6500 Conditional Logistic Regression for Matched Pairs The data for the tutorial came from support.sas.com, The LOGISTIC Procedure: Conditional Logistic Regression for Matched Pairs Data :: SAS/STAT(R)

### Mixed- Model Analysis of Variance. Sohad Murrar & Markus Brauer. University of Wisconsin- Madison. Target Word Count: Actual Word Count: 2755

Mixed- Model Analysis of Variance Sohad Murrar & Markus Brauer University of Wisconsin- Madison The SAGE Encyclopedia of Educational Research, Measurement and Evaluation Target Word Count: 3000 - Actual

### Pseudo-score confidence intervals for parameters in discrete statistical models

Biometrika Advance Access published January 14, 2010 Biometrika (2009), pp. 1 8 C 2009 Biometrika Trust Printed in Great Britain doi: 10.1093/biomet/asp074 Pseudo-score confidence intervals for parameters

### 1 Introduction. 2 Example

Statistics: Multilevel modelling Richard Buxton. 2008. Introduction Multilevel modelling is an approach that can be used to handle clustered or grouped data. Suppose we are trying to discover some of the

### Exact McNemar s Test and Matching Confidence Intervals Michael P. Fay April 25,

Exact McNemar s Test and Matching Confidence Intervals Michael P. Fay April 25, 2016 1 McNemar s Original Test Consider paired binary response data. For example, suppose you have twins randomized to two

### CAMPBELL COLLABORATION

CAMPBELL COLLABORATION Random and Mixed-effects Modeling C Training Materials 1 Overview Effect-size estimates Random-effects model Mixed model C Training Materials Effect sizes Suppose we have computed

### Relating Latent Class Analysis Results to Variables not Included in the Analysis

Relating LCA Results 1 Running Head: Relating LCA Results Relating Latent Class Analysis Results to Variables not Included in the Analysis Shaunna L. Clark & Bengt Muthén University of California, Los

### Using Power Tables to Compute Statistical Power in Multilevel Experimental Designs

A peer-reviewed electronic journal. Copyright is retained by the first or sole author, who grants right of first publication to the Practical Assessment, Research & Evaluation. Permission is granted to

### Statistical Methods in Clinical Trials Categorical Data

Statistical Methods in Clinical Trials Categorical Data Types of Data quantitative Continuous Blood pressure Time to event Categorical sex qualitative Discrete No of relapses Ordered Categorical Pain level

### Regression models. Generalized linear models in R. Normal regression models are not always appropriate. Generalized linear models. Examples.

Regression models Generalized linear models in R Dr Peter K Dunn http://www.usq.edu.au Department of Mathematics and Computing University of Southern Queensland ASC, July 00 The usual linear regression

### A (Brief) Introduction to Crossed Random Effects Models for Repeated Measures Data

A (Brief) Introduction to Crossed Random Effects Models for Repeated Measures Data Today s Class: Review of concepts in multivariate data Introduction to random intercepts Crossed random effects models

### Ordinary Least Squares Regression Explained: Vartanian

Ordinary Least Squares Regression Eplained: Vartanian When to Use Ordinary Least Squares Regression Analysis A. Variable types. When you have an interval/ratio scale dependent variable.. When your independent

### arxiv: v1 [stat.me] 15 May 2011

Working Paper Propensity Score Analysis with Matching Weights Liang Li, Ph.D. arxiv:1105.2917v1 [stat.me] 15 May 2011 Associate Staff of Biostatistics Department of Quantitative Health Sciences, Cleveland

### The Multilevel Logit Model for Binary Dependent Variables Marco R. Steenbergen

The Multilevel Logit Model for Binary Dependent Variables Marco R. Steenbergen January 23-24, 2012 Page 1 Part I The Single Level Logit Model: A Review Motivating Example Imagine we are interested in voting