Modelling the Covariance

Similar documents
Modeling the Covariance

Modeling the Mean: Response Profiles v. Parametric Curves

Stat 579: Generalized Linear Models and Extensions

Covariance Models (*) X i : (n i p) design matrix for fixed effects β : (p 1) regression coefficient for fixed effects

Models for longitudinal data

Introduction to General and Generalized Linear Models

Point-Referenced Data Models

Intruction to General and Generalized Linear Models

Step 2: Select Analyze, Mixed Models, and Linear.

Longitudinal Data Analysis

MISCELLANEOUS TOPICS RELATED TO LIKELIHOOD. Copyright c 2012 (Iowa State University) Statistics / 30

Repeated measures, part 2, advanced methods

Serial Correlation. Edps/Psych/Stat 587. Carolyn J. Anderson. Fall Department of Educational Psychology

REPEATED MEASURES. Copyright c 2012 (Iowa State University) Statistics / 29

Testing Restrictions and Comparing Models

Model comparison and selection

RANDOM and REPEATED statements - How to Use Them to Model the Covariance Structure in Proc Mixed. Charlie Liu, Dachuang Cao, Peiqi Chen, Tony Zagar

Statistics 203: Introduction to Regression and Analysis of Variance Course review

Describing Within-Person Fluctuation over Time using Alternative Covariance Structures

Nonconcave Penalized Likelihood with A Diverging Number of Parameters

Modelling Repeated Measurements of Renal Function during dialysis with cut off due to complete kidney failure

over Time line for the means). Specifically, & covariances) just a fixed variance instead. PROC MIXED: to 1000 is default) list models with TYPE=VC */

MLR Model Selection. Author: Nicholas G Reich, Jeff Goldsmith. This material is part of the statsteachr project

Statistical Practice. Selecting the Best Linear Mixed Model Under REML. Matthew J. GURKA

UNIVERSITY OF TORONTO. Faculty of Arts and Science APRIL 2010 EXAMINATIONS STA 303 H1S / STA 1002 HS. Duration - 3 hours. Aids Allowed: Calculator

Stat 579: Generalized Linear Models and Extensions

Repeated Measures Data

These slides illustrate a few example R commands that can be useful for the analysis of repeated measures data.

Repeated Measures Design. Advertising Sales Example

9. Model Selection. statistical models. overview of model selection. information criteria. goodness-of-fit measures

Applied Multivariate and Longitudinal Data Analysis

Statistics 910, #5 1. Regression Methods

WU Weiterbildung. Linear Mixed Models

Analysis of Longitudinal Data: Comparison Between PROC GLM and PROC MIXED. Maribeth Johnson Medical College of Georgia Augusta, GA

Heteroscedasticity. Jamie Monogan. Intermediate Political Methodology. University of Georgia. Jamie Monogan (UGA) Heteroscedasticity POLS / 11

STAT 5200 Handout #23. Repeated Measures Example (Ch. 16)

Introduction to Within-Person Analysis and RM ANOVA

Stat 579: Generalized Linear Models and Extensions

Overview. 1. Independence. 2. Modeling Autocorrelation. 3. Temporal Autocorrelation Example. 4. Spatial Autocorrelation Example

Univariate ARIMA Models

Vector Auto-Regressive Models

Econ 5150: Applied Econometrics Dynamic Demand Model Model Selection. Sung Y. Park CUHK

Multiple Regression Analysis: The Problem of Inference

A Significance Test for the Lasso

VAR Models and Applications

Questions and Answers on Heteroskedasticity, Autocorrelation and Generalized Least Squares

F9 F10: Autocorrelation

Univariate Time Series Analysis; ARIMA Models

Dynamic Determination of Mixed Model Covariance Structures. in Double-blind Clinical Trials. Matthew Davis - Omnicare Clinical Research

Chapter 3 - Temporal processes

Spatial inference. Spatial inference. Accounting for spatial correlation. Multivariate normal distributions

Economics 520. Lecture Note 19: Hypothesis Testing via the Neyman-Pearson Lemma CB 8.1,

Math 494: Mathematical Statistics

Analysis of Longitudinal Data: Comparison between PROC GLM and PROC MIXED.

STAT Financial Time Series

Vector autoregressions, VAR

Time-Series Cross-Section Analysis

Random and Mixed Effects Models - Part III

Statistics and econometrics

STAT 100C: Linear models

Biostatistics 301A. Repeated measurement analysis (mixed models)

Outline. Mixed models in R using the lme4 package Part 3: Longitudinal data. Sleep deprivation data. Simple longitudinal data

Introduction to Linear Mixed Models: Modeling continuous longitudinal outcomes

A time series is called strictly stationary if the joint distribution of every collection (Y t

LISA Short Course Series Generalized Linear Models (GLMs) & Categorical Data Analysis (CDA) in R. Liang (Sally) Shan Nov. 4, 2014

Sparse Linear Models (10/7/13)

Estimating prediction error in mixed models

Topic 25 - One-Way Random Effects Models. Outline. Random Effects vs Fixed Effects. Data for One-way Random Effects Model. One-way Random effects

22s:152 Applied Linear Regression. Returning to a continuous response variable Y...

Information Criteria and Model Selection

Spring 2017 Econ 574 Roger Koenker. Lecture 14 GEE-GMM

SAS Code for Data Manipulation: SPSS Code for Data Manipulation: STATA Code for Data Manipulation: Psyc 945 Example 1 page 1

Comment about AR spectral estimation Usually an estimate is produced by computing the AR theoretical spectrum at (ˆφ, ˆσ 2 ). With our Monte Carlo

22s:152 Applied Linear Regression. In matrix notation, we can write this model: Generalized Least Squares. Y = Xβ + ɛ with ɛ N n (0, Σ)

Midterm Suggested Solutions

Regression with correlation for the Sales Data

Machine Learning. Part 1. Linear Regression. Machine Learning: Regression Case. .. Dennis Sun DATA 401 Data Science Alex Dekhtyar..

On the Behavior of Marginal and Conditional Akaike Information Criteria in Linear Mixed Models

Well-developed and understood properties

Regression, Ridge Regression, Lasso

Repeated Measures Modeling With PROC MIXED E. Barry Moser, Louisiana State University, Baton Rouge, LA

Restricted Maximum Likelihood in Linear Regression and Linear Mixed-Effects Model

Likelihood-Based Methods

2.2 Classical Regression in the Time Series Context

Modeling Data with Linear Combinations of Basis Functions. Read Chapter 3 in the text by Bishop

Review of Unconditional Multilevel Models for Longitudinal Data

Advantages of Mixed-effects Regression Models (MRM; aka multilevel, hierarchical linear, linear mixed models) 1. MRM explicitly models individual

On the Behavior of Marginal and Conditional Akaike Information Criteria in Linear Mixed Models

TIME SERIES ANALYSIS AND FORECASTING USING THE STATISTICAL MODEL ARIMA

On the Behavior of Marginal and Conditional Akaike Information Criteria in Linear Mixed Models

STAT 135 Lab 13 (Review) Linear Regression, Multivariate Random Variables, Prediction, Logistic Regression and the δ-method.

Standard Errors & Confidence Intervals. N(0, I( β) 1 ), I( β) = [ 2 l(β, φ; y) β i β β= β j

Multivariate Regression

Generalized Linear Models For Covariances : Curses of Dimensionality and PD-ness

4 Introduction to modeling longitudinal data

Applied Time Series Topics

1 Mixed effect models and longitudinal data analysis

Linear Model Selection and Regularization

Solutions for Examination Categorical Data Analysis, March 21, 2013

Ph.D. Qualifying Exam Friday Saturday, January 6 7, 2017

Transcription:

Modelling the Covariance Jamie Monogan Washington University in St Louis February 9, 2010 Jamie Monogan (WUStL) Modelling the Covariance February 9, 2010 1 / 13

Objectives By the end of this meeting, participants should be able to: Use orthogonal polynomials to capture higher-order time effects Define the structure of common covariance pattern models Make a choice of covariance pattern model for real data Jamie Monogan (WUStL) Modelling the Covariance February 9, 2010 2 / 13

Longitudinal Variance and Covariance We expect between-individual variance to be greater than within-individual variance This is because repeated observations ought to be similar The more similar observations are, the higher their covariance The more similar observations are, the lower their variance Illustration: Var(Y i2 Y i1 ) = σ 2 1 + σ2 2 2ρ 12σ 1 σ 2 If we do not account for the correlation among repeated observations, our standard errors will usually be too large Jamie Monogan (WUStL) Modelling the Covariance February 9, 2010 3 / 13

Longitudinal Variance and Covariance In regression terms, we account for this by defining Ω within ˆβ GLS = [x Ω 1 x] 1 x Ω 1 y Typically: Σ O O O Σ O Ω = O O Σ Where Y i = (Y i1, Y i2,, Y in ), Cov(Y i ) = Σ, and: 0 0 0 0 0 0 O = 0 0 0 Note: for n waves, both Σ and O are n n matrices Jamie Monogan (WUStL) Modelling the Covariance February 9, 2010 4 / 13

Unstructured Covariance No constraints on Σ other than symmetry σ1 2 σ 12 σ 1n σ 21 σ2 2 σ 2n Cov(Y i ) = σ n1 σ n2 σn 2 Jamie Monogan (WUStL) Modelling the Covariance February 9, 2010 5 / 13

Compund Symmetry Random effects for units Ie, how to deal with unit effects using GLS σ 2 σ 2 ρ σ 2 ρ σ 2 ρ σ 2 ρ σ 2 σ 2 ρ σ 2 ρ Cov(Y i ) = σ 2 ρ σ 2 ρ σ 2 ρ σ 2 Jamie Monogan (WUStL) Modelling the Covariance February 9, 2010 6 / 13

Toeplitz Cannot be directly estimated in R For n waves, AR(n 1) is equivalent σ 2 σ 1 σ 2 σ n 1 σ 1 σ 2 σ 1 σ n 2 Cov(Y i ) = σ n 1 σ n 2 σ n 3 σ 2 Jamie Monogan (WUStL) Modelling the Covariance February 9, 2010 7 / 13

First-Order Autoregressive Shorthand: AR(1) A special case of Toeplitz that reduces to two parameters Assumption: e ij = ρe ij 1 + w ij, where w ij is iid normal σ 2 σ 2 ρ σ 2 ρ 2 σ 2 ρ n 1 σ 2 ρ σ 2 σ 2 ρ σ 2 ρ n 2 Cov(Y i ) = σ 2 ρ n 1 σ 2 ρ n 2 σ 2 ρ n 3 σ 2 Jamie Monogan (WUStL) Modelling the Covariance February 9, 2010 8 / 13

Banded Simplification of Toeplitz that imposes zero covariance beyond a certain order Not easily estimated in R Toeplitz (2): σ 2 σ 2 ρ 1 0 0 σ 2 ρ 1 σ 2 σ 2 ρ 1 0 Cov(Y i ) = 0 0 0 σ 2 Jamie Monogan (WUStL) Modelling the Covariance February 9, 2010 9 / 13

Exponential Also called continuous autoregressive A better choice for irregularly-spaced measurement intervals σ 2 σ 2 ρ t 1 t 2 σ 2 ρ t 1 t 3 σ 2 ρ t 1 t n σ 2 ρ t 2 t 1 σ 2 σ 2 ρ t 2 t 3 σ 2 ρ t 2 t n Cov(Y i ) = σ 2 ρ tn t 1 σ 2 ρ tn t 2 σ 2 ρ tn t 3 σ 2 Jamie Monogan (WUStL) Modelling the Covariance February 9, 2010 10 / 13

Estimation in R: corclasses corsymm general correlation matrix, with no additional structure corcompsymm compound symmetry structure (econ random effects) corar1 autoregressive process of order 1 corarma autoregressive moving average process corcar1 continuous autoregressive process (exponential) Jamie Monogan (WUStL) Modelling the Covariance February 9, 2010 11 / 13

Choosing a Covariance Pattern Model Covariance of repeated observations can depend on the specification of the mean model (Case for response profiles) Likelihood Ratio Test: H 0 : r constraints are true G 2 = 2(ˆl f ˆl s ) χ 2 (r) for r restrictions For covariance testing: use REML!!! Not ideal if imposing many zeros AIC 2(ˆl) + 2(c) For ˆl maximized REML log-likelihood and c the number of covariance parameters The lowest value is the best fit, contingent on a parsimony penalty BIC (Schwartz s Criterion) 2(ˆl) + log(n )(c) For ˆl maximized REML log-likelihood, N = N p (for p length of β), and c the number of covariance parameters The lowest value is the best fit, contingent on a larger parsimony penalty Jamie Monogan (WUStL) Modelling the Covariance February 9, 2010 12 / 13

For Next Time Read FLW chapter 8 With the Lead-Exposed Children data from Feb 2: Run a response profile model Do so with four covariance structures: unstructured, exponential, Toeplitz, and AR(1) (Don t report the results) Report the AIC for the four models and explain which covariance structure you would use Regardless of the AIC, would you believe Toeplitz or AR(1) could be appropriate for modeling the covariance of these data? Jamie Monogan (WUStL) Modelling the Covariance February 9, 2010 13 / 13