More about linear mixed models

Size: px
Start display at page:

Download "More about linear mixed models"

Transcription

1 Faculty of Health Sciences Contents More about linear mixed models Analysis of repeated measurements, NFA 2016 Julie Lyng Forman & Lene Theil Skovgaard Department of Biostatistics, University of Copenhagen Linear mixed models in general Case: a cross-over study Model asssumptions and diagnostics Missing data and methods for handling it Suggested reading FLW (2011) chapters 10, 18, / 68 Outline Specification of linear mixed models (LMMs) Mixed refers to a mixture of fixed and random effects. Linear mixed models in general Case: a cross-over study Model assumptions and how to check them Missing data Appendix: Other methods for handling missing data Systematic variation Effect of covariates (time, treatment, gender, age, etc.), described by population parameters. Random variation: Random effects described by subject specific parameters. Serial correlation Measurement error Note: Interactions between systematic and random effects are always random. 3 / 68 4 / 68

2 The covariance/correlation structure Linear model specification of LMMs Random effects: Model repeated outcomes on subjects/clusters as a linear function Serial correlation (correlated error terms): Y ij = X ij β + Z ij b i + ε ij of systematic effect(s) of covariate(s) X ij. random effect(s) of covariates Z ij. with possibly correlated residual error terms ε ij s. Measurement error (uncorrelated error terms): We see this if the same outcome is measured twice at the same time (e.g. duplicates or triplicates). We assume that the b i s and ε i s are independent and normally distributed with mean zero and covariance given by: The G-matrix: Var(b i ) = G. The R-matrix: Var(ε i ) = R. 5 / 68 6 / 68 SAS: PROC MIXED Nonidentifiability model: describes the mean value structure (i.e. covariates / fixed effects) random: describes the random effects repeated: describes the residual covariance. Very flexible modeling framework! Example: It is possible to model, e.g. longitudinal series of measurements (2 level model)... with repeated series on each subject and with different treatments along the way (3 level model)... where subjects belong to different clusters (4 level model). Warning: Make sure you understand your model! Modeling random effects together with a residual error covariance may result in unidentifiable covariance parameters, i.e. nonconvergence or strange results, unless done with some care. Example: Compound symmetry can be specified as either of: RANDOM id; RANDOM intercept / SUBJECT=id; REPEATED time / TYPE=CS SUBJECT=id; in case two of these lines are included in the same program, results may be incomprehensible. 7 / 68 8 / 68

3 Advice for making mixed model analyses First make a plan (perhaps with help from a statistician): Think about what results you want to obtain and how you want to present them. Think about an appropriate model (and its assumptions). Then do the analysis (perhaps with help from a statistician): Try to be calm and systematic. Don t just put data in your favourite statistical program and run (away). Make as many graphical displays as you can think of. When interpreting your output, check whether the results look plausible - and whether they in fact answer the relevant scientific questions. Outline Linear mixed models in general Case: a cross-over study Model assumptions and how to check them Missing data Appendix: Other methods for handling missing data Most likely you will have to do this a few times over again: Something unexpected? Think about it and revise the plan. 9 / / 68 Example: Cross-over study of headache Picture ignoring period effect and pairing Patients with chronic headache are randomized into two groups: Both groups receive LNMMA and placebo, on two different days, with a suitable wash-out period in-between Group 1 was treated first with placebo (period 1), and then with LNMMA (period 2) Group 2 was treated first with LNMMA (period 1), and then with placebo (period 2) Pain was measured subjectively on a VAS-scale (small is good), at baseline and at 30, 60, 90 and 120 minutes after treatment. Reference: Ashina et al (1999), Lancet, pp / / 68

4 Model building for complicated cross over study Analysis based on summary statistics Fixed effects: time, treat*time, period (e.g. due to weather conditions) possibly a carry-over effect: treat*(time)*period? note the baseline measurement at time=0 Suggested covariance structure: We expect observations from the same period (and patient) to be more strongly correlated when they are close in time. We also expect correlation between observations from different periods, but this is most likely not as strong. RANDOM patient; REPEATED time / TYPE=SP(POW)(time) SUBJECT=patient*period; Note the non-equidistant time points. Unfortunately, we do not have access to the full data with the two series of repeated measurements over time. The analysis was based on summary statistics: Difference between average follow-up measurements and baseline, Y 30 + Y 60 + Y Y 0 for each period, we have paired data. negativ values are good (pain decreases). 13 / / 68 Observations vs. period and treatment Averages Sequence groups: Group 1 (P+A), Group 2 (A+P) Period 1 Period 2 Mean (SD) of LNMMA (41.0) (65.0) Mean (SD) of Placebo (41.5) 3.3 (17.8) Pairs are obviously correlated. 15 / 68 Seemingly much larger treatment effect in period 2 -?? 16 / 68

5 PROC MIXED: test of carry-over effect Interpretation of the carry-over effect PROC MIXED DATA=ashina; CLASS patient treat period; MODEL effect=treat period treat*period / SOLUTION CL DDFM=KR; RANDOM intercept / SUBJECT=patient; RUN; Effect treat period Estimate StdError DF t Value Pr > t Intercept treat lmmma treat placebo period period treat*period lmmma treat*period lmmma treat*period placebo treat*period placebo Estimated effect of LNMMA vs placebo is in period 2 (reference) and =-8.2 in period 1. The carry-over effect is usually interpreted as an different effect of placebo when given after the active treatment. This could be explained as a psychological effect, in the sense that subjects expect something better (namely what they experienced in the previous period). Estimate 62.3 (95%CI : 25.4to149.9), i.e. nonsignificant. The carry-over effect (placebo following active) has a positive value, corresponding to a higher intensity of the headache. 17 / / 68 Traditional approach Conclusion on treatment effect First test the hypothesis H 0 : no carry-over effect: Two-sample T-test group 1 vs group 2. Outcome: sum of "effect" in the two periods. Expected mean difference: carry over effect. Main effects of treatment and period cancels out. If this is accepted, test H 1 : no treatment effect: Two-sample T-test, group 1 vs group 2. Outcome: difference of "effect", period 1 minus 2. Expected mean difference: two times treatment effect. Main effects of period cancels out. Same results as the mixed model analysis when there are no missing data. Method Estimate (95% CI) P-value Only period (-53.99,37.59) 0.71 Only period ( ,-11.55) 0.02 Both periods (no carry-over) (-68.70,-9.97) 0.01 Comments on the design: Use a paired design for higher power. Power calculation: t-test of treatment effect. Assumption of no carry-over is argued from using a suitably long wash-out period.... not too long, though, since then we would loose some of the power (correlation) from the pairing. 19 / / 68

6 Outline Case: Calcium supplements (from lecture 2) Linear mixed models in general Case: a cross-over study Model assumptions and how to check them Missing data Appendix: Other methods for handling missing data Bone mineral density in (g/cm 3 ) vs years since baseline 21 / / 68 Suggested models for BMD data How can we choose between models? Response profiles: Unstructured mean and unstructured covariance (only for balanced data, not actual time of observation) Think about it or otherwise: By direct visual assessment of fit Spaghettiplots and predicted profiles. Linear growth over time with either: Compound symmetry covariance pattern (same as random intercept for each girl) Autoregressive covariance pattern (or other similar) Random regression (i.e. random intercept and slope for each girl) By use of residual diagnostics Automatic model checks, using ods graphics More extensice model checks using output data sets By testing against more flexible alternatives Models for the mean tested by the usual F-tests in output. Covariance patterns evaluated by χ 2 -tests on 2 log L 23 / / 68

7 Assumptions in linear mixed models Observation Y ij from subject i measured at occation j described as Y ij = X ij β + Z ij b i + ε ij with covariate(s) X ij (fixed effects) and Z ij (random effects). Model assumptions are: Independence between individuals Linearity in covariates X ij s and Z ij s. Residuals ε i are multivariate normal ε i N (0, R i ) and the covariance pattern (R-matrix) is correctly specified. Random effects b i are (multivariate) normal b i N (0, G) and their (co)variance (G-matrix) is correctly specified. Random effects b i are independent of covariates X i s. Importance of assumptions Important: Independence between individuals (normally not an issue) Linearity Independence between X ij and b i Appropriateness of the covariance structure (may be circumvented with balanced complete data by using the empirical sandwich estimator, see lecture 2). Less important (especially when the sample size is large): Normality of residuals ε i Normality of random effects b i 25 / / 68 Model diagnostics in proc mixed Population predictions (fixed effects) ods graphics on; title1 random regression (model diagnostics) ; proc mixed data=calcium plots=all; class grp girl; model bmd = obstime grp*obstime / solution cl ddfm=kr outpm=fitpm outp=fitp residual vciry; random intercept obstime / type=un subject=girl g; run; Generates panels to check stability of variance and normal distribution of residuals. Creates two output data sets for further model checking. outpm: Population predictions and ordinary residuals. outp: Subject predictions and conditional residuals. Is the predicted population mean reasonable? 27 / / 68

8 Subject predictions (fixed + random effects) Goodness of fit, ordinary residuals Observed minus predicted population mean: Y ij X ij ˆβ Are the predicted subject means reasonable? 29 / / 68 Goodness of fit, conditional residuals Observed minus predicted subject mean: Y ij (X ij ˆβ + Zijˆbi ) Residuals in proc mixed Raw residuals: Observed - Predicted. No good for checking normality if variance changes with time. Pearson residuals: (Observed - Predicted) / SD. Same variance, but residuals on same subject are correlated. Studentized residuals: (Observed - Predicted) / SD. Similar to Pearson residuals (the same in large samples) but takes estimation uncertainty in predictions into account when estimating SD. Residuals on same subject are correlated. Scaled residuals: vciry-option in proc mixed. Scales residuals by the square root of their covariance-matrix. Independent and N (0, 1) if the model is correct. 31 / / 68

9 Goodness of fit, scaled residuals Additional model checks Plots of residuals against individual covariates: Do we see deviations from linearity? (next slide) title1 Ordinary residuals vs time ; proc sgplot data=fitpm; loess Y=Resid X=obstime / group=grp; refline 0; run; Is the variance similar in different groups? (not shown) Systematic deviation seen in the residual plot. title1 Scaled residuals vs groups ; proc sgplot data=fitpm; vbox ScaledResid / group=grp; run; 33 / / 68 Effect of time: Ordinary residuals (outpm) Effect of time: Conditional residuals (outp) 35 / / 68

10 Effect of time, scaled residuals (vciry + outpm) Influential observations Beware of single observations or subjects with a large influence on the estimates (mean or covariance parameters) e.g. due to: An unusual combination of covariates X i or Z i Large ordinary residuals (i.e. outliers) may occur naturally... or due to model misspecification. Loads of diagnostics are generated by proc mixed using e.g.: model bmd=obstime grp*obstime / ddfm=kr influence(iter=10); Some deviation from linearity seen in ordinary and scaled residuals, consistently for the two groups 37 / 68 Note: This takes a while to run because the model is re-fitted without each subject in turn. 38 / 68 Cooks distance Real trouble In some cases valid analysis can be more difficult or outright impossible to do. Call for statistical aid if: Your data set is very small. The (co)variance depend on the covariates. E.g. patients react more similarly to placebo than to treatment (or the other way around). One or more covariates depend on previous outcome values. E.g. dose adjustment. There are missing data. Tentative limit for "influential": 4 n = There are unmeasured confounders. 39 / / 68

11 Outline Missing data Linear mixed models in general Case: a cross-over study Model assumptions and how to check them Missing data Appendix: Other methods for handling missing data Most investigations are planned to be balanced but almost inevitable turn out to have missing values, or patients who drop-out for some reason... Just by coincidence (blood sample lost or ruined). The patient moved away (may be worrysome). The patient has recovered (worrying, i.e. carrying information). The patient is too ill to show up (very serious, i.e. carrying unretrievable information). 41 / / 68 Look-alike missing data When you have missing data The most important question is WHY. Missing data is not to be confused with: Non-existent data due to death. Censored data such as measurements above or below the limit of detection. In other case observed data may be replaced by missing values, In case of non-compliance or switch to rescue medication outcomes no longer reflect the treatment effect... Specialized statistical analyses are needed to handle these kinds of data. Investigate: Make separate spaghettiplots for completers and drop-outs. Make a table comparing the distribution of covariates and other characteristics between the drop-outs and the completers. If possible, ask the patients! Speculate: Think about what differences there might be between completers and drop-outs in terms of unmeasured outcomes and confounders. How could these affect the results of an analysis. 43 / / 68

12 Types of missingness Case: Missing data in calcium study MCAR Missing completely at random. Unrelated to the outcome and the covariates. MAR Missing at random. Missingness may depend on covariates and possible also on previously observed outcome values. NNAR Not missing at random. Missingness depends on the unobserved outcome either directly or indirectly by depending on an unmeasured confounder. Patterns of missingness Monotone Drops out and stays out. Intermittent Comes back later. 45 / / 68 Case: Missing data in calcium study Biased average curves Think about it... Likely causes of drop-out: Family moves away or too busy to participate (MCAR, unless related to unmeasured confounders). Parents learn at the visit that BMD is low and decides to withdraw because they think the girl is on placebo but needs treatment (MAR). The average curve is representative of the whole population when data is balanced and complete. When there are missing data (MAR or NMAR) it is biased. Note: Missing at random (MAR) has to be argued - it cannot be tested or otherwise verified from the data. Spaghettiplots are of course always ok. 47 / / 68

13 Example of missing mechanism MAR Low values are good (e.g. blood pressure): When the patient learns he is doing well, he might decide he no longer needs treatment. Example of missing mechanism NMAR Low values are bad (e.g. lung function): When the patient is sufficiently ill, he drops out of the labour market. Result: Average stays high, treatment effect is underestimated 49 / 68 Result: Average stays high, healthy worker effect. 50 / 68 Methods for handling missing data Likelihood inference Up-to-date methods valid under MAR-assumption: Likelihood inference (default in LMMs, e.g. proc mixed) Inverse probablility weighting (IPW) Multiple imputations Historical approaches that has been abandonned due to likely bias: Complete case analysis LOCF (or LVCF): Last observation (value) carried forward Predicted value (e.g. mean) imputation. BUT: The optimal method of handling missing data is of course to avoid it in the first place by choosing a missing-proof study design. 51 / 68 Perform the usual LMM analysis (e.g. in proc mixed where likelihood inference is default). Valid under MAR-assumption if the model is correct (all model assumptions must be fulfilled). Note: it is important that all covariates that missingness depend on are included in the model. Also it is important that observations are included - don t delete the incomplete cases! Properties: Efficient - makes optimal use of the available observations. Also applies to generalized linear mixed models for non-normal outcomes (lectures 5 and 6). Recommended whenever possible 52 / 68

14 NMAR: Not missing at random Paired T-test with missing data Tryptase level before and after operation, for 120 patients No statistical method can make up for data being NMAR. It has been suggested to perform sensitivity analyses over a range of plausible models for the missingness / unobserved data. However: All such models rely on unverifiable assumptions that can never be checked with the data. These analyses currently cannot be performed with standard statistical software. Red observations are the singletons: Reference: Garvey et al. (2010a,b) 53 / / 68 Missing values Complete case analysis What would happen if some patients were only measured once? We randomly delete 55 (23%) of the observations. time NObs N NMiss :before :after Total Note: Missing data are missing completely at random (MCAR). Paired t-test on complete pairs: The TTEST Procedure Difference: logafter - logbefore N Mean Std Dev Std Err Minimum Maximum Mean 95% CL Mean Std Dev 95% CL Std Dev DF t Value Pr > t BUT: only 70 patients have observations both before and after operation; only 140 of the 185 available observations are used. 55 / / 68

15 Two-sample t-test This uses all available observations... The GLM Procedure Dependent Variable: logtryptase Sum of Source DF Squares Mean Square F Value Pr > F Model Error Corrected Total R-Square Coeff Var Root MSE logtryptase Mean Standard Parameter Estimate Error t Value Pr > t Intercept B <.0001 time 1:before B time 2:after B... Parameter 95% Confidence Limits Intercept time 1:before time 2:after.. BUT: Correlation within pairs (about 0.94) is neglected! 57 / 68 Likelihood inference A linear mixed model uses all available observations and accounts for correlation within pairs. Number of Observations Number of Observations Read 240 Number of Observations Used 185 Number of Observations Not Used 55 Covariance Parameter Estimates Cov Parm Estimate patient Between Residual Within Solution for Fixed Effects Standard Effect time Estimate Error DF t Value Pr > t Alpha Intercept < time 1:before time 2:after Effect time Lower Upper Intercept time 1:before time 2:after.. 58 / 68 Estimated decrease after operation (on log-scale) When should we use what approach? Analyses on incomplete data compared to full data (benchmark). Method Effect N Confidence Interval P-value Mixed model ( , ) Paired T-test ( , ) Two-sample T-test ( , ) 0.21 All data ( , ) < Note: Very similar results from the mixed model and paired t-test. Paired T-test: Almost right when the correlation is strong, and only few observations are missing. Unpaired T-test: Almost right when the correlation is weak, and many observations are missing. Mixed effects model: Always right. But: This depends on what kind of missing we are dealing with. All variables and covariates that missingness depend on must be included in the model. T-tests are biased unless data is MCAR, while the mixed model can handle MAR. In case of NMAR data, nothing is rigth. 59 / / 68

16 Outline Complete case analysis Linear mixed models in general Case: a cross-over study Make an analysis including only those individuals who are observed at all available time points. Default choice for oldfashioned software (e.g. MANOVA). Valid under MCAR-assumption Model assumptions and how to check them Missing data Appendix: Other methods for handling missing data Consequences: Likely biased if there are specific reasons for the missingness. Inefficient because partial information from non-completers is lost. Not recommended 61 / / 68 Last observation carried forward (LOCF) LOCF in clinical trials If an individual has no observed value at time t, replace the missing value by the previously observed value. For drop-outs, all subsequent values will be identical. Consequences: The time effect is most likely biased. The natural variation is obscurred. Definitely not recommended The LOCF is the easiest imputation approach for missing data to be understood by the non-statisticians. However, the LOCF approach has been the target for criticisms from the statisticians for its lack of a sound statistical foundation and for its biases in either direction (i.e., it is not necessarily conservative). After the National Academies published its draft report "The prevention and treatment of missing data in clinical trials, using LOCF approach seemed to be out-dated and markedly out of step with modern statistical thinking. Reference: 63 / / 68

17 Predicted value imputation Replace the missing values with more or less qualified guesses of what they might have been, e.g. Time average imputation: Replace missing value with average over remaining subjects Model prediction imputation: Replace missing values with predicted values from regression models including previous observations and other covariates. Consequences: May be biased if the prediction model is not correct. Underestimates SEs because imputed values are treated as if they were the actual observations and because predictions are less variable than genuine observations. Not recommended 65 / 68 Multiple imputations Similar to predicted value imputation only random error terms are added to the predictions (and the model parameters). 1. This is repeated a number of times and analysis is performed on each imputed dataset. 2. Finally estimates are averaged and SEs are computed according to Rubin s rule. Properties Valid under the MAR-assumption if the imputation model is correct and includes all covariates that missingness depend on. Results are very similar to likelihood inference, but SEs tend to be slightly conservative in small samples. Recommended, but likelihood inference in LMMs is valid under 66 / 68 the same assumptions and easier to do in practice. Inverse probability weighting (IPW) Inverse probability weighting (IPW) Intuition: If patients with a certain characteristic are twice as likely to drop out than others, then those who actually remain in the study must count for two. This requires a model for drop out (usually logistic regression) including relevant covariates and previous outcomes on which missingness may depend. Model the outcome "Y ij is observed" (i.e. not dropping out). Save the predicted probabilities ˆπ ij from this model (the probability that the next outcome Y ij is observed). Use the inverse probabilities ˆπ ij 1 as weights when analysing the available data using generalized estimating equations (lectures 5 and 6). Properties: Valid under MAR-assumption if both the model for the missingness and the model for the observations are correct. In case of monotone missingness, it suffices that one of the two models is correct (estimates are doubly robust). Recommended for handling monotone missing data in the population average models for non-normal outcomes (lectures 6). 67 / / 68

Faculty of Health Sciences. Correlated data. More about LMMs. Lene Theil Skovgaard. December 4, / 104

Faculty of Health Sciences. Correlated data. More about LMMs. Lene Theil Skovgaard. December 4, / 104 Faculty of Health Sciences Correlated data More about LMMs Lene Theil Skovgaard December 4, 2015 1 / 104 Further topics Model check and diagnostics Cross-over studies Paired T-tests with missing values

More information

Models for longitudinal data

Models for longitudinal data Faculty of Health Sciences Contents Models for longitudinal data Analysis of repeated measurements, NFA 016 Julie Lyng Forman & Lene Theil Skovgaard Department of Biostatistics, University of Copenhagen

More information

Linear mixed models. Faculty of Health Sciences. Analysis of repeated measurements, 10th March Julie Lyng Forman & Lene Theil Skovgaard

Linear mixed models. Faculty of Health Sciences. Analysis of repeated measurements, 10th March Julie Lyng Forman & Lene Theil Skovgaard Faculty of Health Sciences Linear mixed models Analysis of repeated measurements, 10th March 2015 Julie Lyng Forman & Lene Theil Skovgaard Department of Biostatistics, University of Copenhagen 1 / 80 Program

More information

Correlated data. Overview. Example: Swelling due to vaccine. Variance component models. Faculty of Health Sciences. Variance component models

Correlated data. Overview. Example: Swelling due to vaccine. Variance component models. Faculty of Health Sciences. Variance component models Faculty of Health Sciences Overview Correlated data Variance component models One-way anova with random variation The rabbit example Hierarchical models with several levels Random regression Lene Theil

More information

Faculty of Health Sciences. Correlated data. Variance component models. Lene Theil Skovgaard & Julie Lyng Forman.

Faculty of Health Sciences. Correlated data. Variance component models. Lene Theil Skovgaard & Julie Lyng Forman. Faculty of Health Sciences Correlated data Variance component models Lene Theil Skovgaard & Julie Lyng Forman November 27, 2018 1 / 84 Overview One-way anova with random variation The rabbit example Hierarchical

More information

Correlated data. Overview. Variance component models. Terminology for correlated measurements. Faculty of Health Sciences. Variance component models

Correlated data. Overview. Variance component models. Terminology for correlated measurements. Faculty of Health Sciences. Variance component models Faculty of Health Sciences Overview Correlated data Variance component models Lene Theil Skovgaard & Julie Lyng Forman November 28, 2017 One-way anova with random variation The rabbit example Hierarchical

More information

Faculty of Health Sciences. Correlated data. Variance component models. Lene Theil Skovgaard & Julie Lyng Forman.

Faculty of Health Sciences. Correlated data. Variance component models. Lene Theil Skovgaard & Julie Lyng Forman. Faculty of Health Sciences Correlated data Variance component models Lene Theil Skovgaard & Julie Lyng Forman November 28, 2017 1 / 96 Overview One-way anova with random variation The rabbit example Hierarchical

More information

Correlated data. Repeated measurements over time. Typical set-up for repeated measurements. Traditional presentation of data

Correlated data. Repeated measurements over time. Typical set-up for repeated measurements. Traditional presentation of data Faculty of Health Sciences Repeated measurements over time Correlated data NFA, May 22, 2014 Longitudinal measurements Julie Lyng Forman & Lene Theil Skovgaard Department of Biostatistics University of

More information

Linear mixed models. Program. What are repeated measurements? Outline. Faculty of Health Sciences. Analysis of repeated measurements, 10th March 2015

Linear mixed models. Program. What are repeated measurements? Outline. Faculty of Health Sciences. Analysis of repeated measurements, 10th March 2015 university of copenhagen d e pa rt m e n t o f b i o s tat i s t i c s university of copenhagen d e pa rt m e n t o f b i o s tat i s t i c s Program Faculty of Health Sciences Topics: Linear mixed models

More information

Correlated data. Longitudinal data. Typical set-up for repeated measurements. Examples from literature, I. Faculty of Health Sciences

Correlated data. Longitudinal data. Typical set-up for repeated measurements. Examples from literature, I. Faculty of Health Sciences Faculty of Health Sciences Longitudinal data Correlated data Longitudinal measurements Outline Designs Models for the mean Covariance patterns Lene Theil Skovgaard November 27, 2015 Random regression Baseline

More information

Variance components and LMMs

Variance components and LMMs Faculty of Health Sciences Variance components and LMMs Analysis of repeated measurements, 4th December 2014 Julie Lyng Forman & Lene Theil Skovgaard Department of Biostatistics, University of Copenhagen

More information

Variance components and LMMs

Variance components and LMMs Faculty of Health Sciences Topics for today Variance components and LMMs Analysis of repeated measurements, 4th December 04 Leftover from 8/: Rest of random regression example. New concepts for today:

More information

Introduction to SAS proc mixed

Introduction to SAS proc mixed Faculty of Health Sciences Introduction to SAS proc mixed Analysis of repeated measurements, 2017 Julie Forman Department of Biostatistics, University of Copenhagen Outline Data in wide and long format

More information

Correlated data. Overview. Cross-over study. Repetition. Faculty of Health Sciences. Variance component models, II. More on variance component models

Correlated data. Overview. Cross-over study. Repetition. Faculty of Health Sciences. Variance component models, II. More on variance component models Faculty of Health Sciences Overview Correlated data More on variance component models Variance component models, II Cross-over studies Non-normal data Comparing measurement devices Lene Theil Skovgaard

More information

Models for binary data

Models for binary data Faculty of Health Sciences Models for binary data Analysis of repeated measurements 2015 Julie Lyng Forman & Lene Theil Skovgaard Department of Biostatistics, University of Copenhagen 1 / 63 Program for

More information

Analysis of variance and regression. May 13, 2008

Analysis of variance and regression. May 13, 2008 Analysis of variance and regression May 13, 2008 Repeated measurements over time Presentation of data Traditional ways of analysis Variance component model (the dogs revisited) Random regression Baseline

More information

Answer to exercise: Blood pressure lowering drugs

Answer to exercise: Blood pressure lowering drugs Answer to exercise: Blood pressure lowering drugs The data set bloodpressure.txt contains data from a cross-over trial, involving three different formulations of a drug for lowering of blood pressure:

More information

Time-Invariant Predictors in Longitudinal Models

Time-Invariant Predictors in Longitudinal Models Time-Invariant Predictors in Longitudinal Models Today s Class (or 3): Summary of steps in building unconditional models for time What happens to missing predictors Effects of time-invariant predictors

More information

Variance component models part I

Variance component models part I Faculty of Health Sciences Variance component models part I Analysis of repeated measurements, 30th November 2012 Julie Lyng Forman & Lene Theil Skovgaard Department of Biostatistics, University of Copenhagen

More information

Statistical Methods. Missing Data snijders/sm.htm. Tom A.B. Snijders. November, University of Oxford 1 / 23

Statistical Methods. Missing Data  snijders/sm.htm. Tom A.B. Snijders. November, University of Oxford 1 / 23 1 / 23 Statistical Methods Missing Data http://www.stats.ox.ac.uk/ snijders/sm.htm Tom A.B. Snijders University of Oxford November, 2011 2 / 23 Literature: Joseph L. Schafer and John W. Graham, Missing

More information

Time-Invariant Predictors in Longitudinal Models

Time-Invariant Predictors in Longitudinal Models Time-Invariant Predictors in Longitudinal Models Topics: What happens to missing predictors Effects of time-invariant predictors Fixed vs. systematically varying vs. random effects Model building strategies

More information

Introduction to SAS proc mixed

Introduction to SAS proc mixed Faculty of Health Sciences Introduction to SAS proc mixed Analysis of repeated measurements, 2017 Julie Forman Department of Biostatistics, University of Copenhagen 2 / 28 Preparing data for analysis The

More information

Analysis of Incomplete Non-Normal Longitudinal Lipid Data

Analysis of Incomplete Non-Normal Longitudinal Lipid Data Analysis of Incomplete Non-Normal Longitudinal Lipid Data Jiajun Liu*, Devan V. Mehrotra, Xiaoming Li, and Kaifeng Lu 2 Merck Research Laboratories, PA/NJ 2 Forrest Laboratories, NY *jiajun_liu@merck.com

More information

Time-Invariant Predictors in Longitudinal Models

Time-Invariant Predictors in Longitudinal Models Time-Invariant Predictors in Longitudinal Models Today s Topics: What happens to missing predictors Effects of time-invariant predictors Fixed vs. systematically varying vs. random effects Model building

More information

Multi-factor analysis of variance

Multi-factor analysis of variance Faculty of Health Sciences Outline Multi-factor analysis of variance Basic statistics for experimental researchers 2015 Two-way ANOVA and interaction Mathed samples ANOVA Random vs systematic variation

More information

Statistics for exp. medical researchers Regression and Correlation

Statistics for exp. medical researchers Regression and Correlation Faculty of Health Sciences Regression analysis Statistics for exp. medical researchers Regression and Correlation Lene Theil Skovgaard Sept. 28, 2015 Linear regression, Estimation and Testing Confidence

More information

Introduction to Crossover Trials

Introduction to Crossover Trials Introduction to Crossover Trials Stat 6500 Tutorial Project Isaac Blackhurst A crossover trial is a type of randomized control trial. It has advantages over other designed experiments because, under certain

More information

Time Invariant Predictors in Longitudinal Models

Time Invariant Predictors in Longitudinal Models Time Invariant Predictors in Longitudinal Models Longitudinal Data Analysis Workshop Section 9 University of Georgia: Institute for Interdisciplinary Research in Education and Human Development Section

More information

General Linear Model (Chapter 4)

General Linear Model (Chapter 4) General Linear Model (Chapter 4) Outcome variable is considered continuous Simple linear regression Scatterplots OLS is BLUE under basic assumptions MSE estimates residual variance testing regression coefficients

More information

Correlated data. Variance component models. Example: Evaluate vaccine. Traditional assumption so far. Faculty of Health Sciences

Correlated data. Variance component models. Example: Evaluate vaccine. Traditional assumption so far. Faculty of Health Sciences Faculty of Health Sciences Variance component models Definitions and motivation Correlated data Variance component models, I Lene Theil Skovgaard November 29, 2013 One-way anova with random variation The

More information

Time-Invariant Predictors in Longitudinal Models

Time-Invariant Predictors in Longitudinal Models Time-Invariant Predictors in Longitudinal Models Topics: Summary of building unconditional models for time Missing predictors in MLM Effects of time-invariant predictors Fixed, systematically varying,

More information

Model Assumptions; Predicting Heterogeneity of Variance

Model Assumptions; Predicting Heterogeneity of Variance Model Assumptions; Predicting Heterogeneity of Variance Today s topics: Model assumptions Normality Constant variance Predicting heterogeneity of variance CLP 945: Lecture 6 1 Checking for Violations of

More information

Whether to use MMRM as primary estimand.

Whether to use MMRM as primary estimand. Whether to use MMRM as primary estimand. James Roger London School of Hygiene & Tropical Medicine, London. PSI/EFSPI European Statistical Meeting on Estimands. Stevenage, UK: 28 September 2015. 1 / 38

More information

Missing Data in Longitudinal Studies: Mixed-effects Pattern-Mixture and Selection Models

Missing Data in Longitudinal Studies: Mixed-effects Pattern-Mixture and Selection Models Missing Data in Longitudinal Studies: Mixed-effects Pattern-Mixture and Selection Models Hedeker D & Gibbons RD (1997). Application of random-effects pattern-mixture models for missing data in longitudinal

More information

Variance component models

Variance component models Faculty of Health Sciences Variance component models Analysis of repeated measurements, NFA 2016 Julie Lyng Forman & Lene Theil Skovgaard Department of Biostatistics, University of Copenhagen Topics for

More information

Faculty of Health Sciences. Correlated data. Count variables. Lene Theil Skovgaard & Julie Lyng Forman. December 6, 2016

Faculty of Health Sciences. Correlated data. Count variables. Lene Theil Skovgaard & Julie Lyng Forman. December 6, 2016 Faculty of Health Sciences Correlated data Count variables Lene Theil Skovgaard & Julie Lyng Forman December 6, 2016 1 / 76 Modeling count outcomes Outline The Poisson distribution for counts Poisson models,

More information

An Introduction to Multilevel Models. PSYC 943 (930): Fundamentals of Multivariate Modeling Lecture 25: December 7, 2012

An Introduction to Multilevel Models. PSYC 943 (930): Fundamentals of Multivariate Modeling Lecture 25: December 7, 2012 An Introduction to Multilevel Models PSYC 943 (930): Fundamentals of Multivariate Modeling Lecture 25: December 7, 2012 Today s Class Concepts in Longitudinal Modeling Between-Person vs. +Within-Person

More information

A Re-Introduction to General Linear Models (GLM)

A Re-Introduction to General Linear Models (GLM) A Re-Introduction to General Linear Models (GLM) Today s Class: You do know the GLM Estimation (where the numbers in the output come from): From least squares to restricted maximum likelihood (REML) Reviewing

More information

Introduction to Within-Person Analysis and RM ANOVA

Introduction to Within-Person Analysis and RM ANOVA Introduction to Within-Person Analysis and RM ANOVA Today s Class: From between-person to within-person ANOVAs for longitudinal data Variance model comparisons using 2 LL CLP 944: Lecture 3 1 The Two Sides

More information

Some methods for handling missing values in outcome variables. Roderick J. Little

Some methods for handling missing values in outcome variables. Roderick J. Little Some methods for handling missing values in outcome variables Roderick J. Little Missing data principles Likelihood methods Outline ML, Bayes, Multiple Imputation (MI) Robust MAR methods Predictive mean

More information

Applied Statistics and Econometrics

Applied Statistics and Econometrics Applied Statistics and Econometrics Lecture 6 Saul Lach September 2017 Saul Lach () Applied Statistics and Econometrics September 2017 1 / 53 Outline of Lecture 6 1 Omitted variable bias (SW 6.1) 2 Multiple

More information

ST 790, Homework 1 Spring 2017

ST 790, Homework 1 Spring 2017 ST 790, Homework 1 Spring 2017 1. In EXAMPLE 1 of Chapter 1 of the notes, it is shown at the bottom of page 22 that the complete case estimator for the mean µ of an outcome Y given in (1.18) under MNAR

More information

Analysis of variance and regression. December 4, 2007

Analysis of variance and regression. December 4, 2007 Analysis of variance and regression December 4, 2007 Variance component models Variance components One-way anova with random variation estimation interpretations Two-way anova with random variation Crossed

More information

Biostatistics Workshop Longitudinal Data Analysis. Session 4 GARRETT FITZMAURICE

Biostatistics Workshop Longitudinal Data Analysis. Session 4 GARRETT FITZMAURICE Biostatistics Workshop 2008 Longitudinal Data Analysis Session 4 GARRETT FITZMAURICE Harvard University 1 LINEAR MIXED EFFECTS MODELS Motivating Example: Influence of Menarche on Changes in Body Fat Prospective

More information

Applied Multivariate Statistical Modeling Prof. J. Maiti Department of Industrial Engineering and Management Indian Institute of Technology, Kharagpur

Applied Multivariate Statistical Modeling Prof. J. Maiti Department of Industrial Engineering and Management Indian Institute of Technology, Kharagpur Applied Multivariate Statistical Modeling Prof. J. Maiti Department of Industrial Engineering and Management Indian Institute of Technology, Kharagpur Lecture - 29 Multivariate Linear Regression- Model

More information

ECON Introductory Econometrics. Lecture 17: Experiments

ECON Introductory Econometrics. Lecture 17: Experiments ECON4150 - Introductory Econometrics Lecture 17: Experiments Monique de Haan (moniqued@econ.uio.no) Stock and Watson Chapter 13 Lecture outline 2 Why study experiments? The potential outcome framework.

More information

MIXED MODELS FOR REPEATED (LONGITUDINAL) DATA PART 2 DAVID C. HOWELL 4/1/2010

MIXED MODELS FOR REPEATED (LONGITUDINAL) DATA PART 2 DAVID C. HOWELL 4/1/2010 MIXED MODELS FOR REPEATED (LONGITUDINAL) DATA PART 2 DAVID C. HOWELL 4/1/2010 Part 1 of this document can be found at http://www.uvm.edu/~dhowell/methods/supplements/mixed Models for Repeated Measures1.pdf

More information

For more information about how to cite these materials visit

For more information about how to cite these materials visit Author(s): Kerby Shedden, Ph.D., 2010 License: Unless otherwise noted, this material is made available under the terms of the Creative Commons Attribution Share Alike 3.0 License: http://creativecommons.org/licenses/by-sa/3.0/

More information

Varians- og regressionsanalyse

Varians- og regressionsanalyse Faculty of Health Sciences Varians- og regressionsanalyse Variance component models Lene Theil Skovgaard Department of Biostatistics Variance component models Definitions and motivation One-way anova with

More information

MISSING or INCOMPLETE DATA

MISSING or INCOMPLETE DATA MISSING or INCOMPLETE DATA A (fairly) complete review of basic practice Don McLeish and Cyntha Struthers University of Waterloo Dec 5, 2015 Structure of the Workshop Session 1 Common methods for dealing

More information

Rerandomization to Balance Covariates

Rerandomization to Balance Covariates Rerandomization to Balance Covariates Kari Lock Morgan Department of Statistics Penn State University Joint work with Don Rubin University of Minnesota Biostatistics 4/27/16 The Gold Standard Randomized

More information

Correlation and Simple Linear Regression

Correlation and Simple Linear Regression Correlation and Simple Linear Regression Sasivimol Rattanasiri, Ph.D Section for Clinical Epidemiology and Biostatistics Ramathibodi Hospital, Mahidol University E-mail: sasivimol.rat@mahidol.ac.th 1 Outline

More information

2 Naïve Methods. 2.1 Complete or available case analysis

2 Naïve Methods. 2.1 Complete or available case analysis 2 Naïve Methods Before discussing methods for taking account of missingness when the missingness pattern can be assumed to be MAR in the next three chapters, we review some simple methods for handling

More information

Business Statistics. Lecture 10: Course Review

Business Statistics. Lecture 10: Course Review Business Statistics Lecture 10: Course Review 1 Descriptive Statistics for Continuous Data Numerical Summaries Location: mean, median Spread or variability: variance, standard deviation, range, percentiles,

More information

Multivariate Survival Analysis

Multivariate Survival Analysis Multivariate Survival Analysis Previously we have assumed that either (X i, δ i ) or (X i, δ i, Z i ), i = 1,..., n, are i.i.d.. This may not always be the case. Multivariate survival data can arise in

More information

Regression: Main Ideas Setting: Quantitative outcome with a quantitative explanatory variable. Example, cont.

Regression: Main Ideas Setting: Quantitative outcome with a quantitative explanatory variable. Example, cont. TCELL 9/4/205 36-309/749 Experimental Design for Behavioral and Social Sciences Simple Regression Example Male black wheatear birds carry stones to the nest as a form of sexual display. Soler et al. wanted

More information

Structural Nested Mean Models for Assessing Time-Varying Effect Moderation. Daniel Almirall

Structural Nested Mean Models for Assessing Time-Varying Effect Moderation. Daniel Almirall 1 Structural Nested Mean Models for Assessing Time-Varying Effect Moderation Daniel Almirall Center for Health Services Research, Durham VAMC & Dept. of Biostatistics, Duke University Medical Joint work

More information

36-309/749 Experimental Design for Behavioral and Social Sciences. Dec 1, 2015 Lecture 11: Mixed Models (HLMs)

36-309/749 Experimental Design for Behavioral and Social Sciences. Dec 1, 2015 Lecture 11: Mixed Models (HLMs) 36-309/749 Experimental Design for Behavioral and Social Sciences Dec 1, 2015 Lecture 11: Mixed Models (HLMs) Independent Errors Assumption An error is the deviation of an individual observed outcome (DV)

More information

Simple, Marginal, and Interaction Effects in General Linear Models: Part 1

Simple, Marginal, and Interaction Effects in General Linear Models: Part 1 Simple, Marginal, and Interaction Effects in General Linear Models: Part 1 PSYC 943 (930): Fundamentals of Multivariate Modeling Lecture 2: August 24, 2012 PSYC 943: Lecture 2 Today s Class Centering and

More information

Variance. Standard deviation VAR = = value. Unbiased SD = SD = 10/23/2011. Functional Connectivity Correlation and Regression.

Variance. Standard deviation VAR = = value. Unbiased SD = SD = 10/23/2011. Functional Connectivity Correlation and Regression. 10/3/011 Functional Connectivity Correlation and Regression Variance VAR = Standard deviation Standard deviation SD = Unbiased SD = 1 10/3/011 Standard error Confidence interval SE = CI = = t value for

More information

CHL 5225 H Crossover Trials. CHL 5225 H Crossover Trials

CHL 5225 H Crossover Trials. CHL 5225 H Crossover Trials CHL 55 H Crossover Trials The Two-sequence, Two-Treatment, Two-period Crossover Trial Definition A trial in which patients are randomly allocated to one of two sequences of treatments (either 1 then, or

More information

Longitudinal Data Analysis Using Stata Paul D. Allison, Ph.D. Upcoming Seminar: May 18-19, 2017, Chicago, Illinois

Longitudinal Data Analysis Using Stata Paul D. Allison, Ph.D. Upcoming Seminar: May 18-19, 2017, Chicago, Illinois Longitudinal Data Analysis Using Stata Paul D. Allison, Ph.D. Upcoming Seminar: May 18-19, 217, Chicago, Illinois Outline 1. Opportunities and challenges of panel data. a. Data requirements b. Control

More information

Checking model assumptions with regression diagnostics

Checking model assumptions with regression diagnostics @graemeleehickey www.glhickey.com graeme.hickey@liverpool.ac.uk Checking model assumptions with regression diagnostics Graeme L. Hickey University of Liverpool Conflicts of interest None Assistant Editor

More information

multilevel modeling: concepts, applications and interpretations

multilevel modeling: concepts, applications and interpretations multilevel modeling: concepts, applications and interpretations lynne c. messer 27 october 2010 warning social and reproductive / perinatal epidemiologist concepts why context matters multilevel models

More information

Longitudinal Data Analysis Using SAS Paul D. Allison, Ph.D. Upcoming Seminar: October 13-14, 2017, Boston, Massachusetts

Longitudinal Data Analysis Using SAS Paul D. Allison, Ph.D. Upcoming Seminar: October 13-14, 2017, Boston, Massachusetts Longitudinal Data Analysis Using SAS Paul D. Allison, Ph.D. Upcoming Seminar: October 13-14, 217, Boston, Massachusetts Outline 1. Opportunities and challenges of panel data. a. Data requirements b. Control

More information

36-309/749 Experimental Design for Behavioral and Social Sciences. Sep. 22, 2015 Lecture 4: Linear Regression

36-309/749 Experimental Design for Behavioral and Social Sciences. Sep. 22, 2015 Lecture 4: Linear Regression 36-309/749 Experimental Design for Behavioral and Social Sciences Sep. 22, 2015 Lecture 4: Linear Regression TCELL Simple Regression Example Male black wheatear birds carry stones to the nest as a form

More information

9 Correlation and Regression

9 Correlation and Regression 9 Correlation and Regression SW, Chapter 12. Suppose we select n = 10 persons from the population of college seniors who plan to take the MCAT exam. Each takes the test, is coached, and then retakes the

More information

E(Y ij b i ) = f(x ijβ i ), (13.1) β i = A i β + B i b i. (13.2)

E(Y ij b i ) = f(x ijβ i ), (13.1) β i = A i β + B i b i. (13.2) 1 Advanced topics 1.1 Introduction In this chapter, we conclude with brief overviews of several advanced topics. Each of these topics could realistically be the subject of an entire course! 1. Generalized

More information

Mixed-Effects Pattern-Mixture Models for Incomplete Longitudinal Data. Don Hedeker University of Illinois at Chicago

Mixed-Effects Pattern-Mixture Models for Incomplete Longitudinal Data. Don Hedeker University of Illinois at Chicago Mixed-Effects Pattern-Mixture Models for Incomplete Longitudinal Data Don Hedeker University of Illinois at Chicago This work was supported by National Institute of Mental Health Contract N44MH32056. 1

More information

Correlated data. Overview. Variance component models. Terminology for correlated measurements. Faculty of Health Sciences. Variance component models

Correlated data. Overview. Variance component models. Terminology for correlated measurements. Faculty of Health Sciences. Variance component models Faculty of Health Sciences Overview Correlated data Variance component models Lene Theil Skovgaard & Julie Lyng Forman November 29, 2016 One-way anova with random variation The rabbit example Hierarchical

More information

Interactions. Interactions. Lectures 1 & 2. Linear Relationships. y = a + bx. Slope. Intercept

Interactions. Interactions. Lectures 1 & 2. Linear Relationships. y = a + bx. Slope. Intercept Interactions Lectures 1 & Regression Sometimes two variables appear related: > smoking and lung cancers > height and weight > years of education and income > engine size and gas mileage > GMAT scores and

More information

STATISTICS 174: APPLIED STATISTICS FINAL EXAM DECEMBER 10, 2002

STATISTICS 174: APPLIED STATISTICS FINAL EXAM DECEMBER 10, 2002 Time allowed: 3 HOURS. STATISTICS 174: APPLIED STATISTICS FINAL EXAM DECEMBER 10, 2002 This is an open book exam: all course notes and the text are allowed, and you are expected to use your own calculator.

More information

Course Introduction and Overview Descriptive Statistics Conceptualizations of Variance Review of the General Linear Model

Course Introduction and Overview Descriptive Statistics Conceptualizations of Variance Review of the General Linear Model Course Introduction and Overview Descriptive Statistics Conceptualizations of Variance Review of the General Linear Model PSYC 943 (930): Fundamentals of Multivariate Modeling Lecture 1: August 22, 2012

More information

INTRODUCTION TO MULTILEVEL MODELLING FOR REPEATED MEASURES DATA. Belfast 9 th June to 10 th June, 2011

INTRODUCTION TO MULTILEVEL MODELLING FOR REPEATED MEASURES DATA. Belfast 9 th June to 10 th June, 2011 INTRODUCTION TO MULTILEVEL MODELLING FOR REPEATED MEASURES DATA Belfast 9 th June to 10 th June, 2011 Dr James J Brown Southampton Statistical Sciences Research Institute (UoS) ADMIN Research Centre (IoE

More information

Acknowledgements. Outline. Marie Diener-West. ICTR Leadership / Team INTRODUCTION TO CLINICAL RESEARCH. Introduction to Linear Regression

Acknowledgements. Outline. Marie Diener-West. ICTR Leadership / Team INTRODUCTION TO CLINICAL RESEARCH. Introduction to Linear Regression INTRODUCTION TO CLINICAL RESEARCH Introduction to Linear Regression Karen Bandeen-Roche, Ph.D. July 17, 2012 Acknowledgements Marie Diener-West Rick Thompson ICTR Leadership / Team JHU Intro to Clinical

More information

MS&E 226: Small Data

MS&E 226: Small Data MS&E 226: Small Data Lecture 12: Frequentist properties of estimators (v4) Ramesh Johari ramesh.johari@stanford.edu 1 / 39 Frequentist inference 2 / 39 Thinking like a frequentist Suppose that for some

More information

Correlated data. Introduction. We expect students to... Aim of the course. Faculty of Health Sciences. NFA, May 19, 2014.

Correlated data. Introduction. We expect students to... Aim of the course. Faculty of Health Sciences. NFA, May 19, 2014. Faculty of Health Sciences Introduction Correlated data NFA, May 19, 2014 Introduction Julie Lyng Forman & Lene Theil Skovgaard Department of Biostatistics University of Copenhagen The idea of the course

More information

Heteroskedasticity in Panel Data

Heteroskedasticity in Panel Data Essex Summer School in Social Science Data Analysis Panel Data Analysis for Comparative Research Heteroskedasticity in Panel Data Christopher Adolph Department of Political Science and Center for Statistics

More information

Bayesian Inference on Joint Mixture Models for Survival-Longitudinal Data with Multiple Features. Yangxin Huang

Bayesian Inference on Joint Mixture Models for Survival-Longitudinal Data with Multiple Features. Yangxin Huang Bayesian Inference on Joint Mixture Models for Survival-Longitudinal Data with Multiple Features Yangxin Huang Department of Epidemiology and Biostatistics, COPH, USF, Tampa, FL yhuang@health.usf.edu January

More information

ECNS 561 Multiple Regression Analysis

ECNS 561 Multiple Regression Analysis ECNS 561 Multiple Regression Analysis Model with Two Independent Variables Consider the following model Crime i = β 0 + β 1 Educ i + β 2 [what else would we like to control for?] + ε i Here, we are taking

More information

Exam Applied Statistical Regression. Good Luck!

Exam Applied Statistical Regression. Good Luck! Dr. M. Dettling Summer 2011 Exam Applied Statistical Regression Approved: Tables: Note: Any written material, calculator (without communication facility). Attached. All tests have to be done at the 5%-level.

More information

with the usual assumptions about the error term. The two values of X 1 X 2 0 1

with the usual assumptions about the error term. The two values of X 1 X 2 0 1 Sample questions 1. A researcher is investigating the effects of two factors, X 1 and X 2, each at 2 levels, on a response variable Y. A balanced two-factor factorial design is used with 1 replicate. The

More information

A Sampling of IMPACT Research:

A Sampling of IMPACT Research: A Sampling of IMPACT Research: Methods for Analysis with Dropout and Identifying Optimal Treatment Regimes Marie Davidian Department of Statistics North Carolina State University http://www.stat.ncsu.edu/

More information

Heteroskedasticity in Panel Data

Heteroskedasticity in Panel Data Essex Summer School in Social Science Data Analysis Panel Data Analysis for Comparative Research Heteroskedasticity in Panel Data Christopher Adolph Department of Political Science and Center for Statistics

More information

Review of CLDP 944: Multilevel Models for Longitudinal Data

Review of CLDP 944: Multilevel Models for Longitudinal Data Review of CLDP 944: Multilevel Models for Longitudinal Data Topics: Review of general MLM concepts and terminology Model comparisons and significance testing Fixed and random effects of time Significance

More information

Bios 6648: Design & conduct of clinical research

Bios 6648: Design & conduct of clinical research Bios 6648: Design & conduct of clinical research Section 2 - Formulating the scientific and statistical design designs 2.5(b) Binary 2.5(c) Skewed baseline (a) Time-to-event (revisited) (b) Binary (revisited)

More information

Bayesian methods for missing data: part 1. Key Concepts. Nicky Best and Alexina Mason. Imperial College London

Bayesian methods for missing data: part 1. Key Concepts. Nicky Best and Alexina Mason. Imperial College London Bayesian methods for missing data: part 1 Key Concepts Nicky Best and Alexina Mason Imperial College London BAYES 2013, May 21-23, Erasmus University Rotterdam Missing Data: Part 1 BAYES2013 1 / 68 Outline

More information

Testing Indirect Effects for Lower Level Mediation Models in SAS PROC MIXED

Testing Indirect Effects for Lower Level Mediation Models in SAS PROC MIXED Testing Indirect Effects for Lower Level Mediation Models in SAS PROC MIXED Here we provide syntax for fitting the lower-level mediation model using the MIXED procedure in SAS as well as a sas macro, IndTest.sas

More information

ANOVA Longitudinal Models for the Practice Effects Data: via GLM

ANOVA Longitudinal Models for the Practice Effects Data: via GLM Psyc 943 Lecture 25 page 1 ANOVA Longitudinal Models for the Practice Effects Data: via GLM Model 1. Saturated Means Model for Session, E-only Variances Model (BP) Variances Model: NO correlation, EQUAL

More information

Repeated Measures ANOVA Multivariate ANOVA and Their Relationship to Linear Mixed Models

Repeated Measures ANOVA Multivariate ANOVA and Their Relationship to Linear Mixed Models Repeated Measures ANOVA Multivariate ANOVA and Their Relationship to Linear Mixed Models EPSY 905: Multivariate Analysis Spring 2016 Lecture #12 April 20, 2016 EPSY 905: RM ANOVA, MANOVA, and Mixed Models

More information

Review of Statistics 101

Review of Statistics 101 Review of Statistics 101 We review some important themes from the course 1. Introduction Statistics- Set of methods for collecting/analyzing data (the art and science of learning from data). Provides methods

More information

Machine Learning Linear Classification. Prof. Matteo Matteucci

Machine Learning Linear Classification. Prof. Matteo Matteucci Machine Learning Linear Classification Prof. Matteo Matteucci Recall from the first lecture 2 X R p Regression Y R Continuous Output X R p Y {Ω 0, Ω 1,, Ω K } Classification Discrete Output X R p Y (X)

More information

Shared Random Parameter Models for Informative Missing Data

Shared Random Parameter Models for Informative Missing Data Shared Random Parameter Models for Informative Missing Data Dean Follmann NIAID NIAID/NIH p. A General Set-up Longitudinal data (Y ij,x ij,r ij ) Y ij = outcome for person i on visit j R ij = 1 if observed

More information

review session gov 2000 gov 2000 () review session 1 / 38

review session gov 2000 gov 2000 () review session 1 / 38 review session gov 2000 gov 2000 () review session 1 / 38 Overview Random Variables and Probability Univariate Statistics Bivariate Statistics Multivariate Statistics Causal Inference gov 2000 () review

More information

Lecture Outline. Biost 518 Applied Biostatistics II. Choice of Model for Analysis. Choice of Model. Choice of Model. Lecture 10: Multiple Regression:

Lecture Outline. Biost 518 Applied Biostatistics II. Choice of Model for Analysis. Choice of Model. Choice of Model. Lecture 10: Multiple Regression: Biost 518 Applied Biostatistics II Scott S. Emerson, M.D., Ph.D. Professor of Biostatistics University of Washington Lecture utline Choice of Model Alternative Models Effect of data driven selection of

More information

Statistical Distribution Assumptions of General Linear Models

Statistical Distribution Assumptions of General Linear Models Statistical Distribution Assumptions of General Linear Models Applied Multilevel Models for Cross Sectional Data Lecture 4 ICPSR Summer Workshop University of Colorado Boulder Lecture 4: Statistical Distributions

More information

A Bayesian Nonparametric Approach to Monotone Missing Data in Longitudinal Studies with Informative Missingness

A Bayesian Nonparametric Approach to Monotone Missing Data in Longitudinal Studies with Informative Missingness A Bayesian Nonparametric Approach to Monotone Missing Data in Longitudinal Studies with Informative Missingness A. Linero and M. Daniels UF, UT-Austin SRC 2014, Galveston, TX 1 Background 2 Working model

More information

Lecture 7 Time-dependent Covariates in Cox Regression

Lecture 7 Time-dependent Covariates in Cox Regression Lecture 7 Time-dependent Covariates in Cox Regression So far, we ve been considering the following Cox PH model: λ(t Z) = λ 0 (t) exp(β Z) = λ 0 (t) exp( β j Z j ) where β j is the parameter for the the

More information

Introduction to Random Effects of Time and Model Estimation

Introduction to Random Effects of Time and Model Estimation Introduction to Random Effects of Time and Model Estimation Today s Class: The Big Picture Multilevel model notation Fixed vs. random effects of time Random intercept vs. random slope models How MLM =

More information

Models for Clustered Data

Models for Clustered Data Models for Clustered Data Edps/Psych/Soc 589 Carolyn J Anderson Department of Educational Psychology c Board of Trustees, University of Illinois Spring 2019 Outline Notation NELS88 data Fixed Effects ANOVA

More information