Postgraduate course: Anova and Repeated measurements Day 2 (part 2) Niels Trolle Andersen, Dept. of Biostatistics, Aarhus University, June 2009
|
|
- Collin Gervase Manning
- 5 years ago
- Views:
Transcription
1 Postgraduate course in ANOVA and Repeated Measurements Day (part ) Repeated Measurements Niels Trolle Andersen Dept. of Biostatistics, Aarhus University CVP (mean and sd) Sd w (within) version 1 Sd w (within) version The mean curve and the sd is the same for the two versions The with-in subject variation is the relevant variation analyzing changes over time..so How can we estimate the with-in subject variation and between subject variation? sd total = (sd w + sd b ) sd b (between subjects) (relevant for comparison of group levels) Univariate Repeated Measurements ANOVA -using the anova command Notation (comparison of k groups, each subject measured p times ): Data y ijt where i=1,.,k (group i) j=1,., n i (n i sample size in group i) t=1,..,p ( timepoint t) Assumptions: Data between subjects (within and between groups) are independent and y ijt = α ij + τ it + ε ijt where (τ i1,.., τ ip ) is the mean curve for group i. α ij is the random the level for subject j in group i ε ijt is random variation (with in subject measurement error ) 3 Assumptions: 1. The ε ijt s are independent and normal distributed with mean 0 and standard deviation σ w ( with in subject ). The α ij s are independent with mean 0 and standard deviation σ B (between subject) 3. The α ij s are normal distributed Remark: Only assumption 1 is required in comparisons of changes (changes in two group or no changes at all), while all three assumptions are required in comparisons of levels in the groups. 4
2 5 6 Example (evf): If all three assumptions are fulfilled then : The variation of the data can be described by normal distributions with mean µ ijt and standard deviation σ T where µ ijt = τ it and σ T = σ B + σ W (independent of i,j,t) EVF measured 6 times, two groups. When we used the Manova command and our measurements are given in 6 variables : evf1, evf,..,evf6. (no of rows= number of subject) The use of anova requires that the data are organized in another way: One variable containing all the evf measurements and three variable describing which subject, group and timepoint the measurement reefers to. The correlation between any two different measurements on the same subject is σ B / σ T If id is a variable which identify the subjects (the rows) then The stata command: reshape long evf,i(id) j(time) will produce a data set (evf data) with two new variables evf and time and keeping e.g. the group variable (no of rows= 6*number of subject). (reshape wide split the data into several rows) reshape long evf,i(id) j(time) (note: j = ) Data wide -> long 7 Plotting the individual curves (in Stata): Plotting the mean curves (in Stata): generate gw = 100*group + time Number of obs. 30 -> 180 Number of variables 8 -> 4 sort group id time scatter evf time, connect(l) by(group) egen groupmean = mean(evf), by(gw) sort group time id Twoway /// j variable (6 values) -> time (connected groupmean time if group==1,symbol(oh)) xij variables: evf1 evf... evf6 -> evf (connected groupmean time if group==,symbol(o)), legend(label(1 "CPB") label( "Sham")) (reshape wide can produce dataset example 5 from example 6.) If we have the data in long format i.e. one variable containing all the measurements then Stata can be used to produce figures with individual and/or mean curves. evf time Graphs by group groupmean time CPB Sham
3 We want to test the hypothesis: H : Equal changes over time for the two groups i.e. two parallel curves. This test can be perfformed by a 3-way ANOVA with id (subject identification), time, group and group*time (interaction) in the model. 9 The Univariate Repeated Measurements Anova: set matsize 800, permanently Number of obs = 180 R-squared = Root MSE = Adj R-squared = Source Partial SS df MS F Prob > F Model group idgroup time time*group Residual Total The p value for testing H : Equal changes over time for the two groups i.e. two parallel curves Maybe necessary (between 10 and 800) 10 Output (continued) Between-subjects error term: idgroup Levels: 30 (8 df) Lowest b.s.e. variable: id Covariance pooled over: group (for repeated variable) 11 How can we use of the four/three p - values: Source df F Regular H-F G-G Box 1 Huynh-Feldt epsilon = Greenhouse-Geisser epsilon = Box's conservative epsilon = time time*group Residual 140 Source df F Regular H-F G-G Box time time*group Residual 140 Some corrections of the p-value have been proposed when the assumptions (about equal sd and correlation) aren t fulfilled. In the regular column the p - values from first part of the output is stated (the uncorrected p-value); the three others will normally be larger than this. The following has been proposed: If the regular/uncorrected p value is not significant (>0.05) then stop and accept (fail to reject) the hypothesis else If the G-G p value is significant (<0.05) then stop and reject the hypothesis else If the Box p value is significant (<0.05) then stop and reject the hypothesis else stop and accept (fail to reject) the hypothesis. Kirk (198)
4 If the hypothesis H: Time + group is rejected i.e. the changes over time depend on the groups we can test (for each group): H: no changes over time. anova evf id time if group==1, repeated(time) Number of obs = 90 R-squared = = Root MSE = Adj R-squared 13 Between-subjects error term: id Levels: 15 (14 df) Lowest b.s.e. variable: id Huynh-Feldt epsilon = Greenhouse-Geisser epsilon = Box's conservative epsilon = Source Partial SS df MS F Prob > F Model id time Residual Source df F Regular H-F G-G Box time Residual 70 Upper bound for the p - value anova evf id time if group==, repeated(time) anova evf id time if group==, repeated(time) Number of obs = 90 R-squared = Root MSE = Adj R-squared = Source Partial SS df MS F Prob > F Model id time Residual Total Between-subjects error term: id Lowest b.s.e. variable: id Levels: 15 (14 df) Source df Huynh-Feldt epsilon = Greenhouse-Geisser epsilon = Box's conservative epsilon = F Regular H-F G-G Box time Residual 70 Lower bound for the p - value Conclusion: We found a significant differerence between the groups with respect to changes over time p<0.004) We found a statistical significant changes over time in the CPB-group (p<0.006) but not in the Sham-group (p>0.19) 16
5 17 18 If we want to estimate the with-in and the between subject variation one can use the xtmixed command: We have four variables: Evf group id and time: xi: xtmixed evf i.time*i.group _all: R.id ///,nofetable noheader nogroup nostderr nolrtest We have sd W = sd B = and can calculate Part of the output: Random-effects Parameters Estimate Std. Err. [95% Conf. Interval] _all: Identity sd(r.id) estimate of the between subject variation sd(residual) estimate of the with-in subject variation s w = s B = sd T = sd B + sd W = s T = The (estimated) correlation between two measurements on the same subject s B / s T = 0.53 We can look at each group separately: 19 We can calculate estimates for both groups: 0 xi: xtmixed evf i.time _all: R.id if group==1 ///,nofetable noheader nogroup nostderr nolrtest Random-effects Parameters Estimate Std. Err. [95% Conf. Interval] _all: Identity grp 1 s W s B s T s W s B s T Correlation sd(r.id) sd(residual) xi: xtmixed evf i.time _all: R.id if group== ///,nofetable noheader nogroup nostderr nolrtest Random-effects Parameters Estimate Std. Err. [95% Conf. Interval] _all: Identity sd(r.id) sd(residual) We can see that the estimates for the between subject variation (s B ) are almost equal but the with - in subject variation (s W ) are different and hence also the total variation and the correlation. Remarks: The correlations are expected to be positive (why?), but in special cases one might get negative correlations. (weight of mice with limit amount of food and.) We can compare the estimates above with the standard deviations and correlation calculated from the 6 variables evf1, evf,.., evf6.
6 In order to check part of assumption 1 ( e ijt ) one can calculate the residuals and the predicted values and make the figures as in day 1: predict evf_pre if e(sample), xb predict evf_res if e(sample), resid scatter evf_res evf_pre, saving(q1,replace) qnorm evf_res, title ("residual probability-plot") saving(q,replace) graph combine q1.gph q.gph Linear prediction residual probability-plot Inverse Normal Remarks: Is the probability plots nice? Does the variation in the residuals depend on the predicted values? 1 or we can make the figures for the groups separately: scatter evf_res evf_pre if group==1, saving(q1,replace) qnorm evf_res if group==1, title ("residual probability-plot") saving(q,replace) scatter evf_res evf_pre if group==, saving(q3,replace) qnorm evf_res if group==, title ("residual probability-plot") saving(q4,replace) graph combine q1.gph q.gph q3.gph q4.gph Linear prediction Linear prediction residual probability-plot Inverse Normal residual probability-plot Inverse Normal Remarks: We can see that the variation in the residuals is larger in group 1 compared to group. The probability plots are nice. Does the variation in the residuals depends on the predicted values? 3 anova dist sex/idsex time sex*time, repeated(time) 4 groupmean time Boys Girls Number of obs = 108 R-squared = Root MSE = Adj R-squared = Source Partial SS df MS F Prob > F Model sex idsex time sex*time Residual Total
7 Between-subjects error term: idsex Lowest b.s.e. variable: id Levels: 7 (5 df) Covariance pooled over: sex (for repeated variable) Huynh-Feldt epsilon = *Huynh-Feldt epsilon reset to Greenhouse-Geisser epsilon = Box's conservative epsilon = Source df F Regular H-F G-G Box time sex*time Residual 75 Conclusion: We found no significant differerence between the groups (sex) with respect to changes over time p>.078) 5 Number of obs = 108 R-squared = Root MSE = Adj R-squared = Source Partial SS df MS F Prob > F Model sex idsex time sex*time Residual Total If we accept H we can test whether the two mean curves coincides. It is exactly the same test as day i.e. equal to a t-test on the average of the 4 measurements of distances. All three assumptions should be fulfilled. 6 Between-subjects error term: idsex Lowest b.s.e. variable: id Levels: 7 (5 df) Covariance pooled over: sex (for repeated variable) 7 Remarks: We have now more than one way to analyze the data. 8 Huynh-Feldt epsilon = *Huynh-Feldt epsilon reset to Greenhouse-Geisser epsilon = Box's conservative epsilon = Which one (if any) shall we choose? How can describe the analysis? How can we describe the results? Source df F Regular H-F G-G Box time sex*time Residual 75 If we accept H (parallel curves) we can test H4 (no changes over time) for both groups in one test. If we perform a test for each of the groups we can have to different answers or we can accept H4 for both groups separately due to low power. First of all we need to check the assumptions (Day 3). Depending of what we can assume we can try to answer the questions (Day 4).
Postgraduate course: Anova and Repeated measurements Day 2 (part 2) Mogens Erlandsen, Department of Biostatistics, Aarhus University, November 2010
30 CVP (mean and sd) Postgraduate course in ANOVA and Repeated Measurements Day Repeated measurements (part ) Mogens Erlandsen Deptartment of Biostatistics Aarhus University 5 0 15 10 0 1 3 4 5 6 7 8 9
More informationPostgraduate course: Anova and Repeated measurements Day 4 (part 2 )
Postgraduate course: Anova Repeated measurements Day (part ) Postgraduate course in ANOVA Repeated Measurements Day (part ) Summarizing homework exercises. Nielrolle Andersen Dept. of Biostatistics, Aarhus
More informationAnalysis of repeated measurements (KLMED8008)
Analysis of repeated measurements (KLMED8008) Eirik Skogvoll, MD PhD Professor and Consultant Institute of Circulation and Medical Imaging Dept. of Anaesthesiology and Emergency Medicine 1 Day 2 Practical
More informationSOCY5601 Handout 8, Fall DETECTING CURVILINEARITY (continued) CONDITIONAL EFFECTS PLOTS
SOCY5601 DETECTING CURVILINEARITY (continued) CONDITIONAL EFFECTS PLOTS More on use of X 2 terms to detect curvilinearity: As we have said, a quick way to detect curvilinearity in the relationship between
More informationLONGITUDINAL DATA ANALYSIS Homework I, 2005 SOLUTION. A = ( 2) = 36; B = ( 4) = 94. Therefore A B = 36 ( 94) = 3384.
LONGITUDINAL DATA ANALYSIS Homework I, 2005 SOLUTION 1. Suppose A and B are both 2 2 matrices with A = ( 6 3 2 5 ) ( 4 10, B = 7 6 (a) Verify that A B = AB. ) A = 6 5 3 ( 2) = 36; B = ( 4) 6 10 7 = 94.
More informationMonday 7 th Febraury 2005
Monday 7 th Febraury 2 Analysis of Pigs data Data: Body weights of 48 pigs at 9 successive follow-up visits. This is an equally spaced data. It is always a good habit to reshape the data, so we can easily
More informationANOVA approaches to Repeated Measures. repeated measures MANOVA (chapter 3)
ANOVA approaches to Repeated Measures univariate repeated-measures ANOVA (chapter 2) repeated measures MANOVA (chapter 3) Assumptions Interval measurement and normally distributed errors (homogeneous across
More information****Lab 4, Feb 4: EDA and OLS and WLS
****Lab 4, Feb 4: EDA and OLS and WLS ------- log: C:\Documents and Settings\Default\Desktop\LDA\Data\cows_Lab4.log log type: text opened on: 4 Feb 2004, 09:26:19. use use "Z:\LDA\DataLDA\cowsP.dta", clear.
More informationEmpirical Application of Simple Regression (Chapter 2)
Empirical Application of Simple Regression (Chapter 2) 1. The data file is House Data, which can be downloaded from my webpage. 2. Use stata menu File Import Excel Spreadsheet to read the data. Don t forget
More informationANOVA in SPSS. Hugo Quené. opleiding Taalwetenschap Universiteit Utrecht Trans 10, 3512 JK Utrecht.
ANOVA in SPSS Hugo Quené hugo.quene@let.uu.nl opleiding Taalwetenschap Universiteit Utrecht Trans 10, 3512 JK Utrecht 7 Oct 2005 1 introduction In this example I ll use fictitious data, taken from http://www.ruf.rice.edu/~mickey/psyc339/notes/rmanova.html.
More informationStatistical Modelling in Stata 5: Linear Models
Statistical Modelling in Stata 5: Linear Models Mark Lunt Arthritis Research UK Epidemiology Unit University of Manchester 07/11/2017 Structure This Week What is a linear model? How good is my model? Does
More informationTesting methodology. It often the case that we try to determine the form of the model on the basis of data
Testing methodology It often the case that we try to determine the form of the model on the basis of data The simplest case: we try to determine the set of explanatory variables in the model Testing for
More informationSTATISTICS 110/201 PRACTICE FINAL EXAM
STATISTICS 110/201 PRACTICE FINAL EXAM Questions 1 to 5: There is a downloadable Stata package that produces sequential sums of squares for regression. In other words, the SS is built up as each variable
More informationCorrelation and Simple Linear Regression
Correlation and Simple Linear Regression Sasivimol Rattanasiri, Ph.D Section for Clinical Epidemiology and Biostatistics Ramathibodi Hospital, Mahidol University E-mail: sasivimol.rat@mahidol.ac.th 1 Outline
More informationGeneral Linear Model (Chapter 4)
General Linear Model (Chapter 4) Outcome variable is considered continuous Simple linear regression Scatterplots OLS is BLUE under basic assumptions MSE estimates residual variance testing regression coefficients
More informationProblem Set #3-Key. wage Coef. Std. Err. t P> t [95% Conf. Interval]
Problem Set #3-Key Sonoma State University Economics 317- Introduction to Econometrics Dr. Cuellar 1. Use the data set Wage1.dta to answer the following questions. a. For the regression model Wage i =
More informationBIOSTATS 640 Spring 2018 Unit 2. Regression and Correlation (Part 1 of 2) STATA Users
Unit Regression and Correlation 1 of - Practice Problems Solutions Stata Users 1. In this exercise, you will gain some practice doing a simple linear regression using a Stata data set called week0.dta.
More informationLecture 3: Multiple Regression. Prof. Sharyn O Halloran Sustainable Development U9611 Econometrics II
Lecture 3: Multiple Regression Prof. Sharyn O Halloran Sustainable Development Econometrics II Outline Basics of Multiple Regression Dummy Variables Interactive terms Curvilinear models Review Strategies
More informationANOVA Longitudinal Models for the Practice Effects Data: via GLM
Psyc 943 Lecture 25 page 1 ANOVA Longitudinal Models for the Practice Effects Data: via GLM Model 1. Saturated Means Model for Session, E-only Variances Model (BP) Variances Model: NO correlation, EQUAL
More informationTopic 12. The Split-plot Design and its Relatives (Part II) Repeated Measures [ST&D Ch. 16] 12.9 Repeated measures analysis
Topic 12. The Split-plot Design and its Relatives (Part II) Repeated Measures [ST&D Ch. 16] 12.9 Repeated measures analysis Sometimes researchers make multiple measurements on the same experimental unit.
More informationTopic 12. The Split-plot Design and its Relatives (continued) Repeated Measures
12.1 Topic 12. The Split-plot Design and its Relatives (continued) Repeated Measures 12.9 Repeated measures analysis Sometimes researchers make multiple measurements on the same experimental unit. We have
More informationAcknowledgements. Outline. Marie Diener-West. ICTR Leadership / Team INTRODUCTION TO CLINICAL RESEARCH. Introduction to Linear Regression
INTRODUCTION TO CLINICAL RESEARCH Introduction to Linear Regression Karen Bandeen-Roche, Ph.D. July 17, 2012 Acknowledgements Marie Diener-West Rick Thompson ICTR Leadership / Team JHU Intro to Clinical
More informationLab 10 - Binary Variables
Lab 10 - Binary Variables Spring 2017 Contents 1 Introduction 1 2 SLR on a Dummy 2 3 MLR with binary independent variables 3 3.1 MLR with a Dummy: different intercepts, same slope................. 4 3.2
More informationSection Least Squares Regression
Section 2.3 - Least Squares Regression Statistics 104 Autumn 2004 Copyright c 2004 by Mark E. Irwin Regression Correlation gives us a strength of a linear relationship is, but it doesn t tell us what it
More informationANCOVA. Psy 420 Andrew Ainsworth
ANCOVA Psy 420 Andrew Ainsworth What is ANCOVA? Analysis of covariance an extension of ANOVA in which main effects and interactions are assessed on DV scores after the DV has been adjusted for by the DV
More informationEconomics 326 Methods of Empirical Research in Economics. Lecture 14: Hypothesis testing in the multiple regression model, Part 2
Economics 326 Methods of Empirical Research in Economics Lecture 14: Hypothesis testing in the multiple regression model, Part 2 Vadim Marmer University of British Columbia May 5, 2010 Multiple restrictions
More informationProblem Set #5-Key Sonoma State University Dr. Cuellar Economics 317- Introduction to Econometrics
Problem Set #5-Key Sonoma State University Dr. Cuellar Economics 317- Introduction to Econometrics C1.1 Use the data set Wage1.dta to answer the following questions. Estimate regression equation wage =
More informationLab 6 - Simple Regression
Lab 6 - Simple Regression Spring 2017 Contents 1 Thinking About Regression 2 2 Regression Output 3 3 Fitted Values 5 4 Residuals 6 5 Functional Forms 8 Updated from Stata tutorials provided by Prof. Cichello
More information8. Nonstandard standard error issues 8.1. The bias of robust standard errors
8.1. The bias of robust standard errors Bias Robust standard errors are now easily obtained using e.g. Stata option robust Robust standard errors are preferable to normal standard errors when residuals
More informationLecture 7: OLS with qualitative information
Lecture 7: OLS with qualitative information Dummy variables Dummy variable: an indicator that says whether a particular observation is in a category or not Like a light switch: on or off Most useful values:
More informationRepeated-Measures ANOVA in SPSS Correct data formatting for a repeated-measures ANOVA in SPSS involves having a single line of data for each
Repeated-Measures ANOVA in SPSS Correct data formatting for a repeated-measures ANOVA in SPSS involves having a single line of data for each participant, with the repeated measures entered as separate
More informationWITHIN-PARTICIPANT EXPERIMENTAL DESIGNS
1 WITHIN-PARTICIPANT EXPERIMENTAL DESIGNS I. Single-factor designs: the model is: yij i j ij ij where: yij score for person j under treatment level i (i = 1,..., I; j = 1,..., n) overall mean βi treatment
More informationWorking with Stata Inference on the mean
Working with Stata Inference on the mean Nicola Orsini Biostatistics Team Department of Public Health Sciences Karolinska Institutet Dataset: hyponatremia.dta Motivating example Outcome: Serum sodium concentration,
More informationOutline. Linear OLS Models vs: Linear Marginal Models Linear Conditional Models. Random Intercepts Random Intercepts & Slopes
Lecture 2.1 Basic Linear LDA 1 Outline Linear OLS Models vs: Linear Marginal Models Linear Conditional Models Random Intercepts Random Intercepts & Slopes Cond l & Marginal Connections Empirical Bayes
More informationLab 07 Introduction to Econometrics
Lab 07 Introduction to Econometrics Learning outcomes for this lab: Introduce the different typologies of data and the econometric models that can be used Understand the rationale behind econometrics Understand
More informationChapter 8 (More on Assumptions for the Simple Linear Regression)
EXST3201 Chapter 8b Geaghan Fall 2005: Page 1 Chapter 8 (More on Assumptions for the Simple Linear Regression) Your textbook considers the following assumptions: Linearity This is not something I usually
More informationsociology 362 regression
sociology 36 regression Regression is a means of modeling how the conditional distribution of a response variable (say, Y) varies for different values of one or more independent explanatory variables (say,
More informationPLS205 Lab 2 January 15, Laboratory Topic 3
PLS205 Lab 2 January 15, 2015 Laboratory Topic 3 General format of ANOVA in SAS Testing the assumption of homogeneity of variances by "/hovtest" by ANOVA of squared residuals Proc Power for ANOVA One-way
More informationRepeated Measures Analysis of Variance
Repeated Measures Analysis of Variance Review Univariate Analysis of Variance Group A Group B Group C Repeated Measures Analysis of Variance Condition A Condition B Condition C Repeated Measures Analysis
More informationIntroductory Econometrics. Lecture 13: Hypothesis testing in the multiple regression model, Part 1
Introductory Econometrics Lecture 13: Hypothesis testing in the multiple regression model, Part 1 Jun Ma School of Economics Renmin University of China October 19, 2016 The model I We consider the classical
More informationOne-Way ANOVA. Some examples of when ANOVA would be appropriate include:
One-Way ANOVA 1. Purpose Analysis of variance (ANOVA) is used when one wishes to determine whether two or more groups (e.g., classes A, B, and C) differ on some outcome of interest (e.g., an achievement
More informationBinary Dependent Variables
Binary Dependent Variables In some cases the outcome of interest rather than one of the right hand side variables - is discrete rather than continuous Binary Dependent Variables In some cases the outcome
More informationECO220Y Simple Regression: Testing the Slope
ECO220Y Simple Regression: Testing the Slope Readings: Chapter 18 (Sections 18.3-18.5) Winter 2012 Lecture 19 (Winter 2012) Simple Regression Lecture 19 1 / 32 Simple Regression Model y i = β 0 + β 1 x
More informationFailure Time of System due to the Hot Electron Effect
of System due to the Hot Electron Effect 1 * exresist; 2 option ls=120 ps=75 nocenter nodate; 3 title of System due to the Hot Electron Effect ; 4 * TIME = failure time (hours) of a system due to drift
More informationTable 1: Fish Biomass data set on 26 streams
Math 221: Multiple Regression S. K. Hyde Chapter 27 (Moore, 5th Ed.) The following data set contains observations on the fish biomass of 26 streams. The potential regressors from which we wish to explain
More informationInterpreting coefficients for transformed variables
Interpreting coefficients for transformed variables! Recall that when both independent and dependent variables are untransformed, an estimated coefficient represents the change in the dependent variable
More informationLinear Modelling in Stata Session 6: Further Topics in Linear Modelling
Linear Modelling in Stata Session 6: Further Topics in Linear Modelling Mark Lunt Arthritis Research UK Epidemiology Unit University of Manchester 14/11/2017 This Week Categorical Variables Categorical
More informationSuggested Answers Problem set 4 ECON 60303
Suggested Answers Problem set 4 ECON 60303 Bill Evans Spring 04. A program that answers part A) is on the web page and is named psid_iv_comparison.do. Below are some key results and a summary table is
More information1 A Review of Correlation and Regression
1 A Review of Correlation and Regression SW, Chapter 12 Suppose we select n = 10 persons from the population of college seniors who plan to take the MCAT exam. Each takes the test, is coached, and then
More informationT. Mark Beasley One-Way Repeated Measures ANOVA handout
T. Mark Beasley One-Way Repeated Measures ANOVA handout Profile Analysis Example In the One-Way Repeated Measures ANOVA, two factors represent separate sources of variance. Their interaction presents an
More informationAt this point, if you ve done everything correctly, you should have data that looks something like:
This homework is due on July 19 th. Economics 375: Introduction to Econometrics Homework #4 1. One tool to aid in understanding econometrics is the Monte Carlo experiment. A Monte Carlo experiment allows
More informationSplineLinear.doc 1 # 9 Last save: Saturday, 9. December 2006
SplineLinear.doc 1 # 9 Problem:... 2 Objective... 2 Reformulate... 2 Wording... 2 Simulating an example... 3 SPSS 13... 4 Substituting the indicator function... 4 SPSS-Syntax... 4 Remark... 4 Result...
More informationPrepared by: Prof. Dr Bahaman Abu Samah Department of Professional Development and Continuing Education Faculty of Educational Studies Universiti
Prepared by: Prof. Dr Bahaman Abu Samah Department of Professional Development and Continuing Education Faculty of Educational Studies Universiti Putra Malaysia Serdang Use in experiment, quasi-experiment
More informationPractice: Basic Linear-Interactive Model
Practice: Basic Linear-Interactive Model Basic Linear-Interactive Model: eusup = b + b edu + b lftrt + b edu lftrt +... + ε Effect of edu? 0 edu lftrt eusup edu» For the record, the effect of lftrt: Std
More informationEssential of Simple regression
Essential of Simple regression We use simple regression when we are interested in the relationship between two variables (e.g., x is class size, and y is student s GPA). For simplicity we assume the relationship
More informationCorrelation and regression. Correlation and regression analysis. Measures of association. Why bother? Positive linear relationship
1 Correlation and regression analsis 12 Januar 2009 Monda, 14.00-16.00 (C1058) Frank Haege Department of Politics and Public Administration Universit of Limerick frank.haege@ul.ie www.frankhaege.eu Correlation
More informationNonrecursive Models Highlights Richard Williams, University of Notre Dame, https://www3.nd.edu/~rwilliam/ Last revised April 6, 2015
Nonrecursive Models Highlights Richard Williams, University of Notre Dame, https://www3.nd.edu/~rwilliam/ Last revised April 6, 2015 This lecture borrows heavily from Duncan s Introduction to Structural
More informationGLM Repeated Measures
GLM Repeated Measures Notation The GLM (general linear model) procedure provides analysis of variance when the same measurement or measurements are made several times on each subject or case (repeated
More informationAddition of Center Points to a 2 k Designs Section 6-6 page 271
to a 2 k Designs Section 6-6 page 271 Based on the idea of replicating some of the runs in a factorial design 2 level designs assume linearity. If interaction terms are added to model some curvature results
More informationAnalysis of Longitudinal Data: Comparison Between PROC GLM and PROC MIXED. Maribeth Johnson Medical College of Georgia Augusta, GA
Analysis of Longitudinal Data: Comparison Between PROC GLM and PROC MIXED Maribeth Johnson Medical College of Georgia Augusta, GA Overview Introduction to longitudinal data Describe the data for examples
More informationLecture 3: Multivariate Regression
Lecture 3: Multivariate Regression Rates, cont. Two weeks ago, we modeled state homicide rates as being dependent on one variable: poverty. In reality, we know that state homicide rates depend on numerous
More informationAnalysis of Variance. ภาว น ศ ร ประภาน ก ล คณะเศรษฐศาสตร มหาว ทยาล ยธรรมศาสตร
Analysis of Variance ภาว น ศ ร ประภาน ก ล คณะเศรษฐศาสตร มหาว ทยาล ยธรรมศาสตร pawin@econ.tu.ac.th Outline Introduction One Factor Analysis of Variance Two Factor Analysis of Variance ANCOVA MANOVA Introduction
More informationGeneral Linear Model
GLM V1 V2 V3 V4 V5 V11 V12 V13 V14 V15 /WSFACTOR=placeholders 2 Polynomial target 5 Polynomial /METHOD=SSTYPE(3) /EMMEANS=TABLES(OVERALL) /EMMEANS=TABLES(placeholders) COMPARE ADJ(SIDAK) /EMMEANS=TABLES(target)
More information(a) Briefly discuss the advantage of using panel data in this situation rather than pure crosssections
Answer Key Fixed Effect and First Difference Models 1. See discussion in class.. David Neumark and William Wascher published a study in 199 of the effect of minimum wages on teenage employment using a
More informationCovariance Structure Approach to Within-Cases
Covariance Structure Approach to Within-Cases Remember how the data file grapefruit1.data looks: Store sales1 sales2 sales3 1 62.1 61.3 60.8 2 58.2 57.9 55.1 3 51.6 49.2 46.2 4 53.7 51.5 48.3 5 61.4 58.7
More informationName: Biostatistics 1 st year Comprehensive Examination: Applied in-class exam. June 8 th, 2016: 9am to 1pm
Name: Biostatistics 1 st year Comprehensive Examination: Applied in-class exam June 8 th, 2016: 9am to 1pm Instructions: 1. This is exam is to be completed independently. Do not discuss your work with
More informationMixed Models for Longitudinal Binary Outcomes. Don Hedeker Department of Public Health Sciences University of Chicago.
Mixed Models for Longitudinal Binary Outcomes Don Hedeker Department of Public Health Sciences University of Chicago hedeker@uchicago.edu https://hedeker-sites.uchicago.edu/ Hedeker, D. (2005). Generalized
More informationHandout 12. Endogeneity & Simultaneous Equation Models
Handout 12. Endogeneity & Simultaneous Equation Models In which you learn about another potential source of endogeneity caused by the simultaneous determination of economic variables, and learn how to
More informationunadjusted model for baseline cholesterol 22:31 Monday, April 19,
unadjusted model for baseline cholesterol 22:31 Monday, April 19, 2004 1 Class Level Information Class Levels Values TRETGRP 3 3 4 5 SEX 2 0 1 Number of observations 916 unadjusted model for baseline cholesterol
More informationNotes on Maxwell & Delaney
Notes on Maxwell & Delaney PSY710 12 higher-order within-subject designs Chapter 11 discussed the analysis of data collected in experiments that had a single, within-subject factor. Here we extend those
More informationWarwick Economics Summer School Topics in Microeconometrics Instrumental Variables Estimation
Warwick Economics Summer School Topics in Microeconometrics Instrumental Variables Estimation Michele Aquaro University of Warwick This version: July 21, 2016 1 / 31 Reading material Textbook: Introductory
More information2.1. Consider the following production function, known in the literature as the transcendental production function (TPF).
CHAPTER Functional Forms of Regression Models.1. Consider the following production function, known in the literature as the transcendental production function (TPF). Q i B 1 L B i K i B 3 e B L B K 4 i
More informationsociology 362 regression
sociology 36 regression Regression is a means of studying how the conditional distribution of a response variable (say, Y) varies for different values of one or more independent explanatory variables (say,
More informationMeasurement Error. Often a data set will contain imperfect measures of the data we would ideally like.
Measurement Error Often a data set will contain imperfect measures of the data we would ideally like. Aggregate Data: (GDP, Consumption, Investment are only best guesses of theoretical counterparts and
More information1 Independent Practice: Hypothesis tests for one parameter:
1 Independent Practice: Hypothesis tests for one parameter: Data from the Indian DHS survey from 2006 includes a measure of autonomy of the women surveyed (a scale from 0-10, 10 being the most autonomous)
More informationComputer Exercise 3 Answers Hypothesis Testing
Computer Exercise 3 Answers Hypothesis Testing. reg lnhpay xper yearsed tenure ---------+------------------------------ F( 3, 6221) = 512.58 Model 457.732594 3 152.577531 Residual 1851.79026 6221.297667619
More informationUnderstanding the multinomial-poisson transformation
The Stata Journal (2004) 4, Number 3, pp. 265 273 Understanding the multinomial-poisson transformation Paulo Guimarães Medical University of South Carolina Abstract. There is a known connection between
More informationUnit 2 Regression and Correlation Practice Problems. SOLUTIONS Version STATA
PubHlth 640. Regression and Correlation Page 1 of 19 Unit Regression and Correlation Practice Problems SOLUTIONS Version STATA 1. A regression analysis of measurements of a dependent variable Y on an independent
More informationESTIMATING AVERAGE TREATMENT EFFECTS: REGRESSION DISCONTINUITY DESIGNS Jeff Wooldridge Michigan State University BGSE/IZA Course in Microeconometrics
ESTIMATING AVERAGE TREATMENT EFFECTS: REGRESSION DISCONTINUITY DESIGNS Jeff Wooldridge Michigan State University BGSE/IZA Course in Microeconometrics July 2009 1. Introduction 2. The Sharp RD Design 3.
More informationProblem Set 1 ANSWERS
Economics 20 Prof. Patricia M. Anderson Problem Set 1 ANSWERS Part I. Multiple Choice Problems 1. If X and Z are two random variables, then E[X-Z] is d. E[X] E[Z] This is just a simple application of one
More informationIntroduction. Chapter 8
Chapter 8 Introduction In general, a researcher wants to compare one treatment against another. The analysis of variance (ANOVA) is a general test for comparing treatment means. When the null hypothesis
More informationSpecification Error: Omitted and Extraneous Variables
Specification Error: Omitted and Extraneous Variables Richard Williams, University of Notre Dame, https://www3.nd.edu/~rwilliam/ Last revised February 5, 05 Omitted variable bias. Suppose that the correct
More informationDescriptive Statistics
*following creates z scores for the ydacl statedp traitdp and rads vars. *specifically adding the /SAVE subcommand to descriptives will create z. *scores for whatever variables are in the command. DESCRIPTIVES
More informationPractice 2SLS with Artificial Data Part 1
Practice 2SLS with Artificial Data Part 1 Yona Rubinstein July 2016 Yona Rubinstein (LSE) Practice 2SLS with Artificial Data Part 1 07/16 1 / 16 Practice with Artificial Data In this note we use artificial
More information1 DV is normally distributed in the population for each level of the within-subjects factor 2 The population variances of the difference scores
One-way Prepared by: Prof. Dr Bahaman Abu Samah Department of Professional Development and Continuing Education Faculty of Educational Studies Universiti Putra Malaysia Serdang The purpose is to test the
More informationECON2228 Notes 7. Christopher F Baum. Boston College Economics. cfb (BC Econ) ECON2228 Notes / 41
ECON2228 Notes 7 Christopher F Baum Boston College Economics 2014 2015 cfb (BC Econ) ECON2228 Notes 6 2014 2015 1 / 41 Chapter 8: Heteroskedasticity In laying out the standard regression model, we made
More informationLongitudinal data: simple univariate methods of analysis
Longitudinal data: simple univariate methods of analysis Danish version by Henrik Stryhn, June 1996 Department of Mathematcis and Physics, KVL Translation (and rewriting) by Ib Skovgaard, March 1998 (the
More informationEXST Regression Techniques Page 1. We can also test the hypothesis H :" œ 0 versus H :"
EXST704 - Regression Techniques Page 1 Using F tests instead of t-tests We can also test the hypothesis H :" œ 0 versus H :" Á 0 with an F test.! " " " F œ MSRegression MSError This test is mathematically
More informationLongitudinal Data Analysis Using Stata Paul D. Allison, Ph.D. Upcoming Seminar: May 18-19, 2017, Chicago, Illinois
Longitudinal Data Analysis Using Stata Paul D. Allison, Ph.D. Upcoming Seminar: May 18-19, 217, Chicago, Illinois Outline 1. Opportunities and challenges of panel data. a. Data requirements b. Control
More informationLab 11 - Heteroskedasticity
Lab 11 - Heteroskedasticity Spring 2017 Contents 1 Introduction 2 2 Heteroskedasticity 2 3 Addressing heteroskedasticity in Stata 3 4 Testing for heteroskedasticity 4 5 A simple example 5 1 1 Introduction
More informationUV Absorbance by Fish Slime
Data Set 1: UV Absorbance by Fish Slime Statistical Setting This handout describes a repeated-measures ANOVA, with two crossed amongsubjects factors and repeated-measures on a third (within-subjects) factor.
More informationConstruct factor from dummy variables Group 1 Group 2 Group 3 Group 4
THE UNIVERSITY OF MINNESOTA Statistics 5401 October 10, 2005 Multi-group Profile Analysis Example This handout provides an analysis of some artificial data from Example 5.9 on p. 240 of Multivariate Statistical
More information1998, Gregory Carey Repeated Measures ANOVA - 1. REPEATED MEASURES ANOVA (incomplete)
1998, Gregory Carey Repeated Measures ANOVA - 1 REPEATED MEASURES ANOVA (incomplete) Repeated measures ANOVA (RM) is a specific type of MANOVA. When the within group covariance matrix has a special form,
More informationExercices for Applied Econometrics A
QEM F. Gardes-C. Starzec-M.A. Diaye Exercices for Applied Econometrics A I. Exercice: The panel of households expenditures in Poland, for years 1997 to 2000, gives the following statistics for the whole
More informationAnswer all questions from part I. Answer two question from part II.a, and one question from part II.b.
B203: Quantitative Methods Answer all questions from part I. Answer two question from part II.a, and one question from part II.b. Part I: Compulsory Questions. Answer all questions. Each question carries
More informationBIOL 458 BIOMETRY Lab 8 - Nested and Repeated Measures ANOVA
BIOL 458 BIOMETRY Lab 8 - Nested and Repeated Measures ANOVA PART 1: NESTED ANOVA Nested designs are used when levels of one factor are not represented within all levels of another factor. Often this is
More informationProblem Set 10: Panel Data
Problem Set 10: Panel Data 1. Read in the data set, e11panel1.dta from the course website. This contains data on a sample or 1252 men and women who were asked about their hourly wage in two years, 2005
More informationLecture 4: Multivariate Regression, Part 2
Lecture 4: Multivariate Regression, Part 2 Gauss-Markov Assumptions 1) Linear in Parameters: Y X X X i 0 1 1 2 2 k k 2) Random Sampling: we have a random sample from the population that follows the above
More informationMultiple Regression: Inference
Multiple Regression: Inference The t-test: is ˆ j big and precise enough? We test the null hypothesis: H 0 : β j =0; i.e. test that x j has no effect on y once the other explanatory variables are controlled
More informationFixed and Random Effects Models: Vartanian, SW 683
: Vartanian, SW 683 Fixed and random effects models See: http://teaching.sociology.ul.ie/dcw/confront/node45.html When you have repeated observations per individual this is a problem and an advantage:
More information