Lecture 12: Interactions and Splines
|
|
- Amie Rose
- 6 years ago
- Views:
Transcription
1 Lecture 12: Interactions and Splines Sandy Eckel 12 May
2 Definition Effect Modification The phenomenon in which the relationship between the primary predictor and outcome varies across levels of another predictor We say the other predictor modifies the effect between the primary predictor and outcome In linear regression, coded by inclusion of interaction term between primary predictor and another predictor Also called interaction 2
3 Reminder: Nested models Parent model contains one set of variables Extended model adds one or more new variables to the parent model one variable added: compare models with t test two or more variables added: compare models with F test Return to the example of wage versus experience 3
4 Model 1 Wage on Experience and Gender E Wage i ] = βˆ + βˆ (Experience ) + βˆ (Gender ) [ 0 1 i 2 i This model allows the average wage to differ for men and women, but the difference in average wage between men and women is always the same regardless of experience level. Hourly Wage We have created two parallel regression lines by including the binary covariate gender Years of Experience Men's hourly wage fit2_men Women's hourly wage fit2_women 4
5 Model 1 Wage on experience and gender E Wage i ] = βˆ + βˆ (Experience ) + βˆ (Gender [ 0 1 i 2 i Gender (0=man, 1=woman) Model results: Source SS df MS Number of obs = F( 2, 531) = Model Prob > F = Residual R-squared = Adj R-squared = Total Root MSE = wagehr Coef. Std. Err. t P> t [95% Conf. Interval] educyrs gender (Intercept) ) 5
6 Model 2 Wage on experience and gender, include interaction Goal: Create a model that allows the average wage to differ for men and women the difference in average wage between men and women to change as experience level increases still will have two regression lines (one for each gender) no longer parallel E[Wagei ] = βˆ 0 + βˆ 1(Experiencei ) + βˆ 2(Genderi ) + βˆ 3(Genderi Experience i ) Interaction variable How do we create it? 6
7 Model 2: Creating the interaction variable Multiply the two X variables gender: 0 for men 1 for women gender*experience = 0*experience = 0 for men = 1*experience = experience for women 7
8 Model 2: R code and output Code in R to create and include interaction: > gendereduc <- gender*educ > lm( wagehr ~ educyrs + gender + gendereduc ) Output (similar to R s output) Source SS df MS Number of obs = F( 3, 530) = Model Prob > F = Residual R-squared = Adj R-squared = Total Root MSE = wagehr Coef. Std. Err. t P> t [95% Conf. Interval] educyrs gender gender_educ (Intercept)
9 Model 2: Interpretation in terms of gender value E[Wagei ] = βˆ 0 + βˆ 1(Experiencei ) + βˆ 2(Genderi ) + βˆ 3(Genderi Experience i ) Equation for men: E[ Wage E[ Wage i i ] = βˆ 0 + βˆ 1 (Experience ] = (Experience i ) i ) Gender: 0 for men 1 for women Equation for women: E[ Wage E[ Wage i i ] = ] = ( βˆ + βˆ ) + ( βˆ + βˆ ) 0 2 (Experience ( ) + ( ) (Experience β 2 : change in mean wage for women vs. men with no experience β 3 : change in slope (of experience) for women vs. men 1 3 i ) i ) 9
10 Model 2: Coefficient Interpretation E[Wagei ] = βˆ 0 + βˆ 1(Experiencei ) + βˆ 2(Genderi ) + βˆ 3(Genderi Experience β 0 : The average wage for men with no experience β 1 : The difference in average wage for a one year increase in experience among men β 2 : The difference in average wage between women and men with no experience i ) β 3 : The difference of the difference in average wage for a one year increase in experience between women and men the change in slope between women and men the slope for women is β 1 +β 3 the slope for men was β 1 10
11 Compare to model 1 In the parent model β 1 was slope for both men and women β 2 was difference between women & men at every experience level In the extended model (with interaction) β 1 is slope for men β 2 is difference between women & men for experience=0 β 3 is change in slope per year of experience between men & women 11
12 Is the change in slope statistically significant? Test model 1 vs. model 2 only 1 variable added use t test for that variable to compare models H 0 : β 3 =0 in the population From the t-statistic, p = 0.27 Fail to reject H 0 Conclude that model 1 is better 12
13 Model 3: Interaction of two binary predictors Model 2: continuous X, binary X, their interaction slope changes by group Model 3: binary X, binary X, their interaction difference in mean changes by group 13
14 Model 3: Wage on gender and married, include interaction E Wagei ] = βˆ 0 + βˆ 1(Genderi ) + βˆ 2( Marriedi ) + βˆ 3(Genderi Married gender: 0 for men 1 for women married: 0 if unmarried 1 if married [ i ) To create the interaction term: gender*married = 0*0 = 0 for unmarried men = 1*0 = 0 for unmarried women = 0*1 = 0 for married men = 1*1 = 1 for married women 14
15 Model 3: Results E[ Wagei ] = βˆ 0 + βˆ 1(Genderi ) + βˆ 2( Marriedi ) + βˆ 3(Genderi Marriedi) Source SS df MS Number of obs = F( 3, 530) = Model Prob > F = Residual R-squared = Adj R-squared = Total Root MSE = wagehr Coef. Std. Err. t P> t [95% Conf. Interval] gender married gender_mar~d (Intercept) Let s visualize what this implies! - basically, we ve created a model that allows for a different mean for each of the 4 possible types of people - the slopes (β 1 β 2 and β 3 ) represent differences in means between groups 15
16 Graph for Model 3 β 3 = Difference of differences Difference = β 1 Difference = β 2 Difference = β 1 +β 3 Mean hourly wage β 0 Difference = β 2 +β unmarried men unmarried women married men married women E[ Wagei ] = βˆ 0 + βˆ 1(Genderi ) + βˆ 2( Marriedi ) + βˆ 3(Genderi Marriedi) 16
17 Model 3: Interpretation β 0 : The average wage for unmarried men β 1 : The difference in average wage between unmarried women and unmarried men β 1 + β 3 : The difference in average wage between married women and married men β 3 : The difference of the difference in average wage between married women and married men and between unmarried women and unmarried men 17
18 Model 3: Interpretation β 0 : The average wage for unmarried men β 2 : The difference in average wage between married men and unmarried men β 2 + β 3 : The difference in average wage between married women and unmarried women β 3 : The difference of the difference in average wage between married women and unmarried women and between married men and unmarried men 18
19 Model 3: conclusion The interaction variable is statistically significantly different from 0 (p=0.001, CI: -4.9 to -1.3 ) The difference in mean hourly wage between women and men is greater for married people than for unmarried people. -or- The difference in mean hourly wage between married people and unmarried people is greater for men than for women. 19
20 Summary Interaction interaction=var1*var2 interaction variable changes interpretation of entire model with interaction, the effect of one variable changes according to the level of the second variable Test for interaction by testing new variable use t-test if significant (p<α, 0 not in CI), keep if not significant, go back to parent model without interaction variable 20
21 Flexibility in linear models In linear regression, we assume the outcome, Y, has a linear relationship with the predictors, X However, we have flexibility in defining the predictors transform X, such as X 2 or X 3 use linear splines to fit broken arrow models 21
22 Linear Splines: when are they needed? Example 1: Outcome is `out of pocket medical expenditures A researcher tells you most Health Management Organizations (HMOs) will usually pay for the first week of a hospital stay only She expects `out of pocket expenditures to increase dramitically if length of stay (LOS) was longer than one week How should we set up the model? Example 2: Outcome is medical expenditures In the US, older adults are eligible for Medicare, a national health insurance program, once they reach the age of 65 We expect study subjects medical expenditure patterns before age 65 to be different from the expenditures after age 65 How should we set up the model? 22
23 Example 1: The researcher hypothesized this regression line for length of stay (LOS) Expenditures Broken Arrow Model length of stay (days) Terminology: Called Linear spline Broken arrow Hockey stick 23
24 Defining the spline variable We define a new variable that checks to see if the slope is indeed different if LOS is greater than 7 Idea, create a term: (LOS-7) + = (LOS 7) if LOS>7 = 0 if LOS<=7 When you run the regression include both LOS (LOS-7) + The spline allows you to change the magnitude of the slope! 24
25 When to use a spline? When a continuous predictor is used, a typical regression equation assumes there is a straight-line relationship between X and Y in the population. If the relationship between X and Y is a bent line a curve adding a spline may more accurately model the relationship between X and Y 25
26 Visualizing the linear spline model 3500 Broken Arrow Model Expenditures Slope = β 1 Slope = β 1 + β length of stay (days) 26
27 The Model E(expenditures) = β 0 + β 1 LOS + β 2 (LOS-7) + Where: (LOS-7)+ = (LOS 7) if LOS>7 0 if LOS<=7 27
28 Then: E(expenditures LOS <= 7) = β 0 + β 1 LOS E(exp LOS > 7)= β 0 + β 1 LOS + β 2 (LOS - 7) = (β 0 - β 2 7)+ (β 1 + β 2 )LOS = β 0 * + β 1 *LOS 28
29 Back to modelling wages Standardized residuals age Do we need a spline on age? 29
30 How should we add the spline? Goal: let the regression line bend Model: E(Wage i ) = β 0 +β 1 (age-35)+β 2 (age-35) + What is (age-35) +? 0 if age<35 (age-35) if age>=35 30
31 Fitted model with spline at 35 Model: E(Wage i ) = β 0 +β 1 (age-35)+β 2 (age-35) + Source SS df MS Number of obs = F( 2, 530) = Model Prob > F = Residual R-squared = Adj R-squared = Total Root MSE = wagehr Coef. Std. Err. t P> t [95% Conf. Interval] age_cent age_spline (Intercept)
32 Fitted Graph (with spline) Solid line: fitted spline, Dotted line: visualize change in slope! Wage ($/hour) age 32
33 Working out what the spline term means... E(Wage i ) = (age-35)-0.37(age-35) + For a person under 35: E(Wage i ) = (age-35)-0.37(age-35) + = (age-35) For a person 35 or older: E(Wage i ) = (age-35)-0.37(age-35) + = (age-35) 0 (age-35) β 1 +β 2 = new slope for those over 35 33
34 Interpretation β 0 is the average wage for people who are 35 years old β 1 is the change in average wage per additional year of age for those under 35 β 2 is the difference in the change in average wage per additional year of age for those over age 35 as compared to those under age 35 β 2 is the change in the slope for over 35 vs. under 35 34
35 Better Interpretation The average wage for people who are 35 years old is $10.45/hour (95% CI: $9.75, 11.16) For each additional year of age, those under age 35 earn an average of $0.33 more per hour (95% CI: $0.24, $0.43) For each additional year of age, those over age 35 earn an average of $0.04 less per hour (95% CI: -$0.10, $0.01) 35
36 Is the change in slope statistically significant? One variable was added to create the change in slope compare nested models with t-test Coefficients Coef. Std. Err. t P> t [95% Conf. Interval] age_cent age_spline (Intercept) H 0 : spline is not needed (no change in slope in the population) p<0.001 or CI does not include 0: reject H 0 Conclude slope differs for those over vs. under 35 in population 36
37 Check the model assumptions again! Note: plots are omitted L Linear relationship With the spline, there is no longer any pattern in the residuals After removing the one outlier, no others appear to stand out I - Independence We cannot check this by looking at the data N Normality of the residuals The residuals are slightly skewed to positive values the estimated regression coefficients are still correct their confidence intervals may be misleading E Equal variance of the residuals across X The vertical spread of the residuals may be smaller for those under 25 years of age the estimated regression coefficients are still correct their confidence intervals may be misleading 37
38 Conclusions The increase in hourly wage with increasing age is statistically significant for those who recently entered the workforce (ages 18-35): for each additional year, these workers earn an average of 33 cents more per hour. However, this increase in wage with increasing age levels off for those over age 35, so that no appreciable increase in average wage is observed for those over age 35. One 21-year-old had much higher earnings ($44.50 per hour) than other young workers. This person s results were so unlike the rest of the sample that the observation was dropped from the analysis. It is possible that the data was incorrectly entered for this person, but we are unable to assess the data entry since the original completed surveys are unavailable. 38
39 A final word on Splines Splines are used to allow the regression line to bend the breakpoint is arbitrary and decided graphically the actual slope above and below the breakpoint is usually of more interest than the coefficient for the spline (i.e., the change in slope) 39
40 Lecture 12 Summary Effect modification Epidemiologic definition Include in model using interaction term 1 binary X, 1 continuous X 2 binary X Linear Splines The relationship between Y and a continuous X may be linear Allow for a change in slope at a certain point in the continuous X 40
Reminder: Nested models. Lecture 9: Interactions, Quadratic terms and Splines. Effect Modification. Model 1
Lecture 9: Interactons, Quadratc terms and Splnes An Manchakul amancha@jhsph.edu 3 Aprl 7 Remnder: Nested models Parent model contans one set of varables Extended model adds one or more new varables to
More informationCorrelation and Simple Linear Regression
Correlation and Simple Linear Regression Sasivimol Rattanasiri, Ph.D Section for Clinical Epidemiology and Biostatistics Ramathibodi Hospital, Mahidol University E-mail: sasivimol.rat@mahidol.ac.th 1 Outline
More informationProblem Set #3-Key. wage Coef. Std. Err. t P> t [95% Conf. Interval]
Problem Set #3-Key Sonoma State University Economics 317- Introduction to Econometrics Dr. Cuellar 1. Use the data set Wage1.dta to answer the following questions. a. For the regression model Wage i =
More informationLecture 12: Effect modification, and confounding in logistic regression
Lecture 12: Effect modification, and confounding in logistic regression Ani Manichaikul amanicha@jhsph.edu 4 May 2007 Today Categorical predictor create dummy variables just like for linear regression
More informationMathematics for Economics MA course
Mathematics for Economics MA course Simple Linear Regression Dr. Seetha Bandara Simple Regression Simple linear regression is a statistical method that allows us to summarize and study relationships between
More informationLab 10 - Binary Variables
Lab 10 - Binary Variables Spring 2017 Contents 1 Introduction 1 2 SLR on a Dummy 2 3 MLR with binary independent variables 3 3.1 MLR with a Dummy: different intercepts, same slope................. 4 3.2
More informationAcknowledgements. Outline. Marie Diener-West. ICTR Leadership / Team INTRODUCTION TO CLINICAL RESEARCH. Introduction to Linear Regression
INTRODUCTION TO CLINICAL RESEARCH Introduction to Linear Regression Karen Bandeen-Roche, Ph.D. July 17, 2012 Acknowledgements Marie Diener-West Rick Thompson ICTR Leadership / Team JHU Intro to Clinical
More informationOrdinary Least Squares Regression Explained: Vartanian
Ordinary Least Squares Regression Explained: Vartanian When to Use Ordinary Least Squares Regression Analysis A. Variable types. When you have an interval/ratio scale dependent variable.. When your independent
More information5. Let W follow a normal distribution with mean of μ and the variance of 1. Then, the pdf of W is
Practice Final Exam Last Name:, First Name:. Please write LEGIBLY. Answer all questions on this exam in the space provided (you may use the back of any page if you need more space). Show all work but do
More informationECON3150/4150 Spring 2015
ECON3150/4150 Spring 2015 Lecture 3&4 - The linear regression model Siv-Elisabeth Skjelbred University of Oslo January 29, 2015 1 / 67 Chapter 4 in S&W Section 17.1 in S&W (extended OLS assumptions) 2
More informationAnswer all questions from part I. Answer two question from part II.a, and one question from part II.b.
B203: Quantitative Methods Answer all questions from part I. Answer two question from part II.a, and one question from part II.b. Part I: Compulsory Questions. Answer all questions. Each question carries
More informationLinear Modelling in Stata Session 6: Further Topics in Linear Modelling
Linear Modelling in Stata Session 6: Further Topics in Linear Modelling Mark Lunt Arthritis Research UK Epidemiology Unit University of Manchester 14/11/2017 This Week Categorical Variables Categorical
More informationProblem Set #5-Key Sonoma State University Dr. Cuellar Economics 317- Introduction to Econometrics
Problem Set #5-Key Sonoma State University Dr. Cuellar Economics 317- Introduction to Econometrics C1.1 Use the data set Wage1.dta to answer the following questions. Estimate regression equation wage =
More informationECON3150/4150 Spring 2016
ECON3150/4150 Spring 2016 Lecture 4 - The linear regression model Siv-Elisabeth Skjelbred University of Oslo Last updated: January 26, 2016 1 / 49 Overview These lecture slides covers: The linear regression
More informationStatistical Modelling in Stata 5: Linear Models
Statistical Modelling in Stata 5: Linear Models Mark Lunt Arthritis Research UK Epidemiology Unit University of Manchester 07/11/2017 Structure This Week What is a linear model? How good is my model? Does
More informationECO220Y Simple Regression: Testing the Slope
ECO220Y Simple Regression: Testing the Slope Readings: Chapter 18 (Sections 18.3-18.5) Winter 2012 Lecture 19 (Winter 2012) Simple Regression Lecture 19 1 / 32 Simple Regression Model y i = β 0 + β 1 x
More informationSociology Exam 2 Answer Key March 30, 2012
Sociology 63993 Exam 2 Answer Key March 30, 2012 I. True-False. (20 points) Indicate whether the following statements are true or false. If false, briefly explain why. 1. A researcher has constructed scales
More informationInteraction effects between continuous variables (Optional)
Interaction effects between continuous variables (Optional) Richard Williams, University of Notre Dame, https://www.nd.edu/~rwilliam/ Last revised February 0, 05 This is a very brief overview of this somewhat
More informationProblem Set 10: Panel Data
Problem Set 10: Panel Data 1. Read in the data set, e11panel1.dta from the course website. This contains data on a sample or 1252 men and women who were asked about their hourly wage in two years, 2005
More informationGeneral Linear Model (Chapter 4)
General Linear Model (Chapter 4) Outcome variable is considered continuous Simple linear regression Scatterplots OLS is BLUE under basic assumptions MSE estimates residual variance testing regression coefficients
More informationLecture 4 Scatterplots, Association, and Correlation
Lecture 4 Scatterplots, Association, and Correlation Previously, we looked at Single variables on their own One or more categorical variable In this lecture: We shall look at two quantitative variables.
More informationECON Introductory Econometrics. Lecture 5: OLS with One Regressor: Hypothesis Tests
ECON4150 - Introductory Econometrics Lecture 5: OLS with One Regressor: Hypothesis Tests Monique de Haan (moniqued@econ.uio.no) Stock and Watson Chapter 5 Lecture outline 2 Testing Hypotheses about one
More informationLecture 3: Multiple Regression. Prof. Sharyn O Halloran Sustainable Development U9611 Econometrics II
Lecture 3: Multiple Regression Prof. Sharyn O Halloran Sustainable Development Econometrics II Outline Basics of Multiple Regression Dummy Variables Interactive terms Curvilinear models Review Strategies
More informationsociology 362 regression
sociology 36 regression Regression is a means of studying how the conditional distribution of a response variable (say, Y) varies for different values of one or more independent explanatory variables (say,
More informationLab 6 - Simple Regression
Lab 6 - Simple Regression Spring 2017 Contents 1 Thinking About Regression 2 2 Regression Output 3 3 Fitted Values 5 4 Residuals 6 5 Functional Forms 8 Updated from Stata tutorials provided by Prof. Cichello
More informationLecture 4 Scatterplots, Association, and Correlation
Lecture 4 Scatterplots, Association, and Correlation Previously, we looked at Single variables on their own One or more categorical variables In this lecture: We shall look at two quantitative variables.
More informationS o c i o l o g y E x a m 2 A n s w e r K e y - D R A F T M a r c h 2 7,
S o c i o l o g y 63993 E x a m 2 A n s w e r K e y - D R A F T M a r c h 2 7, 2 0 0 9 I. True-False. (20 points) Indicate whether the following statements are true or false. If false, briefly explain
More informationMaking sense of Econometrics: Basics
Making sense of Econometrics: Basics Lecture 4: Qualitative influences and Heteroskedasticity Egypt Scholars Economic Society November 1, 2014 Assignment & feedback enter classroom at http://b.socrative.com/login/student/
More informationSection Least Squares Regression
Section 2.3 - Least Squares Regression Statistics 104 Autumn 2004 Copyright c 2004 by Mark E. Irwin Regression Correlation gives us a strength of a linear relationship is, but it doesn t tell us what it
More informationECON Interactions and Dummies
ECON 351 - Interactions and Dummies Maggie Jones 1 / 25 Readings Chapter 6: Section on Models with Interaction Terms Chapter 7: Full Chapter 2 / 25 Interaction Terms with Continuous Variables In some regressions
More informationSTATISTICS 110/201 PRACTICE FINAL EXAM
STATISTICS 110/201 PRACTICE FINAL EXAM Questions 1 to 5: There is a downloadable Stata package that produces sequential sums of squares for regression. In other words, the SS is built up as each variable
More informationmultilevel modeling: concepts, applications and interpretations
multilevel modeling: concepts, applications and interpretations lynne c. messer 27 october 2010 warning social and reproductive / perinatal epidemiologist concepts why context matters multilevel models
More informationsociology 362 regression
sociology 36 regression Regression is a means of modeling how the conditional distribution of a response variable (say, Y) varies for different values of one or more independent explanatory variables (say,
More informationRegression: Main Ideas Setting: Quantitative outcome with a quantitative explanatory variable. Example, cont.
TCELL 9/4/205 36-309/749 Experimental Design for Behavioral and Social Sciences Simple Regression Example Male black wheatear birds carry stones to the nest as a form of sexual display. Soler et al. wanted
More informationInference. ME104: Linear Regression Analysis Kenneth Benoit. August 15, August 15, 2012 Lecture 3 Multiple linear regression 1 1 / 58
Inference ME104: Linear Regression Analysis Kenneth Benoit August 15, 2012 August 15, 2012 Lecture 3 Multiple linear regression 1 1 / 58 Stata output resvisited. reg votes1st spend_total incumb minister
More informationSix Sigma Black Belt Study Guides
Six Sigma Black Belt Study Guides 1 www.pmtutor.org Powered by POeT Solvers Limited. Analyze Correlation and Regression Analysis 2 www.pmtutor.org Powered by POeT Solvers Limited. Variables and relationships
More informationInference for the Regression Coefficient
Inference for the Regression Coefficient Recall, b 0 and b 1 are the estimates of the slope β 1 and intercept β 0 of population regression line. We can shows that b 0 and b 1 are the unbiased estimates
More information36-309/749 Experimental Design for Behavioral and Social Sciences. Sep. 22, 2015 Lecture 4: Linear Regression
36-309/749 Experimental Design for Behavioral and Social Sciences Sep. 22, 2015 Lecture 4: Linear Regression TCELL Simple Regression Example Male black wheatear birds carry stones to the nest as a form
More informationIntroduction to Econometrics. Review of Probability & Statistics
1 Introduction to Econometrics Review of Probability & Statistics Peerapat Wongchaiwat, Ph.D. wongchaiwat@hotmail.com Introduction 2 What is Econometrics? Econometrics consists of the application of mathematical
More informationInference for Regression Inference about the Regression Model and Using the Regression Line, with Details. Section 10.1, 2, 3
Inference for Regression Inference about the Regression Model and Using the Regression Line, with Details Section 10.1, 2, 3 Basic components of regression setup Target of inference: linear dependency
More informationLecture 14: Introduction to Poisson Regression
Lecture 14: Introduction to Poisson Regression Ani Manichaikul amanicha@jhsph.edu 8 May 2007 1 / 52 Overview Modelling counts Contingency tables Poisson regression models 2 / 52 Modelling counts I Why
More informationModelling counts. Lecture 14: Introduction to Poisson Regression. Overview
Modelling counts I Lecture 14: Introduction to Poisson Regression Ani Manichaikul amanicha@jhsph.edu Why count data? Number of traffic accidents per day Mortality counts in a given neighborhood, per week
More informationPre-Calculus Multiple Choice Questions - Chapter S8
1 If every man married a women who was exactly 3 years younger than he, what would be the correlation between the ages of married men and women? a Somewhat negative b 0 c Somewhat positive d Nearly 1 e
More informationOrdinary Least Squares Regression Explained: Vartanian
Ordinary Least Squares Regression Eplained: Vartanian When to Use Ordinary Least Squares Regression Analysis A. Variable types. When you have an interval/ratio scale dependent variable.. When your independent
More information1. The shoe size of five randomly selected men in the class is 7, 7.5, 6, 6.5 the shoe size of 4 randomly selected women is 6, 5.
Economics 3 Introduction to Econometrics Winter 2004 Professor Dobkin Name Final Exam (Sample) You must answer all the questions. The exam is closed book and closed notes you may use calculators. You must
More informationThursday Morning. Growth Modelling in Mplus. Using a set of repeated continuous measures of bodyweight
Thursday Morning Growth Modelling in Mplus Using a set of repeated continuous measures of bodyweight 1 Growth modelling Continuous Data Mplus model syntax refresher ALSPAC Confirmatory Factor Analysis
More informationECON Introductory Econometrics. Lecture 7: OLS with Multiple Regressors Hypotheses tests
ECON4150 - Introductory Econometrics Lecture 7: OLS with Multiple Regressors Hypotheses tests Monique de Haan (moniqued@econ.uio.no) Stock and Watson Chapter 7 Lecture outline 2 Hypothesis test for single
More informationCorrelation and regression. Correlation and regression analysis. Measures of association. Why bother? Positive linear relationship
1 Correlation and regression analsis 12 Januar 2009 Monda, 14.00-16.00 (C1058) Frank Haege Department of Politics and Public Administration Universit of Limerick frank.haege@ul.ie www.frankhaege.eu Correlation
More informationWarwick Economics Summer School Topics in Microeconometrics Instrumental Variables Estimation
Warwick Economics Summer School Topics in Microeconometrics Instrumental Variables Estimation Michele Aquaro University of Warwick This version: July 21, 2016 1 / 31 Reading material Textbook: Introductory
More information1 Warm-Up: 2 Adjusted R 2. Introductory Applied Econometrics EEP/IAS 118 Spring Sylvan Herskowitz Section #
Introductory Applied Econometrics EEP/IAS 118 Spring 2015 Sylvan Herskowitz Section #10 4-1-15 1 Warm-Up: Remember that exam you took before break? We had a question that said this A researcher wants to
More informationUniversity of California at Berkeley Fall Introductory Applied Econometrics Final examination. Scores add up to 125 points
EEP 118 / IAS 118 Elisabeth Sadoulet and Kelly Jones University of California at Berkeley Fall 2008 Introductory Applied Econometrics Final examination Scores add up to 125 points Your name: SID: 1 1.
More informationEconomics 326 Methods of Empirical Research in Economics. Lecture 14: Hypothesis testing in the multiple regression model, Part 2
Economics 326 Methods of Empirical Research in Economics Lecture 14: Hypothesis testing in the multiple regression model, Part 2 Vadim Marmer University of British Columbia May 5, 2010 Multiple restrictions
More informationInferences for Regression
Inferences for Regression An Example: Body Fat and Waist Size Looking at the relationship between % body fat and waist size (in inches). Here is a scatterplot of our data set: Remembering Regression In
More informationConfidence Interval for the mean response
Week 3: Prediction and Confidence Intervals at specified x. Testing lack of fit with replicates at some x's. Inference for the correlation. Introduction to regression with several explanatory variables.
More informationECON Introductory Econometrics. Lecture 17: Experiments
ECON4150 - Introductory Econometrics Lecture 17: Experiments Monique de Haan (moniqued@econ.uio.no) Stock and Watson Chapter 13 Lecture outline 2 Why study experiments? The potential outcome framework.
More informationGroup Comparisons: Differences in Composition Versus Differences in Models and Effects
Group Comparisons: Differences in Composition Versus Differences in Models and Effects Richard Williams, University of Notre Dame, https://www3.nd.edu/~rwilliam/ Last revised February 15, 2015 Overview.
More informationCorrelation & Simple Regression
Chapter 11 Correlation & Simple Regression The previous chapter dealt with inference for two categorical variables. In this chapter, we would like to examine the relationship between two quantitative variables.
More informationLecture 10: Introduction to Logistic Regression
Lecture 10: Introduction to Logistic Regression Ani Manichaikul amanicha@jhsph.edu 2 May 2007 Logistic Regression Regression for a response variable that follows a binomial distribution Recall the binomial
More informationCh 13 & 14 - Regression Analysis
Ch 3 & 4 - Regression Analysis Simple Regression Model I. Multiple Choice:. A simple regression is a regression model that contains a. only one independent variable b. only one dependent variable c. more
More informationGraduate Econometrics Lecture 4: Heteroskedasticity
Graduate Econometrics Lecture 4: Heteroskedasticity Department of Economics University of Gothenburg November 30, 2014 1/43 and Autocorrelation Consequences for OLS Estimator Begin from the linear model
More informationAMS 315/576 Lecture Notes. Chapter 11. Simple Linear Regression
AMS 315/576 Lecture Notes Chapter 11. Simple Linear Regression 11.1 Motivation A restaurant opening on a reservations-only basis would like to use the number of advance reservations x to predict the number
More informationIntroduction to Regression
Introduction to Regression ιατµηµατικό Πρόγραµµα Μεταπτυχιακών Σπουδών Τεχνο-Οικονοµικά Συστήµατα ηµήτρης Φουσκάκης Introduction Basic idea: Use data to identify relationships among variables and use these
More informationVariance Decomposition and Goodness of Fit
Variance Decomposition and Goodness of Fit 1. Example: Monthly Earnings and Years of Education In this tutorial, we will focus on an example that explores the relationship between total monthly earnings
More informationA discussion on multiple regression models
A discussion on multiple regression models In our previous discussion of simple linear regression, we focused on a model in which one independent or explanatory variable X was used to predict the value
More information1 Independent Practice: Hypothesis tests for one parameter:
1 Independent Practice: Hypothesis tests for one parameter: Data from the Indian DHS survey from 2006 includes a measure of autonomy of the women surveyed (a scale from 0-10, 10 being the most autonomous)
More informationSociology 63993, Exam 2 Answer Key [DRAFT] March 27, 2015 Richard Williams, University of Notre Dame,
Sociology 63993, Exam 2 Answer Key [DRAFT] March 27, 2015 Richard Williams, University of Notre Dame, http://www3.nd.edu/~rwilliam/ I. True-False. (20 points) Indicate whether the following statements
More informationINFERENCE FOR REGRESSION
CHAPTER 3 INFERENCE FOR REGRESSION OVERVIEW In Chapter 5 of the textbook, we first encountered regression. The assumptions that describe the regression model we use in this chapter are the following. We
More informationProblem Set 1 ANSWERS
Economics 20 Prof. Patricia M. Anderson Problem Set 1 ANSWERS Part I. Multiple Choice Problems 1. If X and Z are two random variables, then E[X-Z] is d. E[X] E[Z] This is just a simple application of one
More informationRegression #8: Loose Ends
Regression #8: Loose Ends Econ 671 Purdue University Justin L. Tobias (Purdue) Regression #8 1 / 30 In this lecture we investigate a variety of topics that you are probably familiar with, but need to touch
More information1: a b c d e 2: a b c d e 3: a b c d e 4: a b c d e 5: a b c d e. 6: a b c d e 7: a b c d e 8: a b c d e 9: a b c d e 10: a b c d e
Economics 102: Analysis of Economic Data Cameron Spring 2016 Department of Economics, U.C.-Davis Final Exam (A) Tuesday June 7 Compulsory. Closed book. Total of 58 points and worth 45% of course grade.
More informationData Analysis 1 LINEAR REGRESSION. Chapter 03
Data Analysis 1 LINEAR REGRESSION Chapter 03 Data Analysis 2 Outline The Linear Regression Model Least Squares Fit Measures of Fit Inference in Regression Other Considerations in Regression Model Qualitative
More informationApplied Statistics and Econometrics
Applied Statistics and Econometrics Lecture 6 Saul Lach September 2017 Saul Lach () Applied Statistics and Econometrics September 2017 1 / 53 Outline of Lecture 6 1 Omitted variable bias (SW 6.1) 2 Multiple
More informationBusiness Statistics. Lecture 10: Course Review
Business Statistics Lecture 10: Course Review 1 Descriptive Statistics for Continuous Data Numerical Summaries Location: mean, median Spread or variability: variance, standard deviation, range, percentiles,
More informationIntroductory Econometrics. Lecture 13: Hypothesis testing in the multiple regression model, Part 1
Introductory Econometrics Lecture 13: Hypothesis testing in the multiple regression model, Part 1 Jun Ma School of Economics Renmin University of China October 19, 2016 The model I We consider the classical
More informationLecture 24: Partial correlation, multiple regression, and correlation
Lecture 24: Partial correlation, multiple regression, and correlation Ernesto F. L. Amaral November 21, 2017 Advanced Methods of Social Research (SOCI 420) Source: Healey, Joseph F. 2015. Statistics: A
More informationLongitudinal Data Analysis of Health Outcomes
Longitudinal Data Analysis of Health Outcomes Longitudinal Data Analysis Workshop Running Example: Days 2 and 3 University of Georgia: Institute for Interdisciplinary Research in Education and Human Development
More informationApplied Statistics and Econometrics
Applied Statistics and Econometrics Lecture 7 Saul Lach September 2017 Saul Lach () Applied Statistics and Econometrics September 2017 1 / 68 Outline of Lecture 7 1 Empirical example: Italian labor force
More informationLecture 7: OLS with qualitative information
Lecture 7: OLS with qualitative information Dummy variables Dummy variable: an indicator that says whether a particular observation is in a category or not Like a light switch: on or off Most useful values:
More informationSuggested Answers Problem set 4 ECON 60303
Suggested Answers Problem set 4 ECON 60303 Bill Evans Spring 04. A program that answers part A) is on the web page and is named psid_iv_comparison.do. Below are some key results and a summary table is
More informationAnalysis of Covariance. The following example illustrates a case where the covariate is affected by the treatments.
Analysis of Covariance In some experiments, the experimental units (subjects) are nonhomogeneous or there is variation in the experimental conditions that are not due to the treatments. For example, a
More informationMeasurement Error. Often a data set will contain imperfect measures of the data we would ideally like.
Measurement Error Often a data set will contain imperfect measures of the data we would ideally like. Aggregate Data: (GDP, Consumption, Investment are only best guesses of theoretical counterparts and
More informationLecture 5: ANOVA and Correlation
Lecture 5: ANOVA and Correlation Ani Manichaikul amanicha@jhsph.edu 23 April 2007 1 / 62 Comparing Multiple Groups Continous data: comparing means Analysis of variance Binary data: comparing proportions
More informationPractice exam questions
Practice exam questions Nathaniel Higgins nhiggins@jhu.edu, nhiggins@ers.usda.gov 1. The following question is based on the model y = β 0 + β 1 x 1 + β 2 x 2 + β 3 x 3 + u. Discuss the following two hypotheses.
More information2.1. Consider the following production function, known in the literature as the transcendental production function (TPF).
CHAPTER Functional Forms of Regression Models.1. Consider the following production function, known in the literature as the transcendental production function (TPF). Q i B 1 L B i K i B 3 e B L B K 4 i
More informationReview of Multiple Regression
Ronald H. Heck 1 Let s begin with a little review of multiple regression this week. Linear models [e.g., correlation, t-tests, analysis of variance (ANOVA), multiple regression, path analysis, multivariate
More information9. Linear Regression and Correlation
9. Linear Regression and Correlation Data: y a quantitative response variable x a quantitative explanatory variable (Chap. 8: Recall that both variables were categorical) For example, y = annual income,
More informationLab 3: Two levels Poisson models (taken from Multilevel and Longitudinal Modeling Using Stata, p )
Lab 3: Two levels Poisson models (taken from Multilevel and Longitudinal Modeling Using Stata, p. 376-390) BIO656 2009 Goal: To see if a major health-care reform which took place in 1997 in Germany was
More informationSTA 108 Applied Linear Models: Regression Analysis Spring Solution for Homework #6
STA 8 Applied Linear Models: Regression Analysis Spring 011 Solution for Homework #6 6. a) = 11 1 31 41 51 1 3 4 5 11 1 31 41 51 β = β1 β β 3 b) = 1 1 1 1 1 11 1 31 41 51 1 3 4 5 β = β 0 β1 β 6.15 a) Stem-and-leaf
More information1 A Review of Correlation and Regression
1 A Review of Correlation and Regression SW, Chapter 12 Suppose we select n = 10 persons from the population of college seniors who plan to take the MCAT exam. Each takes the test, is coached, and then
More informationEconometrics Homework 1
Econometrics Homework Due Date: March, 24. by This problem set includes questions for Lecture -4 covered before midterm exam. Question Let z be a random column vector of size 3 : z = @ (a) Write out z
More informationECON2228 Notes 2. Christopher F Baum. Boston College Economics. cfb (BC Econ) ECON2228 Notes / 47
ECON2228 Notes 2 Christopher F Baum Boston College Economics 2014 2015 cfb (BC Econ) ECON2228 Notes 2 2014 2015 1 / 47 Chapter 2: The simple regression model Most of this course will be concerned with
More informationQuestion 1a 1b 1c 1d 1e 2a 2b 2c 2d 2e 2f 3a 3b 3c 3d 3e 3f M ult: choice Points
Economics 102: Analysis of Economic Data Cameron Spring 2016 May 12 Department of Economics, U.C.-Davis Second Midterm Exam (Version A) Compulsory. Closed book. Total of 30 points and worth 22.5% of course
More informationClassification & Regression. Multicollinearity Intro to Nominal Data
Multicollinearity Intro to Nominal Let s Start With A Question y = β 0 + β 1 x 1 +β 2 x 2 y = Anxiety Level x 1 = heart rate x 2 = recorded pulse Since we can all agree heart rate and pulse are related,
More informationIntroduction to Regression
Regression Introduction to Regression If two variables covary, we should be able to predict the value of one variable from another. Correlation only tells us how much two variables covary. In regression,
More informationProject Report for STAT571 Statistical Methods Instructor: Dr. Ramon V. Leon. Wage Data Analysis. Yuanlei Zhang
Project Report for STAT7 Statistical Methods Instructor: Dr. Ramon V. Leon Wage Data Analysis Yuanlei Zhang 77--7 November, Part : Introduction Data Set The data set contains a random sample of observations
More informationThe simple linear regression model discussed in Chapter 13 was written as
1519T_c14 03/27/2006 07:28 AM Page 614 Chapter Jose Luis Pelaez Inc/Blend Images/Getty Images, Inc./Getty Images, Inc. 14 Multiple Regression 14.1 Multiple Regression Analysis 14.2 Assumptions of the Multiple
More informationBasic econometrics. Tutorial 3. Dipl.Kfm. Johannes Metzler
Basic econometrics Tutorial 3 Dipl.Kfm. Introduction Some of you were asking about material to revise/prepare econometrics fundamentals. First of all, be aware that I will not be too technical, only as
More informationsociology sociology Scatterplots Quantitative Research Methods: Introduction to correlation and regression Age vs Income
Scatterplots Quantitative Research Methods: Introduction to correlation and regression Scatterplots can be considered as interval/ratio analogue of cross-tabs: arbitrarily many values mapped out in -dimensions
More informationReview of Statistics 101
Review of Statistics 101 We review some important themes from the course 1. Introduction Statistics- Set of methods for collecting/analyzing data (the art and science of learning from data). Provides methods
More informationSimple Linear Regression Using Ordinary Least Squares
Simple Linear Regression Using Ordinary Least Squares Purpose: To approximate a linear relationship with a line. Reason: We want to be able to predict Y using X. Definition: The Least Squares Regression
More information