x3,..., Multiple Regression β q α, β 1, β 2, β 3,..., β q in the model can all be estimated by least square estimators

Size: px
Start display at page:

Download "x3,..., Multiple Regression β q α, β 1, β 2, β 3,..., β q in the model can all be estimated by least square estimators"

Transcription

1 Multiple Regression Relating a response (dependent, input) y to a set of explanatory (independent, output, predictor) variables x, x 2, x 3,, x q. A technique for modeling the relationship between variables. Deterministic component: Random component: y = µ + ε y x, x2, x3,..., x q µ = α + β x + β 2 x 2 + β 3 x β q x q ε y x, x2, x3,..., x q Multiple Linear Regression : y = α + β x + β 2 x 2 + β 3 x β q x q + ε α, β, β 2, β 3,..., β q in the model can all be estimated by least square estimators α, ˆ β, ˆ β, ˆ β,..., ˆ ˆ 2 3 β q The Least-Square Regression Equation: y ˆ = α ˆ + ˆ β x ˆ β x ˆ β x ˆ β x q q Example: Study weight (y) using age (x ) and height (x 2 ). Data: (months), height (inches), and weight (pounds) were recorded for a group of school children. Weight Weight Scatter plo show that both age and height are linearly related to weight - : y = α + β x + β 2 x 2 + ε where y: Weight, x :, x 2 :

2 Evaluation of the : (SPSS Output) Summary Adjusted Std. Error of R R Square R Square the Estimate.794 a a. Predictors:,, t of determination: the percentage of variability in the response variable (Weight) that can be described by predictor variables (, ) through the model. Regression Residual Total a. Predictors:,, b. Dependent Variable: Weight ANOVA b Sum of Squares df Mean Square F Sig a Test for significance of the model: p-value =.000 <.05 Ho: is insignificant (β i s are all zeros). Ha: is significant (Some β i s are all zeros). Estimation: (SPSS Output) Inference for Regression : Ho: α = 0 v.s. Ha: α 0 p-value =.000 <.05 Ho: β = 0 v.s. Ha: β 0 p-value =.000 <.05 Ho: β 2 = 0 v.s. Ha: β 2 0 p-value =.000 <.05 Collinearity * statistics: If tolerance is less than 0. or VIF (Variance Inflation Factor) is greater than 0 implies serious collinearity. * Collinearity occurs when there are significant correlations between pairs of independent variables in the model. Collinearity: There is no significant collinearity in the model. Tes for regression coefficien: all parameters in the model, α, β, and β 2 are all statistically significant. Least square regression equation: yˆ = x x2 (For estimating expected response value) The average weight of children of 44 months old and whose height is 55 inches would be: x x55 = (lb) (estimated by the model) How to interpret α, β, and β 2? α is the constant of the y-intercept in the model. It is the average value of response when both predictor variables are 0. β is the rate of change of expected (average) weight per unit change of age adjusted for the height variable. β 2 is the rate of change of expected (average) weight per unit change of height adjusted for the age variable. 2

3 Other possible models: (y: Weight, x :, x 2 : ) y = α + β x + ε y = α + β 2 x 2 + ε With interaction term: y = α + β x + β 2 x 2 + β 3 x x 2 + ε y = α + β x + β 3 x x 2 + ε y = α + β 2 x 2 + β 3 x x 2 + ε Interaction term t Estimation with Interaction Between and for the : INTAG_HT y = α + β x + β 2 x 2 + β 3 x x 2 + ε E E High VIF implies very serious collinearity Interaction should not be in to the model. : y = α + β x + β 2 x 2 + ε where y: Weight, x :, x 2 : Prediction Equation: yˆ = x x 2 Is the model above a good model for estimating a child s weight based on age and height for the population where the sample was taken from? 3

4 If only the male children or only the female children data are modeled, the SPSS output coefficien table for each model y = α + β x + β 2 x 2 + ε should be: For boys: Is there a serious collinearity? Explain with the statistics in the table. Write the weight prediction equation using age and height as predictor variables. Find the average weight for boys that are 44 months old and 55 inches tall. For girls: Is there a serious collinearity? Explain with the statistics in the table. Write the weight prediction equation using age and height as predictor variables. Find the average weight for boys that are 44 months old and 55 inches tall. 4

5 Indicator Variables Binary variables that take only two possible values, 0 and, and can be use for including categorical variables in the model. Male: Female: 0 Group Statistics Weight Male Female Std. Error N Mean Std. Deviation Mean : (A model that models two independent samples situation with equal variances condition.) y = α + β x + ε When x = 0: y = α + ε When x 2 = : y = α + β + ε The difference of the averages of the two categories is β. where y: Weight, x : (x = 0 for female, x = for male) SPSS output for linear regression with gender as predictor variable SPSS output for two independent samples t-test for comparing the mean weight between male and female. Independent Samples Test Weight Equal variances assumed Equal variances not assumed Levene's Test for Equality of Variances F Sig. t df Sig. (2-tailed) t-test for Equality of Means Mean Difference 95% Confidence Interval of the Std. Error Difference Difference Lower Upper The relation between Weight and variables is insignificant or there is no significant difference between the average weigh of male and female children. 5

6 Use of Indicator Variables in the Regression, and variables as Predictor Variables : y = α + β x + β 2 x 2 + β 3 x 3 + ε where y: Weight, x :, x 2 :, x 3 : (x 3 = 0 for Female; x 3 = for Male) Summary Adjusted Std. Error of R R Square R Square the Estimate.794 a a. Predictors:,,, W e i g h t Male Female With and variables in the model, variable becomes insignificant. When comparing the difference in average weigh between genders and adjusted for age and height variables, the difference is statistically insignificant.,,, and - Interaction variables as Predictor Variables INTAG_HT E Adding interaction term to the model increase VIF to the model estimation. without interaction term would be better. 6

7 and as Predictor Variables : y = α + β x + β 2 x 2 + ε where y: Weight, x :, x 2 : (x 2 = 0 for Female; x 2 = for Male) Summary Adjusted Std. Error of R R Square R Square the Estimate.645 a a. Predictors:,, Weight Male Female and are both significant variables if using them for predicting weight. There is significant difference in average weight between genders if adjusted for age variable. Exercise: What would be the average weight for 4 years old boys using the model above? : Prediction:,, and - Interaction variables as Predictor Variables INTGN_AG Adding interaction term to the model increases VIF to the model estimation. without interaction term would be better. 7

8 Common mistake: Use the internally coded values of a categorical explanatory variable directly in linear regression modeling calculation. The proper way to include a categorical variable is to use indicator variables. For having a categorical variable with k categories, one should set up k indicator variables. Example: A survey question asked Race with 3 possible responses, White =, Black = 2, Hispanic = 3. One can set up an indicator variable x so x = represen White, otherwise x = 0, and another indicator x 2 such that x 2 = represen Black otherwise x 2 =0, and x = 0 and x 2 = 0 represen Hispanic. In this survey, it also asked Your Body Fat Percentage and Number of hours of exercise per week. Number of hours of exercise per week : y = α + β x + β 2 x 2 + β 3 x 3 + ε Body Fat Percentage Race Interpretation of the model: Race: White, x = and x 2 = 0, y = α + β + β 3 x 3 + ε Race: Black, x = 0 and x 2 =, y = α + β 2 + β 3 x 3 + ε Race: Hispanic, x = 0 and x 2 = 0, y = α + β 3 x 3 + ε Exercise: Suppose that the estimated parameter values for the model are the following: Write down the prediction equation: α = 20, ˆ β = 2., ˆ β =.3, ˆ β ˆ 2 3 =. Estimate the average body fat for a white person exercise 0 hours per week: Estimate the average body fat for a black person exercise 0 hours per week: Estimate the average body fat for a Hispanic person exercise 0 hours per week: 8

9 Example: Study female life expectancy using percentage of urbanization and birth rate Female life expectancy Female life expectancy Percent urban, 992 Births per 000 population, 992 : y = α + β Birth rate + β 2 Percent urbani + ε where x : Birth rate, x 2 : Percent urbani Evaluation of the model: (SPSS output) Summary Adjusted Std. Error of R R Square R Square the Estimate.904 a a. Predictors:, Births per 000 population, 992, Percent urban, 992 t of determination: the percentage of variability in the response variable (female life expectancy) that can be described by predictor variables (birth rate, percentage of urbanization) through the model. Regression Residual Total ANOVA b Sum of Squares df Mean Square F Sig a a. Predictors:, Births per 000 population, 992, Percent urban, 992 b. Dependent Variable: Female life expectancy 992 Test for significance of the model: p-value =.000 <.05 Ho: is insignificant (β i s are all zeros). Ha: is significant (Some β i s are all zeros). 9

10 estimation: (SPSS output) Births per 000 population, 992 Percent urban, 992 a. Dependent Variable: Female life expectancy Inference for Regression : Ho: α = 0 v.s. Ha: α 0 p-value =.000 <.05 Ho: β = 0 v.s. Ha: β 0 p-value =.000 <.05 Ho: β 2 = 0 v.s. Ha: β 2 0 p-value =.000 <.05 Collinearity * statistics: If tolerance is less than 0. or VIF (Variance Inflation Factor) is greater than 0 implies serious collinearity. * Collinearity: Significant correlations between pairs of independent variables in the model. Tes for regression coefficien: all parameters in the model, α, β, and β 2 are all statistically significant. Least square regression equation: yˆ = x x2 (For estimating expected response value) The average female life expectancy for the countries whose birth rate per 000 is 30 and whose percentage of urbanization is would be x x = (estimated by the model) How to interpret α, β, and β 2? α is the constant or the y-intercept of the model. It is the average value of response variable when both predictor variables are 0. β is the rate of change of expected (average) life expectancy per unit change of birth rate and adjusted for percentage of urbanization. β 2 is the rate of change of expected (average) life expectancy per unit change of percentage of urbanization and adjusted for the birth rate. Other possible models: (x : Birth rate, x 2 : Percent urbani ) y = α + β x + ε y = α + β 2 x 2 + ε With Interaction Effect: y = α + β x + β 2 x 2 + β 3 x x 2 + ε y = α + β x + β 3 x x 2 + ε y = α + β 2 x 2 + β 3 x x 2 + ε Interaction term 0

11 Understanding the female life expectancy and how it is related with explanatory variables: Birth Rate, Urbanization, Phones, Doctors, and GDP. After Log Transformation for Before Transformation Phones, Doctors, and GDP Female life expectan Female life expectan Births per 000 popu Births per 000 popu Percent urban, 992 Percent urban, 992 Phones per 00 peopl Natural log of phone Doctors per 0,000 p Natural log of docto GDP per capita Natural log of GDP Summary b R R Square Adjusted R Square.934 a a. Predictors:, Natural log of GDP, Percent urban, 992, Births per 000 population, 992, Natural log of doctors per 0000, Natural log of phones per 00 people b. Dependent Variable: Female life expectancy 992 Std. Error of the Estimate Durbin-Waon Independence of error Regression Residual Total ANOVA b Sum of Squares df Mean Square F Sig a a. Predictors:, Natural log of GDP, Percent urban, 992, Births per 000 population, 992, Natural log of doctors per 0000, Natural log of phones per 00 people b. Dependent Variable: Female life expectancy 992 Births per 000 population, 992 Percent urban, 992 Natural log of phones per 00 people Natural log of doctors per 0000 Natural log of GDP a. Dependent Variable: Female life expectancy 992 z ed t s E Multicollinearity Tolerance measures the strength of the linear relation between the independent variables. It is better to be higher than 0.. VIF is the reciprocal of Tolerance.

12 ANOVA d 2 3 Regression Residual Total Regression Residual Total Regression Residual Total Sum of Squares df Mean Square F Sig a b c a. Predictors:, Natural log of phones per 00 people b. Predictors:, Natural log of phones per 00 people, Births per 000 population, 992 c. Predictors:, Natural log of phones per 00 people, Births per 000 population, 992, Natural log of doctors per 0000 d. Dependent Variable: Female life expectancy 992 Step-wise selection 2 3 Natural log of phones per 00 people Natural log of phones per 00 people Births per 000 population, 992 Natural log of phones per 00 people Births per 000 population, 992 Natural log of doctors per 0000 B Std. Error a. Dependent Variable: Female life expectancy 992 z ed t s Beta What are the significant factors that are related to the female s life expectancy? In stepwise regression, a large number of tes are performed and lead to higher probability of Type I or Type II error. It should be used when one wan to determine important independent variables from a large number of potentially useful variables in the modeling process. 2

13 Use of regression analysis. Description (model, system, relation) Relation between life expectancy, birth rate, GDP, Relation between salary, rank, years of service, 2. Control Died too young, underpaid, overpaid, 3. Prediction Life expectancy, salary for new comers, future salary, 4. Variable screening (important factors) What are the important factors that affecting salary or life expectancy? Construction of regression models:. Hypothesize the form of the model for µ y x, x2, x3,..., x q a) Selecting predictor variables. b) Deciding functional form of the regression equation. c) Defining scope of the model (design range). 2. Collect the sample data (observations, experimen). 3. Use sample to estimate unknown parameters in the model. 4. Specifying the probability distribution of the random error. 5. Statistically check the usefulness of the model. 6. Apply the model in decision making. 7. Review the model with new data. What is linear model? Example of a linear model: y = β 0 + β x + ε y = β 0 + β x + β 2 x 2 + ε y = β 0 + β x + β 2 x 2 + β 3 x x 2 + ε y = β 0 + β x + β 2 x 2 + β 3 x x 2 + β 4 x 2 + β 5 x ε y = β 0 + β ln(x) + ε y = β 0 + β e x + ε is linear in terms of i parameters. 3

Chapter 9 - Correlation and Regression

Chapter 9 - Correlation and Regression Chapter 9 - Correlation and Regression 9. Scatter diagram of percentage of LBW infants (Y) and high-risk fertility rate (X ) in Vermont Health Planning Districts. 9.3 Correlation between percentage of

More information

Chapter 13. Multiple Regression and Model Building

Chapter 13. Multiple Regression and Model Building Chapter 13 Multiple Regression and Model Building Multiple Regression Models The General Multiple Regression Model y x x x 0 1 1 2 2... k k y is the dependent variable x, x,..., x 1 2 k the model are the

More information

Multiple Regression. More Hypothesis Testing. More Hypothesis Testing The big question: What we really want to know: What we actually know: We know:

Multiple Regression. More Hypothesis Testing. More Hypothesis Testing The big question: What we really want to know: What we actually know: We know: Multiple Regression Ψ320 Ainsworth More Hypothesis Testing What we really want to know: Is the relationship in the population we have selected between X & Y strong enough that we can use the relationship

More information

Predictive Analytics : QM901.1x Prof U Dinesh Kumar, IIMB. All Rights Reserved, Indian Institute of Management Bangalore

Predictive Analytics : QM901.1x Prof U Dinesh Kumar, IIMB. All Rights Reserved, Indian Institute of Management Bangalore What is Multiple Linear Regression Several independent variables may influence the change in response variable we are trying to study. When several independent variables are included in the equation, the

More information

Multiple linear regression S6

Multiple linear regression S6 Basic medical statistics for clinical and experimental research Multiple linear regression S6 Katarzyna Jóźwiak k.jozwiak@nki.nl November 15, 2017 1/42 Introduction Two main motivations for doing multiple

More information

Unit 6 - Introduction to linear regression

Unit 6 - Introduction to linear regression Unit 6 - Introduction to linear regression Suggested reading: OpenIntro Statistics, Chapter 7 Suggested exercises: Part 1 - Relationship between two numerical variables: 7.7, 7.9, 7.11, 7.13, 7.15, 7.25,

More information

Chapter 14 Student Lecture Notes 14-1

Chapter 14 Student Lecture Notes 14-1 Chapter 14 Student Lecture Notes 14-1 Business Statistics: A Decision-Making Approach 6 th Edition Chapter 14 Multiple Regression Analysis and Model Building Chap 14-1 Chapter Goals After completing this

More information

Practical Biostatistics

Practical Biostatistics Practical Biostatistics Clinical Epidemiology, Biostatistics and Bioinformatics AMC Multivariable regression Day 5 Recap Describing association: Correlation Parametric technique: Pearson (PMCC) Non-parametric:

More information

Review of Multiple Regression

Review of Multiple Regression Ronald H. Heck 1 Let s begin with a little review of multiple regression this week. Linear models [e.g., correlation, t-tests, analysis of variance (ANOVA), multiple regression, path analysis, multivariate

More information

Analysing data: regression and correlation S6 and S7

Analysing data: regression and correlation S6 and S7 Basic medical statistics for clinical and experimental research Analysing data: regression and correlation S6 and S7 K. Jozwiak k.jozwiak@nki.nl 2 / 49 Correlation So far we have looked at the association

More information

A discussion on multiple regression models

A discussion on multiple regression models A discussion on multiple regression models In our previous discussion of simple linear regression, we focused on a model in which one independent or explanatory variable X was used to predict the value

More information

Answer Key. 9.1 Scatter Plots and Linear Correlation. Chapter 9 Regression and Correlation. CK-12 Advanced Probability and Statistics Concepts 1

Answer Key. 9.1 Scatter Plots and Linear Correlation. Chapter 9 Regression and Correlation. CK-12 Advanced Probability and Statistics Concepts 1 9.1 Scatter Plots and Linear Correlation Answers 1. A high school psychologist wants to conduct a survey to answer the question: Is there a relationship between a student s athletic ability and his/her

More information

Multiple Regression and Model Building (cont d) + GIS Lecture 21 3 May 2006 R. Ryznar

Multiple Regression and Model Building (cont d) + GIS Lecture 21 3 May 2006 R. Ryznar Multiple Regression and Model Building (cont d) + GIS 11.220 Lecture 21 3 May 2006 R. Ryznar Model Summary b 1-[(SSE/n-k+1)/(SST/n-1)] Model 1 Adjusted Std. Error of R R Square R Square the Estimate.991

More information

Inference for Regression Inference about the Regression Model and Using the Regression Line, with Details. Section 10.1, 2, 3

Inference for Regression Inference about the Regression Model and Using the Regression Line, with Details. Section 10.1, 2, 3 Inference for Regression Inference about the Regression Model and Using the Regression Line, with Details Section 10.1, 2, 3 Basic components of regression setup Target of inference: linear dependency

More information

Chapter 3 Multiple Regression Complete Example

Chapter 3 Multiple Regression Complete Example Department of Quantitative Methods & Information Systems ECON 504 Chapter 3 Multiple Regression Complete Example Spring 2013 Dr. Mohammad Zainal Review Goals After completing this lecture, you should be

More information

Single and multiple linear regression analysis

Single and multiple linear regression analysis Single and multiple linear regression analysis Marike Cockeran 2017 Introduction Outline of the session Simple linear regression analysis SPSS example of simple linear regression analysis Additional topics

More information

Multiple linear regression

Multiple linear regression Multiple linear regression Course MF 930: Introduction to statistics June 0 Tron Anders Moger Department of biostatistics, IMB University of Oslo Aims for this lecture: Continue where we left off. Repeat

More information

Final Exam - Solutions

Final Exam - Solutions Ecn 102 - Analysis of Economic Data University of California - Davis March 19, 2010 Instructor: John Parman Final Exam - Solutions You have until 5:30pm to complete this exam. Please remember to put your

More information

General linear models. One and Two-way ANOVA in SPSS Repeated measures ANOVA Multiple linear regression

General linear models. One and Two-way ANOVA in SPSS Repeated measures ANOVA Multiple linear regression General linear models One and Two-way ANOVA in SPSS Repeated measures ANOVA Multiple linear regression 2-way ANOVA in SPSS Example 14.1 2 3 2-way ANOVA in SPSS Click Add 4 Repeated measures The stroop

More information

Two-Way ANOVA. Chapter 15

Two-Way ANOVA. Chapter 15 Two-Way ANOVA Chapter 15 Interaction Defined An interaction is present when the effects of one IV depend upon a second IV Interaction effect : The effect of each IV across the levels of the other IV When

More information

Chapter 7 Student Lecture Notes 7-1

Chapter 7 Student Lecture Notes 7-1 Chapter 7 Student Lecture Notes 7- Chapter Goals QM353: Business Statistics Chapter 7 Multiple Regression Analysis and Model Building After completing this chapter, you should be able to: Explain model

More information

y response variable x 1, x 2,, x k -- a set of explanatory variables

y response variable x 1, x 2,, x k -- a set of explanatory variables 11. Multiple Regression and Correlation y response variable x 1, x 2,, x k -- a set of explanatory variables In this chapter, all variables are assumed to be quantitative. Chapters 12-14 show how to incorporate

More information

Project Report for STAT571 Statistical Methods Instructor: Dr. Ramon V. Leon. Wage Data Analysis. Yuanlei Zhang

Project Report for STAT571 Statistical Methods Instructor: Dr. Ramon V. Leon. Wage Data Analysis. Yuanlei Zhang Project Report for STAT7 Statistical Methods Instructor: Dr. Ramon V. Leon Wage Data Analysis Yuanlei Zhang 77--7 November, Part : Introduction Data Set The data set contains a random sample of observations

More information

: The model hypothesizes a relationship between the variables. The simplest probabilistic model: or.

: The model hypothesizes a relationship between the variables. The simplest probabilistic model: or. Chapter Simple Linear Regression : comparing means across groups : presenting relationships among numeric variables. Probabilistic Model : The model hypothesizes an relationship between the variables.

More information

Regression ( Kemampuan Individu, Lingkungan kerja dan Motivasi)

Regression ( Kemampuan Individu, Lingkungan kerja dan Motivasi) Regression (, Lingkungan kerja dan ) Descriptive Statistics Mean Std. Deviation N 3.87.333 32 3.47.672 32 3.78.585 32 s Pearson Sig. (-tailed) N Kemampuan Lingkungan Individu Kerja.000.432.49.432.000.3.49.3.000..000.000.000..000.000.000.

More information

Chapter 4. Regression Models. Learning Objectives

Chapter 4. Regression Models. Learning Objectives Chapter 4 Regression Models To accompany Quantitative Analysis for Management, Eleventh Edition, by Render, Stair, and Hanna Power Point slides created by Brian Peterson Learning Objectives After completing

More information

Correlation and Regression Bangkok, 14-18, Sept. 2015

Correlation and Regression Bangkok, 14-18, Sept. 2015 Analysing and Understanding Learning Assessment for Evidence-based Policy Making Correlation and Regression Bangkok, 14-18, Sept. 2015 Australian Council for Educational Research Correlation The strength

More information

REVIEW 8/2/2017 陈芳华东师大英语系

REVIEW 8/2/2017 陈芳华东师大英语系 REVIEW Hypothesis testing starts with a null hypothesis and a null distribution. We compare what we have to the null distribution, if the result is too extreme to belong to the null distribution (p

More information

Unit 11: Multiple Linear Regression

Unit 11: Multiple Linear Regression Unit 11: Multiple Linear Regression Statistics 571: Statistical Methods Ramón V. León 7/13/2004 Unit 11 - Stat 571 - Ramón V. León 1 Main Application of Multiple Regression Isolating the effect of a variable

More information

QUANTITATIVE STATISTICAL METHODS: REGRESSION AND FORECASTING JOHANNES LEDOLTER VIENNA UNIVERSITY OF ECONOMICS AND BUSINESS ADMINISTRATION SPRING 2013

QUANTITATIVE STATISTICAL METHODS: REGRESSION AND FORECASTING JOHANNES LEDOLTER VIENNA UNIVERSITY OF ECONOMICS AND BUSINESS ADMINISTRATION SPRING 2013 QUANTITATIVE STATISTICAL METHODS: REGRESSION AND FORECASTING JOHANNES LEDOLTER VIENNA UNIVERSITY OF ECONOMICS AND BUSINESS ADMINISTRATION SPRING 3 Introduction Objectives of course: Regression and Forecasting

More information

Multiple Regression Analysis

Multiple Regression Analysis Multiple Regression Analysis Where as simple linear regression has 2 variables (1 dependent, 1 independent): y ˆ = a + bx Multiple linear regression has >2 variables (1 dependent, many independent): ˆ

More information

Simple Linear Regression Using Ordinary Least Squares

Simple Linear Regression Using Ordinary Least Squares Simple Linear Regression Using Ordinary Least Squares Purpose: To approximate a linear relationship with a line. Reason: We want to be able to predict Y using X. Definition: The Least Squares Regression

More information

Chapter 4: Regression Models

Chapter 4: Regression Models Sales volume of company 1 Textbook: pp. 129-164 Chapter 4: Regression Models Money spent on advertising 2 Learning Objectives After completing this chapter, students will be able to: Identify variables,

More information

Inferences for Regression

Inferences for Regression Inferences for Regression An Example: Body Fat and Waist Size Looking at the relationship between % body fat and waist size (in inches). Here is a scatterplot of our data set: Remembering Regression In

More information

Chapter Goals. To understand the methods for displaying and describing relationship among variables. Formulate Theories.

Chapter Goals. To understand the methods for displaying and describing relationship among variables. Formulate Theories. Chapter Goals To understand the methods for displaying and describing relationship among variables. Formulate Theories Interpret Results/Make Decisions Collect Data Summarize Results Chapter 7: Is There

More information

Basic Statistics Exercises 66

Basic Statistics Exercises 66 Basic Statistics Exercises 66 42. Suppose we are interested in predicting a person's height from the person's length of stride (distance between footprints). The following data is recorded for a random

More information

Example: Forced Expiratory Volume (FEV) Program L13. Example: Forced Expiratory Volume (FEV) Example: Forced Expiratory Volume (FEV)

Example: Forced Expiratory Volume (FEV) Program L13. Example: Forced Expiratory Volume (FEV) Example: Forced Expiratory Volume (FEV) Program L13 Relationships between two variables Correlation, cont d Regression Relationships between more than two variables Multiple linear regression Two numerical variables Linear or curved relationship?

More information

Parametric Test. Multiple Linear Regression Spatial Application I: State Homicide Rates Equations taken from Zar, 1984.

Parametric Test. Multiple Linear Regression Spatial Application I: State Homicide Rates Equations taken from Zar, 1984. Multiple Linear Regression Spatial Application I: State Homicide Rates Equations taken from Zar, 984. y ˆ = a + b x + b 2 x 2K + b n x n where n is the number of variables Example: In an earlier bivariate

More information

Unit 6 - Simple linear regression

Unit 6 - Simple linear regression Sta 101: Data Analysis and Statistical Inference Dr. Çetinkaya-Rundel Unit 6 - Simple linear regression LO 1. Define the explanatory variable as the independent variable (predictor), and the response variable

More information

Multiple Regression. Peerapat Wongchaiwat, Ph.D.

Multiple Regression. Peerapat Wongchaiwat, Ph.D. Peerapat Wongchaiwat, Ph.D. wongchaiwat@hotmail.com The Multiple Regression Model Examine the linear relationship between 1 dependent (Y) & 2 or more independent variables (X i ) Multiple Regression Model

More information

Data Analysis 1 LINEAR REGRESSION. Chapter 03

Data Analysis 1 LINEAR REGRESSION. Chapter 03 Data Analysis 1 LINEAR REGRESSION Chapter 03 Data Analysis 2 Outline The Linear Regression Model Least Squares Fit Measures of Fit Inference in Regression Other Considerations in Regression Model Qualitative

More information

Prepared by: Prof. Dr Bahaman Abu Samah Department of Professional Development and Continuing Education Faculty of Educational Studies Universiti

Prepared by: Prof. Dr Bahaman Abu Samah Department of Professional Development and Continuing Education Faculty of Educational Studies Universiti Prepared by: Prof Dr Bahaman Abu Samah Department of Professional Development and Continuing Education Faculty of Educational Studies Universiti Putra Malaysia Serdang M L Regression is an extension to

More information

9. Linear Regression and Correlation

9. Linear Regression and Correlation 9. Linear Regression and Correlation Data: y a quantitative response variable x a quantitative explanatory variable (Chap. 8: Recall that both variables were categorical) For example, y = annual income,

More information

Ecn Analysis of Economic Data University of California - Davis February 23, 2010 Instructor: John Parman. Midterm 2. Name: ID Number: Section:

Ecn Analysis of Economic Data University of California - Davis February 23, 2010 Instructor: John Parman. Midterm 2. Name: ID Number: Section: Ecn 102 - Analysis of Economic Data University of California - Davis February 23, 2010 Instructor: John Parman Midterm 2 You have until 10:20am to complete this exam. Please remember to put your name,

More information

Introduction to Regression

Introduction to Regression Regression Introduction to Regression If two variables covary, we should be able to predict the value of one variable from another. Correlation only tells us how much two variables covary. In regression,

More information

Inference for Regression Inference about the Regression Model and Using the Regression Line

Inference for Regression Inference about the Regression Model and Using the Regression Line Inference for Regression Inference about the Regression Model and Using the Regression Line PBS Chapter 10.1 and 10.2 2009 W.H. Freeman and Company Objectives (PBS Chapter 10.1 and 10.2) Inference about

More information

( ), which of the coefficients would end

( ), which of the coefficients would end Discussion Sheet 29.7.9 Qualitative Variables We have devoted most of our attention in multiple regression to quantitative or numerical variables. MR models can become more useful and complex when we consider

More information

CHAPTER 10. Regression and Correlation

CHAPTER 10. Regression and Correlation CHAPTER 10 Regression and Correlation In this Chapter we assess the strength of the linear relationship between two continuous variables. If a significant linear relationship is found, the next step would

More information

Inference for Regression Simple Linear Regression

Inference for Regression Simple Linear Regression Inference for Regression Simple Linear Regression IPS Chapter 10.1 2009 W.H. Freeman and Company Objectives (IPS Chapter 10.1) Simple linear regression p Statistical model for linear regression p Estimating

More information

Correlation & Simple Regression

Correlation & Simple Regression Chapter 11 Correlation & Simple Regression The previous chapter dealt with inference for two categorical variables. In this chapter, we would like to examine the relationship between two quantitative variables.

More information

2) For a normal distribution, the skewness and kurtosis measures are as follows: A) 1.96 and 4 B) 1 and 2 C) 0 and 3 D) 0 and 0

2) For a normal distribution, the skewness and kurtosis measures are as follows: A) 1.96 and 4 B) 1 and 2 C) 0 and 3 D) 0 and 0 Introduction to Econometrics Midterm April 26, 2011 Name Student ID MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. (5,000 credit for each correct

More information

Multiple Regression and Model Building Lecture 20 1 May 2006 R. Ryznar

Multiple Regression and Model Building Lecture 20 1 May 2006 R. Ryznar Multiple Regression and Model Building 11.220 Lecture 20 1 May 2006 R. Ryznar Building Models: Making Sure the Assumptions Hold 1. There is a linear relationship between the explanatory (independent) variable(s)

More information

STAT 3900/4950 MIDTERM TWO Name: Spring, 2015 (print: first last ) Covered topics: Two-way ANOVA, ANCOVA, SLR, MLR and correlation analysis

STAT 3900/4950 MIDTERM TWO Name: Spring, 2015 (print: first last ) Covered topics: Two-way ANOVA, ANCOVA, SLR, MLR and correlation analysis STAT 3900/4950 MIDTERM TWO Name: Spring, 205 (print: first last ) Covered topics: Two-way ANOVA, ANCOVA, SLR, MLR and correlation analysis Instructions: You may use your books, notes, and SPSS/SAS. NO

More information

ECON 497 Midterm Spring

ECON 497 Midterm Spring ECON 497 Midterm Spring 2009 1 ECON 497: Economic Research and Forecasting Name: Spring 2009 Bellas Midterm You have three hours and twenty minutes to complete this exam. Answer all questions and explain

More information

Mathematics for Economics MA course

Mathematics for Economics MA course Mathematics for Economics MA course Simple Linear Regression Dr. Seetha Bandara Simple Regression Simple linear regression is a statistical method that allows us to summarize and study relationships between

More information

CHAPTER 5 LINEAR REGRESSION AND CORRELATION

CHAPTER 5 LINEAR REGRESSION AND CORRELATION CHAPTER 5 LINEAR REGRESSION AND CORRELATION Expected Outcomes Able to use simple and multiple linear regression analysis, and correlation. Able to conduct hypothesis testing for simple and multiple linear

More information

Midterm 2 - Solutions

Midterm 2 - Solutions Ecn 102 - Analysis of Economic Data University of California - Davis February 23, 2010 Instructor: John Parman Midterm 2 - Solutions You have until 10:20am to complete this exam. Please remember to put

More information

using the beginning of all regression models

using the beginning of all regression models Estimating using the beginning of all regression models 3 examples Note about shorthand Cavendish's 29 measurements of the earth's density Heights (inches) of 14 11 year-old males from Alberta study Half-life

More information

Statistics 5100 Spring 2018 Exam 1

Statistics 5100 Spring 2018 Exam 1 Statistics 5100 Spring 2018 Exam 1 Directions: You have 60 minutes to complete the exam. Be sure to answer every question, and do not spend too much time on any part of any question. Be concise with all

More information

Daniel Boduszek University of Huddersfield

Daniel Boduszek University of Huddersfield Daniel Boduszek University of Huddersfield d.boduszek@hud.ac.uk Introduction to moderator effects Hierarchical Regression analysis with continuous moderator Hierarchical Regression analysis with categorical

More information

Intro to Linear Regression

Intro to Linear Regression Intro to Linear Regression Introduction to Regression Regression is a statistical procedure for modeling the relationship among variables to predict the value of a dependent variable from one or more predictor

More information

In Class Review Exercises Vartanian: SW 540

In Class Review Exercises Vartanian: SW 540 In Class Review Exercises Vartanian: SW 540 1. Given the following output from an OLS model looking at income, what is the slope and intercept for those who are black and those who are not black? b SE

More information

Intro to Linear Regression

Intro to Linear Regression Intro to Linear Regression Introduction to Regression Regression is a statistical procedure for modeling the relationship among variables to predict the value of a dependent variable from one or more predictor

More information

Multiple Regression: Chapter 13. July 24, 2015

Multiple Regression: Chapter 13. July 24, 2015 Multiple Regression: Chapter 13 July 24, 2015 Multiple Regression (MR) Response Variable: Y - only one response variable (quantitative) Several Predictor Variables: X 1, X 2, X 3,..., X p (p = # predictors)

More information

Basic Business Statistics, 10/e

Basic Business Statistics, 10/e Chapter 4 4- Basic Business Statistics th Edition Chapter 4 Introduction to Multiple Regression Basic Business Statistics, e 9 Prentice-Hall, Inc. Chap 4- Learning Objectives In this chapter, you learn:

More information

Chapter 7 9 Review. Select the letter that corresponds to the best answer.

Chapter 7 9 Review. Select the letter that corresponds to the best answer. AP STATISTICS Chapter 7 9 Review MULTIPLE CHOICE Name: Per: Select the letter that corresponds to the best answer. 1. The correlation between X and Y is r = 0.35. If we double each X value, increase each

More information

Simple Linear Regression: One Qualitative IV

Simple Linear Regression: One Qualitative IV Simple Linear Regression: One Qualitative IV 1. Purpose As noted before regression is used both to explain and predict variation in DVs, and adding to the equation categorical variables extends regression

More information

Correlation and simple linear regression S5

Correlation and simple linear regression S5 Basic medical statistics for clinical and eperimental research Correlation and simple linear regression S5 Katarzyna Jóźwiak k.jozwiak@nki.nl November 15, 2017 1/41 Introduction Eample: Brain size and

More information

Self-Assessment Weeks 8: Multiple Regression with Qualitative Predictors; Multiple Comparisons

Self-Assessment Weeks 8: Multiple Regression with Qualitative Predictors; Multiple Comparisons Self-Assessment Weeks 8: Multiple Regression with Qualitative Predictors; Multiple Comparisons 1. Suppose we wish to assess the impact of five treatments while blocking for study participant race (Black,

More information

LINEAR REGRESSION ANALYSIS. MODULE XVI Lecture Exercises

LINEAR REGRESSION ANALYSIS. MODULE XVI Lecture Exercises LINEAR REGRESSION ANALYSIS MODULE XVI Lecture - 44 Exercises Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur Exercise 1 The following data has been obtained on

More information

EDF 7405 Advanced Quantitative Methods in Educational Research MULTR.SAS

EDF 7405 Advanced Quantitative Methods in Educational Research MULTR.SAS EDF 7405 Advanced Quantitative Methods in Educational Research MULTR.SAS The data used in this example describe teacher and student behavior in 8 classrooms. The variables are: Y percentage of interventions

More information

Chapter 4 Regression with Categorical Predictor Variables Page 1. Overview of regression with categorical predictors

Chapter 4 Regression with Categorical Predictor Variables Page 1. Overview of regression with categorical predictors Chapter 4 Regression with Categorical Predictor Variables Page. Overview of regression with categorical predictors 4-. Dummy coding 4-3 4-5 A. Karpinski Regression with Categorical Predictor Variables.

More information

Binary Logistic Regression

Binary Logistic Regression The coefficients of the multiple regression model are estimated using sample data with k independent variables Estimated (or predicted) value of Y Estimated intercept Estimated slope coefficients Ŷ = b

More information

Simple Linear Regression

Simple Linear Regression Simple Linear Regression 1 Correlation indicates the magnitude and direction of the linear relationship between two variables. Linear Regression: variable Y (criterion) is predicted by variable X (predictor)

More information

16.400/453J Human Factors Engineering. Design of Experiments II

16.400/453J Human Factors Engineering. Design of Experiments II J Human Factors Engineering Design of Experiments II Review Experiment Design and Descriptive Statistics Research question, independent and dependent variables, histograms, box plots, etc. Inferential

More information

Univariate analysis. Simple and Multiple Regression. Univariate analysis. Simple Regression How best to summarise the data?

Univariate analysis. Simple and Multiple Regression. Univariate analysis. Simple Regression How best to summarise the data? Univariate analysis Example - linear regression equation: y = ax + c Least squares criteria ( yobs ycalc ) = yobs ( ax + c) = minimum Simple and + = xa xc xy xa + nc = y Solve for a and c Univariate analysis

More information

Example. Multiple Regression. Review of ANOVA & Simple Regression /749 Experimental Design for Behavioral and Social Sciences

Example. Multiple Regression. Review of ANOVA & Simple Regression /749 Experimental Design for Behavioral and Social Sciences 36-309/749 Experimental Design for Behavioral and Social Sciences Sep. 29, 2015 Lecture 5: Multiple Regression Review of ANOVA & Simple Regression Both Quantitative outcome Independent, Gaussian errors

More information

Chapter 10-Regression

Chapter 10-Regression Chapter 10-Regression 10.1 Regression equation predicting infant mortality from income Y = Infant mortality X = Income Y = 6.70 s Y = 0.698 s 2 Y = 0.487 X = 46.00 s X = 6.289 s 2 X = 39.553 cov XY = 2.7245

More information

Regression of Inflation on Percent M3 Change

Regression of Inflation on Percent M3 Change ECON 497 Final Exam Page of ECON 497: Economic Research and Forecasting Name: Spring 2006 Bellas Final Exam Return this exam to me by midnight on Thursday, April 27. It may be e-mailed to me. It may be

More information

Regression Analysis Primer DEO PowerPoint, Bureau of Labor Market Statistics

Regression Analysis Primer DEO PowerPoint, Bureau of Labor Market Statistics Regression Analysis Primer DEO PowerPoint, Bureau of Labor Market Statistics September 27-30, 2017 Regression Analysis Stephen Birch, Economic Consultant LTIP Technical Lead, Projections Managing Partnership

More information

1. Use Scenario 3-1. In this study, the response variable is

1. Use Scenario 3-1. In this study, the response variable is Chapter 8 Bell Work Scenario 3-1 The height (in feet) and volume (in cubic feet) of usable lumber of 32 cherry trees are measured by a researcher. The goal is to determine if volume of usable lumber can

More information

An Analysis of College Algebra Exam Scores December 14, James D Jones Math Section 01

An Analysis of College Algebra Exam Scores December 14, James D Jones Math Section 01 An Analysis of College Algebra Exam s December, 000 James D Jones Math - Section 0 An Analysis of College Algebra Exam s Introduction Students often complain about a test being too difficult. Are there

More information

SPSS LAB FILE 1

SPSS LAB FILE  1 SPSS LAB FILE www.mcdtu.wordpress.com 1 www.mcdtu.wordpress.com 2 www.mcdtu.wordpress.com 3 OBJECTIVE 1: Transporation of Data Set to SPSS Editor INPUTS: Files: group1.xlsx, group1.txt PROCEDURE FOLLOWED:

More information

DEVELOPMENT OF CRASH PREDICTION MODEL USING MULTIPLE REGRESSION ANALYSIS Harshit Gupta 1, Dr. Siddhartha Rokade 2 1

DEVELOPMENT OF CRASH PREDICTION MODEL USING MULTIPLE REGRESSION ANALYSIS Harshit Gupta 1, Dr. Siddhartha Rokade 2 1 DEVELOPMENT OF CRASH PREDICTION MODEL USING MULTIPLE REGRESSION ANALYSIS Harshit Gupta 1, Dr. Siddhartha Rokade 2 1 PG Student, 2 Assistant Professor, Department of Civil Engineering, Maulana Azad National

More information

bivariate correlation bivariate regression multiple regression

bivariate correlation bivariate regression multiple regression bivariate correlation bivariate regression multiple regression Today Bivariate Correlation Pearson product-moment correlation (r) assesses nature and strength of the linear relationship between two continuous

More information

SPSS Output. ANOVA a b Residual Coefficients a Standardized Coefficients

SPSS Output. ANOVA a b Residual Coefficients a Standardized Coefficients SPSS Output Homework 1-1e ANOVA a Sum of Squares df Mean Square F Sig. 1 Regression 351.056 1 351.056 11.295.002 b Residual 932.412 30 31.080 Total 1283.469 31 a. Dependent Variable: Sexual Harassment

More information

1 Multiple Regression

1 Multiple Regression 1 Multiple Regression In this section, we extend the linear model to the case of several quantitative explanatory variables. There are many issues involved in this problem and this section serves only

More information

Regression Models. Chapter 4. Introduction. Introduction. Introduction

Regression Models. Chapter 4. Introduction. Introduction. Introduction Chapter 4 Regression Models Quantitative Analysis for Management, Tenth Edition, by Render, Stair, and Hanna 008 Prentice-Hall, Inc. Introduction Regression analysis is a very valuable tool for a manager

More information

Sociology 593 Exam 1 February 14, 1997

Sociology 593 Exam 1 February 14, 1997 Sociology 9 Exam February, 997 I. True-False. ( points) Indicate whether the following statements are true or false. If false, briefly explain why.. There are IVs in a multiple regression model. If the

More information

Multivariate Correlational Analysis: An Introduction

Multivariate Correlational Analysis: An Introduction Assignment. Multivariate Correlational Analysis: An Introduction Mertler & Vanetta, Chapter 7 Kachigan, Chapter 4, pps 180-193 Terms you should know. Multiple Regression Linear Equations Least Squares

More information

Categorical Predictor Variables

Categorical Predictor Variables Categorical Predictor Variables We often wish to use categorical (or qualitative) variables as covariates in a regression model. For binary variables (taking on only 2 values, e.g. sex), it is relatively

More information

Lecture 4: Multivariate Regression, Part 2

Lecture 4: Multivariate Regression, Part 2 Lecture 4: Multivariate Regression, Part 2 Gauss-Markov Assumptions 1) Linear in Parameters: Y X X X i 0 1 1 2 2 k k 2) Random Sampling: we have a random sample from the population that follows the above

More information

y = a + bx 12.1: Inference for Linear Regression Review: General Form of Linear Regression Equation Review: Interpreting Computer Regression Output

y = a + bx 12.1: Inference for Linear Regression Review: General Form of Linear Regression Equation Review: Interpreting Computer Regression Output 12.1: Inference for Linear Regression Review: General Form of Linear Regression Equation y = a + bx y = dependent variable a = intercept b = slope x = independent variable Section 12.1 Inference for Linear

More information

Identify the scale of measurement most appropriate for each of the following variables. (Use A = nominal, B = ordinal, C = interval, D = ratio.

Identify the scale of measurement most appropriate for each of the following variables. (Use A = nominal, B = ordinal, C = interval, D = ratio. Answers to Items from Problem Set 1 Item 1 Identify the scale of measurement most appropriate for each of the following variables. (Use A = nominal, B = ordinal, C = interval, D = ratio.) a. response latency

More information

Review of the General Linear Model

Review of the General Linear Model Review of the General Linear Model EPSY 905: Multivariate Analysis Online Lecture #2 Learning Objectives Types of distributions: Ø Conditional distributions The General Linear Model Ø Regression Ø Analysis

More information

(Where does Ch. 7 on comparing 2 means or 2 proportions fit into this?)

(Where does Ch. 7 on comparing 2 means or 2 proportions fit into this?) 12. Comparing Groups: Analysis of Variance (ANOVA) Methods Response y Explanatory x var s Method Categorical Categorical Contingency tables (Ch. 8) (chi-squared, etc.) Quantitative Quantitative Regression

More information

Multiple Regression. Inference for Multiple Regression and A Case Study. IPS Chapters 11.1 and W.H. Freeman and Company

Multiple Regression. Inference for Multiple Regression and A Case Study. IPS Chapters 11.1 and W.H. Freeman and Company Multiple Regression Inference for Multiple Regression and A Case Study IPS Chapters 11.1 and 11.2 2009 W.H. Freeman and Company Objectives (IPS Chapters 11.1 and 11.2) Multiple regression Data for multiple

More information

TOPIC 9 SIMPLE REGRESSION & CORRELATION

TOPIC 9 SIMPLE REGRESSION & CORRELATION TOPIC 9 SIMPLE REGRESSION & CORRELATION Basic Linear Relationships Mathematical representation: Y = a + bx X is the independent variable [the variable whose value we can choose, or the input variable].

More information

Ridge Regression. Summary. Sample StatFolio: ridge reg.sgp. STATGRAPHICS Rev. 10/1/2014

Ridge Regression. Summary. Sample StatFolio: ridge reg.sgp. STATGRAPHICS Rev. 10/1/2014 Ridge Regression Summary... 1 Data Input... 4 Analysis Summary... 5 Analysis Options... 6 Ridge Trace... 7 Regression Coefficients... 8 Standardized Regression Coefficients... 9 Observed versus Predicted...

More information

THE ROYAL STATISTICAL SOCIETY 2008 EXAMINATIONS SOLUTIONS HIGHER CERTIFICATE (MODULAR FORMAT) MODULE 4 LINEAR MODELS

THE ROYAL STATISTICAL SOCIETY 2008 EXAMINATIONS SOLUTIONS HIGHER CERTIFICATE (MODULAR FORMAT) MODULE 4 LINEAR MODELS THE ROYAL STATISTICAL SOCIETY 008 EXAMINATIONS SOLUTIONS HIGHER CERTIFICATE (MODULAR FORMAT) MODULE 4 LINEAR MODELS The Society provides these solutions to assist candidates preparing for the examinations

More information