Example: Forced Expiratory Volume (FEV) Program L13. Example: Forced Expiratory Volume (FEV) Example: Forced Expiratory Volume (FEV)

Size: px
Start display at page:

Download "Example: Forced Expiratory Volume (FEV) Program L13. Example: Forced Expiratory Volume (FEV) Example: Forced Expiratory Volume (FEV)"

Transcription

1 Program L13 Relationships between two variables Correlation, cont d Regression Relationships between more than two variables Multiple linear regression Two numerical variables Linear or curved relationship? 1 2 Assumptions for Pearson s correlation coefficient: Random samples Both variables are numerical No extreme values Assumptions for Spearman s correlation coefficient: Random samples The variables can at least be ranked (ordered) 3 The two variables Correlations Height (cm) FEV Height (cm) Pearson Correlation 1,869 ** Sig. (2-tailed), N FEV Pearson Correlation,869 ** 1 Sig. (2-tailed), N **. Correlation is significant at the.1 level (2-tailed). Pearson s correlation coefficient ( r ) The correlation between the two variables height and lung capacity (FEV) is quite strong (r =.869). The correlation is significantly different from zero (low P-value). P - value (H a : ρ ) 4 Quantitative Methods Fall 215 1

2 Testing the significance of the correlation A hypothesis test can be performed to test the significance of the correlation coefficient. If there is no correlation, the correlation coefficient is zero. H : ρ = : ρ H a The correlation in the population is denoted by the greek letter ρ (rho) If the P-value is low enough (usually compared to a significance level of 5%), the null hypothesis is rejected and we state that the correlation is significant (or significantly different from zero). 5 The two variables Correlations Height (cm) Spearman's rho Height (cm) Correlation Coefficient 1,,888 ** FEV Sig. (2-tailed)., N FEV Correlation Coefficient,888 ** 1, **. Correlation is significant at the.1 level (2-tailed). Sig. (2-tailed),. N Spearman s correlation coefficient ( r s ) P - value (H a : ρ s ) The correlation between the two variables height and lung capacity (FEV) is quite strong also when measured by the Spearman coefficient (r s =.888). The correlation is significantly different from zero (low P-value). 6 Nonsense correlation Limitations of the correlation coefficient (Pearson) A numerical relationship does not necessarily mean that there is a causal relationship X = Age of school children Y = Height of the same children X = Shoe size of shool girls Y = Drawing skills Z = Age r > X Y Cause-effect r > X Y Z Nonsense correlation 7 The correlation coefficient measures how well the relationship is captured by a straight line, but does not give any indication about which line. Two relationships with the same correlation coefficient: We can only see whether the direction is positive or negative, i.e. if the line is sloping upwards or downwards. 8 Quantitative Methods Fall 215 2

3 Relationship between two quantitative variables If a scatterplot shows a linear relationship, we can summarize this overall pattern by drawing a line on the scatterplot. A regression line summarizes the relationship between two variables in a specific setting; when one of the variables helps explain or predict the other. Linear regression Weight Vikt (kg) Age Ålder (years) 9 We have two variables that we want to investigate the relationship between. Eg: X=Age (years), Y=Weight (kg) 1 Linear regression Regression: Cause and effect Weight Vikt (kg) Age Ålder (years) A straight line can be used to summarize this relationhip. Is there a direction in the relationship? Determine which variable causes effects in the other. What is cause and effect when we study... Height and Weight of 19 year old boys? Price and Milage of used cars? First determine the direction of the relationship. Next find the straight line which best fits the scatter plot, i.e. determine location and direction of the line Quantitative Methods Fall 215 3

4 Regression: Terminology Linear regression The X-variable is assumed to effect the Y-variable. The X-variable is called the explanatory variable (the one that explains the other). The Y-variable is called the response variable (the one that responds to or depends on the other). Response variable (Y) Weight Vikt (kg) Age Ålder (years) Explanatory variable (X) The regression line The regression line For a quantitative variable, we often use a sample to get information about the population mean µ. If there is a relationship between X and Y, then regression analysis can use the information in the X-variable to improve our estimate of the mean of Y. The regression line can be viewed as point estimates of the mean of Y for different values of the X-variable. The line can be seen as point estimates of mean weight for different values of age Quantitative Methods Fall 215 4

5 What is a straight line? The mathematical description of a straight line is y = b + b 1 x What is a straight line? b is called the intercept and determines where the line crosses the Y-axis, i.e. the value of Y when X=. Intercept, b (the value of Y=weight where the line crosses the Y-axis, i.e.when Age=) What is a straight line? b 1 determines the slope of the line, the amount by which Y changes when X increases by one unit. b 1 =the amount by which Y (weight) changes when X increases by one unit What is the best line? So, we want to find a line that fits the data in the best possible way. We want the line to be in the middle of the scatter, and we want it to be as close to the individuals ( data points ) as possible. X (age) increases by one unit (1 year) 19 2 Quantitative Methods Fall 215 5

6 What is the best line? Least Squares Regression As when calculating the variance, the distances are squared, and the line that makes the sum of all squared distances as small as possible is considered to be the best line. This is called the least-squares regression line Linear regression Intercept Explanatory variable (X) Model Coefficients a Unstandardized Coefficients Standardized Coefficients B Std. Error Beta 1 (Constant) -6,88,514-11,846, Height (cm),56,3,869 17,264, a. Dependent Variable: FEV Dependent variable (Y) The effect of Height, b 1 The value of FEV (Y) when Height= (b ) t Sig. The P-value of the coefficients b and b 1 23 In SPSS: Analyze >> Regression >> Linear 24 Quantitative Methods Fall 215 6

7 Testing the significance of the coefficients -Linear regression A hypothesis test can be performed to test the significance of the regression coefficients. If X does not help explain Y, the value of b 1 is. H H a β : 1 = : β 1 The coefficient in the population is denoted by the greek letter β (beta) If the P-value is low enough (usually compared to a significance level of 5%), the null hypothesis is rejected and we state that the coefficient is significant (or significantly different from zero). 25 SPSS gives us the least-squares regression line Estimated from our sample y = b + b 1 x FEV = Height This means that for every cm taller the children are, the lung capacity increases by on average.56 liters (significantly different from zero, P-value <.5). Or, for every dm taller the children are, the lung capacity increases by.56 liters on average. 26 -Linear regression -Linear regression SPSS gives us the least-squares regression line Estimated from our sample y = b + b 1 x FEV = Height The intercept means that when the height is zero, the lung capacity is liters. Negative lung capacity??? 27 The regression line is fitted to the available data, it does not fit to any imaginary data outside the range of existing values. The intercept is the value of Y when X is zero. Height= is not a value that can exist, and it is also far outside our range of values (minimum = 12 cm). 28 Quantitative Methods Fall 215 7

8 Extrapolation Interpretation of the intercept (b ) If we include the value we can see how far it is from the range of existing values. The intercept only denotes the point where the regression line crosses the Y-axis and is not affected by whether this point has a relevant interpretation or not. Drawing conclusions about values outside the range of the sample is called extrapolating. Such predictions are often not accurate and should be avoided Interpretation of the regression coefficients (b and b 1 ) The intercept b is the estimated value of the Y-variable when X is. If X= is far outside the range of values in the sample, or the X-variable cannot take the value, the intercept (b ) does not have an interpretation. The slope b 1 is an estimate of how the population mean of the Y-variable changes when the X-variable increases by one unit. Predicting average values of Y We have the least-squares regression line for the relationship between Height and FEV FEV = Height Say that we are interested in the average lung capacity (FEV) of 14 cm tall children. Plug in the value Height=14 in our regression equation FEV = x 14 We estimate the average lung capacity for 14 cm tall children to x 14 =1.752 litres 32 Quantitative Methods Fall 215 8

9 How strong is the relationship? How much better does the estimation of the mean of the Y-variable (µ y ) get when we use information from the X-variable? The coefficient of determination (R 2 ) We can calculate the share of the total variation in the Y-variable which is explained by the regression. We define the coefficient of determination, R 2 For simple regression (using only one X-variable) the coefficient of determination equals the square of the correlation coefficient, r 2 = R 2. This relationship does not hold for multiple regression (when you have two or more X-variables) The coefficient of determination (R 2 ) The coefficient of determination shows how much of the total variation in the Y-variable that is being explained by the X-variable(s). R 2 1 ( 1%) -coefficient of determination Model R R Square Model Summary Adjusted R Square Std. Error of the Estimate 1,869 a,754,752,47933 a. Predictors: (Constant), Height (cm) Coefficient of determination (R 2 ) 35 In SPSS: Analyze >> Regression >> Linear. Y=dependent variable, X=independent. 36 Quantitative Methods Fall 215 9

10 -coefficient of determination R 2 =.754, i.e. the coefficient of determination is 75.4%. Interpretation: The individuals in the sample have different lung capacity measured by FEV, i.e. there exists a variation in lung capacity. They are also of different height, i.e. there exists a variation in height. The standard deviation (from the perspective of regression) The standard deviation is a measure of the average distance from the mean. In regression analysis, we are interested in the average distance from the regression line, i.e. the standard error of the regression prediction. The coefficient of determination tells us that 75.4% of the variation in lung capacity can be explained by the individuals different heights standard error of the prediction - standard error of the prediction Model R R Square Model Summary Adjusted R Square Std. Error of the Estimate 1,869 a,754,752,47933 a. Predictors: (Constant), Height (cm) The standard error of the regression prediction, i.e. the average distance from the regression line The standard error has to be interpreted in relation to the values of the variable in the sample. Lung capacity takes values between 1 and 6 l in the sample Average distance from the line is.479 liters In SPSS: Analyze >> Regression >> Linear. Y=dependent variable, X=independent Quantitative Methods Fall 215 1

11 - standard error of the prediction The standard error of the prediction is.479 liters. Interpretation: The children have different lung capacity, and different heights. A regression line shows the average lung capacity for different heights of the children. In our sample, the children s lung capacity is on average.479 liters more or less than the regression line. Half a liter is not very much in relation to the values that the variable (FEV) takes in the sample, 1-6 liters. The standard deviation of FEV (without using the information about height) is approx. 1 liter. Multiple linear regression In the previous example we managed to explain about 75% (R 2 =.754) of the variation in lung capacity (measured by FEV) of a sample of children, using their differences in height as the explanatory variable. While this is a very large share, it also means that 25% of the variation in lung capacity is unexplained Multiple linear regression Example: House prices There is often more than one factor behind the variation in the response variable Y. We can try to explain more by adding more explanatory variables (X-variables) in the model, thus making it into a multiple linear regression model. A real estate agent wants to make a prediction of the price of a house in a particular area. She knows that the price is affected by e.g. the size of the house. A sample of 19 recently sold houses is available. Y = house price (kr) X = house size (m 2 ) 43 Quantitative Methods Fall

12 Example: House prices Example: House prices (Constant) House size Price = House size The average price of houses in this area increases with 11 5 kr for every extra m 2 (interpretation of the slope b 1 ). The intercept (b ) cannot be interpreted here since there are no houses with a living area of m % of the variation in price between different houses can be explained by their differences in living area (interpretation of the coefficient of determination R 2 ). 31.3% of the variation in prices is still unexplained. 46 Multiple linear regression Two (or more) explanatory variables A multiple linear regression model with two explanatory variables is written Two X-variables y ˆ = b + + b1x1 b2x2 The coefficients b 1 and b 2 are interpreted as marginal changes, i.e. b 1 is the change in y when x 1 is changed one unit, and x 2 is being kept constant. b is the value of Y when all X-variables are zero. Multiple linear regression equation A model with two explanatory variables y y ˆ = b + + b1x1 b2x2 x 2 47 x 1 Quantitative Methods Fall

13 Example: House prices Dummy variables Can we find any other explanatory variables that might help explain the variation in house prices? The existence of a garage is likely to have an effect on the price of the house, and the agent wants to take this information into account. But whether a house has a garage or not is not a quantitative variable, it is a categorical variable (there either is a garage or not). We can quantify a categorical variable with only two categories as a variable that can take the values or 1, where 1 denotes the property of interest (eg. 1= the house has a garage, and = the house has no garage ). Such variables are called dummy variables and can be used as explanatory variables in regression analysis Example: House prices Example: House prices We expand the model to include the dummy variable Garage Y = house price (kr) X 1 = house size (m 2 ) X 2 = Garage 1 If the house has a garage If the house doesn t have a garage Price = House size Garage A house with one extra m 2 is sold for on average 1 4 kr more, keeping the other variable constant (i.e. not making any changes to the garage variable). A house with a garage cost on average 222 kr more than one without, keeping the other variable constant (i.e. not making any changes to the house size variable). Quantitative Methods Fall

14 What characterizes good explanatory variables? Strong correlation with the response variable. If the pattern of variation in the explanatory variable and the response variable corresponds, i.e. they covary, then one can be used to predict the other. Weak correlation with other explanatory variables. If the explanatory variables are strongly correlated, it is likely that they explain the same part of the variation in the response variable. Which explanatory variables to include in the model? The goal is to predict the value of the response variable. Use only those explanatory variables (X-variables) which have a substance-relation to the response variable (Y). Try not to use several variables which explain the same variation in the response variable Home assignment Deadline for supplements: Dec 17 th 17.. For supplements: READ THE INSTRUCTIONS again! Clearly mark your changes compared to the original report. 55 Quantitative Methods Fall

Correlation. A statistics method to measure the relationship between two variables. Three characteristics

Correlation. A statistics method to measure the relationship between two variables. Three characteristics Correlation Correlation A statistics method to measure the relationship between two variables Three characteristics Direction of the relationship Form of the relationship Strength/Consistency Direction

More information

Can you tell the relationship between students SAT scores and their college grades?

Can you tell the relationship between students SAT scores and their college grades? Correlation One Challenge Can you tell the relationship between students SAT scores and their college grades? A: The higher SAT scores are, the better GPA may be. B: The higher SAT scores are, the lower

More information

Business Statistics. Lecture 10: Correlation and Linear Regression

Business Statistics. Lecture 10: Correlation and Linear Regression Business Statistics Lecture 10: Correlation and Linear Regression Scatterplot A scatterplot shows the relationship between two quantitative variables measured on the same individuals. It displays the Form

More information

Correlation Analysis

Correlation Analysis Simple Regression Correlation Analysis Correlation analysis is used to measure strength of the association (linear relationship) between two variables Correlation is only concerned with strength of the

More information

REVIEW 8/2/2017 陈芳华东师大英语系

REVIEW 8/2/2017 陈芳华东师大英语系 REVIEW Hypothesis testing starts with a null hypothesis and a null distribution. We compare what we have to the null distribution, if the result is too extreme to belong to the null distribution (p

More information

Multiple Regression. More Hypothesis Testing. More Hypothesis Testing The big question: What we really want to know: What we actually know: We know:

Multiple Regression. More Hypothesis Testing. More Hypothesis Testing The big question: What we really want to know: What we actually know: We know: Multiple Regression Ψ320 Ainsworth More Hypothesis Testing What we really want to know: Is the relationship in the population we have selected between X & Y strong enough that we can use the relationship

More information

Chapter 4: Regression Models

Chapter 4: Regression Models Sales volume of company 1 Textbook: pp. 129-164 Chapter 4: Regression Models Money spent on advertising 2 Learning Objectives After completing this chapter, students will be able to: Identify variables,

More information

MATH ASSIGNMENT 2: SOLUTIONS

MATH ASSIGNMENT 2: SOLUTIONS MATH 204 - ASSIGNMENT 2: SOLUTIONS (a) Fitting the simple linear regression model to each of the variables in turn yields the following results: we look at t-tests for the individual coefficients, and

More information

Overview. Overview. Overview. Specific Examples. General Examples. Bivariate Regression & Correlation

Overview. Overview. Overview. Specific Examples. General Examples. Bivariate Regression & Correlation Bivariate Regression & Correlation Overview The Scatter Diagram Two Examples: Education & Prestige Correlation Coefficient Bivariate Linear Regression Line SPSS Output Interpretation Covariance ou already

More information

Correlation and simple linear regression S5

Correlation and simple linear regression S5 Basic medical statistics for clinical and eperimental research Correlation and simple linear regression S5 Katarzyna Jóźwiak k.jozwiak@nki.nl November 15, 2017 1/41 Introduction Eample: Brain size and

More information

Practical Biostatistics

Practical Biostatistics Practical Biostatistics Clinical Epidemiology, Biostatistics and Bioinformatics AMC Multivariable regression Day 5 Recap Describing association: Correlation Parametric technique: Pearson (PMCC) Non-parametric:

More information

Chapter 4. Regression Models. Learning Objectives

Chapter 4. Regression Models. Learning Objectives Chapter 4 Regression Models To accompany Quantitative Analysis for Management, Eleventh Edition, by Render, Stair, and Hanna Power Point slides created by Brian Peterson Learning Objectives After completing

More information

Regression Analysis. BUS 735: Business Decision Making and Research. Learn how to detect relationships between ordinal and categorical variables.

Regression Analysis. BUS 735: Business Decision Making and Research. Learn how to detect relationships between ordinal and categorical variables. Regression Analysis BUS 735: Business Decision Making and Research 1 Goals of this section Specific goals Learn how to detect relationships between ordinal and categorical variables. Learn how to estimate

More information

SPSS Output. ANOVA a b Residual Coefficients a Standardized Coefficients

SPSS Output. ANOVA a b Residual Coefficients a Standardized Coefficients SPSS Output Homework 1-1e ANOVA a Sum of Squares df Mean Square F Sig. 1 Regression 351.056 1 351.056 11.295.002 b Residual 932.412 30 31.080 Total 1283.469 31 a. Dependent Variable: Sexual Harassment

More information

Key Concepts. Correlation (Pearson & Spearman) & Linear Regression. Assumptions. Correlation parametric & non-para. Correlation

Key Concepts. Correlation (Pearson & Spearman) & Linear Regression. Assumptions. Correlation parametric & non-para. Correlation Correlation (Pearson & Spearman) & Linear Regression Azmi Mohd Tamil Key Concepts Correlation as a statistic Positive and Negative Bivariate Correlation Range Effects Outliers Regression & Prediction Directionality

More information

Finding Relationships Among Variables

Finding Relationships Among Variables Finding Relationships Among Variables BUS 230: Business and Economic Research and Communication 1 Goals Specific goals: Re-familiarize ourselves with basic statistics ideas: sampling distributions, hypothesis

More information

Lecture (chapter 13): Association between variables measured at the interval-ratio level

Lecture (chapter 13): Association between variables measured at the interval-ratio level Lecture (chapter 13): Association between variables measured at the interval-ratio level Ernesto F. L. Amaral April 9 11, 2018 Advanced Methods of Social Research (SOCI 420) Source: Healey, Joseph F. 2015.

More information

ECON 497 Midterm Spring

ECON 497 Midterm Spring ECON 497 Midterm Spring 2009 1 ECON 497: Economic Research and Forecasting Name: Spring 2009 Bellas Midterm You have three hours and twenty minutes to complete this exam. Answer all questions and explain

More information

Correlation and Linear Regression

Correlation and Linear Regression Correlation and Linear Regression Correlation: Relationships between Variables So far, nearly all of our discussion of inferential statistics has focused on testing for differences between group means

More information

1 Correlation and Inference from Regression

1 Correlation and Inference from Regression 1 Correlation and Inference from Regression Reading: Kennedy (1998) A Guide to Econometrics, Chapters 4 and 6 Maddala, G.S. (1992) Introduction to Econometrics p. 170-177 Moore and McCabe, chapter 12 is

More information

Area1 Scaled Score (NAPLEX) .535 ** **.000 N. Sig. (2-tailed)

Area1 Scaled Score (NAPLEX) .535 ** **.000 N. Sig. (2-tailed) Institutional Assessment Report Texas Southern University College of Pharmacy and Health Sciences "An Analysis of 2013 NAPLEX, P4-Comp. Exams and P3 courses The following analysis illustrates relationships

More information

Regression Analysis. BUS 735: Business Decision Making and Research

Regression Analysis. BUS 735: Business Decision Making and Research Regression Analysis BUS 735: Business Decision Making and Research 1 Goals and Agenda Goals of this section Specific goals Learn how to detect relationships between ordinal and categorical variables. Learn

More information

Review of Multiple Regression

Review of Multiple Regression Ronald H. Heck 1 Let s begin with a little review of multiple regression this week. Linear models [e.g., correlation, t-tests, analysis of variance (ANOVA), multiple regression, path analysis, multivariate

More information

Topic 10 - Linear Regression

Topic 10 - Linear Regression Topic 10 - Linear Regression Least squares principle Hypothesis tests/confidence intervals/prediction intervals for regression 1 Linear Regression How much should you pay for a house? Would you consider

More information

Business Statistics. Chapter 14 Introduction to Linear Regression and Correlation Analysis QMIS 220. Dr. Mohammad Zainal

Business Statistics. Chapter 14 Introduction to Linear Regression and Correlation Analysis QMIS 220. Dr. Mohammad Zainal Department of Quantitative Methods & Information Systems Business Statistics Chapter 14 Introduction to Linear Regression and Correlation Analysis QMIS 220 Dr. Mohammad Zainal Chapter Goals After completing

More information

: The model hypothesizes a relationship between the variables. The simplest probabilistic model: or.

: The model hypothesizes a relationship between the variables. The simplest probabilistic model: or. Chapter Simple Linear Regression : comparing means across groups : presenting relationships among numeric variables. Probabilistic Model : The model hypothesizes an relationship between the variables.

More information

Chapter Learning Objectives. Regression Analysis. Correlation. Simple Linear Regression. Chapter 12. Simple Linear Regression

Chapter Learning Objectives. Regression Analysis. Correlation. Simple Linear Regression. Chapter 12. Simple Linear Regression Chapter 12 12-1 North Seattle Community College BUS21 Business Statistics Chapter 12 Learning Objectives In this chapter, you learn:! How to use regression analysis to predict the value of a dependent

More information

Linear Correlation and Regression Analysis

Linear Correlation and Regression Analysis Linear Correlation and Regression Analysis Set Up the Calculator 2 nd CATALOG D arrow down DiagnosticOn ENTER ENTER SCATTER DIAGRAM Positive Linear Correlation Positive Correlation Variables will tend

More information

Regression Models. Chapter 4. Introduction. Introduction. Introduction

Regression Models. Chapter 4. Introduction. Introduction. Introduction Chapter 4 Regression Models Quantitative Analysis for Management, Tenth Edition, by Render, Stair, and Hanna 008 Prentice-Hall, Inc. Introduction Regression analysis is a very valuable tool for a manager

More information

SIMPLE REGRESSION ANALYSIS. Business Statistics

SIMPLE REGRESSION ANALYSIS. Business Statistics SIMPLE REGRESSION ANALYSIS Business Statistics CONTENTS Ordinary least squares (recap for some) Statistical formulation of the regression model Assessing the regression model Testing the regression coefficients

More information

x3,..., Multiple Regression β q α, β 1, β 2, β 3,..., β q in the model can all be estimated by least square estimators

x3,..., Multiple Regression β q α, β 1, β 2, β 3,..., β q in the model can all be estimated by least square estimators Multiple Regression Relating a response (dependent, input) y to a set of explanatory (independent, output, predictor) variables x, x 2, x 3,, x q. A technique for modeling the relationship between variables.

More information

TOPIC 9 SIMPLE REGRESSION & CORRELATION

TOPIC 9 SIMPLE REGRESSION & CORRELATION TOPIC 9 SIMPLE REGRESSION & CORRELATION Basic Linear Relationships Mathematical representation: Y = a + bx X is the independent variable [the variable whose value we can choose, or the input variable].

More information

Acknowledgements. Outline. Marie Diener-West. ICTR Leadership / Team INTRODUCTION TO CLINICAL RESEARCH. Introduction to Linear Regression

Acknowledgements. Outline. Marie Diener-West. ICTR Leadership / Team INTRODUCTION TO CLINICAL RESEARCH. Introduction to Linear Regression INTRODUCTION TO CLINICAL RESEARCH Introduction to Linear Regression Karen Bandeen-Roche, Ph.D. July 17, 2012 Acknowledgements Marie Diener-West Rick Thompson ICTR Leadership / Team JHU Intro to Clinical

More information

Keller: Stats for Mgmt & Econ, 7th Ed July 17, 2006

Keller: Stats for Mgmt & Econ, 7th Ed July 17, 2006 Chapter 17 Simple Linear Regression and Correlation 17.1 Regression Analysis Our problem objective is to analyze the relationship between interval variables; regression analysis is the first tool we will

More information

Chapter 16. Simple Linear Regression and Correlation

Chapter 16. Simple Linear Regression and Correlation Chapter 16 Simple Linear Regression and Correlation 16.1 Regression Analysis Our problem objective is to analyze the relationship between interval variables; regression analysis is the first tool we will

More information

Correlation and Regression Bangkok, 14-18, Sept. 2015

Correlation and Regression Bangkok, 14-18, Sept. 2015 Analysing and Understanding Learning Assessment for Evidence-based Policy Making Correlation and Regression Bangkok, 14-18, Sept. 2015 Australian Council for Educational Research Correlation The strength

More information

Ordinary Least Squares Regression Explained: Vartanian

Ordinary Least Squares Regression Explained: Vartanian Ordinary Least Squares Regression Eplained: Vartanian When to Use Ordinary Least Squares Regression Analysis A. Variable types. When you have an interval/ratio scale dependent variable.. When your independent

More information

Simple Linear Regression

Simple Linear Regression Simple Linear Regression 1 Correlation indicates the magnitude and direction of the linear relationship between two variables. Linear Regression: variable Y (criterion) is predicted by variable X (predictor)

More information

Answer Key. 9.1 Scatter Plots and Linear Correlation. Chapter 9 Regression and Correlation. CK-12 Advanced Probability and Statistics Concepts 1

Answer Key. 9.1 Scatter Plots and Linear Correlation. Chapter 9 Regression and Correlation. CK-12 Advanced Probability and Statistics Concepts 1 9.1 Scatter Plots and Linear Correlation Answers 1. A high school psychologist wants to conduct a survey to answer the question: Is there a relationship between a student s athletic ability and his/her

More information

Chapter Goals. To understand the methods for displaying and describing relationship among variables. Formulate Theories.

Chapter Goals. To understand the methods for displaying and describing relationship among variables. Formulate Theories. Chapter Goals To understand the methods for displaying and describing relationship among variables. Formulate Theories Interpret Results/Make Decisions Collect Data Summarize Results Chapter 7: Is There

More information

Comparing Quantitative Variables

Comparing Quantitative Variables Comparing Quantitative Variables Lecture 8 January 29, 2018 Four Stages of Statistics Data Collection Displaying and Summarizing Data One Categorical Two Categorical One Quantitative One Categorical and

More information

THE PEARSON CORRELATION COEFFICIENT

THE PEARSON CORRELATION COEFFICIENT CORRELATION Two variables are said to have a relation if knowing the value of one variable gives you information about the likely value of the second variable this is known as a bivariate relation There

More information

Analysing data: regression and correlation S6 and S7

Analysing data: regression and correlation S6 and S7 Basic medical statistics for clinical and experimental research Analysing data: regression and correlation S6 and S7 K. Jozwiak k.jozwiak@nki.nl 2 / 49 Correlation So far we have looked at the association

More information

Review 6. n 1 = 85 n 2 = 75 x 1 = x 2 = s 1 = 38.7 s 2 = 39.2

Review 6. n 1 = 85 n 2 = 75 x 1 = x 2 = s 1 = 38.7 s 2 = 39.2 Review 6 Use the traditional method to test the given hypothesis. Assume that the samples are independent and that they have been randomly selected ) A researcher finds that of,000 people who said that

More information

Chapter 10. Regression. Understandable Statistics Ninth Edition By Brase and Brase Prepared by Yixun Shi Bloomsburg University of Pennsylvania

Chapter 10. Regression. Understandable Statistics Ninth Edition By Brase and Brase Prepared by Yixun Shi Bloomsburg University of Pennsylvania Chapter 10 Regression Understandable Statistics Ninth Edition By Brase and Brase Prepared by Yixun Shi Bloomsburg University of Pennsylvania Scatter Diagrams A graph in which pairs of points, (x, y), are

More information

STA441: Spring Multiple Regression. This slide show is a free open source document. See the last slide for copyright information.

STA441: Spring Multiple Regression. This slide show is a free open source document. See the last slide for copyright information. STA441: Spring 2018 Multiple Regression This slide show is a free open source document. See the last slide for copyright information. 1 Least Squares Plane 2 Statistical MODEL There are p-1 explanatory

More information

14: Correlation. Introduction Scatter Plot The Correlational Coefficient Hypothesis Test Assumptions An Additional Example

14: Correlation. Introduction Scatter Plot The Correlational Coefficient Hypothesis Test Assumptions An Additional Example 14: Correlation Introduction Scatter Plot The Correlational Coefficient Hypothesis Test Assumptions An Additional Example Introduction Correlation quantifies the extent to which two quantitative variables,

More information

WORKSHOP 3 Measuring Association

WORKSHOP 3 Measuring Association WORKSHOP 3 Measuring Association Concepts Analysing Categorical Data o Testing of Proportions o Contingency Tables & Tests o Odds Ratios Linear Association Measures o Correlation o Simple Linear Regression

More information

Basic Business Statistics 6 th Edition

Basic Business Statistics 6 th Edition Basic Business Statistics 6 th Edition Chapter 12 Simple Linear Regression Learning Objectives In this chapter, you learn: How to use regression analysis to predict the value of a dependent variable based

More information

Bivariate Regression Analysis. The most useful means of discerning causality and significance of variables

Bivariate Regression Analysis. The most useful means of discerning causality and significance of variables Bivariate Regression Analysis The most useful means of discerning causality and significance of variables Purpose of Regression Analysis Test causal hypotheses Make predictions from samples of data Derive

More information

Relationship Between Interval and/or Ratio Variables: Correlation & Regression. Sorana D. BOLBOACĂ

Relationship Between Interval and/or Ratio Variables: Correlation & Regression. Sorana D. BOLBOACĂ Relationship Between Interval and/or Ratio Variables: Correlation & Regression Sorana D. BOLBOACĂ OUTLINE Correlation Definition Deviation Score Formula, Z score formula Hypothesis Test Regression - Intercept

More information

CHAPTER 4 DESCRIPTIVE MEASURES IN REGRESSION AND CORRELATION

CHAPTER 4 DESCRIPTIVE MEASURES IN REGRESSION AND CORRELATION STP 226 ELEMENTARY STATISTICS CHAPTER 4 DESCRIPTIVE MEASURES IN REGRESSION AND CORRELATION Linear Regression and correlation allows us to examine the relationship between two or more quantitative variables.

More information

Regression Models. Chapter 4

Regression Models. Chapter 4 Chapter 4 Regression Models To accompany Quantitative Analysis for Management, Eleventh Edition, by Render, Stair, and Hanna Power Point slides created by Brian Peterson Introduction Regression analysis

More information

Multiple linear regression S6

Multiple linear regression S6 Basic medical statistics for clinical and experimental research Multiple linear regression S6 Katarzyna Jóźwiak k.jozwiak@nki.nl November 15, 2017 1/42 Introduction Two main motivations for doing multiple

More information

Chapter 16. Simple Linear Regression and dcorrelation

Chapter 16. Simple Linear Regression and dcorrelation Chapter 16 Simple Linear Regression and dcorrelation 16.1 Regression Analysis Our problem objective is to analyze the relationship between interval variables; regression analysis is the first tool we will

More information

Glossary. The ISI glossary of statistical terms provides definitions in a number of different languages:

Glossary. The ISI glossary of statistical terms provides definitions in a number of different languages: Glossary The ISI glossary of statistical terms provides definitions in a number of different languages: http://isi.cbs.nl/glossary/index.htm Adjusted r 2 Adjusted R squared measures the proportion of the

More information

LI EAR REGRESSIO A D CORRELATIO

LI EAR REGRESSIO A D CORRELATIO CHAPTER 6 LI EAR REGRESSIO A D CORRELATIO Page Contents 6.1 Introduction 10 6. Curve Fitting 10 6.3 Fitting a Simple Linear Regression Line 103 6.4 Linear Correlation Analysis 107 6.5 Spearman s Rank Correlation

More information

Regression line. Regression. Regression line. Slope intercept form review 9/16/09

Regression line. Regression. Regression line. Slope intercept form review 9/16/09 Regression FPP 10 kind of Regression line Correlation coefficient a nice numerical summary of two quantitative variables It indicates direction and strength of association But does it quantify the association?

More information

Chapter 4 Describing the Relation between Two Variables

Chapter 4 Describing the Relation between Two Variables Chapter 4 Describing the Relation between Two Variables 4.1 Scatter Diagrams and Correlation The is the variable whose value can be explained by the value of the or. A is a graph that shows the relationship

More information

Important note: Transcripts are not substitutes for textbook assignments. 1

Important note: Transcripts are not substitutes for textbook assignments. 1 In this lesson we will cover correlation and regression, two really common statistical analyses for quantitative (or continuous) data. Specially we will review how to organize the data, the importance

More information

Multiple linear regression

Multiple linear regression Multiple linear regression Course MF 930: Introduction to statistics June 0 Tron Anders Moger Department of biostatistics, IMB University of Oslo Aims for this lecture: Continue where we left off. Repeat

More information

MULTIPLE REGRESSION ANALYSIS AND OTHER ISSUES. Business Statistics

MULTIPLE REGRESSION ANALYSIS AND OTHER ISSUES. Business Statistics MULTIPLE REGRESSION ANALYSIS AND OTHER ISSUES Business Statistics CONTENTS Multiple regression Dummy regressors Assumptions of regression analysis Predicting with regression analysis Old exam question

More information

determine whether or not this relationship is.

determine whether or not this relationship is. Section 9-1 Correlation A correlation is a between two. The data can be represented by ordered pairs (x,y) where x is the (or ) variable and y is the (or ) variable. There are several types of correlations

More information

Chapter 12 - Part I: Correlation Analysis

Chapter 12 - Part I: Correlation Analysis ST coursework due Friday, April - Chapter - Part I: Correlation Analysis Textbook Assignment Page - # Page - #, Page - # Lab Assignment # (available on ST webpage) GOALS When you have completed this lecture,

More information

Correlation and Regression

Correlation and Regression Elementary Statistics A Step by Step Approach Sixth Edition by Allan G. Bluman http://www.mhhe.com/math/stat/blumanbrief SLIDES PREPARED BY LLOYD R. JAISINGH MOREHEAD STATE UNIVERSITY MOREHEAD KY Updated

More information

Statistics for Managers using Microsoft Excel 6 th Edition

Statistics for Managers using Microsoft Excel 6 th Edition Statistics for Managers using Microsoft Excel 6 th Edition Chapter 13 Simple Linear Regression 13-1 Learning Objectives In this chapter, you learn: How to use regression analysis to predict the value of

More information

Correlation and the Analysis of Variance Approach to Simple Linear Regression

Correlation and the Analysis of Variance Approach to Simple Linear Regression Correlation and the Analysis of Variance Approach to Simple Linear Regression Biometry 755 Spring 2009 Correlation and the Analysis of Variance Approach to Simple Linear Regression p. 1/35 Correlation

More information

STAT 3900/4950 MIDTERM TWO Name: Spring, 2015 (print: first last ) Covered topics: Two-way ANOVA, ANCOVA, SLR, MLR and correlation analysis

STAT 3900/4950 MIDTERM TWO Name: Spring, 2015 (print: first last ) Covered topics: Two-way ANOVA, ANCOVA, SLR, MLR and correlation analysis STAT 3900/4950 MIDTERM TWO Name: Spring, 205 (print: first last ) Covered topics: Two-way ANOVA, ANCOVA, SLR, MLR and correlation analysis Instructions: You may use your books, notes, and SPSS/SAS. NO

More information

Chapter 2: Looking at Data Relationships (Part 3)

Chapter 2: Looking at Data Relationships (Part 3) Chapter 2: Looking at Data Relationships (Part 3) Dr. Nahid Sultana Chapter 2: Looking at Data Relationships 2.1: Scatterplots 2.2: Correlation 2.3: Least-Squares Regression 2.5: Data Analysis for Two-Way

More information

Chs. 16 & 17: Correlation & Regression

Chs. 16 & 17: Correlation & Regression Chs. 16 & 17: Correlation & Regression With the shift to correlational analyses, we change the very nature of the question we are asking of our data. Heretofore, we were asking if a difference was likely

More information

Chapter 12 : Linear Correlation and Linear Regression

Chapter 12 : Linear Correlation and Linear Regression Chapter 1 : Linear Correlation and Linear Regression Determining whether a linear relationship exists between two quantitative variables, and modeling the relationship with a line, if the linear relationship

More information

Lecture 9: Linear Regression

Lecture 9: Linear Regression Lecture 9: Linear Regression Goals Develop basic concepts of linear regression from a probabilistic framework Estimating parameters and hypothesis testing with linear models Linear regression in R Regression

More information

Review of Statistics 101

Review of Statistics 101 Review of Statistics 101 We review some important themes from the course 1. Introduction Statistics- Set of methods for collecting/analyzing data (the art and science of learning from data). Provides methods

More information

Basics of Experimental Design. Review of Statistics. Basic Study. Experimental Design. When an Experiment is Not Possible. Studying Relations

Basics of Experimental Design. Review of Statistics. Basic Study. Experimental Design. When an Experiment is Not Possible. Studying Relations Basics of Experimental Design Review of Statistics And Experimental Design Scientists study relation between variables In the context of experiments these variables are called independent and dependent

More information

regression analysis is a type of inferential statistics which tells us whether relationships between two or more variables exist

regression analysis is a type of inferential statistics which tells us whether relationships between two or more variables exist regression analysis is a type of inferential statistics which tells us whether relationships between two or more variables exist sales $ (y - dependent variable) advertising $ (x - independent variable)

More information

Bivariate Relationships Between Variables

Bivariate Relationships Between Variables Bivariate Relationships Between Variables BUS 735: Business Decision Making and Research 1 Goals Specific goals: Detect relationships between variables. Be able to prescribe appropriate statistical methods

More information

Correlation: Relationships between Variables

Correlation: Relationships between Variables Correlation Correlation: Relationships between Variables So far, nearly all of our discussion of inferential statistics has focused on testing for differences between group means However, researchers are

More information

IT 403 Practice Problems (2-2) Answers

IT 403 Practice Problems (2-2) Answers IT 403 Practice Problems (2-2) Answers #1. Which of the following is correct with respect to the correlation coefficient (r) and the slope of the leastsquares regression line (Choose one)? a. They will

More information

Simple Linear Regression Using Ordinary Least Squares

Simple Linear Regression Using Ordinary Least Squares Simple Linear Regression Using Ordinary Least Squares Purpose: To approximate a linear relationship with a line. Reason: We want to be able to predict Y using X. Definition: The Least Squares Regression

More information

About Bivariate Correlations and Linear Regression

About Bivariate Correlations and Linear Regression About Bivariate Correlations and Linear Regression TABLE OF CONTENTS About Bivariate Correlations and Linear Regression... 1 What is BIVARIATE CORRELATION?... 1 What is LINEAR REGRESSION... 1 Bivariate

More information

Linear Regression with one Regressor

Linear Regression with one Regressor 1 Linear Regression with one Regressor Covering Chapters 4.1 and 4.2. We ve seen the California test score data before. Now we will try to estimate the marginal effect of STR on SCORE. To motivate these

More information

Lecture 15: Chapter 10

Lecture 15: Chapter 10 Lecture 15: Chapter 10 C C Moxley UAB Mathematics 20 July 15 10.1 Pairing Data In Chapter 9, we talked about pairing data in a natural way. In this Chapter, we will essentially be discussing whether these

More information

Biostatistics. Chapter 11 Simple Linear Correlation and Regression. Jing Li

Biostatistics. Chapter 11 Simple Linear Correlation and Regression. Jing Li Biostatistics Chapter 11 Simple Linear Correlation and Regression Jing Li jing.li@sjtu.edu.cn http://cbb.sjtu.edu.cn/~jingli/courses/2018fall/bi372/ Dept of Bioinformatics & Biostatistics, SJTU Review

More information

Ordinary Least Squares Regression Explained: Vartanian

Ordinary Least Squares Regression Explained: Vartanian Ordinary Least Squares Regression Explained: Vartanian When to Use Ordinary Least Squares Regression Analysis A. Variable types. When you have an interval/ratio scale dependent variable.. When your independent

More information

Regression: Main Ideas Setting: Quantitative outcome with a quantitative explanatory variable. Example, cont.

Regression: Main Ideas Setting: Quantitative outcome with a quantitative explanatory variable. Example, cont. TCELL 9/4/205 36-309/749 Experimental Design for Behavioral and Social Sciences Simple Regression Example Male black wheatear birds carry stones to the nest as a form of sexual display. Soler et al. wanted

More information

Chapter 10-Regression

Chapter 10-Regression Chapter 10-Regression 10.1 Regression equation predicting infant mortality from income Y = Infant mortality X = Income Y = 6.70 s Y = 0.698 s 2 Y = 0.487 X = 46.00 s X = 6.289 s 2 X = 39.553 cov XY = 2.7245

More information

MATH 1070 Introductory Statistics Lecture notes Relationships: Correlation and Simple Regression

MATH 1070 Introductory Statistics Lecture notes Relationships: Correlation and Simple Regression MATH 1070 Introductory Statistics Lecture notes Relationships: Correlation and Simple Regression Objectives: 1. Learn the concepts of independent and dependent variables 2. Learn the concept of a scatterplot

More information

Reminder: Student Instructional Rating Surveys

Reminder: Student Instructional Rating Surveys Reminder: Student Instructional Rating Surveys You have until May 7 th to fill out the student instructional rating surveys at https://sakai.rutgers.edu/portal/site/sirs The survey should be available

More information

9. Linear Regression and Correlation

9. Linear Regression and Correlation 9. Linear Regression and Correlation Data: y a quantitative response variable x a quantitative explanatory variable (Chap. 8: Recall that both variables were categorical) For example, y = annual income,

More information

(ii) Scan your answer sheets INTO ONE FILE only, and submit it in the drop-box.

(ii) Scan your answer sheets INTO ONE FILE only, and submit it in the drop-box. FINAL EXAM ** Two different ways to submit your answer sheet (i) Use MS-Word and place it in a drop-box. (ii) Scan your answer sheets INTO ONE FILE only, and submit it in the drop-box. Deadline: December

More information

(4) 1. Create dummy variables for Town. Name these dummy variables A and B. These 0,1 variables now indicate the location of the house.

(4) 1. Create dummy variables for Town. Name these dummy variables A and B. These 0,1 variables now indicate the location of the house. Exam 3 Resource Economics 312 Introductory Econometrics Please complete all questions on this exam. The data in the spreadsheet: Exam 3- Home Prices.xls are to be used for all analyses. These data are

More information

36-309/749 Experimental Design for Behavioral and Social Sciences. Sep. 22, 2015 Lecture 4: Linear Regression

36-309/749 Experimental Design for Behavioral and Social Sciences. Sep. 22, 2015 Lecture 4: Linear Regression 36-309/749 Experimental Design for Behavioral and Social Sciences Sep. 22, 2015 Lecture 4: Linear Regression TCELL Simple Regression Example Male black wheatear birds carry stones to the nest as a form

More information

Chapter 9 - Correlation and Regression

Chapter 9 - Correlation and Regression Chapter 9 - Correlation and Regression 9. Scatter diagram of percentage of LBW infants (Y) and high-risk fertility rate (X ) in Vermont Health Planning Districts. 9.3 Correlation between percentage of

More information

Describing the Relationship between Two Variables

Describing the Relationship between Two Variables 1 Describing the Relationship between Two Variables Key Definitions Scatter : A graph made to show the relationship between two different variables (each pair of x s and y s) measured from the same equation.

More information

THE ROYAL STATISTICAL SOCIETY 2008 EXAMINATIONS SOLUTIONS HIGHER CERTIFICATE (MODULAR FORMAT) MODULE 4 LINEAR MODELS

THE ROYAL STATISTICAL SOCIETY 2008 EXAMINATIONS SOLUTIONS HIGHER CERTIFICATE (MODULAR FORMAT) MODULE 4 LINEAR MODELS THE ROYAL STATISTICAL SOCIETY 008 EXAMINATIONS SOLUTIONS HIGHER CERTIFICATE (MODULAR FORMAT) MODULE 4 LINEAR MODELS The Society provides these solutions to assist candidates preparing for the examinations

More information

Lecture Slides. Elementary Statistics Tenth Edition. by Mario F. Triola. and the Triola Statistics Series. Slide 1

Lecture Slides. Elementary Statistics Tenth Edition. by Mario F. Triola. and the Triola Statistics Series. Slide 1 Lecture Slides Elementary Statistics Tenth Edition and the Triola Statistics Series by Mario F. Triola Slide 1 Chapter 10 Correlation and Regression 10-1 Overview 10-2 Correlation 10-3 Regression 10-4

More information

Ch 13 & 14 - Regression Analysis

Ch 13 & 14 - Regression Analysis Ch 3 & 4 - Regression Analysis Simple Regression Model I. Multiple Choice:. A simple regression is a regression model that contains a. only one independent variable b. only one dependent variable c. more

More information

Applied Regression Modeling: A Business Approach Chapter 2: Simple Linear Regression Sections

Applied Regression Modeling: A Business Approach Chapter 2: Simple Linear Regression Sections Applied Regression Modeling: A Business Approach Chapter 2: Simple Linear Regression Sections 2.1 2.3 by Iain Pardoe 2.1 Probability model for and 2 Simple linear regression model for and....................................

More information

Correlation and Regression Analysis. Linear Regression and Correlation. Correlation and Linear Regression. Three Questions.

Correlation and Regression Analysis. Linear Regression and Correlation. Correlation and Linear Regression. Three Questions. 10/8/18 Correlation and Regression Analysis Correlation Analysis is the study of the relationship between variables. It is also defined as group of techniques to measure the association between two variables.

More information

Draft Proof - Do not copy, post, or distribute. Chapter Learning Objectives REGRESSION AND CORRELATION THE SCATTER DIAGRAM

Draft Proof - Do not copy, post, or distribute. Chapter Learning Objectives REGRESSION AND CORRELATION THE SCATTER DIAGRAM 1 REGRESSION AND CORRELATION As we learned in Chapter 9 ( Bivariate Tables ), the differential access to the Internet is real and persistent. Celeste Campos-Castillo s (015) research confirmed the impact

More information