Is economic freedom related to economic growth?

Size: px
Start display at page:

Download "Is economic freedom related to economic growth?"

Transcription

1 Is economic freedom related to economic growth? It is an article of faith among supporters of capitalism: economic freedom leads to economic growth. The publication Economic Freedom of the World: 2003 Annual Report (James Gwartney and Robert Lawson with Neil Emerick, Economic Freedom of the World: 2003 Annual Report, Vancouver: The Fraser Institute, 2003; data retrieved from is devoted to the examination of economic freedom across the world (economic freedom being defined as freedom to trade (an open trade policy, with minimal barriers to imports and minimal subsidies to domestic industries), freedom to invest (few restrictions on foreign investment), freedom to operate a business without overly burdensome regulation, and secure property rights. The following quote from page xi of that publication summarizes the beliefs of the authors: The trend toward [economic] liberalization, that is, continued undisturbed. This suggests it is anything but a passing fad, the artifact of some economic bubble. Rather, it represents a deep worldwide consensus that the path to prosperity lies in the verities of open trade, sound money, international flows of goods and capital (and labor), market-determined prices, sensible regulation, and the protection of property rights. Do the data support the existence of these benefits of economic freedom? This is obviously a complex issue, but regression methods can be useful in looking at the evidence. Consider the following plot of the 2000 per capita Gross Domestic Product (GDP, in 1995 U.S. dollars) versus the 2000 Economic Freedom score. Note, by the way, that by using data from 2000, we are avoiding the effects of the worldwide economic downturn that started in The data comprise a near-census of all of the countries of the world, so when we talk about predicting per capita GDP (for example), we are actually treating these data as a sample from a reasonably stable ongoing process. c 2008, Jeffrey S. Simonoff 1

2 The freedom score is on a scale from 1 to 5, with lower values corresponding to more economic freedom. Presumably we would expect an inverse relationship here, and that is what we see. On the other hand, it is distinctly nonlinear, as the following fitted line plot shows: c 2008, Jeffrey S. Simonoff 2

3 It is very apparent from this plot that a straight-line model is not appropriate for these data. We ll address that point in a little while, but for now I m going to ignore the obvious evidence, and fit a least squares linear regression model anyway. Here is regression output for this model. Regression Analysis: Per capita GDP versus 2000 freedom score The regression equation is Per capita GDP = freedom score 150 cases used 6 cases contain missing values Predictor Coef SE Coef T P Constant fre S = 8428 R-Sq = 45.6% R-Sq(adj) = 45.2% c 2008, Jeffrey S. Simonoff 3

4 Analysis of Variance Source DF SS MS F P Regression Residual Error Total The t-test for the slope and the F-test are of course equivalent here, and they imply that there is a strongly statistically significant relationship between freedom score and per capita GDP, as the p-value is quite small. We see that the estimated slope is This says that the estimated decrease in per capita GDP for a one point increase in the freedom score is $10,831. What does this mean? A one point increase in the freedom score is a large one, as it corresponds to a full category difference in a five-point scale. The following output can help in interpreting the coefficient: Descriptive Statistics: 2000 freedom score Variable N N* Mean Median TrMean StDev 2000 fre Variable SE Mean Minimum Maximum Q1 Q fre As you can see, the mean and median score are right in the middle, at (about) 3. Roughly one-quarter of the countries fall.5 below this, and roughly one-quarter fall.5 above it, so a one-unit change in the freedom index corresponds, for example, to the difference between a country at the 25th percentile and one at the 75th percentile. A per capita GDP difference of over $10,000 certainly seems to be meaningful from a practical point of view. The intercept of is the estimated expected per capita GDP when the freedom score is 0; since this is impossible, this coefficient has no physical meaning. Does this model provide useful predictive power? The standard error of the estimate is $8428, implying (by the standard assumptions) that this model can predict per capita GDP to within ±(2)(8428) = ±$16856 roughly 95% of the time. Is this a noteworthy accomplishment? Consider the following output: c 2008, Jeffrey S. Simonoff 4

5 Descriptive Statistics: Per capita GDP Variable N N* Mean Median TrMean StDev Per capi Variable SE Mean Minimum Maximum Q1 Q3 Per capi The range of per capita GDP values is more than $55,000, so this improvement in predictive power might be considered moderate. This is consistent with the R 2, which tells us that a bit less than half of the variability in per capita GDP is accounted for by economic freedom. Of course, in actual fact, all of this is useless, since the linear model is inappropriate here. The scatter plot made this obvious, but there are other clues also. The target variable is very long right-tailed: c 2008, Jeffrey S. Simonoff 5

6 This was obvious from the descriptive statistics (the mean is more than four times the median), and also could have been seen in the original scatter plot. Residual plots reinforce the problems: increasing variability of the residuals with the fitted values, and a long right tail: c 2008, Jeffrey S. Simonoff 6

7 All of this evidence is pointing us in one direction to analyze these data in a logged scale. Here is a scatter plot of logged per capita GDP (logs base 10) versus economic freedom: c 2008, Jeffrey S. Simonoff 7

8 Now, that s more like it! Here is a fitted line plot, followed by regression output: c 2008, Jeffrey S. Simonoff 8

9 Regression Analysis: Logged GDPpc versus 2000 freedom score The regression equation is Logged GDPpc = freedom score 150 cases used 6 cases contain missing values Predictor Coef SE Coef T P Constant fre S = R-Sq = 59.1% R-Sq(adj) = 58.8% Analysis of Variance Source DF SS MS F P Regression Residual Error Total c 2008, Jeffrey S. Simonoff 9

10 How do we interpret these results? The R 2 tells us that the index accounts for almost 60% of the variability in logged per capita GDP, a reasonably large amount. Once again the intercept is meaningless, but what about the slope? The slope coefficient is.764; this says that a one-unit increase in the economic freedom index is associated with an expected.764 unit decrease in the logged per capita GDP. We ve already talked about what a oneunit increase in the index means, but what does a.764 unit decrease in logged per capita GDP mean? The key is to remember what logarithms mean. Recall, for example, that log = 1, and log = 2. The logs of these two numbers differs by 1, which is telling us that the numbers themselves differ by a multiplicative factor of 10 1, or 10 (that is 100 is ten times 10). Two numbers whose logs differ by.764 differ by a multiplicative factor of = 5.8; that is, one is 5.8 times the other. Equivalently, if one number has log.764 lower than another, it is a multiplicative factor of =.172 smaller. Of course, we see that the regression itself is highly statistically significant. So, what does our.764 coefficient mean? It says that a one-unit increase in the economic freedom index is associated with multiplying the expected per capita GDP by , or.172; that is, a one-unit increase in the index is associated with an expected 82.8% decrease in per capita GDP. This is quite a lot! [What if our predictor had also been a logged variable? First, we can just recognize that the coefficients continue to mean what they always mean. Say the fitted model was log Y = log X (that is, a regression was fit the logarithm (base 10) of Y as the target, and the logarithm of X as the predictor, for some variables X and Y ). The intercept is the estimated expected value of the target (log Y ) when the predictor (log X) equals 0; that is, when X = 1 (since log 1 = 0). So, in this hypothetical case, the estimated Y when X = 1 is = Whether this means anything or not depends on whether X = 1 is a meaningful condition. The slope tells us the estimated change in log Y when log X increases by 1, but that just corresponds to X being multiplied by 10. Therefore, in this hypothetical case, multiplying X by 10 is associated with multiplying Y by 10.6 = Of course, this is also an elasticity, which means that a 1% change in X is estimated to be associated with a.6% change in Y.] We need to adjust our thinking regarding the standard error of the estimate in the same way. It is.453, which we know means that we should be able to predict logged per capita GDP to within ±(2)(.4532) = ±.9064 roughly 95% of the time, but what does that c 2008, Jeffrey S. Simonoff 10

11 mean? We just use the same argument as before; since =.124 and = 8.06, this standard error of the estimate says that knowing the economic freedom index allows us to predict per capita GDP to within a multiplicative factor of (roughly) 8, roughly 95% of the time. So, for example, a country with economic freedom index equaling 3 has predicted logged per capita GDP of = , or predicted per capita GDP of = $2334. The standard error of the estimate tells us that we wouldn t be surprised if the actual per capita GDP was as much as 8 times less that ($289) to as much as 8 times more that ($18,810). This wide a range might surprise you, but it s inherent to the fact that per capita GDP is very long right-tailed; remember, the range of per capita GDP values is ($116,$55744), so the largest value is more than 480 times the smallest! Residual plots look okay for this model: c 2008, Jeffrey S. Simonoff 11

12 This is a situation where it is natural to consider the use of confidence and prediction intervals. Consider the case of Libya. Its 200 freedom score was 4.85, but its per capita GDP is not given in the data. What does the regression model imply for that value? Here are confidence and prediction interval results: Predicted Values for New Observations New Obs Fit SE Fit 95% CI 95% PI (1.7517, ) (1.0361, )X X denotes a point that is an outlier in the predictors. Values of Predictors for New Observations c 2008, Jeffrey S. Simonoff 12

13 2000 New freedom Obs score The confidence interval would correspond to an interval for the average response for all observations in the population with 2000 freedom scores equal to 4.85, but since the response variable is in the log scale it is not meaningful here. On the other hand, the prediction interval is useful here. The 95% interval for logged per capita GDP of (1.0361, ) converts to an interval of (10.87, ). Note that the interval is asymmetric around the geometric mean of = 89.99, which reflects the much higher natural variability in per capita GDP at the upper end compared to at the lower end. Note that the ratio of the high to the low end of the interval is roughly 68.6, which corresponds closely to our earlier rough prediction interval of being able to predict per capita GDP to within a multiplicative factor of about 8. Thus, it would seem that we re done, except for one thing this analysis doesn t really address that quote at the beginning of this handout. It s not surprising that wealth and economic freedom would go together, but that doesn t necessarily say anything about whether it is in a country s economic best interest to become more economically free. As the quote says, it s the path to prosperity that matters; that is, economic growth, not current economic status. Is economic freedom related to economic growth? Here is a plot of 2000 growth in GDP versus 2000 economic freedom index: c 2008, Jeffrey S. Simonoff 13

14 This is not very encouraging, as the fitted line plot shows: c 2008, Jeffrey S. Simonoff 14

15 There is apparently very little relationship between one-year economic growth and economic freedom. Now, of course, one-year growth might be too variable a measure, and longer-term growth measure might be more useful. Still, this result is disappointing, especially since it came at a time of general economic prosperity. Regression Analysis: GDP growth rate versus 2000 freedom score The regression equation is GDP growth rate = freedom score 153 cases used 3 cases contain missing values Predictor Coef SE Coef T P Constant fre S = R-Sq = 0.8% R-Sq(adj) = 0.1% c 2008, Jeffrey S. Simonoff 15

16 Analysis of Variance Source DF SS MS F P Regression Residual Error Total With an R 2 less than 1%, there s not much need to try to interpret the slope coefficient (in any event, it s positive, which would imply that more freedom is associated with less growth!). The regression is of course not close to statistically significant. Do the regression assumptions seem reasonable here? Maybe not: c 2008, Jeffrey S. Simonoff 16

17 There are two clearly unusual points: Equatorial Guinea and Turkmenistan. Each of these countries had very high growth in 2000 (16.9% and 17.6%, respectively), despite repressive economic situations (freedom scores 4.05 and 4.3, respectively). Equatorial Guinea s GDP growth is unusual in that extensive offshore oil exploitation only began in 1997, while Turkmenistan s high growth came from oil and natural gas exports. It is important to explore whether these two countries have had a strong effect on the overall model, since they are not at all typical of the general pattern. We can do this by omitting them and seeing how things are affected, being sure to report that this is what we have done. Overall, not much changes; in fact, the relationship becomes even weaker: c 2008, Jeffrey S. Simonoff 17

18 Regression Analysis: GDP growth rate versus 2000 freedom score The regression equation is GDP growth rate = freedom score 151 cases used 3 cases contain missing values Predictor Coef SE Coef T P Constant fre S = R-Sq = 0.0% R-Sq(adj) = 0.0% Analysis of Variance Source DF SS MS F P Regression Residual Error Total c 2008, Jeffrey S. Simonoff 18

19 The model now seems to fit fine, so we should just accept it as reflecting what is really going on here: c 2008, Jeffrey S. Simonoff 19

20 So what have we learned? The results are decidedly mixed. There is a clear relationship between a country s current level of wealth and economic freedom, although this is accompanied by a good deal of variability. There is apparently no relationship between economic freedom and short-term GDP growth. Whether there is a relationship between economic freedom and long-term growth cannot be addressed with these data. c 2008, Jeffrey S. Simonoff 20

21 MINITAB commands Although it is possible to omit observations in a sample by simply highlighting them in the data worksheet and pressing the delete key, this is generally not advisable, since then the observation cannot be recovered without reopening the original file (and if you save the data before doing that, the observation is gone completely). A better approach is to create a subset of the worksheet that has the observations you want; this will create a new worksheet that can be analyzed, but the original worksheet will still be there as well. Click on Data Subset Worksheet. You can give the new worksheet an identifying name if you like under Name:. Click the radio button next to Specify which rows to exclude, click the radio button next to Specify rows:, and enter the row numbers of the outliers in the associated box. Note that there is a good deal of flexibility in the subsetting; you can identify rows to include or exclude, identify them by some condition (for example, observations with values of a predictor greater than 10), or brush them on a scatter plot and identify them that way, in addition to specifying them by row number(s). c 2008, Jeffrey S. Simonoff 21

, (1) e i = ˆσ 1 h ii. c 2016, Jeffrey S. Simonoff 1

, (1) e i = ˆσ 1 h ii. c 2016, Jeffrey S. Simonoff 1 Regression diagnostics As is true of all statistical methodologies, linear regression analysis can be a very effective way to model data, as along as the assumptions being made are true. For the regression

More information

Inference with Simple Regression

Inference with Simple Regression 1 Introduction Inference with Simple Regression Alan B. Gelder 06E:071, The University of Iowa 1 Moving to infinite means: In this course we have seen one-mean problems, twomean problems, and problems

More information

INFERENCE FOR REGRESSION

INFERENCE FOR REGRESSION CHAPTER 3 INFERENCE FOR REGRESSION OVERVIEW In Chapter 5 of the textbook, we first encountered regression. The assumptions that describe the regression model we use in this chapter are the following. We

More information

10 Model Checking and Regression Diagnostics

10 Model Checking and Regression Diagnostics 10 Model Checking and Regression Diagnostics The simple linear regression model is usually written as i = β 0 + β 1 i + ɛ i where the ɛ i s are independent normal random variables with mean 0 and variance

More information

Lecture 18: Simple Linear Regression

Lecture 18: Simple Linear Regression Lecture 18: Simple Linear Regression BIOS 553 Department of Biostatistics University of Michigan Fall 2004 The Correlation Coefficient: r The correlation coefficient (r) is a number that measures the strength

More information

Confidence Interval for the mean response

Confidence Interval for the mean response Week 3: Prediction and Confidence Intervals at specified x. Testing lack of fit with replicates at some x's. Inference for the correlation. Introduction to regression with several explanatory variables.

More information

The Flight of the Space Shuttle Challenger

The Flight of the Space Shuttle Challenger The Flight of the Space Shuttle Challenger On January 28, 1986, the space shuttle Challenger took off on the 25 th flight in NASA s space shuttle program. Less than 2 minutes into the flight, the spacecraft

More information

1 Introduction to Minitab

1 Introduction to Minitab 1 Introduction to Minitab Minitab is a statistical analysis software package. The software is freely available to all students and is downloadable through the Technology Tab at my.calpoly.edu. When you

More information

Multiple Regression Examples

Multiple Regression Examples Multiple Regression Examples Example: Tree data. we have seen that a simple linear regression of usable volume on diameter at chest height is not suitable, but that a quadratic model y = β 0 + β 1 x +

More information

Answers to Homework 4

Answers to Homework 4 Answers to Homework 4 1. (a) Let p be the true probability of an auto having automatic transmission. The observed proportion of autos is.847, and a 95% confidence interval for p has the form p ± z.025

More information

CHAPTER 5 FUNCTIONAL FORMS OF REGRESSION MODELS

CHAPTER 5 FUNCTIONAL FORMS OF REGRESSION MODELS CHAPTER 5 FUNCTIONAL FORMS OF REGRESSION MODELS QUESTIONS 5.1. (a) In a log-log model the dependent and all explanatory variables are in the logarithmic form. (b) In the log-lin model the dependent variable

More information

Conditions for Regression Inference:

Conditions for Regression Inference: AP Statistics Chapter Notes. Inference for Linear Regression We can fit a least-squares line to any data relating two quantitative variables, but the results are useful only if the scatterplot shows a

More information

SMAM 319 Exam 1 Name. 1.Pick the best choice for the multiple choice questions below (10 points 2 each)

SMAM 319 Exam 1 Name. 1.Pick the best choice for the multiple choice questions below (10 points 2 each) SMAM 319 Exam 1 Name 1.Pick the best choice for the multiple choice questions below (10 points 2 each) A b In Metropolis there are some houses for sale. Superman and Lois Lane are interested in the average

More information

Business Statistics. Lecture 9: Simple Regression

Business Statistics. Lecture 9: Simple Regression Business Statistics Lecture 9: Simple Regression 1 On to Model Building! Up to now, class was about descriptive and inferential statistics Numerical and graphical summaries of data Confidence intervals

More information

Ch 13 & 14 - Regression Analysis

Ch 13 & 14 - Regression Analysis Ch 3 & 4 - Regression Analysis Simple Regression Model I. Multiple Choice:. A simple regression is a regression model that contains a. only one independent variable b. only one dependent variable c. more

More information

28. SIMPLE LINEAR REGRESSION III

28. SIMPLE LINEAR REGRESSION III 28. SIMPLE LINEAR REGRESSION III Fitted Values and Residuals To each observed x i, there corresponds a y-value on the fitted line, y = βˆ + βˆ x. The are called fitted values. ŷ i They are the values of

More information

Correlation & Simple Regression

Correlation & Simple Regression Chapter 11 Correlation & Simple Regression The previous chapter dealt with inference for two categorical variables. In this chapter, we would like to examine the relationship between two quantitative variables.

More information

Interaction effects for continuous predictors in regression modeling

Interaction effects for continuous predictors in regression modeling Interaction effects for continuous predictors in regression modeling Testing for interactions The linear regression model is undoubtedly the most commonly-used statistical model, and has the advantage

More information

Regression, part II. I. What does it all mean? A) Notice that so far all we ve done is math.

Regression, part II. I. What does it all mean? A) Notice that so far all we ve done is math. Regression, part II I. What does it all mean? A) Notice that so far all we ve done is math. 1) One can calculate the Least Squares Regression Line for anything, regardless of any assumptions. 2) But, if

More information

Model Building Chap 5 p251

Model Building Chap 5 p251 Model Building Chap 5 p251 Models with one qualitative variable, 5.7 p277 Example 4 Colours : Blue, Green, Lemon Yellow and white Row Blue Green Lemon Insects trapped 1 0 0 1 45 2 0 0 1 59 3 0 0 1 48 4

More information

School of Mathematical Sciences. Question 1. Best Subsets Regression

School of Mathematical Sciences. Question 1. Best Subsets Regression School of Mathematical Sciences MTH5120 Statistical Modelling I Practical 9 and Assignment 8 Solutions Question 1 Best Subsets Regression Response is Crime I n W c e I P a n A E P U U l e Mallows g E P

More information

Multiple Linear Regression

Multiple Linear Regression Andrew Lonardelli December 20, 2013 Multiple Linear Regression 1 Table Of Contents Introduction: p.3 Multiple Linear Regression Model: p.3 Least Squares Estimation of the Parameters: p.4-5 The matrix approach

More information

Stat 501, F. Chiaromonte. Lecture #8

Stat 501, F. Chiaromonte. Lecture #8 Stat 501, F. Chiaromonte Lecture #8 Data set: BEARS.MTW In the minitab example data sets (for description, get into the help option and search for "Data Set Description"). Wild bears were anesthetized,

More information

The simple linear regression model discussed in Chapter 13 was written as

The simple linear regression model discussed in Chapter 13 was written as 1519T_c14 03/27/2006 07:28 AM Page 614 Chapter Jose Luis Pelaez Inc/Blend Images/Getty Images, Inc./Getty Images, Inc. 14 Multiple Regression 14.1 Multiple Regression Analysis 14.2 Assumptions of the Multiple

More information

SMAM 314 Practice Final Examination Winter 2003

SMAM 314 Practice Final Examination Winter 2003 SMAM 314 Practice Final Examination Winter 2003 You may use your textbook, one page of notes and a calculator. Please hand in the notes with your exam. 1. Mark the following statements True T or False

More information

9 One-Way Analysis of Variance

9 One-Way Analysis of Variance 9 One-Way Analysis of Variance SW Chapter 11 - all sections except 6. The one-way analysis of variance (ANOVA) is a generalization of the two sample t test to k 2 groups. Assume that the populations of

More information

Multiple Regression Methods

Multiple Regression Methods Chapter 1: Multiple Regression Methods Hildebrand, Ott and Gray Basic Statistical Ideas for Managers Second Edition 1 Learning Objectives for Ch. 1 The Multiple Linear Regression Model How to interpret

More information

STA 108 Applied Linear Models: Regression Analysis Spring Solution for Homework #6

STA 108 Applied Linear Models: Regression Analysis Spring Solution for Homework #6 STA 8 Applied Linear Models: Regression Analysis Spring 011 Solution for Homework #6 6. a) = 11 1 31 41 51 1 3 4 5 11 1 31 41 51 β = β1 β β 3 b) = 1 1 1 1 1 11 1 31 41 51 1 3 4 5 β = β 0 β1 β 6.15 a) Stem-and-leaf

More information

Estimating σ 2. We can do simple prediction of Y and estimation of the mean of Y at any value of X.

Estimating σ 2. We can do simple prediction of Y and estimation of the mean of Y at any value of X. Estimating σ 2 We can do simple prediction of Y and estimation of the mean of Y at any value of X. To perform inferences about our regression line, we must estimate σ 2, the variance of the error term.

More information

The Simple Linear Regression Model

The Simple Linear Regression Model The Simple Linear Regression Model Lesson 3 Ryan Safner 1 1 Department of Economics Hood College ECON 480 - Econometrics Fall 2017 Ryan Safner (Hood College) ECON 480 - Lesson 3 Fall 2017 1 / 77 Bivariate

More information

MULTIPLE REGRESSION METHODS

MULTIPLE REGRESSION METHODS DEPARTMENT OF POLITICAL SCIENCE AND INTERNATIONAL RELATIONS Posc/Uapp 816 MULTIPLE REGRESSION METHODS I. AGENDA: A. Residuals B. Transformations 1. A useful procedure for making transformations C. Reading:

More information

1 A Review of Correlation and Regression

1 A Review of Correlation and Regression 1 A Review of Correlation and Regression SW, Chapter 12 Suppose we select n = 10 persons from the population of college seniors who plan to take the MCAT exam. Each takes the test, is coached, and then

More information

Stat 529 (Winter 2011) A simple linear regression (SLR) case study. Mammals brain weights and body weights

Stat 529 (Winter 2011) A simple linear regression (SLR) case study. Mammals brain weights and body weights Stat 529 (Winter 2011) A simple linear regression (SLR) case study Reading: Sections 8.1 8.4, 8.6, 8.7 Mammals brain weights and body weights Questions of interest Scatterplots of the data Log transforming

More information

SMAM 314 Computer Assignment 5 due Nov 8,2012 Data Set 1. For each of the following data sets use Minitab to 1. Make a scatterplot.

SMAM 314 Computer Assignment 5 due Nov 8,2012 Data Set 1. For each of the following data sets use Minitab to 1. Make a scatterplot. SMAM 314 Computer Assignment 5 due Nov 8,2012 Data Set 1. For each of the following data sets use Minitab to 1. Make a scatterplot. 2. Fit the linear regression line. Regression Analysis: y versus x y

More information

9. Linear Regression and Correlation

9. Linear Regression and Correlation 9. Linear Regression and Correlation Data: y a quantitative response variable x a quantitative explanatory variable (Chap. 8: Recall that both variables were categorical) For example, y = annual income,

More information

Continuous distributions

Continuous distributions Continuous distributions In contrast to discrete random variables, like the Binomial distribution, in many situations the possible values of a random variable cannot be counted. For example, the measurement

More information

SMAM 314 Exam 42 Name

SMAM 314 Exam 42 Name SMAM 314 Exam 42 Name Mark the following statements True (T) or False (F) (10 points) 1. F A. The line that best fits points whose X and Y values are negatively correlated should have a positive slope.

More information

Analysis of Bivariate Data

Analysis of Bivariate Data Analysis of Bivariate Data Data Two Quantitative variables GPA and GAES Interest rates and indices Tax and fund allocation Population size and prison population Bivariate data (x,y) Case corr&reg 2 Independent

More information

STAT 212 Business Statistics II 1

STAT 212 Business Statistics II 1 STAT 1 Business Statistics II 1 KING FAHD UNIVERSITY OF PETROLEUM & MINERALS DEPARTMENT OF MATHEMATICAL SCIENCES DHAHRAN, SAUDI ARABIA STAT 1: BUSINESS STATISTICS II Semester 091 Final Exam Thursday Feb

More information

Correlation and Regression

Correlation and Regression Correlation and Regression Dr. Bob Gee Dean Scott Bonney Professor William G. Journigan American Meridian University 1 Learning Objectives Upon successful completion of this module, the student should

More information

Chapter 14. Simple Linear Regression Preliminary Remarks The Simple Linear Regression Model

Chapter 14. Simple Linear Regression Preliminary Remarks The Simple Linear Regression Model Chapter 14 Simple Linear Regression 14.1 Preliminary Remarks We have only three lectures to introduce the ideas of regression. Worse, they are the last three lectures when you have many other things academic

More information

Linear Regression. Linear Regression. Linear Regression. Did You Mean Association Or Correlation?

Linear Regression. Linear Regression. Linear Regression. Did You Mean Association Or Correlation? Did You Mean Association Or Correlation? AP Statistics Chapter 8 Be careful not to use the word correlation when you really mean association. Often times people will incorrectly use the word correlation

More information

42 GEO Metro Japan

42 GEO Metro Japan Statistics 101 106 Lecture 11 (17 November 98) c David Pollard Page 1 Read M&M Chapters 2 and 11 again. Section leaders will decide how much of Chapters 12 and 13 to cover formally; they will assign the

More information

LAB 5 INSTRUCTIONS LINEAR REGRESSION AND CORRELATION

LAB 5 INSTRUCTIONS LINEAR REGRESSION AND CORRELATION LAB 5 INSTRUCTIONS LINEAR REGRESSION AND CORRELATION In this lab you will learn how to use Excel to display the relationship between two quantitative variables, measure the strength and direction of the

More information

Data Set 8: Laysan Finch Beak Widths

Data Set 8: Laysan Finch Beak Widths Data Set 8: Finch Beak Widths Statistical Setting This handout describes an analysis of covariance (ANCOVA) involving one categorical independent variable (with only two levels) and one quantitative covariate.

More information

Assumptions, Diagnostics, and Inferences for the Simple Linear Regression Model with Normal Residuals

Assumptions, Diagnostics, and Inferences for the Simple Linear Regression Model with Normal Residuals Assumptions, Diagnostics, and Inferences for the Simple Linear Regression Model with Normal Residuals 4 December 2018 1 The Simple Linear Regression Model with Normal Residuals In previous class sessions,

More information

Confidence intervals

Confidence intervals Confidence intervals We now want to take what we ve learned about sampling distributions and standard errors and construct confidence intervals. What are confidence intervals? Simply an interval for which

More information

Introduction to Regression

Introduction to Regression Introduction to Regression Using Mult Lin Regression Derived variables Many alternative models Which model to choose? Model Criticism Modelling Objective Model Details Data and Residuals Assumptions 1

More information

Chapter 9. Correlation and Regression

Chapter 9. Correlation and Regression Chapter 9 Correlation and Regression Lesson 9-1/9-2, Part 1 Correlation Registered Florida Pleasure Crafts and Watercraft Related Manatee Deaths 100 80 60 40 20 0 1991 1993 1995 1997 1999 Year Boats in

More information

Time series and Forecasting

Time series and Forecasting Chapter 2 Time series and Forecasting 2.1 Introduction Data are frequently recorded at regular time intervals, for instance, daily stock market indices, the monthly rate of inflation or annual profit figures.

More information

Fitting a Straight Line to Data

Fitting a Straight Line to Data Fitting a Straight Line to Data Thanks for your patience. Finally we ll take a shot at real data! The data set in question is baryonic Tully-Fisher data from http://astroweb.cwru.edu/sparc/btfr Lelli2016a.mrt,

More information

Basic Business Statistics 6 th Edition

Basic Business Statistics 6 th Edition Basic Business Statistics 6 th Edition Chapter 12 Simple Linear Regression Learning Objectives In this chapter, you learn: How to use regression analysis to predict the value of a dependent variable based

More information

MULTIPLE LINEAR REGRESSION IN MINITAB

MULTIPLE LINEAR REGRESSION IN MINITAB MULTIPLE LINEAR REGRESSION IN MINITAB This document shows a complicated Minitab multiple regression. It includes descriptions of the Minitab commands, and the Minitab output is heavily annotated. Comments

More information

SMAM 319 Exam1 Name. a B.The equation of a line is 3x + y =6. The slope is a. -3 b.3 c.6 d.1/3 e.-1/3

SMAM 319 Exam1 Name. a B.The equation of a line is 3x + y =6. The slope is a. -3 b.3 c.6 d.1/3 e.-1/3 SMAM 319 Exam1 Name 1. Pick the best choice. (10 points-2 each) _c A. A data set consisting of fifteen observations has the five number summary 4 11 12 13 15.5. For this data set it is definitely true

More information

9 Correlation and Regression

9 Correlation and Regression 9 Correlation and Regression SW, Chapter 12. Suppose we select n = 10 persons from the population of college seniors who plan to take the MCAT exam. Each takes the test, is coached, and then retakes the

More information

22S39: Class Notes / November 14, 2000 back to start 1

22S39: Class Notes / November 14, 2000 back to start 1 Model diagnostics Interpretation of fitted regression model 22S39: Class Notes / November 14, 2000 back to start 1 Model diagnostics 22S39: Class Notes / November 14, 2000 back to start 2 Model diagnostics

More information

LAB 3 INSTRUCTIONS SIMPLE LINEAR REGRESSION

LAB 3 INSTRUCTIONS SIMPLE LINEAR REGRESSION LAB 3 INSTRUCTIONS SIMPLE LINEAR REGRESSION In this lab you will first learn how to display the relationship between two quantitative variables with a scatterplot and also how to measure the strength of

More information

Nature vs. nurture? Lecture 18 - Regression: Inference, Outliers, and Intervals. Regression Output. Conditions for inference.

Nature vs. nurture? Lecture 18 - Regression: Inference, Outliers, and Intervals. Regression Output. Conditions for inference. Understanding regression output from software Nature vs. nurture? Lecture 18 - Regression: Inference, Outliers, and Intervals In 1966 Cyril Burt published a paper called The genetic determination of differences

More information

Lab 3 A Quick Introduction to Multiple Linear Regression Psychology The Multiple Linear Regression Model

Lab 3 A Quick Introduction to Multiple Linear Regression Psychology The Multiple Linear Regression Model Lab 3 A Quick Introduction to Multiple Linear Regression Psychology 310 Instructions.Work through the lab, saving the output as you go. You will be submitting your assignment as an R Markdown document.

More information

Important note: Transcripts are not substitutes for textbook assignments. 1

Important note: Transcripts are not substitutes for textbook assignments. 1 In this lesson we will cover correlation and regression, two really common statistical analyses for quantitative (or continuous) data. Specially we will review how to organize the data, the importance

More information

Warm-up Using the given data Create a scatterplot Find the regression line

Warm-up Using the given data Create a scatterplot Find the regression line Time at the lunch table Caloric intake 21.4 472 30.8 498 37.7 335 32.8 423 39.5 437 22.8 508 34.1 431 33.9 479 43.8 454 42.4 450 43.1 410 29.2 504 31.3 437 28.6 489 32.9 436 30.6 480 35.1 439 33.0 444

More information

Data Analysis and Statistical Methods Statistics 651

Data Analysis and Statistical Methods Statistics 651 y 1 2 3 4 5 6 7 x Data Analysis and Statistical Methods Statistics 651 http://www.stat.tamu.edu/~suhasini/teaching.html Lecture 32 Suhasini Subba Rao Previous lecture We are interested in whether a dependent

More information

LECTURE 15: SIMPLE LINEAR REGRESSION I

LECTURE 15: SIMPLE LINEAR REGRESSION I David Youngberg BSAD 20 Montgomery College LECTURE 5: SIMPLE LINEAR REGRESSION I I. From Correlation to Regression a. Recall last class when we discussed two basic types of correlation (positive and negative).

More information

Sampling, Frequency Distributions, and Graphs (12.1)

Sampling, Frequency Distributions, and Graphs (12.1) 1 Sampling, Frequency Distributions, and Graphs (1.1) Design: Plan how to obtain the data. What are typical Statistical Methods? Collect the data, which is then subjected to statistical analysis, which

More information

Topic 1. Definitions

Topic 1. Definitions S Topic. Definitions. Scalar A scalar is a number. 2. Vector A vector is a column of numbers. 3. Linear combination A scalar times a vector plus a scalar times a vector, plus a scalar times a vector...

More information

Lecture 10: F -Tests, ANOVA and R 2

Lecture 10: F -Tests, ANOVA and R 2 Lecture 10: F -Tests, ANOVA and R 2 1 ANOVA We saw that we could test the null hypothesis that β 1 0 using the statistic ( β 1 0)/ŝe. (Although I also mentioned that confidence intervals are generally

More information

Inference for the Regression Coefficient

Inference for the Regression Coefficient Inference for the Regression Coefficient Recall, b 0 and b 1 are the estimates of the slope β 1 and intercept β 0 of population regression line. We can shows that b 0 and b 1 are the unbiased estimates

More information

Section 3: Simple Linear Regression

Section 3: Simple Linear Regression Section 3: Simple Linear Regression Carlos M. Carvalho The University of Texas at Austin McCombs School of Business http://faculty.mccombs.utexas.edu/carlos.carvalho/teaching/ 1 Regression: General Introduction

More information

Steps to take to do the descriptive part of regression analysis:

Steps to take to do the descriptive part of regression analysis: STA 2023 Simple Linear Regression: Least Squares Model Steps to take to do the descriptive part of regression analysis: A. Plot the data on a scatter plot. Describe patterns: 1. Is there a strong, moderate,

More information

Chapter 8. Linear Regression. Copyright 2010 Pearson Education, Inc.

Chapter 8. Linear Regression. Copyright 2010 Pearson Education, Inc. Chapter 8 Linear Regression Copyright 2010 Pearson Education, Inc. Fat Versus Protein: An Example The following is a scatterplot of total fat versus protein for 30 items on the Burger King menu: Copyright

More information

Section 4: Multiple Linear Regression

Section 4: Multiple Linear Regression Section 4: Multiple Linear Regression Carlos M. Carvalho The University of Texas at Austin McCombs School of Business http://faculty.mccombs.utexas.edu/carlos.carvalho/teaching/ 1 The Multiple Regression

More information

ANOVA: Analysis of Variation

ANOVA: Analysis of Variation ANOVA: Analysis of Variation The basic ANOVA situation Two variables: 1 Categorical, 1 Quantitative Main Question: Do the (means of) the quantitative variables depend on which group (given by categorical

More information

Q Lecture Introduction to Regression

Q Lecture Introduction to Regression Q3 2009 1 Before/After Transformation 2 Construction Role of T-ratios Formally, even under Null Hyp: H : 0, ˆ, being computed from k t k SE ˆ ˆ y values themselves containing random error, will sometimes

More information

SIMPLE TWO VARIABLE REGRESSION

SIMPLE TWO VARIABLE REGRESSION DEPARTMENT OF POLITICAL SCIENCE AND INTERNATIONAL RELATIONS Posc/Uapp 816 SIMPLE TWO VARIABLE REGRESSION I. AGENDA: A. Causal inference and non-experimental research B. Least squares principle C. Regression

More information

23. Inference for regression

23. Inference for regression 23. Inference for regression The Practice of Statistics in the Life Sciences Third Edition 2014 W. H. Freeman and Company Objectives (PSLS Chapter 23) Inference for regression The regression model Confidence

More information

Problem Set 1 ANSWERS

Problem Set 1 ANSWERS Economics 20 Prof. Patricia M. Anderson Problem Set 1 ANSWERS Part I. Multiple Choice Problems 1. If X and Z are two random variables, then E[X-Z] is d. E[X] E[Z] This is just a simple application of one

More information

Regression, Part I. - In correlation, it would be irrelevant if we changed the axes on our graph.

Regression, Part I. - In correlation, it would be irrelevant if we changed the axes on our graph. Regression, Part I I. Difference from correlation. II. Basic idea: A) Correlation describes the relationship between two variables, where neither is independent or a predictor. - In correlation, it would

More information

Models with qualitative explanatory variables p216

Models with qualitative explanatory variables p216 Models with qualitative explanatory variables p216 Example gen = 1 for female Row gpa hsm gen 1 3.32 10 0 2 2.26 6 0 3 2.35 8 0 4 2.08 9 0 5 3.38 8 0 6 3.29 10 0 7 3.21 8 0 8 2.00 3 0 9 3.18 9 0 10 2.34

More information

Analysis of Covariance. The following example illustrates a case where the covariate is affected by the treatments.

Analysis of Covariance. The following example illustrates a case where the covariate is affected by the treatments. Analysis of Covariance In some experiments, the experimental units (subjects) are nonhomogeneous or there is variation in the experimental conditions that are not due to the treatments. For example, a

More information

Activity #12: More regression topics: LOWESS; polynomial, nonlinear, robust, quantile; ANOVA as regression

Activity #12: More regression topics: LOWESS; polynomial, nonlinear, robust, quantile; ANOVA as regression Activity #12: More regression topics: LOWESS; polynomial, nonlinear, robust, quantile; ANOVA as regression Scenario: 31 counts (over a 30-second period) were recorded from a Geiger counter at a nuclear

More information

Business Statistics. Lecture 10: Course Review

Business Statistics. Lecture 10: Course Review Business Statistics Lecture 10: Course Review 1 Descriptive Statistics for Continuous Data Numerical Summaries Location: mean, median Spread or variability: variance, standard deviation, range, percentiles,

More information

Keppel, G. & Wickens, T. D. Design and Analysis Chapter 4: Analytical Comparisons Among Treatment Means

Keppel, G. & Wickens, T. D. Design and Analysis Chapter 4: Analytical Comparisons Among Treatment Means Keppel, G. & Wickens, T. D. Design and Analysis Chapter 4: Analytical Comparisons Among Treatment Means 4.1 The Need for Analytical Comparisons...the between-groups sum of squares averages the differences

More information

Topic 10 - Linear Regression

Topic 10 - Linear Regression Topic 10 - Linear Regression Least squares principle Hypothesis tests/confidence intervals/prediction intervals for regression 1 Linear Regression How much should you pay for a house? Would you consider

More information

MATH 1150 Chapter 2 Notation and Terminology

MATH 1150 Chapter 2 Notation and Terminology MATH 1150 Chapter 2 Notation and Terminology Categorical Data The following is a dataset for 30 randomly selected adults in the U.S., showing the values of two categorical variables: whether or not the

More information

Chapter 8. Linear Regression. The Linear Model. Fat Versus Protein: An Example. The Linear Model (cont.) Residuals

Chapter 8. Linear Regression. The Linear Model. Fat Versus Protein: An Example. The Linear Model (cont.) Residuals Chapter 8 Linear Regression Copyright 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Slide 8-1 Copyright 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Fat Versus

More information

LINEAR REGRESSION ANALYSIS. MODULE XVI Lecture Exercises

LINEAR REGRESSION ANALYSIS. MODULE XVI Lecture Exercises LINEAR REGRESSION ANALYSIS MODULE XVI Lecture - 44 Exercises Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur Exercise 1 The following data has been obtained on

More information

STATISTICS 110/201 PRACTICE FINAL EXAM

STATISTICS 110/201 PRACTICE FINAL EXAM STATISTICS 110/201 PRACTICE FINAL EXAM Questions 1 to 5: There is a downloadable Stata package that produces sequential sums of squares for regression. In other words, the SS is built up as each variable

More information

This document contains 3 sets of practice problems.

This document contains 3 sets of practice problems. P RACTICE PROBLEMS This document contains 3 sets of practice problems. Correlation: 3 problems Regression: 4 problems ANOVA: 8 problems You should print a copy of these practice problems and bring them

More information

1. An article on peanut butter in Consumer reports reported the following scores for various brands

1. An article on peanut butter in Consumer reports reported the following scores for various brands SMAM 314 Review Exam 1 1. An article on peanut butter in Consumer reports reported the following scores for various brands Creamy 56 44 62 36 39 53 50 65 45 40 56 68 41 30 40 50 50 56 65 56 45 40 Crunchy

More information

STA Module 5 Regression and Correlation. Learning Objectives. Learning Objectives (Cont.) Upon completing this module, you should be able to:

STA Module 5 Regression and Correlation. Learning Objectives. Learning Objectives (Cont.) Upon completing this module, you should be able to: STA 2023 Module 5 Regression and Correlation Learning Objectives Upon completing this module, you should be able to: 1. Define and apply the concepts related to linear equations with one independent variable.

More information

appstats27.notebook April 06, 2017

appstats27.notebook April 06, 2017 Chapter 27 Objective Students will conduct inference on regression and analyze data to write a conclusion. Inferences for Regression An Example: Body Fat and Waist Size pg 634 Our chapter example revolves

More information

MBA Statistics COURSE #4

MBA Statistics COURSE #4 MBA Statistics 51-651-00 COURSE #4 Simple and multiple linear regression What should be the sales of ice cream? Example: Before beginning building a movie theater, one must estimate the daily number of

More information

Start with review, some new definitions, and pictures on the white board. Assumptions in the Normal Linear Regression Model

Start with review, some new definitions, and pictures on the white board. Assumptions in the Normal Linear Regression Model Start with review, some new definitions, and pictures on the white board. Assumptions in the Normal Linear Regression Model A1: There is a linear relationship between X and Y. A2: The error terms (and

More information

Continuous distributions

Continuous distributions Continuous distributions In contrast to discrete random variables, like the Binomial distribution, in many situations the possible values of a random variable cannot be counted. For example, the measurement

More information

Histogram of Residuals. Residual Normal Probability Plot. Reg. Analysis Check Model Utility. (con t) Check Model Utility. Inference.

Histogram of Residuals. Residual Normal Probability Plot. Reg. Analysis Check Model Utility. (con t) Check Model Utility. Inference. Steps for Regression Simple Linear Regression Make a Scatter plot Does it make sense to plot a line? Check Residual Plot (Residuals vs. X) Are there any patterns? Check Histogram of Residuals Is it Normal?

More information

Simple Linear Regression. Steps for Regression. Example. Make a Scatter plot. Check Residual Plot (Residuals vs. X)

Simple Linear Regression. Steps for Regression. Example. Make a Scatter plot. Check Residual Plot (Residuals vs. X) Simple Linear Regression 1 Steps for Regression Make a Scatter plot Does it make sense to plot a line? Check Residual Plot (Residuals vs. X) Are there any patterns? Check Histogram of Residuals Is it Normal?

More information

WISE Regression/Correlation Interactive Lab. Introduction to the WISE Correlation/Regression Applet

WISE Regression/Correlation Interactive Lab. Introduction to the WISE Correlation/Regression Applet WISE Regression/Correlation Interactive Lab Introduction to the WISE Correlation/Regression Applet This tutorial focuses on the logic of regression analysis with special attention given to variance components.

More information

Multiple Regression an Introduction. Stat 511 Chap 9

Multiple Regression an Introduction. Stat 511 Chap 9 Multiple Regression an Introduction Stat 511 Chap 9 1 case studies meadowfoam flowers brain size of mammals 2 case study 1: meadowfoam flowering designed experiment carried out in a growth chamber general

More information

Interpret Standard Deviation. Outlier Rule. Describe the Distribution OR Compare the Distributions. Linear Transformations SOCS. Interpret a z score

Interpret Standard Deviation. Outlier Rule. Describe the Distribution OR Compare the Distributions. Linear Transformations SOCS. Interpret a z score Interpret Standard Deviation Outlier Rule Linear Transformations Describe the Distribution OR Compare the Distributions SOCS Using Normalcdf and Invnorm (Calculator Tips) Interpret a z score What is an

More information

R 2 and F -Tests and ANOVA

R 2 and F -Tests and ANOVA R 2 and F -Tests and ANOVA December 6, 2018 1 Partition of Sums of Squares The distance from any point y i in a collection of data, to the mean of the data ȳ, is the deviation, written as y i ȳ. Definition.

More information