Chapter 8 Handout: Interval Estimates and Hypothesis Testing

Size: px
Start display at page:

Download "Chapter 8 Handout: Interval Estimates and Hypothesis Testing"

Transcription

1 Chapter 8 Handout: Interval Estimates and Hypothesis esting Preview Clint s Assignment: aking Stock General Properties of the Ordinary Least Squares (OLS) Estimation Procedure Estimate Reliability: Interval Estimate Question o Normal Distribution versus the Student t-distribution: One Last Complication o Assessing the Reliability of a Coefficient Estimate: Applying the Student t-distribution heory Assessment: Hypothesis esting Summary: he Ordinary Least Squares (OLS) Estimation Procedure o Regression Model and the Role of the Error erm o Standard Ordinary Least Squares (OLS) Premises o Ordinary Least Squares (OLS) Estimation Procedure: hree Important Parts Value of the coefficient itself Variance of the error term s probability distribution Variance of the coefficient estimate s probability distribution o Properties of the Ordinary Least Squares (OLS) Estimation Procedure When the Standard Ordinary Least Squares (OLS) Premises Are Met: Each estimation procedure is unbiased. he estimation procedure for the coefficient value is the best linear unbiased estimation procedure (BLUE). Causation versus Correlation Assignment: aking Stock heory: Additional studying increases quiz scores. Regression Model: y t = Const + x x t + e t o y t = Quiz score Const = Points for showing up o x t = Minutes studied x = Points for each minute studied o e t = Error term e t reflects random influences: Mean[e t ] = 0 Ordinary Least Squares (OLS) Estimation Procedure: Apply the ordinary least squares estimation procedure; use data from Professor Lord s first quiz to estimate the values of Const and x by finding the best fitting line, the equation that minimizes the sum of squared residuals: First Quiz Data Student x y Ordinary Least Squares (OLS) Estimates: Esty = x b Const = y b x x = = 63 5 b Const = Estimated points for showing up = 63 ( yt y)( xt x) t x ( xt x) t1 Clint s Assignment Coefficient Reliability: How reliable is the coefficient estimate calculated from the results of the first quiz? hat is, how confident should Clint be that the coefficient estimate, 1.2, will be close to the actual value? heory Confidence: How much confidence should Clint have in the theory that additional studying increases quiz scores? b 1.2 b x = Estimated points for each minute studied = 1.2

2 2 General Properties of the Ordinary Least Squares (OLS) Estimation Procedure When the standard ordinary least squares (OLS) premises are satisfied, the following equations describe the coefficient estimate s general properties, the estimate s probability distribution: Mean[b x ] = x Var[ e] Var[ bx ] 2 ( x x) Importance of the Probability Distribution s Mean (Center) and Variance (Spread) Mean: When the mean of the estimate s probability distribution, Mean[b x ], equals the actual value of the coefficient, x, the estimation procedure is unbiased. o Unbiased does not mean that the estimate will equal the actual value. In fact, we can be all but certain that the estimate will not equal the actual value. o What unbiased does mean that the estimation procedure does not systematically underestimate or overestimate the actual value. Formally, the mean of the estimate s probability distribution equals the actual value. Furthermore, if the estimate s probability distribution is symmetric, the chances that the estimate is too high equal the chances that it is too low. Variance: When the estimation procedure is unbiased, the variance of the estimate s probability distribution, Var[b x ], determines the reliability of the estimate. o As the variance decreases, the estimate is more likely to be close to the actual value. o he Problem: In reality, neither Clint nor we know the variance of the error term s probability distribution which is needed to calculate the variance of the coefficient estimate s probability distribution. How can of Clint proceed? o Solution: A wo Step Approach Step 1: Estimate the variance of the error term s probability distribution. Step 2: Use the estimate of the variance of the error term s probability distribution to estimate the variance of the coefficient estimate s probability distribution. Clint s Assignment: Coefficient Reliability. How reliable is the coefficient estimate, 1.2, calculated from the first quiz? hat is, how confident should Clint be that the coefficient estimate, 1.2, will be close to the actual value? Interval Estimate Question: What is the probability that the coefficient estimate, 1.2, lies within of the actual coefficient value? One Last Complication Unfortunately, before we can tackle Clint s assignment, we must address one last complication: o use the normal distribution we We do not know the actual value must know the actual value of the of the variance for the coefficient variance (and the standard deviation) estimate s probability distribution. of the random variable s probability distribution. We must estimate its variance. t1 t We cannot use the normal distribution when dealing with the coefficient estimate. Instead we must use another distribution, the Student t-distribution.

3 3 Normal Distribution Versus the Student t-distribution Estimating the standard deviation introduces an additional element of uncertainty; hence, the t-distribution is more than the normal distribution. Probability Distribution of Random Variable Normal he Student t-distribution s spread depends on the _. As the degrees of freedom increase, we have information.. Hence, the distribution s spread is. Distribution Mean Value of Random Variable he Normal Distribution s z and the Student t-distribution s t: When the standard deviation, is known, we use the normal distribution: z = Value of Random Variable Distribution Mean Standard Deviation of Random Variable = Number of Standard Deviations from the Mean o calculate probabilities with the normal distribution we need only the value of. When the standard deviation is not known it must be estimated; we must use the Student t- distribution rather than the normal distribution: t = Value of Random Variable Distribution Mean Estimated Standard Deviation of Random Variable = Number of Estimated Standard Deviations (Standard Errors) from the Mean o calculate probabilities with the Student t-distribution we need the value of _. the _.

4 4 Clint s Assignment: Coefficient Reliability. How reliable is the coefficient estimate, 1.2, calculated from the first quiz? hat is, how confident should Clint be that the coefficient estimate, 1.2, will be close to the actual value? Interval Estimate Question: What is the probability that the coefficient estimate, 1.2, lies within of the actual coefficient value? First Blank: We begin by filling in the first blank, choosing our close to criterion. Suppose that we choose 1.5: Close o Criterion = So, we write 1.50 in the first blank. Next, draw the picture. Question: Why does the mean of the coefficient estimate s probability distribution equal the actual coefficient value, x? Probability that the estimate is within of the actual value β x t = Second Blank: Calculate the probability. Ordinary Least Squares (OLS) Dependent Variable: y Explanatory Variable(s): Estimate SE t-statistic Prob x Const Number of Observations 3 Convert 1.50 into standard errors: 1.50 = _ SE s. he probability that the he probability that the estimate lies within 1.50 of = estimate lies within SE s of the actual value. the actual value. Between t s of and Degrees of Freedom = Sample Size Number of Estimated Parameters = _ _ = _ Using the Econometrics Lab: Lab 8.1b Lab 8.1a Degrees of Freedom: Left tail: t: Right tail: t: Left tail probability = SE s SE s Actual Value = β x Student t-distribution Mean[b x ] = β x SE[b x ] = DF = β x + t = b x Right tail probability = he probability that the estimate lies within 1.5 of the actual value: 1.00 ( + ) = 1.00 =.

5 5 Clint s Assignment: heory Confidence. How much confidence should Clint have in the theory that additional studying increases quiz scores? heory: Additional studying increases quiz scores. Step 0: Construct a model reflecting the theory to be tested: y t = Const + x x t + e t y t = quiz score x t = minutes studied e t = error term Const reflects points for showing up x reflects points for each minute studied he theory suggests that x should be positive. heory: x > 0. Step 1: Collect data, run the regression, and interpret the estimates First Quiz Data Student x y Ordinary Least Squares (OLS) Estimates: Esty = x b Const = y b x x = = 63 5 b Const = Estimated points for showing up = 63 b ( yt y)( xt x) t x ( xt x) t1 1.2 b x = Estimated points for each minute studied = 1.2 Ordinary Least Squares (OLS) Dependent Variable: y Explanatory Variable(s): Estimate SE t-statistic Prob x Const Number of Observations 3 Sum Squared Residuals SE of Regression Estimated Equation: Esty = x Interpretation: Focus on the parameter estimates: he estimate of the constant suggests that students receive points for showing up; he estimate of the coefficient suggests that students receive additional points for each additional minute of studied. Critical Result: he coefficient estimate equals. he sign of the coefficient estimate suggests that additional studying increases quiz scores. his evidence our theory.

6 6 Step 2: Play the cynic, challenge the evidence, and construct the null and alternative hypotheses. Cynic s View: Sure the coefficient estimate from the first quiz data was positive, but this result was just the luck of the draw. In fact, studying actually has no impact on a student s quiz score; the actual coefficient, x, equals 0. H 0 : x = 0 Cynic is correct: Studying has no impact on a student s quiz score H 1 : x > 0 Cynic is incorrect: Additional studying increases quiz scores Lab 8.2 Probability Distribution of Coefficient Estimates Question: Can we rule out the possibility that the cynic might be correct? _ If the actual coefficient, x, equals 0: Mean[b x ] = 0 he distribution mean (center) would equal 0. Why? If β x = 0 Prob[b x > 0].50 herefore, the probability that the coefficient estimate from one repetition of the experiment (quiz) would be positive equals about _. Step 3: Formulate the question to assess the cynic s view. 0 b x Question to Assess Cynic s View: Generic Question: What is the probability that the results would be like those we actually obtained (or even stronger), if the cynic is correct and studying actually has no impact? Specific Question: he regression s coefficient estimate was 1.2: What is the probability that the coefficient estimate in one regression would be 1.2 or more, if H 0 were actually true (if the actual coefficient, x, equals 0)? Answer: Prob[Results IF Cynic Correct] or equivalently rue] he size of this probability determines whether or not we reject the null hypothesis: rue] small that H 0 is true rue] large that H 0 is true H 0 H 0 rue]: What is the probability that the coefficient estimate in one regression would be 1.2 or more, if H 0 were actually true (if the actual coefficient, x, equals 0)?

7 7 Step 4: Use the general properties of the estimation procedure, the properties of the probability distribution of the estimates, to calculate rue]. Since ordinary least squares estimation procedure for the Probability Distribution coefficient value is unbiased, the mean of the probability distribution for the estimate equals. If the null hypothesis were true, the actual coefficient would equal. he standard error equals _. he degrees of freedom equal. Student t-distribution Mean = SE = DF = b x OLS estimation Assume H 0 Standard Number of Number of procedure unbiased is true Error Observations Parameters Mean[b x ] = = SE[b x ] = DF = = Econometrics Lab: t = _ Lab 8.3 rue] =. An easier way to calculate rue]: Most statistical software packages make this easy for us. Focus attention on the t-statistic and Prob. columns. Ordinary Least Squares (OLS) Dependent Variable: y Explanatory Variable(s): Estimate SE t-statistic Prob x Const Number of Observations 3 Sum Squared Residuals SE of Regression Estimated Equation: Esty = x t-statistic Column: What does the t-statistic column report? Question: How many standard errors (estimated standard deviations) does the coefficient estimate, 1,2, lie from 0? Answer: Coefficient Column Std. Error Column = = Question: What does the t-statistic column equal? Answer: _

8 8 Ordinary Least Squares (OLS) Dependent Variable: y Explanatory Variable(s): Estimate SE t-statistic Prob x Const Number of Observations 3 Sum Squared Residuals SE of Regression Estimated Equation: Esty = x Prob. Column: Question: What does the Prob. Column Report? Question: What is the tails probability, the probability that the coefficient estimate, b x, resulting from one regression would will lie at least 1.2 from 0, if the actual coefficient, x, equals 0. Answer: Probability Distribution Student t-distribution Mean = SE = DF = Question: What does the Prob. column equal? Answer: _ b x Prob. Column Answer: rue] = 2 Step 5: Decide on the standard of proof, a significance level = 2. he significance level is the dividing line between the probability being small and the probability being large. rue] Less han Significance Level rue] that H 0 is true rue] Greater han Significance Level rue] that H 0 is true H 0 H 0 Would we reject H 0 at a 1 percent (.01) significance level? Would we reject H 0 at a 5 percent (.05) significance level? Would we reject H 0 at a 10 percent (.10) significance level? At the traditional significance levels we reject the null hypothesis that studying has no impact on quiz scores.

9 9 Summary: he Ordinary Least Squares Estimation Procedure Now, let us sum up what we have learned about the ordinary least squares (OLS) estimation procedure: Regression Model y t = Const + x x t + e t y t = Dependent variable x t = Explanatory variable e t = Error term t = 1, 2,, = Sample size Role of the Error erm he error term is a random variable; it represents random influences. he mean of the error term s probability distribution equals 0: Mean[e t ] = 0 Standard Ordinary Least Squares (OLS) Premises Error erm Equal Variance Premise: he variance of the error term s probability distribution for each observation is the same; all the variances equal Var[e]: Var[e 1 ] = Var[e 2 ] = = Var[e ] = Var[e] Error erm/error erm Independence Premise: he error terms are independent: Cov[e i, e j ] = 0. Knowing the value of the error term from one observation does not help you predict the value of the error term for any other observation. Explanatory Variable/Error erm Independence Premise: he explanatory variables, the x t s, and the error terms, the e t s, are not correlated. Knowing the value of an observation s explanatory variable does not help you predict the value of that observation s error term. Ordinary least squares estimation procedure (OLS) includes three procedures to estimate the: Values of the regression parameters, x and Const : ( yt y)( xt x) t1 bx 2 ( x x) and b Const = y b x x Variance of the error term s probability distribution, Var[e]: EstVar[e] = t1 t SSR Degrees of Freedom Variance of the coefficient estimate s probability distribution, Var[b x ]: EstVar[ b ] x EstVar[ e] t1 ( x x) t 2 Properties of the Ordinary Least Squares Estimation Procedure: When the standard regression premises are met: Each estimation procedure is unbiased; each estimation procedure does not systematically underestimate or overestimate the actual value. he ordinary least squares (OLS) estimation procedure for the coefficient value is the best linear unbiased estimation procedure (BLUE).

10 10 Causation versus Correlation Our theory and Step 0 illustrate the important distinction between causation and correlation: heory: Additional studying increases quiz scores. Step 0: Formulate a model reflecting the theory to be tested. y t = Const + x x t + e t y t = Quiz score Const reflects points for showing up x t = Minutes studied x reflects points for each minute studied e t = Error term he theory suggests that x is positive. heory: x > 0. Increase in studying (x t ) _ Our model is a model. Quiz score to increase (y t ) Question: Does causation imply correlation? If our theory is correct, does knowing the number of minutes a student studies help us to predict his/her quiz score? _ If our theory is correct, does knowing a student s quiz score helps us predict the number of minutes he/she has studied? _ Question: Does correlation necessarily imply causation? Consider the win Cities, Minneapolis and St. Paul. Is rainfall in Minneapolis and St. Paul correlated? o Does knowing whether or not it rains in Minneapolis help us predict whether or not it will rain in St. Paul? o Does knowing whether or not it rains in St. Paul help us predict whether or not it will rain in Minneapolis? Is there a causal relationship between rainfall in Minneapolis and St. Paul? o o Does rain in Minneapolis cause rain in St. Paul? Does rain in St. Paul cause rain in Minneapolis? Summary: Causation versus Correlation Causation correlation. Correlation causation.

Wednesday, September 26 Handout: Estimating the Variance of an Estimate s Probability Distribution

Wednesday, September 26 Handout: Estimating the Variance of an Estimate s Probability Distribution Amherst College Department of Economics Economics 60 Fall 2012 Wednesday, September 26 Handout: Estimating the Variance of an Estimate s Probability Distribution Preview: Review: Ordinary Least Squares

More information

Wednesday, September 19 Handout: Ordinary Least Squares Estimation Procedure The Mechanics

Wednesday, September 19 Handout: Ordinary Least Squares Estimation Procedure The Mechanics Amherst College Department of Economics Economics Fall 2012 Wednesday, September 19 Handout: Ordinary Least Squares Estimation Procedure he Mechanics Preview Best Fitting Line: Income and Savings Clint

More information

Chapter 11 Handout: Hypothesis Testing and the Wald Test

Chapter 11 Handout: Hypothesis Testing and the Wald Test Chapter 11 Handout: Hypothesis Testing and the Wald Test Preview No Money Illusion Theory: Calculating True] o Clever Algebraic Manipulation o Wald Test Restricted Regression Reflects Unrestricted Regression

More information

Monday, November 26: Explanatory Variable Explanatory Premise, Bias, and Large Sample Properties

Monday, November 26: Explanatory Variable Explanatory Premise, Bias, and Large Sample Properties Amherst College Department of Economics Economics 360 Fall 2012 Monday, November 26: Explanatory Variable Explanatory Premise, Bias, and Large Sample Properties Chapter 18 Outline Review o Regression Model

More information

Chapter 5: Ordinary Least Squares Estimation Procedure The Mechanics Chapter 5 Outline Best Fitting Line Clint s Assignment Simple Regression Model o

Chapter 5: Ordinary Least Squares Estimation Procedure The Mechanics Chapter 5 Outline Best Fitting Line Clint s Assignment Simple Regression Model o Chapter 5: Ordinary Least Squares Estimation Procedure The Mechanics Chapter 5 Outline Best Fitting Line Clint s Assignment Simple Regression Model o Parameters of the Model o Error Term and Random Influences

More information

Wednesday, October 17 Handout: Hypothesis Testing and the Wald Test

Wednesday, October 17 Handout: Hypothesis Testing and the Wald Test Amherst College Department of Economics Economics 360 Fall 2012 Wednesday, October 17 Handout: Hypothesis Testing and the Wald Test Preview No Money Illusion Theory: Calculating True] o Clever Algebraic

More information

Wednesday, October 10 Handout: One-Tailed Tests, Two-Tailed Tests, and Logarithms

Wednesday, October 10 Handout: One-Tailed Tests, Two-Tailed Tests, and Logarithms Amherst College Department of Economics Economics 360 Fall 2012 Wednesday, October 10 Handout: One-Tailed Tests, Two-Tailed Tests, and Logarithms Preview A One-Tailed Hypothesis Test: The Downward Sloping

More information

[Mean[e j ] Mean[e i ]]

[Mean[e j ] Mean[e i ]] Amherst College Department of Economics Economics 360 Fall 202 Solutions: Wednesday, September 26. Assume that the standard ordinary least square (OLS) premises are met. Let (x i, y i ) and (, y j ) be

More information

Hint: The following equation converts Celsius to Fahrenheit: F = C where C = degrees Celsius F = degrees Fahrenheit

Hint: The following equation converts Celsius to Fahrenheit: F = C where C = degrees Celsius F = degrees Fahrenheit Amherst College Department of Economics Economics 360 Fall 2014 Exam 1: Solutions 1. (10 points) The following table in reports the summary statistics for high and low temperatures in Key West, FL from

More information

Solutions: Monday, October 15

Solutions: Monday, October 15 Amherst College Department of Economics Economics 360 Fall 2012 1. Consider Nebraska petroleum consumption. Solutions: Monday, October 15 Petroleum Consumption Data for Nebraska: Annual time series data

More information

Monday, October 15 Handout: Multiple Regression Analysis Introduction

Monday, October 15 Handout: Multiple Regression Analysis Introduction Amherst College Department of Economics Economics 360 Fall 2012 Monday, October 15 Handout: Multiple Regression Analysis Introduction Review Simple and Multiple Regression Analysis o Distinction between

More information

An Introduction to Econometrics. A Self-contained Approach. Frank Westhoff. The MIT Press Cambridge, Massachusetts London, England

An Introduction to Econometrics. A Self-contained Approach. Frank Westhoff. The MIT Press Cambridge, Massachusetts London, England An Introduction to Econometrics A Self-contained Approach Frank Westhoff The MIT Press Cambridge, Massachusetts London, England How to Use This Book xvii 1 Descriptive Statistics 1 Chapter 1 Prep Questions

More information

Chapter 10: Multiple Regression Analysis Introduction

Chapter 10: Multiple Regression Analysis Introduction Chapter 10: Multiple Regression Analysis Introduction Chapter 10 Outline Simple versus Multiple Regression Analysis Goal of Multiple Regression Analysis A One-Tailed Test: Downward Sloping Demand Theory

More information

Lectures 5 & 6: Hypothesis Testing

Lectures 5 & 6: Hypothesis Testing Lectures 5 & 6: Hypothesis Testing in which you learn to apply the concept of statistical significance to OLS estimates, learn the concept of t values, how to use them in regression work and come across

More information

Chapter 15: Other Regression Statistics and Pitfalls

Chapter 15: Other Regression Statistics and Pitfalls Chapter 15: Other Regression Statistics and Pitfalls Chapter 15 Outline Two-Tailed Confidence Intervals o Confidence Interval Approach: Which Theories Are Consistent with the Data? o A Confidence Interval

More information

Multiple Regression Analysis

Multiple Regression Analysis Chapter 4 Multiple Regression Analysis The simple linear regression covered in Chapter 2 can be generalized to include more than one variable. Multiple regression analysis is an extension of the simple

More information

ECO220Y Simple Regression: Testing the Slope

ECO220Y Simple Regression: Testing the Slope ECO220Y Simple Regression: Testing the Slope Readings: Chapter 18 (Sections 18.3-18.5) Winter 2012 Lecture 19 (Winter 2012) Simple Regression Lecture 19 1 / 32 Simple Regression Model y i = β 0 + β 1 x

More information

Multiple Regression Analysis

Multiple Regression Analysis Multiple Regression Analysis y = β 0 + β 1 x 1 + β 2 x 2 +... β k x k + u 2. Inference 0 Assumptions of the Classical Linear Model (CLM)! So far, we know: 1. The mean and variance of the OLS estimators

More information

The Simple Linear Regression Model

The Simple Linear Regression Model The Simple Linear Regression Model Lesson 3 Ryan Safner 1 1 Department of Economics Hood College ECON 480 - Econometrics Fall 2017 Ryan Safner (Hood College) ECON 480 - Lesson 3 Fall 2017 1 / 77 Bivariate

More information

Econ 3790: Business and Economics Statistics. Instructor: Yogesh Uppal

Econ 3790: Business and Economics Statistics. Instructor: Yogesh Uppal Econ 3790: Business and Economics Statistics Instructor: Yogesh Uppal yuppal@ysu.edu Sampling Distribution of b 1 Expected value of b 1 : Variance of b 1 : E(b 1 ) = 1 Var(b 1 ) = σ 2 /SS x Estimate of

More information

T.I.H.E. IT 233 Statistics and Probability: Sem. 1: 2013 ESTIMATION AND HYPOTHESIS TESTING OF TWO POPULATIONS

T.I.H.E. IT 233 Statistics and Probability: Sem. 1: 2013 ESTIMATION AND HYPOTHESIS TESTING OF TWO POPULATIONS ESTIMATION AND HYPOTHESIS TESTING OF TWO POPULATIONS In our work on hypothesis testing, we used the value of a sample statistic to challenge an accepted value of a population parameter. We focused only

More information

Review of Statistics 101

Review of Statistics 101 Review of Statistics 101 We review some important themes from the course 1. Introduction Statistics- Set of methods for collecting/analyzing data (the art and science of learning from data). Provides methods

More information

Linear Regression with 1 Regressor. Introduction to Econometrics Spring 2012 Ken Simons

Linear Regression with 1 Regressor. Introduction to Econometrics Spring 2012 Ken Simons Linear Regression with 1 Regressor Introduction to Econometrics Spring 2012 Ken Simons Linear Regression with 1 Regressor 1. The regression equation 2. Estimating the equation 3. Assumptions required for

More information

Amherst College Department of Economics Economics 360 Fall 2012

Amherst College Department of Economics Economics 360 Fall 2012 Amherst College Department of Economics Economics 360 Fall 2012 Monday, December 3: Omitted Variables and the Instrumental Variable Estimation Procedure Chapter 20 Outline Revisit Omitted Explanatory Variable

More information

Statistical Inference with Regression Analysis

Statistical Inference with Regression Analysis Introductory Applied Econometrics EEP/IAS 118 Spring 2015 Steven Buck Lecture #13 Statistical Inference with Regression Analysis Next we turn to calculating confidence intervals and hypothesis testing

More information

Mathematical Notation Math Introduction to Applied Statistics

Mathematical Notation Math Introduction to Applied Statistics Mathematical Notation Math 113 - Introduction to Applied Statistics Name : Use Word or WordPerfect to recreate the following documents. Each article is worth 10 points and should be emailed to the instructor

More information

Lecture 11: Simple Linear Regression

Lecture 11: Simple Linear Regression Lecture 11: Simple Linear Regression Readings: Sections 3.1-3.3, 11.1-11.3 Apr 17, 2009 In linear regression, we examine the association between two quantitative variables. Number of beers that you drink

More information

The t-test: A z-score for a sample mean tells us where in the distribution the particular mean lies

The t-test: A z-score for a sample mean tells us where in the distribution the particular mean lies The t-test: So Far: Sampling distribution benefit is that even if the original population is not normal, a sampling distribution based on this population will be normal (for sample size > 30). Benefit

More information

Econometrics. 4) Statistical inference

Econometrics. 4) Statistical inference 30C00200 Econometrics 4) Statistical inference Timo Kuosmanen Professor, Ph.D. http://nomepre.net/index.php/timokuosmanen Today s topics Confidence intervals of parameter estimates Student s t-distribution

More information

Review: General Approach to Hypothesis Testing. 1. Define the research question and formulate the appropriate null and alternative hypotheses.

Review: General Approach to Hypothesis Testing. 1. Define the research question and formulate the appropriate null and alternative hypotheses. 1 Review: Let X 1, X,..., X n denote n independent random variables sampled from some distribution might not be normal!) with mean µ) and standard deviation σ). Then X µ σ n In other words, X is approximately

More information

AMS 7 Correlation and Regression Lecture 8

AMS 7 Correlation and Regression Lecture 8 AMS 7 Correlation and Regression Lecture 8 Department of Applied Mathematics and Statistics, University of California, Santa Cruz Suumer 2014 1 / 18 Correlation pairs of continuous observations. Correlation

More information

Single Sample Means. SOCY601 Alan Neustadtl

Single Sample Means. SOCY601 Alan Neustadtl Single Sample Means SOCY601 Alan Neustadtl The Central Limit Theorem If we have a population measured by a variable with a mean µ and a standard deviation σ, and if all possible random samples of size

More information

ECON Introductory Econometrics. Lecture 5: OLS with One Regressor: Hypothesis Tests

ECON Introductory Econometrics. Lecture 5: OLS with One Regressor: Hypothesis Tests ECON4150 - Introductory Econometrics Lecture 5: OLS with One Regressor: Hypothesis Tests Monique de Haan (moniqued@econ.uio.no) Stock and Watson Chapter 5 Lecture outline 2 Testing Hypotheses about one

More information

FinQuiz Notes

FinQuiz Notes Reading 10 Multiple Regression and Issues in Regression Analysis 2. MULTIPLE LINEAR REGRESSION Multiple linear regression is a method used to model the linear relationship between a dependent variable

More information

Wednesday, November 7 Handout: Heteroskedasticity

Wednesday, November 7 Handout: Heteroskedasticity Amhers College Deparmen of Economics Economics 360 Fall 202 Wednesday, November 7 Handou: Heeroskedasiciy Preview Review o Regression Model o Sandard Ordinary Leas Squares (OLS) Premises o Esimaion Procedures

More information

ECON Introductory Econometrics. Lecture 7: OLS with Multiple Regressors Hypotheses tests

ECON Introductory Econometrics. Lecture 7: OLS with Multiple Regressors Hypotheses tests ECON4150 - Introductory Econometrics Lecture 7: OLS with Multiple Regressors Hypotheses tests Monique de Haan (moniqued@econ.uio.no) Stock and Watson Chapter 7 Lecture outline 2 Hypothesis test for single

More information

Applied Econometrics. Professor Bernard Fingleton

Applied Econometrics. Professor Bernard Fingleton Applied Econometrics Professor Bernard Fingleton 1 Causation & Prediction 2 Causation One of the main difficulties in the social sciences is estimating whether a variable has a true causal effect Data

More information

ECON2228 Notes 2. Christopher F Baum. Boston College Economics. cfb (BC Econ) ECON2228 Notes / 47

ECON2228 Notes 2. Christopher F Baum. Boston College Economics. cfb (BC Econ) ECON2228 Notes / 47 ECON2228 Notes 2 Christopher F Baum Boston College Economics 2014 2015 cfb (BC Econ) ECON2228 Notes 2 2014 2015 1 / 47 Chapter 2: The simple regression model Most of this course will be concerned with

More information

appstats27.notebook April 06, 2017

appstats27.notebook April 06, 2017 Chapter 27 Objective Students will conduct inference on regression and analyze data to write a conclusion. Inferences for Regression An Example: Body Fat and Waist Size pg 634 Our chapter example revolves

More information

Chapter 13: Dummy and Interaction Variables

Chapter 13: Dummy and Interaction Variables Chapter 13: Dummy and eraction Variables Chapter 13 Outline Preliminary Mathematics: Averages and Regressions Including Only a Constant An Example: Discrimination in Academia o Average Salaries o Dummy

More information

Chapter 10. Regression. Understandable Statistics Ninth Edition By Brase and Brase Prepared by Yixun Shi Bloomsburg University of Pennsylvania

Chapter 10. Regression. Understandable Statistics Ninth Edition By Brase and Brase Prepared by Yixun Shi Bloomsburg University of Pennsylvania Chapter 10 Regression Understandable Statistics Ninth Edition By Brase and Brase Prepared by Yixun Shi Bloomsburg University of Pennsylvania Scatter Diagrams A graph in which pairs of points, (x, y), are

More information

Simple Linear Regression: One Quantitative IV

Simple Linear Regression: One Quantitative IV Simple Linear Regression: One Quantitative IV Linear regression is frequently used to explain variation observed in a dependent variable (DV) with theoretically linked independent variables (IV). For example,

More information

y ˆ i = ˆ " T u i ( i th fitted value or i th fit)

y ˆ i = ˆ  T u i ( i th fitted value or i th fit) 1 2 INFERENCE FOR MULTIPLE LINEAR REGRESSION Recall Terminology: p predictors x 1, x 2,, x p Some might be indicator variables for categorical variables) k-1 non-constant terms u 1, u 2,, u k-1 Each u

More information

Stats Review Chapter 14. Mary Stangler Center for Academic Success Revised 8/16

Stats Review Chapter 14. Mary Stangler Center for Academic Success Revised 8/16 Stats Review Chapter 14 Revised 8/16 Note: This review is meant to highlight basic concepts from the course. It does not cover all concepts presented by your instructor. Refer back to your notes, unit

More information

Mathematics for Economics MA course

Mathematics for Economics MA course Mathematics for Economics MA course Simple Linear Regression Dr. Seetha Bandara Simple Regression Simple linear regression is a statistical method that allows us to summarize and study relationships between

More information

Chapter 27 Summary Inferences for Regression

Chapter 27 Summary Inferences for Regression Chapter 7 Summary Inferences for Regression What have we learned? We have now applied inference to regression models. Like in all inference situations, there are conditions that we must check. We can test

More information

Review of Multiple Regression

Review of Multiple Regression Ronald H. Heck 1 Let s begin with a little review of multiple regression this week. Linear models [e.g., correlation, t-tests, analysis of variance (ANOVA), multiple regression, path analysis, multivariate

More information

Regression Analysis: Exploring relationships between variables. Stat 251

Regression Analysis: Exploring relationships between variables. Stat 251 Regression Analysis: Exploring relationships between variables Stat 251 Introduction Objective of regression analysis is to explore the relationship between two (or more) variables so that information

More information

Interpret Standard Deviation. Outlier Rule. Describe the Distribution OR Compare the Distributions. Linear Transformations SOCS. Interpret a z score

Interpret Standard Deviation. Outlier Rule. Describe the Distribution OR Compare the Distributions. Linear Transformations SOCS. Interpret a z score Interpret Standard Deviation Outlier Rule Linear Transformations Describe the Distribution OR Compare the Distributions SOCS Using Normalcdf and Invnorm (Calculator Tips) Interpret a z score What is an

More information

Chapter 12 - Lecture 2 Inferences about regression coefficient

Chapter 12 - Lecture 2 Inferences about regression coefficient Chapter 12 - Lecture 2 Inferences about regression coefficient April 19th, 2010 Facts about slope Test Statistic Confidence interval Hypothesis testing Test using ANOVA Table Facts about slope In previous

More information

Warm-up Using the given data Create a scatterplot Find the regression line

Warm-up Using the given data Create a scatterplot Find the regression line Time at the lunch table Caloric intake 21.4 472 30.8 498 37.7 335 32.8 423 39.5 437 22.8 508 34.1 431 33.9 479 43.8 454 42.4 450 43.1 410 29.2 504 31.3 437 28.6 489 32.9 436 30.6 480 35.1 439 33.0 444

More information

Inferences for Regression

Inferences for Regression Inferences for Regression An Example: Body Fat and Waist Size Looking at the relationship between % body fat and waist size (in inches). Here is a scatterplot of our data set: Remembering Regression In

More information

School of Mathematical Sciences. Question 1

School of Mathematical Sciences. Question 1 School of Mathematical Sciences MTH5120 Statistical Modelling I Practical 8 and Assignment 7 Solutions Question 1 Figure 1: The residual plots do not contradict the model assumptions of normality, constant

More information

9.5 t test: one μ, σ unknown

9.5 t test: one μ, σ unknown GOALS: 1. Recognize the assumptions for a 1 mean t test (srs, nd or large sample size, population stdev. NOT known). 2. Understand that the actual p value (area in the tail past the test statistic) is

More information

1 Correlation and Inference from Regression

1 Correlation and Inference from Regression 1 Correlation and Inference from Regression Reading: Kennedy (1998) A Guide to Econometrics, Chapters 4 and 6 Maddala, G.S. (1992) Introduction to Econometrics p. 170-177 Moore and McCabe, chapter 12 is

More information

s e, which is large when errors are large and small Linear regression model

s e, which is large when errors are large and small Linear regression model Linear regression model we assume that two quantitative variables, x and y, are linearly related; that is, the the entire population of (x, y) pairs are related by an ideal population regression line y

More information

Chapter 12: Model Specification and Development

Chapter 12: Model Specification and Development Chapter 12: Model Specification and Development Chapter 12 Outline Model Specification: Ramsey REgression Specification Error Test (RESET) o RESET Logic o Linear Demand Model o Constant Elasticity Demand

More information

Applied Statistics and Econometrics

Applied Statistics and Econometrics Applied Statistics and Econometrics Lecture 6 Saul Lach September 2017 Saul Lach () Applied Statistics and Econometrics September 2017 1 / 53 Outline of Lecture 6 1 Omitted variable bias (SW 6.1) 2 Multiple

More information

Can you tell the relationship between students SAT scores and their college grades?

Can you tell the relationship between students SAT scores and their college grades? Correlation One Challenge Can you tell the relationship between students SAT scores and their college grades? A: The higher SAT scores are, the better GPA may be. B: The higher SAT scores are, the lower

More information

Correlation and Linear Regression

Correlation and Linear Regression Correlation and Linear Regression Correlation: Relationships between Variables So far, nearly all of our discussion of inferential statistics has focused on testing for differences between group means

More information

The t-statistic. Student s t Test

The t-statistic. Student s t Test The t-statistic 1 Student s t Test When the population standard deviation is not known, you cannot use a z score hypothesis test Use Student s t test instead Student s t, or t test is, conceptually, very

More information

MS&E 226: Small Data

MS&E 226: Small Data MS&E 226: Small Data Lecture 15: Examples of hypothesis tests (v5) Ramesh Johari ramesh.johari@stanford.edu 1 / 32 The recipe 2 / 32 The hypothesis testing recipe In this lecture we repeatedly apply the

More information

Note on Bivariate Regression: Connecting Practice and Theory. Konstantin Kashin

Note on Bivariate Regression: Connecting Practice and Theory. Konstantin Kashin Note on Bivariate Regression: Connecting Practice and Theory Konstantin Kashin Fall 2012 1 This note will explain - in less theoretical terms - the basics of a bivariate linear regression, including testing

More information

Chapter 15 Panel Data Models. Pooling Time-Series and Cross-Section Data

Chapter 15 Panel Data Models. Pooling Time-Series and Cross-Section Data Chapter 5 Panel Data Models Pooling Time-Series and Cross-Section Data Sets of Regression Equations The topic can be introduced wh an example. A data set has 0 years of time series data (from 935 to 954)

More information

Multiple Regression Analysis: Inference MULTIPLE REGRESSION ANALYSIS: INFERENCE. Sampling Distributions of OLS Estimators

Multiple Regression Analysis: Inference MULTIPLE REGRESSION ANALYSIS: INFERENCE. Sampling Distributions of OLS Estimators 1 2 Multiple Regression Analysis: Inference MULTIPLE REGRESSION ANALYSIS: INFERENCE Hüseyin Taştan 1 1 Yıldız Technical University Department of Economics These presentation notes are based on Introductory

More information

Wooldridge, Introductory Econometrics, 4th ed. Chapter 2: The simple regression model

Wooldridge, Introductory Econometrics, 4th ed. Chapter 2: The simple regression model Wooldridge, Introductory Econometrics, 4th ed. Chapter 2: The simple regression model Most of this course will be concerned with use of a regression model: a structure in which one or more explanatory

More information

Chapter 14: Omitted Explanatory Variables, Multicollinearity, and Irrelevant Explanatory Variables

Chapter 14: Omitted Explanatory Variables, Multicollinearity, and Irrelevant Explanatory Variables Chapter 14: Omitted Explanatory Variables, Multicollinearity, and Irrelevant Explanatory Variables Chapter 14 Outline Review o Unbiased Estimation Procedures Estimates and Random Variables Mean of the

More information

Ch 2: Simple Linear Regression

Ch 2: Simple Linear Regression Ch 2: Simple Linear Regression 1. Simple Linear Regression Model A simple regression model with a single regressor x is y = β 0 + β 1 x + ɛ, where we assume that the error ɛ is independent random component

More information

Harvard University. Rigorous Research in Engineering Education

Harvard University. Rigorous Research in Engineering Education Statistical Inference Kari Lock Harvard University Department of Statistics Rigorous Research in Engineering Education 12/3/09 Statistical Inference You have a sample and want to use the data collected

More information

HYPOTHESIS TESTING. Hypothesis Testing

HYPOTHESIS TESTING. Hypothesis Testing MBA 605 Business Analytics Don Conant, PhD. HYPOTHESIS TESTING Hypothesis testing involves making inferences about the nature of the population on the basis of observations of a sample drawn from the population.

More information

ECON3150/4150 Spring 2015

ECON3150/4150 Spring 2015 ECON3150/4150 Spring 2015 Lecture 3&4 - The linear regression model Siv-Elisabeth Skjelbred University of Oslo January 29, 2015 1 / 67 Chapter 4 in S&W Section 17.1 in S&W (extended OLS assumptions) 2

More information

Inference for Regression

Inference for Regression Inference for Regression Section 9.4 Cathy Poliak, Ph.D. cathy@math.uh.edu Office in Fleming 11c Department of Mathematics University of Houston Lecture 13b - 3339 Cathy Poliak, Ph.D. cathy@math.uh.edu

More information

Important note: Transcripts are not substitutes for textbook assignments. 1

Important note: Transcripts are not substitutes for textbook assignments. 1 In this lesson we will cover correlation and regression, two really common statistical analyses for quantitative (or continuous) data. Specially we will review how to organize the data, the importance

More information

2.1: Inferences about β 1

2.1: Inferences about β 1 Chapter 2 1 2.1: Inferences about β 1 Test of interest throughout regression: Need sampling distribution of the estimator b 1. Idea: If b 1 can be written as a linear combination of the responses (which

More information

INFERENCE FOR REGRESSION

INFERENCE FOR REGRESSION CHAPTER 3 INFERENCE FOR REGRESSION OVERVIEW In Chapter 5 of the textbook, we first encountered regression. The assumptions that describe the regression model we use in this chapter are the following. We

More information

Statistical Inference. Part IV. Statistical Inference

Statistical Inference. Part IV. Statistical Inference Part IV Statistical Inference As of Oct 5, 2017 Sampling Distributions of the OLS Estimator 1 Statistical Inference Sampling Distributions of the OLS Estimator Testing Against One-Sided Alternatives Two-Sided

More information

LECTURE 6. Introduction to Econometrics. Hypothesis testing & Goodness of fit

LECTURE 6. Introduction to Econometrics. Hypothesis testing & Goodness of fit LECTURE 6 Introduction to Econometrics Hypothesis testing & Goodness of fit October 25, 2016 1 / 23 ON TODAY S LECTURE We will explain how multiple hypotheses are tested in a regression model We will define

More information

Statistical Analysis How do we know if it works? Group workbook: Cartoon from XKCD.com. Subscribe!

Statistical Analysis How do we know if it works? Group workbook: Cartoon from XKCD.com. Subscribe! Statistical Analysis How do we know if it works? Group workbook: Cartoon from XKCD.com. Subscribe! http://www.xkcd.com/552/ Significant Concepts We structure the presentation and processing of data to

More information

Homework Set 2, ECO 311, Fall 2014

Homework Set 2, ECO 311, Fall 2014 Homework Set 2, ECO 311, Fall 2014 Due Date: At the beginning of class on October 21, 2014 Instruction: There are twelve questions. Each question is worth 2 points. You need to submit the answers of only

More information

Questions 3.83, 6.11, 6.12, 6.17, 6.25, 6.29, 6.33, 6.35, 6.50, 6.51, 6.53, 6.55, 6.59, 6.60, 6.65, 6.69, 6.70, 6.77, 6.79, 6.89, 6.

Questions 3.83, 6.11, 6.12, 6.17, 6.25, 6.29, 6.33, 6.35, 6.50, 6.51, 6.53, 6.55, 6.59, 6.60, 6.65, 6.69, 6.70, 6.77, 6.79, 6.89, 6. Chapter 7 Reading 7.1, 7.2 Questions 3.83, 6.11, 6.12, 6.17, 6.25, 6.29, 6.33, 6.35, 6.50, 6.51, 6.53, 6.55, 6.59, 6.60, 6.65, 6.69, 6.70, 6.77, 6.79, 6.89, 6.112 Introduction In Chapter 5 and 6, we emphasized

More information

Applied Statistics and Econometrics

Applied Statistics and Econometrics Applied Statistics and Econometrics Lecture 5 Saul Lach September 2017 Saul Lach () Applied Statistics and Econometrics September 2017 1 / 44 Outline of Lecture 5 Now that we know the sampling distribution

More information

An Analysis of College Algebra Exam Scores December 14, James D Jones Math Section 01

An Analysis of College Algebra Exam Scores December 14, James D Jones Math Section 01 An Analysis of College Algebra Exam s December, 000 James D Jones Math - Section 0 An Analysis of College Algebra Exam s Introduction Students often complain about a test being too difficult. Are there

More information

THE PEARSON CORRELATION COEFFICIENT

THE PEARSON CORRELATION COEFFICIENT CORRELATION Two variables are said to have a relation if knowing the value of one variable gives you information about the likely value of the second variable this is known as a bivariate relation There

More information

Chapter 4: Regression Models

Chapter 4: Regression Models Sales volume of company 1 Textbook: pp. 129-164 Chapter 4: Regression Models Money spent on advertising 2 Learning Objectives After completing this chapter, students will be able to: Identify variables,

More information

Regression with a Single Regressor: Hypothesis Tests and Confidence Intervals

Regression with a Single Regressor: Hypothesis Tests and Confidence Intervals Regression with a Single Regressor: Hypothesis Tests and Confidence Intervals (SW Chapter 5) Outline. The standard error of ˆ. Hypothesis tests concerning β 3. Confidence intervals for β 4. Regression

More information

Estimating σ 2. We can do simple prediction of Y and estimation of the mean of Y at any value of X.

Estimating σ 2. We can do simple prediction of Y and estimation of the mean of Y at any value of X. Estimating σ 2 We can do simple prediction of Y and estimation of the mean of Y at any value of X. To perform inferences about our regression line, we must estimate σ 2, the variance of the error term.

More information

Chapte The McGraw-Hill Companies, Inc. All rights reserved.

Chapte The McGraw-Hill Companies, Inc. All rights reserved. 12er12 Chapte Bivariate i Regression (Part 1) Bivariate Regression Visual Displays Begin the analysis of bivariate data (i.e., two variables) with a scatter plot. A scatter plot - displays each observed

More information

Chapter 10. Simple Linear Regression and Correlation

Chapter 10. Simple Linear Regression and Correlation Chapter 10. Simple Linear Regression and Correlation In the two sample problems discussed in Ch. 9, we were interested in comparing values of parameters for two distributions. Regression analysis is the

More information

Heteroskedasticity. Part VII. Heteroskedasticity

Heteroskedasticity. Part VII. Heteroskedasticity Part VII Heteroskedasticity As of Oct 15, 2015 1 Heteroskedasticity Consequences Heteroskedasticity-robust inference Testing for Heteroskedasticity Weighted Least Squares (WLS) Feasible generalized Least

More information

POLI 443 Applied Political Research

POLI 443 Applied Political Research POLI 443 Applied Political Research Session 4 Tests of Hypotheses The Normal Curve Lecturer: Prof. A. Essuman-Johnson, Dept. of Political Science Contact Information: aessuman-johnson@ug.edu.gh College

More information

Lecture (chapter 13): Association between variables measured at the interval-ratio level

Lecture (chapter 13): Association between variables measured at the interval-ratio level Lecture (chapter 13): Association between variables measured at the interval-ratio level Ernesto F. L. Amaral April 9 11, 2018 Advanced Methods of Social Research (SOCI 420) Source: Healey, Joseph F. 2015.

More information

LAB 2. HYPOTHESIS TESTING IN THE BIOLOGICAL SCIENCES- Part 2

LAB 2. HYPOTHESIS TESTING IN THE BIOLOGICAL SCIENCES- Part 2 LAB 2. HYPOTHESIS TESTING IN THE BIOLOGICAL SCIENCES- Part 2 Data Analysis: The mean egg masses (g) of the two different types of eggs may be exactly the same, in which case you may be tempted to accept

More information

Section 9.4. Notation. Requirements. Definition. Inferences About Two Means (Matched Pairs) Examples

Section 9.4. Notation. Requirements. Definition. Inferences About Two Means (Matched Pairs) Examples Objective Section 9.4 Inferences About Two Means (Matched Pairs) Compare of two matched-paired means using two samples from each population. Hypothesis Tests and Confidence Intervals of two dependent means

More information

CHAPTER 10 Comparing Two Populations or Groups

CHAPTER 10 Comparing Two Populations or Groups CHAPTER 10 Comparing Two Populations or Groups 10. Comparing Two Means The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers Comparing Two Means Learning

More information

CHAPTER 10 Comparing Two Populations or Groups

CHAPTER 10 Comparing Two Populations or Groups CHAPTER 10 Comparing Two Populations or Groups 10.2 Comparing Two Means The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers Comparing Two Means Learning

More information

Ordinary Least Squares Regression Explained: Vartanian

Ordinary Least Squares Regression Explained: Vartanian Ordinary Least Squares Regression Explained: Vartanian When to Use Ordinary Least Squares Regression Analysis A. Variable types. When you have an interval/ratio scale dependent variable.. When your independent

More information

Ch 3: Multiple Linear Regression

Ch 3: Multiple Linear Regression Ch 3: Multiple Linear Regression 1. Multiple Linear Regression Model Multiple regression model has more than one regressor. For example, we have one response variable and two regressor variables: 1. delivery

More information

Lecture 30. DATA 8 Summer Regression Inference

Lecture 30. DATA 8 Summer Regression Inference DATA 8 Summer 2018 Lecture 30 Regression Inference Slides created by John DeNero (denero@berkeley.edu) and Ani Adhikari (adhikari@berkeley.edu) Contributions by Fahad Kamran (fhdkmrn@berkeley.edu) and

More information

Probability and Statistics

Probability and Statistics Probability and Statistics Kristel Van Steen, PhD 2 Montefiore Institute - Systems and Modeling GIGA - Bioinformatics ULg kristel.vansteen@ulg.ac.be CHAPTER 4: IT IS ALL ABOUT DATA 4a - 1 CHAPTER 4: IT

More information

Quantitative Methods for Economics, Finance and Management (A86050 F86050)

Quantitative Methods for Economics, Finance and Management (A86050 F86050) Quantitative Methods for Economics, Finance and Management (A86050 F86050) Matteo Manera matteo.manera@unimib.it Marzio Galeotti marzio.galeotti@unimi.it 1 This material is taken and adapted from Guy Judge

More information