Inference about the Slope and Intercept

Similar documents
Regression Models - Introduction

STATISTICAL DATA ANALYSIS IN EXCEL

Chapter 13 Student Lecture Notes Department of Quantitative Methods & Information Systems. Business Statistics

Regression Models - Introduction

LINEAR REGRESSION ANALYSIS

Random Vectors. 1 Joint distribution of a random vector. 1 Joint distribution of a random vector

Research Design - - Topic 15a Introduction to Multivariate Analyses 2009 R.C. Gardner, Ph.D.

Inference for the Regression Coefficient

Linear correlation. Contents. 1 Linear correlation. 1.1 Introduction. Anthony Tanbakuchi Department of Mathematics Pima Community College

Chapter 1: Linear Regression with One Predictor Variable also known as: Simple Linear Regression Bivariate Linear Regression

Estimating σ 2. We can do simple prediction of Y and estimation of the mean of Y at any value of X.

0.24 adults 2. (c) Prove that, regardless of the possible values of and, the covariance between X and Y is equal to zero. Show all work.

Systems of Linear Equations: Solving by Graphing

6. Vector Random Variables

STAT Chapter 11: Regression

Joint ] X 5) P[ 6) P[ (, ) = y 2. x 1. , y. , ( x, y ) 2, (

Variance. Standard deviation VAR = = value. Unbiased SD = SD = 10/23/2011. Functional Connectivity Correlation and Regression.

Simple and Multiple Linear Regression

11. Regression and Least Squares

Topic - 12 Linear Regression and Correlation

The data can be downloaded as an Excel file under Econ2130 at

Lecture 1 Linear Regression with One Predictor Variable.p2

Dependence and scatter-plots. MVE-495: Lecture 4 Correlation and Regression

Scatter plot of data from the study. Linear Regression

Scatter plot of data from the study. Linear Regression

Correlation and Linear Regression

Inference for Regression Inference about the Regression Model and Using the Regression Line, with Details. Section 10.1, 2, 3

INF Introduction to classifiction Anne Solberg Based on Chapter 2 ( ) in Duda and Hart: Pattern Classification

Confidence Intervals, Testing and ANOVA Summary

University of California, Los Angeles Department of Statistics. Joint probability distributions

Correlation: Relationships between Variables

Chapter 14 Simple Linear Regression (A)

Inference for Regression Simple Linear Regression

Correlation and regression. Correlation and regression analysis. Measures of association. Why bother? Positive linear relationship

Review of Probability

Regression. Estimation of the linear function (straight line) describing the linear component of the joint relationship between two variables X and Y.

STAT5044: Regression and Anova. Inyoung Kim

Ch. 5 Joint Probability Distributions and Random Samples

Two-dimensional Random Vectors

Simple Linear Regression

Topic 14: Inference in Multiple Regression

Covariance and Correlation Class 7, Jeremy Orloff and Jonathan Bloom

Inference for Regression Inference about the Regression Model and Using the Regression Line

Correlation and the Analysis of Variance Approach to Simple Linear Regression

Statistical Techniques II EXST7015 Simple Linear Regression

Correlation analysis 2: Measures of correlation

Psychology 282 Lecture #4 Outline Inferences in SLR

Business Statistics. Lecture 10: Correlation and Linear Regression

Psychology 282 Lecture #3 Outline

Reminder: Student Instructional Rating Surveys

General Linear Model (Chapter 4)

Bivariate distributions

Chapter 2 Inferences in Simple Linear Regression

Weighted Least Squares

MA 575 Linear Models: Cedric E. Ginestet, Boston University Midterm Review Week 7

STAT 3A03 Applied Regression With SAS Fall 2017

Chapter 12 - Lecture 2 Inferences about regression coefficient

Chapter 11. Correlation and Regression

Part 6: Multivariate Normal and Linear Models

Chapter 1 Linear Regression with One Predictor

STA441: Spring Multiple Regression. This slide show is a free open source document. See the last slide for copyright information.

Statistics for Engineers Lecture 9 Linear Regression

Simple Linear Regression

INF Anne Solberg One of the most challenging topics in image analysis is recognizing a specific object in an image.

Econ 3790: Statistics Business and Economics. Instructor: Yogesh Uppal

Chapter 11 Specification Error Analysis

18 Equilibrium equation of a laminated plate (a laminate)

The Random Effects Model Introduction

Copyright, 2008, R.E. Kass, E.N. Brown, and U. Eden REPRODUCTION OR CIRCULATION REQUIRES PERMISSION OF THE AUTHORS

Linear models and their mathematical foundations: Simple linear regression

TMA4255 Applied Statistics V2016 (5)

f X, Y (x, y)dx (x), where f(x,y) is the joint pdf of X and Y. (x) dx

RELATIONS AND FUNCTIONS through

First Year Examination Department of Statistics, University of Florida

,i = 1,2,L, p. For a sample of size n, let the columns of data be

Forestry 430 Advanced Biometrics and FRST 533 Problems in Statistical Methods Course Materials 2010

The Simple Linear Regression Model

Topic 10 - Linear Regression

Problems. Suppose both models are fitted to the same data. Show that SS Res, A SS Res, B

Categorical Predictor Variables

9. Linear Regression and Correlation

Linear regression Class 25, Jeremy Orloff and Jonathan Bloom

Review 6. n 1 = 85 n 2 = 75 x 1 = x 2 = s 1 = 38.7 s 2 = 39.2

Analysis of Longitudinal Data. Patrick J. Heagerty PhD Department of Biostatistics University of Washington

Probability Densities in Data Mining

Chapter 9 Regression. 9.1 Simple linear regression Linear models Least squares Predictions and residuals.

Simple linear regression

STA121: Applied Regression Analysis

MA 575 Linear Models: Cedric E. Ginestet, Boston University Non-parametric Inference, Polynomial Regression Week 9, Lecture 2

IES 612/STA 4-573/STA Winter 2008 Week 1--IES 612-STA STA doc

2 Statistical Estimation: Basic Concepts

Can you tell the relationship between students SAT scores and their college grades?

3.0 PROBABILITY, RANDOM VARIABLES AND RANDOM PROCESSES

Correlation and simple linear regression S5

Applied Econometrics (QEM)

Correlation. Bret Hanlon and Bret Larget. Department of Statistics University of Wisconsin Madison. December 6, Correlation 1 / 25

The Multiple Regression Model

14.1 Systems of Linear Equations in Two Variables

Scatter Plot Quadrants. Setting. Data pairs of two attributes X & Y, measured at N sampling units:

Transcription:

Inference about the Slope and Intercept Recall, we have established that the least square estimates and 0 are linear combinations of the Y i s. Further, we have showed that the are unbiased and have the following variances Var n X S XX S XX ˆ and Var ˆ 0 In order to make inference we assume that ε i s have a Normal distribution, that is ε i ~ N(0, σ ). This in turn means that the Y i s are normall distributed. ˆ ˆ ˆ ˆ Since both 0 and are linear combination of the Y i s the also have a Normal distribution. STA30/00 - week 3

Inference for β in Normal Error Regression Model The least square estimate of β is, because it is a linear combination of normall distributed random variables (Y i s) we have the following result: ˆ N ~, S XX We estimate the variance of b S /S XX where S is the MSE which has n- df. Claim: The distribution of is t with n- df. Proof: ˆ ˆ S ˆ S XX STA30/00 - week 3

Tests and CIs for β The hpothesis of interest about the slope in a Normal linear regression model is H 0 : β = 0. The test statistic for this hpothesis is b 0 b tstat S S. Eb S XX We compare the above test statistic to a t with n- df distribution to obtain the P-value. Further, 00(-α)% CI for β is: S b t n ; S b tn; S Eb. XX STA30/00 - week 3 3

Important Comment Similar results can be obtained about the intercept in a Normal linear regression model. See the book for more details. However, in man cases the intercept does not have an practical meaning and therefore it is not necessar to make inference about it. STA30/00 - week 3 4

Eample We have Data on Violent and Propert Crimes in 3 US Metropolitan Areas.The data contains the following three variables: violcrim = number of violent crimes propcrim = number of propert crimes popn = population in 000's We are interested in the relationship between the size of the cit and the number of violent crimes. STA30/00 - week 3 5

Comments Regarding the Crime Eample A regression model fit to data from all 3 cities finds a statisticall significant linear relationship between numbers of violent crimes in American cities and their populations. For each increase of,000 in population, the number of violent crimes increases b 0.093 on average. 48.6% of the variation in number of violent crimes can be eplained b its relationship with population. Because these data are observational, i.e. collected without eperimental intervention, it cannot be said that larger populations cause larger numbers of crimes, but onl that such an association appears to eist. However, this linear relationship is mostl determined b New York Cit whose population and number of violent crimes are much larger than an other cit, and thus accounts for a large fraction of the variation in the data. When New York is removed from the analsis there is no longer a statisticall significant linear relationship and the linear relationship with population eplains less than 9% of the variation in number of violent crimes. STA30/00 - week 3 6

STA30/00 - week 3 7 Bivariate Normal Distribution X and Y are jointl normall distributed if their joint densit is where - < < and - < <. Can show that the marginal distributions are: and ρ is the correlation between X and Y, i.e., ep, f, ~,, ~ N Y N X Y X Y E Y X E X E var var

Properties of Bivariate Normal Distribution It can be shown that the conditional distribution of Y given X = is: Y X ~ N, Linear combinations of X and Y are normall distributed. A zero covariance between an X and Y implies that the are statisticall independent. Note that this is not true in general for an two random variables. STA30/00 - week 3 8

Sample Correlation If X and Y are random variables, and we would like a smmetric measure of the direction and strength of the linear relationship between them we can use correlation. Based on n observed pairs ( i, i ) i =,,n, the estimate of the population correlation ρ is the Pearson s Product-Moment Correlation given b r It is the MLE of ρ. n i i i i i STA30/00 - week 3 9

Facts about r It measures the strength of the linear relationship between X and Y. It is distribution free. r is alwas a number between and. r = 0 indicates no linear association. r = or indicates that the points fall perfectl on a straight line with negative slope. r = or indicates that the points fall perfectl on a straight line with positive slope. The strength of the linear relationship increases as r moves awa from 0. STA30/00 - week 3 0

STA30/00 - week 3

Relationship between Regression and Correlation STA30/00 - week 3