Lecture 14 Simple Linear Regression
|
|
- Dwain Greer
- 6 years ago
- Views:
Transcription
1 Lecture 4 Simple Linear Regression Ordinary Least Squares (OLS) Consider the following simple linear regression model where, for each unit i, Y i is the dependent variable (response). X i is the independent variable (predictor). Y i α + βx i + ε i ε i is the error between the observed Y i and what the model predicts. This model describes the relation between X i and Y i using an intercept and a slope parameter. The error ε i Y i α βx i is also known as the residual because it measures the amount of variation in Y i not explained by the model. We saw last class that there exists ˆα and ˆβ that minimize the sum of ε 2 i. Specifically, we wish to find ˆα and ˆβ such that n ( Y i (ˆα + ˆβX ) 2 i ) i is the minimum. Using calculus, it turns out that ˆα Ȳ ˆβ X ˆβ (Xi X)(Y i Ȳ ). OLS and Transformation If we center the predictor, Xi X i X, then X i has mean zero. Therefore, ˆα Ȳ ˆβ Xi (Y i Ȳ ) X2 i. By horizontally shifting the value of X i, note that β β, but the intercept changed to the overall average of Y i Consider the linear transformation Z i a + bx i with Z a + b X. Consider the linear model Y i α + β Z i + ε i
2 Let ˆα and ˆβ be the estimates using X as the predictor. Then the intercept and slope using Z as the predictor are ˆβ (Zi Z)(Y i Ȳ ) (Zi Z) 2 (a + bxi a b X)(Y i Ȳ ) (a + b X a b X) 2 (bxi b X)(Y i Ȳ ) (b X b X) 2 b (X i X)(Y i Ȳ ) b 2 ( X X) 2 b ˆβ and ˆα Ȳ ˆβ Z Ȳ ˆβ b (a + b X) Ȳ ˆβ X a ˆβ b ˆα a b ˆβ We can get the same results by substituting X i Z i a b into the linear model. Y i α + βx i + ε i ( ) Zi a Y i α + β + ε i b Y i α aβ b + β b Z i + ε i Y i α + β Z i + ε i Properties of OLS Given the estimates ˆα and ˆβ, we can define () the estimated predicted value Ŷi and (2) the estimated residual ˆε i. Ŷ i ˆα + ˆβX i ˆε i Y i Ŷi Y i ˆα ˆβX i The least squared estimates have the following properties.. i ˆε i 0 n n ˆε i (Y i ˆα ˆβX n i ) Y i nˆα ˆβ i i i i nȳ nˆα n ˆβ X n(ȳ ˆα ˆβ X) n(ȳ (Ȳ ˆβ X) ˆβ X) 0 n X i 2
3 2. X and Ȳ is always on the fitted line. ˆα + ˆβ X (Ȳ ˆβ X) + ˆβ X Ȳ 3. ˆβ rxy s Y, where s Y and s X are the sample standard deviation of X and Y, and r XY is s X the correlation between X and Y. Note that the sample correlation is given by: r XY (Xi X)(Y i Ȳ ) (n )s X s Y. Therefore, ˆβ (Xi X)(Y i Ȳ ) n n s Y s Y Cov(X, Y ) s 2 X Cov(X, Y ) s X s Y r XY s Y s X s Y s Y s X s X s Y s X s X s X 4. ˆβ is a weighted sum of Yi. ˆβ (Xi X)(Y i Ȳ ) (Xi X)Y i (Xi X) 2 (Xi X)Y i (Xi X)Ȳ Ȳ (X i X) (Xi X)Y i 0 ( X i X ) Y i w i Y i, w i X i X 3
4 5. ˆα is a weighted sum of Y i. ˆα Ȳ ˆβ X n Y i X w i Y i v i Y i, v i n X(X i X) 6. X i ˆε i 0. Xi ˆε i (Y i Ŷi)X i (Y i ˆα ˆβX i )X i (Y i (Ȳ ˆβ X) ˆβX i )X i ((Y i Ȳ ) ˆβ(X i X))X i X i (Y i Ȳ ) ˆβ X i (X i X) ˆβ (X i X) 2 ˆβ X i (X i X) ˆβ [ X i (X i X) ˆβ [ X 2 i 2X i X + X2 X 2 i + X i X) ˆβ [ X2 X i X ˆβ X ( X Xi ) 0 Note that ˆβ (Xi X)(Y i Ȳ ) Xi (Y i Ȳ ) 7. Partition of total variation. (Y i Ȳ ) Y i Ȳ + Ŷi Ŷi (Ŷi Ȳ ) + (Y i Ŷi) (Y i Ȳ ): total deviation of the observed Y i from a null model (β 0). (Ŷ Ȳ ): deviation of the modeled value Ŷi (β 0) from a null model (β 0). (Ŷ Y i): deviation of the modeled value from the observed value. Total Variations (Sum of Squares) (Y i Ȳ )2 : total variation: (n ) V (Y ). (Ŷ Ȳ )2 : variation explained by the model 4
5 (Ŷ Y i) 2 : variation not explained by the model; residual sum of squares (SSE), ˆε 2 i Partition of Total Variation (Yi Ȳ )2 (Ŷi Ȳ )2 + (Y i Ŷi) 2 Proof: (Yi Ȳ )2 [ 2 (Ŷi Ȳ ) + (Y i Ŷi) (Ŷi Ȳ )2 + (Y i Ŷi) (Ŷi Ȳ )(Y i Ŷi) (Ŷi Ȳ )2 + (Y i Ŷi) 2 where, ( Ŷ i Ȳ )(Y i Ŷi) [ (ˆα + ˆβX i ) (ˆα + [ ˆβ(Xi X) ˆε i ˆβ X) ˆε i ˆβ X i ˆε i ˆβ X ˆε i Alternative formula for ˆε 2 i (Yi Ŷi) 2 (Y i Ȳ )2 (Ŷi Ȳ )2 (Y i Ȳ )2 (ˆα + ˆβX i Ȳ )2 (Y i Ȳ )2 (Ȳ + βx X + ˆβX i Ȳ )2 (Y i Ȳ )2 [ ˆβ(Xi X) (Y i Ȳ )2 ˆβ 2 (X i X) 2 or (Yi Ŷi) 2 [ (Y i Ȳ (Xi )2 X)(Y i Ȳ ) Variation Explained by the Model: 5
6 An easy way to calculate (Ŷi Ȳ )2 is ( Ŷ i Ȳ )2 (Y i Ȳ )2 (Y i Ŷi) 2 [ (Y i Ȳ (Xi )2 X)(Y i Ȳ ) 2 [ (Xi X)(Y i Ȳ ) 2 0. Coefficient of Determination r 2 : r 2 variation explained by the model total variation ( Ŷ i Ȳ )2 (Yi Ȳ )2 (a) r 2 is related the residual variance. (Yi Ŷi) 2 (Y i Ȳ )2 (Ŷi Ȳ )2 [ (Yi Ȳ )2 (Ŷi Ȳ )2 (Yi Ȳ )2 (Yi Ȳ )2 ( r 2 ) (Y i Ȳ )2 Diving the right-hand side by n 2 and the left-hand side by n, for large n s 2 ( r 2 )Var(Y ) (b) r 2 is the square of the sample correlation between X and Y. ( r 2 Ŷ i Ȳ )2 (Yi Ȳ )2 [ (Xi X)(Y i Ȳ ) 2 (Yi Ȳ )2 (X i X) 2 [ (Xi X)(Y i Ȳ ) (Yi Ȳ )2 2 r 2 XY Statistical Inference for OLS Estimates Parameters ˆα and ˆβ can be estimated for any given sample of data. Therefore, we also need to consider their sampling distributions because each sample of (X i, Y i ) pairs will result in different estimates of α and β. 6
7 Consider the following distribution assumption on the error, Y i α + βx i + ε i ε i iid N (0, σ 2 ). The above is now a statistical model that describes the distribution of Y i given X i. Specifically, we assume the observed Y i is error-prone but centered around the linear model for each value of X i. Y i iid N (α + βx i, σ 2 ) The error distribution results in the following assumptions.. Homoscedasticity: variance of Y i is the same for any X i. V (Y i ) V (α + βx i + ε i ) V (ε i ) σ 2 2. Linearity: the mean of Y i is linear with respect to X i E(Y i ) E(α + βx i + ε i ) α + βx i + E(ε i ) α + βx i 3. Independence: Y i are independent of each other. Cov(Y i, Y j ) Cov(α + βx i + ε i, α + βx j + ε j ) Cov(ε i, ε j ) 0 We can now investigate the bias and variance of OLS estimators ˆα and ˆβ. We assume that X i are known constants. ˆβ is unbiased. E[ ˆβ [ E wi Y i, w i X i X w i E[Y i w i (α + βx i ) α w i + β w i X i β 7
8 Because wi wi X i (Xi X) 0 Xi (X i X) Xi (X i X) X(Xi X) Xi (X i X) Xi (X i X) X (X i X) Xi (X i X) Xi (X i X) 0 Xi (X i X) ˆα is unbiased. E[ˆα E[Ȳ ˆβ X E[Ȳ XE[ ˆβ [ E Yi β n X E [α + βxi β n X n (nα + nβx i) β X α + β X β X α ˆβ σ 2 has variance n i (X i X ) 2 σ 2 (n )s 2, where σ 2 is the residual variance and s 2 x is the x sample variance of x,..., x n. where V [ ˆβ [ V wi Y i w 2 i w 2 i V [Y i σ 2 w 2 i, w i X i X [ (X i X) (Xi 2 2 X) 2 8
9 ( ) ˆα has variance σ 2 n + X2 n i (X i X ) 2 V [ˆα V [ vi Y i, v i n X(X i X) v 2 i V [Y i σ ( 2 n X(X i X) ) 2 σ 2 [ n 2 X 2 (X i X) 2 [ (X i X) X(X i X) n (X i X) 2 σ 2 [ n X2 [ (X i X) 2 2 ( ) σ 2 n + X2 n i (X i X ) 2 2 X n (X i X) 2 (Xi X) ˆα and ˆβ are negatively correlated. Cov(ˆα, ˆβ) ( Cov vi Y i, ) w i Y i v i w i Cov (Y i, Y i ) v i w i σ 2 σ ( 2 n [ ( σ 2, v i n X(X i X), w i X(X i X) ) ( Xi X ) X i X ) n (X i X) 2 X σ 2 [ (Xi X) n (X i X) 2 σ 2 [ 0 X (X i X) 2 [ (X i X) 2 2 σ 2 X [ (X i X) 2 2 X(X i X) X i X X i X Therefore, the sampling distributions for the regression parameters are: ˆα N [ ) n (α, σ 2 + X2 n i (X i X ) 2 ˆβ N ( β, σ 2 ) n i (X i X ) 2 9
10 However, in most cases, we don t know what σ 2 and will need to estimate it from the data. One unbiased estimator for σ 2 is obtained from the residuals. s 2 n 2 N (Y i Ŷi) 2 By replacing σ 2 with s 2, we have the following sampling distributions for ˆα and ˆβ. ( [ ) ˆα t n 2 α, s 2 n + X2 s n i (X i X ˆβ ) 2 t n 2 (β, 2 ) n i (X i X ) 2 i Note that our degrees of freedom is n 2 because we estimated two parameters (α and β) in order to calculate the residuals. Similarly recall that (Ŷi Y i ) 0. Knowing that ˆα and ˆβ follow a t-distribution with degrees of freedom n-2, we are able to construct confidence interval and carry out hypothesis test as before. For example, a 90% confidence interval for β is given by ˆβ ± t 5%,n 2 s 2 n i (X i X ) 2. Similarly, consider testing the hypotheses H 0 β β 0 versus H A β β 0. The corresponding t-test statistic is given by ˆβ β 0 s 2 n i (X i X ) 2 t n 2. Prediction Based on the sampling distribution of ˆα and ˆβ we can also investigate the uncertainty associated with Ŷ0 ˆα + ˆβX 0 on the fitted line for any X 0. Specifically, [ n Ŷ 0 N (α + X 0 β, σ 2 + (X 0 X) 2 ). Note that α + X 0 β is the mean of Y given X 0. The textbook uses the notation µ 0. Replacing the unknown with our estimates, we have ( [ Ŷ 0 t n 2 ˆα + X 0 ˆβ, s 2 n + (X 0 X) 2 ). The fitted value Ŷ0 ˆα + ˆβX 0 has mean α + X 0 β and variance σ 2 [ n + (X 0 X) 2. E(Ŷ0) E(ˆα + ˆβX 0 ) E(ˆα) + X 0 E( ˆβ) α + X 0 β 0
11 V (Ŷ0) V (ˆα + ˆβX 0 ) V (ˆα) + X 2 0V ( ˆβ) + X 0 Cov(ˆα, ˆβ) ( ) σ 2 n + X2 n i (X i X ) 2 + σ 2 X0 2 n i (X i X ) 2 X 0 X 2σ2 [ σ 2 n + (X 0 X) 2 We can also consider the uncertainty of a new observation Y0 at X 0. In other words, what values of Y0 do I expect to see at X 0? Note that this is different from Ŷ0 which is the fitted value that represents the mean of Y at X 0. Since Y 0 α + βx 0 + ε, ε N (0, σ 2 ), by replacing the unknown parameters with our estimates, we obtain ( [ Y0 t n 2 ˆα + ˆβX 0, s 2 + s 2 n + (X 0 X) 2 ) Again, the above distribution describes the actual observable data, instead of model parameters; interval estimates are also known as prediction intervals, instead of confidence intervals..
Scatter plot of data from the study. Linear Regression
1 2 Linear Regression Scatter plot of data from the study. Consider a study to relate birthweight to the estriol level of pregnant women. The data is below. i Weight (g / 100) i Weight (g / 100) 1 7 25
More informationScatter plot of data from the study. Linear Regression
1 2 Linear Regression Scatter plot of data from the study. Consider a study to relate birthweight to the estriol level of pregnant women. The data is below. i Weight (g / 100) i Weight (g / 100) 1 7 25
More informationCh 2: Simple Linear Regression
Ch 2: Simple Linear Regression 1. Simple Linear Regression Model A simple regression model with a single regressor x is y = β 0 + β 1 x + ɛ, where we assume that the error ɛ is independent random component
More informationSimple Linear Regression
Simple Linear Regression In simple linear regression we are concerned about the relationship between two variables, X and Y. There are two components to such a relationship. 1. The strength of the relationship.
More informationCorrelation Analysis
Simple Regression Correlation Analysis Correlation analysis is used to measure strength of the association (linear relationship) between two variables Correlation is only concerned with strength of the
More informationAMS 315/576 Lecture Notes. Chapter 11. Simple Linear Regression
AMS 315/576 Lecture Notes Chapter 11. Simple Linear Regression 11.1 Motivation A restaurant opening on a reservations-only basis would like to use the number of advance reservations x to predict the number
More informationSimple Linear Regression Analysis
LINEAR REGRESSION ANALYSIS MODULE II Lecture - 6 Simple Linear Regression Analysis Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur Prediction of values of study
More informationSimple and Multiple Linear Regression
Sta. 113 Chapter 12 and 13 of Devore March 12, 2010 Table of contents 1 Simple Linear Regression 2 Model Simple Linear Regression A simple linear regression model is given by Y = β 0 + β 1 x + ɛ where
More informationSimple Linear Regression
Simple Linear Regression September 24, 2008 Reading HH 8, GIll 4 Simple Linear Regression p.1/20 Problem Data: Observe pairs (Y i,x i ),i = 1,...n Response or dependent variable Y Predictor or independent
More informationInference for Regression
Inference for Regression Section 9.4 Cathy Poliak, Ph.D. cathy@math.uh.edu Office in Fleming 11c Department of Mathematics University of Houston Lecture 13b - 3339 Cathy Poliak, Ph.D. cathy@math.uh.edu
More informationMeasuring the fit of the model - SSR
Measuring the fit of the model - SSR Once we ve determined our estimated regression line, we d like to know how well the model fits. How far/close are the observations to the fitted line? One way to do
More informationSimple Linear Regression
Simple Linear Regression Reading: Hoff Chapter 9 November 4, 2009 Problem Data: Observe pairs (Y i,x i ),i = 1,... n Response or dependent variable Y Predictor or independent variable X GOALS: Exploring
More informationReview: General Approach to Hypothesis Testing. 1. Define the research question and formulate the appropriate null and alternative hypotheses.
1 Review: Let X 1, X,..., X n denote n independent random variables sampled from some distribution might not be normal!) with mean µ) and standard deviation σ). Then X µ σ n In other words, X is approximately
More informationSimple linear regression
Simple linear regression Biometry 755 Spring 2008 Simple linear regression p. 1/40 Overview of regression analysis Evaluate relationship between one or more independent variables (X 1,...,X k ) and a single
More informationSTAT5044: Regression and Anova. Inyoung Kim
STAT5044: Regression and Anova Inyoung Kim 2 / 47 Outline 1 Regression 2 Simple Linear regression 3 Basic concepts in regression 4 How to estimate unknown parameters 5 Properties of Least Squares Estimators:
More informationSummer School in Statistics for Astronomers V June 1 - June 6, Regression. Mosuk Chow Statistics Department Penn State University.
Summer School in Statistics for Astronomers V June 1 - June 6, 2009 Regression Mosuk Chow Statistics Department Penn State University. Adapted from notes prepared by RL Karandikar Mean and variance Recall
More informationMultivariate Regression (Chapter 10)
Multivariate Regression (Chapter 10) This week we ll cover multivariate regression and maybe a bit of canonical correlation. Today we ll mostly review univariate multivariate regression. With multivariate
More information3. Linear Regression With a Single Regressor
3. Linear Regression With a Single Regressor Econometrics: (I) Application of statistical methods in empirical research Testing economic theory with real-world data (data analysis) 56 Econometrics: (II)
More informationRegression Models - Introduction
Regression Models - Introduction In regression models, two types of variables that are studied: A dependent variable, Y, also called response variable. It is modeled as random. An independent variable,
More informationRegression Models - Introduction
Regression Models - Introduction In regression models there are two types of variables that are studied: A dependent variable, Y, also called response variable. It is modeled as random. An independent
More informationSimple Linear Regression
Simple Linear Regression ST 430/514 Recall: A regression model describes how a dependent variable (or response) Y is affected, on average, by one or more independent variables (or factors, or covariates)
More information[y i α βx i ] 2 (2) Q = i=1
Least squares fits This section has no probability in it. There are no random variables. We are given n points (x i, y i ) and want to find the equation of the line that best fits them. We take the equation
More informationCh 3: Multiple Linear Regression
Ch 3: Multiple Linear Regression 1. Multiple Linear Regression Model Multiple regression model has more than one regressor. For example, we have one response variable and two regressor variables: 1. delivery
More informationTopic 16 Interval Estimation
Topic 16 Interval Estimation Additional Topics 1 / 9 Outline Linear Regression Interpretation of the Confidence Interval 2 / 9 Linear Regression For ordinary linear regression, we have given least squares
More informationECON 3150/4150 spring term 2014: Exercise set for the first seminar and DIY exercises for the first few weeks of the course
ECON 3150/4150 spring term 2014: Exercise set for the first seminar and DIY exercises for the first few weeks of the course Ragnar Nymoen 13 January 2014. Exercise set to seminar 1 (week 6, 3-7 Feb) This
More informationMFin Econometrics I Session 4: t-distribution, Simple Linear Regression, OLS assumptions and properties of OLS estimators
MFin Econometrics I Session 4: t-distribution, Simple Linear Regression, OLS assumptions and properties of OLS estimators Thilo Klein University of Cambridge Judge Business School Session 4: Linear regression,
More informationwhere x and ȳ are the sample means of x 1,, x n
y y Animal Studies of Side Effects Simple Linear Regression Basic Ideas In simple linear regression there is an approximately linear relation between two variables say y = pressure in the pancreas x =
More informationProblems. Suppose both models are fitted to the same data. Show that SS Res, A SS Res, B
Simple Linear Regression 35 Problems 1 Consider a set of data (x i, y i ), i =1, 2,,n, and the following two regression models: y i = β 0 + β 1 x i + ε, (i =1, 2,,n), Model A y i = γ 0 + γ 1 x i + γ 2
More informationMath 3330: Solution to midterm Exam
Math 3330: Solution to midterm Exam Question 1: (14 marks) Suppose the regression model is y i = β 0 + β 1 x i + ε i, i = 1,, n, where ε i are iid Normal distribution N(0, σ 2 ). a. (2 marks) Compute the
More informationINTRODUCING LINEAR REGRESSION MODELS Response or Dependent variable y
INTRODUCING LINEAR REGRESSION MODELS Response or Dependent variable y Predictor or Independent variable x Model with error: for i = 1,..., n, y i = α + βx i + ε i ε i : independent errors (sampling, measurement,
More informationSimple Linear Regression: The Model
Simple Linear Regression: The Model task: quantifying the effect of change X in X on Y, with some constant β 1 : Y = β 1 X, linear relationship between X and Y, however, relationship subject to a random
More informationSTAT420 Midterm Exam. University of Illinois Urbana-Champaign October 19 (Friday), :00 4:15p. SOLUTIONS (Yellow)
STAT40 Midterm Exam University of Illinois Urbana-Champaign October 19 (Friday), 018 3:00 4:15p SOLUTIONS (Yellow) Question 1 (15 points) (10 points) 3 (50 points) extra ( points) Total (77 points) Points
More informationLecture 10 Multiple Linear Regression
Lecture 10 Multiple Linear Regression STAT 512 Spring 2011 Background Reading KNNL: 6.1-6.5 10-1 Topic Overview Multiple Linear Regression Model 10-2 Data for Multiple Regression Y i is the response variable
More informationLinear regression. We have that the estimated mean in linear regression is. ˆµ Y X=x = ˆβ 0 + ˆβ 1 x. The standard error of ˆµ Y X=x is.
Linear regression We have that the estimated mean in linear regression is The standard error of ˆµ Y X=x is where x = 1 n s.e.(ˆµ Y X=x ) = σ ˆµ Y X=x = ˆβ 0 + ˆβ 1 x. 1 n + (x x)2 i (x i x) 2 i x i. The
More informationBasic Business Statistics 6 th Edition
Basic Business Statistics 6 th Edition Chapter 12 Simple Linear Regression Learning Objectives In this chapter, you learn: How to use regression analysis to predict the value of a dependent variable based
More informationSimple Linear Regression for the Climate Data
Prediction Prediction Interval Temperature 0.2 0.0 0.2 0.4 0.6 0.8 320 340 360 380 CO 2 Simple Linear Regression for the Climate Data What do we do with the data? y i = Temperature of i th Year x i =CO
More informationInference in Regression Analysis
Inference in Regression Analysis Dr. Frank Wood Frank Wood, fwood@stat.columbia.edu Linear Regression Models Lecture 4, Slide 1 Today: Normal Error Regression Model Y i = β 0 + β 1 X i + ǫ i Y i value
More informationECON The Simple Regression Model
ECON 351 - The Simple Regression Model Maggie Jones 1 / 41 The Simple Regression Model Our starting point will be the simple regression model where we look at the relationship between two variables In
More informationStatistics for Engineers Lecture 9 Linear Regression
Statistics for Engineers Lecture 9 Linear Regression Chong Ma Department of Statistics University of South Carolina chongm@email.sc.edu April 17, 2017 Chong Ma (Statistics, USC) STAT 509 Spring 2017 April
More informationInferences for Regression
Inferences for Regression An Example: Body Fat and Waist Size Looking at the relationship between % body fat and waist size (in inches). Here is a scatterplot of our data set: Remembering Regression In
More informationApplied Regression. Applied Regression. Chapter 2 Simple Linear Regression. Hongcheng Li. April, 6, 2013
Applied Regression Chapter 2 Simple Linear Regression Hongcheng Li April, 6, 2013 Outline 1 Introduction of simple linear regression 2 Scatter plot 3 Simple linear regression model 4 Test of Hypothesis
More informationSimple Linear Regression. (Chs 12.1, 12.2, 12.4, 12.5)
10 Simple Linear Regression (Chs 12.1, 12.2, 12.4, 12.5) Simple Linear Regression Rating 20 40 60 80 0 5 10 15 Sugar 2 Simple Linear Regression Rating 20 40 60 80 0 5 10 15 Sugar 3 Simple Linear Regression
More informationData Analysis and Statistical Methods Statistics 651
y 1 2 3 4 5 6 7 x Data Analysis and Statistical Methods Statistics 651 http://www.stat.tamu.edu/~suhasini/teaching.html Lecture 32 Suhasini Subba Rao Previous lecture We are interested in whether a dependent
More informationIntroduction to Econometrics Midterm Examination Fall 2005 Answer Key
Introduction to Econometrics Midterm Examination Fall 2005 Answer Key Please answer all of the questions and show your work Clearly indicate your final answer to each question If you think a question is
More informationECON 450 Development Economics
ECON 450 Development Economics Statistics Background University of Illinois at Urbana-Champaign Summer 2017 Outline 1 Introduction 2 3 4 5 Introduction Regression analysis is one of the most important
More informationTwo-Variable Regression Model: The Problem of Estimation
Two-Variable Regression Model: The Problem of Estimation Introducing the Ordinary Least Squares Estimator Jamie Monogan University of Georgia Intermediate Political Methodology Jamie Monogan (UGA) Two-Variable
More informationSTAT2012 Statistical Tests 23 Regression analysis: method of least squares
23 Regression analysis: method of least squares L23 Regression analysis The main purpose of regression is to explore the dependence of one variable (Y ) on another variable (X). 23.1 Introduction (P.532-555)
More informationSimple Linear Regression. Material from Devore s book (Ed 8), and Cengagebrain.com
12 Simple Linear Regression Material from Devore s book (Ed 8), and Cengagebrain.com The Simple Linear Regression Model The simplest deterministic mathematical relationship between two variables x and
More informationLecture 3: Multiple Regression
Lecture 3: Multiple Regression R.G. Pierse 1 The General Linear Model Suppose that we have k explanatory variables Y i = β 1 + β X i + β 3 X 3i + + β k X ki + u i, i = 1,, n (1.1) or Y i = β j X ji + u
More informationApplied Regression Analysis
Applied Regression Analysis Chapter 3 Multiple Linear Regression Hongcheng Li April, 6, 2013 Recall simple linear regression 1 Recall simple linear regression 2 Parameter Estimation 3 Interpretations of
More informationMultiple Linear Regression
Multiple Linear Regression Simple linear regression tries to fit a simple line between two variables Y and X. If X is linearly related to Y this explains some of the variability in Y. In most cases, there
More informationCategorical Predictor Variables
Categorical Predictor Variables We often wish to use categorical (or qualitative) variables as covariates in a regression model. For binary variables (taking on only 2 values, e.g. sex), it is relatively
More informationLECTURE 6. Introduction to Econometrics. Hypothesis testing & Goodness of fit
LECTURE 6 Introduction to Econometrics Hypothesis testing & Goodness of fit October 25, 2016 1 / 23 ON TODAY S LECTURE We will explain how multiple hypotheses are tested in a regression model We will define
More informationSimple Linear Regression
Simple Linear Regression ST 370 Regression models are used to study the relationship of a response variable and one or more predictors. The response is also called the dependent variable, and the predictors
More informationRegression and correlation. Correlation & Regression, I. Regression & correlation. Regression vs. correlation. Involve bivariate, paired data, X & Y
Regression and correlation Correlation & Regression, I 9.07 4/1/004 Involve bivariate, paired data, X & Y Height & weight measured for the same individual IQ & exam scores for each individual Height of
More informationInference for the Regression Coefficient
Inference for the Regression Coefficient Recall, b 0 and b 1 are the estimates of the slope β 1 and intercept β 0 of population regression line. We can shows that b 0 and b 1 are the unbiased estimates
More information1. Simple Linear Regression
1. Simple Linear Regression Suppose that we are interested in the average height of male undergrads at UF. We put each male student s name (population) in a hat and randomly select 100 (sample). Then their
More informationLecture 5: Clustering, Linear Regression
Lecture 5: Clustering, Linear Regression Reading: Chapter 10, Sections 3.1-3.2 STATS 202: Data mining and analysis October 4, 2017 1 / 22 .0.0 5 5 1.0 7 5 X2 X2 7 1.5 1.0 0.5 3 1 2 Hierarchical clustering
More informationChapter 14 Student Lecture Notes Department of Quantitative Methods & Information Systems. Business Statistics. Chapter 14 Multiple Regression
Chapter 14 Student Lecture Notes 14-1 Department of Quantitative Methods & Information Systems Business Statistics Chapter 14 Multiple Regression QMIS 0 Dr. Mohammad Zainal Chapter Goals After completing
More informationRegression Analysis Chapter 2 Simple Linear Regression
Regression Analysis Chapter 2 Simple Linear Regression Dr. Bisher Mamoun Iqelan biqelan@iugaza.edu.ps Department of Mathematics The Islamic University of Gaza 2010-2011, Semester 2 Dr. Bisher M. Iqelan
More informationLinear Regression. In this problem sheet, we consider the problem of linear regression with p predictors and one intercept,
Linear Regression In this problem sheet, we consider the problem of linear regression with p predictors and one intercept, y = Xβ + ɛ, where y t = (y 1,..., y n ) is the column vector of target values,
More informationCorrelation and Regression
Correlation and Regression October 25, 2017 STAT 151 Class 9 Slide 1 Outline of Topics 1 Associations 2 Scatter plot 3 Correlation 4 Regression 5 Testing and estimation 6 Goodness-of-fit STAT 151 Class
More informationChapter 14 Simple Linear Regression (A)
Chapter 14 Simple Linear Regression (A) 1. Characteristics Managerial decisions often are based on the relationship between two or more variables. can be used to develop an equation showing how the variables
More informationLecture 6 Multiple Linear Regression, cont.
Lecture 6 Multiple Linear Regression, cont. BIOST 515 January 22, 2004 BIOST 515, Lecture 6 Testing general linear hypotheses Suppose we are interested in testing linear combinations of the regression
More informationEstimating σ 2. We can do simple prediction of Y and estimation of the mean of Y at any value of X.
Estimating σ 2 We can do simple prediction of Y and estimation of the mean of Y at any value of X. To perform inferences about our regression line, we must estimate σ 2, the variance of the error term.
More information13 Simple Linear Regression
B.Sc./Cert./M.Sc. Qualif. - Statistics: Theory and Practice 3 Simple Linear Regression 3. An industrial example A study was undertaken to determine the effect of stirring rate on the amount of impurity
More informationSTAT 3A03 Applied Regression With SAS Fall 2017
STAT 3A03 Applied Regression With SAS Fall 2017 Assignment 2 Solution Set Q. 1 I will add subscripts relating to the question part to the parameters and their estimates as well as the errors and residuals.
More informationLinear Models and Estimation by Least Squares
Linear Models and Estimation by Least Squares Jin-Lung Lin 1 Introduction Causal relation investigation lies in the heart of economics. Effect (Dependent variable) cause (Independent variable) Example:
More informationChapter 10. Simple Linear Regression and Correlation
Chapter 10. Simple Linear Regression and Correlation In the two sample problems discussed in Ch. 9, we were interested in comparing values of parameters for two distributions. Regression analysis is the
More informationLecture 2 Simple Linear Regression STAT 512 Spring 2011 Background Reading KNNL: Chapter 1
Lecture Simple Linear Regression STAT 51 Spring 011 Background Reading KNNL: Chapter 1-1 Topic Overview This topic we will cover: Regression Terminology Simple Linear Regression with a single predictor
More informationLinear Regression & Correlation
Linear Regression & Correlation Jamie Monogan University of Georgia Introduction to Data Analysis Jamie Monogan (UGA) Linear Regression & Correlation POLS 7012 1 / 25 Objectives By the end of these meetings,
More informationSimple Linear Regression
Simple Linear Regression Christopher Ting Christopher Ting : christophert@smu.edu.sg : 688 0364 : LKCSB 5036 January 7, 017 Web Site: http://www.mysmu.edu/faculty/christophert/ Christopher Ting QF 30 Week
More informationLinear Regression Model. Badr Missaoui
Linear Regression Model Badr Missaoui Introduction What is this course about? It is a course on applied statistics. It comprises 2 hours lectures each week and 1 hour lab sessions/tutorials. We will focus
More informationStat 135, Fall 2006 A. Adhikari HOMEWORK 10 SOLUTIONS
Stat 135, Fall 2006 A. Adhikari HOMEWORK 10 SOLUTIONS 1a) The model is cw i = β 0 + β 1 el i + ɛ i, where cw i is the weight of the ith chick, el i the length of the egg from which it hatched, and ɛ i
More informationLecture 5: Clustering, Linear Regression
Lecture 5: Clustering, Linear Regression Reading: Chapter 10, Sections 3.1-3.2 STATS 202: Data mining and analysis October 4, 2017 1 / 22 Hierarchical clustering Most algorithms for hierarchical clustering
More informationFitting a regression model
Fitting a regression model We wish to fit a simple linear regression model: y = β 0 + β 1 x + ɛ. Fitting a model means obtaining estimators for the unknown population parameters β 0 and β 1 (and also for
More informationECO220Y Simple Regression: Testing the Slope
ECO220Y Simple Regression: Testing the Slope Readings: Chapter 18 (Sections 18.3-18.5) Winter 2012 Lecture 19 (Winter 2012) Simple Regression Lecture 19 1 / 32 Simple Regression Model y i = β 0 + β 1 x
More informationChapter 1 Linear Regression with One Predictor
STAT 525 FALL 2018 Chapter 1 Linear Regression with One Predictor Professor Min Zhang Goals of Regression Analysis Serve three purposes Describes an association between X and Y In some applications, the
More informationQuantitative Analysis of Financial Markets. Summary of Part II. Key Concepts & Formulas. Christopher Ting. November 11, 2017
Summary of Part II Key Concepts & Formulas Christopher Ting November 11, 2017 christopherting@smu.edu.sg http://www.mysmu.edu/faculty/christophert/ Christopher Ting 1 of 16 Why Regression Analysis? Understand
More informationAn Introduction to Bayesian Linear Regression
An Introduction to Bayesian Linear Regression APPM 5720: Bayesian Computation Fall 2018 A SIMPLE LINEAR MODEL Suppose that we observe explanatory variables x 1, x 2,..., x n and dependent variables y 1,
More informationThe Slow Convergence of OLS Estimators of α, β and Portfolio. β and Portfolio Weights under Long Memory Stochastic Volatility
The Slow Convergence of OLS Estimators of α, β and Portfolio Weights under Long Memory Stochastic Volatility New York University Stern School of Business June 21, 2018 Introduction Bivariate long memory
More informationMATH11400 Statistics Homepage
MATH11400 Statistics 1 2010 11 Homepage http://www.stats.bris.ac.uk/%7emapjg/teach/stats1/ 4. Linear Regression 4.1 Introduction So far our data have consisted of observations on a single variable of interest.
More informationCorrelation 1. December 4, HMS, 2017, v1.1
Correlation 1 December 4, 2017 1 HMS, 2017, v1.1 Chapter References Diez: Chapter 7 Navidi, Chapter 7 I don t expect you to learn the proofs what will follow. Chapter References 2 Correlation The sample
More informationLecture 19 Multiple (Linear) Regression
Lecture 19 Multiple (Linear) Regression Thais Paiva STA 111 - Summer 2013 Term II August 1, 2013 1 / 30 Thais Paiva STA 111 - Summer 2013 Term II Lecture 19, 08/01/2013 Lecture Plan 1 Multiple regression
More informationLinear models and their mathematical foundations: Simple linear regression
Linear models and their mathematical foundations: Simple linear regression Steffen Unkel Department of Medical Statistics University Medical Center Göttingen, Germany Winter term 2018/19 1/21 Introduction
More informationSTAT Chapter 11: Regression
STAT 515 -- Chapter 11: Regression Mostly we have studied the behavior of a single random variable. Often, however, we gather data on two random variables. We wish to determine: Is there a relationship
More informationLectures on Simple Linear Regression Stat 431, Summer 2012
Lectures on Simple Linear Regression Stat 43, Summer 0 Hyunseung Kang July 6-8, 0 Last Updated: July 8, 0 :59PM Introduction Previously, we have been investigating various properties of the population
More informationTopic 12 Overview of Estimation
Topic 12 Overview of Estimation Classical Statistics 1 / 9 Outline Introduction Parameter Estimation Classical Statistics Densities and Likelihoods 2 / 9 Introduction In the simplest possible terms, the
More informationTutorial 6: Linear Regression
Tutorial 6: Linear Regression Rob Nicholls nicholls@mrc-lmb.cam.ac.uk MRC LMB Statistics Course 2014 Contents 1 Introduction to Simple Linear Regression................ 1 2 Parameter Estimation and Model
More informationSample Problems. Note: If you find the following statements true, you should briefly prove them. If you find them false, you should correct them.
Sample Problems 1. True or False Note: If you find the following statements true, you should briefly prove them. If you find them false, you should correct them. (a) The sample average of estimated residuals
More information11 Hypothesis Testing
28 11 Hypothesis Testing 111 Introduction Suppose we want to test the hypothesis: H : A q p β p 1 q 1 In terms of the rows of A this can be written as a 1 a q β, ie a i β for each row of A (here a i denotes
More informationEC212: Introduction to Econometrics Review Materials (Wooldridge, Appendix)
1 EC212: Introduction to Econometrics Review Materials (Wooldridge, Appendix) Taisuke Otsu London School of Economics Summer 2018 A.1. Summation operator (Wooldridge, App. A.1) 2 3 Summation operator For
More informationLinear Regression. In this lecture we will study a particular type of regression model: the linear regression model
1 Linear Regression 2 Linear Regression In this lecture we will study a particular type of regression model: the linear regression model We will first consider the case of the model with one predictor
More informationTMA4255 Applied Statistics V2016 (5)
TMA4255 Applied Statistics V2016 (5) Part 2: Regression Simple linear regression [11.1-11.4] Sum of squares [11.5] Anna Marie Holand To be lectured: January 26, 2016 wiki.math.ntnu.no/tma4255/2016v/start
More informationStatistics for Managers using Microsoft Excel 6 th Edition
Statistics for Managers using Microsoft Excel 6 th Edition Chapter 13 Simple Linear Regression 13-1 Learning Objectives In this chapter, you learn: How to use regression analysis to predict the value of
More informationProbability and Statistics Notes
Probability and Statistics Notes Chapter Seven Jesse Crawford Department of Mathematics Tarleton State University Spring 2011 (Tarleton State University) Chapter Seven Notes Spring 2011 1 / 42 Outline
More informationLecture 5: Clustering, Linear Regression
Lecture 5: Clustering, Linear Regression Reading: Chapter 10, Sections 3.1-2 STATS 202: Data mining and analysis Sergio Bacallado September 19, 2018 1 / 23 Announcements Starting next week, Julia Fukuyama
More informationReview of Statistics
Review of Statistics Topics Descriptive Statistics Mean, Variance Probability Union event, joint event Random Variables Discrete and Continuous Distributions, Moments Two Random Variables Covariance and
More informationSTAT 511. Lecture : Simple linear regression Devore: Section Prof. Michael Levine. December 3, Levine STAT 511
STAT 511 Lecture : Simple linear regression Devore: Section 12.1-12.4 Prof. Michael Levine December 3, 2018 A simple linear regression investigates the relationship between the two variables that is not
More informationLecture 13. Simple Linear Regression
1 / 27 Lecture 13 Simple Linear Regression October 28, 2010 2 / 27 Lesson Plan 1. Ordinary Least Squares 2. Interpretation 3 / 27 Motivation Suppose we want to approximate the value of Y with a linear
More information