Lesson 11: Simple Linear Regression

Size: px
Start display at page:

Download "Lesson 11: Simple Linear Regression"

Transcription

1 Lesso 11: Simple Liear Regressio Ka-fu WONG December 2, 2004 I previous lessos, we have covered maily about the estimatio of populatio mea (or expected value) ad its iferece. Sometimes we are iterested i allowig the expected value to vary with some variables. For istace, i the discussio o the mea icome, sometimes we may wat to kow how icome is related to the level of educatio. Kowig how icome is related to the level of educatio will allow us to predict better a perso s icome give his/her educatioal backgroud. I additio, the Huma Capital Theory i Ecoomics tells us there should be a positive relatio betwee icome ad educatio which is viewed as huma capital. 1 Recall that if we have several radom variables described by a multivariate distributio, we ca talk about coditioal expectatios. Recall the defiitio of coditioal expectatio for the discrete case with two radom variables X ad Y. Defiitio 1 (Coditioal Expectatio): For two discrete radom variables that are joitly distributed with a bivariate probability distributio, the coditioal expectatio or coditioal mea E(X Y = y j ) is computed by the formula: E(X Y = y j ) = x xp X Y (x y j ) = x 1 P X Y (x 1 y j ) + x 2 P X Y (x 2 y j ) x N P X Y (x N y j ) Sometimes, we write µ X Y =yj = E(X Y = y j ). The ucoditioal expectatio or mea of X is related to the coditioal mea. E(X) = y E(X Y = y)p X Y (y) = E[E(X Y )] 1 See Becker, Gary (1964): Huma Capital, 1st editio (NBER). ECON1003 Lesso 11: Simple Liear Regressio 1

2 For cotiuous radom variables, the coditioal expectatio or coditioal mea E(X Y = y) is computed by the formula: E(X Y = y) = xf(x y)dx x I the huma capital example, Y will be icome, X will be years of schoolig. Whe our iterest is the expected value (or called populatio mea) of a radom variable, we use the sample average as a estimator. Whe our iterest is the coditioal expected value (or called the coditioal populatio mea), we ca use the coditioal sample average as a estimator. Example 1 (coditioal sample average I): Suppose we have the followig sample of 10 observatios with two variables, mothly earigs (Y, i dollars) ad geder (X, male=1, female=2). Obs # X Y Obs # X Y The sample average icome coditioal o male is (y 1 + y y 6 )/6 = ( )/6 = The sample average icome coditioal o female is (y 7 + y y 1 0)/4 = ( )/4 = Based o these sample averages, we will coclude that a typical male will ear dollars per moth while female will ear dollars. Geder (X) Coditioal mea E(Y X = x) coditioal sample mea Ê(Y X = x) 1 E(Y X = 1) E(Y X = 2) The geeral formula to compute the coditioal sample average is Ê(Y X = x) = X=x Y i #(X = x) ECON1003 Lesso 11: Simple Liear Regressio 2

3 Example 2 (coditioal sample average II): Suppose we have the followig sample of 40 observatios with two variables, mothly earigs (Y, i dollars) ad years of schoolig (X, from 0 to 20). Obs # X Y Obs # X Y Obs # X Y Obs # X Y The sample average icome coditioal o 0 year of schoolig is (y 1 + y 2 )/2 = ( )/2 = Similarly, we ca compute the sample average icome coditioal o differet years of schoolig X Ê(Y X = x) #obs X Ê(Y X = x) #obs We make several observatios of the above example. ECON1003 Lesso 11: Simple Liear Regressio 3

4 1. Some of the coditioal sample average are based o oly oe observatio. Usig it as a estimate of the coditioal expectatio is very imprecise. 2. Some of the coditioal sample average are missig because the iavailablity of data. For istace, we have o observatio of X = 7. What ca we do to improve our estimatio of the coditioal mea? It turs out that the estimatio may be improved if we are willig to assume some relatioship betwee X ad Y. A liear relatioship is commoly assumed betwee two variables E(Y X) = β 0 + β 1 X (1) I example 2, we used 40 observatios to produce 20 coditioal meas. O average, we have two observatio to estimate each coditioal mea ad we could ot estimate the populatio mea coditioal o X = 7 because there is o observatio coditioal o X + 7. If we are willig to assume a liear relatioship betwee E(Y X) ad X as i equatio (1), we oly eed to estimate two parameters β 0 ad β 1. O average, we will be usig 20 observatios to estimate oe parameter. Oce we have the estimates of β 0 ad β 1, we ca produce the coditioal mea of Y for each X we are iterested i. I additio, we would be able to estimate the populatio mea coditioal o X = 7 eve if we have o observatio coditoal o X = 7. 2 Simulatio 1 (Liear expectatio): We simulatio 10 observatios for each X with differet variace of the error term. E(Y X) = 3 + 2X ɛ N(0, σ 2 ) Y = E(Y X) + ɛ X is assumed to take discrete values of 1,2,...,9. The observatios so geerated are plotted below. Note that the expected values lie o the straight lies. 2 Followig this logic, the estimatio of the coditoal mea will be improved if we are willig to assume ay relatioship betwee X ad Y such that the umber of parameters is greatly reduced. For istace, we may assume E(Y X) = β 0 + β 1 X + β 2 X 2. However, ote that if the true coditioal expectatio is ot related to X as assumed, usig the assumed relatioship to estimate E(Y X) will be wrog. Thus, the choice of the fuctioal form of E(Y X) is extremely importat. That is why we ofte check the liearity assumptio by doig a scatter plot of Y agaist X. Whe E(Y X) is assumed to have a specific fuctioal form with a set of parameters, the regressio is called parametric. Whe E(Y X) is ot assumed ay specific fuctioal form (ad hece o assumed parameters), the regressio is called oparametric. ECON1003 Lesso 11: Simple Liear Regressio 4

5 (a) σ 2 = (b) σ 2 = 4 Figure 1: Distributio of data from a liear regressio model Simulatio 2 (No-liear expectatio): We simulatio 10 observatios for each X with differet variace of the error term. E(Y X) = 3 + 2X ɛ N(0, σ 2 ) Y = E(Y X) + ɛ X is assumed to take discrete values of 1,2,...,9. The observatios so geerated are plotted below. Note that the expected values lie o the the curve (a) σ 2 = (b) σ 2 = 100 Figure 2: Distributio of data from a o-liear regressio model Example 3 (Which datasets are from a liear expectatio model): Guess which datasets are likely from a liear expectatio model. ECON1003 Lesso 11: Simple Liear Regressio 5

6 (a) Dataset # (b) Dataset # (c) Dataset # (d) Dataset #4 Figure 3: Datasets from four differet models It turs out that the datasets are draw from the simulatios reported above. Dataset #3 is ulikely from a liear model. However, oe ca easily coclude that Dataset #4 is likely from a liear model because the oliearity is mild relative to the dispersio of the data. Give that we believe that the uderlyig model is liear, how do we estimate β 0 ad β 1? 1 Estimatio of the simple liear model There are at least two approaches to estimate the liear model: 1. The method of momets 2. The ordiary least squares It turs out that the two differet approach yield the same estimator for the parameters β 0 ad β 1. ECON1003 Lesso 11: Simple Liear Regressio 6

7 1.1 The method of momets Suppose we have observatios of (X, Y ) pair. We ca imagie that the observatios of Y are radom draws from a ormal distributio with mea E(Y X) ad some variace σ 2, i.e., Y N(E(Y X), σ 2 ) Let ɛ = Y E(Y X). We have ɛ N(0, σ 2 ) Thus, a radom draw of Y is like a radom draw of e plus E(Y X). Thus, the assumed liear model (1) meas Y = β 0 + β 1 X + ɛ (2) Note that ɛ has zero mea, i.e., E(ɛ). Thus, oe ca use the coditio E(ɛ) as oe criteria to estimate the parameters. The problem is that oe equatio ca also be used to solve oe coefficiet oly (either β 0 or β 1 ). To solve (get estimate) for two coefficets, we will eed aother coditio. Oe possibility is to assume that ɛ is draw idepedetly of X. That is, E(ɛ X). E(ɛ X) implies E(Xɛ X) ad E(Xɛ) 3. Thus, we have two coditios, E(ɛ) ad E(Xɛ). E(ɛ) = E(Y β 0 + β 1 X) = E(Y ) β 0 + β 1 E(X) E(ɛX) = E[(Y β 0 + β 1 X)X] = E(Y X) β 0 E(X) + β 1 E(X 2 ) Two equatios are just eough to solve the two coefficiets β 0 or β 1. If we have a data sample of obseratios, how do we estimate β 0 or β 1? Note that E(.) is really the populatio average. I our estimatio, we do ot observe ɛ, β 0 ad β 1. What we have are oly observatios of (x i, y i ), i = 1,...,. We ca use the sample aalog (i.e., sample average to replace for the populatio average) to estimate the paramters. That is, we defie e i = y i b 0 + b 1 x i ad compute correspodig sample averages ad set them to equal zero. b 0, b 1 ad e i are sample aalog of β 0, β 1 ad ɛ i the model. Our objective is to fid b 0 ad 3 Note that E(Xɛ) implies Cov(X, ɛ). ECON1003 Lesso 11: Simple Liear Regressio 7

8 b 1, ad hece e i. Ê(e) = e i E(Xe) = x ie i Let s verify this method with somethig we are familair with the estimatio of populatio mea. Example 4 (Estimatio of populatio mea): Suppose Y N(β 0, σ 2 ). Thus, β 0 is the populatio mea of Y. We have observatio of y i, i = 1, 2,...,. We wat to estimate the populatio mea of Y. Fittig ito the liear model framwork, we write Y = β 0 + ɛ (3) Thus, we have oly oe parameter to estimate, i.e., β 0. Let b 0 be a estimate of β 0. First, we write e i = y i b 0. Secod, we will compute the sample average of e i ad set it to zero. e i (y i b 0 ) y i b 0 y i b 0 b 0 = y i Thus, the method yields sample average as a estimator of β 0. Example 5 (Estimatio of the liear model): Suppose Y N(β 0 + β 1 X, σ 2 ). Thus, β 0 + β 1 X is the populatio mea of Y coditioal o X. We have observatio of (x i, y i ), i = 1,...,. We wat to estimate the liear relatioship of coditioal populatio mea. This is exactly the liear model framwork as i equatio 1. Thus, we have two parameter to estimate, i.e., β 0 ad β 1. Let b 0 ad b 1 be estimates of β 0 ad β 1. First, we write e i = y i b 0 b 1 x i. Secod, we will compute the sample average of e i ad ECON1003 Lesso 11: Simple Liear Regressio 8

9 x i e i ad set them to zero. e i (y i b 0 b 1 x i ) (y i b 0 b 1 x i ) (4) e ix i (y i b 0 b 1 x i )x i (y i b 0 b 1 x i )x i (5) Thus, the two equatios (4 ad 5) may be used to solve for the two ukow b 0 ad b 1 This approach is called the method of momets because the estimatio is based o the matchig the sample momets (sample averages) with the populatio momets (E(.)). 1.2 The method of ordiary least squares Aother view is to fid the lie that best fit the data. I the liear model 2, we would like to choose the b 0 ad b 1 so that the error e is miimized. e = Y b 0 + b 1 X Whe we have obseratios of x i, y i (ad hece e i ), aturally, we will have some positive e i ad some egativae e i. A operatioal procedure to miimize e is to choose b 0 ad b 1 such that the sum of squared errors is miimized. S(b 0, b 1 ) = e 2 i = (y i b 0 b 1 x i ) 2 Miimizig the S(b 0, b 1 ) with respective to b 0 ad b 1 yields the followig two first order coditios: S(b 0, b 1 ) b 0 = 2(y i b 0 b 1 x i )( 1) (y i b 0 b 1 x i ) (6) ECON1003 Lesso 11: Simple Liear Regressio 9

10 S(b 0, b 1 ) b 1 = 2(y i b 0 b 1 x i )( x i ) (y i b 0 b 1 x i )x i (7) Note that these two coditios (6 ad 7) are the same as those two coditios usig the method of momets approach (4 ad 5). 2 The coveiece of matrix otatios The use of matrix greatly simplify our aalysis. Our model Y = β 0 + β 1 X + ɛ may be rewritte i matrix otatios ( Y = 1 X ) β 0 β 1 + ɛ = Zβ + ɛ Premultiply with Z, we have Z Y = Z Zβ + Z ɛ The coditio to estimate β is E(Z ɛ). Hece E(Z Y ) = E(Z Z)β + E(Z ɛ) E(Z Y ) = E(Z Z)β β = [E(Z Z)] 1 E(Z Y ) ECON1003 Lesso 11: Simple Liear Regressio 10

11 Suppose we have a sample of observatios (y i, x i ), i = 1,...,. We have y 1 y 2... = 1 x 1 1 x b 0 b 1 + e 1 e 2... y 1 x e or i compact form Y = Zb + e Premultiply by Z, we have Z Y = Z Zb + Z e Z Y = Z Zb b = (Z Z) 1 Z Y (8) where Z e because Z e is the sample aalog of E(Z ɛ) which is assumed to equal zero i the model. 3 Properties of the OLS estimator For coveiece, we use the matrix otatios to discuss the properties of the OLS estimator. 3.1 Ubiasedess Recall the defiitio of ubiasedess. Defiitio 2 (Ubiasedess): A estimator θ = θ(x 1, x 2,..., x ) for a populatio parameter β is called ubiased if E(θ) = β ECON1003 Lesso 11: Simple Liear Regressio 11

12 Thus, b is a ubiased estimator of β if E(b) = β. I the followig discussio, it is coveiet to assume that x i are fixed ad kow. E(b) = E((Z Z) 1 Z Y) = E((Z Z) 1 Z Zβ + ɛ) = E((Z Z) 1 Z Zβ) + E((Z Z) 1 Z ɛ) = E((Z Z) 1 Z Z)β + E[(Z Z) 1 Z E(ɛ Z)] = β Thus, b is ubiased if E(ɛ Z). 3.2 The estimators are ormally distributed Note that b is a ratio of sample meas. If we assume that x i are fixed ad kow, the b will be a weighted average of y i. Thus, for sample with more tha 30 observatios, Cetral Limit Theorem may be applied to coclude that b will be ormally distributed. I showig the ubiasess, we have compute the mea of b. It remais to fid the variace of b. V (b) = V (Z Z) 1 Z Y) = V ((Z Z) 1 Z Zβ + ɛ) = V ((Z Z) 1 Z Zβ) + V ((Z Z) 1 Z ɛ) + 2COV ((Z Z) 1 Z Zβ), ((Z Z) 1 Z ɛ) = V (β) + V ((Z Z) 1 Z ɛ) + 2COV (β, ((Z Z) 1 Z ɛ) = V ((Z Z) 1 Z ɛ) = E[(Z Z) 1 Z ɛ)(z Z) 1 Z ɛ) ] = E[(Z Z) 1 Z ɛɛz(z Z) 1 ] = (Z Z) 1 Z E(ɛɛ)Z(Z Z) 1 ] = (Z Z) 1 Z Iσ 2 Z(Z Z) 1 ] = σ 2 (Z Z) 1 Z Z(Z Z) 1 ] = σ 2 (Z Z) 1 ECON1003 Lesso 11: Simple Liear Regressio 12

13 Thus, b A N(β, σ 2 (Z Z) 1 ) Usually σ 2 is ukow ad has to be estimated based o e i = y i b 0 b 1 x i. S 2 = e i 2 S 2 is also called stadard error of estimate. Why do we have a deomiator of ( 2) istead of ( 1) as i the usual estimate of populatio variace? It is becasue b 0 ad b 1 have to be estimated from data. I the estimatio of the populatio variace σ 2, these two umbers are assumed fixed. Hece, ( 2) reflects the loss of two degree of freedom. 3.3 BLUE The OLS estimator is also kow to be Best Liear Ubiased Estimator. Best because the estimator is a result of miimizig the sum of squared errors ad hece V (b) is the smallest amog all possible ways of obtaiig a estimate of β. Liear because liear model is assumed. Ubiased because b 0 ad b 1 are ubiased estimator of β 0 ad β 1 4 Iferece The kowledge about the distributio of b allows us to do various kids of iferece. The cofidece iterval about β ad testig hypothesis about β are straighforward. Let b = b 0 b 1 N β 0 β 1, V (b 0) C(b 0, b 1 ) C(b 0, b 1 ) V (b 1 ). (9) 4.1 Testig idividual parameters Ofte, we are iterested i testig whether the idividual populatio parameters are differet from zero at 5% level of sigificace. That is, H 0 : β 1 versus H 1 : β 1 0. The joit distributio of b 0 ad b 1 as show ECON1003 Lesso 11: Simple Liear Regressio 13

14 i (9) suggests that b 1 N(β 1, V (b 1 )). Hece we will reject the ull if b 1 0 V (b1 ) > 1.96 or b 1 0 V (b1 ) < 1.96 Sometimes, we would like to test whether the idividual populatio parameters are differet from oe at 5% level of sigificace. That is, H 0 : β 1 = 1 versus H 1 : β 1 1. The joit distributio of b 0 ad b 1 as show i (9) suggests that b 1 N(β 1, V (b 1 )). Hece we will reject the ull if b 1 1 V (b1 ) > 1.96 or b 1 1 V (b1 ) < 1.96 The testig about β 0 is similar. 4.2 Testig a set of parameters Suppose we are iterested i testig whether the two populatio parameters are ot equal at 5% level of sigificace. That is, H 0 : β 1 β 0 versus H 1 : β 1 β 0 0. The joit distributio of b 0 ad b 1 as show i (9) suggests that b 1 b 0 N(β 1, V (b 1 ) + V (b 0 ) 2C(b 0, b 1 )). Hece we will reject the ull if (b 1 b 0 ) 0 V (b1 ) + V (b 0 ) 2C(b 0, b 1 ) > 1.96 or (b 1 b 0 ) 0 V (b1 ) + V (b 0 ) 2C(b 0, b 1 ) < How good is the model? 5.1 Goodess of fit How well doest the model fit the data? A model is better fit of the data whe the implied e i are small. Because the OLS estimators are result of miimizig the sum of squared errors give x i ad y i, the estimator ECON1003 Lesso 11: Simple Liear Regressio 14

15 b is the best fit. However, there are alterative models usig differet x as a explaatory variables. Oe would wat to derive a commo measture to tell which explaatory variable will yield the best fit. Note that our aim is the predict y give x. Without x, we will be usig the sample mea of y as a predictio of y. I this case we will have a sum of squared errors SST = (y i ȳ) 2 With x, we will be usig the sample coditiol mea of y (i.e., b 0 + b 1 x) as a predictio of y. We will have a sum of squared errors SSE = (y i (b 0 + b 1 x i )) 2 It ca be show that SST = SSE + SSR where SSR is the regressio sum of sqaures SSR = A atural measure of goodess of fit is ((b 0 + b 1 x i ) ȳ) 2 R 2 = SSR SST = 1 SSE SST R 2 measures how the percetage of the total sum of sqaured errors that ca be explaied by the explaatory variable(s) i a regressio framework. Note that R 2 lies betwee 0 ad 1. A higher R 2 meas a explaatory variable (x) is better i predictig y. R 2 the explaatory variable x is useless i predictig y. R 2 = 1 the explaatory variable x predicts y perfectly. R 2 is also kow as the coefficiet of determiatio. A relative of R 2 is the correlatio coefficiet r. Defiitio 3 (Correlatio coefficiet): Suppose we have a sample of observatios (x i, y i ), i = 1, 2,...,. The Correlatio Coefficiet (r) is a measure of the stregth of the liear relatioship betwee two variables x ad y. r = (xi x)(yi ȳ) (xi x)2 1 1 (yi ȳ)2 1 = (x i x)(y i ȳ) (x i x) 2 (y i ȳ) 2 ECON1003 Lesso 11: Simple Liear Regressio 15

16 It ca rage from 1.00 to Values of 1.00 or 1.00 idicate perfect ad strog correlatio. Values close to 0.0 idicate weak correlatio. Negative values idicate a iverse relatioship ad positive values idicate a direct relatioship. It ca be show that i the coefficiet of determiatio (R 2 ) is the square of correlatio coefficiet (r 2 ). Note that R 2 is more geeral ad is valid for models with more tha oe explaatory variable, but the correlatio coefficiet applies oly to two variables. 5.2 Validity of assumptios The liearity assumptio We have assumed the Y = β 0 + β 1 X + e. Sometimes, Ecoomic theory or data suggest that the model may ot be liear. For example, i the huma capital example, it is ofte assumed that Y = β 0 +β 1 X +β 2 X 2 +e ad Y is log mothly earigs istead of mothly earigs. How do we kow whether liearity is a satisfactory assumptio? We ofte check by doig a scatter plot of the data y agaist x. If the plot suggest o-liearity, we will have to revised our model Same variace for all observatios homoskedasticity The observatios (x i, y i ), i = 1,..., are assumed to be draw from the same populatio Y N(E(Y X), σ 2 ) Whe this assumptio is ot correct, we will eed to do some adjustmet to our estimatio ad iferece. The assumptio may geerally be checked by plottig the residuals (e i ) agaist x. If we see the residuals to exhibit some patter, we will try to trasform the model or the data. To trasform the data, we ca defie y = log(y) or y = y 2, etc. To adjust the model, oe may add higher order terms (i.e., square terms, cubic terms) to allow for oliearity. ECON1003 Lesso 11: Simple Liear Regressio 16

Properties and Hypothesis Testing

Properties and Hypothesis Testing Chapter 3 Properties ad Hypothesis Testig 3.1 Types of data The regressio techiques developed i previous chapters ca be applied to three differet kids of data. 1. Cross-sectioal data. 2. Time series data.

More information

1 Inferential Methods for Correlation and Regression Analysis

1 Inferential Methods for Correlation and Regression Analysis 1 Iferetial Methods for Correlatio ad Regressio Aalysis I the chapter o Correlatio ad Regressio Aalysis tools for describig bivariate cotiuous data were itroduced. The sample Pearso Correlatio Coefficiet

More information

Econ 325 Notes on Point Estimator and Confidence Interval 1 By Hiro Kasahara

Econ 325 Notes on Point Estimator and Confidence Interval 1 By Hiro Kasahara Poit Estimator Eco 325 Notes o Poit Estimator ad Cofidece Iterval 1 By Hiro Kasahara Parameter, Estimator, ad Estimate The ormal probability desity fuctio is fully characterized by two costats: populatio

More information

(all terms are scalars).the minimization is clearer in sum notation:

(all terms are scalars).the minimization is clearer in sum notation: 7 Multiple liear regressio: with predictors) Depedet data set: y i i = 1, oe predictad, predictors x i,k i = 1,, k = 1, ' The forecast equatio is ŷ i = b + Use matrix otatio: k =1 b k x ik Y = y 1 y 1

More information

ECON 3150/4150, Spring term Lecture 3

ECON 3150/4150, Spring term Lecture 3 Itroductio Fidig the best fit by regressio Residuals ad R-sq Regressio ad causality Summary ad ext step ECON 3150/4150, Sprig term 2014. Lecture 3 Ragar Nymoe Uiversity of Oslo 21 Jauary 2014 1 / 30 Itroductio

More information

Regression, Inference, and Model Building

Regression, Inference, and Model Building Regressio, Iferece, ad Model Buildig Scatter Plots ad Correlatio Correlatio coefficiet, r -1 r 1 If r is positive, the the scatter plot has a positive slope ad variables are said to have a positive relatioship

More information

Simple Linear Regression

Simple Linear Regression Simple Liear Regressio 1. Model ad Parameter Estimatio (a) Suppose our data cosist of a collectio of pairs (x i, y i ), where x i is a observed value of variable X ad y i is the correspodig observatio

More information

Efficient GMM LECTURE 12 GMM II

Efficient GMM LECTURE 12 GMM II DECEMBER 1 010 LECTURE 1 II Efficiet The estimator depeds o the choice of the weight matrix A. The efficiet estimator is the oe that has the smallest asymptotic variace amog all estimators defied by differet

More information

11 Correlation and Regression

11 Correlation and Regression 11 Correlatio Regressio 11.1 Multivariate Data Ofte we look at data where several variables are recorded for the same idividuals or samplig uits. For example, at a coastal weather statio, we might record

More information

Response Variable denoted by y it is the variable that is to be predicted measure of the outcome of an experiment also called the dependent variable

Response Variable denoted by y it is the variable that is to be predicted measure of the outcome of an experiment also called the dependent variable Statistics Chapter 4 Correlatio ad Regressio If we have two (or more) variables we are usually iterested i the relatioship betwee the variables. Associatio betwee Variables Two variables are associated

More information

CEE 522 Autumn Uncertainty Concepts for Geotechnical Engineering

CEE 522 Autumn Uncertainty Concepts for Geotechnical Engineering CEE 5 Autum 005 Ucertaity Cocepts for Geotechical Egieerig Basic Termiology Set A set is a collectio of (mutually exclusive) objects or evets. The sample space is the (collectively exhaustive) collectio

More information

Simple Regression Model

Simple Regression Model Simple Regressio Model 1. The Model y i 0 1 x i u i where y i depedet variable x i idepedet variable u i disturbace/error term i 1,..., Eg: y wage (measured i 1976 dollars per hr) x educatio (measured

More information

Output Analysis (2, Chapters 10 &11 Law)

Output Analysis (2, Chapters 10 &11 Law) B. Maddah ENMG 6 Simulatio Output Aalysis (, Chapters 10 &11 Law) Comparig alterative system cofiguratio Sice the output of a simulatio is radom, the comparig differet systems via simulatio should be doe

More information

Linear Regression Models

Linear Regression Models Liear Regressio Models Dr. Joh Mellor-Crummey Departmet of Computer Sciece Rice Uiversity johmc@cs.rice.edu COMP 528 Lecture 9 15 February 2005 Goals for Today Uderstad how to Use scatter diagrams to ispect

More information

S Y Y = ΣY 2 n. Using the above expressions, the correlation coefficient is. r = SXX S Y Y

S Y Y = ΣY 2 n. Using the above expressions, the correlation coefficient is. r = SXX S Y Y 1 Sociology 405/805 Revised February 4, 004 Summary of Formulae for Bivariate Regressio ad Correlatio Let X be a idepedet variable ad Y a depedet variable, with observatios for each of the values of these

More information

Statistical Inference (Chapter 10) Statistical inference = learn about a population based on the information provided by a sample.

Statistical Inference (Chapter 10) Statistical inference = learn about a population based on the information provided by a sample. Statistical Iferece (Chapter 10) Statistical iferece = lear about a populatio based o the iformatio provided by a sample. Populatio: The set of all values of a radom variable X of iterest. Characterized

More information

Random Variables, Sampling and Estimation

Random Variables, Sampling and Estimation Chapter 1 Radom Variables, Samplig ad Estimatio 1.1 Itroductio This chapter will cover the most importat basic statistical theory you eed i order to uderstad the ecoometric material that will be comig

More information

Frequentist Inference

Frequentist Inference Frequetist Iferece The topics of the ext three sectios are useful applicatios of the Cetral Limit Theorem. Without kowig aythig about the uderlyig distributio of a sequece of radom variables {X i }, for

More information

Statistical Properties of OLS estimators

Statistical Properties of OLS estimators 1 Statistical Properties of OLS estimators Liear Model: Y i = β 0 + β 1 X i + u i OLS estimators: β 0 = Y β 1X β 1 = Best Liear Ubiased Estimator (BLUE) Liear Estimator: β 0 ad β 1 are liear fuctio of

More information

TAMS24: Notations and Formulas

TAMS24: Notations and Formulas TAMS4: Notatios ad Formulas Basic otatios ad defiitios X: radom variable stokastiska variabel Mea Vätevärde: µ = X = by Xiagfeg Yag kpx k, if X is discrete, xf Xxdx, if X is cotiuous Variace Varias: =

More information

Algebra of Least Squares

Algebra of Least Squares October 19, 2018 Algebra of Least Squares Geometry of Least Squares Recall that out data is like a table [Y X] where Y collects observatios o the depedet variable Y ad X collects observatios o the k-dimesioal

More information

Lecture 22: Review for Exam 2. 1 Basic Model Assumptions (without Gaussian Noise)

Lecture 22: Review for Exam 2. 1 Basic Model Assumptions (without Gaussian Noise) Lecture 22: Review for Exam 2 Basic Model Assumptios (without Gaussia Noise) We model oe cotiuous respose variable Y, as a liear fuctio of p umerical predictors, plus oise: Y = β 0 + β X +... β p X p +

More information

3/3/2014. CDS M Phil Econometrics. Types of Relationships. Types of Relationships. Types of Relationships. Vijayamohanan Pillai N.

3/3/2014. CDS M Phil Econometrics. Types of Relationships. Types of Relationships. Types of Relationships. Vijayamohanan Pillai N. 3/3/04 CDS M Phil Old Least Squares (OLS) Vijayamohaa Pillai N CDS M Phil Vijayamoha CDS M Phil Vijayamoha Types of Relatioships Oly oe idepedet variable, Relatioship betwee ad is Liear relatioships Curviliear

More information

MA Advanced Econometrics: Properties of Least Squares Estimators

MA Advanced Econometrics: Properties of Least Squares Estimators MA Advaced Ecoometrics: Properties of Least Squares Estimators Karl Whela School of Ecoomics, UCD February 5, 20 Karl Whela UCD Least Squares Estimators February 5, 20 / 5 Part I Least Squares: Some Fiite-Sample

More information

Chapters 5 and 13: REGRESSION AND CORRELATION. Univariate data: x, Bivariate data (x,y).

Chapters 5 and 13: REGRESSION AND CORRELATION. Univariate data: x, Bivariate data (x,y). Chapters 5 ad 13: REGREION AND CORRELATION (ectios 5.5 ad 13.5 are omitted) Uivariate data: x, Bivariate data (x,y). Example: x: umber of years studets studied paish y: score o a proficiecy test For each

More information

Goodness-of-Fit Tests and Categorical Data Analysis (Devore Chapter Fourteen)

Goodness-of-Fit Tests and Categorical Data Analysis (Devore Chapter Fourteen) Goodess-of-Fit Tests ad Categorical Data Aalysis (Devore Chapter Fourtee) MATH-252-01: Probability ad Statistics II Sprig 2019 Cotets 1 Chi-Squared Tests with Kow Probabilities 1 1.1 Chi-Squared Testig................

More information

Resampling Methods. X (1/2), i.e., Pr (X i m) = 1/2. We order the data: X (1) X (2) X (n). Define the sample median: ( n.

Resampling Methods. X (1/2), i.e., Pr (X i m) = 1/2. We order the data: X (1) X (2) X (n). Define the sample median: ( n. Jauary 1, 2019 Resamplig Methods Motivatio We have so may estimators with the property θ θ d N 0, σ 2 We ca also write θ a N θ, σ 2 /, where a meas approximately distributed as Oce we have a cosistet estimator

More information

Statistics 20: Final Exam Solutions Summer Session 2007

Statistics 20: Final Exam Solutions Summer Session 2007 1. 20 poits Testig for Diabetes. Statistics 20: Fial Exam Solutios Summer Sessio 2007 (a) 3 poits Give estimates for the sesitivity of Test I ad of Test II. Solutio: 156 patiets out of total 223 patiets

More information

Estimation for Complete Data

Estimation for Complete Data Estimatio for Complete Data complete data: there is o loss of iformatio durig study. complete idividual complete data= grouped data A complete idividual data is the oe i which the complete iformatio of

More information

BIOS 4110: Introduction to Biostatistics. Breheny. Lab #9

BIOS 4110: Introduction to Biostatistics. Breheny. Lab #9 BIOS 4110: Itroductio to Biostatistics Brehey Lab #9 The Cetral Limit Theorem is very importat i the realm of statistics, ad today's lab will explore the applicatio of it i both categorical ad cotiuous

More information

Sample Size Determination (Two or More Samples)

Sample Size Determination (Two or More Samples) Sample Sie Determiatio (Two or More Samples) STATGRAPHICS Rev. 963 Summary... Data Iput... Aalysis Summary... 5 Power Curve... 5 Calculatios... 6 Summary This procedure determies a suitable sample sie

More information

Linear Regression Demystified

Linear Regression Demystified Liear Regressio Demystified Liear regressio is a importat subject i statistics. I elemetary statistics courses, formulae related to liear regressio are ofte stated without derivatio. This ote iteds to

More information

MATH 320: Probability and Statistics 9. Estimation and Testing of Parameters. Readings: Pruim, Chapter 4

MATH 320: Probability and Statistics 9. Estimation and Testing of Parameters. Readings: Pruim, Chapter 4 MATH 30: Probability ad Statistics 9. Estimatio ad Testig of Parameters Estimatio ad Testig of Parameters We have bee dealig situatios i which we have full kowledge of the distributio of a radom variable.

More information

STATISTICAL PROPERTIES OF LEAST SQUARES ESTIMATORS. Comments:

STATISTICAL PROPERTIES OF LEAST SQUARES ESTIMATORS. Comments: Recall: STATISTICAL PROPERTIES OF LEAST SQUARES ESTIMATORS Commets:. So far we have estimates of the parameters! 0 ad!, but have o idea how good these estimates are. Assumptio: E(Y x)! 0 +! x (liear coditioal

More information

The variance of a sum of independent variables is the sum of their variances, since covariances are zero. Therefore. V (xi )= n n 2 σ2 = σ2.

The variance of a sum of independent variables is the sum of their variances, since covariances are zero. Therefore. V (xi )= n n 2 σ2 = σ2. SAMPLE STATISTICS A radom sample x 1,x,,x from a distributio f(x) is a set of idepedetly ad idetically variables with x i f(x) for all i Their joit pdf is f(x 1,x,,x )=f(x 1 )f(x ) f(x )= f(x i ) The sample

More information

Continuous Data that can take on any real number (time/length) based on sample data. Categorical data can only be named or categorised

Continuous Data that can take on any real number (time/length) based on sample data. Categorical data can only be named or categorised Questio 1. (Topics 1-3) A populatio cosists of all the members of a group about which you wat to draw a coclusio (Greek letters (μ, σ, Ν) are used) A sample is the portio of the populatio selected for

More information

Linear Regression Models, OLS, Assumptions and Properties

Linear Regression Models, OLS, Assumptions and Properties Chapter 2 Liear Regressio Models, OLS, Assumptios ad Properties 2.1 The Liear Regressio Model The liear regressio model is the sigle most useful tool i the ecoometricia s kit. The multiple regressio model

More information

Simple Regression. Acknowledgement. These slides are based on presentations created and copyrighted by Prof. Daniel Menasce (GMU) CS 700

Simple Regression. Acknowledgement. These slides are based on presentations created and copyrighted by Prof. Daniel Menasce (GMU) CS 700 Simple Regressio CS 7 Ackowledgemet These slides are based o presetatios created ad copyrighted by Prof. Daiel Measce (GMU) Basics Purpose of regressio aalysis: predict the value of a depedet or respose

More information

SIMPLE LINEAR REGRESSION AND CORRELATION ANALYSIS

SIMPLE LINEAR REGRESSION AND CORRELATION ANALYSIS SIMPLE LINEAR REGRESSION AND CORRELATION ANALSIS INTRODUCTION There are lot of statistical ivestigatio to kow whether there is a relatioship amog variables Two aalyses: (1) regressio aalysis; () correlatio

More information

Correlation Regression

Correlation Regression Correlatio Regressio While correlatio methods measure the stregth of a liear relatioship betwee two variables, we might wish to go a little further: How much does oe variable chage for a give chage i aother

More information

MOST PEOPLE WOULD RATHER LIVE WITH A PROBLEM THEY CAN'T SOLVE, THAN ACCEPT A SOLUTION THEY CAN'T UNDERSTAND.

MOST PEOPLE WOULD RATHER LIVE WITH A PROBLEM THEY CAN'T SOLVE, THAN ACCEPT A SOLUTION THEY CAN'T UNDERSTAND. XI-1 (1074) MOST PEOPLE WOULD RATHER LIVE WITH A PROBLEM THEY CAN'T SOLVE, THAN ACCEPT A SOLUTION THEY CAN'T UNDERSTAND. R. E. D. WOOLSEY AND H. S. SWANSON XI-2 (1075) STATISTICAL DECISION MAKING Advaced

More information

Overview. p 2. Chapter 9. Pooled Estimate of. q = 1 p. Notation for Two Proportions. Inferences about Two Proportions. Assumptions

Overview. p 2. Chapter 9. Pooled Estimate of. q = 1 p. Notation for Two Proportions. Inferences about Two Proportions. Assumptions Chapter 9 Slide Ifereces from Two Samples 9- Overview 9- Ifereces about Two Proportios 9- Ifereces about Two Meas: Idepedet Samples 9-4 Ifereces about Matched Pairs 9-5 Comparig Variatio i Two Samples

More information

Investigating the Significance of a Correlation Coefficient using Jackknife Estimates

Investigating the Significance of a Correlation Coefficient using Jackknife Estimates Iteratioal Joural of Scieces: Basic ad Applied Research (IJSBAR) ISSN 2307-4531 (Prit & Olie) http://gssrr.org/idex.php?joural=jouralofbasicadapplied ---------------------------------------------------------------------------------------------------------------------------

More information

Topic 9: Sampling Distributions of Estimators

Topic 9: Sampling Distributions of Estimators Topic 9: Samplig Distributios of Estimators Course 003, 2018 Page 0 Samplig distributios of estimators Sice our estimators are statistics (particular fuctios of radom variables), their distributio ca be

More information

Lecture 33: Bootstrap

Lecture 33: Bootstrap Lecture 33: ootstrap Motivatio To evaluate ad compare differet estimators, we eed cosistet estimators of variaces or asymptotic variaces of estimators. This is also importat for hypothesis testig ad cofidece

More information

Correlation. Two variables: Which test? Relationship Between Two Numerical Variables. Two variables: Which test? Contingency table Grouped bar graph

Correlation. Two variables: Which test? Relationship Between Two Numerical Variables. Two variables: Which test? Contingency table Grouped bar graph Correlatio Y Two variables: Which test? X Explaatory variable Respose variable Categorical Numerical Categorical Cotigecy table Cotigecy Logistic Grouped bar graph aalysis regressio Mosaic plot Numerical

More information

STATISTICAL INFERENCE

STATISTICAL INFERENCE STATISTICAL INFERENCE POPULATION AND SAMPLE Populatio = all elemets of iterest Characterized by a distributio F with some parameter θ Sample = the data X 1,..., X, selected subset of the populatio = sample

More information

Topic 9: Sampling Distributions of Estimators

Topic 9: Sampling Distributions of Estimators Topic 9: Samplig Distributios of Estimators Course 003, 2016 Page 0 Samplig distributios of estimators Sice our estimators are statistics (particular fuctios of radom variables), their distributio ca be

More information

EXAMINATIONS OF THE ROYAL STATISTICAL SOCIETY

EXAMINATIONS OF THE ROYAL STATISTICAL SOCIETY EXAMINATIONS OF THE ROYAL STATISTICAL SOCIETY HIGHER CERTIFICATE IN STATISTICS, 017 MODULE 4 : Liear models Time allowed: Oe ad a half hours Cadidates should aswer THREE questios. Each questio carries

More information

Regression. Correlation vs. regression. The parameters of linear regression. Regression assumes... Random sample. Y = α + β X.

Regression. Correlation vs. regression. The parameters of linear regression. Regression assumes... Random sample. Y = α + β X. Regressio Correlatio vs. regressio Predicts Y from X Liear regressio assumes that the relatioship betwee X ad Y ca be described by a lie Regressio assumes... Radom sample Y is ormally distributed with

More information

Geometry of LS. LECTURE 3 GEOMETRY OF LS, PROPERTIES OF σ 2, PARTITIONED REGRESSION, GOODNESS OF FIT

Geometry of LS. LECTURE 3 GEOMETRY OF LS, PROPERTIES OF σ 2, PARTITIONED REGRESSION, GOODNESS OF FIT OCTOBER 7, 2016 LECTURE 3 GEOMETRY OF LS, PROPERTIES OF σ 2, PARTITIONED REGRESSION, GOODNESS OF FIT Geometry of LS We ca thik of y ad the colums of X as members of the -dimesioal Euclidea space R Oe ca

More information

Chapter 22. Comparing Two Proportions. Copyright 2010, 2007, 2004 Pearson Education, Inc.

Chapter 22. Comparing Two Proportions. Copyright 2010, 2007, 2004 Pearson Education, Inc. Chapter 22 Comparig Two Proportios Copyright 2010, 2007, 2004 Pearso Educatio, Ic. Comparig Two Proportios Read the first two paragraphs of pg 504. Comparisos betwee two percetages are much more commo

More information

Recall the study where we estimated the difference between mean systolic blood pressure levels of users of oral contraceptives and non-users, x - y.

Recall the study where we estimated the difference between mean systolic blood pressure levels of users of oral contraceptives and non-users, x - y. Testig Statistical Hypotheses Recall the study where we estimated the differece betwee mea systolic blood pressure levels of users of oral cotraceptives ad o-users, x - y. Such studies are sometimes viewed

More information

There is no straightforward approach for choosing the warmup period l.

There is no straightforward approach for choosing the warmup period l. B. Maddah INDE 504 Discrete-Evet Simulatio Output Aalysis () Statistical Aalysis for Steady-State Parameters I a otermiatig simulatio, the iterest is i estimatig the log ru steady state measures of performace.

More information

Lecture 2: Monte Carlo Simulation

Lecture 2: Monte Carlo Simulation STAT/Q SCI 43: Itroductio to Resamplig ethods Sprig 27 Istructor: Ye-Chi Che Lecture 2: ote Carlo Simulatio 2 ote Carlo Itegratio Assume we wat to evaluate the followig itegratio: e x3 dx What ca we do?

More information

TABLES AND FORMULAS FOR MOORE Basic Practice of Statistics

TABLES AND FORMULAS FOR MOORE Basic Practice of Statistics TABLES AND FORMULAS FOR MOORE Basic Practice of Statistics Explorig Data: Distributios Look for overall patter (shape, ceter, spread) ad deviatios (outliers). Mea (use a calculator): x = x 1 + x 2 + +

More information

University of California, Los Angeles Department of Statistics. Simple regression analysis

University of California, Los Angeles Department of Statistics. Simple regression analysis Uiversity of Califoria, Los Ageles Departmet of Statistics Statistics 100C Istructor: Nicolas Christou Simple regressio aalysis Itroductio: Regressio aalysis is a statistical method aimig at discoverig

More information

Midterm 2 ECO3151. Winter 2012

Midterm 2 ECO3151. Winter 2012 Name: Studet Number: Midterm 2 ECO3151 Witer 2012 Istructios: 1. Prit your ame ad studet umber at the top of this midterm 2. No programmable calculators 3. You ca aswer i pecil or pe 4. This midterm cosists

More information

Ismor Fischer, 1/11/

Ismor Fischer, 1/11/ Ismor Fischer, //04 7.4-7.4 Problems. I Problem 4.4/9, it was show that importat relatios exist betwee populatio meas, variaces, ad covariace. Specifically, we have the formulas that appear below left.

More information

Worksheet 23 ( ) Introduction to Simple Linear Regression (continued)

Worksheet 23 ( ) Introduction to Simple Linear Regression (continued) Worksheet 3 ( 11.5-11.8) Itroductio to Simple Liear Regressio (cotiued) This worksheet is a cotiuatio of Discussio Sheet 3; please complete that discussio sheet first if you have ot already doe so. This

More information

Economics 241B Relation to Method of Moments and Maximum Likelihood OLSE as a Maximum Likelihood Estimator

Economics 241B Relation to Method of Moments and Maximum Likelihood OLSE as a Maximum Likelihood Estimator Ecoomics 24B Relatio to Method of Momets ad Maximum Likelihood OLSE as a Maximum Likelihood Estimator Uder Assumptio 5 we have speci ed the distributio of the error, so we ca estimate the model parameters

More information

Eco411 Lab: Central Limit Theorem, Normal Distribution, and Journey to Girl State

Eco411 Lab: Central Limit Theorem, Normal Distribution, and Journey to Girl State Eco411 Lab: Cetral Limit Theorem, Normal Distributio, ad Jourey to Girl State 1. Some studets may woder why the magic umber 1.96 or 2 (called critical values) is so importat i statistics. Where do they

More information

Chapter 22. Comparing Two Proportions. Copyright 2010 Pearson Education, Inc.

Chapter 22. Comparing Two Proportions. Copyright 2010 Pearson Education, Inc. Chapter 22 Comparig Two Proportios Copyright 2010 Pearso Educatio, Ic. Comparig Two Proportios Comparisos betwee two percetages are much more commo tha questios about isolated percetages. Ad they are more

More information

Dr. Maddah ENMG 617 EM Statistics 11/26/12. Multiple Regression (2) (Chapter 15, Hines)

Dr. Maddah ENMG 617 EM Statistics 11/26/12. Multiple Regression (2) (Chapter 15, Hines) Dr Maddah NMG 617 M Statistics 11/6/1 Multiple egressio () (Chapter 15, Hies) Test for sigificace of regressio This is a test to determie whether there is a liear relatioship betwee the depedet variable

More information

Linear regression. Daniel Hsu (COMS 4771) (y i x T i β)2 2πσ. 2 2σ 2. 1 n. (x T i β y i ) 2. 1 ˆβ arg min. β R n d

Linear regression. Daniel Hsu (COMS 4771) (y i x T i β)2 2πσ. 2 2σ 2. 1 n. (x T i β y i ) 2. 1 ˆβ arg min. β R n d Liear regressio Daiel Hsu (COMS 477) Maximum likelihood estimatio Oe of the simplest liear regressio models is the followig: (X, Y ),..., (X, Y ), (X, Y ) are iid radom pairs takig values i R d R, ad Y

More information

Tests of Hypotheses Based on a Single Sample (Devore Chapter Eight)

Tests of Hypotheses Based on a Single Sample (Devore Chapter Eight) Tests of Hypotheses Based o a Sigle Sample Devore Chapter Eight MATH-252-01: Probability ad Statistics II Sprig 2018 Cotets 1 Hypothesis Tests illustrated with z-tests 1 1.1 Overview of Hypothesis Testig..........

More information

II. Descriptive Statistics D. Linear Correlation and Regression. 1. Linear Correlation

II. Descriptive Statistics D. Linear Correlation and Regression. 1. Linear Correlation II. Descriptive Statistics D. Liear Correlatio ad Regressio I this sectio Liear Correlatio Cause ad Effect Liear Regressio 1. Liear Correlatio Quatifyig Liear Correlatio The Pearso product-momet correlatio

More information

HYPOTHESIS TESTS FOR ONE POPULATION MEAN WORKSHEET MTH 1210, FALL 2018

HYPOTHESIS TESTS FOR ONE POPULATION MEAN WORKSHEET MTH 1210, FALL 2018 HYPOTHESIS TESTS FOR ONE POPULATION MEAN WORKSHEET MTH 1210, FALL 2018 We are resposible for 2 types of hypothesis tests that produce ifereces about the ukow populatio mea, µ, each of which has 3 possible

More information

Final Examination Solutions 17/6/2010

Final Examination Solutions 17/6/2010 The Islamic Uiversity of Gaza Faculty of Commerce epartmet of Ecoomics ad Political Scieces A Itroductio to Statistics Course (ECOE 30) Sprig Semester 009-00 Fial Eamiatio Solutios 7/6/00 Name: I: Istructor:

More information

Statistical and Mathematical Methods DS-GA 1002 December 8, Sample Final Problems Solutions

Statistical and Mathematical Methods DS-GA 1002 December 8, Sample Final Problems Solutions Statistical ad Mathematical Methods DS-GA 00 December 8, 05. Short questios Sample Fial Problems Solutios a. Ax b has a solutio if b is i the rage of A. The dimesio of the rage of A is because A has liearly-idepedet

More information

UNIVERSITY OF TORONTO Faculty of Arts and Science APRIL/MAY 2009 EXAMINATIONS ECO220Y1Y PART 1 OF 2 SOLUTIONS

UNIVERSITY OF TORONTO Faculty of Arts and Science APRIL/MAY 2009 EXAMINATIONS ECO220Y1Y PART 1 OF 2 SOLUTIONS PART of UNIVERSITY OF TORONTO Faculty of Arts ad Sciece APRIL/MAY 009 EAMINATIONS ECO0YY PART OF () The sample media is greater tha the sample mea whe there is. (B) () A radom variable is ormally distributed

More information

1 Introduction to reducing variance in Monte Carlo simulations

1 Introduction to reducing variance in Monte Carlo simulations Copyright c 010 by Karl Sigma 1 Itroductio to reducig variace i Mote Carlo simulatios 11 Review of cofidece itervals for estimatig a mea I statistics, we estimate a ukow mea µ = E(X) of a distributio by

More information

Expectation and Variance of a random variable

Expectation and Variance of a random variable Chapter 11 Expectatio ad Variace of a radom variable The aim of this lecture is to defie ad itroduce mathematical Expectatio ad variace of a fuctio of discrete & cotiuous radom variables ad the distributio

More information

Mathematical Notation Math Introduction to Applied Statistics

Mathematical Notation Math Introduction to Applied Statistics Mathematical Notatio Math 113 - Itroductio to Applied Statistics Name : Use Word or WordPerfect to recreate the followig documets. Each article is worth 10 poits ad ca be prited ad give to the istructor

More information

University of California, Los Angeles Department of Statistics. Practice problems - simple regression 2 - solutions

University of California, Los Angeles Department of Statistics. Practice problems - simple regression 2 - solutions Uiversity of Califoria, Los Ageles Departmet of Statistics Statistics 00C Istructor: Nicolas Christou EXERCISE Aswer the followig questios: Practice problems - simple regressio - solutios a Suppose y,

More information

Exam II Covers. STA 291 Lecture 19. Exam II Next Tuesday 5-7pm Memorial Hall (Same place as exam I) Makeup Exam 7:15pm 9:15pm Location CB 234

Exam II Covers. STA 291 Lecture 19. Exam II Next Tuesday 5-7pm Memorial Hall (Same place as exam I) Makeup Exam 7:15pm 9:15pm Location CB 234 STA 291 Lecture 19 Exam II Next Tuesday 5-7pm Memorial Hall (Same place as exam I) Makeup Exam 7:15pm 9:15pm Locatio CB 234 STA 291 - Lecture 19 1 Exam II Covers Chapter 9 10.1; 10.2; 10.3; 10.4; 10.6

More information

Stat 139 Homework 7 Solutions, Fall 2015

Stat 139 Homework 7 Solutions, Fall 2015 Stat 139 Homework 7 Solutios, Fall 2015 Problem 1. I class we leared that the classical simple liear regressio model assumes the followig distributio of resposes: Y i = β 0 + β 1 X i + ɛ i, i = 1,...,,

More information

TMA4245 Statistics. Corrected 30 May and 4 June Norwegian University of Science and Technology Department of Mathematical Sciences.

TMA4245 Statistics. Corrected 30 May and 4 June Norwegian University of Science and Technology Department of Mathematical Sciences. Norwegia Uiversity of Sciece ad Techology Departmet of Mathematical Scieces Corrected 3 May ad 4 Jue Solutios TMA445 Statistics Saturday 6 May 9: 3: Problem Sow desity a The probability is.9.5 6x x dx

More information

The standard deviation of the mean

The standard deviation of the mean Physics 6C Fall 20 The stadard deviatio of the mea These otes provide some clarificatio o the distictio betwee the stadard deviatio ad the stadard deviatio of the mea.. The sample mea ad variace Cosider

More information

Econ 325/327 Notes on Sample Mean, Sample Proportion, Central Limit Theorem, Chi-square Distribution, Student s t distribution 1.

Econ 325/327 Notes on Sample Mean, Sample Proportion, Central Limit Theorem, Chi-square Distribution, Student s t distribution 1. Eco 325/327 Notes o Sample Mea, Sample Proportio, Cetral Limit Theorem, Chi-square Distributio, Studet s t distributio 1 Sample Mea By Hiro Kasahara We cosider a radom sample from a populatio. Defiitio

More information

ST 305: Exam 3 ( ) = P(A)P(B A) ( ) = P(A) + P(B) ( ) = 1 P( A) ( ) = P(A) P(B) σ X 2 = σ a+bx. σ ˆp. σ X +Y. σ X Y. σ X. σ Y. σ n.

ST 305: Exam 3 ( ) = P(A)P(B A) ( ) = P(A) + P(B) ( ) = 1 P( A) ( ) = P(A) P(B) σ X 2 = σ a+bx. σ ˆp. σ X +Y. σ X Y. σ X. σ Y. σ n. ST 305: Exam 3 By hadig i this completed exam, I state that I have either give or received assistace from aother perso durig the exam period. I have used o resources other tha the exam itself ad the basic

More information

Final Review. Fall 2013 Prof. Yao Xie, H. Milton Stewart School of Industrial Systems & Engineering Georgia Tech

Final Review. Fall 2013 Prof. Yao Xie, H. Milton Stewart School of Industrial Systems & Engineering Georgia Tech Fial Review Fall 2013 Prof. Yao Xie, yao.xie@isye.gatech.edu H. Milto Stewart School of Idustrial Systems & Egieerig Georgia Tech 1 Radom samplig model radom samples populatio radom samples: x 1,..., x

More information

Chapter 12 Correlation

Chapter 12 Correlation Chapter Correlatio Correlatio is very similar to regressio with oe very importat differece. Regressio is used to explore the relatioship betwee a idepedet variable ad a depedet variable, whereas correlatio

More information

Section 14. Simple linear regression.

Section 14. Simple linear regression. Sectio 14 Simple liear regressio. Let us look at the cigarette dataset from [1] (available to dowload from joural s website) ad []. The cigarette dataset cotais measuremets of tar, icotie, weight ad carbo

More information

Read through these prior to coming to the test and follow them when you take your test.

Read through these prior to coming to the test and follow them when you take your test. Math 143 Sprig 2012 Test 2 Iformatio 1 Test 2 will be give i class o Thursday April 5. Material Covered The test is cummulative, but will emphasize the recet material (Chapters 6 8, 10 11, ad Sectios 12.1

More information

2 1. The r.s., of size n2, from population 2 will be. 2 and 2. 2) The two populations are independent. This implies that all of the n1 n2

2 1. The r.s., of size n2, from population 2 will be. 2 and 2. 2) The two populations are independent. This implies that all of the n1 n2 Chapter 8 Comparig Two Treatmets Iferece about Two Populatio Meas We wat to compare the meas of two populatios to see whether they differ. There are two situatios to cosider, as show i the followig examples:

More information

STA Learning Objectives. Population Proportions. Module 10 Comparing Two Proportions. Upon completing this module, you should be able to:

STA Learning Objectives. Population Proportions. Module 10 Comparing Two Proportions. Upon completing this module, you should be able to: STA 2023 Module 10 Comparig Two Proportios Learig Objectives Upo completig this module, you should be able to: 1. Perform large-sample ifereces (hypothesis test ad cofidece itervals) to compare two populatio

More information

A statistical method to determine sample size to estimate characteristic value of soil parameters

A statistical method to determine sample size to estimate characteristic value of soil parameters A statistical method to determie sample size to estimate characteristic value of soil parameters Y. Hojo, B. Setiawa 2 ad M. Suzuki 3 Abstract Sample size is a importat factor to be cosidered i determiig

More information

Lecture 5: Parametric Hypothesis Testing: Comparing Means. GENOME 560, Spring 2016 Doug Fowler, GS

Lecture 5: Parametric Hypothesis Testing: Comparing Means. GENOME 560, Spring 2016 Doug Fowler, GS Lecture 5: Parametric Hypothesis Testig: Comparig Meas GENOME 560, Sprig 2016 Doug Fowler, GS (dfowler@uw.edu) 1 Review from last week What is a cofidece iterval? 2 Review from last week What is a cofidece

More information

Chapter 1 (Definitions)

Chapter 1 (Definitions) FINAL EXAM REVIEW Chapter 1 (Defiitios) Qualitative: Nomial: Ordial: Quatitative: Ordial: Iterval: Ratio: Observatioal Study: Desiged Experimet: Samplig: Cluster: Stratified: Systematic: Coveiece: Simple

More information

(X i X)(Y i Y ) = 1 n

(X i X)(Y i Y ) = 1 n L I N E A R R E G R E S S I O N 10 I Chapter 6 we discussed the cocepts of covariace ad correlatio two ways of measurig the extet to which two radom variables, X ad Y were related to each other. I may

More information

Topic 9: Sampling Distributions of Estimators

Topic 9: Sampling Distributions of Estimators Topic 9: Samplig Distributios of Estimators Course 003, 2018 Page 0 Samplig distributios of estimators Sice our estimators are statistics (particular fuctios of radom variables), their distributio ca be

More information

TABLES AND FORMULAS FOR MOORE Basic Practice of Statistics

TABLES AND FORMULAS FOR MOORE Basic Practice of Statistics TABLES AND FORMULAS FOR MOORE Basic Practice of Statistics Explorig Data: Distributios Look for overall patter (shape, ceter, spread) ad deviatios (outliers). Mea (use a calculator): x = x 1 + x 2 + +

More information

Stat 200 -Testing Summary Page 1

Stat 200 -Testing Summary Page 1 Stat 00 -Testig Summary Page 1 Mathematicias are like Frechme; whatever you say to them, they traslate it ito their ow laguage ad forthwith it is somethig etirely differet Goethe 1 Large Sample Cofidece

More information

ECE 901 Lecture 12: Complexity Regularization and the Squared Loss

ECE 901 Lecture 12: Complexity Regularization and the Squared Loss ECE 90 Lecture : Complexity Regularizatio ad the Squared Loss R. Nowak 5/7/009 I the previous lectures we made use of the Cheroff/Hoeffdig bouds for our aalysis of classifier errors. Hoeffdig s iequality

More information

Stat 319 Theory of Statistics (2) Exercises

Stat 319 Theory of Statistics (2) Exercises Kig Saud Uiversity College of Sciece Statistics ad Operatios Research Departmet Stat 39 Theory of Statistics () Exercises Refereces:. Itroductio to Mathematical Statistics, Sixth Editio, by R. Hogg, J.

More information

Comparing Two Populations. Topic 15 - Two Sample Inference I. Comparing Two Means. Comparing Two Pop Means. Background Reading

Comparing Two Populations. Topic 15 - Two Sample Inference I. Comparing Two Means. Comparing Two Pop Means. Background Reading Topic 15 - Two Sample Iferece I STAT 511 Professor Bruce Craig Comparig Two Populatios Research ofte ivolves the compariso of two or more samples from differet populatios Graphical summaries provide visual

More information

FACULTY OF MATHEMATICAL STUDIES MATHEMATICS FOR PART I ENGINEERING. Lectures

FACULTY OF MATHEMATICAL STUDIES MATHEMATICS FOR PART I ENGINEERING. Lectures FACULTY OF MATHEMATICAL STUDIES MATHEMATICS FOR PART I ENGINEERING Lectures MODULE 5 STATISTICS II. Mea ad stadard error of sample data. Biomial distributio. Normal distributio 4. Samplig 5. Cofidece itervals

More information

Math 140 Introductory Statistics

Math 140 Introductory Statistics 8.2 Testig a Proportio Math 1 Itroductory Statistics Professor B. Abrego Lecture 15 Sectios 8.2 People ofte make decisios with data by comparig the results from a sample to some predetermied stadard. These

More information

Chapter 6 Sampling Distributions

Chapter 6 Sampling Distributions Chapter 6 Samplig Distributios 1 I most experimets, we have more tha oe measuremet for ay give variable, each measuremet beig associated with oe radomly selected a member of a populatio. Hece we eed to

More information