( )( ) [ ] [ ] ( ) 1 = [ ] = ( ) 1. H = X X X X is called the hat matrix ( it puts the hats on the Y s) and is of order n n H = X X X X.

Size: px
Start display at page:

Download "( )( ) [ ] [ ] ( ) 1 = [ ] = ( ) 1. H = X X X X is called the hat matrix ( it puts the hats on the Y s) and is of order n n H = X X X X."

Transcription

1 ( ) ( ) where ( ) 1 ˆ β = X X X X β + ε = β + Aε A = X X 1 X [ ] E ˆ β β AE ε β so ˆ = + = β s unbased ( )( ) [ ] ˆ Cov β = E ˆ β β ˆ β β = E Aεε A AE ε ε A Aσ IA = σ AA = σ X X = [ ] = ( ) 1 Ftted values are gven by Yˆ = X ˆ β = X X X X Y = HY ( ) 1 ( ) 1 H = X X X X s called the hat matrx ( t puts the hats on the Y s) and s of order n n The error sum of squares, SSE, s Mn S = Y Y ˆ β X Y + ˆ β X X ˆ β ( ) 1 = Y Y ˆ β X Y + ˆ β X X X X X Y = Y Y ˆ β X Y The estmate of σ s based on ths Normal model Addng the assumptons that the error varables are..d. wth ε ~ N(0,σ ), then Y has a MVN (multvarate normal) dstrbuton wth mean Xβ and covarance matrx) σ I. ˆ β = X X X Y s also MVN {wth mean β and It follows (usng MVN theory) that ( ) 1 σ X X }. covarance matrx ( ) 1 The maxmum lkelhood method produces the same estmates of the β s as least-squares and an estmate of σ whch s agan a multple of SSE as before. ( β ) 1 In fact we use ˆ σ = ˆ n k 1 Y Y X Y In addton we can show that ˆ β β ese ( ˆ β ) and c s the th dagonal element of ( X X ) Resduals ( ˆ ) ( ˆ ) SSE = ˆ ε ˆ ε = Y X β Y X β Note that ˆε = Y Yˆ = Y HY = ( I H )Y 14 ( ˆ β ) ~ t where ese = c ˆ σ n k 1 It follows that Cov[ ˆ ε ] = σ ( I H ) and V [ ˆ ε ] = σ ( 1 h ) where h s the th dagonal element of ( ) 1 H = X X X X.

2 The resduals have dfferent varances. h s a measure of the leverage/nfluence of the th observaton and ts value depends explctly on the predctors for that observaton. [ ] as h V εˆ We can defne the standardsed resdual as ˆ ε ese ˆ ε ˆ ε ( ) ( ) = 1 h The standardsed resduals do have the same varance (= 1). Checkng for ths s a recommended part of model dagnostcs. An observaton wth a large standardsed resdual has an unusual response. (Related dea: Cook s dstances for dentfyng nfluental observatons). ˆ σ Illustraton of multple lnear regresson (wth explanatory varables) Model: Y = β 0 + β 1 x 1 + β x + ε or E[Y x ] = β 0 + β 1 x 1 + β x ; ε ~ N(0,σ ) Data: n = 7 y x 1 x R Data are n a data frame called llus, wth varables y, x1, and x pars(llus) gves us a matrx plot of scatterplots for pars of varables see over Y = X = X X = X X 1 = ( ) 140. X Y = ɵ β = ( X X ) X Y =

3 y x1 x >> Ftted model y ˆ = x x >> Estmate of error varance σ Y Y = ɵ X Y = ɵ = = 1 β σ ( ) >> Standard errors/covarances/dstrbutons of estmators ˆ ˆ Estmate of Cov β = σ ( X X ) = ese( ˆ β ) = ese( ˆ β ) = 0.53 ese( ˆ β ) = 0.01 So: 0 1 ˆ β ˆ ~ (, ) ˆ 0 ~ N ( β 0,0.5685) β1 N β1 β ~ N( β, ) >> Testng the β's Testng H: β 0 = 0 we have t = 0.138/ = 0.9 on 4 d.f. Smlarly for β 1 : t = 3.55 and for β : t = 0.8 Now P( t 4 >.776) = 0.05 so ˆβ 1 s sgnfcantly dfferent from 0". We conclude that β 1 0, whch suggests that x 1 may be a useful predctor of the response. 16

4 >> 95% CIs for parameters β 0 : ± ( ).e ±.086.e. (.30, 1.87) β 1 : ± ( ).e ± e. (0.196, 1.60) β : ± ( ).e ± e. ( 0.041, 0.076) CI for σ (based on the χ 4 dstrbuton of ˆ 4 σ / σ ) s {( )/11.14, ( )/0.4844}.e. (0.030, 0.695) Usng R attach(llus) summary(y) Mn. 1st Qu. Medan Mean 3rd Qu. Max summary(x1) Mn. 1st Qu. Medan Mean 3rd Qu. Max summary(x) Mn. 1st Qu. Medan Mean 3rd Qu. Max mod = lm(y~x1+x) summary(mod) Resduals: Coeffcents: Estmate Std. Error t value Pr(> t ) (Intercept) x * x Sgnf. codes: 0 `***' `**' 0.01 `*' 0.05 `.' 0.1 ` ' 1 Resdual standard error: 0.90 on 4 degrees of freedom Multple R-Squared: 0.833, Adjusted R-squared: F-statstc: on and 4 DF, p-value: summary.aov(mod) Df Sum Sq Mean Sq F value Pr(>F) x * x Resduals Sgnf. codes: 0 `***' `**' 0.01 `*' 0.05 `.' 0.1 ` ' 1 mod$ft mod$res

5 >> Ftted values v responses plot(y,mod$ft,pch=8) y >> Resduals v x and v ftted values plot(x1,mod$res,pch=8) plot(x,mod$res,pch=8) plot(mod$ft,mod$res,pch=8) mod$ft x1 mod$res mod$res mod$res x mod$ft 18

6 >> Usng the "hat" matrx H = X(X X) -1 X H = ˆ and then ftted values Y = H Y = (see ftted values above) >> Standardsed resduals ɵ σ = ɵε 1 = = h 11 = h 11 = ese(ɵε 1 ) = ( ) 1/ = sres ɵε 1 = / = ɵε 7 = = h 77 = h 77 = ese(ɵε 7 ) = ( ) 1/ = sres ɵε 7 = /0.137 = Smlarly for the others, as shown below: Row FITS1 RESI1 HI1 SRES Interpretaton of ftted models - health warnng The nterpretaton of ftted models nvolvng more than one explanatory varable has to be approached wth care. We must recognse that when we examne the mportance/effect of an explanatory varable, we are examnng ts mportance/effect n the presence of any other varables already n the model. The coeffcents/parameters and assocated standard errors and P values do not tell the whole story and can sometmes be msleadng. 19

7 We need to wegh up all the evdence all the plots and summary statstcs, ncludng the contrbutons to the total sum of squares and the overall coeffcent of determnaton. Returnng to the data of the Illustraton above (secton ), let us examne the fts of models nvolvng only one explanatory varable, x 1 or x. Frst x 1 on ts own mod10 = lm(y ~ x1) summary(mod10) Resduals: Coeffcents: Estmate Std. Error t value Pr(> t ) (Intercept) x ** Sgnf. codes: 0 `***' `**' 0.01 `*' 0.05 `.' 0.1 ` ' 1 Resdual standard error: on 5 degrees of freedom Multple R-Squared: , Adjusted R-squared: F-statstc: 0.59 on 1 and 5 DF, p-value: summary.aov(mod10) Df Sum Sq Mean Sq F value Pr(>F) x ** Resduals Sgnf. codes: 0 `***' `**' 0.01 `*' 0.05 `.' 0.1 ` ' 1 The total varaton n the responses s S yy =.0171; varable x 1 explans of ths total (80.5%) and the coeffcent assocated wth t (0.99) s hghly sgnfcant (sgnfcantly dfferent from 0) t has a very small P value (0.006, whch s < 1%). The varaton explaned by x 1 (1.630) s the same as n model above, whch was the ft of y on x 1 and x (n that order). Returnng to the ft n model, we see that, n the presence of x 1, x explans only a further of the total varaton. Together the two varables explan 83.3% of the total varaton (not much more than the 80.5% explaned by x 1 alone). In the presence of x 1, we gan lttle by ncludng x. Now x on ts own mod11 = lm(y~x) summary(mod11) Resduals: Coeffcents: Estmate Std. Error t value Pr(> t ) (Intercept) x Resdual standard error: on 5 degrees of freedom Multple R-Squared: , Adjusted R-squared: F-statstc:.1 on 1 and 5 DF, p-value: summary.aov(mod11) Df Sum Sq Mean Sq F value Pr(>F) x Resduals

8 The total varaton n the responses s S yy =.0171 ; varable x explans only of ths total (30.6%) and the coeffcent assocated wth t (0.051) s not sgnfcant (not sgnfcantly dfferent from 0) t has a szeable P value (0.197). Back to fttng both varables: what happens s we ft x frst, then x 1? mod1 = lm(y~x+x1) summary(mod1) Resduals: Coeffcents: Estmate Std. Error t value Pr(> t ) (Intercept) x x * Sgnf. codes: 0 `***' `**' 0.01 `*' 0.05 `.' 0.1 ` ' 1 Resdual standard error: 0.90 on 4 degrees of freedom Multple R-Squared: 0.833, Adjusted R-squared: F-statstc: on and 4 DF, p-value: summary.aov(mod1) Df Sum Sq Mean Sq F value Pr(>F) x x * Resduals Sgnf. codes: 0 `***' `**' 0.01 `*' 0.05 `.' 0.1 ` ' 1 The varaton explaned by x n model 11, the ft of x alone, (0.618) s the same as n model 1, the ft of y on x and x 1 (n that order). Returnng to model 1, we see that, n the presence of x, x 1 explans a further of the total varaton. Together the two varables explan 83.3% of the total varaton (much more than the 30.6% explaned by x alone). In the presence of x, we gan a lot by ncludng x 1. It s clearer and smpler f we ft the varable whch explans most of the varaton n the responses frst, then the varable of next mportance and so on. The followng pages contan supplementary materal on aspects of statstcal modellng 1

9 Extra materal on statstcal modellng A further llustraton models wth qualtatve explanatory varables (factors) Data: n = pars (x, y ) where y s the response; the data arse under two dfferent sets of condtons (type = 0 or 1) and are presented below sorted by x wthn type. Row y x type Dstngushng the two types y x y We model the responses frst gnorng the varable type. R Data are n data frame called llus3, wth varables y, x, and type attach(llus3) mod3 = lm(y~x) plot(x,y,pch=8) ablne(mod3) x1

10 y summary(mod3) Resduals: Mn 1Q Medan 3Q Max Coeffcents: Estmate Std. Error t value Pr(> t ) (Intercept) *** x e-08 *** Sgnf. codes: 0 `***' `**' 0.01 `*' 0.05 `.' 0.1 ` ' 1 Resdual standard error: on 0 degrees of freedom Multple R-Squared: , Adjusted R-squared: F-statstc: 69.4 on 1 and 0 DF, p-value: 6.01e-08 summary.aov(mod3) Df Sum Sq Mean Sq F value Pr(>F) x e-08 *** Resduals Sgnf. codes: 0 `***' `**' 0.01 `*' 0.05 `.' 0.1 ` ' 1 x mod3$ft mod3$res

11 Ftted values v responses plot(y,mod3$ft,pch=8) Resduals v ftted y values plot(mod3$ft,mod3$res,pch=8) mod3$res mod3$ft We now model the responses usng a model whch ncludes the qualtatve varable type, whch was declared as a factor when the data frame was set up [> type = factor(c( rep(0,14),rep(1,8)))]. mod4 = lm(y~x+type) mod3$ft y x1

12 summary(mod4) Resduals: Mn 1Q Medan 3Q Max Coeffcents: Estmate Std. Error t value Pr(> t ) (Intercept) e-05 *** x e-11 *** type e-06 *** Sgnf. codes: 0 `***' `**' 0.01 `*' 0.05 `.' 0.1 ` ' 1 Resdual standard error: on 19 degrees of freedom Multple R-Squared: 0.95, Adjusted R-squared: F-statstc: on and 19 DF, p-value:.001e-11 summary.aov(mod4) Df Sum Sq Mean Sq F value Pr(>F) x e-11 *** type e-06 *** Resduals Sgnf. codes: 0 `***' `**' 0.01 `*' 0.05 `.' 0.1 ` ' 1 Interpretng the output: The ft s yˆ = x ( f type ) so e.g. observaton 1: x =.4, type = 1, y ˆ = ( ) = observaton 0: x = 9.1, type =, y ˆ = ( ) = mod4$ft

13 mod4$res Ftted values v responses plot(y,mod4$ft,pch=8) mod4$ft y Resduals v x and v ftted values plot(x,mod4$res,pch=8) > plot(mod4$ft,mod4$res,pch=8) mod4$res mod4$res x mod4$ft The total varaton n the responses s S yy = ; varable x explans of ths total (77.6%) and the coeffcent assocated wth t (0.6090) s hghly sgnfcant (sgnfcantly dfferent from 0) t has a neglgble P value. 6

14 In the presence of x, the varable type explans a further of the total varaton and ts coeffcent s also hghly sgnfcant. Together the two varables explan 9.5% of the total varaton. In the presence of x, we gan much by ncludng type. * * * * * Fnally we extend the prevous model (mod4) by allowng for an nteracton between the explanatory varables x and type. An nteracton exsts between two explanatory varables when the effect of one on a response varable s dfferent at dfferent values/levels of the other. For example consder the effect of polcyholder s age and gender on a response varable clam rate. If the effect of age on clam rate s dfferent for males and females, then there s an nteracton between age and gender. mod5 = lm(y ~ x * type) summary(mod5) Resduals: Mn 1Q Medan 3Q Max Coeffcents: Estmate Std. Error t value Pr(> t ) (Intercept) e-05 *** x e-10 *** type x:type Sgnf. codes: 0 `***' `**' 0.01 `*' 0.05 `.' 0.1 ` ' 1 Resdual standard error: 0.68 on 18 degrees of freedom Multple R-Squared: 0.956, Adjusted R-squared: F-statstc: 74.6 on 3 and 18 DF, p-value:.388e-10 summary.aov(mod5) Df Sum Sq Mean Sq F value Pr(>F) x e-11 *** type e-05 *** x:type Resduals Sgnf. codes: 0 `***' `**' 0.01 `*' 0.05 `.' 0.1 ` ' 1 Comment: the nteracton appears to have added nothng the coeffcent of determnaton s effectvely unchanged compared to the prevous model. We also note that the extra parameter value s small and s not sgnfcant. In ths partcular case, an nteracton term s not helpful - ncludng t has smply confused the ssue. In a case where an nteracton term does mprove the ft and the coeffcent s sgnfcant, then both varables and the nteracton between them should be ncluded n the model Comparng full and reduced models We may want to compare a model wth a restrcted verson of tself: Full model wth k predctors: Y = β 0 + β 1 x β m x m + β m+1 x m β k x k + ε Reduced model wth m predctors: Y = β 0 + β 1 x β m x m + ε We can wrte the restrcton as H 0 : β m+1 = β m+ = = β k = 0 The full model ncludes an addtonal k m predctors. 7

15 Test s based on the relatve reducton n the resdual/unexplaned varaton SSE (.e. the ncrease n the regresson/explaned SSR) resultng from ncludng the extra k m predctors n the model. Under H 0 : ( reduced full ) SSE SSE /( k m) ~ F SSE /( n k 1) full k m, n k 1 If the F value s large we reject H 0 and conclude that the extra predctors should be ncluded n the model. Ref: Illustraton n sectons and : n = 7, k =, m = 1 Full model: Y = β 0 + β 1 x 1 + β x + ε SSE full = on 4 df ( predctors) Reduced model A: Y = β 0 + β 1 x 1 + ε (1 predctor).e. H 0 : β = 0 Fttng the regresson of y on x 1 alone gves SSE reduced = on 5 df So F = [( )/(1)]/[( /4)] = 0.7 on 1,4 df P value s very hgh, so the restrcton (H 0 ) can stand. The predctor x adds nothng of value to the model. Reduced model B: Y = β 0 + β x + ε (1 predctor).e. H 0 : β 1 = 0 Fttng the regresson of y on x alone gves SSE reduced = on 5 df So F = ( )/( /4) = 1.6 on 1,4 df P value < 0.05 so we reject the restrcton: we should nclude the predctor x 1. Notes: In the full model, explaned varaton SSR = Predctor x 1 explans followed by x wth a further Specfyng the predctors n reverse order, x explans followed by x 1 wth a further x 1 has a t value of 3.55 wth P value 0.04 x has a t value of only 0.8 wth P value R = Fttng x 1 only: R = ; fttng x only: R = Partal correlaton The sgnfcance of an ndvdual predctor can be found from the t rato of the correspondng β parameter. Another approach s to examne the partal correlaton of response and predctor, whch s the correlaton of response and predctor wth dependence on all other predctors removed. Consder the model Y = β 0 + β 1 x+ β z + ε wth data (y, x, z ), = 1,, n. We consder the drect effect of x on the response varable Y we fnd the partal correlaton of y and x, wth dependence on z removed, gven the symbol r yx z : remove the lnear effect of z from y: regress y on z call the resduals from the ft e yz remove the lnear effect of z from x: regress x on z call the resduals from the ft e xz We can now nvestgate the relatonshp between y and x (wth dependence on z removed) by regressng e yz on e xz. 8

16 And the ordnary correlaton coeffcent between e yz and e xz s the requred partal correlaton coeffcent of y and x wth dependence on z removed,.e. r yx z = correlaton coeffcent between e yz and e xz The partal correlaton coeffcent r yx z s related to the ordnary correlaton coeffcents as follows: r yx z = r yx r r yz xz ( 1 r yz )( 1 r xz ) (Note: n ths formula x and y are nterchangeable - as of course they must be) Partal correlaton coeffcents between response and potental new predctors to be ncluded n the model (wth the effect of all exstng predctors n the model removed) are also used n stepwse regresson methods. Ref: Illustraton n sectons and agan : n = 7, k =, m = 1 Partal correlaton of y and x 1 : y on x : r y = ; y = x ; resduals from the ft e y x 1 on x : r 1 = ; x 1 = x ; resduals from the ft e 1 Partal correlaton coeffcent of y and x 1 = correlaton coeffcent of e y and e 1 = Ordnary correlaton coeffcent of y and x 1 = Partal correlaton of y and x : y on x 1 : r y1 = ; y = x 1 ; resduals from the ft e y1 x on x 1 : r 1 = ; x = x 1 ; resduals from the ft e 1 Partal correlaton coeffcent of y and x = correlaton coeffcent of e y1 and e 1 = Ordnary correlaton coeffcent of y and x = Factors/dummy varables The relatonshp between the response Y and predctors x may be dfferent for values of some category varable - for example: ncome wth x and gender yeld wth x and nvestment type sales wth x and country We can ncorporate a category varable n a regresson analyss by declarng t as a factor or by explctly usng one or more ndcator varables, whch n ths context are often referred to as dummy varables. In secton we ncorporated the varable type by declarng t as a factor wth two levels. We could alternatvely have ncorporated t by explctly ntroducng a dummy numercal varable: Regresson of Y on x and type (= 0 or 1) E[Y type = 1] = β 0 + β 1 x E[Y type = ] = β 0 + β 1 x + β β s the addtonal ntercept for type = 1 Y = β 0 + β 1 x + (β type) + ε 9

17 Suppose the category varable s country, wth 3 values (Scotland, England, Wales). An alternatve to declarng country as a factor wth 3 levels s to ntroduce dummy varables: dv1 (= 1 for Scotland, 0 otherwse) dv (= 1 for England, 0 otherwse) In ths approach, to regress Y on x and country, we can use the model: Y = β 0 + β 1 x + (β dv1) + (β 3 dv) + ε E[Y Wales] = β 0 + β 1 x E[Y Scotland] = (β 0 + β ) + β 1 x E[Y England] = (β 0 + β 3 ) + β 1 x β s the addtonal ntercept for Scotland β 3 s the addtonal ntercept for England These examples are both common slope (wth dfferng ntercepts) models. There are also common ntercept (wth dfferng slopes) models Multcollnearty Consder the model Y = β 0 + β 1 x 1 + β x + ε Suppose the data ponts for our predctors (x 1,x ) le on a straght lne (r = ± 1). Then the model s not of full rank and we cannot ft t (try t and see ). If r s close to 1, the estmaton process becomes unstable (small changes n errors have bg effects) the standard errors of estmaton are hgh. We can predct the response for values close to the lne whch the data (x 1,x ) are close to, but not elsewhere. We cannot separate the effects of one predctor from the other and so we cannot sensbly nvestgate the effect of x 1 or x alone. Ths stuaton s descrbed as one of multcollnearty of course ts serousness s a matter of degree. Some predctors are related n subtle (not very obvous) ways ths can be a real problem. Some statstcal software packages provde warnngs when hgh correlatons among predctors are present Multple correlaton Multple correlaton measures the strength of the relatonshp between the response Y and the set of predctors. There are two approaches, both of whch we have met before. () correlaton coeffcent between the responses and the ftted responses () coeffcent of determnaton, gven by R ( yˆ y ) ( ) SSR = = SST y y In stepwse regresson we ntroduce the predctors one by one. Each addtonal predctor ntroduced nto the model wll add an amount ( 0) to the overall SSR, reducng SSE by a correspondng amount *. We generally choose the predctor to ntroduce next as that one whch wll ncrease R the most. * note: the degrees of freedom wll change 30

18 Heteroscedastcty In many stuatons n economcs and fnance, the specfcaton that the model be homoscedastc s napproprate, and so we nvestgate models n whch the assumpton that the error varance s constant s dropped. Instead we specfy a heteroscedastc model wth V[ε ] = σ. In hgh varance sectons of the data the resduals are lkely to be greater than n low varance sectons, and so OLS wll place more weght on the observatons n the former sectons, ensurng a good ft there. The OLS estmators are stll unbased and consstent, but they are not mnmum varance. In the model wth one predctor Y = a + bx + ε, V[ε ] = σ we fnd bˆ = S /, ˆ xy Sxx = cy E b = b, and Var b = ( ) x x σ ˆ Sxx The basc technque used to take account of heteroscedastcty s weghted least squares (WLS) n whch, for the model above, we mnmse whch leads to * ( ) Y a b x σ * * * * ˆ x y x x y y b = where x =, y = x σ σ We can acheve ths by scalng/weghtng the orgnal data and then usng OLS. In general then: Consder the model Y = β 0 + β 1 x β k x k + ε and the transformed model Y * = β 0 ' + β 1 ' x 1 * + + β k 'x k * + ε * where Y * = Y /σ, β ' = β /σ, x j * = x j /σ, j = 1,,, k, ε * = ε /σ The transformed model s homoscedastc: V[ε *] = 1 OLS on the transformed model s equvalent to WLS on the orgnal model Potental drawback do we need to know the values of the σ? Actually we can manage wth a knowledge of only the relatve szes of the error varances we may ndeed have some relevant nformaton whch wll help us to assess the relatve szes, but t s stll askng a lot. One mportant specal case s to assume that the error varance s related to one of the predctors n partcular that V[ε ] x j for some j. Suppose V[ε ] = cx. We then take Y * = Y /x, x j * = x j /x, j = 1,,, k, ε * = ε /x and ft the model Y /x = β 0 /x + β 1 x 1 /x + β + β 3 x 3 /x + + β k x k /x + ε /x Ths s also a homoscedastc model: V[ε *] = c. 31

Chapter 13: Multiple Regression

Chapter 13: Multiple Regression Chapter 13: Multple Regresson 13.1 Developng the multple-regresson Model The general model can be descrbed as: It smplfes for two ndependent varables: The sample ft parameter b 0, b 1, and b are used to

More information

Chapter 15 - Multiple Regression

Chapter 15 - Multiple Regression Chapter - Multple Regresson Chapter - Multple Regresson Multple Regresson Model The equaton that descrbes how the dependent varable y s related to the ndependent varables x, x,... x p and an error term

More information

Chapter 11: Simple Linear Regression and Correlation

Chapter 11: Simple Linear Regression and Correlation Chapter 11: Smple Lnear Regresson and Correlaton 11-1 Emprcal Models 11-2 Smple Lnear Regresson 11-3 Propertes of the Least Squares Estmators 11-4 Hypothess Test n Smple Lnear Regresson 11-4.1 Use of t-tests

More information

x i1 =1 for all i (the constant ).

x i1 =1 for all i (the constant ). Chapter 5 The Multple Regresson Model Consder an economc model where the dependent varable s a functon of K explanatory varables. The economc model has the form: y = f ( x,x,..., ) xk Approxmate ths by

More information

Lecture 6: Introduction to Linear Regression

Lecture 6: Introduction to Linear Regression Lecture 6: Introducton to Lnear Regresson An Manchakul amancha@jhsph.edu 24 Aprl 27 Lnear regresson: man dea Lnear regresson can be used to study an outcome as a lnear functon of a predctor Example: 6

More information

Statistics for Business and Economics

Statistics for Business and Economics Statstcs for Busness and Economcs Chapter 11 Smple Regresson Copyrght 010 Pearson Educaton, Inc. Publshng as Prentce Hall Ch. 11-1 11.1 Overvew of Lnear Models n An equaton can be ft to show the best lnear

More information

Lecture 9: Linear regression: centering, hypothesis testing, multiple covariates, and confounding

Lecture 9: Linear regression: centering, hypothesis testing, multiple covariates, and confounding Lecture 9: Lnear regresson: centerng, hypothess testng, multple covarates, and confoundng Sandy Eckel seckel@jhsph.edu 6 May 008 Recall: man dea of lnear regresson Lnear regresson can be used to study

More information

Lecture 9: Linear regression: centering, hypothesis testing, multiple covariates, and confounding

Lecture 9: Linear regression: centering, hypothesis testing, multiple covariates, and confounding Recall: man dea of lnear regresson Lecture 9: Lnear regresson: centerng, hypothess testng, multple covarates, and confoundng Sandy Eckel seckel@jhsph.edu 6 May 8 Lnear regresson can be used to study an

More information

[The following data appear in Wooldridge Q2.3.] The table below contains the ACT score and college GPA for eight college students.

[The following data appear in Wooldridge Q2.3.] The table below contains the ACT score and college GPA for eight college students. PPOL 59-3 Problem Set Exercses n Smple Regresson Due n class /8/7 In ths problem set, you are asked to compute varous statstcs by hand to gve you a better sense of the mechancs of the Pearson correlaton

More information

Department of Quantitative Methods & Information Systems. Time Series and Their Components QMIS 320. Chapter 6

Department of Quantitative Methods & Information Systems. Time Series and Their Components QMIS 320. Chapter 6 Department of Quanttatve Methods & Informaton Systems Tme Seres and Ther Components QMIS 30 Chapter 6 Fall 00 Dr. Mohammad Zanal These sldes were modfed from ther orgnal source for educatonal purpose only.

More information

Statistics for Economics & Business

Statistics for Economics & Business Statstcs for Economcs & Busness Smple Lnear Regresson Learnng Objectves In ths chapter, you learn: How to use regresson analyss to predct the value of a dependent varable based on an ndependent varable

More information

Statistics for Managers Using Microsoft Excel/SPSS Chapter 13 The Simple Linear Regression Model and Correlation

Statistics for Managers Using Microsoft Excel/SPSS Chapter 13 The Simple Linear Regression Model and Correlation Statstcs for Managers Usng Mcrosoft Excel/SPSS Chapter 13 The Smple Lnear Regresson Model and Correlaton 1999 Prentce-Hall, Inc. Chap. 13-1 Chapter Topcs Types of Regresson Models Determnng the Smple Lnear

More information

Biostatistics 360 F&t Tests and Intervals in Regression 1

Biostatistics 360 F&t Tests and Intervals in Regression 1 Bostatstcs 360 F&t Tests and Intervals n Regresson ORIGIN Model: Y = X + Corrected Sums of Squares: X X bar where: s the y ntercept of the regresson lne (translaton) s the slope of the regresson lne (scalng

More information

Negative Binomial Regression

Negative Binomial Regression STATGRAPHICS Rev. 9/16/2013 Negatve Bnomal Regresson Summary... 1 Data Input... 3 Statstcal Model... 3 Analyss Summary... 4 Analyss Optons... 7 Plot of Ftted Model... 8 Observed Versus Predcted... 10 Predctons...

More information

Comparison of Regression Lines

Comparison of Regression Lines STATGRAPHICS Rev. 9/13/2013 Comparson of Regresson Lnes Summary... 1 Data Input... 3 Analyss Summary... 4 Plot of Ftted Model... 6 Condtonal Sums of Squares... 6 Analyss Optons... 7 Forecasts... 8 Confdence

More information

Y = β 0 + β 1 X 1 + β 2 X β k X k + ε

Y = β 0 + β 1 X 1 + β 2 X β k X k + ε Chapter 3 Secton 3.1 Model Assumptons: Multple Regresson Model Predcton Equaton Std. Devaton of Error Correlaton Matrx Smple Lnear Regresson: 1.) Lnearty.) Constant Varance 3.) Independent Errors 4.) Normalty

More information

STAT 3008 Applied Regression Analysis

STAT 3008 Applied Regression Analysis STAT 3008 Appled Regresson Analyss Tutoral : Smple Lnear Regresson LAI Chun He Department of Statstcs, The Chnese Unversty of Hong Kong 1 Model Assumpton To quantfy the relatonshp between two factors,

More information

Lecture Notes for STATISTICAL METHODS FOR BUSINESS II BMGT 212. Chapters 14, 15 & 16. Professor Ahmadi, Ph.D. Department of Management

Lecture Notes for STATISTICAL METHODS FOR BUSINESS II BMGT 212. Chapters 14, 15 & 16. Professor Ahmadi, Ph.D. Department of Management Lecture Notes for STATISTICAL METHODS FOR BUSINESS II BMGT 1 Chapters 14, 15 & 16 Professor Ahmad, Ph.D. Department of Management Revsed August 005 Chapter 14 Formulas Smple Lnear Regresson Model: y =

More information

a. (All your answers should be in the letter!

a. (All your answers should be in the letter! Econ 301 Blkent Unversty Taskn Econometrcs Department of Economcs Md Term Exam I November 8, 015 Name For each hypothess testng n the exam complete the followng steps: Indcate the test statstc, ts crtcal

More information

January Examinations 2015

January Examinations 2015 24/5 Canddates Only January Examnatons 25 DO NOT OPEN THE QUESTION PAPER UNTIL INSTRUCTED TO DO SO BY THE CHIEF INVIGILATOR STUDENT CANDIDATE NO.. Department Module Code Module Ttle Exam Duraton (n words)

More information

Chapter 15 Student Lecture Notes 15-1

Chapter 15 Student Lecture Notes 15-1 Chapter 15 Student Lecture Notes 15-1 Basc Busness Statstcs (9 th Edton) Chapter 15 Multple Regresson Model Buldng 004 Prentce-Hall, Inc. Chap 15-1 Chapter Topcs The Quadratc Regresson Model Usng Transformatons

More information

Chapter 8 Indicator Variables

Chapter 8 Indicator Variables Chapter 8 Indcator Varables In general, e explanatory varables n any regresson analyss are assumed to be quanttatve n nature. For example, e varables lke temperature, dstance, age etc. are quanttatve n

More information

NANYANG TECHNOLOGICAL UNIVERSITY SEMESTER I EXAMINATION MTH352/MH3510 Regression Analysis

NANYANG TECHNOLOGICAL UNIVERSITY SEMESTER I EXAMINATION MTH352/MH3510 Regression Analysis NANYANG TECHNOLOGICAL UNIVERSITY SEMESTER I EXAMINATION 014-015 MTH35/MH3510 Regresson Analyss December 014 TIME ALLOWED: HOURS INSTRUCTIONS TO CANDIDATES 1. Ths examnaton paper contans FOUR (4) questons

More information

2016 Wiley. Study Session 2: Ethical and Professional Standards Application

2016 Wiley. Study Session 2: Ethical and Professional Standards Application 6 Wley Study Sesson : Ethcal and Professonal Standards Applcaton LESSON : CORRECTION ANALYSIS Readng 9: Correlaton and Regresson LOS 9a: Calculate and nterpret a sample covarance and a sample correlaton

More information

LINEAR REGRESSION ANALYSIS. MODULE IX Lecture Multicollinearity

LINEAR REGRESSION ANALYSIS. MODULE IX Lecture Multicollinearity LINEAR REGRESSION ANALYSIS MODULE IX Lecture - 30 Multcollnearty Dr. Shalabh Department of Mathematcs and Statstcs Indan Insttute of Technology Kanpur 2 Remedes for multcollnearty Varous technques have

More information

Statistics MINITAB - Lab 2

Statistics MINITAB - Lab 2 Statstcs 20080 MINITAB - Lab 2 1. Smple Lnear Regresson In smple lnear regresson we attempt to model a lnear relatonshp between two varables wth a straght lne and make statstcal nferences concernng that

More information

Statistics for Managers Using Microsoft Excel/SPSS Chapter 14 Multiple Regression Models

Statistics for Managers Using Microsoft Excel/SPSS Chapter 14 Multiple Regression Models Statstcs for Managers Usng Mcrosoft Excel/SPSS Chapter 14 Multple Regresson Models 1999 Prentce-Hall, Inc. Chap. 14-1 Chapter Topcs The Multple Regresson Model Contrbuton of Indvdual Independent Varables

More information

The Multiple Classical Linear Regression Model (CLRM): Specification and Assumptions. 1. Introduction

The Multiple Classical Linear Regression Model (CLRM): Specification and Assumptions. 1. Introduction ECONOMICS 5* -- NOTE (Summary) ECON 5* -- NOTE The Multple Classcal Lnear Regresson Model (CLRM): Specfcaton and Assumptons. Introducton CLRM stands for the Classcal Lnear Regresson Model. The CLRM s also

More information

Lecture 2: Prelude to the big shrink

Lecture 2: Prelude to the big shrink Lecture 2: Prelude to the bg shrnk Last tme A slght detour wth vsualzaton tools (hey, t was the frst day... why not start out wth somethng pretty to look at?) Then, we consdered a smple 120a-style regresson

More information

β0 + β1xi. You are interested in estimating the unknown parameters β

β0 + β1xi. You are interested in estimating the unknown parameters β Ordnary Least Squares (OLS): Smple Lnear Regresson (SLR) Analytcs The SLR Setup Sample Statstcs Ordnary Least Squares (OLS): FOCs and SOCs Back to OLS and Sample Statstcs Predctons (and Resduals) wth OLS

More information

Linear Regression Analysis: Terminology and Notation

Linear Regression Analysis: Terminology and Notation ECON 35* -- Secton : Basc Concepts of Regresson Analyss (Page ) Lnear Regresson Analyss: Termnology and Notaton Consder the generc verson of the smple (two-varable) lnear regresson model. It s represented

More information

28. SIMPLE LINEAR REGRESSION III

28. SIMPLE LINEAR REGRESSION III 8. SIMPLE LINEAR REGRESSION III Ftted Values and Resduals US Domestc Beers: Calores vs. % Alcohol To each observed x, there corresponds a y-value on the ftted lne, y ˆ = βˆ + βˆ x. The are called ftted

More information

Econ107 Applied Econometrics Topic 3: Classical Model (Studenmund, Chapter 4)

Econ107 Applied Econometrics Topic 3: Classical Model (Studenmund, Chapter 4) I. Classcal Assumptons Econ7 Appled Econometrcs Topc 3: Classcal Model (Studenmund, Chapter 4) We have defned OLS and studed some algebrac propertes of OLS. In ths topc we wll study statstcal propertes

More information

Chapter 2 - The Simple Linear Regression Model S =0. e i is a random error. S β2 β. This is a minimization problem. Solution is a calculus exercise.

Chapter 2 - The Simple Linear Regression Model S =0. e i is a random error. S β2 β. This is a minimization problem. Solution is a calculus exercise. Chapter - The Smple Lnear Regresson Model The lnear regresson equaton s: where y + = β + β e for =,..., y and are observable varables e s a random error How can an estmaton rule be constructed for the

More information

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur Analyss of Varance and Desgn of Experment-I MODULE VII LECTURE - 3 ANALYSIS OF COVARIANCE Dr Shalabh Department of Mathematcs and Statstcs Indan Insttute of Technology Kanpur Any scentfc experment s performed

More information

Chapter 14 Simple Linear Regression

Chapter 14 Simple Linear Regression Chapter 4 Smple Lnear Regresson Chapter 4 - Smple Lnear Regresson Manageral decsons often are based on the relatonshp between two or more varables. Regresson analss can be used to develop an equaton showng

More information

18. SIMPLE LINEAR REGRESSION III

18. SIMPLE LINEAR REGRESSION III 8. SIMPLE LINEAR REGRESSION III US Domestc Beers: Calores vs. % Alcohol Ftted Values and Resduals To each observed x, there corresponds a y-value on the ftted lne, y ˆ ˆ = α + x. The are called ftted values.

More information

Correlation and Regression

Correlation and Regression Correlaton and Regresson otes prepared by Pamela Peterson Drake Index Basc terms and concepts... Smple regresson...5 Multple Regresson...3 Regresson termnology...0 Regresson formulas... Basc terms and

More information

Diagnostics in Poisson Regression. Models - Residual Analysis

Diagnostics in Poisson Regression. Models - Residual Analysis Dagnostcs n Posson Regresson Models - Resdual Analyss 1 Outlne Dagnostcs n Posson Regresson Models - Resdual Analyss Example 3: Recall of Stressful Events contnued 2 Resdual Analyss Resduals represent

More information

Correlation and Regression. Correlation 9.1. Correlation. Chapter 9

Correlation and Regression. Correlation 9.1. Correlation. Chapter 9 Chapter 9 Correlaton and Regresson 9. Correlaton Correlaton A correlaton s a relatonshp between two varables. The data can be represented b the ordered pars (, ) where s the ndependent (or eplanator) varable,

More information

Basic Business Statistics, 10/e

Basic Business Statistics, 10/e Chapter 13 13-1 Basc Busness Statstcs 11 th Edton Chapter 13 Smple Lnear Regresson Basc Busness Statstcs, 11e 009 Prentce-Hall, Inc. Chap 13-1 Learnng Objectves In ths chapter, you learn: How to use regresson

More information

Econ107 Applied Econometrics Topic 9: Heteroskedasticity (Studenmund, Chapter 10)

Econ107 Applied Econometrics Topic 9: Heteroskedasticity (Studenmund, Chapter 10) I. Defnton and Problems Econ7 Appled Econometrcs Topc 9: Heteroskedastcty (Studenmund, Chapter ) We now relax another classcal assumpton. Ths s a problem that arses often wth cross sectons of ndvduals,

More information

The Ordinary Least Squares (OLS) Estimator

The Ordinary Least Squares (OLS) Estimator The Ordnary Least Squares (OLS) Estmator 1 Regresson Analyss Regresson Analyss: a statstcal technque for nvestgatng and modelng the relatonshp between varables. Applcatons: Engneerng, the physcal and chemcal

More information

Learning Objectives for Chapter 11

Learning Objectives for Chapter 11 Chapter : Lnear Regresson and Correlaton Methods Hldebrand, Ott and Gray Basc Statstcal Ideas for Managers Second Edton Learnng Objectves for Chapter Usng the scatterplot n regresson analyss Usng the method

More information

Chapter 5 Multilevel Models

Chapter 5 Multilevel Models Chapter 5 Multlevel Models 5.1 Cross-sectonal multlevel models 5.1.1 Two-level models 5.1.2 Multple level models 5.1.3 Multple level modelng n other felds 5.2 Longtudnal multlevel models 5.2.1 Two-level

More information

Introduction to Regression

Introduction to Regression Introducton to Regresson Dr Tom Ilvento Department of Food and Resource Economcs Overvew The last part of the course wll focus on Regresson Analyss Ths s one of the more powerful statstcal technques Provdes

More information

Psychology 282 Lecture #24 Outline Regression Diagnostics: Outliers

Psychology 282 Lecture #24 Outline Regression Diagnostics: Outliers Psychology 282 Lecture #24 Outlne Regresson Dagnostcs: Outlers In an earler lecture we studed the statstcal assumptons underlyng the regresson model, ncludng the followng ponts: Formal statement of assumptons.

More information

Chapter 3. Two-Variable Regression Model: The Problem of Estimation

Chapter 3. Two-Variable Regression Model: The Problem of Estimation Chapter 3. Two-Varable Regresson Model: The Problem of Estmaton Ordnary Least Squares Method (OLS) Recall that, PRF: Y = β 1 + β X + u Thus, snce PRF s not drectly observable, t s estmated by SRF; that

More information

e i is a random error

e i is a random error Chapter - The Smple Lnear Regresson Model The lnear regresson equaton s: where + β + β e for,..., and are observable varables e s a random error How can an estmaton rule be constructed for the unknown

More information

[ ] λ λ λ. Multicollinearity. multicollinearity Ragnar Frisch (1934) perfect exact. collinearity. multicollinearity. exact

[ ] λ λ λ. Multicollinearity. multicollinearity Ragnar Frisch (1934) perfect exact. collinearity. multicollinearity. exact Multcollnearty multcollnearty Ragnar Frsch (934 perfect exact collnearty multcollnearty K exact λ λ λ K K x+ x+ + x 0 0.. λ, λ, λk 0 0.. x perfect ntercorrelated λ λ λ x+ x+ + KxK + v 0 0.. v 3 y β + β

More information

LINEAR REGRESSION ANALYSIS. MODULE IX Lecture Multicollinearity

LINEAR REGRESSION ANALYSIS. MODULE IX Lecture Multicollinearity LINEAR REGRESSION ANALYSIS MODULE IX Lecture - 31 Multcollnearty Dr. Shalabh Department of Mathematcs and Statstcs Indan Insttute of Technology Kanpur 6. Rdge regresson The OLSE s the best lnear unbased

More information

1. Inference on Regression Parameters a. Finding Mean, s.d and covariance amongst estimates. 2. Confidence Intervals and Working Hotelling Bands

1. Inference on Regression Parameters a. Finding Mean, s.d and covariance amongst estimates. 2. Confidence Intervals and Working Hotelling Bands Content. Inference on Regresson Parameters a. Fndng Mean, s.d and covarance amongst estmates.. Confdence Intervals and Workng Hotellng Bands 3. Cochran s Theorem 4. General Lnear Testng 5. Measures of

More information

Basically, if you have a dummy dependent variable you will be estimating a probability.

Basically, if you have a dummy dependent variable you will be estimating a probability. ECON 497: Lecture Notes 13 Page 1 of 1 Metropoltan State Unversty ECON 497: Research and Forecastng Lecture Notes 13 Dummy Dependent Varable Technques Studenmund Chapter 13 Bascally, f you have a dummy

More information

Chap 10: Diagnostics, p384

Chap 10: Diagnostics, p384 Chap 10: Dagnostcs, p384 Multcollnearty 10.5 p406 Defnton Multcollnearty exsts when two or more ndependent varables used n regresson are moderately or hghly correlated. - when multcollnearty exsts, regresson

More information

Polynomial Regression Models

Polynomial Regression Models LINEAR REGRESSION ANALYSIS MODULE XII Lecture - 6 Polynomal Regresson Models Dr. Shalabh Department of Mathematcs and Statstcs Indan Insttute of Technology Kanpur Test of sgnfcance To test the sgnfcance

More information

Chapter 14 Simple Linear Regression Page 1. Introduction to regression analysis 14-2

Chapter 14 Simple Linear Regression Page 1. Introduction to regression analysis 14-2 Chapter 4 Smple Lnear Regresson Page. Introducton to regresson analyss 4- The Regresson Equaton. Lnear Functons 4-4 3. Estmaton and nterpretaton of model parameters 4-6 4. Inference on the model parameters

More information

Now we relax this assumption and allow that the error variance depends on the independent variables, i.e., heteroskedasticity

Now we relax this assumption and allow that the error variance depends on the independent variables, i.e., heteroskedasticity ECON 48 / WH Hong Heteroskedastcty. Consequences of Heteroskedastcty for OLS Assumpton MLR. 5: Homoskedastcty var ( u x ) = σ Now we relax ths assumpton and allow that the error varance depends on the

More information

Chapter 9: Statistical Inference and the Relationship between Two Variables

Chapter 9: Statistical Inference and the Relationship between Two Variables Chapter 9: Statstcal Inference and the Relatonshp between Two Varables Key Words The Regresson Model The Sample Regresson Equaton The Pearson Correlaton Coeffcent Learnng Outcomes After studyng ths chapter,

More information

ECONOMICS 351*-A Mid-Term Exam -- Fall Term 2000 Page 1 of 13 pages. QUEEN'S UNIVERSITY AT KINGSTON Department of Economics

ECONOMICS 351*-A Mid-Term Exam -- Fall Term 2000 Page 1 of 13 pages. QUEEN'S UNIVERSITY AT KINGSTON Department of Economics ECOOMICS 35*-A Md-Term Exam -- Fall Term 000 Page of 3 pages QUEE'S UIVERSITY AT KIGSTO Department of Economcs ECOOMICS 35* - Secton A Introductory Econometrcs Fall Term 000 MID-TERM EAM ASWERS MG Abbott

More information

Lecture 3 Stat102, Spring 2007

Lecture 3 Stat102, Spring 2007 Lecture 3 Stat0, Sprng 007 Chapter 3. 3.: Introducton to regresson analyss Lnear regresson as a descrptve technque The least-squares equatons Chapter 3.3 Samplng dstrbuton of b 0, b. Contnued n net lecture

More information

ANOVA. The Observations y ij

ANOVA. The Observations y ij ANOVA Stands for ANalyss Of VArance But t s a test of dfferences n means The dea: The Observatons y j Treatment group = 1 = 2 = k y 11 y 21 y k,1 y 12 y 22 y k,2 y 1, n1 y 2, n2 y k, nk means: m 1 m 2

More information

/ n ) are compared. The logic is: if the two

/ n ) are compared. The logic is: if the two STAT C141, Sprng 2005 Lecture 13 Two sample tests One sample tests: examples of goodness of ft tests, where we are testng whether our data supports predctons. Two sample tests: called as tests of ndependence

More information

Interval Estimation in the Classical Normal Linear Regression Model. 1. Introduction

Interval Estimation in the Classical Normal Linear Regression Model. 1. Introduction ECONOMICS 35* -- NOTE 7 ECON 35* -- NOTE 7 Interval Estmaton n the Classcal Normal Lnear Regresson Model Ths note outlnes the basc elements of nterval estmaton n the Classcal Normal Lnear Regresson Model

More information

Lecture 4 Hypothesis Testing

Lecture 4 Hypothesis Testing Lecture 4 Hypothess Testng We may wsh to test pror hypotheses about the coeffcents we estmate. We can use the estmates to test whether the data rejects our hypothess. An example mght be that we wsh to

More information

β0 + β1xi. You are interested in estimating the unknown parameters β

β0 + β1xi. You are interested in estimating the unknown parameters β Revsed: v3 Ordnar Least Squares (OLS): Smple Lnear Regresson (SLR) Analtcs The SLR Setup Sample Statstcs Ordnar Least Squares (OLS): FOCs and SOCs Back to OLS and Sample Statstcs Predctons (and Resduals)

More information

SIMPLE LINEAR REGRESSION

SIMPLE LINEAR REGRESSION Smple Lnear Regresson and Correlaton Introducton Prevousl, our attenton has been focused on one varable whch we desgnated b x. Frequentl, t s desrable to learn somethng about the relatonshp between two

More information

STATISTICS QUESTIONS. Step by Step Solutions.

STATISTICS QUESTIONS. Step by Step Solutions. STATISTICS QUESTIONS Step by Step Solutons www.mathcracker.com 9//016 Problem 1: A researcher s nterested n the effects of famly sze on delnquency for a group of offenders and examnes famles wth one to

More information

Introduction to Generalized Linear Models

Introduction to Generalized Linear Models INTRODUCTION TO STATISTICAL MODELLING TRINITY 00 Introducton to Generalzed Lnear Models I. Motvaton In ths lecture we extend the deas of lnear regresson to the more general dea of a generalzed lnear model

More information

β0 + β1xi and want to estimate the unknown

β0 + β1xi and want to estimate the unknown SLR Models Estmaton Those OLS Estmates Estmators (e ante) v. estmates (e post) The Smple Lnear Regresson (SLR) Condtons -4 An Asde: The Populaton Regresson Functon B and B are Lnear Estmators (condtonal

More information

ECONOMETRICS - FINAL EXAM, 3rd YEAR (GECO & GADE)

ECONOMETRICS - FINAL EXAM, 3rd YEAR (GECO & GADE) ECONOMETRICS - FINAL EXAM, 3rd YEAR (GECO & GADE) June 7, 016 15:30 Frst famly name: Name: DNI/ID: Moble: Second famly Name: GECO/GADE: Instructor: E-mal: Queston 1 A B C Blank Queston A B C Blank Queston

More information

Econometrics of Panel Data

Econometrics of Panel Data Econometrcs of Panel Data Jakub Mućk Meetng # 8 Jakub Mućk Econometrcs of Panel Data Meetng # 8 1 / 17 Outlne 1 Heterogenety n the slope coeffcents 2 Seemngly Unrelated Regresson (SUR) 3 Swamy s random

More information

Topic 7: Analysis of Variance

Topic 7: Analysis of Variance Topc 7: Analyss of Varance Outlne Parttonng sums of squares Breakdown the degrees of freedom Expected mean squares (EMS) F test ANOVA table General lnear test Pearson Correlaton / R 2 Analyss of Varance

More information

REGRESSION ANALYSIS II- MULTICOLLINEARITY

REGRESSION ANALYSIS II- MULTICOLLINEARITY REGRESSION ANALYSIS II- MULTICOLLINEARITY QUESTION 1 Departments of Open Unversty of Cyprus A and B consst of na = 35 and nb = 30 students respectvely. The students of department A acheved an average test

More information

Resource Allocation and Decision Analysis (ECON 8010) Spring 2014 Foundations of Regression Analysis

Resource Allocation and Decision Analysis (ECON 8010) Spring 2014 Foundations of Regression Analysis Resource Allocaton and Decson Analss (ECON 800) Sprng 04 Foundatons of Regresson Analss Readng: Regresson Analss (ECON 800 Coursepak, Page 3) Defntons and Concepts: Regresson Analss statstcal technques

More information

Econ Statistical Properties of the OLS estimator. Sanjaya DeSilva

Econ Statistical Properties of the OLS estimator. Sanjaya DeSilva Econ 39 - Statstcal Propertes of the OLS estmator Sanjaya DeSlva September, 008 1 Overvew Recall that the true regresson model s Y = β 0 + β 1 X + u (1) Applyng the OLS method to a sample of data, we estmate

More information

DO NOT OPEN THE QUESTION PAPER UNTIL INSTRUCTED TO DO SO BY THE CHIEF INVIGILATOR. Introductory Econometrics 1 hour 30 minutes

DO NOT OPEN THE QUESTION PAPER UNTIL INSTRUCTED TO DO SO BY THE CHIEF INVIGILATOR. Introductory Econometrics 1 hour 30 minutes 25/6 Canddates Only January Examnatons 26 Student Number: Desk Number:...... DO NOT OPEN THE QUESTION PAPER UNTIL INSTRUCTED TO DO SO BY THE CHIEF INVIGILATOR Department Module Code Module Ttle Exam Duraton

More information

F8: Heteroscedasticity

F8: Heteroscedasticity F8: Heteroscedastcty Feng L Department of Statstcs, Stockholm Unversty What s so-called heteroscedastcty In a lnear regresson model, we assume the error term has a normal dstrbuton wth mean zero and varance

More information

The SAS program I used to obtain the analyses for my answers is given below.

The SAS program I used to obtain the analyses for my answers is given below. Homework 1 Answer sheet Page 1 The SAS program I used to obtan the analyses for my answers s gven below. dm'log;clear;output;clear'; *************************************************************; *** EXST7034

More information

BIO Lab 2: TWO-LEVEL NORMAL MODELS with school children popularity data

BIO Lab 2: TWO-LEVEL NORMAL MODELS with school children popularity data Lab : TWO-LEVEL NORMAL MODELS wth school chldren popularty data Purpose: Introduce basc two-level models for normally dstrbuted responses usng STATA. In partcular, we dscuss Random ntercept models wthout

More information

Interpreting Slope Coefficients in Multiple Linear Regression Models: An Example

Interpreting Slope Coefficients in Multiple Linear Regression Models: An Example CONOMICS 5* -- Introducton to NOT CON 5* -- Introducton to NOT : Multple Lnear Regresson Models Interpretng Slope Coeffcents n Multple Lnear Regresson Models: An xample Consder the followng smple lnear

More information

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur Analyss of Varance and Desgn of Exerments-I MODULE III LECTURE - 2 EXPERIMENTAL DESIGN MODELS Dr. Shalabh Deartment of Mathematcs and Statstcs Indan Insttute of Technology Kanur 2 We consder the models

More information

Economics 130. Lecture 4 Simple Linear Regression Continued

Economics 130. Lecture 4 Simple Linear Regression Continued Economcs 130 Lecture 4 Contnued Readngs for Week 4 Text, Chapter and 3. We contnue wth addressng our second ssue + add n how we evaluate these relatonshps: Where do we get data to do ths analyss? How do

More information

Department of Statistics University of Toronto STA305H1S / 1004 HS Design and Analysis of Experiments Term Test - Winter Solution

Department of Statistics University of Toronto STA305H1S / 1004 HS Design and Analysis of Experiments Term Test - Winter Solution Department of Statstcs Unversty of Toronto STA35HS / HS Desgn and Analyss of Experments Term Test - Wnter - Soluton February, Last Name: Frst Name: Student Number: Instructons: Tme: hours. Ads: a non-programmable

More information

First Year Examination Department of Statistics, University of Florida

First Year Examination Department of Statistics, University of Florida Frst Year Examnaton Department of Statstcs, Unversty of Florda May 7, 010, 8:00 am - 1:00 noon Instructons: 1. You have four hours to answer questons n ths examnaton.. You must show your work to receve

More information

Linear regression. Regression Models. Chapter 11 Student Lecture Notes Regression Analysis is the

Linear regression. Regression Models. Chapter 11 Student Lecture Notes Regression Analysis is the Chapter 11 Student Lecture Notes 11-1 Lnear regresson Wenl lu Dept. Health statstcs School of publc health Tanjn medcal unversty 1 Regresson Models 1. Answer What Is the Relatonshp Between the Varables?.

More information

Introduction to Dummy Variable Regressors. 1. An Example of Dummy Variable Regressors

Introduction to Dummy Variable Regressors. 1. An Example of Dummy Variable Regressors ECONOMICS 5* -- Introducton to Dummy Varable Regressors ECON 5* -- Introducton to NOTE Introducton to Dummy Varable Regressors. An Example of Dummy Varable Regressors A model of North Amercan car prces

More information

Module Contact: Dr Susan Long, ECO Copyright of the University of East Anglia Version 1

Module Contact: Dr Susan Long, ECO Copyright of the University of East Anglia Version 1 UNIVERSITY OF EAST ANGLIA School of Economcs Man Seres PG Examnaton 016-17 ECONOMETRIC METHODS ECO-7000A Tme allowed: hours Answer ALL FOUR Questons. Queston 1 carres a weght of 5%; Queston carres 0%;

More information

Dummy variables in multiple variable regression model

Dummy variables in multiple variable regression model WESS Econometrcs (Handout ) Dummy varables n multple varable regresson model. Addtve dummy varables In the prevous handout we consdered the followng regresson model: y x 2x2 k xk,, 2,, n and we nterpreted

More information

Outline. Zero Conditional mean. I. Motivation. 3. Multiple Regression Analysis: Estimation. Read Wooldridge (2013), Chapter 3.

Outline. Zero Conditional mean. I. Motivation. 3. Multiple Regression Analysis: Estimation. Read Wooldridge (2013), Chapter 3. Outlne 3. Multple Regresson Analyss: Estmaton I. Motvaton II. Mechancs and Interpretaton of OLS Read Wooldrdge (013), Chapter 3. III. Expected Values of the OLS IV. Varances of the OLS V. The Gauss Markov

More information

ANSWERS. Problem 1. and the moment generating function (mgf) by. defined for any real t. Use this to show that E( U) var( U)

ANSWERS. Problem 1. and the moment generating function (mgf) by. defined for any real t. Use this to show that E( U) var( U) Econ 413 Exam 13 H ANSWERS Settet er nndelt 9 deloppgaver, A,B,C, som alle anbefales å telle lkt for å gøre det ltt lettere å stå. Svar er gtt . Unfortunately, there s a prntng error n the hnt of

More information

Chapter 7 Generalized and Weighted Least Squares Estimation. In this method, the deviation between the observed and expected values of

Chapter 7 Generalized and Weighted Least Squares Estimation. In this method, the deviation between the observed and expected values of Chapter 7 Generalzed and Weghted Least Squares Estmaton The usual lnear regresson model assumes that all the random error components are dentcally and ndependently dstrbuted wth constant varance. When

More information

Chapter 12 Analysis of Covariance

Chapter 12 Analysis of Covariance Chapter Analyss of Covarance Any scentfc experment s performed to know somethng that s unknown about a group of treatments and to test certan hypothess about the correspondng treatment effect When varablty

More information

where I = (n x n) diagonal identity matrix with diagonal elements = 1 and off-diagonal elements = 0; and σ 2 e = variance of (Y X).

where I = (n x n) diagonal identity matrix with diagonal elements = 1 and off-diagonal elements = 0; and σ 2 e = variance of (Y X). 11.4.1 Estmaton of Multple Regresson Coeffcents In multple lnear regresson, we essentally solve n equatons for the p unnown parameters. hus n must e equal to or greater than p and n practce n should e

More information

UNIVERSITY OF TORONTO. Faculty of Arts and Science JUNE EXAMINATIONS STA 302 H1F / STA 1001 H1F Duration - 3 hours Aids Allowed: Calculator

UNIVERSITY OF TORONTO. Faculty of Arts and Science JUNE EXAMINATIONS STA 302 H1F / STA 1001 H1F Duration - 3 hours Aids Allowed: Calculator UNIVERSITY OF TORONTO Faculty of Arts and Scence JUNE EXAMINATIONS 008 STA 30 HF / STA 00 HF Duraton - 3 hours Ads Allowed: Calculator LAST NAME: FIRST NAME: STUDENT NUMBER: Enrolled n (Crcle one): STA30

More information

Regression Analysis. Regression Analysis

Regression Analysis. Regression Analysis Regresson Analyss Smple Regresson Multvarate Regresson Stepwse Regresson Replcaton and Predcton Error 1 Regresson Analyss In general, we "ft" a model by mnmzng a metrc that represents the error. n mn (y

More information

Continuous vs. Discrete Goods

Continuous vs. Discrete Goods CE 651 Transportaton Economcs Charsma Choudhury Lecture 3-4 Analyss of Demand Contnuous vs. Dscrete Goods Contnuous Goods Dscrete Goods x auto 1 Indfference u curves 3 u u 1 x 1 0 1 bus Outlne Data Modelng

More information

STAT 511 FINAL EXAM NAME Spring 2001

STAT 511 FINAL EXAM NAME Spring 2001 STAT 5 FINAL EXAM NAME Sprng Instructons: Ths s a closed book exam. No notes or books are allowed. ou may use a calculator but you are not allowed to store notes or formulas n the calculator. Please wrte

More information

PubH 7405: REGRESSION ANALYSIS. SLR: INFERENCES, Part II

PubH 7405: REGRESSION ANALYSIS. SLR: INFERENCES, Part II PubH 7405: REGRESSION ANALSIS SLR: INFERENCES, Part II We cover te topc of nference n two sessons; te frst sesson focused on nferences concernng te slope and te ntercept; ts s a contnuaton on estmatng

More information

Scatter Plot x

Scatter Plot x Construct a scatter plot usng excel for the gven data. Determne whether there s a postve lnear correlaton, negatve lnear correlaton, or no lnear correlaton. Complete the table and fnd the correlaton coeffcent

More information

Problem of Estimation. Ordinary Least Squares (OLS) Ordinary Least Squares Method. Basic Econometrics in Transportation. Bivariate Regression Analysis

Problem of Estimation. Ordinary Least Squares (OLS) Ordinary Least Squares Method. Basic Econometrics in Transportation. Bivariate Regression Analysis 1/60 Problem of Estmaton Basc Econometrcs n Transportaton Bvarate Regresson Analyss Amr Samm Cvl Engneerng Department Sharf Unversty of Technology Ordnary Least Squares (OLS) Maxmum Lkelhood (ML) Generally,

More information