Topic 9. Regression and Correlation

Size: px
Start display at page:

Download "Topic 9. Regression and Correlation"

Transcription

1 BE54W Regresso ad Correlato Page of 43 Topc 9 Regresso ad Correlato Topc. Defto of the Lear Regresso Model... Estmato The Aalyss of Varace Table. 4. Assumptos for the Straght Le Regresso. 5. Hypothess Testg Cofdece Iterval Estmato Itroducto to Correlato.. 8. Hypothess Test for Correlato

2 BE54W Regresso ad Correlato Page of 43. Defto of the Lear Regresso Model I the last ut, topc 8, the settg was that of two categorcal (dscrete) varables, such as smokg ad low brth weght, ad the use of ch-square tests of assocato ad homogeety. I ths ut, topc 9, our focus s the settg of two cotuous varables, such as age ad weght. Ths topc s a troducto to smple lear regresso ad correlato. Lear Regresso Lear regresso models the mea µ of oe radom varable as a lear fucto of oe or more other varables that are treated as fxed. The estmato ad hypothess testg volved are extesos of deas ad techques that we have already see. I lear regresso, we observe a outcome or depedet varable Y at several levels of the depedet or predctor varable X (there may be more tha oe predctor X as see later). A lear regresso model assumes that the values of the predctor X have bee fxed advace of observg Y. However, ths s ot always the realty. Ofte Y ad X are observed jotly ad are both radom varables. Correlato Correlato cosders the assocato of two radom varables. The techques of estmato ad hypothess testg are the same for lear regresso ad correlato aalyses. Explorg the relatoshp begs wth fttg a le to the pots. We develop the lear regresso model aalyss for a smple example volvg oe predctor ad oe outcome.

3 BE54W Regresso ad Correlato Page 3 of 43 Example. Source: Klebaum, Kupper, ad Muller 988 Avalable are pars of observatos of age ad weght for = chcke embryos. WT=Y AGE=X LOGWT=Z We ll use a famlar otato The data are pars of (X, Y ) where X=AGE ad Y=WT (X, Y ) = (6,.9) (X, Y ) = (6,.8) ad equvaletly, pars of (X, Y ) where X=AGE ad Y=LOGWT (X, Y ) = (6, -.538) (X, Y ) = (6,.449) Though smple, t helps to be clear the research questo Does weght chage wth age? I the laguage of aalyss of varace we are askg the followg: Ca the varablty weght be explaed, to a sgfcat extet, by varatos age? What s a good fuctoal form that relates age to weght?

4 BE54W Regresso ad Correlato Page 4 of 43 We beg wth a plot of X=AGE versus Y=WT Scatter Plot of WT vs AGE WT We check ad lear about the followg: The average ad meda of X The rage ad patter of varablty X The average ad meda of Y The rage ad patter of varablty Y The ature of the relatoshp betwee X ad Y The stregth of the relatoshp betwee X ad Y The detfcato of ay pots that mght be fluetal AGE For these data: The plot suggests a relatoshp betwee AGE ad WT A straght le mght ft well, but aother model mght be better We have adequate rages of values for both AGE ad WT There are o outlers

5 BE54W Regresso ad Correlato Page 5 of 43 We mght have gotte ay of a varety of plots. y.5 No relatoshp betwee X ad Y x 8 y 6 Lear relatoshp betwee X ad Y x 5 y3 5 No-lear relatoshp betwee X ad Y x

6 BE54W Regresso ad Correlato Page 6 of 43 y x Note the arrow potg to the outlyg pot Ft of a lear model wll yeld estmated slope that s spurously o-zero. y Note the arrow potg to the outlyg pot Ft of a lear model wll yeld a estmated slope that s spurously ear zero x y Note the arrow potg to the outlyg pot Ft of a lear model wll yeld a estmated slope that s spurously hgh x

7 BE54W Regresso ad Correlato Page 7 of 43 The bowl shape of our scatter plot suggests that perhaps a better model relates the logarthm of WT to AGE:.5 Scatter Plot of LOGWT vs AGE -. LOGWT We ll vestgate two models. AGE ) WT = β + β AGE ) LOGWT = β + β AGE

8 BE54W Regresso ad Correlato Page 8 of 43 Recall what you mght have leared a old math class about the equato of a le You mght recall, too, a feel for the slope Slope > Slope = Slope <

9 BE54W Regresso ad Correlato Page 9 of 43 Populato Y Defto of the Straght Le Model Y = β + β X Sample = β + β X + ε Y = β + β X + e Y = β + β X s the relatoshp the populato. It s measured wth error. ε = measuremet error β, β, ad e are our guesses of β, β ad ε e = resdual We do NOT kow the value of β or β or ε We do have values of β, β ad e The values of β, β ad e are obtaed by the method of least squares estmato. To see f β β ad β β we perform regresso dagostcs. Note Ths s ot dscussed ths course; see PubHlth 64, Itermedate Bostatstcs A lttle otato, sorry! Y = the outcome or depedet varable X = the predctor or depedet varable µ Y = The expected value of Y for all persos the populato µ Y X=x = The expected value of Y for the sub-populato for whom X=x σ Y σ Y X=x = Varablty of Y amog all persos the populato = Varablty of Y for the sub-populato for whom X=x

10 BE54W Regresso ad Correlato Page of 43. Estmato We wll use the method of least squares to obta guesses of β ad β. Goal From the may possble les through the scatter of pots, choose the oe le that s closest to the data. What do we mea by Close? We d lke the vertcal dstace betwee each observed Y ad ts correspodg ftted Y to be as small as possble. It s ot possble to choose β ad β so that t mmzes dy Y ad mmzes dvdually Y Y d ad mmzes dvdually d. Y Y So, stead, we choose β ad β that mmzes ther total c h e j Y Y = Y β + β X = =

11 BE54W Regresso ad Correlato Page of 43 A pcture gves a feel for the fact that may les are possble ad that we seek the closest the sese of vertcal dstaces beg as small as possble c h e j The expresso to be mmzed, Y Y = Y β + β X has a varety of ames: = = resdual sum of squares sum of squares about the regresso le sum of squares due error (SSE) σ YX

12 BE54W Regresso ad Correlato Page of 43 For the calculus lover, A lttle calculus yelds the soluto for the guesses β ad β c h e j Cosder SSE = Y Y = Y β + β X = = Step #: Dfferetate wth respect to β Set dervatve equal to ad solve. Step #: Dfferetate wth respect to β Set dervatve equal to, sert β ad solve. For the o-calculus lover, here are the estmates of β ad β β s the slope Estmate s deoted ˆβ or b β s the tercept Estmate s deoted ˆβ or b

13 BE54W Regresso ad Correlato Page 3 of 43 Some very helpful prelmary calculatos ( ) Sxx = X-X = X NX ( ) Syy = Y-Y = Y NY xy ( ) S = X-X (Y-Y) = XY NXY Note - These expressos make use of a specal otato called the summato otato. The captol S dcates summato. I S xy, the frst subscrpt x s sayg (x-x). The secod subscrpt y s sayg (y-y). S xy = ( ) X-X (Y-Y) S subscrpt x subscrpt y b gb g b g X X Y Y Slope c ov XY, = β = = X X v ar( X ) = a f Sxy ˆ β = S xx Itercept β = Y β X Predcto of Y Ŷ= ˆ β + ˆ β X =b + bx

14 BE54W Regresso ad Correlato Page 4 of 43 Do these estmates make sese? b gb g b g X X Y Y Slope c ov XY, = β = = X X v ar( X ) = a f The lear movemet Y wth lear movemet X s measured relatve to the varablty X. β = says: Wth a ut chage X, overall there s a 5-5 chace that Y creases versus decreases β says: Wth a ut crease X, Y creases also ( β > ) or Y decreases ( β < ). Itercept β = Y β X If the lear model s correct, or, f the true model does ot have a lear compoet, we obta β = ad β = Y as our best guess of a ukow Y

15 BE54W Regresso ad Correlato Page 5 of 43 Illustrato SAS Code. data temp; put wt age logwt; label wt="weght, Y" age="age, X" logwt="log(weght),y"; cards; ; ru; qut; proc reg data=temp smple; /* opto smple produces smple descrptves */ ttle "Regresso of Y=Weght o X=Age"; model wt=age; ru; qut; Partal lstg of output... Parameter Estmates Parameter Stadard Varable Label DF Estmate Error t Value Pr > t Itercept Itercept age Age, X Aotated Parameter Estmates Parameter Stadard Varable Label DF Estmate Error t Value Pr > t Itercept Itercept = tercept = β age Age, X.357 = slope = β The ftted le s therefore WT = * AGE

16 BE54W Regresso ad Correlato Page 6 of 43 Let s overlay the ftted le o our scatterplot. 3. Scatter Plot of WT vs AGE.4.8 WT AGE As we mght have guessed, the straght le model may ot be the best choce. The bowl shape of the scatter plot does have a lear compoet, however. Wthout the plot, we mght have beleved the straght le ft s okay.

17 BE54W Regresso ad Correlato Page 7 of 43 Let s try a straght le model ft to Y=LOGWT versus X=AGE. Partal lstg of output... Parameter Estmates Parameter Stadard Varable Label DF Estmate Error t Value Pr > t Itercept Itercept <. age Age, X <. Aotated Parameter Estmates Parameter Stadard Varable Label DF Estmate Error t Value Pr > t Itercept Itercept = tercept = β <. Age Age, X.9589 = slope = β <. Thus, the ftted le s LOGWT = *AGE Now the scatterplot wth the overlay of the ftted le looks much better. Further dscusso wll cosder Scatter Plot the model of LOGWT that relates vs AGEY=LOGWT to X=AGE LOGWT AGE

18 BE54W Regresso ad Correlato Page 8 of 43 Predcto of Weght from Heght Source: Dxo ad Massey (969) Now You Try Idvdual Heght (X) Weght (Y) It helps to do the prelmary calculatos X= X = 49,68 Y=4.667 Y = 46, X Y 9,38 = xx S = Syy = 5, Sxy =

19 BE54W Regresso ad Correlato Page 9 of 43 Slope ˆ S β = S xy xx ˆ β = = Itercept β = Y β X ˆ β (5.9)( = =

20 BE54W Regresso ad Correlato Page of The Aalyss of Varace Table I Topc, Summarzg Data, we leared that the umerator of the sample varace of the Y data s ( ) Y Y. I regresso settgs where Y s the outcome varable, ths = Y Y s apprecated as the total varace of the Y s. As we wll = same quatty ( ) see, other ames for ths are total sum of squares, total, corrected, ad SSY. (Note corrected has to do wth subtractg the mea before squarg.) A aalyss of varace table s all about parttog the total varace of the Y s (corrected) to two compoets:. Due resdual (the dvdual Y about the dvdual predcto Ŷ). Due regresso (the predcto Ŷ about the overall mea Y) Asde - Why are we terested such a partto? We d lke to kow f, wth the data, there exsts the suggesto of a lear relatoshp ( sgal ) that ca be dscered from chace varablty ( ose ) ) the leftover varablty of the observed Y about the predcted Ŷ ( ose ) ) the explaed varablty of the predcted Ŷ about the overall mea Y ( sgal ) Here s the partto (Note Look closely ad you ll see that both sdes are the same) ( Y ) ( ˆ ) ( ˆ Y = Y Y + Y Y) Some algebra (ot show) reveals a ce partto of the total varablty. ( Y Y) = ( Y Yˆ) + ( Yˆ Y) Total Sum of Squares = Due Error Sum of Squares + Due Model Sum of Squares

21 BE54W Regresso ad Correlato Page of 43 A closer look Total Sum of Squares = Due Model Sum of Squares + Due Error Sum of Squares b Y Y c Y Y Y Y c b g c h c h Y Y = Y Y + Y Y = = = g= devato of Y from Y that s to be explaed h = due model, sgal, systematc, due regresso h = due error, ose, or resdual I the world of regresso aalyses, we seek to expla the total varablty Y Y What happes whe β? What happes whe β =? b = g : A straght le relatoshp s helpful A straght le relatoshp s ot helpful Best guess s Y = β + β X Best guess s Y = β = Y Due model s LARGE because c h β β Y Y = ( + X Y) Due error s early the TOTAL because cy Y h= ey j b g β = Y Y = Y β X + β X Y = β b X X Due error has to be small g due(model) a f a f due error wll be large due model a f due error Due regresso has to be small wll be small

22 BE54W Regresso ad Correlato Page of 43 How to Partto the Total Varace. The total or total, corrected refers to the varablty of Y about Y b = Y Y g s called the total sum of squares Degrees of freedom = df = (-) Dvso of the total sum of squares by ts df yelds the total mea square. The resdual or due error refers to the varablty of Y about Y Y Y = c h s called the resdual sum of squares Degrees of freedom = df = (-) Dvso of the resdual sum of squares by ts df yelds the resdual mea square. 3. The regresso or due model refers to the varablty of Y about Y c h b g = = Y Y = β X X s called the regresso sum of squares Degrees of freedom = df = Dvso of the regresso sum of squares by ts df yelds the regresso mea square or model mea square. Ths s a example of a varace compoet. Source df Sum of Squares Mea Square Regresso SSR = Y d Y = SSR/ Error (-) SSE = dy Y = SSE/(-) Total, corrected (-) SST = Y Yg Ht Mea square = (Sum of squares)/(df) b =

23 BE54W Regresso ad Correlato Page 3 of 43 Be careful! The questo we may ask from a aalyss of varace table s a lmted oe. Does the ft of the straght le model expla a sgfcat porto of the varablty of the dvdual Y about Y? Is ths better tha usg Y aloe? We are NOT askg: Is the choce of the straght le model correct? Would aother fuctoal form be a better choce? We ll use a hypothess test approach ad the method of proof by cotradcto. We beg wth a ull hypothess that says β = ( o lear relatoshp ) Evaluato wll focus o the comparso of the due regresso mea square to the resdual mea square Recall that we reasoed the followg: If β The due(regresso)/due(resdual) wll be LARGE If β = The due(regresso)/due(resdual) wll be SMALL Our p-value calculato wll aswer the questo: If β = truly, what are the chaces of obtag a value of due(regresso)/due(resdual) as larger or larger tha that observed? To calculate chaces we eed a probablty model. So far, we have ot eeded oe.

24 BE54W Regresso ad Correlato Page 4 of Assumptos for a Straght Le Regresso Aalyss I performg least squares estmato, we dd ot use a probablty model. We were dog geometry. Hypothess testg requres some assumptos ad a probablty model. Assumptos The separate observatos Y, Y,, Y are depedet. The values of the predctor varable X are fxed ad measured wthout error. For each value of the predctor varable X=x, the dstrbuto of values of Y follows a ormal dstrbuto wth mea equal to µ Y X=x ad commo varace equal to σ Y x. The separate meas µ Y X=x le o a straght le; that s µ Y X=x = β + β X Schematcally, here s what the stuato looks lke (courtesy: Sta Lemeshow)

25 BE54W Regresso ad Correlato Page 5 of 43 Wth these assumptos, we ca assess the sgfcace of the varace explaed by the model. F msq(model) = wth df =, (-) msq(resdual) β = β Due model MSR has expected value σ Y X Due resdual MSE has expected value σ Y X Due model MSR has expected value σ Y X + β X X b = Due resdual MSE has expected value σ Y X g F = (MSR)/MSE wll be close to F = (MSR)/MSE wll be LARGER tha We obta the aalyss of varace table for the model of Y=LOGWT to X=AGE: The followg s SAS wth aotatos red. Aalyss of Varace Sum of Mea Source DF Squares Square F Value Pr > F = MSQ(Regresso)/MSQ(Resdual) Model <. Error Corrected Total 4.85 Root MSE.87 R-Square.9983 = SSQ(regresso)/SSQ(total) Depedet Mea Adj R-Sq.998 = R adjusted for ad # predctors Coeff Var

26 BE54W Regresso ad Correlato Page 6 of 43 Ths output correspods to the followg. Source Df Sum of Squares Mea Square Regresso SSR = Y d Y = 4.63 SSR/ = 4.63 = Error (-) = 9 SSE = dy Y =.75 SSE/(-) = 7/838E-4 = Total, corrected (-) = SST = Y Yg = b = Other formato ths output: R-SQUARED = Sum of squares regresso/sum of squares total s the proporto of the total that we have bee able to expla wth the ft of the straght le model. - Be careful! As predctors are added to the model, R-SQUARED ca oly crease. Evetually, we eed to adjust ths measure to take ths to accout. See ADJUSTED R-SQUARED. We also get a overall F test of the ull hypothess that the smple lear model does ot expla sgfcatly more varablty LOGWT tha the average LOGWT. F = MSQ (Regresso)/MSQ (Resdual) = 4.63/.7838 = wth df =, 9 Acheved sgfcace <.. Reject H O. Coclude that the ftted le s a sgfcat mprovemet over the average LOGWT.

27 BE54W Regresso ad Correlato Page 7 of Hypothess Testg Straght Le Model: Y = β + β X ) Overall F-Test ) Test of slope 3) Test of tercept ) Overall F-Test Research Questo: Does the ftted model, the Y expla sgfcatly more of the total varablty of the Y about Y tha does Y? Assumptos: As before. H O ad H A : H H O A : β = : β Test Statstc: F = msq( regreso) msq( resdual) df =,( ) Evaluato rule: Whe the ull hypothess s true, the value of F should be close to. Alteratvely, whe β, the value of F wll be LARGER tha. Thus, our p-value calculato aswers: What are the chaces of obtag our value of the F or oe that s larger f we beleve the ull hypothess that β =?

28 BE54W Regresso ad Correlato Page 8 of 43 Calculatos: For our data, we obta p-value = L NM pr F msq(mod el) pr F msq( resdual).. β = QP = <<, ( ),9 O Evaluate: Uder the ull hypothess that β =, the chaces of obtag a value of F that s so far away from the expected value of, wth a value of , s less tha chace,. Ths s a very small lkelhood! Iterpret: We have leared that, at least, the ftted straght le model does a much better job of explag the varablty LOGWT tha a model that allows oly for the average LOGWT. later (BE64, Itermedate Bostatstcs), we ll see that the aalyss does ot stop here

29 BE54W Regresso ad Correlato Page 9 of 43 ) Test of the Slope, β Some terestg otes! - The overall F test ad the test of the slope are equvalet. - The test of the slope uses a t-score approach to hypothess testg - It ca be show that { t-score for slope } = { overall F } Research Questo: Is the slope β =? Assumptos: As before. H O ad H A : H H O A : β = : β Test Statstc: To compute the t-score, we eed a estmate of the stadard error of β L O d β b g X X = SE = msq( resdual) NM QP

30 BE54W Regresso ad Correlato Page 3 of 43 Our t-score s therefore: t df score= L NM = ( ) a f a f a f O L QP = dafo NM dqp observed expected β se expected se β We ca fd ths formato our output: The followg s SAS wth aotatos red. Parameter Estmates Parameter Stadard Varable Label DF Estmate Error t Value Pr > t = Estmate/Error Itercept Itercept <. age Age, X =.9589/.68 <. Recall what we mea by a t-score: T=73.38 says the estmated slope s estmated to be stadard error uts away from ts expected value of zero. Check that { t-score } = { Overall F }: [ ] = whch s close. Evaluato rule: Whe the ull hypothess s true, the value of t should be close to zero. Alteratvely, whe β, the value of t wll be DIFFERENT from. Here, our p-value calculato aswers: What are the chaces of obtag our value of the t or oe that s more far away from f we beleve the ull hypothess that β =?

31 BE54W Regresso ad Correlato Page 3 of 43 Calculatos: For our data, we obta p-value = L NM O P Q pr t β pr t ( ) 9 se P =. <<. β d Evaluate: Uder the ull hypothess that β =, the chaces of obtag a t-score value that s or more stadard error uts away from the expected value of s less tha chace,. Iterpret: The ferece s the same as that for the overall F test. The ftted straght le model does a much better job of explag the varablty LOGWT tha the sample mea.

32 BE54W Regresso ad Correlato Page 3 of 43 3) Test of the Itercept, β Ths pertas to whether or ot the straght le relatoshp passes through the org. It s rarely of terest. Research Questo: Is the tercept β =? Assumptos: As before. H O ad H A : H H O A : β = : β Test Statstc: To compute the t-score for the tercept, we eed a estmate of the stadard error of β L NM SE d β = msq( resdual) + b = X X X O g QP

33 BE54W Regresso ad Correlato Page 33 of 43 Our t-score s therefore: t df score= L NM = ( ) a f a f a f O L QP = dafo NM dqp observed expected β se expected se β We ca fd ths formato our output: The followg s SAS wth aotatos red. Parameter Estmates Parameter Stadard Varable Label DF Estmate Error t Value Pr > t = Estmate/Error Itercept Itercept = /.364 <. age Age, X <. Ths t= says the estmated tercept s estmated to be stadard error uts away from ts expected value of zero. Evaluato rule: Whe the ull hypothess s true, the value of t should be close to zero. Alteratvely, whe β, the value of t wll be DIFFERENT from. Our p-value calculato aswers: What are the chaces of obtag our value of the t or oe that s more far away from f we beleve the ull hypothess that β =?

34 BE54W Regresso ad Correlato Page 34 of 43 Calculatos: p-value = ˆ pr β t( ) = pr[ t ] <<. seˆ ( ˆ β ) Evaluate: Uder the ull hypothess that β =, the chaces of obtag a t-score value that s or more stadard error uts away from the expected value of s less tha chace,. Iterpret: The ferece s that the straght le relatoshp betwee Y=LOGWT ad X=AGE does ot pass through the org.

35 BE54W Regresso ad Correlato Page 35 of Cofdece Iterval Estmato Straght Le Model: Y = β + β X Recall (Topc 6, Estmato) that there are 3 elemets of a cofdece terval: ) Best sgle guess (estmate) ) Stadard error of the best sgle guess (SE[estmate]) 3) Cofdece coeffcet These wll be percetles from the t dstrbuto wth df=(-) For a 95% cofdece terval, ths wll be a 97.5 th percetle For a (-α)% cofdece terval, ths wll be a (-α/) th percetle. The geerc form of a cofdece terval s the Geerc Form of Cofdece Iterval Straght Le Model: Y = β + β X Lower lmt = ( Estmate ) - ( cofdece coeffcet )*SE( estmate ) Upper lmt = ( Estmate ) + ( cofdece coeffcet )*SE( estmate ) We mght wat cofdece terval estmates of the followg 4 parameters: () Slope () Itercept (3) Mea of subset of populato for whom X=x (4) Idvdual respose for perso for whom X=x

36 BE54W Regresso ad Correlato Page 36 of 43 ) SLOPE estmate = β d se b = msq(resdual) b = X X g ) INTERCEPT estmate = β d se b L NM = msq(resdual) + b = X X X O g QP 3) MEAN at X=x estmate = Y = β + β x X= x L NM se = msq(resdual) + 4) INDIVIDUAL wth X=x estmate = Y = β + β x X= x bx b = L NM X se = msq(resdual) + + X X b b = g O g QP x X X X g O g QP

37 BE54W Regresso ad Correlato Page 37 of 43 Illustrato for the model whch fts Y=LOGWT to X=AGE. Recall that we obtaed the followg ft: Parameter Estmates Parameter Stadard Varable Label DF Estmate Error t Value Pr > t Itercept Itercept <. age Age, X <. 95% Cofdece Iterval for the Slope, β ) Best sgle guess (estmate) = ˆ β =.9589 ) Stadard error of the best sgle guess (SE[estmate]) = ( ) se ˆ β =.68 3) Cofdece coeffcet = 97.5 th percetle of Studet t = t df 95% Cofdece Iterval for Slope β = Estmate ± ( cofdece coeffcet )*SE =.9589 ± (.6)(.68) = (.898,.9). 975, = 9 = 6. 95% Cofdece Iterval for the Itercept, β ) Best sgle guess (estmate) = ˆ β =.6895 ) Stadard error of the best sgle guess (SE[estmate]) = ( ) se ˆ β =.364 3) Cofdece coeffcet = 97.5 th percetle of Studet t = t df 95% Cofdece Iterval for Slope β = Estmate ± ( cofdece coeffcet )*SE = ± (.6)(.364) = (-.7585,-.6). 975, = 9 = 6.

38 BE54W Regresso ad Correlato Page 38 of 43 Code. Cofdece Itervals for Predctos proc reg data=temp alpha=.5; /* alpha=.5 s type I error */ ttle "Regresso of Y=Weght o X=Age"; model wt=age/cl clm; /*cl for dvdual, clm for mea */ ru; qut; Output. Output Statstcs Depedet Predcted Std Error Obs Varable Value Mea Predct 95% CL Mea 95% CL Predct Resdual

39 BE54W Regresso ad Correlato Page 39 of 43 Defto of Correlato 7. Itroducto to Correlato A correlato coeffcet s a measure of the assocato betwee two pared radom varables (e.g. heght ad weght). The Pearso product momet correlato, partcular, s a measure of the stregth of the straght le relatoshp betwee the two radom varables. Aother correlato measure (ot dscussed here) s the Spearma correlato. It s a measure of the stregth of the mootoe creasg (or decreasg) relatoshp betwee the two radom varables. The Spearma correlato s a o-parametrc (meag model free) measure. It s troduced PubHlth 64, Itermedate Bostatstcs. Formula for the Pearso Product Momet Correlato ρ The populato parameter desgato s rho, wrtte as ρ The estmate of ρ, based o formato a sample s represeted usg r. Some prelmares: () Suppose we are terested the correlato betwee X ad Y () ˆ cov(x,y) = = (x x)(y y) (-) = S xy (-) Ths s the covarace(x,y) (3) (4) ˆ var(x) = ˆ var(y) = = = (x x) (-) (y y) (-) Sxx = (-) = S yy (-) ad smlarly

40 BE54W Regresso ad Correlato Page 4 of 43 Formula for Estmate of Pearso Product Momet Correlato from a Sample ˆ ρ = r = ˆ cov(x,y) var(x)var(y) ˆ ˆ = S xy S S xx yy If you absolutely have to do t by had, a equvalet (more calculator fredly formula) s ˆ ρ = r = = xy x y = = x y = = x y = = The correlato r ca take o values betwee ad oly Thus, the correlato coeffcet s sad to be dmesoless t s depedet of the uts of x or y. Sg of the correlato coeffcet (postve or egatve) = Sg of the estmated slope ˆβ.

41 BE54W Regresso ad Correlato Page 4 of 43 There s a relatoshp betwee the slope of the straght le, ˆβ, ad the estmated correlato r. Relatoshp betwee slope ˆβ ad the sample correlato r Because ˆ S xy β = ad Sxx r = S xy S S xx yy A lttle algebra reveals that r = S S xx yy ˆ β Thus, beware!!! It s possble to have a very large (postve or egatve) r mght accompayg a very o-zero slope, asmuch as - A very large r mght reflect a very large S xx, all other thgs equal - A very large r mght reflect a very small S yy, all other thgs equal.

42 BE54W Regresso ad Correlato Page 4 of Hypothess Test of Correlato The ull hypothess of zero correlato s equvalet to the ull hypothess of zero slope. Research Questo: Is the correlato ρ =? Is the slope β =? Assumptos: As before. H O ad H A : H H O A : ρ = : ρ Test Statstc: A lttle algebra (ot show) yelds a very ce formula for the t-score that we eed. r (-) t score= r df = ( ) We ca fd ths formato our output. Recall the frst example ad the model of Y=LOGWT to X=AGE: The Pearso Correlato, r, s the R-squared the output. Root MSE.87 R-Square.9983 Depedet Mea Adj R-Sq.998 Coeff Var Pearso Correlato, r =.9983 =.999

43 BE54W Regresso ad Correlato Page 43 of 43 Substtuto to the formula for the t-score yelds r (-) t score= = = = 7.69 r Note: The value.999 the umerator s r= R =.9983 =.999 Ths s very close to the value of the t-score that was obtaed for testg the ull hypothess of zero slope. The dscrepacy s probably roudg error. I dd the calculatos o my calculator usg 4 sgfcat dgts. SAS probably used more sgfcat dgts - cb.

Unit 9 Regression and Correlation

Unit 9 Regression and Correlation PubHlth 54 - Fall 4 Regresso ad Correlato Page of 44 Ut 9 Regresso ad Correlato Assume that a statstcal model such as a lear model s a good frst start oly - Gerald va Belle Is hgher blood pressure the

More information

STA302/1001-Fall 2008 Midterm Test October 21, 2008

STA302/1001-Fall 2008 Midterm Test October 21, 2008 STA3/-Fall 8 Mdterm Test October, 8 Last Name: Frst Name: Studet Number: Erolled (Crcle oe) STA3 STA INSTRUCTIONS Tme allowed: hour 45 mutes Ads allowed: A o-programmable calculator A table of values from

More information

Simple Linear Regression

Simple Linear Regression Statstcal Methods I (EST 75) Page 139 Smple Lear Regresso Smple regresso applcatos are used to ft a model descrbg a lear relatoshp betwee two varables. The aspects of least squares regresso ad correlato

More information

b. There appears to be a positive relationship between X and Y; that is, as X increases, so does Y.

b. There appears to be a positive relationship between X and Y; that is, as X increases, so does Y. .46. a. The frst varable (X) s the frst umber the par ad s plotted o the horzotal axs, whle the secod varable (Y) s the secod umber the par ad s plotted o the vertcal axs. The scatterplot s show the fgure

More information

Multiple Regression. More than 2 variables! Grade on Final. Multiple Regression 11/21/2012. Exam 2 Grades. Exam 2 Re-grades

Multiple Regression. More than 2 variables! Grade on Final. Multiple Regression 11/21/2012. Exam 2 Grades. Exam 2 Re-grades STAT 101 Dr. Kar Lock Morga 11/20/12 Exam 2 Grades Multple Regresso SECTIONS 9.2, 10.1, 10.2 Multple explaatory varables (10.1) Parttog varablty R 2, ANOVA (9.2) Codtos resdual plot (10.2) Trasformatos

More information

ENGI 3423 Simple Linear Regression Page 12-01

ENGI 3423 Simple Linear Regression Page 12-01 ENGI 343 mple Lear Regresso Page - mple Lear Regresso ometmes a expermet s set up where the expermeter has cotrol over the values of oe or more varables X ad measures the resultg values of aother varable

More information

Chapter 13 Student Lecture Notes 13-1

Chapter 13 Student Lecture Notes 13-1 Chapter 3 Studet Lecture Notes 3- Basc Busess Statstcs (9 th Edto) Chapter 3 Smple Lear Regresso 4 Pretce-Hall, Ic. Chap 3- Chapter Topcs Types of Regresso Models Determg the Smple Lear Regresso Equato

More information

12.2 Estimating Model parameters Assumptions: ox and y are related according to the simple linear regression model

12.2 Estimating Model parameters Assumptions: ox and y are related according to the simple linear regression model 1. Estmatg Model parameters Assumptos: ox ad y are related accordg to the smple lear regresso model (The lear regresso model s the model that says that x ad y are related a lear fasho, but the observed

More information

Statistics MINITAB - Lab 5

Statistics MINITAB - Lab 5 Statstcs 10010 MINITAB - Lab 5 PART I: The Correlato Coeffcet Qute ofte statstcs we are preseted wth data that suggests that a lear relatoshp exsts betwee two varables. For example the plot below s of

More information

Lecture Notes Types of economic variables

Lecture Notes Types of economic variables Lecture Notes 3 1. Types of ecoomc varables () Cotuous varable takes o a cotuum the sample space, such as all pots o a le or all real umbers Example: GDP, Polluto cocetrato, etc. () Dscrete varables fte

More information

Probability and. Lecture 13: and Correlation

Probability and. Lecture 13: and Correlation 933 Probablty ad Statstcs for Software ad Kowledge Egeers Lecture 3: Smple Lear Regresso ad Correlato Mocha Soptkamo, Ph.D. Outle The Smple Lear Regresso Model (.) Fttg the Regresso Le (.) The Aalyss of

More information

Lecture 7. Confidence Intervals and Hypothesis Tests in the Simple CLR Model

Lecture 7. Confidence Intervals and Hypothesis Tests in the Simple CLR Model Lecture 7. Cofdece Itervals ad Hypothess Tests the Smple CLR Model I lecture 6 we troduced the Classcal Lear Regresso (CLR) model that s the radom expermet of whch the data Y,,, K, are the outcomes. The

More information

Multiple Linear Regression Analysis

Multiple Linear Regression Analysis LINEA EGESSION ANALYSIS MODULE III Lecture - 4 Multple Lear egresso Aalyss Dr. Shalabh Departmet of Mathematcs ad Statstcs Ida Isttute of Techology Kapur Cofdece terval estmato The cofdece tervals multple

More information

Lecture 8: Linear Regression

Lecture 8: Linear Regression Lecture 8: Lear egresso May 4, GENOME 56, Sprg Goals Develop basc cocepts of lear regresso from a probablstc framework Estmatg parameters ad hypothess testg wth lear models Lear regresso Su I Lee, CSE

More information

Chapter Business Statistics: A First Course Fifth Edition. Learning Objectives. Correlation vs. Regression. In this chapter, you learn:

Chapter Business Statistics: A First Course Fifth Edition. Learning Objectives. Correlation vs. Regression. In this chapter, you learn: Chapter 3 3- Busess Statstcs: A Frst Course Ffth Edto Chapter 2 Correlato ad Smple Lear Regresso Busess Statstcs: A Frst Course, 5e 29 Pretce-Hall, Ic. Chap 2- Learg Objectves I ths chapter, you lear:

More information

Regresso What s a Model? 1. Ofte Descrbe Relatoshp betwee Varables 2. Types - Determstc Models (o radomess) - Probablstc Models (wth radomess) EPI 809/Sprg 2008 9 Determstc Models 1. Hypothesze

More information

Objectives of Multiple Regression

Objectives of Multiple Regression Obectves of Multple Regresso Establsh the lear equato that best predcts values of a depedet varable Y usg more tha oe eplaator varable from a large set of potetal predctors {,,... k }. Fd that subset of

More information

Ordinary Least Squares Regression. Simple Regression. Algebra and Assumptions.

Ordinary Least Squares Regression. Simple Regression. Algebra and Assumptions. Ordary Least Squares egresso. Smple egresso. Algebra ad Assumptos. I ths part of the course we are gog to study a techque for aalysg the lear relatoshp betwee two varables Y ad X. We have pars of observatos

More information

Linear Regression with One Regressor

Linear Regression with One Regressor Lear Regresso wth Oe Regressor AIM QA.7. Expla how regresso aalyss ecoometrcs measures the relatoshp betwee depedet ad depedet varables. A regresso aalyss has the goal of measurg how chages oe varable,

More information

Example: Multiple linear regression. Least squares regression. Repetition: Simple linear regression. Tron Anders Moger

Example: Multiple linear regression. Least squares regression. Repetition: Simple linear regression. Tron Anders Moger Example: Multple lear regresso 5000,00 4000,00 Tro Aders Moger 0.0.007 brthweght 3000,00 000,00 000,00 0,00 50,00 00,00 50,00 00,00 50,00 weght pouds Repetto: Smple lear regresso We defe a model Y = β0

More information

The equation is sometimes presented in form Y = a + b x. This is reasonable, but it s not the notation we use.

The equation is sometimes presented in form Y = a + b x. This is reasonable, but it s not the notation we use. INTRODUCTORY NOTE ON LINEAR REGREION We have data of the form (x y ) (x y ) (x y ) These wll most ofte be preseted to us as two colum of a spreadsheet As the topc develops we wll see both upper case ad

More information

ESS Line Fitting

ESS Line Fitting ESS 5 014 17. Le Fttg A very commo problem data aalyss s lookg for relatoshpetwee dfferet parameters ad fttg les or surfaces to data. The smplest example s fttg a straght le ad we wll dscuss that here

More information

residual. (Note that usually in descriptions of regression analysis, upper-case

residual. (Note that usually in descriptions of regression analysis, upper-case Regresso Aalyss Regresso aalyss fts or derves a model that descres the varato of a respose (or depedet ) varale as a fucto of oe or more predctor (or depedet ) varales. The geeral regresso model s oe of

More information

UNIVERSITY OF OSLO DEPARTMENT OF ECONOMICS

UNIVERSITY OF OSLO DEPARTMENT OF ECONOMICS UNIVERSITY OF OSLO DEPARTMENT OF ECONOMICS Exam: ECON430 Statstcs Date of exam: Frday, December 8, 07 Grades are gve: Jauary 4, 08 Tme for exam: 0900 am 00 oo The problem set covers 5 pages Resources allowed:

More information

Statistics: Unlocking the Power of Data Lock 5

Statistics: Unlocking the Power of Data Lock 5 STAT 0 Dr. Kar Lock Morga Exam 2 Grades: I- Class Multple Regresso SECTIONS 9.2, 0., 0.2 Multple explaatory varables (0.) Parttog varablty R 2, ANOVA (9.2) Codtos resdual plot (0.2) Exam 2 Re- grades Re-

More information

Simple Linear Regression

Simple Linear Regression Correlato ad Smple Lear Regresso Berl Che Departmet of Computer Scece & Iformato Egeerg Natoal Tawa Normal Uversty Referece:. W. Navd. Statstcs for Egeerg ad Scetsts. Chapter 7 (7.-7.3) & Teachg Materal

More information

Summary of the lecture in Biostatistics

Summary of the lecture in Biostatistics Summary of the lecture Bostatstcs Probablty Desty Fucto For a cotuos radom varable, a probablty desty fucto s a fucto such that: 0 dx a b) b a dx A probablty desty fucto provdes a smple descrpto of the

More information

Unit 2. Regression and Correlation

Unit 2. Regression and Correlation PubHlth 640 - Sprg 0. Regresso ad Correlato Page of 80 Ut. Regresso ad Correlato Do t let us quarrel, the Whte Quee sad a axous toe. What s the cause of lghtg? The cause of lghtg, Alce sad very decdedly,

More information

Mean is only appropriate for interval or ratio scales, not ordinal or nominal.

Mean is only appropriate for interval or ratio scales, not ordinal or nominal. Mea Same as ordary average Sum all the data values ad dvde by the sample sze. x = ( x + x +... + x Usg summato otato, we wrte ths as x = x = x = = ) x Mea s oly approprate for terval or rato scales, ot

More information

Chapter 8. Inferences about More Than Two Population Central Values

Chapter 8. Inferences about More Than Two Population Central Values Chapter 8. Ifereces about More Tha Two Populato Cetral Values Case tudy: Effect of Tmg of the Treatmet of Port-We tas wth Lasers ) To vestgate whether treatmet at a youg age would yeld better results tha

More information

CLASS NOTES. for. PBAF 528: Quantitative Methods II SPRING Instructor: Jean Swanson. Daniel J. Evans School of Public Affairs

CLASS NOTES. for. PBAF 528: Quantitative Methods II SPRING Instructor: Jean Swanson. Daniel J. Evans School of Public Affairs CLASS NOTES for PBAF 58: Quattatve Methods II SPRING 005 Istructor: Jea Swaso Dael J. Evas School of Publc Affars Uversty of Washgto Ackowledgemet: The structor wshes to thak Rachel Klet, Assstat Professor,

More information

Econometric Methods. Review of Estimation

Econometric Methods. Review of Estimation Ecoometrc Methods Revew of Estmato Estmatg the populato mea Radom samplg Pot ad terval estmators Lear estmators Ubased estmators Lear Ubased Estmators (LUEs) Effcecy (mmum varace) ad Best Lear Ubased Estmators

More information

Midterm Exam 1, section 1 (Solution) Thursday, February hour, 15 minutes

Midterm Exam 1, section 1 (Solution) Thursday, February hour, 15 minutes coometrcs, CON Sa Fracsco State Uversty Mchael Bar Sprg 5 Mdterm am, secto Soluto Thursday, February 6 hour, 5 mutes Name: Istructos. Ths s closed book, closed otes eam.. No calculators of ay kd are allowed..

More information

THE ROYAL STATISTICAL SOCIETY HIGHER CERTIFICATE

THE ROYAL STATISTICAL SOCIETY HIGHER CERTIFICATE THE ROYAL STATISTICAL SOCIETY 00 EXAMINATIONS SOLUTIONS HIGHER CERTIFICATE PAPER I STATISTICAL THEORY The Socety provdes these solutos to assst caddates preparg for the examatos future years ad for the

More information

STATISTICAL PROPERTIES OF LEAST SQUARES ESTIMATORS. x, where. = y - ˆ " 1

STATISTICAL PROPERTIES OF LEAST SQUARES ESTIMATORS. x, where. = y - ˆ  1 STATISTICAL PROPERTIES OF LEAST SQUARES ESTIMATORS Recall Assumpto E(Y x) η 0 + η x (lear codtoal mea fucto) Data (x, y ), (x 2, y 2 ),, (x, y ) Least squares estmator ˆ E (Y x) ˆ " 0 + ˆ " x, where ˆ

More information

: At least two means differ SST

: At least two means differ SST Formula Card for Eam 3 STA33 ANOVA F-Test: Completely Radomzed Desg ( total umber of observatos, k = Number of treatmets,& T = total for treatmet ) Step : Epress the Clam Step : The ypotheses: :... 0 A

More information

Chapter 14 Logistic Regression Models

Chapter 14 Logistic Regression Models Chapter 4 Logstc Regresso Models I the lear regresso model X β + ε, there are two types of varables explaatory varables X, X,, X k ad study varable y These varables ca be measured o a cotuous scale as

More information

Lecture 3. Sampling, sampling distributions, and parameter estimation

Lecture 3. Sampling, sampling distributions, and parameter estimation Lecture 3 Samplg, samplg dstrbutos, ad parameter estmato Samplg Defto Populato s defed as the collecto of all the possble observatos of terest. The collecto of observatos we take from the populato s called

More information

Midterm Exam 1, section 2 (Solution) Thursday, February hour, 15 minutes

Midterm Exam 1, section 2 (Solution) Thursday, February hour, 15 minutes coometrcs, CON Sa Fracsco State Uverst Mchael Bar Sprg 5 Mdterm xam, secto Soluto Thursda, Februar 6 hour, 5 mutes Name: Istructos. Ths s closed book, closed otes exam.. No calculators of a kd are allowed..

More information

Simple Linear Regression - Scalar Form

Simple Linear Regression - Scalar Form Smple Lear Regresso - Scalar Form Q.. Model Y X,..., p..a. Derve the ormal equatos that mmze Q. p..b. Solve for the ordary least squares estmators, p..c. Derve E, V, E, V, COV, p..d. Derve the mea ad varace

More information

ECON 482 / WH Hong The Simple Regression Model 1. Definition of the Simple Regression Model

ECON 482 / WH Hong The Simple Regression Model 1. Definition of the Simple Regression Model ECON 48 / WH Hog The Smple Regresso Model. Defto of the Smple Regresso Model Smple Regresso Model Expla varable y terms of varable x y = β + β x+ u y : depedet varable, explaed varable, respose varable,

More information

Statistics. Correlational. Dr. Ayman Eldeib. Simple Linear Regression and Correlation. SBE 304: Linear Regression & Correlation 1/3/2018

Statistics. Correlational. Dr. Ayman Eldeib. Simple Linear Regression and Correlation. SBE 304: Linear Regression & Correlation 1/3/2018 /3/08 Sstems & Bomedcal Egeerg Departmet SBE 304: Bo-Statstcs Smple Lear Regresso ad Correlato Dr. Ama Eldeb Fall 07 Descrptve Orgasg, summarsg & descrbg data Statstcs Correlatoal Relatoshps Iferetal Geeralsg

More information

Applied Statistics and Probability for Engineers, 5 th edition February 23, b) y ˆ = (85) =

Applied Statistics and Probability for Engineers, 5 th edition February 23, b) y ˆ = (85) = Appled Statstcs ad Probablty for Egeers, 5 th edto February 3, y.8.7.6.5.4.3.. -5 5 5 x b) y ˆ.3999 +.46(85).6836 c) y ˆ.3999 +.46(9).744 d) ˆ.46-3 a) Regresso Aalyss: Ratg Pots versus Meters per Att The

More information

REVIEW OF SIMPLE LINEAR REGRESSION SIMPLE LINEAR REGRESSION

REVIEW OF SIMPLE LINEAR REGRESSION SIMPLE LINEAR REGRESSION REVIEW OF SIMPLE LINEAR REGRESSION SIMPLE LINEAR REGRESSION I lear regreo, we coder the frequecy dtrbuto of oe varable (Y) at each of everal level of a ecod varable (X). Y kow a the depedet varable. The

More information

UNIVERSITY OF OSLO DEPARTMENT OF ECONOMICS

UNIVERSITY OF OSLO DEPARTMENT OF ECONOMICS UNIVERSITY OF OSLO DEPARTMENT OF ECONOMICS Postpoed exam: ECON430 Statstcs Date of exam: Jauary 0, 0 Tme for exam: 09:00 a.m. :00 oo The problem set covers 5 pages Resources allowed: All wrtte ad prted

More information

Simple Linear Regression and Correlation.

Simple Linear Regression and Correlation. Smple Lear Regresso ad Correlato. Correspods to Chapter 0 Tamhae ad Dulop Sldes prepared b Elzabeth Newto (MIT) wth some sldes b Jacquele Telford (Johs Hopks Uverst) Smple lear regresso aalss estmates

More information

Lecture 1: Introduction to Regression

Lecture 1: Introduction to Regression Lecture : Itroducto to Regresso A Eample: Eplag State Homcde Rates What kds of varables mght we use to epla/predct state homcde rates? Let s cosder just oe predctor for ow: povert Igore omtted varables,

More information

STA 105-M BASIC STATISTICS (This is a multiple choice paper.)

STA 105-M BASIC STATISTICS (This is a multiple choice paper.) DCDM BUSINESS SCHOOL September Mock Eamatos STA 0-M BASIC STATISTICS (Ths s a multple choce paper.) Tme: hours 0 mutes INSTRUCTIONS TO CANDIDATES Do ot ope ths questo paper utl you have bee told to do

More information

Chapter Two. An Introduction to Regression ( )

Chapter Two. An Introduction to Regression ( ) ubject: A Itroducto to Regresso Frst tage Chapter Two A Itroducto to Regresso (018-019) 1 pg. ubject: A Itroducto to Regresso Frst tage A Itroducto to Regresso Regresso aalss s a statstcal tool for the

More information

CHAPTER VI Statistical Analysis of Experimental Data

CHAPTER VI Statistical Analysis of Experimental Data Chapter VI Statstcal Aalyss of Expermetal Data CHAPTER VI Statstcal Aalyss of Expermetal Data Measuremets do ot lead to a uque value. Ths s a result of the multtude of errors (maly radom errors) that ca

More information

Multiple Choice Test. Chapter Adequacy of Models for Regression

Multiple Choice Test. Chapter Adequacy of Models for Regression Multple Choce Test Chapter 06.0 Adequac of Models for Regresso. For a lear regresso model to be cosdered adequate, the percetage of scaled resduals that eed to be the rage [-,] s greater tha or equal to

More information

Functions of Random Variables

Functions of Random Variables Fuctos of Radom Varables Chapter Fve Fuctos of Radom Varables 5. Itroducto A geeral egeerg aalyss model s show Fg. 5.. The model output (respose) cotas the performaces of a system or product, such as weght,

More information

STA 108 Applied Linear Models: Regression Analysis Spring Solution for Homework #1

STA 108 Applied Linear Models: Regression Analysis Spring Solution for Homework #1 STA 08 Appled Lear Models: Regresso Aalyss Sprg 0 Soluto for Homework #. Let Y the dollar cost per year, X the umber of vsts per year. The the mathematcal relato betwee X ad Y s: Y 300 + X. Ths s a fuctoal

More information

Lecture 1 Review of Fundamental Statistical Concepts

Lecture 1 Review of Fundamental Statistical Concepts Lecture Revew of Fudametal Statstcal Cocepts Measures of Cetral Tedecy ad Dsperso A word about otato for ths class: Idvduals a populato are desgated, where the dex rages from to N, ad N s the total umber

More information

4. Standard Regression Model and Spatial Dependence Tests

4. Standard Regression Model and Spatial Dependence Tests 4. Stadard Regresso Model ad Spatal Depedece Tests Stadard regresso aalss fals the presece of spatal effects. I case of spatal depedeces ad/or spatal heterogeet a stadard regresso model wll be msspecfed.

More information

{ }{ ( )} (, ) = ( ) ( ) ( ) Chapter 14 Exercises in Sampling Theory. Exercise 1 (Simple random sampling): Solution:

{ }{ ( )} (, ) = ( ) ( ) ( ) Chapter 14 Exercises in Sampling Theory. Exercise 1 (Simple random sampling): Solution: Chapter 4 Exercses Samplg Theory Exercse (Smple radom samplg: Let there be two correlated radom varables X ad A sample of sze s draw from a populato by smple radom samplg wthout replacemet The observed

More information

Correlation and Simple Linear Regression

Correlation and Simple Linear Regression Correlato ad Smple Lear Regresso Berl Che Departmet of Computer Scece & Iformato Egeerg Natoal Tawa Normal Uverst Referece:. W. Navd. Statstcs for Egeerg ad Scetsts. Chapter 7 (7.-7.3) & Teachg Materal

More information

Investigation of Partially Conditional RP Model with Response Error. Ed Stanek

Investigation of Partially Conditional RP Model with Response Error. Ed Stanek Partally Codtoal Radom Permutato Model 7- vestgato of Partally Codtoal RP Model wth Respose Error TRODUCTO Ed Staek We explore the predctor that wll result a smple radom sample wth respose error whe a

More information

Chapter 2 Simple Linear Regression

Chapter 2 Simple Linear Regression Chapter Smple Lear Regresso. Itroducto ad Least Squares Estmates Regresso aalyss s a method for vestgatg the fuctoal relatoshp amog varables. I ths chapter we cosder problems volvg modelg the relatoshp

More information

1. The weight of six Golden Retrievers is 66, 61, 70, 67, 92 and 66 pounds. The weight of six Labrador Retrievers is 54, 60, 72, 78, 84 and 67.

1. The weight of six Golden Retrievers is 66, 61, 70, 67, 92 and 66 pounds. The weight of six Labrador Retrievers is 54, 60, 72, 78, 84 and 67. Ecoomcs 3 Itroducto to Ecoometrcs Sprg 004 Professor Dobk Name Studet ID Frst Mdterm Exam You must aswer all the questos. The exam s closed book ad closed otes. You may use your calculators but please

More information

ε. Therefore, the estimate

ε. Therefore, the estimate Suggested Aswers, Problem Set 3 ECON 333 Da Hugerma. Ths s ot a very good dea. We kow from the secod FOC problem b) that ( ) SSE / = y x x = ( ) Whch ca be reduced to read y x x = ε x = ( ) The OLS model

More information

Lecture 2: Linear Least Squares Regression

Lecture 2: Linear Least Squares Regression Lecture : Lear Least Squares Regresso Dave Armstrog UW Mlwaukee February 8, 016 Is the Relatoshp Lear? lbrary(car) data(davs) d 150) Davs$weght[d]

More information

Handout #8. X\Y f(x) 0 1/16 1/ / /16 3/ / /16 3/16 0 3/ /16 1/16 1/8 g(y) 1/16 1/4 3/8 1/4 1/16 1

Handout #8. X\Y f(x) 0 1/16 1/ / /16 3/ / /16 3/16 0 3/ /16 1/16 1/8 g(y) 1/16 1/4 3/8 1/4 1/16 1 Hadout #8 Ttle: Foudatos of Ecoometrcs Course: Eco 367 Fall/05 Istructor: Dr. I-Mg Chu Lear Regresso Model So far we have focused mostly o the study of a sgle radom varable, ts correspodg theoretcal dstrbuto,

More information

Lecture 1: Introduction to Regression

Lecture 1: Introduction to Regression Lecture : Itroducto to Regresso A Eample: Eplag State Homcde Rates What kds of varables mght we use to epla/predct state homcde rates? Let s cosder just oe predctor for ow: povert Igore omtted varables,

More information

Simple Linear Regression and Correlation. Applied Statistics and Probability for Engineers. Chapter 11 Simple Linear Regression and Correlation

Simple Linear Regression and Correlation. Applied Statistics and Probability for Engineers. Chapter 11 Simple Linear Regression and Correlation 4//6 Appled Statstcs ad Probablty for Egeers Sth Edto Douglas C. Motgomery George C. Ruger Chapter Smple Lear Regresso ad Correlato CHAPTER OUTLINE Smple Lear Regresso ad Correlato - Emprcal Models -8

More information

Logistic regression (continued)

Logistic regression (continued) STAT562 page 138 Logstc regresso (cotued) Suppose we ow cosder more complex models to descrbe the relatoshp betwee a categorcal respose varable (Y) that takes o two (2) possble outcomes ad a set of p explaatory

More information

Continuous Distributions

Continuous Distributions 7//3 Cotuous Dstrbutos Radom Varables of the Cotuous Type Desty Curve Percet Desty fucto, f (x) A smooth curve that ft the dstrbuto 3 4 5 6 7 8 9 Test scores Desty Curve Percet Probablty Desty Fucto, f

More information

Chapter 11 The Analysis of Variance

Chapter 11 The Analysis of Variance Chapter The Aalyss of Varace. Oe Factor Aalyss of Varace. Radomzed Bloc Desgs (ot for ths course) NIPRL . Oe Factor Aalyss of Varace.. Oe Factor Layouts (/4) Suppose that a expermeter s terested populatos

More information

Multivariate Transformation of Variables and Maximum Likelihood Estimation

Multivariate Transformation of Variables and Maximum Likelihood Estimation Marquette Uversty Multvarate Trasformato of Varables ad Maxmum Lkelhood Estmato Dael B. Rowe, Ph.D. Assocate Professor Departmet of Mathematcs, Statstcs, ad Computer Scece Copyrght 03 by Marquette Uversty

More information

Chapter 3 Sampling For Proportions and Percentages

Chapter 3 Sampling For Proportions and Percentages Chapter 3 Samplg For Proportos ad Percetages I may stuatos, the characterstc uder study o whch the observatos are collected are qualtatve ature For example, the resposes of customers may marketg surveys

More information

Chapter 13, Part A Analysis of Variance and Experimental Design. Introduction to Analysis of Variance. Introduction to Analysis of Variance

Chapter 13, Part A Analysis of Variance and Experimental Design. Introduction to Analysis of Variance. Introduction to Analysis of Variance Chapter, Part A Aalyss of Varace ad Epermetal Desg Itroducto to Aalyss of Varace Aalyss of Varace: Testg for the Equalty of Populato Meas Multple Comparso Procedures Itroducto to Aalyss of Varace Aalyss

More information

Module 7. Lecture 7: Statistical parameter estimation

Module 7. Lecture 7: Statistical parameter estimation Lecture 7: Statstcal parameter estmato Parameter Estmato Methods of Parameter Estmato 1) Method of Matchg Pots ) Method of Momets 3) Mamum Lkelhood method Populato Parameter Sample Parameter Ubased estmato

More information

STK4011 and STK9011 Autumn 2016

STK4011 and STK9011 Autumn 2016 STK4 ad STK9 Autum 6 Pot estmato Covers (most of the followg materal from chapter 7: Secto 7.: pages 3-3 Secto 7..: pages 3-33 Secto 7..: pages 35-3 Secto 7..3: pages 34-35 Secto 7.3.: pages 33-33 Secto

More information

"It is the mark of a truly intelligent person to be moved by statistics." George Bernard Shaw

It is the mark of a truly intelligent person to be moved by statistics. George Bernard Shaw Chapter 0 Chapter 0 Lear Regresso ad Correlato "It s the mark of a truly tellget perso to be moved by statstcs." George Berard Shaw Source: https://www.google.com.ph/search?q=house+ad+car+pctures&bw=366&bh=667&tbm

More information

ENGI 4421 Propagation of Error Page 8-01

ENGI 4421 Propagation of Error Page 8-01 ENGI 441 Propagato of Error Page 8-01 Propagato of Error [Navd Chapter 3; ot Devore] Ay realstc measuremet procedure cotas error. Ay calculatos based o that measuremet wll therefore also cota a error.

More information

Chapter Statistics Background of Regression Analysis

Chapter Statistics Background of Regression Analysis Chapter 06.0 Statstcs Backgroud of Regresso Aalyss After readg ths chapter, you should be able to:. revew the statstcs backgroud eeded for learg regresso, ad. kow a bref hstory of regresso. Revew of Statstcal

More information

Special Instructions / Useful Data

Special Instructions / Useful Data JAM 6 Set of all real umbers P A..d. B, p Posso Specal Istructos / Useful Data x,, :,,, x x Probablty of a evet A Idepedetly ad detcally dstrbuted Bomal dstrbuto wth parameters ad p Posso dstrbuto wth

More information

Simulation Output Analysis

Simulation Output Analysis Smulato Output Aalyss Summary Examples Parameter Estmato Sample Mea ad Varace Pot ad Iterval Estmato ermatg ad o-ermatg Smulato Mea Square Errors Example: Sgle Server Queueg System x(t) S 4 S 4 S 3 S 5

More information

Chapter 4 Multiple Random Variables

Chapter 4 Multiple Random Variables Revew for the prevous lecture: Theorems ad Examples: How to obta the pmf (pdf) of U = g (, Y) ad V = g (, Y) Chapter 4 Multple Radom Varables Chapter 44 Herarchcal Models ad Mxture Dstrbutos Examples:

More information

Chapter 5 Properties of a Random Sample

Chapter 5 Properties of a Random Sample Lecture 6 o BST 63: Statstcal Theory I Ku Zhag, /0/008 Revew for the prevous lecture Cocepts: t-dstrbuto, F-dstrbuto Theorems: Dstrbutos of sample mea ad sample varace, relatoshp betwee sample mea ad sample

More information

Homework Solution (#5)

Homework Solution (#5) Homework Soluto (# Chapter : #6,, 8(b, 3, 4, 44, 49, 3, 9 ad 7 Chapter. Smple Lear Regresso ad Correlato.6 (6 th edto 7, old edto Page 9 Rafall volume ( vs Ruoff volume ( : 9 8 7 6 4 3 : a. Yes, the scatter-plot

More information

Sum Mean n

Sum Mean n tatstcal Methods I (EXT 75) Page 147 ummary data Itermedate Calculatos X = 83 Y = 8 X = 51 Y = 368 Mea of X = X = 5.1875 Mea of Y = Y = 14.5 XY = 1348 = 16 Correcto factors ad Corrected values (ums of

More information

Reaction Time VS. Drug Percentage Subject Amount of Drug Times % Reaction Time in Seconds 1 Mary John Carl Sara William 5 4

Reaction Time VS. Drug Percentage Subject Amount of Drug Times % Reaction Time in Seconds 1 Mary John Carl Sara William 5 4 CHAPTER Smple Lear Regreo EXAMPLE A expermet volvg fve ubject coducted to determe the relatohp betwee the percetage of a certa drug the bloodtream ad the legth of tme t take the ubject to react to a tmulu.

More information

University of Belgrade. Faculty of Mathematics. Master thesis Regression and Correlation

University of Belgrade. Faculty of Mathematics. Master thesis Regression and Correlation Uversty of Belgrade Vrtual Lbrary of Faculty of Mathematcs - Uversty of Belgrade Faculty of Mathematcs Master thess Regresso ad Correlato The caddate Supervsor Karma Ibrahm Soufya Vesa Jevremovć Jue 014

More information

QA 622 AUTUMN QUARTER ACADEMIC YEAR

QA 622 AUTUMN QUARTER ACADEMIC YEAR QA 6 AUTUMN QUARTER 6-7 ACADEMIC YEAR Istructor: Offce: James J. Cochra 7A CAB Telephoe: (38) 57-3445 Hours: e-mal: URL: TR 8: a.m. - oo W 8: a.m. - : a.m. or by appotmet jcochra@cab.latech.edu http://www.cab.latech.edu/publc/fac-staff/

More information

ENGI 4421 Joint Probability Distributions Page Joint Probability Distributions [Navidi sections 2.5 and 2.6; Devore sections

ENGI 4421 Joint Probability Distributions Page Joint Probability Distributions [Navidi sections 2.5 and 2.6; Devore sections ENGI 441 Jot Probablty Dstrbutos Page 7-01 Jot Probablty Dstrbutos [Navd sectos.5 ad.6; Devore sectos 5.1-5.] The jot probablty mass fucto of two dscrete radom quattes, s, P ad p x y x y The margal probablty

More information

( ) = ( ) ( ) Chapter 13 Asymptotic Theory and Stochastic Regressors. Stochastic regressors model

( ) = ( ) ( ) Chapter 13 Asymptotic Theory and Stochastic Regressors. Stochastic regressors model Chapter 3 Asmptotc Theor ad Stochastc Regressors The ature of eplaator varable s assumed to be o-stochastc or fed repeated samples a regresso aalss Such a assumpto s approprate for those epermets whch

More information

UNIVERSITY OF EAST ANGLIA. Main Series UG Examination

UNIVERSITY OF EAST ANGLIA. Main Series UG Examination UNIVERSITY OF EAST ANGLIA School of Ecoomcs Ma Seres UG Examato 03-4 INTRODUCTORY MATHEMATICS AND STATISTICS FOR ECONOMISTS ECO-400Y Tme allowed: 3 hours Aswer ALL questos from both Sectos. Aswer EACH

More information

MEASURES OF DISPERSION

MEASURES OF DISPERSION MEASURES OF DISPERSION Measure of Cetral Tedecy: Measures of Cetral Tedecy ad Dsperso ) Mathematcal Average: a) Arthmetc mea (A.M.) b) Geometrc mea (G.M.) c) Harmoc mea (H.M.) ) Averages of Posto: a) Meda

More information

Example. Row Hydrogen Carbon

Example. Row Hydrogen Carbon SMAM 39 Least Squares Example. Heatg ad combusto aalyses were performed order to study the composto of moo rocks collected by Apollo 4 ad 5 crews. Recorded c ad c of the Mtab output are the determatos

More information

C. Statistics. X = n geometric the n th root of the product of numerical data ln X GM = or ln GM = X 2. X n X 1

C. Statistics. X = n geometric the n th root of the product of numerical data ln X GM = or ln GM = X 2. X n X 1 C. Statstcs a. Descrbe the stages the desg of a clcal tral, takg to accout the: research questos ad hypothess, lterature revew, statstcal advce, choce of study protocol, ethcal ssues, data collecto ad

More information

ECONOMETRIC THEORY. MODULE VIII Lecture - 26 Heteroskedasticity

ECONOMETRIC THEORY. MODULE VIII Lecture - 26 Heteroskedasticity ECONOMETRIC THEORY MODULE VIII Lecture - 6 Heteroskedastcty Dr. Shalabh Departmet of Mathematcs ad Statstcs Ida Isttute of Techology Kapur . Breusch Paga test Ths test ca be appled whe the replcated data

More information

Chapter 3 Multiple Linear Regression Model

Chapter 3 Multiple Linear Regression Model Chapter 3 Multple Lear Regresso Model We cosder the problem of regresso whe study varable depeds o more tha oe explaatory or depedet varables, called as multple lear regresso model. Ths model geeralzes

More information

Point Estimation: definition of estimators

Point Estimation: definition of estimators Pot Estmato: defto of estmators Pot estmator: ay fucto W (X,..., X ) of a data sample. The exercse of pot estmato s to use partcular fuctos of the data order to estmate certa ukow populato parameters.

More information

TESTS BASED ON MAXIMUM LIKELIHOOD

TESTS BASED ON MAXIMUM LIKELIHOOD ESE 5 Toy E. Smth. The Basc Example. TESTS BASED ON MAXIMUM LIKELIHOOD To llustrate the propertes of maxmum lkelhood estmates ad tests, we cosder the smplest possble case of estmatg the mea of the ormal

More information

Module 7: Probability and Statistics

Module 7: Probability and Statistics Lecture 4: Goodess of ft tests. Itroducto Module 7: Probablty ad Statstcs I the prevous two lectures, the cocepts, steps ad applcatos of Hypotheses testg were dscussed. Hypotheses testg may be used to

More information

Chapter 2 Supplemental Text Material

Chapter 2 Supplemental Text Material -. Models for the Data ad the t-test Chapter upplemetal Text Materal The model preseted the text, equato (-3) s more properl called a meas model. ce the mea s a locato parameter, ths tpe of model s also

More information

THE ROYAL STATISTICAL SOCIETY GRADUATE DIPLOMA

THE ROYAL STATISTICAL SOCIETY GRADUATE DIPLOMA THE ROYAL STATISTICAL SOCIETY EXAMINATIONS SOLUTIONS GRADUATE DIPLOMA PAPER II STATISTICAL THEORY & METHODS The Socety provdes these solutos to assst caddates preparg for the examatos future years ad for

More information

r y Simple Linear Regression How To Study Relation Between Two Quantitative Variables? Scatter Plot Pearson s Sample Correlation Correlation

r y Simple Linear Regression How To Study Relation Between Two Quantitative Variables? Scatter Plot Pearson s Sample Correlation Correlation Maatee Klled Correlato & Regreo How To Study Relato Betwee Two Quattatve Varable? Smple Lear Regreo 6.11 A Smple Regreo Problem 1 I there relato betwee umber of power boat the area ad umber of maatee klled?

More information

Lecture 3 Probability review (cont d)

Lecture 3 Probability review (cont d) STATS 00: Itroducto to Statstcal Iferece Autum 06 Lecture 3 Probablty revew (cot d) 3. Jot dstrbutos If radom varables X,..., X k are depedet, the ther dstrbuto may be specfed by specfyg the dvdual dstrbuto

More information