Simple Linear Regression. Copyright 2012 Pearson Education, Inc. Publishing as Prentice Hall. Chapter Chapter. β0 β1. β β = 1. a. b.

Size: px
Start display at page:

Download "Simple Linear Regression. Copyright 2012 Pearson Education, Inc. Publishing as Prentice Hall. Chapter Chapter. β0 β1. β β = 1. a. b."

Transcription

1 Chapter 3 3- Simple Liear Regressio Chapter 3 3. a. b. c. d. 3.3 a. Usig the techique explaied i Exercise 3.: b. () = β + β β β 6 = + () 4 () = β + β β β 6 = + () β = β y= + x = β = 4 y= 4+ x β = c. = β + β() β = 6 = β + β( ) β = 4 y= + 4x d. 4 = β + β() 7 β β(3) = + β = 4 y= 4 x β = 3.5 Slope (β ) y-itercept (β ) a. 3 b. c. 3 d. 5 e. 4 Copyright Pearso Educatio, Ic. Publishig as Pretice all.

2 3- Simple Liear Regressio 3.7 a. To compute ˆ β ad ˆ β, we first costruct the followig table: x y xy x y x = y = xy = x = y = 36 The, ( x) () SSxx = x = = 5 ( x)( y) () SSxy = xy = = 5 ( y) ( ) SS yy = y = 36 = 6 5 y x y = = = x = = = 5 5 Thus, the least squares estimates of β ad β are: ˆ SSxy = = =. SSxx ˆ = y ˆ βx = (.)() = β β ad the equatio of the least squares predictio lie is yˆ =. x. b. 3.9 a. Yes, a positive wide scattered bad sice the year 4. b. Positive slope which implies that the older composers had a tedecy to produce a certai pitch more times. c. The lie is based o the sample data. Copyright Pearso Educatio, Ic. Publishig as Pretice all.

3 Chapter a. No, there does ot appear to ay tred for cooperatio use versus the average payoff. b. No, there does ot appear to ay tred for defective use versus the average payoff. c. Yes, there appears to be somewhat of a liear relatioship for average payoff ad puishmet use. d. Negative relatioship; the more puishmet use, the average payoff decreases. e. Yes, wiers ted to puish less tha o-wiers. 3.3 a. Some prelimiary calculatios are: xi i= xi i= = 667 = 645 yi i= xy i i i= = 35.8 = 4 = x i y i i= i= ( 667)( 35.8) = = = SSxy xi yi i= SSxx b xi i= x i i = ( 667) = = 645 = SS ˆ xy = = = SS xx ˆ ˆ (.3769 b = y b x = ) = The least squares lie is yˆ = x. b. b ˆ = 6.5 Sice x = is ot i the observed rage, b has o iterpretatio other tha beig the y-itercept. b ˆ =.3. For each additioal icrease of part per millio of pecti, the mea sweetess idex is estimated to decrease by.3. c. y ˆ = 6.5.3( 3) = Copyright Pearso Educatio, Ic. Publishig as Pretice all.

4 3-4 Simple Liear Regressio 3.5 Some prelimiary calculatios are: x 3.7 y = = = y 79 x = = = x y 79(3.7) SSxy = xy = = ( x) 79 SSxx = x = 5, = ˆ SSxy = = = SSxx 756 β ˆ ˆ o = y βx = (.64957) = β The estimated regressio lie is yˆ = x; sice x = is osesical, o practical iterpretatio of ˆ β =.574; for each oe-positio icrease i order, estimated recall proportio icreases by ˆ β = a. SSE.9 s =.33 = 9 = b. s =.33 =.769 = =. =. of their 3.9 We expect that 95% of the actual etropy values (y) will fall withi d s ( ) predicted values; i.e. y ±.. 3. a. y= β+ βx+ ε b. yˆ = x c. Assumptio : The mea of the probability distributio of ε is. Assumptio : The variace of the probability distributio of ε is costat for all settigs of the idepedet variable x. Assumptio 3: The probability distributio of ε if ormal. Assumptio 4: The errors associated with ay two differet observatios are idepedet. d. s = 635. yˆ± s yˆ± 635. yˆ± 7.4. e. ( ) Copyright Pearso Educatio, Ic. Publishig as Pretice all.

5 Chapter a. : β = a : β b Test statistic: t = ˆ =.857 = 6.7 s SS xx Rejectio regio: α =.5, df = = 4, t.5 =.776 Reject if t <.776 or t >.776 Coclusio: Reject. There is sufficiet evidece (at α =.5 ) to idicate that x cotributes iformatio for the predictio of y usig a liear model. b. : β= a : β b Test statistic: ˆ t = =. = 5. s SSxx.733 Rejectio regio: α =.5, df = = 3, t.5 = 3.8 Reject if t < 3.8 or t > 3.8 Coclusio: Reject. There is sufficiet evidece (at α =.5 ) to idicate that x provides iformatio for predictig y usig a liear model. 3.5 Some prelimiary calculatios are: y i i = ( 35.8) SS yy = yi = = i= 4 SSE = SS b ˆ SS = = yy xy ( )( ) SSE s = = =.4639 s bˆ s.4639 = = =.96 SS xx For cofidece level.9, α =. ad α =. =.5. From Table, Appedix D with df = = 4 =, t =.77. The cofidece iterval is:.5 bˆ.5 b ˆ ± t s.3 ± ,.8 ( ) ( ) Copyright Pearso Educatio, Ic. Publishig as Pretice all.

6 3-6 Simple Liear Regressio We are 9% cofidet that the chage i the mea sweetess idex for each oe uit chage i the pecti is betwee.39 ad a. To determie if body plus head rotatio ad active head movemet are positively liearly related, we test: β : = a : β > The test statistic is bˆ.88 t = = = 6.9 s ˆ.4 b The rejectio regio requires α =.5 i each tail of the t distributio with df = = 39 = 37. From Table, Appedix D, t The rejectio regio is t >.687. Sice the observed value of the test statistic falls i the rejectio regio ( t = 6.86 >.687, is rejected. There is sufficiet evidece to idicate that the two variables are positively liearly related at α =.5. b. For cofidece level.9, α =. ad α =. =.5. From Table, Appedix D, with df = = 39 = 37, t The cofidece iterval is: b ˆ ± t.5sˆ.88 ±.687 (.4 ).88 ±.4 (.64,. ) b We are 9% cofidet that the true value of β is betwee.64 ad.. c. Because the iterval i part b cotais the value, there is o evidece that the true slope of the lie differs from. 3.9 Usig the calculatios from Exercise 3.5 ad these calculatios: ( ) y 3.7 SSyy = y = = ( xy ) ( )( ) SSE = SS ˆ yy β SS = = SSE s = = = s= s = = To determie if there is a liear tred betwee the proportio of ames recalled ad positio, we test: : β = a : β The test statistic is bˆ bˆ.64 t = = = =.86 s bˆ s SSxx Copyright Pearso Educatio, Ic. Publishig as Pretice all.

7 Chapter The rejectio regio requires α =. =.5 i each tail of the t distributio. From Table, Appedix D, with df = = 44 = 4, t The rejectio regio is t <.576 or t >.576. Sice the observed test statistic falls i the rejectio regio ( t =.86 >.576 ), is rejected. There is sufficiet evidece to idicate the proportio of ames recalled is liearly related to positio at α = The MINITAB pritout below shows that there is a sigificat liear relatioship betwee empathic cocer ad brai activity sice the p-value for empathy of.9 / =.45 is less tha a sigificace level of.5. Thus, the researchers were correct i their theory that β >. The regressio equatio is ACTIVITY = EMPATY Predictor Coef SE Coef T P Costat EMPATY S =.684 R-Sq = 39.4% R-Sq(adj) = 35.% 3.33 a. b ˆ =.55 b ˆ =. b. Yes, there is sufficiet evidece to show that there is a positive liear relatioship betwee elevatio ad sluggig percetage sice the p-value for the elevatio is.8 / =.4 which is smaller tha the sigificace level of.. c..65 Scatterplot of SLUGPCT vs ELEVATION SLUGPCT ELEVATION Dever s elevatio is very high. d. b ˆ =.554 b ˆ =. There is sufficiet evidece to show that there is ot a positive liear relatioship betwee elevatio ad sluggig percetage sice the p-value for the elevatio is.33 / =.66 which is larger tha the sigificace level of.. The thi air theory is false. Copyright Pearso Educatio, Ic. Publishig as Pretice all.

8 3-8 Simple Liear Regressio 3.35 a. From Exercise 3.6, SS xx = 7.5, SS yy = 4 ad SS xy = 5 SSxy 5 r = = =.9583 SS SS xx yy ( )( ) From Exercise 3.8, SSE =.435 SSyy SSE r = = =.983. SS 4 yy There is a strog positive correlatio betwee x ad y. We ca explai 9.83% of the variatio i the sample y s usig the liear model with x. b. I Exercise 3.7, SS xx =, SS yy = 6 ad SS xy = SSxy r = = = SSxxSS yy ( 6) = ˆ = 6. =.6. I Exercise 3.7, SSE SS β SS ( )( ) SSyy SSE 6.6 r = = =.9. SS 6 yy yy xy There is a strog positive liear correlatio betwee x ad y. We ca explai 9% of the variatio i the sample y s usig the liear model with x We would expect the GPA of a college studet to be correlated to his/her I.Q. As the I.Q. score icreases, we would expect the GPA to icrease. Thus, the correlatio would be positive a. y= β+ βx+ ε b. Moderate positive liear relatioship betwee RMP ad SET ratigs. c. Positive slope sice the correlatio is positive. r d. Sice r =.68, = 46 the test statistic t value is t = = = 9.97 r.68 So, the correspodig P-value is <. <.5 ad has to be rejected. e. ( ) r =.68 = % of the sum of squares of deviatio of the sample SET ratigs about their mea ca be explaied by the RMP ratigs as a liear predictor. Copyright Pearso Educatio, Ic. Publishig as Pretice all.

9 Chapter a. Piao: r =.447 Because this value if ear.5, there is s slight positive liear relatioship betwee recogitio exposure time ad goodess of view for piao. Bech: r =.57 Because this value is extremely close to, there is a extremely weak egative liear relatioship betwee recogitio exposure time ad goodess of view for bech. Motorbike: r =.69 Because this value is ear.5, there is a moderate positive liear relatioship betwee recogitio exposure time ad goodess of view for motorbike. Armchair: r =.94 Because this value is fairly close to, there is a weak positive liear relatioship betwee recogitio exposure time ad goodess of view for armchair. Teapot: r =.949 Because this value is very close to, there is a strog positive liear relatioship betwee recogitio exposure time ad goodess of view for teapot. b. Piao: ( ) r =.447 = % of the total sample variability aroud the sample mea recogitio exposure time is explaied by the liear relatioship betwee the recogitio exposure time ad the goodess of view for piao. Bech: ( ) r =.57 =.3.3% of the total sample variability aroud the sample mea recogitio exposure time is explaied by the liear relatioship betwee the recogitio exposure time ad the goodess of view for bech. Motorbike: ( ) r =.69 = % of the total sample variability aroud the sample mea recogitio exposure time is explaied by the liear relatioship betwee the recogitio exposure time ad the goodess of view for motorbike. Armchair: ( ) r =.94 = % of the total sample variability aroud the sample mea recogitio exposure time is explaied by the liear relatioship betwee the recogitio exposure time ad the goodess of view for armchair. Teapot: ( ) r =.949 = % of the total sample variability aroud the sample mea recogitio exposure time is explaied by the liear relatioship betwee the recogitio exposure time ad the goodess of view for teapot. Copyright Pearso Educatio, Ic. Publishig as Pretice all.

10 3- Simple Liear Regressio c. The test is: : β= a : β Followig are the values of α ad t α that correspod to df = = 5 = 3. α t α/ Piao: t =.4.69 <.4 <.5, p.5 For levels of sigificace greater tha α =.5, ca be rejected. There is sufficiet evidece to idicate that there is a liear relatioship betwee goodess of view ad recogitio exposure time for piao for α >.5. Bech: t =.7.7 <.39, p >. is ot rejected. There is isufficiet evidece to idicate that there is a liear relatioship betwee goodess of view ad recogitio exposure time for bech for α >.. Motorbike: t = > 3.767, p <. ca be rejected for α.. There is sufficiet evidece to idicate that there is a liear relatioship betwee goodess of view ad recogitio exposure time for motorbike for α.. Armchair: t = <.47 <.77, p.5 caot be rejected for levels of sigificace α <.5. There is isufficiet evidece to idicate that there is a liear relatioship betwee goodess of view ad recogitio exposure time for armchair for α <.5. Teapot: t = > 3.767, p <. ca be rejected for α.. There is sufficiet evidece to idicate that there is a liear relatioship betwee goodess of view ad recogitio exposure time for teapot for α a. Sice the p-value of.7 is smaller tha a sigificace level of.5, we ca coclude that there is o sigificat relatioship betwee baselie ad follow-up physical activity for obese youg adults; fail to reject : ρ = at.5. Copyright Pearso Educatio, Ic. Publishig as Pretice all.

11 Chapter 3 3- b Scatterplot of Baselie vs Follow-up 65. Baselie Follow-up 6 7 c. ( ) r =.5 =.5, thus 5% of the variability aroud the sample mea for the total of follow-up umber of movemets is explaied by the baselie total umber of movemets for the obese adults. d. Sice the correlatio value itself is close to zero ad the p-value of.66 is smaller tha.95 correspodig to a sigificace level of.5 we ca coclude that there is o sigificat relatioship betwee baselie ad follow-up physical activity for ormal weight youg adults; fail to reject : ρ = at α =.5. e. Scatterplot of baselie vs follow-up 5 4 baselie follow-up 5 6 f. ( ) r =. =.44, thus.44% of the variability aroud the sample mea for the total of follow-up umber of movemets is explaied by the baselie total umber of movemets for the ormal weight youg adults i a simple liear regressio model SSxy 48.5 r = = =.57 SS SS (8.75) xx yy Because r is moderately small, there is a rather weak positive liear relatioship betwee blood lactate cocetratio ad perceived recovery. ( ) r =.57 = % of the sample variace of blood lactate cocetratio aroud the sample mea is explaied by the liear relatioship betwee blood lactate cocetratio ad perceived recovery. Copyright Pearso Educatio, Ic. Publishig as Pretice all.

12 3- Simple Liear Regressio 3.47 a. This relatioship will have a egative correlatio sice the researchers claim a iverse relatioship. r b. Solvig t = r for r usig the smallest value of t that leads to a statistically sigificat t result gives: r =. So if t =.645 leads to a rejectio of t + : r =, the (.645) r = =.8 ad thus r =.736 =.895 sice r is egative. (.645) SS a. ˆ xy b = = = SSxx 4.77 SSE = SS b ˆ SS = = 4.6 yy xy SSE 4.55 s =.3. = = ( ) b. For x yˆ ( ) =.5, = =.6 The form of the 95% cofidece iterval is yˆ ± t s + α ( x x) SS xx For cofidece coefficiet.95, α =.5 ad α =.5 =.5. From Table, Appedix D, with df = = = 8, t.5 =.. The 95% cofidece iterval is: (.5.5).6 ± ± , ( ) We are 95% cofidet the mea value of y whe x =.5 is betwee.377 ad.83. c. For x yˆ ( ) =., = = 8.9. The 95% cofidece iterval is: (.5.5) 8.9 ± ± , ( ) We are 95% cofidet the mea value of y whe x =.5 is betwee 8.58 ad 9.9. d. For x yˆ ( ) = 3., = =.3. The 95% cofidece iterval is: ( 3..5).3 ± ±.39.98, ( ) Copyright Pearso Educatio, Ic. Publishig as Pretice all.

13 Chapter We are 95% cofidet the mea value of y whe x =.5 is betwee.98 ad.69. e. The width of the iterval i (b) is =.446 The width of the iterval i (c) is =.638 The width of the iterval i (d) is =.638 As the value of x moves away from x =.5, the cofidece iterval gets wider. f. The 95% predictio iterval is yˆ tα /s ± + + ( x x) SS xx ( 3..5).3± ±.46 (.54,3.346 ) a. Predictio iterval for y with x = b. If x x, p ( xp x) yˆ ± tα /s + + SS = = the the formula i part a reduces to y ( t ) iterval for E ( y ) with x = For x =.95% the cofidece iterval for E( y ) is ( 5.65,5.84 ). xx s ˆ ± α /, which is the cofidece 3.55 a. x =.87, SS xx = s =.8573, ad yˆ = 5..4 x. For x= 5, yˆ = 5..4( 5) = 3.5. For cofidece coefficiet.9, α =. ad α =. =.5. From Table, Appedix D, with df = = 3 =, t.5 =.7. The 9% cofidece iterval is: ( xp x) ( ) ( 5.87) yˆ ± tα /s + 3.5±.7 (.8573) + SSxx ± ,3.85. We are 9% cofidet that the mea mass of all spills with a elapsed time of 5 miutes is betwee 3.7 ad b. For cofidece coefficiet.9, α =. ad α =. =.5. From Table, Appedix D, with df = = 3 =, t.5 =.7. The 9% cofidece iterval is: ( xp x) ( 5.87) yˆ tα /s (.8573) SSxx ± + + ± + + ( ) 3.5±.54., 5.. We are 9% cofidet that the mass of a sigle spill with a elapsed time of 5 miutes is betwee. ad 5.. Copyright Pearso Educatio, Ic. Publishig as Pretice all.

14 3-4 Simple Liear Regressio 3.57 The MINTAB results are preseted below: The regressio equatio is SDArea = NumberSD Predictor Coef SE Coef T P Costat NumberSD S = R-Sq = 38.7% R-Sq(adj) = 37.4% Aalysis of Variace Source DF SS MS F P Regressio Residual Error Total Uusual Observatios Obs NumberSD SDArea Fit SE Fit Residual St Resid R X X X R deotes a observatio with a large stadardized residual. X deotes a observatio whose X value gives it large leverage. Predicted Values for New Observatios New Obs Fit SE Fit 95% CI 95% PI (.7, 459.9) (-53.7, 535.3) Values of Predictors for New Observatios New Obs NumberSD 35 Based o the pritout, we are 95% cofidet that the total area of structurally deficiet bridges i a state is betwee (egative umber of bridges does ot make sese) to bridges; (,535.3 ) A scatterplot is give below to check visually for a liear relatioship: 7 Scatterplot of EATRATE vs RPM EATRA TE RPM The MINITAB pritout idicates that the measure of RPM is a sigificat liear predictor of the heat rate ad ca reject : β = ; t=.69 sice the p-value =. is less tha ay sigificace level. Thus the liear equatio is yˆ = x where 7.% of the sample variace i the heat rate is explaied by the RPM. s = 734; a 95% PI for y whe..33,3.58 x = is ( ) Copyright Pearso Educatio, Ic. Publishig as Pretice all.

15 Chapter The regressio equatio is EATRATE = RPM Predictor Coef SE Coef T P Costat RPM S = 86.7 R-Sq = 7.% R-Sq(adj) = 7.8% 3.6 A scatterplot of the umber of hours worked per week ad work/life balace scale score for MBA alumi is preseted below: 8 Scatterplot of WLB-SCORE vs OURS 7 6 WLB-SCORE OURS 8 As you ca see, the data is wildly spread aroud the straight-lie. A MINITAB aalysis was coducted ad the results are as follows: The regressio equatio is WLB-SCORE = OURS Predictor Coef SE Coef T P Costat OURS S =.845 R-Sq = 7.% R-Sq(adj) = 7.% Notice that whe testig : β =, p value=, thus we reject the ull hypothesis ad coclude that the umber of hours worked is sigificat as a useful predictor of the work/life balace score ad results i a regressio equatio of y= x. Also otice that oly 7%, R =.7 of the variability aroud the sample mea for work/life score. There could be other factors that could explai the variability or more data could be take. s = 4.6; ad a 95% PI for y whe x = 5 is (.,69.3 ). Thus, eve though the regressio lie is sigificat it should be used cautiously due to the variability of the data a. The results of the prelimiary calculatios are provided below: = 5, x = 3, xy = 78, y = 589 Substitutig ito the formula for ˆ β, we have ˆ xy 78 β = = = x 3 squares lie is yˆ = x. ad the least Copyright Pearso Educatio, Ic. Publishig as Pretice all.

16 3-6 Simple Liear Regressio = ˆ = =.8667 b. SSE y β xy ( )( ) SSE.8667 s = 3.67 = 5 = s= s = 3.67 =.7935 c. : β= a : β < ˆ β Test statistic: t = = = 8.3 s/ x.7935 / 3 Rejectio regio: α =.5, df = = 4, t.5 =.3 Reject if t <.3 Coclusio: Reject. Yes, there is sufficiet evidece at α =.5 to idicate that x ad y are egatively liearly related. d. A 95% cofidece iterval for β is ˆ β ± t s/ x.5 where t.5 =.776 is based o = 4 df. Substitutig, we have: ± ±.99 (.76, ). 3 =, = ˆ = 9.67 = e. Whe x yˆ β x ( )( ) p x p A 95% cofidece iterval for E( y ) is yˆ ± t.5s x 9.67 ±.776(.7935) 9.67 ±.99 (.76, ). 3 f. From part (e), we predict y ˆ = 9.67 whe x p =. A 95% predictio iterval for y is x p yˆ ± t.5s + x ( ) ( ) 9.67 ± ± , Copyright Pearso Educatio, Ic. Publishig as Pretice all.

17 Chapter a. Prelimiary calculatios yield: = 8, x = 59.75, xy = 3.5, y =738 The, ˆ xy 3.5 β = = = 5.364, ad the least squares lie is yˆ = x. x b. : β = a : β ˆ β Test statistic: t = s/ x where SSE y ˆ βxy 738 (5.364)(3.5) s= s = = = =.64 7 Substitutig, we have t = = / Rejectio regio: α =., df = = 7, tα / = t.5 =.895 Reject if t >.895 or t <.895 Coclusio: Reject at α =.. Yes, there is sufficiet evidece to idicate that the straight-lie model through the origi is useful for predictig decrease i pulse rate y. c. We wat to predict the decrease i pulse rate y correspodig to a drug dosage of x p = 3.5 cubic cetimeters. First, we obtai the poit estimate: yˆ = ˆ β x= = 8.77 ( ) A 99% predictio iterval for y is = 7 df. xp.5 yˆ ± t s + x where t.5 = is based o Substitutio yields: (3.5) ( ) ( ) 8.77 ± ± , Therefore, we predict the decrease i pulse rate correspodig to a dosage of 3.5 c.c. to fall betwee.47 ad 5.7 beats/miute with 99% cofidece. Copyright Pearso Educatio, Ic. Publishig as Pretice all.

18 3-8 Simple Liear Regressio 3.67 a. Prelimiary calculatios yield: =, x =93354, xy = , y = The, ˆ xy β = = = 5.8, x y ˆ = 5.8x. ad the least squares predictio equatio is b. : β = a : β ˆ β Test statistic: t = s/ x where ( y ˆ βxy) SSE s= s = = ( ) = = Substitutig, we have 5.8 t = = / Rejectio regio: α =., df =, 9, t.5 = 3.5 Reject if t < 3.5 or t > 3.5 Coclusio: Reject. Yes, there is sufficiet evidece at α =. to idicate x, populatio i service area, cotributes iformatio for the predictio of y, residetial electric customers i service area. c. We eed the followig additioal iformatio: x = 486, y=97, SS = , SS = , SS 3898 xx xy yy = ˆ β = 47.7, ˆ β = , SSE = s = , s = The least squares predictio equatio is yˆ = x. : β = a : β ˆ β Test statistic: 47.7 t = = = s / SS / xx Copyright Pearso Educatio, Ic. Publishig as Pretice all.

19 Chapter Rejectio regio: α =., df = = 8, t.5 = Reject if t < or t > Coclusio: Reject. Yes, there is sufficiet evidece at α =. to idicate that x cotributes iformatio for the predictio of y. d. Without ruig a formal test, we ca compare the two models. The value of s for the model y= βx+ ε is while the value of s for the model y= β+ βx+ ε is Sice the value of s is much smaller for the secod model, it appears that the secod model should be used. For a formal test, refer to part (d) of Exercise 3.6. : β = : β a Test statistic: ˆ β t = = = x 48.6 s SSxx Rejectio regio: α =., df = = 8, t.5 = Reject if t < or t > Coclusio: Reject. There is sufficiet evidece at α =. to idicate that β should be icluded i the model a. The straight-lie model is y= β+ β x+ ε. Based o the theory, we would expect the metal level to decrease as the distace icreases. Thus, the slope should be egative. b. No. As the distace from the plat icreases, the arseic cocetratio of does ot seem to be clearly supported by the give scatterplot. 3.7 Usig MINITAB, the scattergram of the data is:.5 Scatterplot of Time vs Vehicles.4.3 Time Vehicles 4 6 Copyright Pearso Educatio, Ic. Publishig as Pretice all.

20 3- Simple Liear Regressio Some prelimiary calculatios: x = y =.37 xy = 3.86 x =4 y =.7 x y (.37) SSxy = xy = 3.86 =.9 5 ( x) SSxx = x = 4 = 8 5 x x = = = 8 5 y.37 y = = = ˆ SS xy.9 β = = = SSxx 8 ˆ = y ˆ βx = () 8 = β The least squares lie is yˆ = x. I calculatig we fid that t = 7.43, so we ca reject : β =. r.8; s.4 x = 8.85, ( ) = = 95% PI for y whe ( ) r % of sample variatio i ESLR score ca be explaied by x (SG, SR, or ER score) i liear model as give below: a..% of the liear relatioship betwee the ESLR scores were explaied by the Spaish grammar scores. b. 9.9% of the liear relatioship betwee the ESLR scores were explaied by the Spaish readig scores. c. 7.8% of the liear relatioship betwee the ESLR scores were explaied by the Eglish grammar scores. Copyright Pearso Educatio, Ic. Publishig as Pretice all.

21 Chapter a. A scattergram of the data is: From the graph, it appears that as est box tit occupacy icreases the umber of flycatchers killed also icreases. b. ˆ β = 3.5. Sice x = is ot i the observed rage, ˆ β has o iterpretatio other tha the y-itercept. ˆ β =.8. For each additioal est box tit occupacy, the mea umber of flycatchers killed is estimated to icrease by.8. yˆ = x; t= 4., Reject r s : β = ; =.57; = a. b. Some prelimiary calculatios are: x = 53 x = y = 59. y = 57. xy =736.7 ( x) ( 53) SSxx = x = = x y (53)(59.) SSxy = xy = = Copyright Pearso Educatio, Ic. Publishig as Pretice all.

22 3- Simple Liear Regressio ( y) 59. SSyy = y = 57. = x 53 x = = = y 59. y = = = ˆ SSxy SSxx β = = = ˆ β ˆ = y βx = 78.5 The least squares lie is yˆ = x. c. The least squares lie is plotted o the graph above. d. SSE = SS ˆ yy β SS xy = ( )( ) = SSE s =.6396 = 7 = s =.6396 =.65 To determie if temperature is a useful liear predictor of chage i free eergy, we test: : β= a : β ˆ β Test statistic:.39 t = = =.3 s / SSxx.65 / Rejectio regio: α =., = 7 = 5, t.5 = 4.3 Reject if t < 4.3 or t > 4.3 Coclusio: Do ot reject. There is isufficiet evidece at α =. to idicate that temperature is a useful liear predictor of chage i free eergy. e. It looks like observatio #5 is a outlier. f. x = 75 x = xy = y = 5.7 y = ( x)( y) (75)(5.7) SSxy = xy = = ( x) (75) SSxx = x = = Copyright Pearso Educatio, Ic. Publishig as Pretice all.

23 Chapter x 75 x = = = 9 6 y 5.7 y = = = ˆ SSxy 36. β = = = SS 8.5 xx ˆ β = y ˆ β x = 8.45 ( ) 9 = ( ) The ew least squares lie is yˆ = x. ( y) (5.7) SSyy = y = = SSE=SS - ˆ β SS = ( )( 36.) = yy xy SSE s =.696 = 6 = s =.696 =.68 To determie if temperature is a useful predictor of chage i free eergy, we test: : β = : β a ˆ β Test statistic:.4497 t = = = 5.35 s / SS.68 / 8.5 xx Rejectio regio: α =., df = = 4, t.5 = 4.64 Reject if t < 4.64 or t > 4.64 Coclusio: Reject. Yes, there is sufficiet evidece to idicate temperature is a useful predictor of chage i free eergy at α = a. A straight lie model is y= β+ β x+ ε. b. The researcher hypothesized that therapists with more years of formal dace traiig will report a higher perceived success rate i cotherapy relatioships. This idicates that β >. c..6. r = Because this value is fairly close to, there is a weak egative liear relatioship betwee years of formal traiig ad reported success rate. Copyright Pearso Educatio, Ic. Publishig as Pretice all.

24 3-4 Simple Liear Regressio d. To determie if there is a positive liear relatioship betwee years of formal traiig ad reported success rate, we test: : β = : > a β The test statistic is r.6 t = = = 3. ( r ) / ( ) (.6 ) / (36 ) ( ) The rejectio regio requires α =.5 i the upper tail of the t distributio with df = =36 = 34. From Table, Appedix D, t The rejectio regio is t>.645. Sice the observed value of the test statistic does ot fall i the rejectio regio ( t = 3. <.645 ), so, is ot rejected. There is isufficiet evidece to idicate that there is a positive liear relatioship betwee years of formal traiig ad perceived success rates at α = a. : ρ = : ρ > a The test statistic is r.6 3 t = = = 3.5 r.6 The rejectio regio requires α =. i the upper tail of the t distributio with df = = 3 = 8. From Table i Appedix D, t The rejectio regio is t >.358. Sice the observed value of the test statistic falls i the rejectio regio ( t = 3.5 >.358 ), is rejected. We agree that mother ad daughter loeliess scores were positively correlated at α =.. b. Usig the rejectio regio from (a), the rejectio regio is t >.358 whe testig for positive correlatio ad t <.358 whe testig for egative correlatio. We will begi by testig the ext strogest correlatio, r =.. The test statistic is: r. 3 t = = =.43 r (.) Sice -.43 <.358, there is sufficiet evidece to idicate mother umber of frieds ad daughter loeliess scores are egatively correlated at α =.. We ext test r =.9. The test statistic is: r.9 3 t = = =.9 r.9 Sice.9 <.358, there is isufficiet evidece to reject :. ρ = All other correlatio values will lead to the same coclusio at.. α = Copyright Pearso Educatio, Ic. Publishig as Pretice all.

25 Chapter c. Both a mother's loeliess ad a daughter's loeliess may be caused by other factors such as a abset husbad/father, livig locatios, etc. d. I part (b), we were testig oly for liear correlatios. It may be that some of the variables are correlated, but ot liearly correlated a. = 5 SSxx = ˆ β = x = SSxy = ˆ β = 4.67 y =694 SSyy = SSE = x =53.43 x = s = y = y = s = xy = The least squares lie is yˆ = x. Sice we wat to determie if there is a iverse liear relatioship betwee raise ad ratig, we test: β : = a : β> ˆ β Test statistic: 783 t = = =.98 s / SSxx / Rejectio regio: α =.5, df = = 3, t.5 =.77 Reject if t <.77. Coclusio: No, do ot reject at α =.5. There is isufficiet evidece of a iverse liear relatioship betwee raise ad ratig. b. = 4 SSxx = ˆ β = x = 4.9 SSxy = ˆ β = y =55 SSyy = SSE = x =34.53 x = 3.74 s = y = y = s = xy = The least squares lie is yˆ = x. Copyright Pearso Educatio, Ic. Publishig as Pretice all.

26 3-6 Simple Liear Regressio Sice we wat to determie if there is a iverse liear relatioship betwee raise ad ratig, we test: β : = a : β< ˆ β Test statistic: 3887 t = = =.9 s / SSxx / Rejectio regio: α =.5, df = =, t.5 =.78 Reject if t <.78. Coclusio: Reject at α =.5. Yes, there is sufficiet evidece of a iverse liear relatioship betwee raise ad ratig. c. Sice we oly kow the relatioship betwee raise, y, ad ratig, x, withi the observed rage of values of ratig, it would be very risky to use this least squares lie for the data i the table. The observed rage of values for ratig, x, is.5 to 4.. We should ot predict outside the observed rage of (.5 4.). d. As stated i the problem, typically oly those with axes to grid respod to surveys. Thus, the reported average ratigs may be quite differet tha the average ratigs of the faculty as a whole. With such a low respose rate, the chace that the sample is represetative of the etire populatio is probably quite small. The results should be viewed with cautio. e. Because of the small respose rate ad the probable biased resposes, the claim of the UFF should be viewed with skepticism. Copyright Pearso Educatio, Ic. Publishig as Pretice all.

1 Inferential Methods for Correlation and Regression Analysis

1 Inferential Methods for Correlation and Regression Analysis 1 Iferetial Methods for Correlatio ad Regressio Aalysis I the chapter o Correlatio ad Regressio Aalysis tools for describig bivariate cotiuous data were itroduced. The sample Pearso Correlatio Coefficiet

More information

Overview. p 2. Chapter 9. Pooled Estimate of. q = 1 p. Notation for Two Proportions. Inferences about Two Proportions. Assumptions

Overview. p 2. Chapter 9. Pooled Estimate of. q = 1 p. Notation for Two Proportions. Inferences about Two Proportions. Assumptions Chapter 9 Slide Ifereces from Two Samples 9- Overview 9- Ifereces about Two Proportios 9- Ifereces about Two Meas: Idepedet Samples 9-4 Ifereces about Matched Pairs 9-5 Comparig Variatio i Two Samples

More information

Regression, Inference, and Model Building

Regression, Inference, and Model Building Regressio, Iferece, ad Model Buildig Scatter Plots ad Correlatio Correlatio coefficiet, r -1 r 1 If r is positive, the the scatter plot has a positive slope ad variables are said to have a positive relatioship

More information

Properties and Hypothesis Testing

Properties and Hypothesis Testing Chapter 3 Properties ad Hypothesis Testig 3.1 Types of data The regressio techiques developed i previous chapters ca be applied to three differet kids of data. 1. Cross-sectioal data. 2. Time series data.

More information

HYPOTHESIS TESTS FOR ONE POPULATION MEAN WORKSHEET MTH 1210, FALL 2018

HYPOTHESIS TESTS FOR ONE POPULATION MEAN WORKSHEET MTH 1210, FALL 2018 HYPOTHESIS TESTS FOR ONE POPULATION MEAN WORKSHEET MTH 1210, FALL 2018 We are resposible for 2 types of hypothesis tests that produce ifereces about the ukow populatio mea, µ, each of which has 3 possible

More information

Continuous Data that can take on any real number (time/length) based on sample data. Categorical data can only be named or categorised

Continuous Data that can take on any real number (time/length) based on sample data. Categorical data can only be named or categorised Questio 1. (Topics 1-3) A populatio cosists of all the members of a group about which you wat to draw a coclusio (Greek letters (μ, σ, Ν) are used) A sample is the portio of the populatio selected for

More information

- E < p. ˆ p q ˆ E = q ˆ = 1 - p ˆ = sample proportion of x failures in a sample size of n. where. x n sample proportion. population proportion

- E < p. ˆ p q ˆ E = q ˆ = 1 - p ˆ = sample proportion of x failures in a sample size of n. where. x n sample proportion. population proportion 1 Chapter 7 ad 8 Review for Exam Chapter 7 Estimates ad Sample Sizes 2 Defiitio Cofidece Iterval (or Iterval Estimate) a rage (or a iterval) of values used to estimate the true value of the populatio parameter

More information

Response Variable denoted by y it is the variable that is to be predicted measure of the outcome of an experiment also called the dependent variable

Response Variable denoted by y it is the variable that is to be predicted measure of the outcome of an experiment also called the dependent variable Statistics Chapter 4 Correlatio ad Regressio If we have two (or more) variables we are usually iterested i the relatioship betwee the variables. Associatio betwee Variables Two variables are associated

More information

Linear Regression Models

Linear Regression Models Liear Regressio Models Dr. Joh Mellor-Crummey Departmet of Computer Sciece Rice Uiversity johmc@cs.rice.edu COMP 528 Lecture 9 15 February 2005 Goals for Today Uderstad how to Use scatter diagrams to ispect

More information

Chapter 13, Part A Analysis of Variance and Experimental Design

Chapter 13, Part A Analysis of Variance and Experimental Design Slides Prepared by JOHN S. LOUCKS St. Edward s Uiversity Slide 1 Chapter 13, Part A Aalysis of Variace ad Eperimetal Desig Itroductio to Aalysis of Variace Aalysis of Variace: Testig for the Equality of

More information

Chapters 5 and 13: REGRESSION AND CORRELATION. Univariate data: x, Bivariate data (x,y).

Chapters 5 and 13: REGRESSION AND CORRELATION. Univariate data: x, Bivariate data (x,y). Chapters 5 ad 13: REGREION AND CORRELATION (ectios 5.5 ad 13.5 are omitted) Uivariate data: x, Bivariate data (x,y). Example: x: umber of years studets studied paish y: score o a proficiecy test For each

More information

UNIVERSITY OF TORONTO Faculty of Arts and Science APRIL/MAY 2009 EXAMINATIONS ECO220Y1Y PART 1 OF 2 SOLUTIONS

UNIVERSITY OF TORONTO Faculty of Arts and Science APRIL/MAY 2009 EXAMINATIONS ECO220Y1Y PART 1 OF 2 SOLUTIONS PART of UNIVERSITY OF TORONTO Faculty of Arts ad Sciece APRIL/MAY 009 EAMINATIONS ECO0YY PART OF () The sample media is greater tha the sample mea whe there is. (B) () A radom variable is ormally distributed

More information

Worksheet 23 ( ) Introduction to Simple Linear Regression (continued)

Worksheet 23 ( ) Introduction to Simple Linear Regression (continued) Worksheet 3 ( 11.5-11.8) Itroductio to Simple Liear Regressio (cotiued) This worksheet is a cotiuatio of Discussio Sheet 3; please complete that discussio sheet first if you have ot already doe so. This

More information

Final Examination Solutions 17/6/2010

Final Examination Solutions 17/6/2010 The Islamic Uiversity of Gaza Faculty of Commerce epartmet of Ecoomics ad Political Scieces A Itroductio to Statistics Course (ECOE 30) Sprig Semester 009-00 Fial Eamiatio Solutios 7/6/00 Name: I: Istructor:

More information

II. Descriptive Statistics D. Linear Correlation and Regression. 1. Linear Correlation

II. Descriptive Statistics D. Linear Correlation and Regression. 1. Linear Correlation II. Descriptive Statistics D. Liear Correlatio ad Regressio I this sectio Liear Correlatio Cause ad Effect Liear Regressio 1. Liear Correlatio Quatifyig Liear Correlatio The Pearso product-momet correlatio

More information

MOST PEOPLE WOULD RATHER LIVE WITH A PROBLEM THEY CAN'T SOLVE, THAN ACCEPT A SOLUTION THEY CAN'T UNDERSTAND.

MOST PEOPLE WOULD RATHER LIVE WITH A PROBLEM THEY CAN'T SOLVE, THAN ACCEPT A SOLUTION THEY CAN'T UNDERSTAND. XI-1 (1074) MOST PEOPLE WOULD RATHER LIVE WITH A PROBLEM THEY CAN'T SOLVE, THAN ACCEPT A SOLUTION THEY CAN'T UNDERSTAND. R. E. D. WOOLSEY AND H. S. SWANSON XI-2 (1075) STATISTICAL DECISION MAKING Advaced

More information

Dr. Maddah ENMG 617 EM Statistics 11/26/12. Multiple Regression (2) (Chapter 15, Hines)

Dr. Maddah ENMG 617 EM Statistics 11/26/12. Multiple Regression (2) (Chapter 15, Hines) Dr Maddah NMG 617 M Statistics 11/6/1 Multiple egressio () (Chapter 15, Hies) Test for sigificace of regressio This is a test to determie whether there is a liear relatioship betwee the depedet variable

More information

Statistics. Chapter 10 Two-Sample Tests. Copyright 2013 Pearson Education, Inc. publishing as Prentice Hall. Chap 10-1

Statistics. Chapter 10 Two-Sample Tests. Copyright 2013 Pearson Education, Inc. publishing as Prentice Hall. Chap 10-1 Statistics Chapter 0 Two-Sample Tests Copyright 03 Pearso Educatio, Ic. publishig as Pretice Hall Chap 0- Learig Objectives I this chapter, you lear How to use hypothesis testig for comparig the differece

More information

Stat 139 Homework 7 Solutions, Fall 2015

Stat 139 Homework 7 Solutions, Fall 2015 Stat 139 Homework 7 Solutios, Fall 2015 Problem 1. I class we leared that the classical simple liear regressio model assumes the followig distributio of resposes: Y i = β 0 + β 1 X i + ɛ i, i = 1,...,,

More information

Simple Linear Regression

Simple Linear Regression Simple Liear Regressio 1. Model ad Parameter Estimatio (a) Suppose our data cosist of a collectio of pairs (x i, y i ), where x i is a observed value of variable X ad y i is the correspodig observatio

More information

Common Large/Small Sample Tests 1/55

Common Large/Small Sample Tests 1/55 Commo Large/Small Sample Tests 1/55 Test of Hypothesis for the Mea (σ Kow) Covert sample result ( x) to a z value Hypothesis Tests for µ Cosider the test H :μ = μ H 1 :μ > μ σ Kow (Assume the populatio

More information

11 Correlation and Regression

11 Correlation and Regression 11 Correlatio Regressio 11.1 Multivariate Data Ofte we look at data where several variables are recorded for the same idividuals or samplig uits. For example, at a coastal weather statio, we might record

More information

ST 305: Exam 3 ( ) = P(A)P(B A) ( ) = P(A) + P(B) ( ) = 1 P( A) ( ) = P(A) P(B) σ X 2 = σ a+bx. σ ˆp. σ X +Y. σ X Y. σ X. σ Y. σ n.

ST 305: Exam 3 ( ) = P(A)P(B A) ( ) = P(A) + P(B) ( ) = 1 P( A) ( ) = P(A) P(B) σ X 2 = σ a+bx. σ ˆp. σ X +Y. σ X Y. σ X. σ Y. σ n. ST 305: Exam 3 By hadig i this completed exam, I state that I have either give or received assistace from aother perso durig the exam period. I have used o resources other tha the exam itself ad the basic

More information

S Y Y = ΣY 2 n. Using the above expressions, the correlation coefficient is. r = SXX S Y Y

S Y Y = ΣY 2 n. Using the above expressions, the correlation coefficient is. r = SXX S Y Y 1 Sociology 405/805 Revised February 4, 004 Summary of Formulae for Bivariate Regressio ad Correlatio Let X be a idepedet variable ad Y a depedet variable, with observatios for each of the values of these

More information

Inferential Statistics. Inference Process. Inferential Statistics and Probability a Holistic Approach. Inference Process.

Inferential Statistics. Inference Process. Inferential Statistics and Probability a Holistic Approach. Inference Process. Iferetial Statistics ad Probability a Holistic Approach Iferece Process Chapter 8 Poit Estimatio ad Cofidece Itervals This Course Material by Maurice Geraghty is licesed uder a Creative Commos Attributio-ShareAlike

More information

Simple Regression. Acknowledgement. These slides are based on presentations created and copyrighted by Prof. Daniel Menasce (GMU) CS 700

Simple Regression. Acknowledgement. These slides are based on presentations created and copyrighted by Prof. Daniel Menasce (GMU) CS 700 Simple Regressio CS 7 Ackowledgemet These slides are based o presetatios created ad copyrighted by Prof. Daiel Measce (GMU) Basics Purpose of regressio aalysis: predict the value of a depedet or respose

More information

BIOS 4110: Introduction to Biostatistics. Breheny. Lab #9

BIOS 4110: Introduction to Biostatistics. Breheny. Lab #9 BIOS 4110: Itroductio to Biostatistics Brehey Lab #9 The Cetral Limit Theorem is very importat i the realm of statistics, ad today's lab will explore the applicatio of it i both categorical ad cotiuous

More information

STP 226 ELEMENTARY STATISTICS

STP 226 ELEMENTARY STATISTICS TP 6 TP 6 ELEMENTARY TATITIC CHAPTER 4 DECRIPTIVE MEAURE IN REGREION AND CORRELATION Liear Regressio ad correlatio allows us to examie the relatioship betwee two or more quatitative variables. 4.1 Liear

More information

3/3/2014. CDS M Phil Econometrics. Types of Relationships. Types of Relationships. Types of Relationships. Vijayamohanan Pillai N.

3/3/2014. CDS M Phil Econometrics. Types of Relationships. Types of Relationships. Types of Relationships. Vijayamohanan Pillai N. 3/3/04 CDS M Phil Old Least Squares (OLS) Vijayamohaa Pillai N CDS M Phil Vijayamoha CDS M Phil Vijayamoha Types of Relatioships Oly oe idepedet variable, Relatioship betwee ad is Liear relatioships Curviliear

More information

EXAMINATIONS OF THE ROYAL STATISTICAL SOCIETY

EXAMINATIONS OF THE ROYAL STATISTICAL SOCIETY EXAMINATIONS OF THE ROYAL STATISTICAL SOCIETY HIGHER CERTIFICATE IN STATISTICS, 017 MODULE 4 : Liear models Time allowed: Oe ad a half hours Cadidates should aswer THREE questios. Each questio carries

More information

Assessment and Modeling of Forests. FR 4218 Spring Assignment 1 Solutions

Assessment and Modeling of Forests. FR 4218 Spring Assignment 1 Solutions Assessmet ad Modelig of Forests FR 48 Sprig Assigmet Solutios. The first part of the questio asked that you calculate the average, stadard deviatio, coefficiet of variatio, ad 9% cofidece iterval of the

More information

Statistics 511 Additional Materials

Statistics 511 Additional Materials Cofidece Itervals o mu Statistics 511 Additioal Materials This topic officially moves us from probability to statistics. We begi to discuss makig ifereces about the populatio. Oe way to differetiate probability

More information

(all terms are scalars).the minimization is clearer in sum notation:

(all terms are scalars).the minimization is clearer in sum notation: 7 Multiple liear regressio: with predictors) Depedet data set: y i i = 1, oe predictad, predictors x i,k i = 1,, k = 1, ' The forecast equatio is ŷ i = b + Use matrix otatio: k =1 b k x ik Y = y 1 y 1

More information

LESSON 20: HYPOTHESIS TESTING

LESSON 20: HYPOTHESIS TESTING LESSN 20: YPTESIS TESTING utlie ypothesis testig Tests for the mea Tests for the proportio 1 YPTESIS TESTING TE CNTEXT Example 1: supervisor of a productio lie wats to determie if the productio time of

More information

TABLES AND FORMULAS FOR MOORE Basic Practice of Statistics

TABLES AND FORMULAS FOR MOORE Basic Practice of Statistics TABLES AND FORMULAS FOR MOORE Basic Practice of Statistics Explorig Data: Distributios Look for overall patter (shape, ceter, spread) ad deviatios (outliers). Mea (use a calculator): x = x 1 + x 2 + +

More information

Section 9.2. Tests About a Population Proportion 12/17/2014. Carrying Out a Significance Test H A N T. Parameters & Hypothesis

Section 9.2. Tests About a Population Proportion 12/17/2014. Carrying Out a Significance Test H A N T. Parameters & Hypothesis Sectio 9.2 Tests About a Populatio Proportio P H A N T O M S Parameters Hypothesis Assess Coditios Name the Test Test Statistic (Calculate) Obtai P value Make a decisio State coclusio Sectio 9.2 Tests

More information

Example: Find the SD of the set {x j } = {2, 4, 5, 8, 5, 11, 7}.

Example: Find the SD of the set {x j } = {2, 4, 5, 8, 5, 11, 7}. 1 (*) If a lot of the data is far from the mea, the may of the (x j x) 2 terms will be quite large, so the mea of these terms will be large ad the SD of the data will be large. (*) I particular, outliers

More information

SIMPLE LINEAR REGRESSION AND CORRELATION ANALYSIS

SIMPLE LINEAR REGRESSION AND CORRELATION ANALYSIS SIMPLE LINEAR REGRESSION AND CORRELATION ANALSIS INTRODUCTION There are lot of statistical ivestigatio to kow whether there is a relatioship amog variables Two aalyses: (1) regressio aalysis; () correlatio

More information

Linear Regression Analysis. Analysis of paired data and using a given value of one variable to predict the value of the other

Linear Regression Analysis. Analysis of paired data and using a given value of one variable to predict the value of the other Liear Regressio Aalysis Aalysis of paired data ad usig a give value of oe variable to predict the value of the other 5 5 15 15 1 1 5 5 1 3 4 5 6 7 8 1 3 4 5 6 7 8 Liear Regressio Aalysis E: The chirp rate

More information

Sample Size Determination (Two or More Samples)

Sample Size Determination (Two or More Samples) Sample Sie Determiatio (Two or More Samples) STATGRAPHICS Rev. 963 Summary... Data Iput... Aalysis Summary... 5 Power Curve... 5 Calculatios... 6 Summary This procedure determies a suitable sample sie

More information

April 18, 2017 CONFIDENCE INTERVALS AND HYPOTHESIS TESTING, UNDERGRADUATE MATH 526 STYLE

April 18, 2017 CONFIDENCE INTERVALS AND HYPOTHESIS TESTING, UNDERGRADUATE MATH 526 STYLE April 18, 2017 CONFIDENCE INTERVALS AND HYPOTHESIS TESTING, UNDERGRADUATE MATH 526 STYLE TERRY SOO Abstract These otes are adapted from whe I taught Math 526 ad meat to give a quick itroductio to cofidece

More information

University of California, Los Angeles Department of Statistics. Simple regression analysis

University of California, Los Angeles Department of Statistics. Simple regression analysis Uiversity of Califoria, Los Ageles Departmet of Statistics Statistics 100C Istructor: Nicolas Christou Simple regressio aalysis Itroductio: Regressio aalysis is a statistical method aimig at discoverig

More information

Statistical Intervals for a Single Sample

Statistical Intervals for a Single Sample 3/5/06 Applied Statistics ad Probability for Egieers Sixth Editio Douglas C. Motgomery George C. Ruger Chapter 8 Statistical Itervals for a Sigle Sample 8 CHAPTER OUTLINE 8- Cofidece Iterval o the Mea

More information

Chapter 22. Comparing Two Proportions. Copyright 2010, 2007, 2004 Pearson Education, Inc.

Chapter 22. Comparing Two Proportions. Copyright 2010, 2007, 2004 Pearson Education, Inc. Chapter 22 Comparig Two Proportios Copyright 2010, 2007, 2004 Pearso Educatio, Ic. Comparig Two Proportios Read the first two paragraphs of pg 504. Comparisos betwee two percetages are much more commo

More information

Data Analysis and Statistical Methods Statistics 651

Data Analysis and Statistical Methods Statistics 651 Data Aalysis ad Statistical Methods Statistics 651 http://www.stat.tamu.edu/~suhasii/teachig.html Suhasii Subba Rao Review of testig: Example The admistrator of a ursig home wats to do a time ad motio

More information

Circle the single best answer for each multiple choice question. Your choice should be made clearly.

Circle the single best answer for each multiple choice question. Your choice should be made clearly. TEST #1 STA 4853 March 6, 2017 Name: Please read the followig directios. DO NOT TURN THE PAGE UNTIL INSTRUCTED TO DO SO Directios This exam is closed book ad closed otes. There are 32 multiple choice questios.

More information

Open book and notes. 120 minutes. Cover page and six pages of exam. No calculators.

Open book and notes. 120 minutes. Cover page and six pages of exam. No calculators. IE 330 Seat # Ope book ad otes 120 miutes Cover page ad six pages of exam No calculators Score Fial Exam (example) Schmeiser Ope book ad otes No calculator 120 miutes 1 True or false (for each, 2 poits

More information

Correlation. Two variables: Which test? Relationship Between Two Numerical Variables. Two variables: Which test? Contingency table Grouped bar graph

Correlation. Two variables: Which test? Relationship Between Two Numerical Variables. Two variables: Which test? Contingency table Grouped bar graph Correlatio Y Two variables: Which test? X Explaatory variable Respose variable Categorical Numerical Categorical Cotigecy table Cotigecy Logistic Grouped bar graph aalysis regressio Mosaic plot Numerical

More information

Introduction to Econometrics (3 rd Updated Edition) Solutions to Odd- Numbered End- of- Chapter Exercises: Chapter 4

Introduction to Econometrics (3 rd Updated Edition) Solutions to Odd- Numbered End- of- Chapter Exercises: Chapter 4 Itroductio to Ecoometrics (3 rd Updated Editio) by James H. Stock ad Mark W. Watso Solutios to Odd- Numbered Ed- of- Chapter Exercises: Chapter 4 (This versio August 7, 204) 205 Pearso Educatio, Ic. Stock/Watso

More information

Ismor Fischer, 1/11/

Ismor Fischer, 1/11/ Ismor Fischer, //04 7.4-7.4 Problems. I Problem 4.4/9, it was show that importat relatios exist betwee populatio meas, variaces, ad covariace. Specifically, we have the formulas that appear below left.

More information

Simple Linear Regression

Simple Linear Regression Chapter 2 Simple Liear Regressio 2.1 Simple liear model The simple liear regressio model shows how oe kow depedet variable is determied by a sigle explaatory variable (regressor). Is is writte as: Y i

More information

Confidence Intervals for the Population Proportion p

Confidence Intervals for the Population Proportion p Cofidece Itervals for the Populatio Proportio p The cocept of cofidece itervals for the populatio proportio p is the same as the oe for, the samplig distributio of the mea, x. The structure is idetical:

More information

Lecture 11 Simple Linear Regression

Lecture 11 Simple Linear Regression Lecture 11 Simple Liear Regressio Fall 2013 Prof. Yao Xie, yao.xie@isye.gatech.edu H. Milto Stewart School of Idustrial Systems & Egieerig Georgia Tech Midterm 2 mea: 91.2 media: 93.75 std: 6.5 2 Meddicorp

More information

Final Review. Fall 2013 Prof. Yao Xie, H. Milton Stewart School of Industrial Systems & Engineering Georgia Tech

Final Review. Fall 2013 Prof. Yao Xie, H. Milton Stewart School of Industrial Systems & Engineering Georgia Tech Fial Review Fall 2013 Prof. Yao Xie, yao.xie@isye.gatech.edu H. Milto Stewart School of Idustrial Systems & Egieerig Georgia Tech 1 Radom samplig model radom samples populatio radom samples: x 1,..., x

More information

Linear Regression Demystified

Linear Regression Demystified Liear Regressio Demystified Liear regressio is a importat subject i statistics. I elemetary statistics courses, formulae related to liear regressio are ofte stated without derivatio. This ote iteds to

More information

Correlation Regression

Correlation Regression Correlatio Regressio While correlatio methods measure the stregth of a liear relatioship betwee two variables, we might wish to go a little further: How much does oe variable chage for a give chage i aother

More information

STP 226 EXAMPLE EXAM #1

STP 226 EXAMPLE EXAM #1 STP 226 EXAMPLE EXAM #1 Istructor: Hoor Statemet: I have either give or received iformatio regardig this exam, ad I will ot do so util all exams have bee graded ad retured. PRINTED NAME: Siged Date: DIRECTIONS:

More information

Recall the study where we estimated the difference between mean systolic blood pressure levels of users of oral contraceptives and non-users, x - y.

Recall the study where we estimated the difference between mean systolic blood pressure levels of users of oral contraceptives and non-users, x - y. Testig Statistical Hypotheses Recall the study where we estimated the differece betwee mea systolic blood pressure levels of users of oral cotraceptives ad o-users, x - y. Such studies are sometimes viewed

More information

Introduction to Econometrics (3 rd Updated Edition) Solutions to Odd- Numbered End- of- Chapter Exercises: Chapter 3

Introduction to Econometrics (3 rd Updated Edition) Solutions to Odd- Numbered End- of- Chapter Exercises: Chapter 3 Itroductio to Ecoometrics (3 rd Updated Editio) by James H. Stock ad Mark W. Watso Solutios to Odd- Numbered Ed- of- Chapter Exercises: Chapter 3 (This versio August 17, 014) 015 Pearso Educatio, Ic. Stock/Watso

More information

INSTRUCTIONS (A) 1.22 (B) 0.74 (C) 4.93 (D) 1.18 (E) 2.43

INSTRUCTIONS (A) 1.22 (B) 0.74 (C) 4.93 (D) 1.18 (E) 2.43 PAPER NO.: 444, 445 PAGE NO.: Page 1 of 1 INSTRUCTIONS I. You have bee provided with: a) the examiatio paper i two parts (PART A ad PART B), b) a multiple choice aswer sheet (for PART A), c) selected formulae

More information

Interval Estimation (Confidence Interval = C.I.): An interval estimate of some population parameter is an interval of the form (, ),

Interval Estimation (Confidence Interval = C.I.): An interval estimate of some population parameter is an interval of the form (, ), Cofidece Iterval Estimatio Problems Suppose we have a populatio with some ukow parameter(s). Example: Normal(,) ad are parameters. We eed to draw coclusios (make ifereces) about the ukow parameters. We

More information

Mathematical Notation Math Introduction to Applied Statistics

Mathematical Notation Math Introduction to Applied Statistics Mathematical Notatio Math 113 - Itroductio to Applied Statistics Name : Use Word or WordPerfect to recreate the followig documets. Each article is worth 10 poits ad ca be prited ad give to the istructor

More information

2 1. The r.s., of size n2, from population 2 will be. 2 and 2. 2) The two populations are independent. This implies that all of the n1 n2

2 1. The r.s., of size n2, from population 2 will be. 2 and 2. 2) The two populations are independent. This implies that all of the n1 n2 Chapter 8 Comparig Two Treatmets Iferece about Two Populatio Meas We wat to compare the meas of two populatios to see whether they differ. There are two situatios to cosider, as show i the followig examples:

More information

Describing the Relation between Two Variables

Describing the Relation between Two Variables Copyright 010 Pearso Educatio, Ic. Tables ad Formulas for Sulliva, Statistics: Iformed Decisios Usig Data 010 Pearso Educatio, Ic Chapter Orgaizig ad Summarizig Data Relative frequecy = frequecy sum of

More information

Math 140 Introductory Statistics

Math 140 Introductory Statistics 8.2 Testig a Proportio Math 1 Itroductory Statistics Professor B. Abrego Lecture 15 Sectios 8.2 People ofte make decisios with data by comparig the results from a sample to some predetermied stadard. These

More information

Chapter 12 Correlation

Chapter 12 Correlation Chapter Correlatio Correlatio is very similar to regressio with oe very importat differece. Regressio is used to explore the relatioship betwee a idepedet variable ad a depedet variable, whereas correlatio

More information

University of California, Los Angeles Department of Statistics. Hypothesis testing

University of California, Los Angeles Department of Statistics. Hypothesis testing Uiversity of Califoria, Los Ageles Departmet of Statistics Statistics 100B Elemets of a hypothesis test: Hypothesis testig Istructor: Nicolas Christou 1. Null hypothesis, H 0 (claim about µ, p, σ 2, µ

More information

A quick activity - Central Limit Theorem and Proportions. Lecture 21: Testing Proportions. Results from the GSS. Statistics and the General Population

A quick activity - Central Limit Theorem and Proportions. Lecture 21: Testing Proportions. Results from the GSS. Statistics and the General Population A quick activity - Cetral Limit Theorem ad Proportios Lecture 21: Testig Proportios Statistics 10 Coli Rudel Flip a coi 30 times this is goig to get loud! Record the umber of heads you obtaied ad calculate

More information

1036: Probability & Statistics

1036: Probability & Statistics 036: Probability & Statistics Lecture 0 Oe- ad Two-Sample Tests of Hypotheses 0- Statistical Hypotheses Decisio based o experimetal evidece whether Coffee drikig icreases the risk of cacer i humas. A perso

More information

Chapter 8: Estimating with Confidence

Chapter 8: Estimating with Confidence Chapter 8: Estimatig with Cofidece Sectio 8.2 The Practice of Statistics, 4 th editio For AP* STARNES, YATES, MOORE Chapter 8 Estimatig with Cofidece 8.1 Cofidece Itervals: The Basics 8.2 8.3 Estimatig

More information

TABLES AND FORMULAS FOR MOORE Basic Practice of Statistics

TABLES AND FORMULAS FOR MOORE Basic Practice of Statistics TABLES AND FORMULAS FOR MOORE Basic Practice of Statistics Explorig Data: Distributios Look for overall patter (shape, ceter, spread) ad deviatios (outliers). Mea (use a calculator): x = x 1 + x 2 + +

More information

ENGI 4421 Confidence Intervals (Two Samples) Page 12-01

ENGI 4421 Confidence Intervals (Two Samples) Page 12-01 ENGI 44 Cofidece Itervals (Two Samples) Page -0 Two Sample Cofidece Iterval for a Differece i Populatio Meas [Navidi sectios 5.4-5.7; Devore chapter 9] From the cetral limit theorem, we kow that, for sufficietly

More information

Topic 18: Composite Hypotheses

Topic 18: Composite Hypotheses Toc 18: November, 211 Simple hypotheses limit us to a decisio betwee oe of two possible states of ature. This limitatio does ot allow us, uder the procedures of hypothesis testig to address the basic questio:

More information

Chapter 1 (Definitions)

Chapter 1 (Definitions) FINAL EXAM REVIEW Chapter 1 (Defiitios) Qualitative: Nomial: Ordial: Quatitative: Ordial: Iterval: Ratio: Observatioal Study: Desiged Experimet: Samplig: Cluster: Stratified: Systematic: Coveiece: Simple

More information

Chapter 22. Comparing Two Proportions. Copyright 2010 Pearson Education, Inc.

Chapter 22. Comparing Two Proportions. Copyright 2010 Pearson Education, Inc. Chapter 22 Comparig Two Proportios Copyright 2010 Pearso Educatio, Ic. Comparig Two Proportios Comparisos betwee two percetages are much more commo tha questios about isolated percetages. Ad they are more

More information

Agreement of CI and HT. Lecture 13 - Tests of Proportions. Example - Waiting Times

Agreement of CI and HT. Lecture 13 - Tests of Proportions. Example - Waiting Times Sigificace level vs. cofidece level Agreemet of CI ad HT Lecture 13 - Tests of Proportios Sta102 / BME102 Coli Rudel October 15, 2014 Cofidece itervals ad hypothesis tests (almost) always agree, as log

More information

Lecture 22: Review for Exam 2. 1 Basic Model Assumptions (without Gaussian Noise)

Lecture 22: Review for Exam 2. 1 Basic Model Assumptions (without Gaussian Noise) Lecture 22: Review for Exam 2 Basic Model Assumptios (without Gaussia Noise) We model oe cotiuous respose variable Y, as a liear fuctio of p umerical predictors, plus oise: Y = β 0 + β X +... β p X p +

More information

First, note that the LS residuals are orthogonal to the regressors. X Xb X y = 0 ( normal equations ; (k 1) ) So,

First, note that the LS residuals are orthogonal to the regressors. X Xb X y = 0 ( normal equations ; (k 1) ) So, 0 2. OLS Part II The OLS residuals are orthogoal to the regressors. If the model icludes a itercept, the orthogoality of the residuals ad regressors gives rise to three results, which have limited practical

More information

UNIT 11 MULTIPLE LINEAR REGRESSION

UNIT 11 MULTIPLE LINEAR REGRESSION UNIT MULTIPLE LINEAR REGRESSION Structure. Itroductio release relies Obectives. Multiple Liear Regressio Model.3 Estimatio of Model Parameters Use of Matrix Notatio Properties of Least Squares Estimates.4

More information

Confidence Interval for one population mean or one population proportion, continued. 1. Sample size estimation based on the large sample C.I.

Confidence Interval for one population mean or one population proportion, continued. 1. Sample size estimation based on the large sample C.I. Cofidece Iterval for oe populatio mea or oe populatio proportio, cotiued 1. ample size estimatio based o the large sample C.I. for p ˆ(1 ˆ) ˆ(1 ˆ) From the iterval ˆ p p Z p ˆ, p Z p p L legh of your 100(1

More information

Chapter 5: Hypothesis testing

Chapter 5: Hypothesis testing Slide 5. Chapter 5: Hypothesis testig Hypothesis testig is about makig decisios Is a hypothesis true or false? Are wome paid less, o average, tha me? Barrow, Statistics for Ecoomics, Accoutig ad Busiess

More information

Solutions to Odd Numbered End of Chapter Exercises: Chapter 4

Solutions to Odd Numbered End of Chapter Exercises: Chapter 4 Itroductio to Ecoometrics (3 rd Updated Editio) by James H. Stock ad Mark W. Watso Solutios to Odd Numbered Ed of Chapter Exercises: Chapter 4 (This versio July 2, 24) Stock/Watso - Itroductio to Ecoometrics

More information

Successful HE applicants. Information sheet A Number of applicants. Gender Applicants Accepts Applicants Accepts. Age. Domicile

Successful HE applicants. Information sheet A Number of applicants. Gender Applicants Accepts Applicants Accepts. Age. Domicile Successful HE applicats Sigificace tests use data from samples to test hypotheses. You will use data o successful applicatios for courses i higher educatio to aswer questios about proportios, for example,

More information

Stat 200 -Testing Summary Page 1

Stat 200 -Testing Summary Page 1 Stat 00 -Testig Summary Page 1 Mathematicias are like Frechme; whatever you say to them, they traslate it ito their ow laguage ad forthwith it is somethig etirely differet Goethe 1 Large Sample Cofidece

More information

Chapter 22: What is a Test of Significance?

Chapter 22: What is a Test of Significance? Chapter 22: What is a Test of Sigificace? Thought Questio Assume that the statemet If it s Saturday, the it s the weeked is true. followig statemets will also be true? Which of the If it s the weeked,

More information

Chapter If n is odd, the median is the exact middle number If n is even, the median is the average of the two middle numbers

Chapter If n is odd, the median is the exact middle number If n is even, the median is the average of the two middle numbers Chapter 4 4-1 orth Seattle Commuity College BUS10 Busiess Statistics Chapter 4 Descriptive Statistics Summary Defiitios Cetral tedecy: The extet to which the data values group aroud a cetral value. Variatio:

More information

Chapter 23: Inferences About Means

Chapter 23: Inferences About Means Chapter 23: Ifereces About Meas Eough Proportios! We ve spet the last two uits workig with proportios (or qualitative variables, at least) ow it s time to tur our attetios to quatitative variables. For

More information

Correlation and Covariance

Correlation and Covariance Correlatio ad Covariace Tom Ilveto FREC 9 What is Next? Correlatio ad Regressio Regressio We specify a depedet variable as a liear fuctio of oe or more idepedet variables, based o co-variace Regressio

More information

Algebra of Least Squares

Algebra of Least Squares October 19, 2018 Algebra of Least Squares Geometry of Least Squares Recall that out data is like a table [Y X] where Y collects observatios o the depedet variable Y ad X collects observatios o the k-dimesioal

More information

ECON 3150/4150, Spring term Lecture 3

ECON 3150/4150, Spring term Lecture 3 Itroductio Fidig the best fit by regressio Residuals ad R-sq Regressio ad causality Summary ad ext step ECON 3150/4150, Sprig term 2014. Lecture 3 Ragar Nymoe Uiversity of Oslo 21 Jauary 2014 1 / 30 Itroductio

More information

Regression. Correlation vs. regression. The parameters of linear regression. Regression assumes... Random sample. Y = α + β X.

Regression. Correlation vs. regression. The parameters of linear regression. Regression assumes... Random sample. Y = α + β X. Regressio Correlatio vs. regressio Predicts Y from X Liear regressio assumes that the relatioship betwee X ad Y ca be described by a lie Regressio assumes... Radom sample Y is ormally distributed with

More information

MATH/STAT 352: Lecture 15

MATH/STAT 352: Lecture 15 MATH/STAT 352: Lecture 15 Sectios 5.2 ad 5.3. Large sample CI for a proportio ad small sample CI for a mea. 1 5.2: Cofidece Iterval for a Proportio Estimatig proportio of successes i a biomial experimet

More information

MA238 Assignment 4 Solutions (part a)

MA238 Assignment 4 Solutions (part a) (i) Sigle sample tests. Questio. MA38 Assigmet 4 Solutios (part a) (a) (b) (c) H 0 : = 50 sq. ft H A : < 50 sq. ft H 0 : = 3 mpg H A : > 3 mpg H 0 : = 5 mm H A : 5mm Questio. (i) What are the ull ad alterative

More information

t distribution [34] : used to test a mean against an hypothesized value (H 0 : µ = µ 0 ) or the difference

t distribution [34] : used to test a mean against an hypothesized value (H 0 : µ = µ 0 ) or the difference EXST30 Backgroud material Page From the textbook The Statistical Sleuth Mea [0]: I your text the word mea deotes a populatio mea (µ) while the work average deotes a sample average ( ). Variace [0]: The

More information

If, for instance, we were required to test whether the population mean μ could be equal to a certain value μ

If, for instance, we were required to test whether the population mean μ could be equal to a certain value μ STATISTICAL INFERENCE INTRODUCTION Statistical iferece is that brach of Statistics i which oe typically makes a statemet about a populatio based upo the results of a sample. I oesample testig, we essetially

More information

Frequentist Inference

Frequentist Inference Frequetist Iferece The topics of the ext three sectios are useful applicatios of the Cetral Limit Theorem. Without kowig aythig about the uderlyig distributio of a sequece of radom variables {X i }, for

More information

M1 for method for S xy. M1 for method for at least one of S xx or S yy. A1 for at least one of S xy, S xx, S yy correct. M1 for structure of r

M1 for method for S xy. M1 for method for at least one of S xx or S yy. A1 for at least one of S xy, S xx, S yy correct. M1 for structure of r Questio 1 (i) EITHER: 1 S xy = xy x y = 198.56 1 19.8 140.4 =.44 x x = 1411.66 1 19.8 = 15.657 1 S xx = y y = 1417.88 1 140.4 = 9.869 14 Sxy -.44 r = = SxxSyy 15.6579.869 = 0.76 1 S yy = 14 14 M1 for method

More information

Econ 325 Notes on Point Estimator and Confidence Interval 1 By Hiro Kasahara

Econ 325 Notes on Point Estimator and Confidence Interval 1 By Hiro Kasahara Poit Estimator Eco 325 Notes o Poit Estimator ad Cofidece Iterval 1 By Hiro Kasahara Parameter, Estimator, ad Estimate The ormal probability desity fuctio is fully characterized by two costats: populatio

More information

CHAPTER 2. Mean This is the usual arithmetic mean or average and is equal to the sum of the measurements divided by number of measurements.

CHAPTER 2. Mean This is the usual arithmetic mean or average and is equal to the sum of the measurements divided by number of measurements. CHAPTER 2 umerical Measures Graphical method may ot always be sufficiet for describig data. You ca use the data to calculate a set of umbers that will covey a good metal picture of the frequecy distributio.

More information

Estimating the Population Mean - when a sample average is calculated we can create an interval centered on this average

Estimating the Population Mean - when a sample average is calculated we can create an interval centered on this average 6. Cofidece Iterval for the Populatio Mea p58 Estimatig the Populatio Mea - whe a sample average is calculated we ca create a iterval cetered o this average x-bar - at a predetermied level of cofidece

More information