QA 622 AUTUMN QUARTER ACADEMIC YEAR

Size: px
Start display at page:

Download "QA 622 AUTUMN QUARTER ACADEMIC YEAR"

Transcription

1 QA 6 AUTUMN QUARTER 6-7 ACADEMIC YEAR Istructor: Offce: James J. Cochra 7A CAB Telephoe: (38) Hours: e-mal: URL: TR 8: a.m. - oo W 8: a.m. - : a.m. or by appotmet jcochra@cab.latech.edu Homes/Jcochra Requred Textbook: Draper & Smth, Appled Regresso Aalyss, Wley- Iterscece, 998 (3 rd edto)

2 Other Suggested Texts: Rudolf J. Freud & Ramo C. Lttell, SAS System for Regresso, SAS Isttute ad Joh Wley & Sos, 3 rd ed.,. Bowerma, Bruce L. & Rchard T. O'Coell, Lear Statstcal Models: A Appled Approach, Duxbury, d ed., 99. Rawlgs, Joh O., Sastry G. Patula, & Davd A. Dckey, Appled Regresso Aalyss: A Research Tool, Sprger-Verlag, st ed., 998. Kuter, Mchael H., Chrstopher J. Nachtshem, Joh Neter, & Wllam L, Appled Lear Statstcal Models, McGraw-Hll Irw, 5 th ed., 5. Cook, Des R. & Saford Wesberg, Appled Regresso Icludg Computg ad Graphcs, Wley-Iterscece Publshg, st ed., 999. Belsley, Davd A., Edw Kuh, & Roy E. Welsh, Regresso Dagostcs, Joh Wley & Sos, st ed., 98. Prerequstes: QA 39 (calculus & lear/matrx algebra) QA 43 (Itermedate Busess Statstcs) computer lteracy (ablty to lear ad use SAS) Gradg: Iterm Exam 5 pots Lterature Revew 5 pots Comprehesve Fal Exam pots 5 pots

3 A.Deftos I. Regresso Aalyss The Bascs. Respose Varable (also called the depedet varable, output varable, or y-varable) - ths s the pheomea or characterstc we wsh to estmate or predct. The depedet varable s usually deoted y.. Predctor Varable (also called a regressor, put varable, x-varables, or depedet varable) - ths s the pheomea used to estmate or predct the value of the depedet varable. Idepedet varables are usually deoted x j. 3. Fuctoal Relatoshp - The uque value of the depedet varable (usually deoted y) ca be precsely determed gve the value of the depedet varable (usually deoted x) ths s ofte referred to as a determstc relatoshp betwee x ad y. Classes of Fuctoal Relatoshps Betwee x & y y y y y=f(x) y=f(x) y=f(x) x x x Postve/Drect Relatoshp Negatve/Iverse Relatoshp No Relatoshp

4 4. Slope - Chage the value of the depedet varable that correspods to a oe-ut chage the depedet varable. The (ukow) populato slope s usually deoted β, whle the sample-estmated slope s usually deoted b. 5. Y-Itercept - Value of the depedet varable whe the depedet varable s zero. The (ukow) populato Y-tercept s usually deoted β, whle the sample-estmated Y-tercept s usually deoted b. 6. Scatter Dagram - graphcal smultaeous presetato of the values of two varables o a Cartesa coordate system Suppose you had the followg two observatos o varables x ad y: x y The resultg graphcal dsplay would look lke ths: y x The relatoshp betwee x ad y s (by defto) lear.

5 7. Lear Relatoshp - The slope s costat at all values of the depedet varable. For the prevous example we could fd the le o whch both pots le by solvg the equato y = B + B x smultaeously for our two observatos: 5 = B + (3)B -[6 = B + (4)B ] -= -B B = so 5 = B + (3)() B = 5 3 = y The resultg le s y = + x x Such a le betwee two pots wll always be lear ad fuctoal (why?). What happes f we troduce a thrd pot (3.5, 5.5) to our data set? Sce the ew observato (3.5, 5.5) satsfes our orgal formula (falls o the le through pots (3, 5) ad (4, 6)) y = + x = = 5.5 we stll have a fuctoal lear relatoshp. y x

6 O the other had, what happes f we troduce a thrd pot (3.5, 4.5) to our data set? Sce the ew observato (3.5, 4.5) does ot satsfy our orgal formula (does ot falls o the le through pots (3, 5) ad (4, 6)) y = + x = = we o loger have a fuctoal lear relatoshp. y x 8. Stochastc (Statstcal) Relatoshp - The relatoshp(s) betwee values of the respose varable ad correspodg values of the predctor varable(s) s (are) ot determstc. Thus the value of y s estmated gve the value of x. The estmated value of the depedet varable s deoted y, ^ ad the populato slope ad y- tercept are usually deoted β ad β.

7 Two Notes Regardg Stochastc Relatoshps a. The uderlyg populato relatoshp s gve by y = β + β x + where s the varato y that s uattrbutable to x b. Sources of Varato Stochastc (Statstcal) Relatoshps - Errors Specfcato ot all varables assocated wth the depedet varable have bee accouted for - Errors Measuremet lack of accuracy assessg/quatfyg the values of ay varable. - Samplg Error totally radom varato, uattrbutable to ay source. 9. Regresso Aalyss - Statstcal methods for estmatg the relatoshp betwee the depedet varable ad the depedet varable. The estmates of β ad β are usually deoted b ad b.. Lear Regresso - Idcates that the relatoshp(s) betwee the depedet varable ad the depedet varable(s).. Smple Regresso - Idcates that the relatoshp s betwee the depedet varable ad a sgle depedet varable. Thus we would have y^ = b + b x as our estmate of the value of the depedet varable y whe x = x.

8 . Regresso Error - also called the resdual, ths s the dfferece betwee our estmate of the value of the depedet varable y whe x = x (.e., y) ^ ad the actual value of the depedet varable y whe x = x (.e., y ). The resdual for the th observato s usually deoted e, so we have that whch by substtuto s e = y y ^ e = y (b + b x ) 3. Ordary Least Squares (OLS) - Crtera for fttg a estmated regresso le to sample data whch the sum of the squared dffereces betwee actual ad estmated values of the depedet varable are mmzed,.e., whch s equvalet to ( ˆ ) m y - y = m e = that s, we wsh to fd the regresso le that mmzes the squared regresso errors we would commt usg the le to estmate the values of y for the values of x the followg sample data.

9 Graphcal Iterpretato of Ordary Least Squares (OLS) for a smple lear regresso we have y. x,y y - y ^ (total ^y y - y devato) y^ -y _ y (uexplaed devato) (explaed devato) ^ y = b + b x _ x x x Example: suppose we have collected the followg sample of 6 observatos o age ad come: AGE INCOME ( $,'s) Fd the estmated regresso le for the sample of sx observatos we have collected o age ad come: Whch s the depedet varable ad whch s the depedet varable for ths problem?

10 SCATTER DIAGRAM 7 INCOME ( $,'s) AGE Note that the pots follow a postve slope but do ot le o a straght le (.e., there appears to be a drect stochastc relatoshp betwee AGE ad INCOME) SCATTER DIAGRAM 7 INCOME ( $,'s) AGE How do we ft a le that mmzes the sums of the squared errors e ( ˆ = y - y )? What s the = = geometry of ths objectve?

11 B. Ordary Least Squares Estmate of the Regresso Le - Recall that our objectve s to ( ˆ ) m e,.e., m y - y = = whch by aother substtuto ca be rewrtte as ( ) m e = m y - b + b x a lttle dfferetal calculus ca be appled to ths expresso to derve the ormal equatos: = = Partally dfferetatg ths equato wrt b ad b yelds the followg two equatos: SSE = - (y - (b + b x )) = b = SSE = - (y - (b + b x ))x = b = These partally dfferetato (wrt b ad b ) yeld the followg two equatos (whch have bee rearraged so that b ad b are solated o the rght had sdes):

12 ad We ca solve these equatos (called the Normal Equatos) smultaeously to obta the equatos ecessary to produce the estmates b ad b. y = b + b x = = x y = b x + b x = = = x y = = ( x - x) ( y - y) x y - = = b = = ( ) x - x x = = x - = ad b = y - b x Example: suppose we have collected the followg sample of 6 observatos o age ad come: AGE INCOME ( $,'s) Fd the estmated regresso le for the sample of sx observatos we have collected o age ad come: Whch s the depedet varable ad whch s the depedet varable for ths problem?

13 SCATTER DIAGRAM 7 INCOME ( $,'s) AGE Note that the pots follow a postve slope but do ot le o a straght le (.e., there appears to be a drect stochastc relatoshp betwee AGE ad INCOME) our summary calculatos are: Aual Icome Respodet Age Years (x ) x y x $'s (y ) sums y so: = = x y = 57 y = 74 = = x = 43 x = 673

14 whch results ad b = x y = = x y - x = x - = = ( 43) ( 74) 57 - = = = b = y - bx = -.75 = so our estmated regresso equato s ŷ = x What would happe to these estmates f we chage the scale of measuremet for the depedet varable y? What f we chage the scale of measuremet for the depedet varable x?

15 Let s look at what happes to the sum of squared errors as we alter our estmates b ad b : Respose Surface of Squared Errors 35 3 Sum of Squared Errors 5 The sum of squared errors appears to be mmzed aroud -6.3 b -5.6 ad.9 b b b.9 If we refe our search ad lmt our scope, we get a clearer dea of what s happeg: Respose Surface of Squared Errors 9 8 Sum of Squared Errors The sum of squared errors appears to be mmzed aroud -6. b -5.9 ad.5 b b b.9.

16 If we further refe our search ad lmt our scope, we get a eve clearer dea of what s happeg: Respose Surface of Squared Errors Sum of Squared Errors 63 6 The sum of squared errors appears to be mmzed aroud b = 5.96 ad b = b b.75.5 Of course, a extreme refemet of our search ad lmtato of our scope provdes the clearest pcture: Respose Surface of Squared Errors Sum of Squared Errors The sum of squared errors appears to be mmzed aroud b = ad b = b b.7475

17 7 6 5 y = Note that the estmated regresso le does ) cut through the sample pots o the scatter dagram ad ) goes through the pot x,y. INCOME ( $,'s) 3 ( ) SCATTER DIAGRAM x = 4.5 y - t = AGE Note also that aother otato (commoly referred to as the Pocket Calculator otato) s commoly used: ( ) ( ) S = x x y y XY = = x y = = = ( ) S = x x XX = x = = x xy ( ) S = y y YY = y y

18 so that the formulas for b ad b are: S XY b = ad b = y - b x S XX We refer to each of the followg quattes wth specal ames: = x s the Ucorrected Sum of Squares of the X's = x s the Correcto for the Mea of the X's = x = x - s the Corrected Sum of Squares of the X's = x y s the Ucorrected Sum of x = = Products of the X's ad Y's y s the Correcto for the Mea of the X's ad Y's = x y xy - = = s the Corrected Sum of Products of the X's ad Y's

19 The formula for b b = y - b x mples that the regresso le goes through x,.e., ˆ f x = x the y = b + b x = ( y - b x ) + b x = y y We ca thk of ths problem aother way: + - ( ) ( ) m d + d e = + - s.t. y -b -bx +d -d =, =, K, + - d, d, =, K, Ths olear optmzato problem yelds the same soluto as the ormal equatos.

20 Note that crtera other tha least squares have bee suggested. Partcularly, least absolute devato (LAD) has bee cosdered: ˆ m e,.e., m y - y = = whch by aother substtuto ca be rewrtte as ( ) m e = m y - b + b x = = Why has t the least absolute devato crtero gaed greater acceptace? m = The least absolute devato as a fucto of b looks lke ths: e LAD estmate b of b We would lke to use dfferetal calculus to fd the LAD estmate of b, but the dervatve of the LAD fucto wrt b does ot exst at the mmum value of of b (the dervatve of the LAD fucto from the left ad from the rght dffer at of b ). Numercal methods must be used to fd the value of of b that mmzes the LAD fucto.

21 Furthermore, cosder ths stuato: y Ether of these les (ad ay others wth the same slope that le betwee) mmzes the LAD fucto (why?). Thus the LAD crtero does ot ecessarly yeld a uque regresso le. x Ths problem also be formulated as a olear optmzato problem: + - ( ) m d + d = + - s.t. y -b -bx +d -d =, =, K, + - d, d, =, K, e Aga, ths olear optmzato problem yelds the same soluto as the prevous approach.

22 Let s look at what happes to the sum of absolute errors as we alter our estmates b ad b : Respose Surface of Absolute Errors 5 Sum of Absolute Errors 5 The sum of absolute errors appears to be mmzed aroud -9. b -8. ad.5 b b b If we refe our search ad lmt our scope, we get a clearer dea of what s happeg: Respose Surface of Absolute Errors 7 6 Sum of Absolute Errors 5 4 The sum of absolute errors appears to be mmzed aroud -8.8 b -8.7 ad.59 b b b.54.45

23 If we further refe our search ad lmt our scope, we get a eve clearer dea of what s happeg: Respose Surface of Absolute Errors Sum of Absolute Errors The sum of absolute errors appears to be mmzed aroud b = ad b = b b.5875 Of course, a extreme refemet of our search ad lmtato of our scope provdes the clearest pcture: Respose Surface of Absolute Errors Sum of Absolute Errors The sum of squared errors appears to be mmzed aroud b = ad b = b b.59

24 Let s compare the estmated values of the respose ad the resduals from the two models: OLS model LAD model ŷ = x ŷ = x Respodet Age (x ) Aual Icome OLS LAD $'s (y ) ^ ^ y e y e E What smlartes do you otce? What dffereces do you otce? m D What s the crtero for ths approach? + - s.t. y - b - b x + d - d =, =, K, + - d, d D, =, K, + - d, d, =, K,

25 m D Or ths approach? + - s.t. y - b - b x + d - d =, =, K, + - d, d D, =, K, + - d, d, =, K, Ultmately, mmzato of squared errors (OLS) s the preferred crtero for fttg regresso equatos. C. Usg the Ordary Least Squares Estmate of the Regresso Le to Estmate Values of the Depedet Varable Y - Recall that our geerc estmated regresso equato s y ^ = b + b x so to estmate the value of the depedet varable y for a correspodg value of the depedet varable x, substtute x to the estmated regresso equato ad solve.

26 Example: Recall that our estmated regresso equato for the prevous problem s ŷ = x so to estmate the value of the depedet varable y for a correspodg value of the depedet varable x, substtute x to the estmated regresso equato ad solve. Thus, for our frst observato (x = 5) our regresso estmated value of the depedet varable come s yˆ = ( 5) = 5.9.e., we estmate that a twety-fve year-old wll ear a aual salary of $5,9 ths orgazato. We could estmate the value of the depedet varable y for each correspodg value of the depedet varable x our sample: Respodet Age Years Aual Icome Estmated Aual (x ) $'s (y ) Icome $'s ( ^y ) Σ Note that the sum of the regresso estmates s exactly equal to the sum of the observed values of the depedet varable y

27 Note that we ca use ths formato to calculate our error terms or resduals e : Respodet Age Years Aual Icome Estmated Aual (x ) $'s (y ) Icome $'s ( ^y ) Resduals e Σ Note that the sum of the resduals s zero wll ths always be so? Why or why ot? Also ote that we ca estmate the value of the depedet varable Y for ay value of the depedet varable X that s wth the rage of values for X our sample For example, what s the estmated value of come (the depedet varable Y) for a ffty year-old employee ths orgazato? The value of the depedet varable X (5) s wth the rage of values for X our sample (5-6), so ŷ = ( 5) = e., we estmate that a ffty year-old wll ear a aual salary of $57,78 ths orgazato. Why ca t we estmate estmate the value of the depedet varable Y for a value of the depedet varable X that s outsde the rage of values for X our sample?

28 We have o dea what the relatoshp betwee the depedet varable Y ad the depedet varable X s outsde the rage of values for x our sample SCATTER DIAGRAM INCOME ( $,'s) A attempt to do so s called extrapolato ths s oe of the dagers of usg regresso aalyss! AGE Note that the Y-tercept s frequetly the result of extrapolato uder such crcumstaces the Y-tercept has o real meag or terpretato! Oe possble remedy move the Y-axs to the ceter of the data (.e., ceter the data o X). How? Suppose we buld ths model: ˆ y = b + b x ' where x = x - x The Y-tercept ow represets the estmated value of the respose varable Y whe X s equal to ts (sample) mea ths s certaly ot a extrapolato. '

29 Subtractg the mea of X from every observed value of X essetally moves the X-axs so that the Y-axs tersects t at x stead of at. Y x-x x X x X =X-X whch (because we are usg x -x as the regressor) ad b = x y = = xy - x = x - = = = ( ) = ( x - x) = x - x y = ( x-x) b = y - b = y

30 Cosder the algebrac ramfcatos of ths model: ' y ˆ = b + bx = b + b ( x - x) = b + b x - b x ( ) = b - b x + b x ' = b + b x Ths mples a slght modfcato to the terpretato of the regresso estmate b : b estmated value of the respose varable Y whe the regressor varable X s equal to ts mea. Note also that the terpretato of b remas uchaged (why?) Aother possble remedy ceter the data (.e., elmate the Y-tercept). How? Suppose we buld ths model: ˆ' y = b + b x ' where y = y - y ad x = x - x ' ' The Y-tercept s ow zero - why?

31 Subtractg the meas of x ad y from every observed value of X ad Y essetally moves the X ad Y axes so that they Y-axs tersect at x, y stead of at,. Y Y =Y-Y y y-y y x x-x x X X =X-X whch (because we are usg x -x as the regressor ad y - y as the respose) b = x y = = xy - x = x - = = = ( x - x) ( x - y) = ( x - x) = (whch s detcal to our orgal equato for b ) ad b = y - bx =

32 Cosder the algebrac ramfcatos of ths model: ˆ' y = b + b x = b x ' where y = y - y ad x = x - x ' ' D.Assessg the Stregth of the Relatoshp betwee the Respose ad the Regressor(s). Parttog the Varace the Respose varable Y - Aga recall that our objectve s to = = ( ˆ ) m e = y - y Cosder the followg detty: ( ˆ ) ( ˆ ) ( ˆ ) y - y = y - y + y - y = e + y - y

33 Squarg ad summg both sdes yelds: ( y ) ( ˆ ) ( ˆ ) - y = y - y + y - y = = ( ˆ ) ( ˆ ) ( ˆ ) ( ˆ ) = y -y + y -y y -y + y -y = = = ( ˆ ) ( ˆ ) = y - y + y - y = = What happeed to ( y ˆ ) ( ˆ - y y - y )? = What happeed to ( y ˆ ) ( ˆ - y y - y )? = Frst ote that from the ormal equatos we have x y = b x + b x = = = The observe that ( ˆ ) x e = x y - y = x y - x yˆ = = = = = b x + b x - = = = x ( b + b x ) = b x + b x - b x + b x = = = = =

34 Now observe that = ˆy e ( ˆ ) = yˆ y - y = = ( + ) = b b x e = b e + b x e = = ( ) ( ) = b + b = So what happeed to ( y ˆ ) ( ˆ - y y - y )? = = ( y - y) ( y - yˆ ) ( ) ˆ ( ) = y y - y - y y - y = = = y e - yˆ e = = = - = Thus ( y ˆ ) ( ˆ - y y - y ) = =

35 So we have that: ( y ) ( ˆ ) ( ˆ - y = y - y + y - y ) = = = Whch we ca express as: Sum of Squares Sum of Squares Sum of Squares = + about the Mea due to Regresso about Regresso whch s also sometmes expressed as: Total Sums of Squares Regresso Sum = + of Squares Resdual Sum of Squares Ths s ofte referred to as a parttog of the varato the respose varable. The parttoed varato the respose varable s ofte represeted a Aalyss of Varace (ANOVA) table: Aalyss of Varace (ANOVA) Table: the Basc Splt of S YY Source of Varato Degrees of Freedom (df) Sum of Squares (SS) Mea Square (MS) Due to Regresso About Regresso (Resdual or Error) About the Mea (Corrected for y ) Note that - - ( ) = y - y = SYY ˆ ( y - y) MS Reg = ( y ˆ - y) = ( y - y) = MS Res

36 The prevous example yelds the followg parttog of the varato the respose varable: Aalyss of Varace (ANOVA) Table: the Basc Splt of S YY Source of Varato Degrees of Freedom (df) Sum of Squares (SS) Mea Square (MS) Due to Regresso About Regresso (Resdual or Error) About the Mea (Corrected for y ) The varato the respose varable ca be further parttoed to accout for varato explaed by each parameter estmate: Aalyss of Varace (ANOVA) Table: Icorporatg SS(b ) Source of Varato Degrees of Freedom (df) Sum of Squares (SS) Mea Square (MS) Due to b b About Regresso (Resdual or Error) Corrected Total Correcto Factor (due to b ) - - SS = ( yˆ - y) MS Reg b b = ( y ˆ - y) = ( y - y) = y = SS b = = y MS Res Total y =

37 The varato the respose varable s ofte placed Extra (Icremetal) Sums Of Squares Order: Aalyss of Varace (ANOVA) Table: Extra SS Order b Source of Varato b b Resdual Degrees of Freedom (df) - Sum of Squares (SS) y = SS b = = y b b ( ˆ ) SS = y - y = SS SS b b b Mea Square (MS) MS Reg MS Res Total y = We have ow measured the varato about the estmated regresso le. What f we wat to assess the stregth of the relatoshp betwee the respose varable Y ad the regressor varable X?. Coeffcet of Determato - measure the proporto of the total varato the observed values of the respose varable Y that s accouted for by the varato correspodg values of the regressor X. Usually deoted r for smple regresso ad R for multple regresso, ths s gve by ( ˆy - y) = r = = = ( y - y) = SS b b SS xy SS SS SS yy xx yy ad s the rato of SS due to regresso gve b to the Total SS Corrected for the Mea. y

38 Cosder our prevous example. We ca use our prevously developed equatos to fd the Coeffcet of Determato: ( ) ( ) S = x x y y XY = = xy xy = = = ( ) S = x x XX = x = = x ( ) S = y y YY = y y For our prevous example we could easly calculate the ecessary summary values ecessary for calculato of r tabular form: Aual Icome Respodet Age Years (x (x - )(y - ) (x - ) ) x y x $'s (y ) (y - y ) Σ x= 4.5 y= Note that SS xy = 6., SS xx = 83.5, ad SS yy =

39 By drect substtuto we have: SS 6. SS SS xy r = = =.688 xx yy ( ) ( ) Thus ths regresso explas approxmately 68.8% of the varato the observed values of the respose varable Y. Notes about the Coeffcet of Determato -. r. - larger values of r dcate relatvely stroger relatoshps - t s the square of the Pearso s Correlato Coeffcet betwee y ad b + b x (ad so betwee y ad y^ ) 3. Pearso s Correlato Coeffcet - measure of the relatve stregth ad drecto of the assocato betwee two cotuous varables. Usually deoted ρ whle ts sample estmate s deoted r ad s gve by r = xy SS SS xx xy SS Notes about the Coeffcet of Correlato - t s the rato of the covarace of X ad Y to the product of ther stadard devatos - -. r. - larger absolute values of r dcate stroger relatoshps - a egatve ^ value of r dcates a verse relatoshp whle a postve value of r dcates a drect relatoshp -r y,y = r y, b +b x = sg(b ) r for a lear regresso [where sg( ) returs the sg of the argumet] yy

40 Cosder our prevous example - aga we could easly calculate the ecessary summary values ecessary for calculato of r tabular form: Aual Icome Respodet Age Years (x (x - )(y - ) (x - ) ) x y x $'s (y ) (y - y ) Σ x= 4.5 y= Note that SS xy = 6., SS xx = 83.5, ad SS yy = By drect substtuto we have: SS 6. xy r = = =.896 xx yy ( = sg ( b ) r ) ( ) ( ) SS SS But what does ths mea?

41 Note also the terestg relatoshp betwee b ad r xy - Frst ote that: b = ( x - x) ( y - y) = = = ( x - x) = ( ) = = ( ) ( ) ( x - x) ( x - x) ( y - y) ( y - y) ( x - x) = = = = = y - y x - x y - y SS SS yy xy yy = = rxy SSxx SSxxSSyy SSxx xx xy SS yy SS SS SS Now ote that: = = ( ) ( ) x x - x = - s ( ) ( ) y y - y = - s, By drect substtuto we have: s y b = r sx xy so b s r xy scaled by the dsperso of the respose varable Y dvded by the dsperso of the regressor X.

42 E. Evaluatg the Ordary Least Squares Estmate of the Regresso Le. The F-Test - It ca be show that the Mea Square due to Regresso (MS Reg ) ad the Mea Square due to Resdual Varato (MS Res or s ) have the followg expected values: ( Reg ) = σ + β ( ) = ( ) = σ E MS x - x E s If the followg codtos are met: - the errors ε ~ N(, σ ) - the errors ε are depedet MS s the t ca be show that: Reg ( dfmsreg ) ( dfms ) Res σ σ ~ χ ~ χ wth df = df wth df = df MS Res MS Reg f β = Here we eed to recall that the rato of two depedet Ch-Square Statstcs wth degrees of freedom a ad a dvded by ther respectve degrees of freedom has a F- dstrbuto wth degrees of freedom a ad a.

43 Sce the two results from the prevous slde are depedet, f β = we have: ( dfmsreg ) df σ MS MSReg Reg ( dfms ) Res df σ Ths suggests a smple test of the ull hypothess: H : β = H : β Why s ths hypothess mportat? MSRes s = MS s Reg ~ F wth df = df, df MSReg MSRes Example: from our prevous problem we have: Aalyss of Varace (ANOVA) Table Source of Varato Degrees of Freedom (df) Sum of Squares (SS) Mea Square (MS) F-Rato Due to Regresso About Regresso (Resdual or Error) About the Mea (Corrected for y )

44 We could the fd a crtcal value of F for a gve level of sgfcace α ad df =, 4: α =. F.,,4 = 4.54 α =.5 F.5,,4 = 7.7 α =. F.,,4 =. We reject H f F > F α,,4, so we reject H at α =. OR We ca compare the p-value (provded by a computer prtout) to our chose value of α - f p-value < α the reject H. p-value =.4, so reject H f the chose α s less tha.4. Note that: - We ca exted ths test to multple regresso to test the ull hypothess H : β = β = β 3 = β m = H : at least oe β j Why s ths hypothess mportat? - Ths F-test s equvalet to a t-test (as we wll see shortly)

45 . Cofdece Itervals ad Hypothess Tests for β ad β : If the followg codtos are met: - the errors ε represet a radom varable wth a mea of (.e., E(ε ) = ) ad varace of σ (.e., V(ε ) = σ ) - the errors ε are ucorrelated, j so that cov(ε, ε j ) =. Ths results E(Y ) = β + β x V(Y ) = σ Y ad Y j are ucorrelated, j so that cov(y, Y j ) = Now we have that = b = = = = ( x - x) ( y - y) = ( x - x) ( ) ( ) = = = ( x - x) ( x - x) y - y ( x - x) = = = = = ( ) ( x - x) x - x y - x - x y x - x y ( x - x)

46 Now f the Y s are parwse ucorrelated ad the a s are costats, the varace of a fucto a = a Y + a Y + a 3 Y a Y s = ( ) ( ) V a = a V Y ad f V(Y ) = σ ( ) V a = σ a = Now recall that our prevous expresso for b a = = ( x - x) ( x - x) Reducto ow yelds ( ) V b = ( x - x) The stadard devato (stadard error) of b s the square root of the varace V(b ) ( ) = = σ sd b = = Usg the estmate s place of the ukow σ yelds the estmated stadard devato (stadard error) of b : s s est. sd ( b ) = = S xx x - x σ ( x - x) = ( ) σ S xx

47 If the followg codto s also met: - the errors ε ~ N(, σ ) the t ca be show that a ( - α)% cofdece terval s gve by: b ± t - α,- ( x - x) = ( ) = b ± t est. sd b - α,- s Ths suggests a smple test of the ull hypothess: H : β = β H : β β usg the test statstc b β t = est. sd b = s ( ) ( b β ) ( x - x) = = ( b β ) ( x - x) = s

48 We the compare t to t -, α our decso rule. I our prevous example, we have Respodet Age Years (x ) Aual Icome $'s (y ) (x - x) x Σ Thus our summary data are: s =.3697 ( ) = x - x = 83.5 = b =.748 so for H : β = we have t = = ( b β ) ( x ) - x = s ( ) ( ) =

49 Note that: - ths hypothess testg approach ca be adapted to oe-taled tests - ths hypothess testg approach ca be adapted to values of β other tha zero - smple lear regresso there s a relatoshp betwee the ths t-test ad the prevously dscussed F-test for H : β = we have : MSReg F = s = = = t = = ( ) ( ) b x - x y - y s ( ) b x - x s We ca costruct cofdece tervals ad hypothess tests for β much the same maer as we dd for β : - It ca be show that ( ) sd b = = ( ) Replacg the ukow σ wth ts sample estmate s yelds: = x x - x x = est. sd ( b ) = s ( x - x) = σ

50 A ( - α)% cofdece terval for β s the gve by: x = ± -,-α b t s x - x The test statstc for the t-test of the ull hypothess H : β = β s also gve by b β t = = est. sd ( b ) = H : β β ( ) b x = = ( ) x - x β s We the compare t to t -, α our decso rule. I our prevous example, we have Respodet Age Years (x ) Aual Icome $'s (y ) (x - x ) x Σ

51 Thus our summary data are: s =.3697 = = ( ) so for H : β = we have x - x = 83.5 x = 673 b = b β t = = = x ( ) (.3697 ) = s x - x = ( ) 3. Cofdece Itervals ad Hypothess Tests for ρ xy : Fsher s Approxmato (z trasformato) gves us (approxmately) ' + rxy - - z = l tah ( rxy ) ~ N tah ρ, - rxy - 3 So a approxmate ( α)% cofdece terval for ρ xy s gve by: α - + r ρ xy + xy l ± z α = l - ρ - rxy - xy - 3

52 The test statstc for the Null Hypothess s gve by H : ρ = ρ xy xy H : ρ ρ xy xy + r ρ xy + xy z = l l - 3 ρ - rxy - xy whch s the compared to the percetles of the N(, ) that correspod to the preselected level of sgfcace α. May SAS procedures are avalable for varous types of regresso: CALIS CATMOD GENMOD GLM LIFEREG LOESS LOGISTIC NLIN PLS PROBIT REG RSREG TPSPLINE TRANSREG

53 Geerc example usg a data set drectly: DATA ame of data set goes here; INPUT varable ames ad colum locatos go here; CARDS; data go here ; PROC???/optos; RUN; For our example data we could have: DATA salary; INPUT ame $ age come; CARDS; Jackso 5 Ross Brght Stadfer 6 65 Clark 4 64 Syder 33 3 ; TITLE Aalyss of Age/Icome Data ; PROC PRINT; VAR ame age come; RUN; PROC CORR; VAR age; WITH come; PROC REG; MODEL Icome=age; RUN;

54 Output for PROC PRINT: Aalyss of Age/Icome Data : Tuesday, September, Obs ame age come Jackso 5 Ross Brght Stadfe Clark Syder 33 3 Output for PROC CORR ad PROC REG: Aalyss of Age/Icome Data : Tuesday, September, The CORR Procedure Wth Varables: come By Varables: age Smple Statstcs Varable N Mea Std Dev Sum Mmum Maxmum come age Pearso Correlato Coeffcets, N = 6 Prob > r uder H: Rho= age come.896.4

55 Aalyss of Age/Icome Data 3 : Tuesday, September, The REG Procedure Model: MODEL Depedet Varable: come Aalyss of Varace Sum of Mea Source DF Squares Square F Value Pr > F Model Error Corrected Total Root MSE R-Square.6883 Depedet Mea Adj R-Sq.63 Coeff Var Parameter Estmates Parameter Stadard Varable DF Estmate Error t Value Pr > t Itercept age SOME questos you should be able to aswer: Suppose you have a collecto of observatos o a respose Y ad a predctor varable X.. How do you use OLS to estmate the slope ad tercept for a lear regresso whch the respose varable s Y ad the predctor varable s X? How do you use these results to produce a pot estmate of a value of Y for a gve value of X? How do you use these results to produce a terval estmate of a value of Y for a gve value of X? How do you use these results to produce a terval estmate of the mea value of Y for a gve value of X? How do you terpret the results of these procedures?. How do you evaluate the qualty of the ft of the regresso le to the sample data? How do you test hypotheses about the slope ad/or tercept? How do you produce a terval estmate for each of these parameters? How do you terpret the results of these procedures? 3. How do you use SAS or EXCEL to perform these procedures?

56 SOME questos you should be able to aswer (cotued): Suppose you have a collecto of observatos o a respose Y ad a predctor varable X. 4. How would the estmated slope ad tercept be affected f you multpled the values of the respose varable Y by a costat c? How would ths affect the coeffcet of determato r or the correlato coeffcet r xy? Why? 5. How would the estmated slope ad tercept be affected f you multpled the values of the predctor varable X by a costat c? How would ths affect the coeffcet of determato r or the correlato coeffcet r xy? Why? 6. How would the estmated slope ad tercept be affected f you added a costat c to the values of the respose varable Y? How would ths affect the coeffcet of determato r or the correlato coeffcet r xy? Why? SOME questos you should be able to aswer (cotued): Suppose you have a collecto of observatos o a respose Y ad a predctor varable X. 7. How would the estmated slope ad tercept be affected f you added a costat c to the values of the predctor varable X? How would ths affect the coeffcet of determato r or the correlato coeffcet r xy? Why? 8. How would the estmated slope ad tercept be affected f you terchaged the roles of the respose varable y ad the predctor varable X? How would ths affect the coeffcet of determato r or the correlato coeffcet r xy? Why? 9. Why s OLS preferred over LAD as a crtero for fttg regresso les?

57 SOME questos you should be able to aswer (cotued): Suppose you have a collecto of observatos o a respose Y ad a predctor varable X.. How are the ormal equatos for OLS derved? How are these equatos used to fd equatos for the estmated slope ad tercept?. Why do some aalysts ceter the predctor varable X? Why do some aalysts also ceter the respose varable Y? What are the ramfcatos of each of these modelg strateges?. How ad why s the varato the respose varable Y parttoed? 3. What assumpto(s) must be met whe performg ferece o regresso estmates?

Simple Linear Regression

Simple Linear Regression Statstcal Methods I (EST 75) Page 139 Smple Lear Regresso Smple regresso applcatos are used to ft a model descrbg a lear relatoshp betwee two varables. The aspects of least squares regresso ad correlato

More information

STA302/1001-Fall 2008 Midterm Test October 21, 2008

STA302/1001-Fall 2008 Midterm Test October 21, 2008 STA3/-Fall 8 Mdterm Test October, 8 Last Name: Frst Name: Studet Number: Erolled (Crcle oe) STA3 STA INSTRUCTIONS Tme allowed: hour 45 mutes Ads allowed: A o-programmable calculator A table of values from

More information

ENGI 3423 Simple Linear Regression Page 12-01

ENGI 3423 Simple Linear Regression Page 12-01 ENGI 343 mple Lear Regresso Page - mple Lear Regresso ometmes a expermet s set up where the expermeter has cotrol over the values of oe or more varables X ad measures the resultg values of aother varable

More information

Multiple Linear Regression Analysis

Multiple Linear Regression Analysis LINEA EGESSION ANALYSIS MODULE III Lecture - 4 Multple Lear egresso Aalyss Dr. Shalabh Departmet of Mathematcs ad Statstcs Ida Isttute of Techology Kapur Cofdece terval estmato The cofdece tervals multple

More information

Chapter 13 Student Lecture Notes 13-1

Chapter 13 Student Lecture Notes 13-1 Chapter 3 Studet Lecture Notes 3- Basc Busess Statstcs (9 th Edto) Chapter 3 Smple Lear Regresso 4 Pretce-Hall, Ic. Chap 3- Chapter Topcs Types of Regresso Models Determg the Smple Lear Regresso Equato

More information

Lecture 7. Confidence Intervals and Hypothesis Tests in the Simple CLR Model

Lecture 7. Confidence Intervals and Hypothesis Tests in the Simple CLR Model Lecture 7. Cofdece Itervals ad Hypothess Tests the Smple CLR Model I lecture 6 we troduced the Classcal Lear Regresso (CLR) model that s the radom expermet of whch the data Y,,, K, are the outcomes. The

More information

Chapter Business Statistics: A First Course Fifth Edition. Learning Objectives. Correlation vs. Regression. In this chapter, you learn:

Chapter Business Statistics: A First Course Fifth Edition. Learning Objectives. Correlation vs. Regression. In this chapter, you learn: Chapter 3 3- Busess Statstcs: A Frst Course Ffth Edto Chapter 2 Correlato ad Smple Lear Regresso Busess Statstcs: A Frst Course, 5e 29 Pretce-Hall, Ic. Chap 2- Learg Objectves I ths chapter, you lear:

More information

Probability and. Lecture 13: and Correlation

Probability and. Lecture 13: and Correlation 933 Probablty ad Statstcs for Software ad Kowledge Egeers Lecture 3: Smple Lear Regresso ad Correlato Mocha Soptkamo, Ph.D. Outle The Smple Lear Regresso Model (.) Fttg the Regresso Le (.) The Aalyss of

More information

12.2 Estimating Model parameters Assumptions: ox and y are related according to the simple linear regression model

12.2 Estimating Model parameters Assumptions: ox and y are related according to the simple linear regression model 1. Estmatg Model parameters Assumptos: ox ad y are related accordg to the smple lear regresso model (The lear regresso model s the model that says that x ad y are related a lear fasho, but the observed

More information

Multiple Regression. More than 2 variables! Grade on Final. Multiple Regression 11/21/2012. Exam 2 Grades. Exam 2 Re-grades

Multiple Regression. More than 2 variables! Grade on Final. Multiple Regression 11/21/2012. Exam 2 Grades. Exam 2 Re-grades STAT 101 Dr. Kar Lock Morga 11/20/12 Exam 2 Grades Multple Regresso SECTIONS 9.2, 10.1, 10.2 Multple explaatory varables (10.1) Parttog varablty R 2, ANOVA (9.2) Codtos resdual plot (10.2) Trasformatos

More information

Regresso What s a Model? 1. Ofte Descrbe Relatoshp betwee Varables 2. Types - Determstc Models (o radomess) - Probablstc Models (wth radomess) EPI 809/Sprg 2008 9 Determstc Models 1. Hypothesze

More information

Lecture Notes Types of economic variables

Lecture Notes Types of economic variables Lecture Notes 3 1. Types of ecoomc varables () Cotuous varable takes o a cotuum the sample space, such as all pots o a le or all real umbers Example: GDP, Polluto cocetrato, etc. () Dscrete varables fte

More information

Statistics MINITAB - Lab 5

Statistics MINITAB - Lab 5 Statstcs 10010 MINITAB - Lab 5 PART I: The Correlato Coeffcet Qute ofte statstcs we are preseted wth data that suggests that a lear relatoshp exsts betwee two varables. For example the plot below s of

More information

Summary of the lecture in Biostatistics

Summary of the lecture in Biostatistics Summary of the lecture Bostatstcs Probablty Desty Fucto For a cotuos radom varable, a probablty desty fucto s a fucto such that: 0 dx a b) b a dx A probablty desty fucto provdes a smple descrpto of the

More information

ECON 482 / WH Hong The Simple Regression Model 1. Definition of the Simple Regression Model

ECON 482 / WH Hong The Simple Regression Model 1. Definition of the Simple Regression Model ECON 48 / WH Hog The Smple Regresso Model. Defto of the Smple Regresso Model Smple Regresso Model Expla varable y terms of varable x y = β + β x+ u y : depedet varable, explaed varable, respose varable,

More information

Objectives of Multiple Regression

Objectives of Multiple Regression Obectves of Multple Regresso Establsh the lear equato that best predcts values of a depedet varable Y usg more tha oe eplaator varable from a large set of potetal predctors {,,... k }. Fd that subset of

More information

Simple Linear Regression

Simple Linear Regression Correlato ad Smple Lear Regresso Berl Che Departmet of Computer Scece & Iformato Egeerg Natoal Tawa Normal Uversty Referece:. W. Navd. Statstcs for Egeerg ad Scetsts. Chapter 7 (7.-7.3) & Teachg Materal

More information

residual. (Note that usually in descriptions of regression analysis, upper-case

residual. (Note that usually in descriptions of regression analysis, upper-case Regresso Aalyss Regresso aalyss fts or derves a model that descres the varato of a respose (or depedet ) varale as a fucto of oe or more predctor (or depedet ) varales. The geeral regresso model s oe of

More information

ESS Line Fitting

ESS Line Fitting ESS 5 014 17. Le Fttg A very commo problem data aalyss s lookg for relatoshpetwee dfferet parameters ad fttg les or surfaces to data. The smplest example s fttg a straght le ad we wll dscuss that here

More information

Lecture 8: Linear Regression

Lecture 8: Linear Regression Lecture 8: Lear egresso May 4, GENOME 56, Sprg Goals Develop basc cocepts of lear regresso from a probablstc framework Estmatg parameters ad hypothess testg wth lear models Lear regresso Su I Lee, CSE

More information

Econometric Methods. Review of Estimation

Econometric Methods. Review of Estimation Ecoometrc Methods Revew of Estmato Estmatg the populato mea Radom samplg Pot ad terval estmators Lear estmators Ubased estmators Lear Ubased Estmators (LUEs) Effcecy (mmum varace) ad Best Lear Ubased Estmators

More information

CLASS NOTES. for. PBAF 528: Quantitative Methods II SPRING Instructor: Jean Swanson. Daniel J. Evans School of Public Affairs

CLASS NOTES. for. PBAF 528: Quantitative Methods II SPRING Instructor: Jean Swanson. Daniel J. Evans School of Public Affairs CLASS NOTES for PBAF 58: Quattatve Methods II SPRING 005 Istructor: Jea Swaso Dael J. Evas School of Publc Affars Uversty of Washgto Ackowledgemet: The structor wshes to thak Rachel Klet, Assstat Professor,

More information

Ordinary Least Squares Regression. Simple Regression. Algebra and Assumptions.

Ordinary Least Squares Regression. Simple Regression. Algebra and Assumptions. Ordary Least Squares egresso. Smple egresso. Algebra ad Assumptos. I ths part of the course we are gog to study a techque for aalysg the lear relatoshp betwee two varables Y ad X. We have pars of observatos

More information

Linear Regression with One Regressor

Linear Regression with One Regressor Lear Regresso wth Oe Regressor AIM QA.7. Expla how regresso aalyss ecoometrcs measures the relatoshp betwee depedet ad depedet varables. A regresso aalyss has the goal of measurg how chages oe varable,

More information

The equation is sometimes presented in form Y = a + b x. This is reasonable, but it s not the notation we use.

The equation is sometimes presented in form Y = a + b x. This is reasonable, but it s not the notation we use. INTRODUCTORY NOTE ON LINEAR REGREION We have data of the form (x y ) (x y ) (x y ) These wll most ofte be preseted to us as two colum of a spreadsheet As the topc develops we wll see both upper case ad

More information

Statistics: Unlocking the Power of Data Lock 5

Statistics: Unlocking the Power of Data Lock 5 STAT 0 Dr. Kar Lock Morga Exam 2 Grades: I- Class Multple Regresso SECTIONS 9.2, 0., 0.2 Multple explaatory varables (0.) Parttog varablty R 2, ANOVA (9.2) Codtos resdual plot (0.2) Exam 2 Re- grades Re-

More information

4. Standard Regression Model and Spatial Dependence Tests

4. Standard Regression Model and Spatial Dependence Tests 4. Stadard Regresso Model ad Spatal Depedece Tests Stadard regresso aalss fals the presece of spatal effects. I case of spatal depedeces ad/or spatal heterogeet a stadard regresso model wll be msspecfed.

More information

Mean is only appropriate for interval or ratio scales, not ordinal or nominal.

Mean is only appropriate for interval or ratio scales, not ordinal or nominal. Mea Same as ordary average Sum all the data values ad dvde by the sample sze. x = ( x + x +... + x Usg summato otato, we wrte ths as x = x = x = = ) x Mea s oly approprate for terval or rato scales, ot

More information

Example: Multiple linear regression. Least squares regression. Repetition: Simple linear regression. Tron Anders Moger

Example: Multiple linear regression. Least squares regression. Repetition: Simple linear regression. Tron Anders Moger Example: Multple lear regresso 5000,00 4000,00 Tro Aders Moger 0.0.007 brthweght 3000,00 000,00 000,00 0,00 50,00 00,00 50,00 00,00 50,00 weght pouds Repetto: Smple lear regresso We defe a model Y = β0

More information

b. There appears to be a positive relationship between X and Y; that is, as X increases, so does Y.

b. There appears to be a positive relationship between X and Y; that is, as X increases, so does Y. .46. a. The frst varable (X) s the frst umber the par ad s plotted o the horzotal axs, whle the secod varable (Y) s the secod umber the par ad s plotted o the vertcal axs. The scatterplot s show the fgure

More information

Chapter 13, Part A Analysis of Variance and Experimental Design. Introduction to Analysis of Variance. Introduction to Analysis of Variance

Chapter 13, Part A Analysis of Variance and Experimental Design. Introduction to Analysis of Variance. Introduction to Analysis of Variance Chapter, Part A Aalyss of Varace ad Epermetal Desg Itroducto to Aalyss of Varace Aalyss of Varace: Testg for the Equalty of Populato Meas Multple Comparso Procedures Itroducto to Aalyss of Varace Aalyss

More information

TESTS BASED ON MAXIMUM LIKELIHOOD

TESTS BASED ON MAXIMUM LIKELIHOOD ESE 5 Toy E. Smth. The Basc Example. TESTS BASED ON MAXIMUM LIKELIHOOD To llustrate the propertes of maxmum lkelhood estmates ad tests, we cosder the smplest possble case of estmatg the mea of the ormal

More information

Chapter 14 Logistic Regression Models

Chapter 14 Logistic Regression Models Chapter 4 Logstc Regresso Models I the lear regresso model X β + ε, there are two types of varables explaatory varables X, X,, X k ad study varable y These varables ca be measured o a cotuous scale as

More information

: At least two means differ SST

: At least two means differ SST Formula Card for Eam 3 STA33 ANOVA F-Test: Completely Radomzed Desg ( total umber of observatos, k = Number of treatmets,& T = total for treatmet ) Step : Epress the Clam Step : The ypotheses: :... 0 A

More information

UNIVERSITY OF OSLO DEPARTMENT OF ECONOMICS

UNIVERSITY OF OSLO DEPARTMENT OF ECONOMICS UNIVERSITY OF OSLO DEPARTMENT OF ECONOMICS Exam: ECON430 Statstcs Date of exam: Frday, December 8, 07 Grades are gve: Jauary 4, 08 Tme for exam: 0900 am 00 oo The problem set covers 5 pages Resources allowed:

More information

Chapter 5 Properties of a Random Sample

Chapter 5 Properties of a Random Sample Lecture 6 o BST 63: Statstcal Theory I Ku Zhag, /0/008 Revew for the prevous lecture Cocepts: t-dstrbuto, F-dstrbuto Theorems: Dstrbutos of sample mea ad sample varace, relatoshp betwee sample mea ad sample

More information

Simple Linear Regression and Correlation. Applied Statistics and Probability for Engineers. Chapter 11 Simple Linear Regression and Correlation

Simple Linear Regression and Correlation. Applied Statistics and Probability for Engineers. Chapter 11 Simple Linear Regression and Correlation 4//6 Appled Statstcs ad Probablty for Egeers Sth Edto Douglas C. Motgomery George C. Ruger Chapter Smple Lear Regresso ad Correlato CHAPTER OUTLINE Smple Lear Regresso ad Correlato - Emprcal Models -8

More information

Lecture 1 Review of Fundamental Statistical Concepts

Lecture 1 Review of Fundamental Statistical Concepts Lecture Revew of Fudametal Statstcal Cocepts Measures of Cetral Tedecy ad Dsperso A word about otato for ths class: Idvduals a populato are desgated, where the dex rages from to N, ad N s the total umber

More information

Chapter 8. Inferences about More Than Two Population Central Values

Chapter 8. Inferences about More Than Two Population Central Values Chapter 8. Ifereces about More Tha Two Populato Cetral Values Case tudy: Effect of Tmg of the Treatmet of Port-We tas wth Lasers ) To vestgate whether treatmet at a youg age would yeld better results tha

More information

Functions of Random Variables

Functions of Random Variables Fuctos of Radom Varables Chapter Fve Fuctos of Radom Varables 5. Itroducto A geeral egeerg aalyss model s show Fg. 5.. The model output (respose) cotas the performaces of a system or product, such as weght,

More information

UNIVERSITY OF OSLO DEPARTMENT OF ECONOMICS

UNIVERSITY OF OSLO DEPARTMENT OF ECONOMICS UNIVERSITY OF OSLO DEPARTMENT OF ECONOMICS Postpoed exam: ECON430 Statstcs Date of exam: Jauary 0, 0 Tme for exam: 09:00 a.m. :00 oo The problem set covers 5 pages Resources allowed: All wrtte ad prted

More information

ECONOMETRIC THEORY. MODULE VIII Lecture - 26 Heteroskedasticity

ECONOMETRIC THEORY. MODULE VIII Lecture - 26 Heteroskedasticity ECONOMETRIC THEORY MODULE VIII Lecture - 6 Heteroskedastcty Dr. Shalabh Departmet of Mathematcs ad Statstcs Ida Isttute of Techology Kapur . Breusch Paga test Ths test ca be appled whe the replcated data

More information

{ }{ ( )} (, ) = ( ) ( ) ( ) Chapter 14 Exercises in Sampling Theory. Exercise 1 (Simple random sampling): Solution:

{ }{ ( )} (, ) = ( ) ( ) ( ) Chapter 14 Exercises in Sampling Theory. Exercise 1 (Simple random sampling): Solution: Chapter 4 Exercses Samplg Theory Exercse (Smple radom samplg: Let there be two correlated radom varables X ad A sample of sze s draw from a populato by smple radom samplg wthout replacemet The observed

More information

Chapter Two. An Introduction to Regression ( )

Chapter Two. An Introduction to Regression ( ) ubject: A Itroducto to Regresso Frst tage Chapter Two A Itroducto to Regresso (018-019) 1 pg. ubject: A Itroducto to Regresso Frst tage A Itroducto to Regresso Regresso aalss s a statstcal tool for the

More information

Lecture 3. Sampling, sampling distributions, and parameter estimation

Lecture 3. Sampling, sampling distributions, and parameter estimation Lecture 3 Samplg, samplg dstrbutos, ad parameter estmato Samplg Defto Populato s defed as the collecto of all the possble observatos of terest. The collecto of observatos we take from the populato s called

More information

STATISTICAL PROPERTIES OF LEAST SQUARES ESTIMATORS. x, where. = y - ˆ " 1

STATISTICAL PROPERTIES OF LEAST SQUARES ESTIMATORS. x, where. = y - ˆ  1 STATISTICAL PROPERTIES OF LEAST SQUARES ESTIMATORS Recall Assumpto E(Y x) η 0 + η x (lear codtoal mea fucto) Data (x, y ), (x 2, y 2 ),, (x, y ) Least squares estmator ˆ E (Y x) ˆ " 0 + ˆ " x, where ˆ

More information

Multiple Choice Test. Chapter Adequacy of Models for Regression

Multiple Choice Test. Chapter Adequacy of Models for Regression Multple Choce Test Chapter 06.0 Adequac of Models for Regresso. For a lear regresso model to be cosdered adequate, the percetage of scaled resduals that eed to be the rage [-,] s greater tha or equal to

More information

Midterm Exam 1, section 2 (Solution) Thursday, February hour, 15 minutes

Midterm Exam 1, section 2 (Solution) Thursday, February hour, 15 minutes coometrcs, CON Sa Fracsco State Uverst Mchael Bar Sprg 5 Mdterm xam, secto Soluto Thursda, Februar 6 hour, 5 mutes Name: Istructos. Ths s closed book, closed otes exam.. No calculators of a kd are allowed..

More information

ENGI 4421 Propagation of Error Page 8-01

ENGI 4421 Propagation of Error Page 8-01 ENGI 441 Propagato of Error Page 8-01 Propagato of Error [Navd Chapter 3; ot Devore] Ay realstc measuremet procedure cotas error. Ay calculatos based o that measuremet wll therefore also cota a error.

More information

( ) = ( ) ( ) Chapter 13 Asymptotic Theory and Stochastic Regressors. Stochastic regressors model

( ) = ( ) ( ) Chapter 13 Asymptotic Theory and Stochastic Regressors. Stochastic regressors model Chapter 3 Asmptotc Theor ad Stochastc Regressors The ature of eplaator varable s assumed to be o-stochastc or fed repeated samples a regresso aalss Such a assumpto s approprate for those epermets whch

More information

STA 108 Applied Linear Models: Regression Analysis Spring Solution for Homework #1

STA 108 Applied Linear Models: Regression Analysis Spring Solution for Homework #1 STA 08 Appled Lear Models: Regresso Aalyss Sprg 0 Soluto for Homework #. Let Y the dollar cost per year, X the umber of vsts per year. The the mathematcal relato betwee X ad Y s: Y 300 + X. Ths s a fuctoal

More information

Correlation and Simple Linear Regression

Correlation and Simple Linear Regression Correlato ad Smple Lear Regresso Berl Che Departmet of Computer Scece & Iformato Egeerg Natoal Tawa Normal Uverst Referece:. W. Navd. Statstcs for Egeerg ad Scetsts. Chapter 7 (7.-7.3) & Teachg Materal

More information

Midterm Exam 1, section 1 (Solution) Thursday, February hour, 15 minutes

Midterm Exam 1, section 1 (Solution) Thursday, February hour, 15 minutes coometrcs, CON Sa Fracsco State Uversty Mchael Bar Sprg 5 Mdterm am, secto Soluto Thursday, February 6 hour, 5 mutes Name: Istructos. Ths s closed book, closed otes eam.. No calculators of ay kd are allowed..

More information

Econ 388 R. Butler 2016 rev Lecture 5 Multivariate 2 I. Partitioned Regression and Partial Regression Table 1: Projections everywhere

Econ 388 R. Butler 2016 rev Lecture 5 Multivariate 2 I. Partitioned Regression and Partial Regression Table 1: Projections everywhere Eco 388 R. Butler 06 rev Lecture 5 Multvarate I. Parttoed Regresso ad Partal Regresso Table : Projectos everywhere P = ( ) ad M = I ( ) ad s a vector of oes assocated wth the costat term Sample Model Regresso

More information

Investigation of Partially Conditional RP Model with Response Error. Ed Stanek

Investigation of Partially Conditional RP Model with Response Error. Ed Stanek Partally Codtoal Radom Permutato Model 7- vestgato of Partally Codtoal RP Model wth Respose Error TRODUCTO Ed Staek We explore the predctor that wll result a smple radom sample wth respose error whe a

More information

Simple Linear Regression - Scalar Form

Simple Linear Regression - Scalar Form Smple Lear Regresso - Scalar Form Q.. Model Y X,..., p..a. Derve the ormal equatos that mmze Q. p..b. Solve for the ordary least squares estmators, p..c. Derve E, V, E, V, COV, p..d. Derve the mea ad varace

More information

Class 13,14 June 17, 19, 2015

Class 13,14 June 17, 19, 2015 Class 3,4 Jue 7, 9, 05 Pla for Class3,4:. Samplg dstrbuto of sample mea. The Cetral Lmt Theorem (CLT). Cofdece terval for ukow mea.. Samplg Dstrbuto for Sample mea. Methods used are based o CLT ( Cetral

More information

Simple Linear Regression and Correlation.

Simple Linear Regression and Correlation. Smple Lear Regresso ad Correlato. Correspods to Chapter 0 Tamhae ad Dulop Sldes prepared b Elzabeth Newto (MIT) wth some sldes b Jacquele Telford (Johs Hopks Uverst) Smple lear regresso aalss estmates

More information

ε. Therefore, the estimate

ε. Therefore, the estimate Suggested Aswers, Problem Set 3 ECON 333 Da Hugerma. Ths s ot a very good dea. We kow from the secod FOC problem b) that ( ) SSE / = y x x = ( ) Whch ca be reduced to read y x x = ε x = ( ) The OLS model

More information

X ε ) = 0, or equivalently, lim

X ε ) = 0, or equivalently, lim Revew for the prevous lecture Cocepts: order statstcs Theorems: Dstrbutos of order statstcs Examples: How to get the dstrbuto of order statstcs Chapter 5 Propertes of a Radom Sample Secto 55 Covergece

More information

Chapter 3 Sampling For Proportions and Percentages

Chapter 3 Sampling For Proportions and Percentages Chapter 3 Samplg For Proportos ad Percetages I may stuatos, the characterstc uder study o whch the observatos are collected are qualtatve ature For example, the resposes of customers may marketg surveys

More information

THE ROYAL STATISTICAL SOCIETY HIGHER CERTIFICATE

THE ROYAL STATISTICAL SOCIETY HIGHER CERTIFICATE THE ROYAL STATISTICAL SOCIETY 00 EXAMINATIONS SOLUTIONS HIGHER CERTIFICATE PAPER I STATISTICAL THEORY The Socety provdes these solutos to assst caddates preparg for the examatos future years ad for the

More information

Statistics. Correlational. Dr. Ayman Eldeib. Simple Linear Regression and Correlation. SBE 304: Linear Regression & Correlation 1/3/2018

Statistics. Correlational. Dr. Ayman Eldeib. Simple Linear Regression and Correlation. SBE 304: Linear Regression & Correlation 1/3/2018 /3/08 Sstems & Bomedcal Egeerg Departmet SBE 304: Bo-Statstcs Smple Lear Regresso ad Correlato Dr. Ama Eldeb Fall 07 Descrptve Orgasg, summarsg & descrbg data Statstcs Correlatoal Relatoshps Iferetal Geeralsg

More information

CHAPTER VI Statistical Analysis of Experimental Data

CHAPTER VI Statistical Analysis of Experimental Data Chapter VI Statstcal Aalyss of Expermetal Data CHAPTER VI Statstcal Aalyss of Expermetal Data Measuremets do ot lead to a uque value. Ths s a result of the multtude of errors (maly radom errors) that ca

More information

LINEAR REGRESSION ANALYSIS

LINEAR REGRESSION ANALYSIS LINEAR REGRESSION ANALYSIS MODULE V Lecture - Correctg Model Iadequaces Through Trasformato ad Weghtg Dr. Shalabh Departmet of Mathematcs ad Statstcs Ida Isttute of Techology Kapur Aalytcal methods for

More information

REVIEW OF SIMPLE LINEAR REGRESSION SIMPLE LINEAR REGRESSION

REVIEW OF SIMPLE LINEAR REGRESSION SIMPLE LINEAR REGRESSION REVIEW OF SIMPLE LINEAR REGRESSION SIMPLE LINEAR REGRESSION I lear regreo, we coder the frequecy dtrbuto of oe varable (Y) at each of everal level of a ecod varable (X). Y kow a the depedet varable. The

More information

Handout #8. X\Y f(x) 0 1/16 1/ / /16 3/ / /16 3/16 0 3/ /16 1/16 1/8 g(y) 1/16 1/4 3/8 1/4 1/16 1

Handout #8. X\Y f(x) 0 1/16 1/ / /16 3/ / /16 3/16 0 3/ /16 1/16 1/8 g(y) 1/16 1/4 3/8 1/4 1/16 1 Hadout #8 Ttle: Foudatos of Ecoometrcs Course: Eco 367 Fall/05 Istructor: Dr. I-Mg Chu Lear Regresso Model So far we have focused mostly o the study of a sgle radom varable, ts correspodg theoretcal dstrbuto,

More information

Example. Row Hydrogen Carbon

Example. Row Hydrogen Carbon SMAM 39 Least Squares Example. Heatg ad combusto aalyses were performed order to study the composto of moo rocks collected by Apollo 4 ad 5 crews. Recorded c ad c of the Mtab output are the determatos

More information

STA 105-M BASIC STATISTICS (This is a multiple choice paper.)

STA 105-M BASIC STATISTICS (This is a multiple choice paper.) DCDM BUSINESS SCHOOL September Mock Eamatos STA 0-M BASIC STATISTICS (Ths s a multple choce paper.) Tme: hours 0 mutes INSTRUCTIONS TO CANDIDATES Do ot ope ths questo paper utl you have bee told to do

More information

Lecture Notes 2. The ability to manipulate matrices is critical in economics.

Lecture Notes 2. The ability to manipulate matrices is critical in economics. Lecture Notes. Revew of Matrces he ablt to mapulate matrces s crtcal ecoomcs.. Matr a rectagular arra of umbers, parameters, or varables placed rows ad colums. Matrces are assocated wth lear equatos. lemets

More information

MEASURES OF DISPERSION

MEASURES OF DISPERSION MEASURES OF DISPERSION Measure of Cetral Tedecy: Measures of Cetral Tedecy ad Dsperso ) Mathematcal Average: a) Arthmetc mea (A.M.) b) Geometrc mea (G.M.) c) Harmoc mea (H.M.) ) Averages of Posto: a) Meda

More information

Lecture 2: Linear Least Squares Regression

Lecture 2: Linear Least Squares Regression Lecture : Lear Least Squares Regresso Dave Armstrog UW Mlwaukee February 8, 016 Is the Relatoshp Lear? lbrary(car) data(davs) d 150) Davs$weght[d]

More information

Lecture 1: Introduction to Regression

Lecture 1: Introduction to Regression Lecture : Itroducto to Regresso A Eample: Eplag State Homcde Rates What kds of varables mght we use to epla/predct state homcde rates? Let s cosder just oe predctor for ow: povert Igore omtted varables,

More information

ENGI 4421 Joint Probability Distributions Page Joint Probability Distributions [Navidi sections 2.5 and 2.6; Devore sections

ENGI 4421 Joint Probability Distributions Page Joint Probability Distributions [Navidi sections 2.5 and 2.6; Devore sections ENGI 441 Jot Probablty Dstrbutos Page 7-01 Jot Probablty Dstrbutos [Navd sectos.5 ad.6; Devore sectos 5.1-5.] The jot probablty mass fucto of two dscrete radom quattes, s, P ad p x y x y The margal probablty

More information

CHAPTER 4 RADICAL EXPRESSIONS

CHAPTER 4 RADICAL EXPRESSIONS 6 CHAPTER RADICAL EXPRESSIONS. The th Root of a Real Number A real umber a s called the th root of a real umber b f Thus, for example: s a square root of sce. s also a square root of sce ( ). s a cube

More information

Multivariate Transformation of Variables and Maximum Likelihood Estimation

Multivariate Transformation of Variables and Maximum Likelihood Estimation Marquette Uversty Multvarate Trasformato of Varables ad Maxmum Lkelhood Estmato Dael B. Rowe, Ph.D. Assocate Professor Departmet of Mathematcs, Statstcs, ad Computer Scece Copyrght 03 by Marquette Uversty

More information

Analyzing Two-Dimensional Data. Analyzing Two-Dimensional Data

Analyzing Two-Dimensional Data. Analyzing Two-Dimensional Data /7/06 Aalzg Two-Dmesoal Data The most commo aaltcal measuremets volve the determato of a ukow cocetrato based o the respose of a aaltcal procedure (usuall strumetal). Such a measuremet requres calbrato,

More information

Reaction Time VS. Drug Percentage Subject Amount of Drug Times % Reaction Time in Seconds 1 Mary John Carl Sara William 5 4

Reaction Time VS. Drug Percentage Subject Amount of Drug Times % Reaction Time in Seconds 1 Mary John Carl Sara William 5 4 CHAPTER Smple Lear Regreo EXAMPLE A expermet volvg fve ubject coducted to determe the relatohp betwee the percetage of a certa drug the bloodtream ad the legth of tme t take the ubject to react to a tmulu.

More information

Previous lecture. Lecture 8. Learning outcomes of this lecture. Today. Statistical test and Scales of measurement. Correlation

Previous lecture. Lecture 8. Learning outcomes of this lecture. Today. Statistical test and Scales of measurement. Correlation Lecture 8 Emprcal Research Methods I434 Quattatve Data aalss II Relatos Prevous lecture Idea behd hpothess testg Is the dfferece betwee two samples a reflecto of the dfferece of two dfferet populatos or

More information

Chapter 11 The Analysis of Variance

Chapter 11 The Analysis of Variance Chapter The Aalyss of Varace. Oe Factor Aalyss of Varace. Radomzed Bloc Desgs (ot for ths course) NIPRL . Oe Factor Aalyss of Varace.. Oe Factor Layouts (/4) Suppose that a expermeter s terested populatos

More information

Third handout: On the Gini Index

Third handout: On the Gini Index Thrd hadout: O the dex Corrado, a tala statstca, proposed (, 9, 96) to measure absolute equalt va the mea dfferece whch s defed as ( / ) where refers to the total umber of dvduals socet. Assume that. The

More information

X X X E[ ] E X E X. is the ()m n where the ( i,)th. j element is the mean of the ( i,)th., then

X X X E[ ] E X E X. is the ()m n where the ( i,)th. j element is the mean of the ( i,)th., then Secto 5 Vectors of Radom Varables Whe workg wth several radom varables,,..., to arrage them vector form x, t s ofte coveet We ca the make use of matrx algebra to help us orgaze ad mapulate large umbers

More information

Special Instructions / Useful Data

Special Instructions / Useful Data JAM 6 Set of all real umbers P A..d. B, p Posso Specal Istructos / Useful Data x,, :,,, x x Probablty of a evet A Idepedetly ad detcally dstrbuted Bomal dstrbuto wth parameters ad p Posso dstrbuto wth

More information

Applied Statistics and Probability for Engineers, 5 th edition February 23, b) y ˆ = (85) =

Applied Statistics and Probability for Engineers, 5 th edition February 23, b) y ˆ = (85) = Appled Statstcs ad Probablty for Egeers, 5 th edto February 3, y.8.7.6.5.4.3.. -5 5 5 x b) y ˆ.3999 +.46(85).6836 c) y ˆ.3999 +.46(9).744 d) ˆ.46-3 a) Regresso Aalyss: Ratg Pots versus Meters per Att The

More information

Chapter 2 Supplemental Text Material

Chapter 2 Supplemental Text Material -. Models for the Data ad the t-test Chapter upplemetal Text Materal The model preseted the text, equato (-3) s more properl called a meas model. ce the mea s a locato parameter, ths tpe of model s also

More information

Simulation Output Analysis

Simulation Output Analysis Smulato Output Aalyss Summary Examples Parameter Estmato Sample Mea ad Varace Pot ad Iterval Estmato ermatg ad o-ermatg Smulato Mea Square Errors Example: Sgle Server Queueg System x(t) S 4 S 4 S 3 S 5

More information

Module 7: Probability and Statistics

Module 7: Probability and Statistics Lecture 4: Goodess of ft tests. Itroducto Module 7: Probablty ad Statstcs I the prevous two lectures, the cocepts, steps ad applcatos of Hypotheses testg were dscussed. Hypotheses testg may be used to

More information

Chapter 2 Simple Linear Regression

Chapter 2 Simple Linear Regression Chapter Smple Lear Regresso. Itroducto ad Least Squares Estmates Regresso aalyss s a method for vestgatg the fuctoal relatoshp amog varables. I ths chapter we cosder problems volvg modelg the relatoshp

More information

Regression. Linear Regression. A Simple Data Display. A Batch of Data. The Mean is 220. A Value of 474. STAT Handout Module 15 1 st of June 2009

Regression. Linear Regression. A Simple Data Display. A Batch of Data. The Mean is 220. A Value of 474. STAT Handout Module 15 1 st of June 2009 STAT Hadout Module 5 st of Jue 9 Lear Regresso Regresso Joh D. Sork, M.D. Ph.D. Baltmore VA Medcal Ceter GRCC ad Uversty of Marylad School of Medce Claude D. Pepper Older Amercas Idepedece Ceter Reducg

More information

THE ROYAL STATISTICAL SOCIETY 2016 EXAMINATIONS SOLUTIONS HIGHER CERTIFICATE MODULE 5

THE ROYAL STATISTICAL SOCIETY 2016 EXAMINATIONS SOLUTIONS HIGHER CERTIFICATE MODULE 5 THE ROYAL STATISTICAL SOCIETY 06 EAMINATIONS SOLUTIONS HIGHER CERTIFICATE MODULE 5 The Socety s provdg these solutos to assst cadtes preparg for the examatos 07. The solutos are teded as learg ads ad should

More information

Chapter 3 Multiple Linear Regression Model

Chapter 3 Multiple Linear Regression Model Chapter 3 Multple Lear Regresso Model We cosder the problem of regresso whe study varable depeds o more tha oe explaatory or depedet varables, called as multple lear regresso model. Ths model geeralzes

More information

Lecture 1: Introduction to Regression

Lecture 1: Introduction to Regression Lecture : Itroducto to Regresso A Eample: Eplag State Homcde Rates What kds of varables mght we use to epla/predct state homcde rates? Let s cosder just oe predctor for ow: povert Igore omtted varables,

More information

STK4011 and STK9011 Autumn 2016

STK4011 and STK9011 Autumn 2016 STK4 ad STK9 Autum 6 Pot estmato Covers (most of the followg materal from chapter 7: Secto 7.: pages 3-3 Secto 7..: pages 3-33 Secto 7..: pages 35-3 Secto 7..3: pages 34-35 Secto 7.3.: pages 33-33 Secto

More information

best estimate (mean) for X uncertainty or error in the measurement (systematic, random or statistical) best

best estimate (mean) for X uncertainty or error in the measurement (systematic, random or statistical) best Error Aalyss Preamble Wheever a measuremet s made, the result followg from that measuremet s always subject to ucertaty The ucertaty ca be reduced by makg several measuremets of the same quatty or by mprovg

More information

Assignment 5/MATH 247/Winter Due: Friday, February 19 in class (!) (answers will be posted right after class)

Assignment 5/MATH 247/Winter Due: Friday, February 19 in class (!) (answers will be posted right after class) Assgmet 5/MATH 7/Wter 00 Due: Frday, February 9 class (!) (aswers wll be posted rght after class) As usual, there are peces of text, before the questos [], [], themselves. Recall: For the quadratc form

More information

Topic 9. Regression and Correlation

Topic 9. Regression and Correlation BE54W Regresso ad Correlato Page of 43 Topc 9 Regresso ad Correlato Topc. Defto of the Lear Regresso Model... Estmato.... 3. The Aalyss of Varace Table. 4. Assumptos for the Straght Le Regresso. 5. Hypothess

More information

Sum Mean n

Sum Mean n tatstcal Methods I (EXT 75) Page 147 ummary data Itermedate Calculatos X = 83 Y = 8 X = 51 Y = 368 Mea of X = X = 5.1875 Mea of Y = Y = 14.5 XY = 1348 = 16 Correcto factors ad Corrected values (ums of

More information

Lecture 3 Probability review (cont d)

Lecture 3 Probability review (cont d) STATS 00: Itroducto to Statstcal Iferece Autum 06 Lecture 3 Probablty revew (cot d) 3. Jot dstrbutos If radom varables X,..., X k are depedet, the ther dstrbuto may be specfed by specfyg the dvdual dstrbuto

More information

THE ROYAL STATISTICAL SOCIETY GRADUATE DIPLOMA

THE ROYAL STATISTICAL SOCIETY GRADUATE DIPLOMA THE ROYAL STATISTICAL SOCIETY EXAMINATIONS SOLUTIONS GRADUATE DIPLOMA PAPER II STATISTICAL THEORY & METHODS The Socety provdes these solutos to assst caddates preparg for the examatos future years ad for

More information

"It is the mark of a truly intelligent person to be moved by statistics." George Bernard Shaw

It is the mark of a truly intelligent person to be moved by statistics. George Bernard Shaw Chapter 0 Chapter 0 Lear Regresso ad Correlato "It s the mark of a truly tellget perso to be moved by statstcs." George Berard Shaw Source: https://www.google.com.ph/search?q=house+ad+car+pctures&bw=366&bh=667&tbm

More information