Correlation and Regression

Size: px
Start display at page:

Download "Correlation and Regression"

Transcription

1 Correlaton and Regresson otes prepared by Pamela Peterson Drake Index Basc terms and concepts... Smple regresson...5 Multple Regresson...3 Regresson termnology...0 Regresson formulas... Basc terms and concepts. A scatter plot s a graphcal representaton of the relaton between two or more varables. In the scatter plot of two varables x and y, each pont on the plot s an x-y par.. We use regresson and correlaton to descrbe the varaton n one or more varables. A. The varaton s the sum of the squared devatons of a varable. Varaton ( x-x) B. The varaton s the numerator of the varance of a sample: ( x-x) Varance - C. Both the varaton and the varance are measures of the dsperson of a sample. 3. The covarance between two random varables s a statstcal measure of the degree to whch the two varables move together. A. The covarance captures how one varable s dfferent from ts mean as the other varable s dfferent from ts mean. B. A postve covarance ndcates that the varables tend to move together; a negatve covarance ndcates that the varables tend to move n opposte drectons. C. The covarance s calculated as the rato of the covaraton to the sample sze less one: Covarance (x -x)(y-y) - where s the sample sze x s the th observaton on varable x, x s the mean of the varable x observatons, s the th observaton on varable y, and y Example: Home sale prces and square footage Home sales prces (vertcal axs) v. square footage for a sample of 34 home sales n September 005 n St. Luce County. Sales prce $800,000 $700,000 $600,000 $500,000 $400,000 $300,000 $00,000 $00,000 $ ,000,500,000,500 3,000 Square footage Regresson otes, Prepared by Pamela Peterson Drake of

2 y s the mean of the varable y observatons. D. The actual value of the covarance s not meanngful because t s affected by the scale of the two varables. That s why we calculate the correlaton coeffcent to make somethng nterpretable from the covarance nformaton. E. The correlaton coeffcent, r, s a measure of the strength of the relatonshp between or among varables. Calculaton: covarance betwen x and y r standard devaton standard devaton of x of y r (x-x) (y -y) ( -) (x-x) (y-y),n - - ote: Correlaton does not mply causaton. We may say that two varables X and Y are correlated, but that does not mean that X causes Y or that Y causes X they smply are related or assocated wth one another. Example : Calculatng the correlaton coeffcent Devaton of x Squared devaton of x Devaton of y Squared devaton of y Product of devatons Observaton x y x- x (x- x ) y- y (y- y ) (x- x )(y-y ) Sum , Calculatons: x 35/0 3.5 y 46 / s x / s y,34.4 / r 445/ (6.45)(6.33). The type of relatonshp s represented by the correlaton coeffcent: r + perfect postve correlaton + >r > 0 postve relatonshp Regresson otes, Prepared by Pamela Peterson Drake of

3 . r 0 no relatonshp 0 > r > negatve relatonshp r perfect negatve correlaton You can determne the degree of correlaton by lookng at the scatter graphs. If the relaton s upward there s postve correlaton. If the relaton downward there s negatve correlaton. Y 0 < r <.0 Y -.0 < r < 0 X X. The correlaton coeffcent s bound by and +. The closer the coeffcent to or +, the stronger s the correlaton. v. Wth the excepton of the extremes (that s, r.0 or r -), we cannot really talk about the strength of a relatonshp ndcated by the correlaton coeffcent wthout a statstcal test of sgnfcance. v. The hypotheses of nterest regardng the populaton correlaton, ρ, are: ull hypothess H 0 : ρ 0 In other words, there s no correlaton between the two varables Alternatve hypothess H a : ρ / 0 In other words, there s a correlaton between the two varables v. The test statstc s t-dstrbuted wth n- degrees of freedom: t r - - r Example, contnued In the prevous example, r t v. To make a decson, compare the calculated t-statstc wth the crtcal t-statstc for the approprate degrees of freedom and level of sgnfcance. Regresson otes, Prepared by Pamela Peterson Drake 3 of

4 Problem Suppose the correlaton coeffcent s 0. and the number of observatons s 3. What s the calculated test statstc? Is ths sgnfcant correlaton usng a 5% level of sgnfcance? Soluton Hypotheses: H 0 : ρ 0 H a : ρ Calculated t -statstc: t Degrees of freedom 3-3 The crtcal t-value for a 5% level of sgnfcance and 3 degrees of freedom s.04. Therefore, there s no sgnfcant correlaton (.803 falls between the two crtcal values of.04 and +.04). Problem Suppose the correlaton coeffcent s 0.80 and the number of observatons s 6. What s the calculated test statstc? Is ths sgnfcant correlaton usng a % level of sgnfcance? Soluton Hypotheses: H 0 : ρ 0 v. H a : ρ 0 Calculated t -statstc: t The crtcal t-value for a % level of sgnfcance and observatons s Therefore, the null hypothess s rejected and we conclude that there s sgnfcant correlaton. F. An outler s an extreme value of a varable. The outler may be qute large or small (where large and small are defned relatve to the rest of the sample).. An outler may affect the sample statstcs, such as a correlaton coeffcent. It s possble for an outler to affect the result, for example, such that we conclude that there s a sgnfcant relaton when n fact there s none or to conclude that there s no relaton when n fact there s a relaton.. The researcher must exercse judgment (and cauton) when decdng whether to nclude or exclude an observaton. G. Spurous correlaton s the appearance of a relatonshp when n fact there s no relaton. Outlers may result n spurous correlaton.. The correlaton coeffcent does not ndcate a causal relatonshp. Certan data tems may be hghly correlated, but not necessarly a result of a causal relatonshp.. A good example of a spurous correlaton s snowfall and stock prces n January. If we regress hstorcal stock prces on snowfall totals n Mnnesota, we would get a statstcally sgnfcant relatonshp especally for the month of January. Snce there s not an economc reason for ths relatonshp, ths would be an example of spurous correlaton. Regresson otes, Prepared by Pamela Peterson Drake 4 of

5 Smple regresson. Regresson s the analyss of the relaton between one varable and some other varable(s), assumng a lnear relaton. Also referred to as least squares regresson and ordnary least squares (OLS). A. The purpose s to explan the varaton n a varable (that s, how a varable dffers from t's mean value) usng the varaton n one or more other varables. B. Suppose we want to descrbe, explan, or predct why a varable dffers from ts mean. Let the th observaton on ths varable be represented as Y, and let n ndcate the number of observatons. The varaton n Y 's (what we want to explan) s: Varaton of Y SS Total ( y-y) C. The least squares prncple s that the regresson lne s determned by mnmzng the sum of the squares of the vertcal dstances between the actual Y values and the predcted values of Y. Y X A lne s ft through the XY ponts such that the sum of the squared resduals (that s, the sum of the squared the vertcal dstance between the observatons and the lne) s mnmzed.. The varables n a regresson relaton consst of dependent and ndependent varables. A. The dependent varable s the varable whose varaton s beng explaned by the other varable(s). Also referred to as the explaned varable, the endogenous varable, or the predcted varable. B. The ndependent varable s the varable whose varaton s used to explan that of the dependent varable. Also referred to as the explanatory varable, the exogenous varable, or the predctng varable. C. The parameters n a smple regresson equaton are the slope (b ) and the ntercept (b 0 ): y b 0 + b x + ε where y x b 0 b ε s the th observaton on the dependent varable, s the th observaton on the ndependent varable, s the ntercept. s the slope coeffcent, s the resdual for the th observaton. Regresson otes, Prepared by Pamela Peterson Drake 5 of

6 Y b0 0 b X D. The slope, b, s the change n Y for a gven oneunt change n X. The slope can be postve, negatve, or zero, calculated as: b cov(x, Y) var(x) (y y)(x (x x) x) Hnt: Thnk of the regresson lne as the average of the relatonshp between the ndependent varable(s) and the dependent varable. The resdual represents the dstance an observed value of the dependent varables (.e., Y) s away from the average relatonshp as depcted by the regresson lne. Suppose that: (y y)(x x),000 (x x) Then, ˆb A short -cut formula for the slope coeffcent: x y (y y)(x x) xy b (x x) x x Whether ths s truly a short-cut or not depends on the method of performng the calculatons: by hand, usng Mcrosoft Excel, or usng a calculator. E. The ntercept, b 0, s the lne s ntersecton wth the Y-axs at X0. The ntercept can be postve, negatve, or zero. The ntercept s calculated as: ˆb 0y-bx Regresson otes, Prepared by Pamela Peterson Drake 6 of

7 3. Lnear regresson assumes the followng: A. A lnear relatonshp exsts between dependent and ndependent varable. ote: f the relaton s not lnear, t may be possble to transform one or both varables so that there s a lnear relaton. B. The ndependent varable s uncorrelated wth the resduals; that s, the ndependent varable s not random. C. The expected value of the dsturbance term s zero; that s, E(ε )0 Example, contnued: D. There s a constant varance of the dsturbance term; that s, the dsturbance or resdual terms are all drawn from a dstrbuton wth an dentcal varance. In other words, the dsturbance terms are homoskedaststc. [A volaton of ths s referred to as heteroskedastcty.] E. The resduals are ndependently dstrbuted; that s, the resdual or dsturbance for one observaton s not correlated wth that of another observaton. [A volaton of ths s referred to as autocorrelaton.] F. The dsturbance term (a.k.a. resdual, a.k.a. error term) s normally dstrbuted. 4. The standard error of the estmate, SEE, (also referred to as the standard error of the resdual or standard error of the regresson, and often ndcated as s e ) s the standard devaton of predcted dependent varable values about the estmated regresson lne. 5. Standard error of the estmate (SEE) s SSResdual e Home sales prces (vertcal axs) v. square footage for a sample of 34 home sales n September 005 n St. Luce County. Sales prce $800,000 $600,000 $400,000 $00,000 $0 -$00,000 -$400,000 -,000 0,000,000 3,000 4,000 Square footage SEE ( y bˆ 0 bˆ x ) (y yˆ ) eˆ where SS Resdual s the sum of squared errors; ^ ndcates the predcted or estmated value of the varable or parameter; and ŷ I bˆ + ˆ x, s a pont on the regresson lne correspondng to a value of the 0 b ndependent varable, the x ; the expected value of y, gven the estmated mean relaton between x and y. Regresson otes, Prepared by Pamela Peterson Drake 7 of

8 A. The standard error of the estmate helps us gauge the "ft" of the regresson lne; that s, how well we have descrbed the varaton n the dependent varable.. The smaller the standard error, the better the ft... v. The standard error of the estmate s a measure of close the estmated values (usng the estmated regresson), the ŷ's, are to the actual values, the Y's. The ε s (a.k.a. the dsturbance terms; a.k.a. the resduals) are the vertcal dstance between the observed value of Y and that predcted by the equaton, the ŷ's. The ε s are n the same terms (unt of measure) as the Y s (e.g., dollars, pounds, bllons) 6. The coeffcent of determnaton, R, s the percentage of varaton n the dependent varable (varaton of Y 's or the sum of squares total, SST) explaned by the ndependent varable(s). A. The coeffcent of determnaton s calculated as: R Explaned varaton Total varaton Example, contnued Consder the followng observatons on X and Y: Observaton X Y Sum The estmated regresson lne s: y x and the resduals are calculated as: Observaton x y ŷ y- ŷ ε Total 0,83.63 Therefore, SS Resdaul / SEE Total varaton Unexplaned varaton SS SS Total SSResdual Total varaton SS SS Total Regresson B. An R of 0.49 ndcates that the ndependent varables explan 49% of the varaton n the dependent varable. Total Regresson otes, Prepared by Pamela Peterson Drake 8 of

9 Example, contnued Contnung the prevous regresson example, we can calculate t he R : Observaton x y (y- y) ŷ Y-ŷ ( ŷ - y) ε Total 46, ,83.63 R /, % or R (,83.63 /,34.40) % 7. A confdence nterval s the range of regresson coeffcent values for a gven value estmate of the coeffcent and a gven level of probablty. A. The confdence nterval for a regresson coeffcent bˆ s calculated as: or bˆ ± t c s bˆ bˆ t c s < b bˆ < bˆ + t c s bˆ where t c s the crtcal t-value for the selected confdence level. If there are 30 degrees of freedom and a 95% confdence level, t c s.04 [taken from a t-table]. B. The nterpretaton of the confdence nterval s that ths s an nterval that we beleve wll nclude the true parameter ( s n the case above) wth the specfed level of confdence. bˆ 8. As the standard error of the estmate (the varablty of the data about the regresson lne) rses, the confdence wdens. In other words, the more varable the data, the less confdent you wll be when you re usng the regresson model to estmate the coeffcent. 9. The standard error of the coeffcent s the square root of the rato of the varance of the regresson to the varaton n the ndependent varable: s bˆ n (x s e x) A. Hypothess testng: an ndvdual explanatory varable Regresson otes, Prepared by Pamela Peterson Drake 9 of

10 . To test the hypothess of the slope coeffcent (that s, to see whether the estmated slope s equal to a hypotheszed value, b 0, Ho: b b, we calculate a t-dstrbuted statstc: ˆb -b t b s ˆb. The test statstc s t dstrbuted wth k degrees of freedom (number of observatons (), less the number of ndependent varables (k), less one). B. If the t statstc s greater than the crtcal t value for the approprate degrees of freedom, (or less than the crtcal t value for a negatve slope) we can say that the slope coeffcent s dfferent from the hypotheszed value, b. C. If there s no relaton between the dependent and an ndependent varable, the slope coeffcent, b, would be zero. Y b0 0 ote: The formula for the standard error of the coeffcent has the varaton of the ndependent varable n the denomnator, not the varance. The varance varaton / n-. X b 0 A zero slope ndcates that there s no change n Y for a gven change n X A zero slope ndcates that there s no relatonshp between Y and X. D. To test whether an ndependent varable explans the varaton n the dependent varable, the hypothess that s tested s whether the slope s zero: Ho: b 0 versus the alternatve (what you conclude f you reject the null, Ho): Ha: b / 0 Ths alternatve hypothess s referred to as a two-sded hypothess. Ths means that we reject the null f the observed slope s dfferent from zero n ether drecton (postve or negatve). E. There are hypotheses n economcs that refer to the sgn of the relaton between the dependent and the ndependent varables. In ths case, the alternatve s drectonal (> or <) and the t-test s one-sded (uses only one tal of the t-dstrbuton). In the case of a one-sded alternatve, there s only one crtcal t-value. Regresson otes, Prepared by Pamela Peterson Drake 0 of

11 Example 3: Testng the sgnfcance of a slope coeffcent Suppose the estmated slope coeffcent s 0.78, the sample sze s 6, the standard error of the coeffcent s 0.3, and the level of sgnfcance s 5%. Is the slope dfference than zero? The calculated test statstc s: t b The crtcal t-values are ±.060: bˆ b s bˆ Reject H 0 Fal to reject H 0 Reject H 0 Therefore, we reject the null hypothess, concludng that the slope s dfferent from zero. 0. Interpretaton of coeffcents. A. The estmated ntercept s nterpreted as the value of the dependent varable (the Y) f the ndependent varable (the X) takes on a value of zero. B. The estmated slope coeffcent s nterpreted as the change n the dependent varable for a gven one-unt change n the ndependent varable. C. Any conclusons regardng the mportance of an ndependent varable n explanng a dependent varable requres determnng the statstcal sgnfcance f the slope coeffcent. Smply lookng at the magntude of the slope coeffcent does not address ths ssue of the mportance of the varable.. Forecastng s usng regresson nvolves makng predctons about the dependent varable based on average relatonshps observed n the estmated regresson. A. Predcted values are values of the dependent varable based on the estmated regresson coeffcents and a predcton about the values of the ndependent varables. B. For a smple regresson, the value of Y s predcted as: Example 4 Suppose you estmate a regresson model wth the followng estmates: ŷ X In addton, you have forecasted value for the ndependent varable, X 0. The forecasted value for y s 5.5: ŷ (0) Regresson otes, Prepared by Pamela Peterson Drake of

12 where ŷ x p ˆ + ˆ ŷ b 0 b x p s the predcted value of the dependent varable, and s the predcted value of the ndependent varable (nput).. An analyss of varance table (AOVA table) table s a summary of the explanaton of the varaton n the dependent varable. The basc form of the AOVA table s as follows: Source of varaton Degrees of freedom Sum of squares Mean square Regresson (explaned) Sum of squares regresson (SS Regresson ) Error (unexplaned) - Sum of squares resdual (SS Resdual ) Total - Sum of squares total (SS Total ) Mean square regresson SS Regresson Mean square error SS Resdual - Example 5 Source of Degrees of Sum of Mean varaton freedom squares square Regresson (explaned) Error (unexplaned) Total R 5, or 89.38% 5,650 SEE Regresson otes, Prepared by Pamela Peterson Drake of

13 Multple Regresson. Multple regresson s regresson analyss wth more than one ndependent varable. A. The concept of multple regresson s dentcal to that of smple regresson analyss except that two or more ndependent varables are used smultaneously to explan varatons n the dependent varable. B. In a multple regresson, the goal s to mnmze the sum of the squared errors. Each slope coeffcent s estmated whle holdng the other varables constant. y b 0 + b x + b x + b 3 x 3 + b 4 x 4. The ntercept n the regresson equaton has the same nterpretaton as t dd under the smple lnear case the ntercept s the value of the dependent varable when all ndependent varables are equal zero. 3. The slope coeffcent s the parameter that reflects the change n the dependent varable for a one unt change n the ndependent varable. A. The slope coeffcents (the betas) are descrbed as the movement n the dependent varable for a one unt change n the ndependent varable holdng all other ndependent varables constant. B. For ths reason, beta coeffcents n a multple lnear regresson are sometmes called partal betas or partal regresson coeffcents. 4. Regresson model: Y b 0 + b x + b x + ε We do not represent the multple regresson graphcally because t would requre graphs that are n more than two dmensons. A slope by any other name The slope coeffcent s the elastcty of the dependent varable wth respect to the ndependent varable. In other words, t s the frst dervatve of the dependent varable wth respect to the ndependent varable. where: b j x j s the slope coeffcent on the j th ndependent varable; and s the th observaton on the j th varable. A. The degrees of freedom for the test of a slope coeffcent are -k-, where n s the number of observatons n the sample and k s the number of ndependent varables. B. In multple regresson, the ndependent varables may be correlated wth one another, resultng n less relable estmates. Ths problem s referred to as multcollnearty. 5. A confdence nterval for a populaton regresson slope n a multple regresson s an nterval centered on the estmated slope: bˆ ± t c s or bˆ { bˆ } t c s < b bˆ < + t c s bˆ A. Ths s the same nterval usng n smple regresson for the nterval of a slope coeffcent. B. If ths nterval contans zero, we conclude that the slope s not statstcally dfferent from zero. bˆ Regresson otes, Prepared by Pamela Peterson Drake 3 of

14 6. The assumptons of the multple regresson model are as follows: A. A lnear relatonshp exsts between dependent and ndependent varables. B. The ndependent varables are uncorrelated wth the resduals; that s, the ndependent varable s not random. In addton, there s no exact lnear relaton between two or more ndependent varables. [ote: ths s modfed slghtly from the assumptons of the smple regresson model.] C. The expected value of the dsturbance term s zero; that s, E(ε )0 D. There s a constant varance of the dsturbance term; that s, the dsturbance or resdual terms are all drawn from a dstrbuton wth an dentcal varance. In other words, the dsturbance terms are homoskedaststc. [A volaton of ths s referred to as heteroskedastcty.] E. The resduals are ndependently dstrbuted; that s, the resdual or dsturbance for one observaton s not correlated wth that of another observaton. [A volaton of ths s referred to as autocorrelaton.] F. The dsturbance term (a.k.a. resdual, a.k.a. error term) s normally dstrbuted. G. The resdual (a.k.a. dsturbance term, a.k.a. error term) s what s not explaned by the ndependent varables. 7. In a regresson wth two ndependent varables, the resdual for the th observaton s: ε Y ( bˆ 0 + bˆ x + bˆ x ) 8. The standard error of the estmate (SEE) s the standard error of the resdual: s e eˆ t t SSE SEE k k 9. The degrees of freedom, df, are calculated as: number of number of df k (k + ) observato ns ndependen t varable s A. The degrees of freedom are the number of ndependent peces of nformaton that are used to estmate the regresson parameters. In calculatng the regresson parameters, we use the followng peces of nformaton: The mean of the dependent varable. The mean of each of the ndependent varables. B. Therefore, f the regresson s a smple regresson, we use the two degrees of freedom n estmatng the regresson lne. f the regresson s a multple regresson wth four ndependent varables, we use fve degrees of freedom n the estmaton of the regresson lne. Regresson otes, Prepared by Pamela Peterson Drake 4 of

15 0. Forecastng s usng regresson nvolves makng predctons about the dependent varable based on average relatonshps observed n the estmated regresson. A. Predcted values are values of the dependent varable based on the estmated regresson coeffcents and a predcton about the values of the ndependent varables. B. For a smple regresson, the value of y s predcted as: where ŷ bˆ xˆ yˆ bˆ ˆ + ˆ 0 + bxˆ b xˆ s the predcted value of the dependent varable, s the estmated parameter, and s the predcted value of the ndependent varable C. The better the ft of the regresson (that s, the smaller s SEE), the more confdent we are n our predctons. Example 6: Usng analyss of varance nformaton Suppose we estmate a multple regresson model that has fve ndependent varables usng a sample of 65 observatons. If the sum of squared resduals s 789, what s the standard error of the estmate? Soluton Gven: SEE SS Resdual k Cauton: The estmated ntercept and all the estmated slopes are used n the predcton of the dependent varable value, even f a slope s not statstcally sgnfcantly dfferent from zero. Example 7: Calculatng a forecasted value Suppose you estmate a regresson model wth the followng estmates: Y^ X 0. X +.5 X 3 In addton, you have forecasted values for the ndependent varables: X 0 X 0 X 3 50 What s the forecasted value of y? Soluton The forecasted value for Y s 90: Y^ (0) 0.0 (0) +.5 (50) The F-statstc s a measure of how well a set of ndependent varables, as a group, explan the varaton n the dependent varable. A. The F-statstc s calculated as: Regresson otes, Prepared by Pamela Peterson Drake 5 of

16 (y ˆ y) SSRegresson Mean squared regresson MSR k k F Mean squared error MSE SS Resdual (y ˆy) -k- k B. The F statstc can be formulated to test all ndependent varables as a group (the most common applcaton). For example, f there are four ndependent varables n the model, the hypotheses are: H 0 : b b b3 b 4 0 Ha: at least one b 0 C. The F-statstc can be formulated to test subsets of ndependent varables (to see whether they have ncremental explanatory power). For example f there are four ndependent varables n the model, a subset could be examned: H 0 : bb 40 Ha: b or b 4 0. The coeffcent of determnaton, R, s the percentage of varaton n the dependent varable explaned by the ndependent varables. R Explaned varaton Total varaton Total Unexplaned - varaton varaton Total varaton R (y ˆ y) (y y) 0 < R < A. By constructon, R ranges from 0 to.0 B. The adjusted-r s an alternatve to R : R ( R k ). The adjusted R s less than or equal to R ( equal to only when k)... Addng ndependent varables to the model wll ncrease R. Addng ndependent varables to the model may ncrease or decrease the adjusted-r (ote: adjusted-r can even be negatve). The adjusted R does not have the clean explanaton of explanatory power that the R has. 3. The purpose of the Analyss of Varance (AOVA) table s to attrbute the total varaton of the dependent varable to the regresson model (the regresson source n column ) and the resduals (the error source from column ). A. SS Total s the total varaton of Y about ts mean or average value (a.k.a. total sum of squares) and s computed as: Regresson otes, Prepared by Pamela Peterson Drake 6 of

17 SS (y -y) Total where y s the mean of Y. n B. SS Resdual (a.k.a. SSE) s the varablty that s unexplaned by the regresson and s computed as: n Resdual ˆ ˆ SS SSE (y-y) e where s Ŷ the value of the dependent varable usng the regresson equaton. C. SS Regresson (a.k.a. SS Explaned ) s the varablty that s explaned by the regresson equaton and s computed as SS Total SS Resdual. SS (y-y) ˆ Regresson D. MSE s the mean square error, or MSE SS Resdual / ( k - ) where k s the number of ndependent varables n the regresson. E. MSR s the mean square regresson, MSR SS Regresson / k Analyss of Varance Table (AOVA) Source df (Degrees of Freedom) SS (Sum of Squares) Mean Square (SS/df) Regresson k SS Regresson MSR Error -k- SS Resdual MSE Total - SS Total SS SS R - SS SS Regresson Resdual Total Total MSR F MSE 4. Dummy varables are qualtatve varables that take on a value of zero or one. A. Most ndependent varables represent a contnuous flow of values. However, sometmes the ndependent varable s of a bnary nature (t s ether O or OFF). B. These types of varables are called dummy varables and the data s assgned a value of "0" or "". In many cases, you apply the dummy varable concept to quantfy the mpact of a qualtatve varable. A dummy varable s a dchotomous varable; that s, t takes on a value of one or zero. Regresson otes, Prepared by Pamela Peterson Drake 7 of

18 C. Use one dummy varable less than the number of classes (e.g., f have three classes, use two dummy varables), otherwse you fall nto the dummy varable "trap" (perfect multcollnearty volatng assumpton []). D. An nteractve dummy varable s a dummy varable (0,) multpled by a varable to create a new varable. The slope on ths new varable tells us the ncremental slope. 5. Heteroskedastcty s the stuaton n whch the varance of the resduals s not constant across all observatons. A. An assumpton of the regresson methodology s that the sample s drawn from the same populaton, and that the varance of resduals s constant across observatons; n other words, the resduals are homoskedastc. B. Heteroskedastcty s a problem because the estmators do not have the smallest possble varance, and therefore the standard errors of the coeffcents would not be correct. 6. Autocorrelaton s the stuaton n whch the resdual terms are correlated wth one another. Ths occurs frequently n tme-seres analyss. A. Autocorrelaton usually appears n tme seres data. If last year s earnngs were hgh, ths means that ths year s earnngs may have a greater probablty of beng hgh than beng low. Ths s an example of postve autocorrelaton. When a good year s always followed by a bad year, ths s negatve autocorrelaton. B. Autocorrelaton s a problem because the estmators do not have the smallest possble varance and therefore the standard errors of the coeffcents would not be correct. 7. Multcollnearty s the problem of hgh correlaton between or among two or more ndependent varables. A. Multcollnearty s a problem because. The presence of multcollnearty can cause dstortons n the standard error and may lead to problems wth sgnfcance testng of ndvdual coeffcents, and. Estmates are senstve to changes n the sample observatons or the model specfcaton. B. If there s multcollnearty, we are more lkely to conclude a varable s not mportant. C. Multcollnearty s lkely present to some degree n most economc models. Perfect multcollnearty would prohbt us from estmatng the regresson parameters. The ssue then s really a one of degree. 8. The economc meanng of the results of a regresson estmaton focuses prmarly on the slope coeffcents. A. The slope coeffcents ndcate the change n the dependent varable for a one-unt change n the ndependent varable. Ths slope can than be nterpreted as an elastcty measure; that s, the change n one varable correspondng to a change n another varable. B. It s possble to have statstcal sgnfcance, yet not have economc sgnfcance (e.g., sgnfcant abnormal returns assocated wth an announcement, but these returns are not suffcent to cover transactons costs). Regresson otes, Prepared by Pamela Peterson Drake 8 of

19 To test the role of a sngle varable n explanng the varaton n the dependent varable test the role of all varables n explanng the varaton n the dependent varable estmate the change n the dependent varable for a oneunt change n the ndependent varable estmate the dependent varable f all of the ndependent varables take on a value of zero estmate the percentage of the dependent varable s varaton explaned by the ndependent varables forecast the value of the dependent varable gven the estmated values of the ndependent varable(s) use the t -statstc. the F-statstc. the slope coeffcent. the ntercept. the R. the regresson equaton, substtutng the estmated values of the ndependent varable(s) n the equaton. Regresson otes, Prepared by Pamela Peterson Drake 9 of

20 Regresson termnology Analyss of varance AOVA Autocorrelaton Coeffcent of determnaton Confdence nterval Correlaton coeffcent Covarance Covaraton Cross-sectonal Degrees of freedom Dependent varable Explaned varable Explanatory varable Forecast F-statstc Heteroskedastcty Homoskedastcty Independent varable Intercept Least squares regresson Mean square error Mean square regresson Multcollnearty Multple regresson egatve correlaton Ordnary least squares Perfect negatve correlaton Perfect postve correlaton Postve correlaton Predcted value R Regresson Resdual Scatterplot s e SEE Smple regresson Slope Slope coeffcent Spurous correlaton SS Resdual SS Regresson SS Total Standard error of the estmate Sum of squares error Sum of squares regresson Sum of squares total Tme-seres t-statstc Varance Varaton Regresson otes, Prepared by Pamela Peterson Drake 0 of

21 Regresson formulas Varances Varaton ( x x ) Varance ( x x) Covarance (x-x)(y-y) - Correlaton r (x,n x) (x x) (y y) ( ) (y y) t r - - r Regresson y b 0 + b x + ε y b 0 + b x + b x + b 3 x 3 + b 4 x 4 + ε b cov(x, Y) var(x) (y y)(x (x x) x) ˆb 0y-bx Tests and confdence ntervals ( y bˆ 0 bˆ x ) s e (y yˆ ) eˆ s bˆ n (x s e x) t b ˆb -b s ˆb bˆ t c s < b bˆ < bˆ + t c s bˆ (y ˆ y) SSRegresson Mean squared regresson MSR k k F Mean squared error MSE SS Resdual (y ˆy) -k- k Smple Regresson, prepared by Pamela Peterson Drake

22 Forecastng ŷb ˆ 0+bx ˆ p+bx ˆ p+bx ˆ ˆ 3p+...+bx Kp Analyss of Varance SS (y -y) Total n SS SS R - n Resdual ˆ ˆ Regresson SS SSE (y-y) e Regresson Resdual SS Total SSTotal (y ˆ y) (y y) (y ˆ y) SSRegresson Mean squared regresson MSR k k F Mean squared error MSE SS Resdual (y ˆy) -k- k SS (y-y) ˆ Smple Regresson, prepared by Pamela Peterson Drake

Statistics for Economics & Business

Statistics for Economics & Business Statstcs for Economcs & Busness Smple Lnear Regresson Learnng Objectves In ths chapter, you learn: How to use regresson analyss to predct the value of a dependent varable based on an ndependent varable

More information

Department of Quantitative Methods & Information Systems. Time Series and Their Components QMIS 320. Chapter 6

Department of Quantitative Methods & Information Systems. Time Series and Their Components QMIS 320. Chapter 6 Department of Quanttatve Methods & Informaton Systems Tme Seres and Ther Components QMIS 30 Chapter 6 Fall 00 Dr. Mohammad Zanal These sldes were modfed from ther orgnal source for educatonal purpose only.

More information

Statistics for Business and Economics

Statistics for Business and Economics Statstcs for Busness and Economcs Chapter 11 Smple Regresson Copyrght 010 Pearson Educaton, Inc. Publshng as Prentce Hall Ch. 11-1 11.1 Overvew of Lnear Models n An equaton can be ft to show the best lnear

More information

Statistics for Managers Using Microsoft Excel/SPSS Chapter 13 The Simple Linear Regression Model and Correlation

Statistics for Managers Using Microsoft Excel/SPSS Chapter 13 The Simple Linear Regression Model and Correlation Statstcs for Managers Usng Mcrosoft Excel/SPSS Chapter 13 The Smple Lnear Regresson Model and Correlaton 1999 Prentce-Hall, Inc. Chap. 13-1 Chapter Topcs Types of Regresson Models Determnng the Smple Lnear

More information

Basic Business Statistics, 10/e

Basic Business Statistics, 10/e Chapter 13 13-1 Basc Busness Statstcs 11 th Edton Chapter 13 Smple Lnear Regresson Basc Busness Statstcs, 11e 009 Prentce-Hall, Inc. Chap 13-1 Learnng Objectves In ths chapter, you learn: How to use regresson

More information

2016 Wiley. Study Session 2: Ethical and Professional Standards Application

2016 Wiley. Study Session 2: Ethical and Professional Standards Application 6 Wley Study Sesson : Ethcal and Professonal Standards Applcaton LESSON : CORRECTION ANALYSIS Readng 9: Correlaton and Regresson LOS 9a: Calculate and nterpret a sample covarance and a sample correlaton

More information

Chapter 11: Simple Linear Regression and Correlation

Chapter 11: Simple Linear Regression and Correlation Chapter 11: Smple Lnear Regresson and Correlaton 11-1 Emprcal Models 11-2 Smple Lnear Regresson 11-3 Propertes of the Least Squares Estmators 11-4 Hypothess Test n Smple Lnear Regresson 11-4.1 Use of t-tests

More information

Lecture Notes for STATISTICAL METHODS FOR BUSINESS II BMGT 212. Chapters 14, 15 & 16. Professor Ahmadi, Ph.D. Department of Management

Lecture Notes for STATISTICAL METHODS FOR BUSINESS II BMGT 212. Chapters 14, 15 & 16. Professor Ahmadi, Ph.D. Department of Management Lecture Notes for STATISTICAL METHODS FOR BUSINESS II BMGT 1 Chapters 14, 15 & 16 Professor Ahmad, Ph.D. Department of Management Revsed August 005 Chapter 14 Formulas Smple Lnear Regresson Model: y =

More information

Chapter 14 Simple Linear Regression

Chapter 14 Simple Linear Regression Chapter 4 Smple Lnear Regresson Chapter 4 - Smple Lnear Regresson Manageral decsons often are based on the relatonshp between two or more varables. Regresson analss can be used to develop an equaton showng

More information

Chapter 13: Multiple Regression

Chapter 13: Multiple Regression Chapter 13: Multple Regresson 13.1 Developng the multple-regresson Model The general model can be descrbed as: It smplfes for two ndependent varables: The sample ft parameter b 0, b 1, and b are used to

More information

Comparison of Regression Lines

Comparison of Regression Lines STATGRAPHICS Rev. 9/13/2013 Comparson of Regresson Lnes Summary... 1 Data Input... 3 Analyss Summary... 4 Plot of Ftted Model... 6 Condtonal Sums of Squares... 6 Analyss Optons... 7 Forecasts... 8 Confdence

More information

x i1 =1 for all i (the constant ).

x i1 =1 for all i (the constant ). Chapter 5 The Multple Regresson Model Consder an economc model where the dependent varable s a functon of K explanatory varables. The economc model has the form: y = f ( x,x,..., ) xk Approxmate ths by

More information

1. Inference on Regression Parameters a. Finding Mean, s.d and covariance amongst estimates. 2. Confidence Intervals and Working Hotelling Bands

1. Inference on Regression Parameters a. Finding Mean, s.d and covariance amongst estimates. 2. Confidence Intervals and Working Hotelling Bands Content. Inference on Regresson Parameters a. Fndng Mean, s.d and covarance amongst estmates.. Confdence Intervals and Workng Hotellng Bands 3. Cochran s Theorem 4. General Lnear Testng 5. Measures of

More information

Correlation and Regression. Correlation 9.1. Correlation. Chapter 9

Correlation and Regression. Correlation 9.1. Correlation. Chapter 9 Chapter 9 Correlaton and Regresson 9. Correlaton Correlaton A correlaton s a relatonshp between two varables. The data can be represented b the ordered pars (, ) where s the ndependent (or eplanator) varable,

More information

Lecture 6: Introduction to Linear Regression

Lecture 6: Introduction to Linear Regression Lecture 6: Introducton to Lnear Regresson An Manchakul amancha@jhsph.edu 24 Aprl 27 Lnear regresson: man dea Lnear regresson can be used to study an outcome as a lnear functon of a predctor Example: 6

More information

Statistics for Managers Using Microsoft Excel/SPSS Chapter 14 Multiple Regression Models

Statistics for Managers Using Microsoft Excel/SPSS Chapter 14 Multiple Regression Models Statstcs for Managers Usng Mcrosoft Excel/SPSS Chapter 14 Multple Regresson Models 1999 Prentce-Hall, Inc. Chap. 14-1 Chapter Topcs The Multple Regresson Model Contrbuton of Indvdual Independent Varables

More information

Economics 130. Lecture 4 Simple Linear Regression Continued

Economics 130. Lecture 4 Simple Linear Regression Continued Economcs 130 Lecture 4 Contnued Readngs for Week 4 Text, Chapter and 3. We contnue wth addressng our second ssue + add n how we evaluate these relatonshps: Where do we get data to do ths analyss? How do

More information

18. SIMPLE LINEAR REGRESSION III

18. SIMPLE LINEAR REGRESSION III 8. SIMPLE LINEAR REGRESSION III US Domestc Beers: Calores vs. % Alcohol Ftted Values and Resduals To each observed x, there corresponds a y-value on the ftted lne, y ˆ ˆ = α + x. The are called ftted values.

More information

The Multiple Classical Linear Regression Model (CLRM): Specification and Assumptions. 1. Introduction

The Multiple Classical Linear Regression Model (CLRM): Specification and Assumptions. 1. Introduction ECONOMICS 5* -- NOTE (Summary) ECON 5* -- NOTE The Multple Classcal Lnear Regresson Model (CLRM): Specfcaton and Assumptons. Introducton CLRM stands for the Classcal Lnear Regresson Model. The CLRM s also

More information

Chapter 15 Student Lecture Notes 15-1

Chapter 15 Student Lecture Notes 15-1 Chapter 15 Student Lecture Notes 15-1 Basc Busness Statstcs (9 th Edton) Chapter 15 Multple Regresson Model Buldng 004 Prentce-Hall, Inc. Chap 15-1 Chapter Topcs The Quadratc Regresson Model Usng Transformatons

More information

Learning Objectives for Chapter 11

Learning Objectives for Chapter 11 Chapter : Lnear Regresson and Correlaton Methods Hldebrand, Ott and Gray Basc Statstcal Ideas for Managers Second Edton Learnng Objectves for Chapter Usng the scatterplot n regresson analyss Usng the method

More information

Lecture 9: Linear regression: centering, hypothesis testing, multiple covariates, and confounding

Lecture 9: Linear regression: centering, hypothesis testing, multiple covariates, and confounding Lecture 9: Lnear regresson: centerng, hypothess testng, multple covarates, and confoundng Sandy Eckel seckel@jhsph.edu 6 May 008 Recall: man dea of lnear regresson Lnear regresson can be used to study

More information

Chapter 2 - The Simple Linear Regression Model S =0. e i is a random error. S β2 β. This is a minimization problem. Solution is a calculus exercise.

Chapter 2 - The Simple Linear Regression Model S =0. e i is a random error. S β2 β. This is a minimization problem. Solution is a calculus exercise. Chapter - The Smple Lnear Regresson Model The lnear regresson equaton s: where y + = β + β e for =,..., y and are observable varables e s a random error How can an estmaton rule be constructed for the

More information

Lecture 9: Linear regression: centering, hypothesis testing, multiple covariates, and confounding

Lecture 9: Linear regression: centering, hypothesis testing, multiple covariates, and confounding Recall: man dea of lnear regresson Lecture 9: Lnear regresson: centerng, hypothess testng, multple covarates, and confoundng Sandy Eckel seckel@jhsph.edu 6 May 8 Lnear regresson can be used to study an

More information

Chapter 15 - Multiple Regression

Chapter 15 - Multiple Regression Chapter - Multple Regresson Chapter - Multple Regresson Multple Regresson Model The equaton that descrbes how the dependent varable y s related to the ndependent varables x, x,... x p and an error term

More information

ECONOMICS 351*-A Mid-Term Exam -- Fall Term 2000 Page 1 of 13 pages. QUEEN'S UNIVERSITY AT KINGSTON Department of Economics

ECONOMICS 351*-A Mid-Term Exam -- Fall Term 2000 Page 1 of 13 pages. QUEEN'S UNIVERSITY AT KINGSTON Department of Economics ECOOMICS 35*-A Md-Term Exam -- Fall Term 000 Page of 3 pages QUEE'S UIVERSITY AT KIGSTO Department of Economcs ECOOMICS 35* - Secton A Introductory Econometrcs Fall Term 000 MID-TERM EAM ASWERS MG Abbott

More information

28. SIMPLE LINEAR REGRESSION III

28. SIMPLE LINEAR REGRESSION III 8. SIMPLE LINEAR REGRESSION III Ftted Values and Resduals US Domestc Beers: Calores vs. % Alcohol To each observed x, there corresponds a y-value on the ftted lne, y ˆ = βˆ + βˆ x. The are called ftted

More information

Chapter 14 Simple Linear Regression Page 1. Introduction to regression analysis 14-2

Chapter 14 Simple Linear Regression Page 1. Introduction to regression analysis 14-2 Chapter 4 Smple Lnear Regresson Page. Introducton to regresson analyss 4- The Regresson Equaton. Lnear Functons 4-4 3. Estmaton and nterpretaton of model parameters 4-6 4. Inference on the model parameters

More information

STATISTICS QUESTIONS. Step by Step Solutions.

STATISTICS QUESTIONS. Step by Step Solutions. STATISTICS QUESTIONS Step by Step Solutons www.mathcracker.com 9//016 Problem 1: A researcher s nterested n the effects of famly sze on delnquency for a group of offenders and examnes famles wth one to

More information

Chapter 9: Statistical Inference and the Relationship between Two Variables

Chapter 9: Statistical Inference and the Relationship between Two Variables Chapter 9: Statstcal Inference and the Relatonshp between Two Varables Key Words The Regresson Model The Sample Regresson Equaton The Pearson Correlaton Coeffcent Learnng Outcomes After studyng ths chapter,

More information

[The following data appear in Wooldridge Q2.3.] The table below contains the ACT score and college GPA for eight college students.

[The following data appear in Wooldridge Q2.3.] The table below contains the ACT score and college GPA for eight college students. PPOL 59-3 Problem Set Exercses n Smple Regresson Due n class /8/7 In ths problem set, you are asked to compute varous statstcs by hand to gve you a better sense of the mechancs of the Pearson correlaton

More information

Econ107 Applied Econometrics Topic 3: Classical Model (Studenmund, Chapter 4)

Econ107 Applied Econometrics Topic 3: Classical Model (Studenmund, Chapter 4) I. Classcal Assumptons Econ7 Appled Econometrcs Topc 3: Classcal Model (Studenmund, Chapter 4) We have defned OLS and studed some algebrac propertes of OLS. In ths topc we wll study statstcal propertes

More information

January Examinations 2015

January Examinations 2015 24/5 Canddates Only January Examnatons 25 DO NOT OPEN THE QUESTION PAPER UNTIL INSTRUCTED TO DO SO BY THE CHIEF INVIGILATOR STUDENT CANDIDATE NO.. Department Module Code Module Ttle Exam Duraton (n words)

More information

Resource Allocation and Decision Analysis (ECON 8010) Spring 2014 Foundations of Regression Analysis

Resource Allocation and Decision Analysis (ECON 8010) Spring 2014 Foundations of Regression Analysis Resource Allocaton and Decson Analss (ECON 800) Sprng 04 Foundatons of Regresson Analss Readng: Regresson Analss (ECON 800 Coursepak, Page 3) Defntons and Concepts: Regresson Analss statstcal technques

More information

Introduction to Regression

Introduction to Regression Introducton to Regresson Dr Tom Ilvento Department of Food and Resource Economcs Overvew The last part of the course wll focus on Regresson Analyss Ths s one of the more powerful statstcal technques Provdes

More information

/ n ) are compared. The logic is: if the two

/ n ) are compared. The logic is: if the two STAT C141, Sprng 2005 Lecture 13 Two sample tests One sample tests: examples of goodness of ft tests, where we are testng whether our data supports predctons. Two sample tests: called as tests of ndependence

More information

Chapter 8 Indicator Variables

Chapter 8 Indicator Variables Chapter 8 Indcator Varables In general, e explanatory varables n any regresson analyss are assumed to be quanttatve n nature. For example, e varables lke temperature, dstance, age etc. are quanttatve n

More information

Y = β 0 + β 1 X 1 + β 2 X β k X k + ε

Y = β 0 + β 1 X 1 + β 2 X β k X k + ε Chapter 3 Secton 3.1 Model Assumptons: Multple Regresson Model Predcton Equaton Std. Devaton of Error Correlaton Matrx Smple Lnear Regresson: 1.) Lnearty.) Constant Varance 3.) Independent Errors 4.) Normalty

More information

e i is a random error

e i is a random error Chapter - The Smple Lnear Regresson Model The lnear regresson equaton s: where + β + β e for,..., and are observable varables e s a random error How can an estmaton rule be constructed for the unknown

More information

Scatter Plot x

Scatter Plot x Construct a scatter plot usng excel for the gven data. Determne whether there s a postve lnear correlaton, negatve lnear correlaton, or no lnear correlaton. Complete the table and fnd the correlaton coeffcent

More information

Interval Estimation in the Classical Normal Linear Regression Model. 1. Introduction

Interval Estimation in the Classical Normal Linear Regression Model. 1. Introduction ECONOMICS 35* -- NOTE 7 ECON 35* -- NOTE 7 Interval Estmaton n the Classcal Normal Lnear Regresson Model Ths note outlnes the basc elements of nterval estmaton n the Classcal Normal Lnear Regresson Model

More information

DO NOT OPEN THE QUESTION PAPER UNTIL INSTRUCTED TO DO SO BY THE CHIEF INVIGILATOR. Introductory Econometrics 1 hour 30 minutes

DO NOT OPEN THE QUESTION PAPER UNTIL INSTRUCTED TO DO SO BY THE CHIEF INVIGILATOR. Introductory Econometrics 1 hour 30 minutes 25/6 Canddates Only January Examnatons 26 Student Number: Desk Number:...... DO NOT OPEN THE QUESTION PAPER UNTIL INSTRUCTED TO DO SO BY THE CHIEF INVIGILATOR Department Module Code Module Ttle Exam Duraton

More information

Outline. Zero Conditional mean. I. Motivation. 3. Multiple Regression Analysis: Estimation. Read Wooldridge (2013), Chapter 3.

Outline. Zero Conditional mean. I. Motivation. 3. Multiple Regression Analysis: Estimation. Read Wooldridge (2013), Chapter 3. Outlne 3. Multple Regresson Analyss: Estmaton I. Motvaton II. Mechancs and Interpretaton of OLS Read Wooldrdge (013), Chapter 3. III. Expected Values of the OLS IV. Varances of the OLS V. The Gauss Markov

More information

Statistics II Final Exam 26/6/18

Statistics II Final Exam 26/6/18 Statstcs II Fnal Exam 26/6/18 Academc Year 2017/18 Solutons Exam duraton: 2 h 30 mn 1. (3 ponts) A town hall s conductng a study to determne the amount of leftover food produced by the restaurants n the

More information

x = , so that calculated

x = , so that calculated Stat 4, secton Sngle Factor ANOVA notes by Tm Plachowsk n chapter 8 we conducted hypothess tests n whch we compared a sngle sample s mean or proporton to some hypotheszed value Chapter 9 expanded ths to

More information

Psychology 282 Lecture #24 Outline Regression Diagnostics: Outliers

Psychology 282 Lecture #24 Outline Regression Diagnostics: Outliers Psychology 282 Lecture #24 Outlne Regresson Dagnostcs: Outlers In an earler lecture we studed the statstcal assumptons underlyng the regresson model, ncludng the followng ponts: Formal statement of assumptons.

More information

Chapter 3. Two-Variable Regression Model: The Problem of Estimation

Chapter 3. Two-Variable Regression Model: The Problem of Estimation Chapter 3. Two-Varable Regresson Model: The Problem of Estmaton Ordnary Least Squares Method (OLS) Recall that, PRF: Y = β 1 + β X + u Thus, snce PRF s not drectly observable, t s estmated by SRF; that

More information

Linear regression. Regression Models. Chapter 11 Student Lecture Notes Regression Analysis is the

Linear regression. Regression Models. Chapter 11 Student Lecture Notes Regression Analysis is the Chapter 11 Student Lecture Notes 11-1 Lnear regresson Wenl lu Dept. Health statstcs School of publc health Tanjn medcal unversty 1 Regresson Models 1. Answer What Is the Relatonshp Between the Varables?.

More information

Statistics MINITAB - Lab 2

Statistics MINITAB - Lab 2 Statstcs 20080 MINITAB - Lab 2 1. Smple Lnear Regresson In smple lnear regresson we attempt to model a lnear relatonshp between two varables wth a straght lne and make statstcal nferences concernng that

More information

Midterm Examination. Regression and Forecasting Models

Midterm Examination. Regression and Forecasting Models IOMS Department Regresson and Forecastng Models Professor Wllam Greene Phone: 22.998.0876 Offce: KMC 7-90 Home page: people.stern.nyu.edu/wgreene Emal: wgreene@stern.nyu.edu Course web page: people.stern.nyu.edu/wgreene/regresson/outlne.htm

More information

The Ordinary Least Squares (OLS) Estimator

The Ordinary Least Squares (OLS) Estimator The Ordnary Least Squares (OLS) Estmator 1 Regresson Analyss Regresson Analyss: a statstcal technque for nvestgatng and modelng the relatonshp between varables. Applcatons: Engneerng, the physcal and chemcal

More information

where I = (n x n) diagonal identity matrix with diagonal elements = 1 and off-diagonal elements = 0; and σ 2 e = variance of (Y X).

where I = (n x n) diagonal identity matrix with diagonal elements = 1 and off-diagonal elements = 0; and σ 2 e = variance of (Y X). 11.4.1 Estmaton of Multple Regresson Coeffcents In multple lnear regresson, we essentally solve n equatons for the p unnown parameters. hus n must e equal to or greater than p and n practce n should e

More information

This column is a continuation of our previous column

This column is a continuation of our previous column Comparson of Goodness of Ft Statstcs for Lnear Regresson, Part II The authors contnue ther dscusson of the correlaton coeffcent n developng a calbraton for quanttatve analyss. Jerome Workman Jr. and Howard

More information

Negative Binomial Regression

Negative Binomial Regression STATGRAPHICS Rev. 9/16/2013 Negatve Bnomal Regresson Summary... 1 Data Input... 3 Statstcal Model... 3 Analyss Summary... 4 Analyss Optons... 7 Plot of Ftted Model... 8 Observed Versus Predcted... 10 Predctons...

More information

STAT 3008 Applied Regression Analysis

STAT 3008 Applied Regression Analysis STAT 3008 Appled Regresson Analyss Tutoral : Smple Lnear Regresson LAI Chun He Department of Statstcs, The Chnese Unversty of Hong Kong 1 Model Assumpton To quantfy the relatonshp between two factors,

More information

SIMPLE LINEAR REGRESSION

SIMPLE LINEAR REGRESSION Smple Lnear Regresson and Correlaton Introducton Prevousl, our attenton has been focused on one varable whch we desgnated b x. Frequentl, t s desrable to learn somethng about the relatonshp between two

More information

Here is the rationale: If X and y have a strong positive relationship to one another, then ( x x) will tend to be positive when ( y y)

Here is the rationale: If X and y have a strong positive relationship to one another, then ( x x) will tend to be positive when ( y y) Secton 1.5 Correlaton In the prevous sectons, we looked at regresson and the value r was a measurement of how much of the varaton n y can be attrbuted to the lnear relatonshp between y and x. In ths secton,

More information

Basically, if you have a dummy dependent variable you will be estimating a probability.

Basically, if you have a dummy dependent variable you will be estimating a probability. ECON 497: Lecture Notes 13 Page 1 of 1 Metropoltan State Unversty ECON 497: Research and Forecastng Lecture Notes 13 Dummy Dependent Varable Technques Studenmund Chapter 13 Bascally, f you have a dummy

More information

17 - LINEAR REGRESSION II

17 - LINEAR REGRESSION II Topc 7 Lnear Regresson II 7- Topc 7 - LINEAR REGRESSION II Testng and Estmaton Inferences about β Recall that we estmate Yˆ ˆ β + ˆ βx. 0 μ Y X x β0 + βx usng To estmate σ σ squared error Y X x ε s ε we

More information

NANYANG TECHNOLOGICAL UNIVERSITY SEMESTER I EXAMINATION MTH352/MH3510 Regression Analysis

NANYANG TECHNOLOGICAL UNIVERSITY SEMESTER I EXAMINATION MTH352/MH3510 Regression Analysis NANYANG TECHNOLOGICAL UNIVERSITY SEMESTER I EXAMINATION 014-015 MTH35/MH3510 Regresson Analyss December 014 TIME ALLOWED: HOURS INSTRUCTIONS TO CANDIDATES 1. Ths examnaton paper contans FOUR (4) questons

More information

Regression. The Simple Linear Regression Model

Regression. The Simple Linear Regression Model Regresson Smple Lnear Regresson Model Least Squares Method Coeffcent of Determnaton Model Assumptons Testng for Sgnfcance Usng the Estmated Regresson Equaton for Estmaton and Predcton Resdual Analss: Valdatng

More information

7.1. Single classification analysis of variance (ANOVA) Why not use multiple 2-sample 2. When to use ANOVA

7.1. Single classification analysis of variance (ANOVA) Why not use multiple 2-sample 2. When to use ANOVA Sngle classfcaton analyss of varance (ANOVA) When to use ANOVA ANOVA models and parttonng sums of squares ANOVA: hypothess testng ANOVA: assumptons A non-parametrc alternatve: Kruskal-Walls ANOVA Power

More information

β0 + β1xi. You are interested in estimating the unknown parameters β

β0 + β1xi. You are interested in estimating the unknown parameters β Ordnary Least Squares (OLS): Smple Lnear Regresson (SLR) Analytcs The SLR Setup Sample Statstcs Ordnary Least Squares (OLS): FOCs and SOCs Back to OLS and Sample Statstcs Predctons (and Resduals) wth OLS

More information

Linear Regression Analysis: Terminology and Notation

Linear Regression Analysis: Terminology and Notation ECON 35* -- Secton : Basc Concepts of Regresson Analyss (Page ) Lnear Regresson Analyss: Termnology and Notaton Consder the generc verson of the smple (two-varable) lnear regresson model. It s represented

More information

β0 + β1xi and want to estimate the unknown

β0 + β1xi and want to estimate the unknown SLR Models Estmaton Those OLS Estmates Estmators (e ante) v. estmates (e post) The Smple Lnear Regresson (SLR) Condtons -4 An Asde: The Populaton Regresson Functon B and B are Lnear Estmators (condtonal

More information

CHAPTER 8. Exercise Solutions

CHAPTER 8. Exercise Solutions CHAPTER 8 Exercse Solutons 77 Chapter 8, Exercse Solutons, Prncples of Econometrcs, 3e 78 EXERCISE 8. When = N N N ( x x) ( x x) ( x x) = = = N = = = N N N ( x ) ( ) ( ) ( x x ) x x x x x = = = = Chapter

More information

Lecture 4 Hypothesis Testing

Lecture 4 Hypothesis Testing Lecture 4 Hypothess Testng We may wsh to test pror hypotheses about the coeffcents we estmate. We can use the estmates to test whether the data rejects our hypothess. An example mght be that we wsh to

More information

x yi In chapter 14, we want to perform inference (i.e. calculate confidence intervals and perform tests of significance) in this setting.

x yi In chapter 14, we want to perform inference (i.e. calculate confidence intervals and perform tests of significance) in this setting. The Practce of Statstcs, nd ed. Chapter 14 Inference for Regresson Introducton In chapter 3 we used a least-squares regresson lne (LSRL) to represent a lnear relatonshp etween two quanttatve explanator

More information

Reminder: Nested models. Lecture 9: Interactions, Quadratic terms and Splines. Effect Modification. Model 1

Reminder: Nested models. Lecture 9: Interactions, Quadratic terms and Splines. Effect Modification. Model 1 Lecture 9: Interactons, Quadratc terms and Splnes An Manchakul amancha@jhsph.edu 3 Aprl 7 Remnder: Nested models Parent model contans one set of varables Extended model adds one or more new varables to

More information

Properties of Least Squares

Properties of Least Squares Week 3 3.1 Smple Lnear Regresson Model 3. Propertes of Least Squares Estmators Y Y β 1 + β X + u weekly famly expendtures X weekly famly ncome For a gven level of x, the expected level of food expendtures

More information

Chapter 4: Regression With One Regressor

Chapter 4: Regression With One Regressor Chapter 4: Regresson Wth One Regressor Copyrght 2011 Pearson Addson-Wesley. All rghts reserved. 1-1 Outlne 1. Fttng a lne to data 2. The ordnary least squares (OLS) lne/regresson 3. Measures of ft 4. Populaton

More information

See Book Chapter 11 2 nd Edition (Chapter 10 1 st Edition)

See Book Chapter 11 2 nd Edition (Chapter 10 1 st Edition) Count Data Models See Book Chapter 11 2 nd Edton (Chapter 10 1 st Edton) Count data consst of non-negatve nteger values Examples: number of drver route changes per week, the number of trp departure changes

More information

Answers Problem Set 2 Chem 314A Williamsen Spring 2000

Answers Problem Set 2 Chem 314A Williamsen Spring 2000 Answers Problem Set Chem 314A Wllamsen Sprng 000 1) Gve me the followng crtcal values from the statstcal tables. a) z-statstc,-sded test, 99.7% confdence lmt ±3 b) t-statstc (Case I), 1-sded test, 95%

More information

Biostatistics 360 F&t Tests and Intervals in Regression 1

Biostatistics 360 F&t Tests and Intervals in Regression 1 Bostatstcs 360 F&t Tests and Intervals n Regresson ORIGIN Model: Y = X + Corrected Sums of Squares: X X bar where: s the y ntercept of the regresson lne (translaton) s the slope of the regresson lne (scalng

More information

Polynomial Regression Models

Polynomial Regression Models LINEAR REGRESSION ANALYSIS MODULE XII Lecture - 6 Polynomal Regresson Models Dr. Shalabh Department of Mathematcs and Statstcs Indan Insttute of Technology Kanpur Test of sgnfcance To test the sgnfcance

More information

LINEAR REGRESSION ANALYSIS. MODULE IX Lecture Multicollinearity

LINEAR REGRESSION ANALYSIS. MODULE IX Lecture Multicollinearity LINEAR REGRESSION ANALYSIS MODULE IX Lecture - 30 Multcollnearty Dr. Shalabh Department of Mathematcs and Statstcs Indan Insttute of Technology Kanpur 2 Remedes for multcollnearty Varous technques have

More information

Chapter 8 Multivariate Regression Analysis

Chapter 8 Multivariate Regression Analysis Chapter 8 Multvarate Regresson Analyss 8.3 Multple Regresson wth K Independent Varables 8.4 Sgnfcance tests of Parameters Populaton Regresson Model For K ndependent varables, the populaton regresson and

More information

Diagnostics in Poisson Regression. Models - Residual Analysis

Diagnostics in Poisson Regression. Models - Residual Analysis Dagnostcs n Posson Regresson Models - Resdual Analyss 1 Outlne Dagnostcs n Posson Regresson Models - Resdual Analyss Example 3: Recall of Stressful Events contnued 2 Resdual Analyss Resduals represent

More information

Continuous vs. Discrete Goods

Continuous vs. Discrete Goods CE 651 Transportaton Economcs Charsma Choudhury Lecture 3-4 Analyss of Demand Contnuous vs. Dscrete Goods Contnuous Goods Dscrete Goods x auto 1 Indfference u curves 3 u u 1 x 1 0 1 bus Outlne Data Modelng

More information

experimenteel en correlationeel onderzoek

experimenteel en correlationeel onderzoek expermenteel en correlatoneel onderzoek lecture 6: one-way analyss of varance Leary. Introducton to Behavoral Research Methods. pages 246 271 (chapters 10 and 11): conceptual statstcs Moore, McCabe, and

More information

Chapter 10. What is Regression Analysis? Simple Linear Regression Analysis. Examples

Chapter 10. What is Regression Analysis? Simple Linear Regression Analysis. Examples Chapter 10 Smple Lnear Regresson Analyss What s Regresson Analyss? A statstcal technque that descrbes the relatonshp between a dependent varable and one or more ndependent varables. Examples Consder the

More information

STAT 511 FINAL EXAM NAME Spring 2001

STAT 511 FINAL EXAM NAME Spring 2001 STAT 5 FINAL EXAM NAME Sprng Instructons: Ths s a closed book exam. No notes or books are allowed. ou may use a calculator but you are not allowed to store notes or formulas n the calculator. Please wrte

More information

Sociology 301. Bivariate Regression. Clarification. Regression. Liying Luo Last exam (Exam #4) is on May 17, in class.

Sociology 301. Bivariate Regression. Clarification. Regression. Liying Luo Last exam (Exam #4) is on May 17, in class. Socology 30 Bvarate Regresson Lyng Luo 04.28 Clarfcaton Last exam (Exam #4) s on May 7, n class. No exam n the fnal exam week (May 24). Regresson Regresson Analyss: the procedure for esjmajng and tesjng

More information

Predictive Analytics : QM901.1x Prof U Dinesh Kumar, IIMB. All Rights Reserved, Indian Institute of Management Bangalore

Predictive Analytics : QM901.1x Prof U Dinesh Kumar, IIMB. All Rights Reserved, Indian Institute of Management Bangalore Sesson Outlne Introducton to classfcaton problems and dscrete choce models. Introducton to Logstcs Regresson. Logstc functon and Logt functon. Maxmum Lkelhood Estmator (MLE) for estmaton of LR parameters.

More information

LINEAR REGRESSION ANALYSIS. MODULE VIII Lecture Indicator Variables

LINEAR REGRESSION ANALYSIS. MODULE VIII Lecture Indicator Variables LINEAR REGRESSION ANALYSIS MODULE VIII Lecture - 7 Indcator Varables Dr. Shalabh Department of Maematcs and Statstcs Indan Insttute of Technology Kanpur Indcator varables versus quanttatve explanatory

More information

The SAS program I used to obtain the analyses for my answers is given below.

The SAS program I used to obtain the analyses for my answers is given below. Homework 1 Answer sheet Page 1 The SAS program I used to obtan the analyses for my answers s gven below. dm'log;clear;output;clear'; *************************************************************; *** EXST7034

More information

Module Contact: Dr Susan Long, ECO Copyright of the University of East Anglia Version 1

Module Contact: Dr Susan Long, ECO Copyright of the University of East Anglia Version 1 UNIVERSITY OF EAST ANGLIA School of Economcs Man Seres PG Examnaton 016-17 ECONOMETRIC METHODS ECO-7000A Tme allowed: hours Answer ALL FOUR Questons. Queston 1 carres a weght of 5%; Queston carres 0%;

More information

Statistics Chapter 4

Statistics Chapter 4 Statstcs Chapter 4 "There are three knds of les: les, damned les, and statstcs." Benjamn Dsrael, 1895 (Brtsh statesman) Gaussan Dstrbuton, 4-1 If a measurement s repeated many tmes a statstcal treatment

More information

Biostatistics. Chapter 11 Simple Linear Correlation and Regression. Jing Li

Biostatistics. Chapter 11 Simple Linear Correlation and Regression. Jing Li Bostatstcs Chapter 11 Smple Lnear Correlaton and Regresson Jng L jng.l@sjtu.edu.cn http://cbb.sjtu.edu.cn/~jngl/courses/2018fall/b372/ Dept of Bonformatcs & Bostatstcs, SJTU Recall eat chocolate Cell 175,

More information

β0 + β1xi. You are interested in estimating the unknown parameters β

β0 + β1xi. You are interested in estimating the unknown parameters β Revsed: v3 Ordnar Least Squares (OLS): Smple Lnear Regresson (SLR) Analtcs The SLR Setup Sample Statstcs Ordnar Least Squares (OLS): FOCs and SOCs Back to OLS and Sample Statstcs Predctons (and Resduals)

More information

Chapter 5: Hypothesis Tests, Confidence Intervals & Gauss-Markov Result

Chapter 5: Hypothesis Tests, Confidence Intervals & Gauss-Markov Result Chapter 5: Hypothess Tests, Confdence Intervals & Gauss-Markov Result 1-1 Outlne 1. The standard error of 2. Hypothess tests concernng β 1 3. Confdence ntervals for β 1 4. Regresson when X s bnary 5. Heteroskedastcty

More information

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur Analyss of Varance and Desgn of Experment-I MODULE VII LECTURE - 3 ANALYSIS OF COVARIANCE Dr Shalabh Department of Mathematcs and Statstcs Indan Insttute of Technology Kanpur Any scentfc experment s performed

More information

Chapter 3 Describing Data Using Numerical Measures

Chapter 3 Describing Data Using Numerical Measures Chapter 3 Student Lecture Notes 3-1 Chapter 3 Descrbng Data Usng Numercal Measures Fall 2006 Fundamentals of Busness Statstcs 1 Chapter Goals To establsh the usefulness of summary measures of data. The

More information

Econ107 Applied Econometrics Topic 9: Heteroskedasticity (Studenmund, Chapter 10)

Econ107 Applied Econometrics Topic 9: Heteroskedasticity (Studenmund, Chapter 10) I. Defnton and Problems Econ7 Appled Econometrcs Topc 9: Heteroskedastcty (Studenmund, Chapter ) We now relax another classcal assumpton. Ths s a problem that arses often wth cross sectons of ndvduals,

More information

Lecture 3 Stat102, Spring 2007

Lecture 3 Stat102, Spring 2007 Lecture 3 Stat0, Sprng 007 Chapter 3. 3.: Introducton to regresson analyss Lnear regresson as a descrptve technque The least-squares equatons Chapter 3.3 Samplng dstrbuton of b 0, b. Contnued n net lecture

More information

a. (All your answers should be in the letter!

a. (All your answers should be in the letter! Econ 301 Blkent Unversty Taskn Econometrcs Department of Economcs Md Term Exam I November 8, 015 Name For each hypothess testng n the exam complete the followng steps: Indcate the test statstc, ts crtcal

More information

LINEAR REGRESSION ANALYSIS. MODULE IX Lecture Multicollinearity

LINEAR REGRESSION ANALYSIS. MODULE IX Lecture Multicollinearity LINEAR REGRESSION ANALYSIS MODULE IX Lecture - 31 Multcollnearty Dr. Shalabh Department of Mathematcs and Statstcs Indan Insttute of Technology Kanpur 6. Rdge regresson The OLSE s the best lnear unbased

More information

REGRESSION ANALYSIS II- MULTICOLLINEARITY

REGRESSION ANALYSIS II- MULTICOLLINEARITY REGRESSION ANALYSIS II- MULTICOLLINEARITY QUESTION 1 Departments of Open Unversty of Cyprus A and B consst of na = 35 and nb = 30 students respectvely. The students of department A acheved an average test

More information

Durban Watson for Testing the Lack-of-Fit of Polynomial Regression Models without Replications

Durban Watson for Testing the Lack-of-Fit of Polynomial Regression Models without Replications Durban Watson for Testng the Lack-of-Ft of Polynomal Regresson Models wthout Replcatons Ruba A. Alyaf, Maha A. Omar, Abdullah A. Al-Shha ralyaf@ksu.edu.sa, maomar@ksu.edu.sa, aalshha@ksu.edu.sa Department

More information

A Comparative Study for Estimation Parameters in Panel Data Model

A Comparative Study for Estimation Parameters in Panel Data Model A Comparatve Study for Estmaton Parameters n Panel Data Model Ahmed H. Youssef and Mohamed R. Abonazel hs paper examnes the panel data models when the regresson coeffcents are fxed random and mxed and

More information