Linear correlation and linear regression

Size: px
Start display at page:

Download "Linear correlation and linear regression"

Transcription

1 Lnear correlaton and lnear regresson

2 Contnuous outcome (means) Outcome Varable Contnuous (e.g. pan scale, cogntve functon) Are the observatons ndependent or correlated? ndependent Ttest: compares means between two ndependent groups ANOVA: compares means between more than two ndependent groups Pearson s correlaton coeffcent (lnear correlaton): shows lnear correlaton between two contnuous varables Lnear regresson: multvarate regresson technque used when the outcome s contnuous; gves slopes correlated Pared ttest: compares means between two related groups (e.g., the same subjects before and after) Repeated-measures ANOVA: compares changes over tme n the means of two or more groups (repeated measurements) Mxed models/gee modelng: multvarate regresson technques to compare changes over tme between two or more groups; gves rate of change over tme Alternatves f the normalty assumpton s volated (and small sample sze): Non-parametrc statstcs Wlcoxon sgn-rank test: non-parametrc alternatve to the pared ttest Wlcoxon sum-rank test (=Mann-Whtney U test): nonparametrc alternatve to the ttest Kruskal-Walls test: nonparametrc alternatve to ANOVA Spearman rank correlaton coeffcent: non-parametrc alternatve to Pearson s correlaton coeffcent

3 Recall: Covarance n cov( x, y) = = 1 ( x X )( y Y ) n 1

4 Interpretng Covarance cov(x,y) > 0 X and Y are postvely correlated cov(x,y) < 0 X and Y are nversely correlated cov(x,y) = 0 X and Y are ndependent

5 Correlaton coeffcent Pearson s Correlaton Coeffcent s standardzed covarance (untless): r = cov arance( x, y) var x var y

6 Correlaton Measures the relatve strength of the lnear relatonshp between two varables Unt-less Ranges between 1 and 1 The closer to 1, the stronger the negatve lnear relatonshp The closer to 1, the stronger the postve lnear relatonshp The closer to 0, the weaker any postve lnear relatonshp

7 Scatter Plots of Data wth Varous Correlaton Coeffcents Y Y Y Y X X r = -1 r = -.6 r = 0 Y Y X X X r = +1 r = +.3 Slde from: Statstcs for M anagers Usng M crosoft Excel 4th Edton, 2004 Prentce-Hall r = 0 X

8 Lnear Correlaton Lnear relatonshps Curvlnear relatonshps Y Y X X Y Y X Slde from: Statstcs for Managers Usng Mcrosoft Excel 4th Edton, 2004 Prentce-Hall X

9 Lnear Correlaton Strong relatonshps Weak relatonshps Y Y X X Y Y X Slde from: Statstcs for Managers Usng Mcrosoft Excel 4th Edton, 2004 Prentce-Hall X

10 Lnear Correlaton Y No relatonshp X Y Slde from: Statstcs for Managers Usng Mcrosoft Excel 4th Edton, 2004 Prentce-Hall X

11 Calculatng by hand 1 ) ( 1 ) ( 1 ) )( ( var var ), ( cov ˆ = = = = = n y y n x x n y y x x y x y x arance r n n n

12 Smpler calculaton formula y x xy n n n n n n SS SS SS y y x x y y x x n y y n x x n y y x x r = = = = = = = = = ) ( ) ( ) )( ( 1 ) ( 1 ) ( 1 ) )( ( ˆ y x xy SS SS SS r= ˆ Numerator of covarance Numerators of varance

13 Dstrbuton of the correlaton coeffcent: SE( rˆ) = 1 r 2 n 2 The sample correlaton coeffcent follows a T-dstrbuton wth n-2 degrees of freedom (snce you have to estmate the standard error). *note, lke a proporton, the varance of the correlaton coeffcent depends on the correlaton coeffcent tself substtute n estmated r

14 Contnuous outcome (means) Outcome Varable Contnuous (e.g. pan scale, cogntve functon) Are the observatons ndependent or correlated? ndependent Ttest: compares means between two ndependent groups ANOVA: compares means between more than two ndependent groups Pearson s correlaton coeffcent (lnear correlaton): shows lnear correlaton between two contnuous varables Lnear regresson: multvarate regresson technque used when the outcome s contnuous; gves slopes correlated Pared ttest: compares means between two related groups (e.g., the same subjects before and after) Repeated-measures ANOVA: compares changes over tme n the means of two or more groups (repeated measurements) Mxed models/gee modelng: multvarate regresson technques to compare changes over tme between two or more groups; gves rate of change over tme Alternatves f the normalty assumpton s volated (and small sample sze): Non-parametrc statstcs Wlcoxon sgn-rank test: non-parametrc alternatve to the pared ttest Wlcoxon sum-rank test (=Mann-Whtney U test): nonparametrc alternatve to the ttest Kruskal-Walls test: nonparametrc alternatve to ANOVA Spearman rank correlaton coeffcent: non-parametrc alternatve to Pearson s correlaton coeffcent

15 Lnear regresson In correlaton, the two varables are treated as equals. In regresson, one varable s consdered ndependent (=predctor) varable (X) and the other the dependent (=outcome) varable Y.

16 What s Lnear? Remember ths: Y=mX+B? m B

17 What s Slope? A slope of 2 means that every 1-unt change n X yelds a 2-unt change n Y.

18 Predcton If you know somethng about X, ths knowledge helps you predct somethng about Y. (Sound famlar? sound lke condtonal probabltes?)

19 Regresson equaton Expected value of y at a gven level of x= E ( y / x ) =α+ β x

20 Predcted value for an ndvdual y = α + β*x + random error Fxed exactly on the lne Follows a normal dstrbuton

21 Assumptons (or the fne prnt) Lnear regresson assumes that 1. The relatonshp between X and Y s lnear 2. Y s dstrbuted normally at each value of X 3. The varance of Y at every value of X s the same (homogenety of varances) 4. The observatons are ndependent

22 The standard error of Y gven X s the average varablty around the regresson lne at any gven value of X. It s assumed to be equal at all values of X. S y/x S y/x S y/x S y/x S y/x S y/x

23 Regresson Pcture y C A ŷ = β x +α y A B B y C y n n 2 ( y y ) = = 1 = 1 A 2 B 2 C 2 SS total Total squared dstance of observatons from naïve mean of y Total varaton ( yˆ y ) = 1 SS reg Dstance from regresson lne to naïve mean of y Varablty due to x (regresson) 2 + n ( x yˆ y ) 2 *Least squares estmaton gave us the lne (β) that mnmzed C 2 R 2 =SS reg /SS total SS resdual Varance around the regresson lne Addtonal varablty not explaned by x what least squares method ams to mnmze

24 Recall example: cogntve functon and vtamn D Hypothetcal data loosely based on [1]; cross-sectonal study of 100 mddleaged and older European men. Cogntve functon s measured by the Dgt Symbol Substtuton Test (DSST). 1. Lee DM, Tajar A, Ulubaev A, et al. Assocaton between 25-hydroxyvtamn D levels and cogntve performance n mddle-aged and older European men. J Neurol Neurosurg Psychatry Jul;80(7):722-9.

25 Dstrbuton of vtamn D Mean= 63 nmol/l Standard devaton = 33 nmol/l

26 Dstrbuton of DSST Normally dstrbuted Mean = 28 ponts Standard devaton = 10 ponts

27 Four hypothetcal datasets I generated four hypothetcal datasets, wth ncreasng TRUE slopes (between vt D and DSST): ponts per 10 nmol/l 1.0 ponts per 10 nmol/l 1.5 ponts per 10 nmol/l

28 Dataset 1: no relatonshp

29 Dataset 2: weak relatonshp

30 Dataset 3: weak to moderate relatonshp

31 Dataset 4: moderate relatonshp

32 The Best ft lne Regresson equaton: E(Y ) = *vt D (n 10 nmol/l)

33 The Best ft lne Note how the lne s a lttle deceptve; t draws your eye, makng the relatonshp appear stronger than t really s! Regresson equaton: E(Y ) = *vt D (n 10 nmol/l)

34 The Best ft lne Regresson equaton: E(Y ) = *vt D (n 10 nmol/l)

35 The Best ft lne Regresson equaton: E(Y ) = *vt D (n 10 nmol/l) Note: all the lnes go through the pont (63, 28)!

36 Estmatng the ntercept and slope: least squares estmaton ** Least Squares Estmaton A lttle calculus. What are we tryng to estmate? β, the slope, from What s the constrant? We are tryng to mnmze the squared dstance (hence the least squares ) between the observatons themselves and the predcted values, or (also called the resduals, or left-over unexplaned varablty) Dfference = y (βx + α) Dfference 2 = (y (βx + α)) 2 Fnd the β that gves the mnmum sum of the squared dfferences. How do you maxmze a functon? Take the dervatve; set t equal to zero; and solve. Typcal max/mn problem from calculus. d dβ 2( n = 1 n = 1 ( y ( y ( βx x + βx + α)) )) = 0... ( y From here takes a lttle math trckery to solve for β αx = 2( n = 1 βx α)( x ))

37 Resultng formulas Slope (beta coeffcent) = βˆ = Cov( x, y) Var( x) Intercept= Calculate : α ˆ = y - β ˆx Regresson lne always goes through the pont: ( x, y)

38 Relatonshp wth correlaton rˆ= βˆ SD SD x y In correlaton, the two varables are treated as equals. In regresson, one varable s consdered ndependent (=predctor) varable (X) and the other the dependent (=outcome) varable Y.

39 Example: dataset 4 SDx = 33 nmol/l SDy= 10 ponts Cov(X,Y) = 163 ponts*nmol/l βˆ SS SS x y Beta = 163/33 2 = 0.15 ponts per nmol/l = 1.5 ponts per 10 nmol/l r = 163/(10*33) = 0.49 Or r = 0.15 * (33/10) = 0.49

40 Sgnfcance testng Slope Dstrbuton of slope ~ T n-2 (β,s.e.( )) βˆ H 0 : β 1 = 0 H 1 : β 1 0 T n-2 = (no lnear relatonshp) (lnear relatonshp does exst) ˆ β s. e.( 0 β ˆ)

41 Formula for the standard error of beta (you wll not have to calculate by hand!): n x y x x β α ˆ ˆ ˆ and ) ( where SS 1 2 x + = = = x x y x n SS s SS n y y s 2 / 1 2 ˆ 2 ) ˆ ( = = = β

42 Example: dataset 4 Standard error (beta) = 0.03 T 98 = 0.15/0.03 = 5, p< % Confdence nterval = 0.09 to 0.21

43 Resdual Analyss: check assumptons The resdual for observaton, e, s the dfference between ts observed and predcted value Check the assumptons of regresson by examnng the resduals Examne for lnearty assumpton Examne for constant varance for all levels of X (homoscedastcty) Evaluate normal dstrbuton assumpton Evaluate ndependence assumpton Graphcal Analyss of Resduals e Can plot resduals vs. X = Y Yˆ

44 Predcted values yˆ = x For Vtamn D = 95 nmol/l (or 9.5 n 10 nmol/l): ˆ = (9.5) = y 34

45 Resdual = observed - predcted X=95 nmol/l y = yˆ y = 34 yˆ = 14

46 Resdual Analyss for Lnearty Y Y x x resduals x resduals x Not Lnear Lnear Slde from: Statstcs for Managers Usng Mcrosoft Excel 4th Edton, 2004 Prentce-Hall

47 Resdual Analyss for Homoscedastcty Y Y x x resduals x resduals x Non-constant varance Constant varance Slde from: Statstcs for Managers Usng Mcrosoft Excel 4th Edton, 2004 Prentce-Hall

48 Resdual Analyss for Independence Not Independent Independent resduals X resduals X resduals X Slde from: Statstcs for Managers Usng Mcrosoft Excel 4th Edton, 2004 Prentce-Hall

49 Resdual plot, dataset 4

50 Multple lnear regresson What f age s a confounder here? Older men have lower vtamn D Older men have poorer cognton Adjust for age by puttng age n the model: DSST score = ntercept + slope 1 xvtamn D + slope 2 xage

51 2 predctors: age and vt D

52 Dfferent 3D vew

53 Ft a plane rather than a lne On the plane, the slope for vtamn D s the same at every age; thus, the slope for vtamn D represents the effect of vtamn D when age s held constant.

54 Equaton of the Best ft plane DSST score = xvtamn D (n 10 nmol/l) xage (n years) P-value for vtamn D >>.05 P-value for age <.0001 Thus, relatonshp wth vtamn D was due to confoundng by age!

55 Multple Lnear Regresson More than one predctor E(y)=α + β 1 *X + β 2 *W + β 3 *Z Each regresson coeffcent s the amount of change n the outcome varable that would be expected per one-unt change of the predctor, f all other varables n the model were held constant.

56 Functons of multvarate analyss: Control for confounders Test for nteractons between predctors (effect modfcaton) Improve predctons

57 A ttest s lnear regresson! Dvde vtamn D nto two groups: Insuffcent vtamn D (<50 nmol/l) Suffcent vtamn D (>=50 nmol/l), reference group We can evaluate these data wth a ttest or a lnear regresson = 7.5 T = = 3.46; p =

58 As a lnear regresson Intercept represents the mean value n the suffcent group. Slope represents the dfference n means between the groups. Dfference s sgnfcant. Parameter ````````````````Standard Varable Estmate Error t Value Pr > t Intercept <.0001 nsuff

59 ANOVA s lnear regresson! Dvde vtamn D nto three groups: Defcent (<25 nmol/l) Insuffcent (>=25 and <50 nmol/l) Suffcent (>=50 nmol/l), reference group DSST= α (=value for suffcent) + β nsuffcent *(1 f nsuffcent) + β 2 *(1 f defcent) Ths s called dummy codng where multple bnary varables are created to represent beng n each category (or not) of a categorcal varable

60 The pcture Suffcent vs. Insuffcent Suffcent vs. Defcent

61 Results Parameter Estmates Varable Parameter Standard DF Estmate Error t Value Pr > t Intercept <.0001 defcent nsuffcent Interpretaton: The defcent group has a mean DSST 9.87 ponts lower than the reference (suffcent) group. The nsuffcent group has a mean DSST 6.87 ponts lower than the reference (suffcent) group.

62 Other types of multvarate regresson Multple lnear regresson s for normally dstrbuted outcomes Logstc regresson s for bnary outcomes Cox proportonal hazards regresson s used when tme-to-event s the outcome

63 Common multvarate regresson models. Outcome (dependent varable) Example outcome varable Approprate multvarate regresson model Example equaton What do the coeffcents gve you? Contnuous Blood pressure Lnear regresson blood pressure (mmhg) = α + βsalt*salt consumpton (tsp/day) + βage*age (years) + βsmoker*ever smoker (yes=1/no=0) slopes tells you how much the outcome varable ncreases for every 1-unt ncrease n each predctor. Bnary Hgh blood pressure (yes/no) Logstc regresson ln (odds of hgh blood pressure) = α + βsalt*salt consumpton (tsp/day) + βage*age (years) + βsmoker*ever smoker (yes=1/no=0) odds ratos tells you how much the odds of the outcome ncrease for every 1-unt ncrease n each predctor. Tme-to-event Tme-todeath Cox regresson ln (rate of death) = α + βsalt*salt consumpton (tsp/day) + βage*age (years) + βsmoker*ever smoker (yes=1/no=0) hazard ratos tells you how much the rate of the outcome ncreases for every 1-unt ncrease n each predctor.

64 Multvarate regresson ptfalls Mult-collnearty Resdual confoundng Overfttng

65 Multcollnearty Multcollnearty arses when two varables that measure the same thng or smlar thngs (e.g., weght and BMI) are both ncluded n a multple regresson model; they wll, n effect, cancel each other out and generally destroy your model. Model buldng and dagnostcs are trcky busness!

66 Resdual confoundng You cannot completely wpe out confoundng smply by adjustng for varables n multple regresson unless varables are measured wth zero error (whch s usually mpossble). Example: meat eatng and mortalty

67 Men who eat a lot of meat are unhealther for many reasons! Snha R, Cross AJ, Graubard BI, Letzmann MF, Schatzkn A. Meat ntake and mortalty: a prospectve study of over half a mllon people. Arch Intern Med 2009;169:562-71

68 Mortalty rsks Snha R, Cross AJ, Graubard BI, Letzmann MF, Schatzkn A. Meat ntake and mortalty: a prospectve study of over half a mllon people. Arch Intern Med 2009;169:562-71

69 Overfttng In multvarate modelng, you can get hghly sgnfcant but meanngless results f you put too many predctors n the model. The model s ft perfectly to the qurks of your partcular sample, but has no predctve ablty n a new sample.

70 Overfttng: class data example I asked SAS to automatcally fnd predctors of optmsm n our class dataset. Here s the resultng lnear regresson model: Parameter Standard Varable Estmate Error Type II SS F Value Pr > F Intercept exercse sleep obama <.0001 Clnton mathlove Exercse, sleep, and hgh ratngs for Clnton are negatvely related to optmsm (hghly sgnfcant!) and hgh ratngs for Obama and hgh love of math are postvely related to optmsm (hghly sgnfcant!).

71 If somethng seems to good to be true Clnton, unvarate: Parameter Standard Varable Label DF Estmate Error t Value Pr > t Intercept Intercept Clnton Clnton Sleep, Unvarate: Parameter Standard Varable Label DF Estmate Error t Value Pr > t Intercept Intercept Exercse, Unvarate: sleep sleep Parameter Standard Varable Label DF Estmate Error t Value Pr > t Intercept Intercept <.0001 exercse exercse

72 More unvarate models Obama, Unvarate: Parameter Standard Varable Label DF Estmate Error t Value Pr > t Intercept Intercept obama obama Compare wth multvarate result; p<.0001 Love of Math, unvarate: Parameter Standard Varable Label DF Estmate Error t Value Pr > t Intercept Intercept mathlove mathlove Compare wth multvarate result; p=.0011

73 Overfttng Rule of thumb: You need at least 10 subjects for each addtonal predctor varable n the multvarate regresson model. Pure nose varables stll produce good R 2 values f the model s overftted. The dstrbuton of R 2 values from a seres of smulated regresson models contanng only nose varables. (Fgure 1 from: Babyak, MA. What You See May Not Be What You Get: A Bref, Nontechncal Introducton to Overfttng n Regresson-Type Models. Psychosomatc Medcne 66: (2004).)

74 Revew of statstcal tests The followng table gves the approprate choce of a statstcal test or measure of assocaton for varous types of data (outcome varables and predctor varables) by study desgn. e.g., blood pressure= pounds + age + treatment (1/0) Contnuous outcome Contnuous predctors Bnary predctor

75 Types of varables to be analyzed Predctor varable/s Outcome varable Cross-sectonal/case-control studes Statstcal procedure or measure of assocaton Bnary (two groups) Contnuous Bnary Ranks/ordnal T-test Wlcoxon rank-sum test Categorcal (>2 groups) Contnuous ANOVA Contnuous Contnuous Smple lnear regresson Multvarate (categorcal and Contnuous Multple lnear regresson contnuous) Categorcal Categorcal Ch-square test (or Fsher s exact) Bnary Bnary Odds rato, rsk rato Multvarate Bnary Logstc regresson Cohort Studes/Clncal Trals Bnary Bnary Rsk rato Categorcal Tme-to-event Kaplan-Meer/ log-rank test Multvarate Tme-to-event Cox-proportonal hazards regresson, hazard rato Categorcal Contnuous Repeated measures ANOVA Multvarate Contnuous Mxed models; GEE modelng

76 Alternatve summary: statstcs for varous types of outcome data Are the observatons ndependent or correlated? Outcome Varable ndependent correlated Assumptons Contnuous (e.g. pan scale, cogntve functon) Ttest ANOVA Lnear correlaton Lnear regresson Pared ttest Repeated-measures ANOVA Mxed models/gee modelng Outcome s normally dstrbuted (mportant for small samples). Outcome and predctor have a lnear relatonshp. Bnary or categorcal (e.g. fracture yes/no) Dfference n proportons Relatve rsks Ch-square test Logstc regresson McNemar s test Condtonal logstc regresson GEE modelng Ch-square test assumes suffcent numbers n each cell (>=5) Tme-to-event (e.g. tme to fracture) Kaplan-Meer statstcs Cox regresson n/a Cox regresson assumes proportonal hazards between groups

77 Contnuous outcome (means); HRP 259/HRP 262 Outcome Varable Contnuous (e.g. pan scale, cogntve functon) Are the observatons ndependent or correlated? ndependent Ttest: compares means between two ndependent groups ANOVA: compares means between more than two ndependent groups Pearson s correlaton coeffcent (lnear correlaton): shows lnear correlaton between two contnuous varables Lnear regresson: multvarate regresson technque used when the outcome s contnuous; gves slopes correlated Pared ttest: compares means between two related groups (e.g., the same subjects before and after) Repeated-measures ANOVA: compares changes over tme n the means of two or more groups (repeated measurements) Mxed models/gee modelng: multvarate regresson technques to compare changes over tme between two or more groups; gves rate of change over tme Alternatves f the normalty assumpton s volated (and small sample sze): Non-parametrc statstcs Wlcoxon sgn-rank test: non-parametrc alternatve to the pared ttest Wlcoxon sum-rank test (=Mann-Whtney U test): nonparametrc alternatve to the ttest Kruskal-Walls test: nonparametrc alternatve to ANOVA Spearman rank correlaton coeffcent: non-parametrc alternatve to Pearson s correlaton coeffcent

78 Bnary or categorcal outcomes (proportons); HRP 259/HRP 261 Outcome Varable Bnary or categorcal (e.g. fracture, yes/no) Are the observatons correlated? ndependent Ch-square test: compares proportons between two or more groups Relatve rsks: odds ratos or rsk ratos Logstc regresson: multvarate technque used when outcome s bnary; gves multvarate-adjusted odds ratos correlated McNemar s ch-square test: compares bnary outcome between correlated groups (e.g., before and after) Condtonal logstc regresson: multvarate regresson technque for a bnary outcome when groups are correlated (e.g., matched data) GEE modelng: multvarate regresson technque for a bnary outcome when groups are correlated (e.g., repeated measures) Alternatve to the chsquare test f sparse cells: Fsher s exact test: compares proportons between ndependent groups when there are sparse data (some cells <5). McNemar s exact test: compares proportons between correlated groups when there are sparse data (some cells <5).

79 Tme-to-event outcome (survval data); HRP 262 Outcome Varable Are the observaton groups ndependent or correlated? ndependent correlated Modfcatons to Cox regresson f proportonalhazards s volated: Tme-toevent (e.g., tme to fracture) Kaplan-Meer statstcs: estmates survval functons for each group (usually dsplayed graphcally); compares survval functons wth log-rank test n/a (already over tme) Tme-dependent predctors or tmedependent hazard ratos (trcky!) Cox regresson: Multvarate technque for tme-to-event data; gves multvarate-adjusted hazard ratos

Statistics for Economics & Business

Statistics for Economics & Business Statstcs for Economcs & Busness Smple Lnear Regresson Learnng Objectves In ths chapter, you learn: How to use regresson analyss to predct the value of a dependent varable based on an ndependent varable

More information

Statistics for Managers Using Microsoft Excel/SPSS Chapter 13 The Simple Linear Regression Model and Correlation

Statistics for Managers Using Microsoft Excel/SPSS Chapter 13 The Simple Linear Regression Model and Correlation Statstcs for Managers Usng Mcrosoft Excel/SPSS Chapter 13 The Smple Lnear Regresson Model and Correlaton 1999 Prentce-Hall, Inc. Chap. 13-1 Chapter Topcs Types of Regresson Models Determnng the Smple Lnear

More information

Statistics for Business and Economics

Statistics for Business and Economics Statstcs for Busness and Economcs Chapter 11 Smple Regresson Copyrght 010 Pearson Educaton, Inc. Publshng as Prentce Hall Ch. 11-1 11.1 Overvew of Lnear Models n An equaton can be ft to show the best lnear

More information

Basic Business Statistics, 10/e

Basic Business Statistics, 10/e Chapter 13 13-1 Basc Busness Statstcs 11 th Edton Chapter 13 Smple Lnear Regresson Basc Busness Statstcs, 11e 009 Prentce-Hall, Inc. Chap 13-1 Learnng Objectves In ths chapter, you learn: How to use regresson

More information

Biostatistics. Chapter 11 Simple Linear Correlation and Regression. Jing Li

Biostatistics. Chapter 11 Simple Linear Correlation and Regression. Jing Li Bostatstcs Chapter 11 Smple Lnear Correlaton and Regresson Jng L jng.l@sjtu.edu.cn http://cbb.sjtu.edu.cn/~jngl/courses/2018fall/b372/ Dept of Bonformatcs & Bostatstcs, SJTU Recall eat chocolate Cell 175,

More information

Department of Quantitative Methods & Information Systems. Time Series and Their Components QMIS 320. Chapter 6

Department of Quantitative Methods & Information Systems. Time Series and Their Components QMIS 320. Chapter 6 Department of Quanttatve Methods & Informaton Systems Tme Seres and Ther Components QMIS 30 Chapter 6 Fall 00 Dr. Mohammad Zanal These sldes were modfed from ther orgnal source for educatonal purpose only.

More information

Chapter 13: Multiple Regression

Chapter 13: Multiple Regression Chapter 13: Multple Regresson 13.1 Developng the multple-regresson Model The general model can be descrbed as: It smplfes for two ndependent varables: The sample ft parameter b 0, b 1, and b are used to

More information

Lecture 6: Introduction to Linear Regression

Lecture 6: Introduction to Linear Regression Lecture 6: Introducton to Lnear Regresson An Manchakul amancha@jhsph.edu 24 Aprl 27 Lnear regresson: man dea Lnear regresson can be used to study an outcome as a lnear functon of a predctor Example: 6

More information

[The following data appear in Wooldridge Q2.3.] The table below contains the ACT score and college GPA for eight college students.

[The following data appear in Wooldridge Q2.3.] The table below contains the ACT score and college GPA for eight college students. PPOL 59-3 Problem Set Exercses n Smple Regresson Due n class /8/7 In ths problem set, you are asked to compute varous statstcs by hand to gve you a better sense of the mechancs of the Pearson correlaton

More information

Chapter 11: Simple Linear Regression and Correlation

Chapter 11: Simple Linear Regression and Correlation Chapter 11: Smple Lnear Regresson and Correlaton 11-1 Emprcal Models 11-2 Smple Lnear Regresson 11-3 Propertes of the Least Squares Estmators 11-4 Hypothess Test n Smple Lnear Regresson 11-4.1 Use of t-tests

More information

Chapter 9: Statistical Inference and the Relationship between Two Variables

Chapter 9: Statistical Inference and the Relationship between Two Variables Chapter 9: Statstcal Inference and the Relatonshp between Two Varables Key Words The Regresson Model The Sample Regresson Equaton The Pearson Correlaton Coeffcent Learnng Outcomes After studyng ths chapter,

More information

Lecture 9: Linear regression: centering, hypothesis testing, multiple covariates, and confounding

Lecture 9: Linear regression: centering, hypothesis testing, multiple covariates, and confounding Lecture 9: Lnear regresson: centerng, hypothess testng, multple covarates, and confoundng Sandy Eckel seckel@jhsph.edu 6 May 008 Recall: man dea of lnear regresson Lnear regresson can be used to study

More information

Y = β 0 + β 1 X 1 + β 2 X β k X k + ε

Y = β 0 + β 1 X 1 + β 2 X β k X k + ε Chapter 3 Secton 3.1 Model Assumptons: Multple Regresson Model Predcton Equaton Std. Devaton of Error Correlaton Matrx Smple Lnear Regresson: 1.) Lnearty.) Constant Varance 3.) Independent Errors 4.) Normalty

More information

Lecture 9: Linear regression: centering, hypothesis testing, multiple covariates, and confounding

Lecture 9: Linear regression: centering, hypothesis testing, multiple covariates, and confounding Recall: man dea of lnear regresson Lecture 9: Lnear regresson: centerng, hypothess testng, multple covarates, and confoundng Sandy Eckel seckel@jhsph.edu 6 May 8 Lnear regresson can be used to study an

More information

Lecture Notes for STATISTICAL METHODS FOR BUSINESS II BMGT 212. Chapters 14, 15 & 16. Professor Ahmadi, Ph.D. Department of Management

Lecture Notes for STATISTICAL METHODS FOR BUSINESS II BMGT 212. Chapters 14, 15 & 16. Professor Ahmadi, Ph.D. Department of Management Lecture Notes for STATISTICAL METHODS FOR BUSINESS II BMGT 1 Chapters 14, 15 & 16 Professor Ahmad, Ph.D. Department of Management Revsed August 005 Chapter 14 Formulas Smple Lnear Regresson Model: y =

More information

Linear regression. Regression Models. Chapter 11 Student Lecture Notes Regression Analysis is the

Linear regression. Regression Models. Chapter 11 Student Lecture Notes Regression Analysis is the Chapter 11 Student Lecture Notes 11-1 Lnear regresson Wenl lu Dept. Health statstcs School of publc health Tanjn medcal unversty 1 Regresson Models 1. Answer What Is the Relatonshp Between the Varables?.

More information

Predictive Analytics : QM901.1x Prof U Dinesh Kumar, IIMB. All Rights Reserved, Indian Institute of Management Bangalore

Predictive Analytics : QM901.1x Prof U Dinesh Kumar, IIMB. All Rights Reserved, Indian Institute of Management Bangalore Sesson Outlne Introducton to classfcaton problems and dscrete choce models. Introducton to Logstcs Regresson. Logstc functon and Logt functon. Maxmum Lkelhood Estmator (MLE) for estmaton of LR parameters.

More information

Comparison of Regression Lines

Comparison of Regression Lines STATGRAPHICS Rev. 9/13/2013 Comparson of Regresson Lnes Summary... 1 Data Input... 3 Analyss Summary... 4 Plot of Ftted Model... 6 Condtonal Sums of Squares... 6 Analyss Optons... 7 Forecasts... 8 Confdence

More information

1. Inference on Regression Parameters a. Finding Mean, s.d and covariance amongst estimates. 2. Confidence Intervals and Working Hotelling Bands

1. Inference on Regression Parameters a. Finding Mean, s.d and covariance amongst estimates. 2. Confidence Intervals and Working Hotelling Bands Content. Inference on Regresson Parameters a. Fndng Mean, s.d and covarance amongst estmates.. Confdence Intervals and Workng Hotellng Bands 3. Cochran s Theorem 4. General Lnear Testng 5. Measures of

More information

Chapter 15 - Multiple Regression

Chapter 15 - Multiple Regression Chapter - Multple Regresson Chapter - Multple Regresson Multple Regresson Model The equaton that descrbes how the dependent varable y s related to the ndependent varables x, x,... x p and an error term

More information

/ n ) are compared. The logic is: if the two

/ n ) are compared. The logic is: if the two STAT C141, Sprng 2005 Lecture 13 Two sample tests One sample tests: examples of goodness of ft tests, where we are testng whether our data supports predctons. Two sample tests: called as tests of ndependence

More information

The Multiple Classical Linear Regression Model (CLRM): Specification and Assumptions. 1. Introduction

The Multiple Classical Linear Regression Model (CLRM): Specification and Assumptions. 1. Introduction ECONOMICS 5* -- NOTE (Summary) ECON 5* -- NOTE The Multple Classcal Lnear Regresson Model (CLRM): Specfcaton and Assumptons. Introducton CLRM stands for the Classcal Lnear Regresson Model. The CLRM s also

More information

Statistics MINITAB - Lab 2

Statistics MINITAB - Lab 2 Statstcs 20080 MINITAB - Lab 2 1. Smple Lnear Regresson In smple lnear regresson we attempt to model a lnear relatonshp between two varables wth a straght lne and make statstcal nferences concernng that

More information

Introduction to Regression

Introduction to Regression Introducton to Regresson Dr Tom Ilvento Department of Food and Resource Economcs Overvew The last part of the course wll focus on Regresson Analyss Ths s one of the more powerful statstcal technques Provdes

More information

STAT 3008 Applied Regression Analysis

STAT 3008 Applied Regression Analysis STAT 3008 Appled Regresson Analyss Tutoral : Smple Lnear Regresson LAI Chun He Department of Statstcs, The Chnese Unversty of Hong Kong 1 Model Assumpton To quantfy the relatonshp between two factors,

More information

Learning Objectives for Chapter 11

Learning Objectives for Chapter 11 Chapter : Lnear Regresson and Correlaton Methods Hldebrand, Ott and Gray Basc Statstcal Ideas for Managers Second Edton Learnng Objectves for Chapter Usng the scatterplot n regresson analyss Usng the method

More information

Chapter 15 Student Lecture Notes 15-1

Chapter 15 Student Lecture Notes 15-1 Chapter 15 Student Lecture Notes 15-1 Basc Busness Statstcs (9 th Edton) Chapter 15 Multple Regresson Model Buldng 004 Prentce-Hall, Inc. Chap 15-1 Chapter Topcs The Quadratc Regresson Model Usng Transformatons

More information

Correlation and Regression

Correlation and Regression Correlaton and Regresson otes prepared by Pamela Peterson Drake Index Basc terms and concepts... Smple regresson...5 Multple Regresson...3 Regresson termnology...0 Regresson formulas... Basc terms and

More information

2016 Wiley. Study Session 2: Ethical and Professional Standards Application

2016 Wiley. Study Session 2: Ethical and Professional Standards Application 6 Wley Study Sesson : Ethcal and Professonal Standards Applcaton LESSON : CORRECTION ANALYSIS Readng 9: Correlaton and Regresson LOS 9a: Calculate and nterpret a sample covarance and a sample correlaton

More information

Chapter 2 - The Simple Linear Regression Model S =0. e i is a random error. S β2 β. This is a minimization problem. Solution is a calculus exercise.

Chapter 2 - The Simple Linear Regression Model S =0. e i is a random error. S β2 β. This is a minimization problem. Solution is a calculus exercise. Chapter - The Smple Lnear Regresson Model The lnear regresson equaton s: where y + = β + β e for =,..., y and are observable varables e s a random error How can an estmaton rule be constructed for the

More information

Statistics for Managers Using Microsoft Excel/SPSS Chapter 14 Multiple Regression Models

Statistics for Managers Using Microsoft Excel/SPSS Chapter 14 Multiple Regression Models Statstcs for Managers Usng Mcrosoft Excel/SPSS Chapter 14 Multple Regresson Models 1999 Prentce-Hall, Inc. Chap. 14-1 Chapter Topcs The Multple Regresson Model Contrbuton of Indvdual Independent Varables

More information

SIMPLE LINEAR REGRESSION

SIMPLE LINEAR REGRESSION Smple Lnear Regresson and Correlaton Introducton Prevousl, our attenton has been focused on one varable whch we desgnated b x. Frequentl, t s desrable to learn somethng about the relatonshp between two

More information

Negative Binomial Regression

Negative Binomial Regression STATGRAPHICS Rev. 9/16/2013 Negatve Bnomal Regresson Summary... 1 Data Input... 3 Statstcal Model... 3 Analyss Summary... 4 Analyss Optons... 7 Plot of Ftted Model... 8 Observed Versus Predcted... 10 Predctons...

More information

Chapter 14 Simple Linear Regression

Chapter 14 Simple Linear Regression Chapter 4 Smple Lnear Regresson Chapter 4 - Smple Lnear Regresson Manageral decsons often are based on the relatonshp between two or more varables. Regresson analss can be used to develop an equaton showng

More information

17 - LINEAR REGRESSION II

17 - LINEAR REGRESSION II Topc 7 Lnear Regresson II 7- Topc 7 - LINEAR REGRESSION II Testng and Estmaton Inferences about β Recall that we estmate Yˆ ˆ β + ˆ βx. 0 μ Y X x β0 + βx usng To estmate σ σ squared error Y X x ε s ε we

More information

The Ordinary Least Squares (OLS) Estimator

The Ordinary Least Squares (OLS) Estimator The Ordnary Least Squares (OLS) Estmator 1 Regresson Analyss Regresson Analyss: a statstcal technque for nvestgatng and modelng the relatonshp between varables. Applcatons: Engneerng, the physcal and chemcal

More information

28. SIMPLE LINEAR REGRESSION III

28. SIMPLE LINEAR REGRESSION III 8. SIMPLE LINEAR REGRESSION III Ftted Values and Resduals US Domestc Beers: Calores vs. % Alcohol To each observed x, there corresponds a y-value on the ftted lne, y ˆ = βˆ + βˆ x. The are called ftted

More information

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur Analyss of Varance and Desgn of Experment-I MODULE VII LECTURE - 3 ANALYSIS OF COVARIANCE Dr Shalabh Department of Mathematcs and Statstcs Indan Insttute of Technology Kanpur Any scentfc experment s performed

More information

since [1-( 0+ 1x1i+ 2x2 i)] [ 0+ 1x1i+ assumed to be a reasonable approximation

since [1-( 0+ 1x1i+ 2x2 i)] [ 0+ 1x1i+ assumed to be a reasonable approximation Econ 388 R. Butler 204 revsons Lecture 4 Dummy Dependent Varables I. Lnear Probablty Model: the Regresson model wth a dummy varables as the dependent varable assumpton, mplcaton regular multple regresson

More information

x i1 =1 for all i (the constant ).

x i1 =1 for all i (the constant ). Chapter 5 The Multple Regresson Model Consder an economc model where the dependent varable s a functon of K explanatory varables. The economc model has the form: y = f ( x,x,..., ) xk Approxmate ths by

More information

Correlation and Regression. Correlation 9.1. Correlation. Chapter 9

Correlation and Regression. Correlation 9.1. Correlation. Chapter 9 Chapter 9 Correlaton and Regresson 9. Correlaton Correlaton A correlaton s a relatonshp between two varables. The data can be represented b the ordered pars (, ) where s the ndependent (or eplanator) varable,

More information

β0 + β1xi. You are interested in estimating the unknown parameters β

β0 + β1xi. You are interested in estimating the unknown parameters β Ordnary Least Squares (OLS): Smple Lnear Regresson (SLR) Analytcs The SLR Setup Sample Statstcs Ordnary Least Squares (OLS): FOCs and SOCs Back to OLS and Sample Statstcs Predctons (and Resduals) wth OLS

More information

Econ Statistical Properties of the OLS estimator. Sanjaya DeSilva

Econ Statistical Properties of the OLS estimator. Sanjaya DeSilva Econ 39 - Statstcal Propertes of the OLS estmator Sanjaya DeSlva September, 008 1 Overvew Recall that the true regresson model s Y = β 0 + β 1 X + u (1) Applyng the OLS method to a sample of data, we estmate

More information

Chapter 14 Simple Linear Regression Page 1. Introduction to regression analysis 14-2

Chapter 14 Simple Linear Regression Page 1. Introduction to regression analysis 14-2 Chapter 4 Smple Lnear Regresson Page. Introducton to regresson analyss 4- The Regresson Equaton. Lnear Functons 4-4 3. Estmaton and nterpretaton of model parameters 4-6 4. Inference on the model parameters

More information

Chapter 3. Two-Variable Regression Model: The Problem of Estimation

Chapter 3. Two-Variable Regression Model: The Problem of Estimation Chapter 3. Two-Varable Regresson Model: The Problem of Estmaton Ordnary Least Squares Method (OLS) Recall that, PRF: Y = β 1 + β X + u Thus, snce PRF s not drectly observable, t s estmated by SRF; that

More information

ANOVA. The Observations y ij

ANOVA. The Observations y ij ANOVA Stands for ANalyss Of VArance But t s a test of dfferences n means The dea: The Observatons y j Treatment group = 1 = 2 = k y 11 y 21 y k,1 y 12 y 22 y k,2 y 1, n1 y 2, n2 y k, nk means: m 1 m 2

More information

e i is a random error

e i is a random error Chapter - The Smple Lnear Regresson Model The lnear regresson equaton s: where + β + β e for,..., and are observable varables e s a random error How can an estmaton rule be constructed for the unknown

More information

Econ107 Applied Econometrics Topic 3: Classical Model (Studenmund, Chapter 4)

Econ107 Applied Econometrics Topic 3: Classical Model (Studenmund, Chapter 4) I. Classcal Assumptons Econ7 Appled Econometrcs Topc 3: Classcal Model (Studenmund, Chapter 4) We have defned OLS and studed some algebrac propertes of OLS. In ths topc we wll study statstcal propertes

More information

STAT 405 BIOSTATISTICS (Fall 2016) Handout 15 Introduction to Logistic Regression

STAT 405 BIOSTATISTICS (Fall 2016) Handout 15 Introduction to Logistic Regression STAT 45 BIOSTATISTICS (Fall 26) Handout 5 Introducton to Logstc Regresson Ths handout covers materal found n Secton 3.7 of your text. You may also want to revew regresson technques n Chapter. In ths handout,

More information

Basically, if you have a dummy dependent variable you will be estimating a probability.

Basically, if you have a dummy dependent variable you will be estimating a probability. ECON 497: Lecture Notes 13 Page 1 of 1 Metropoltan State Unversty ECON 497: Research and Forecastng Lecture Notes 13 Dummy Dependent Varable Technques Studenmund Chapter 13 Bascally, f you have a dummy

More information

Here is the rationale: If X and y have a strong positive relationship to one another, then ( x x) will tend to be positive when ( y y)

Here is the rationale: If X and y have a strong positive relationship to one another, then ( x x) will tend to be positive when ( y y) Secton 1.5 Correlaton In the prevous sectons, we looked at regresson and the value r was a measurement of how much of the varaton n y can be attrbuted to the lnear relatonshp between y and x. In ths secton,

More information

x yi In chapter 14, we want to perform inference (i.e. calculate confidence intervals and perform tests of significance) in this setting.

x yi In chapter 14, we want to perform inference (i.e. calculate confidence intervals and perform tests of significance) in this setting. The Practce of Statstcs, nd ed. Chapter 14 Inference for Regresson Introducton In chapter 3 we used a least-squares regresson lne (LSRL) to represent a lnear relatonshp etween two quanttatve explanator

More information

18. SIMPLE LINEAR REGRESSION III

18. SIMPLE LINEAR REGRESSION III 8. SIMPLE LINEAR REGRESSION III US Domestc Beers: Calores vs. % Alcohol Ftted Values and Resduals To each observed x, there corresponds a y-value on the ftted lne, y ˆ ˆ = α + x. The are called ftted values.

More information

Topic 7: Analysis of Variance

Topic 7: Analysis of Variance Topc 7: Analyss of Varance Outlne Parttonng sums of squares Breakdown the degrees of freedom Expected mean squares (EMS) F test ANOVA table General lnear test Pearson Correlaton / R 2 Analyss of Varance

More information

where I = (n x n) diagonal identity matrix with diagonal elements = 1 and off-diagonal elements = 0; and σ 2 e = variance of (Y X).

where I = (n x n) diagonal identity matrix with diagonal elements = 1 and off-diagonal elements = 0; and σ 2 e = variance of (Y X). 11.4.1 Estmaton of Multple Regresson Coeffcents In multple lnear regresson, we essentally solve n equatons for the p unnown parameters. hus n must e equal to or greater than p and n practce n should e

More information

STATISTICS QUESTIONS. Step by Step Solutions.

STATISTICS QUESTIONS. Step by Step Solutions. STATISTICS QUESTIONS Step by Step Solutons www.mathcracker.com 9//016 Problem 1: A researcher s nterested n the effects of famly sze on delnquency for a group of offenders and examnes famles wth one to

More information

ECONOMICS 351*-A Mid-Term Exam -- Fall Term 2000 Page 1 of 13 pages. QUEEN'S UNIVERSITY AT KINGSTON Department of Economics

ECONOMICS 351*-A Mid-Term Exam -- Fall Term 2000 Page 1 of 13 pages. QUEEN'S UNIVERSITY AT KINGSTON Department of Economics ECOOMICS 35*-A Md-Term Exam -- Fall Term 000 Page of 3 pages QUEE'S UIVERSITY AT KIGSTO Department of Economcs ECOOMICS 35* - Secton A Introductory Econometrcs Fall Term 000 MID-TERM EAM ASWERS MG Abbott

More information

Linear Regression Analysis: Terminology and Notation

Linear Regression Analysis: Terminology and Notation ECON 35* -- Secton : Basc Concepts of Regresson Analyss (Page ) Lnear Regresson Analyss: Termnology and Notaton Consder the generc verson of the smple (two-varable) lnear regresson model. It s represented

More information

Diagnostics in Poisson Regression. Models - Residual Analysis

Diagnostics in Poisson Regression. Models - Residual Analysis Dagnostcs n Posson Regresson Models - Resdual Analyss 1 Outlne Dagnostcs n Posson Regresson Models - Resdual Analyss Example 3: Recall of Stressful Events contnued 2 Resdual Analyss Resduals represent

More information

β0 + β1xi. You are interested in estimating the unknown parameters β

β0 + β1xi. You are interested in estimating the unknown parameters β Revsed: v3 Ordnar Least Squares (OLS): Smple Lnear Regresson (SLR) Analtcs The SLR Setup Sample Statstcs Ordnar Least Squares (OLS): FOCs and SOCs Back to OLS and Sample Statstcs Predctons (and Resduals)

More information

Regression Analysis. Regression Analysis

Regression Analysis. Regression Analysis Regresson Analyss Smple Regresson Multvarate Regresson Stepwse Regresson Replcaton and Predcton Error 1 Regresson Analyss In general, we "ft" a model by mnmzng a metrc that represents the error. n mn (y

More information

Reduced slides. Introduction to Analysis of Variance (ANOVA) Part 1. Single factor

Reduced slides. Introduction to Analysis of Variance (ANOVA) Part 1. Single factor Reduced sldes Introducton to Analss of Varance (ANOVA) Part 1 Sngle factor 1 The logc of Analss of Varance Is the varance explaned b the model >> than the resdual varance In regresson models Varance explaned

More information

Chapter 5 Multilevel Models

Chapter 5 Multilevel Models Chapter 5 Multlevel Models 5.1 Cross-sectonal multlevel models 5.1.1 Two-level models 5.1.2 Multple level models 5.1.3 Multple level modelng n other felds 5.2 Longtudnal multlevel models 5.2.1 Two-level

More information

BIO Lab 2: TWO-LEVEL NORMAL MODELS with school children popularity data

BIO Lab 2: TWO-LEVEL NORMAL MODELS with school children popularity data Lab : TWO-LEVEL NORMAL MODELS wth school chldren popularty data Purpose: Introduce basc two-level models for normally dstrbuted responses usng STATA. In partcular, we dscuss Random ntercept models wthout

More information

7.1. Single classification analysis of variance (ANOVA) Why not use multiple 2-sample 2. When to use ANOVA

7.1. Single classification analysis of variance (ANOVA) Why not use multiple 2-sample 2. When to use ANOVA Sngle classfcaton analyss of varance (ANOVA) When to use ANOVA ANOVA models and parttonng sums of squares ANOVA: hypothess testng ANOVA: assumptons A non-parametrc alternatve: Kruskal-Walls ANOVA Power

More information

Chapter 4: Regression With One Regressor

Chapter 4: Regression With One Regressor Chapter 4: Regresson Wth One Regressor Copyrght 2011 Pearson Addson-Wesley. All rghts reserved. 1-1 Outlne 1. Fttng a lne to data 2. The ordnary least squares (OLS) lne/regresson 3. Measures of ft 4. Populaton

More information

Midterm Examination. Regression and Forecasting Models

Midterm Examination. Regression and Forecasting Models IOMS Department Regresson and Forecastng Models Professor Wllam Greene Phone: 22.998.0876 Offce: KMC 7-90 Home page: people.stern.nyu.edu/wgreene Emal: wgreene@stern.nyu.edu Course web page: people.stern.nyu.edu/wgreene/regresson/outlne.htm

More information

Scatter Plot x

Scatter Plot x Construct a scatter plot usng excel for the gven data. Determne whether there s a postve lnear correlaton, negatve lnear correlaton, or no lnear correlaton. Complete the table and fnd the correlaton coeffcent

More information

Decision Analysis (part 2 of 2) Review Linear Regression

Decision Analysis (part 2 of 2) Review Linear Regression Harvard-MIT Dvson of Health Scences and Technology HST.951J: Medcal Decson Support, Fall 2005 Instructors: Professor Lucla Ohno-Machado and Professor Staal Vnterbo 6.873/HST.951 Medcal Decson Support Fall

More information

Resource Allocation and Decision Analysis (ECON 8010) Spring 2014 Foundations of Regression Analysis

Resource Allocation and Decision Analysis (ECON 8010) Spring 2014 Foundations of Regression Analysis Resource Allocaton and Decson Analss (ECON 800) Sprng 04 Foundatons of Regresson Analss Readng: Regresson Analss (ECON 800 Coursepak, Page 3) Defntons and Concepts: Regresson Analss statstcal technques

More information

Outline. Zero Conditional mean. I. Motivation. 3. Multiple Regression Analysis: Estimation. Read Wooldridge (2013), Chapter 3.

Outline. Zero Conditional mean. I. Motivation. 3. Multiple Regression Analysis: Estimation. Read Wooldridge (2013), Chapter 3. Outlne 3. Multple Regresson Analyss: Estmaton I. Motvaton II. Mechancs and Interpretaton of OLS Read Wooldrdge (013), Chapter 3. III. Expected Values of the OLS IV. Varances of the OLS V. The Gauss Markov

More information

NANYANG TECHNOLOGICAL UNIVERSITY SEMESTER I EXAMINATION MTH352/MH3510 Regression Analysis

NANYANG TECHNOLOGICAL UNIVERSITY SEMESTER I EXAMINATION MTH352/MH3510 Regression Analysis NANYANG TECHNOLOGICAL UNIVERSITY SEMESTER I EXAMINATION 014-015 MTH35/MH3510 Regresson Analyss December 014 TIME ALLOWED: HOURS INSTRUCTIONS TO CANDIDATES 1. Ths examnaton paper contans FOUR (4) questons

More information

See Book Chapter 11 2 nd Edition (Chapter 10 1 st Edition)

See Book Chapter 11 2 nd Edition (Chapter 10 1 st Edition) Count Data Models See Book Chapter 11 2 nd Edton (Chapter 10 1 st Edton) Count data consst of non-negatve nteger values Examples: number of drver route changes per week, the number of trp departure changes

More information

Introduction to Analysis of Variance (ANOVA) Part 1

Introduction to Analysis of Variance (ANOVA) Part 1 Introducton to Analss of Varance (ANOVA) Part 1 Sngle factor The logc of Analss of Varance Is the varance explaned b the model >> than the resdual varance In regresson models Varance explaned b regresson

More information

ANSWERS. Problem 1. and the moment generating function (mgf) by. defined for any real t. Use this to show that E( U) var( U)

ANSWERS. Problem 1. and the moment generating function (mgf) by. defined for any real t. Use this to show that E( U) var( U) Econ 413 Exam 13 H ANSWERS Settet er nndelt 9 deloppgaver, A,B,C, som alle anbefales å telle lkt for å gøre det ltt lettere å stå. Svar er gtt . Unfortunately, there s a prntng error n the hnt of

More information

β0 + β1xi and want to estimate the unknown

β0 + β1xi and want to estimate the unknown SLR Models Estmaton Those OLS Estmates Estmators (e ante) v. estmates (e post) The Smple Lnear Regresson (SLR) Condtons -4 An Asde: The Populaton Regresson Functon B and B are Lnear Estmators (condtonal

More information

Chapter 12 Analysis of Covariance

Chapter 12 Analysis of Covariance Chapter Analyss of Covarance Any scentfc experment s performed to know somethng that s unknown about a group of treatments and to test certan hypothess about the correspondng treatment effect When varablty

More information

Lab 4: Two-level Random Intercept Model

Lab 4: Two-level Random Intercept Model BIO 656 Lab4 009 Lab 4: Two-level Random Intercept Model Data: Peak expratory flow rate (pefr) measured twce, usng two dfferent nstruments, for 17 subjects. (from Chapter 1 of Multlevel and Longtudnal

More information

Economics 130. Lecture 4 Simple Linear Regression Continued

Economics 130. Lecture 4 Simple Linear Regression Continued Economcs 130 Lecture 4 Contnued Readngs for Week 4 Text, Chapter and 3. We contnue wth addressng our second ssue + add n how we evaluate these relatonshps: Where do we get data to do ths analyss? How do

More information

Chapter 8 Indicator Variables

Chapter 8 Indicator Variables Chapter 8 Indcator Varables In general, e explanatory varables n any regresson analyss are assumed to be quanttatve n nature. For example, e varables lke temperature, dstance, age etc. are quanttatve n

More information

Chapter 7 Generalized and Weighted Least Squares Estimation. In this method, the deviation between the observed and expected values of

Chapter 7 Generalized and Weighted Least Squares Estimation. In this method, the deviation between the observed and expected values of Chapter 7 Generalzed and Weghted Least Squares Estmaton The usual lnear regresson model assumes that all the random error components are dentcally and ndependently dstrbuted wth constant varance. When

More information

[ ] λ λ λ. Multicollinearity. multicollinearity Ragnar Frisch (1934) perfect exact. collinearity. multicollinearity. exact

[ ] λ λ λ. Multicollinearity. multicollinearity Ragnar Frisch (1934) perfect exact. collinearity. multicollinearity. exact Multcollnearty multcollnearty Ragnar Frsch (934 perfect exact collnearty multcollnearty K exact λ λ λ K K x+ x+ + x 0 0.. λ, λ, λk 0 0.. x perfect ntercorrelated λ λ λ x+ x+ + KxK + v 0 0.. v 3 y β + β

More information

Interval Estimation in the Classical Normal Linear Regression Model. 1. Introduction

Interval Estimation in the Classical Normal Linear Regression Model. 1. Introduction ECONOMICS 35* -- NOTE 7 ECON 35* -- NOTE 7 Interval Estmaton n the Classcal Normal Lnear Regresson Model Ths note outlnes the basc elements of nterval estmaton n the Classcal Normal Lnear Regresson Model

More information

Properties of Least Squares

Properties of Least Squares Week 3 3.1 Smple Lnear Regresson Model 3. Propertes of Least Squares Estmators Y Y β 1 + β X + u weekly famly expendtures X weekly famly ncome For a gven level of x, the expected level of food expendtures

More information

Unit 10: Simple Linear Regression and Correlation

Unit 10: Simple Linear Regression and Correlation Unt 10: Smple Lnear Regresson and Correlaton Statstcs 571: Statstcal Methods Ramón V. León 6/28/2004 Unt 10 - Stat 571 - Ramón V. León 1 Introductory Remarks Regresson analyss s a method for studyng the

More information

is the calculated value of the dependent variable at point i. The best parameters have values that minimize the squares of the errors

is the calculated value of the dependent variable at point i. The best parameters have values that minimize the squares of the errors Multple Lnear and Polynomal Regresson wth Statstcal Analyss Gven a set of data of measured (or observed) values of a dependent varable: y versus n ndependent varables x 1, x, x n, multple lnear regresson

More information

Activity #13: Simple Linear Regression. actgpa.sav; beer.sav;

Activity #13: Simple Linear Regression. actgpa.sav; beer.sav; ctvty #3: Smple Lnear Regresson Resources: actgpa.sav; beer.sav; http://mathworld.wolfram.com/leastfttng.html In the last actvty, we learned how to quantfy the strength of the lnear relatonshp between

More information

The SAS program I used to obtain the analyses for my answers is given below.

The SAS program I used to obtain the analyses for my answers is given below. Homework 1 Answer sheet Page 1 The SAS program I used to obtan the analyses for my answers s gven below. dm'log;clear;output;clear'; *************************************************************; *** EXST7034

More information

ANSWERS CHAPTER 9. TIO 9.2: If the values are the same, the difference is 0, therefore the null hypothesis cannot be rejected.

ANSWERS CHAPTER 9. TIO 9.2: If the values are the same, the difference is 0, therefore the null hypothesis cannot be rejected. ANSWERS CHAPTER 9 THINK IT OVER thnk t over TIO 9.: χ 2 k = ( f e ) = 0 e Breakng the equaton down: the test statstc for the ch-squared dstrbuton s equal to the sum over all categores of the expected frequency

More information

Linear Correlation. Many research issues are pursued with nonexperimental studies that seek to establish relationships among 2 or more variables

Linear Correlation. Many research issues are pursued with nonexperimental studies that seek to establish relationships among 2 or more variables Lnear Correlaton Many research ssues are pursued wth nonexpermental studes that seek to establsh relatonshps among or more varables E.g., correlates of ntellgence; relaton between SAT and GPA; relaton

More information

Computation of Higher Order Moments from Two Multinomial Overdispersion Likelihood Models

Computation of Higher Order Moments from Two Multinomial Overdispersion Likelihood Models Computaton of Hgher Order Moments from Two Multnomal Overdsperson Lkelhood Models BY J. T. NEWCOMER, N. K. NEERCHAL Department of Mathematcs and Statstcs, Unversty of Maryland, Baltmore County, Baltmore,

More information

LINEAR REGRESSION ANALYSIS. MODULE IX Lecture Multicollinearity

LINEAR REGRESSION ANALYSIS. MODULE IX Lecture Multicollinearity LINEAR REGRESSION ANALYSIS MODULE IX Lecture - 31 Multcollnearty Dr. Shalabh Department of Mathematcs and Statstcs Indan Insttute of Technology Kanpur 6. Rdge regresson The OLSE s the best lnear unbased

More information

January Examinations 2015

January Examinations 2015 24/5 Canddates Only January Examnatons 25 DO NOT OPEN THE QUESTION PAPER UNTIL INSTRUCTED TO DO SO BY THE CHIEF INVIGILATOR STUDENT CANDIDATE NO.. Department Module Code Module Ttle Exam Duraton (n words)

More information

Durban Watson for Testing the Lack-of-Fit of Polynomial Regression Models without Replications

Durban Watson for Testing the Lack-of-Fit of Polynomial Regression Models without Replications Durban Watson for Testng the Lack-of-Ft of Polynomal Regresson Models wthout Replcatons Ruba A. Alyaf, Maha A. Omar, Abdullah A. Al-Shha ralyaf@ksu.edu.sa, maomar@ksu.edu.sa, aalshha@ksu.edu.sa Department

More information

Lecture 3 Stat102, Spring 2007

Lecture 3 Stat102, Spring 2007 Lecture 3 Stat0, Sprng 007 Chapter 3. 3.: Introducton to regresson analyss Lnear regresson as a descrptve technque The least-squares equatons Chapter 3.3 Samplng dstrbuton of b 0, b. Contnued n net lecture

More information

IV. Modeling a Mean: Simple Linear Regression

IV. Modeling a Mean: Simple Linear Regression IV. Modelng a Mean: Smple Lnear Regresson We have talked about nference for a sngle mean, for comparng two means, and for comparng several means. What f the mean of one varable depends on the value of

More information

Econ107 Applied Econometrics Topic 9: Heteroskedasticity (Studenmund, Chapter 10)

Econ107 Applied Econometrics Topic 9: Heteroskedasticity (Studenmund, Chapter 10) I. Defnton and Problems Econ7 Appled Econometrcs Topc 9: Heteroskedastcty (Studenmund, Chapter ) We now relax another classcal assumpton. Ths s a problem that arses often wth cross sectons of ndvduals,

More information

Chap 10: Diagnostics, p384

Chap 10: Diagnostics, p384 Chap 10: Dagnostcs, p384 Multcollnearty 10.5 p406 Defnton Multcollnearty exsts when two or more ndependent varables used n regresson are moderately or hghly correlated. - when multcollnearty exsts, regresson

More information

T E C O L O T E R E S E A R C H, I N C.

T E C O L O T E R E S E A R C H, I N C. T E C O L O T E R E S E A R C H, I N C. B rdg n g En g neern g a nd Econo mcs S nce 1973 THE MINIMUM-UNBIASED-PERCENTAGE ERROR (MUPE) METHOD IN CER DEVELOPMENT Thrd Jont Annual ISPA/SCEA Internatonal Conference

More information

Rockefeller College University at Albany

Rockefeller College University at Albany Rockefeller College Unverst at Alban PAD 705 Handout: Maxmum Lkelhood Estmaton Orgnal b Davd A. Wse John F. Kenned School of Government, Harvard Unverst Modfcatons b R. Karl Rethemeer Up to ths pont n

More information