Multiple Linear Regression and the General Linear Model

Size: px
Start display at page:

Download "Multiple Linear Regression and the General Linear Model"

Transcription

1 Multle Lnear Regresson and the General Lnear Model 1

2 Outlne 1. Introducton to Multle Lnear Regresson 2. Statstcal Inference 3. Tocs n Regresson Modelng 4. Examle 5. Varable Selecton Methods 6. Regresson Dagnostc and Strategy for Buldng a Model 2

3 1. Introducton to Multle Lnear Regresson Extendng smle lnear regresson to two or more regressors 3

4 Hstory Francs Galton coned the term regresson n hs bologcal research Karl Pearson and Udny Yule extended Galton s work to the statstcal context Legendre and Gauss develoed the method of least squares Ronald Fsher develoed the maxmum lkelhood method used n the related statstcal nference. 4

5 Hstory Adren-Mare Legendre (1752/9/18 - Carl Fredrch Francs Gauss Galton (1777/4/30 (1822/2/ /1/17). 1855/4/23). He develoed 1833/1/10). He s regarded the fundamentals as the founder of the of bass for Karl Pearson. least-squares In 1805, Bostatstcs. analyss he ublshed n 1795 at an the artcle age of eghteen. named Nouvelles méthodes our la In hs research he found that tall He ublshed détermnaton arents an usually artcle have called des orbtes des shorter Theora Motus Cororum Ronald Aylmer comètes. chldren; Coelestum and vce n versa. Sectonbus So the Concs Fsher. Solem Ambentum In ths artcle, he ntroduced Method heght of human n beng has the In 1821, of Least Squares to the world. Yes, tendency he ublshed to regress another to ts artcle mean. about least square he was the frst erson who Subsequently, analyss wth further coned develoment, the word called Theora ublshed artcle regardng to regresson combnatons for such observatonum They errorbus both develoed method of least squares, whch s the henomenon mnms obnoxae. and roblems. Ths artcle regresson theory ncludes Gauss Markov earlest after form of theorem regresson. Galton. Most content n ths age comes from Wkeda 5

6 Probablstc Model Y s the observed value of the random varable (r.v.) whch deends on k fxed redctor values accordng to the followng model: Y 0 x 1 1 x 2 2 x k k Here 0, 1,, k are unknown model arameters, and n s the number of observatons. The random error,, are assumed to be ndeendent r.v. s wth mean 0 and varance 2 Y Thus are ndeendent r.v. s wth mean and varance, where μ 2 0 1x1 2x2 k xk E Y 6

7 Fttng the model The least squares (LS) method s used to fnd a lne that fts the equaton Y 0 1x1 2x2 k xk, 1, Secfcally, LS rovdes estmates of the unknown model arameters, 0, 1,, k whch mnmzes,, the sum of squared dfference of the n Y x x k xk observed values,, and the corresondng onts on the lne wth the same x s y The LS can be found by takng artal dervatves of Q wth resect to the unknown arameters 2 0, 1,, k set of smultaneous lnear equatons. and settng them equal to 0. The result s a, n The resultng solutons, estmators of 0, 1,, k, resectvely. are the least squares (LS) Please note the LS method s non-arametrc. That s, no robablty dstrbuton assumtons on Y or ε are needed. 7

8 Goodness of Ft of the Model To evaluate the goodness of ft of the LS model, we use the resduals defned by y e y yˆ ( 1,2,, n) are the ftted values: ˆ An overall measure of the goodness of ft s the error sum of squares (SSE) mn Q SSE e n 1 A few other defnton smlar to those n smle lnear regresson: total sum of squares (SST): SST ( y y) 2 2 regresson sum of squares (SSR): SSR SST SSE 8

9 coeffcent of determnaton: R 2 SSR SST SSE 1 SST 2 0R 1 values closer to 1 reresent better fts addng redctor varables never decreases and generally ncreases multle correlaton coeffcent (ostve square root of ): R R 2 only ostve square root s used R s a measure of the strength of the assocaton between the redctors (x s) and the one resonse varable Y R 2 2 R 9

10 Multle Regresson Model n Matrx Notaton The multle regresson model can be reresented n a comact form usng matrx notaton Let: 1 2, n Y Y Y Y , n y y y y 1 2 n be the n x 1 vectors of the r.v. s, ther observed values, and random errors, resectvely for all n observatons Let: ' Y s ' y s ' s be the n x (k + 1) matrx of the values of the redctor varables for all n observatons (the frst column corresonds to the constant term ) 0 kn n k k x x x x x x X

11 Let: 0 1 k and ˆ 0 ˆ ˆ 1 ˆk be the (k + 1) x 1 vectors of unknown model arameters and ther LS estmates, resectvely The model can be rewrtten as: Y X The smultaneous lnear equatons whose solutons yelds the LS estmates: X ' X If the nverse of the matrx X ' y ˆ 1 ( X ' X ) ' X' X X y exsts, then the soluton s gven by: 11

12 2. Statstcal Inference 12

13 Determnng the statstcal sgnfcance of the redctor varables: For statstcal nferences, we need the assumton that d 2 ~ N 0, *..d. means ndeendent & dentcally dstrbuted We test the hyotheses: If we can t reject and varance H 0 j : B j 0 It s easly shown that each H 0 j : j 0 vs. H 1 j : j 0, then the corresondng regressor x j s not a sgnfcant redctor of y. 2 v jj, where matrx ˆ j s normal wth mean v jj j s the jth dagonal entry of the V (X' X) 1 13

14 Dervng a votal quantty for the nference on Recall ˆ 2 j j ~ N(, v jj ) The unbased estmator of the unknown error varance s gven by S 2 SSE n (k 1) e n (k 1) MSE d.o.f. 2 ( n ( k 1)) S SSE 2 We also know that W ~ 2 2 n ( k 1), and that S 2 and ˆ j are statstcally ndeendent. ˆ Wth j j Z ~ N(0,1), and by the defnton of the t-dstrbuton, v T jj Z ˆ j j W / n ( k 1) S v 14 2 jj ~ T j we obtan the votal quantty for the nference on n( k1) j 2

15 Dervaton of the Confdence Interval for P ˆ j j ( t / 2, n( k1) t / 2, n( k1) ) 1 S v jj j P ˆ ˆ ( k1) s v jj j j t / 2, n( k1 s v ) 1 ( j t / 2, n ) jj Thus the 100(1-α)% confdence nterval for j s: where SE(ˆ ) s j 15 v jj

16 Dervaton of the Hyothess Test for at the sgnfcance level α Hyotheses: H : 0 0 j H The test statstc s: a : 0 j The decson rule of the test s derved based on the Tye I error rate α. That s P (Reject H 0 H 0 s true) = P( T c) c t / 2, n ( k 1) T ˆ 0 H ~ S v j 0 n( k1) Therefore, we reject H 0 at the sgnfcance level α f and only f t, where s the observed value of 0 t / 2, n ( k t 1) 0 T0 0 jj 0 T j 16

17 Another Hyothess Test Now consder: H 0 : j 0 H a : j 0 for all 1 j k for at least one 1 j k MSR When H 0 s true, the test statstcs F0 ~ f k, n ( k 1) MSE An alternatve and equvalent way to make a decson for a statstcal test s through the -value, defned as: = P(observe a test statstc value at least as extreme as the one observed H0) At the sgnfcance level, we reject H 0 f and only f < 17

18 The General Hyothess Test Consder the full model: Y x x 2 2 x k k (=1,2, n) Y Now consder a artal model: 0 1x1 2x2 k m km, Hyotheses: H 0 : km1... k 0 vs. H a : j 0 for at least one k m 1 j k x (=1,2, n) Test statstc: F Reject H 0 when ( SSE SSE) / m km k 0 m, n( k1) SSEk /[ n ( k 1)] ~ F0 fm, n( k1), f 18

19 Estmatng and Predctng Future Observatons Let x * (x 0 *,x 1 *,...,x k * ) ' and let The votal quantty for * s T * * ˆ s x Vx * * ~ T n ( k 1) Usng ths votal quantty, we can derve a CI for the estmated mean : * * * ˆ t s x Vx n( k1), / 2 Addtonally, we can derve a redcton nterval (PI) to redct Y * : * * * Yˆ t s 1 x Vx n( k1), / 2 19

20 /* smle lnear regresson */ roc reg; model y = x; /* multle regresson */ roc reg; model y = x1 x2 x3; Regresson n SAS Here are some rnt otons for the model hrase: model y = x / nont; /* regresson wth no ntercet */ model y = x / ss1; /* rnt tye I sums of squares */ model y = x / ; /* rnt redcted values and resduals */ model y = x / r; /* oton lus resdual dagnostcs */ model y = x / clm; /* oton lus 95% CI for estmated mean */ model y = x / cl; /* oton lus 95% CI for redcted value */ model y = x / r cl clm; /* otons can be combned */ 20

21 Confdence and Predcton Band n SAS The CLM oton adds confdence lmts for the mean redcted values. The CLI oton adds confdence lmts for the ndvdual redcted values. roc sglot data=sashel.class; eg x=heght y=weght / CLM CLI; run; For nformaton about the SAS Samle Lbrary, see: htt://suort.sas.com/documenta ton/cdl/en/grstatroc/67909/htm L/default/vewer.htm#0jxq3ea4njt vnn1xnj4a7s37yz.htm 21

22 Regresson n SAS It s ossble to let SAS do the redctng of new observatons and/or estmatng of mean resonses. The way to do ths s to enter the x values (or x1,x2,x3 for multle regresson) you are nterested n durng the data nut ste, but ut a erod (.) for the unknown y value. data new; nut x y; datalnes; ; run; roc reg; model y = x / cl clm; 22

23 3. Tocs n Regresson Modelng 23

24 3.1 Multcollnearty Defnton. The redctor varables are lnearly deendent. Ths can cause serous numercal and statstcal dffcultes n fttng the regresson model unless extra redctor varables are deleted. 24

25 ^ How does the multcollnearty cause dffcultes? sthe solutonto the equaton ^ norder for to be unque and comutable. T T T X X X y, thus X X must be nvertable If the aroxmate multcollnearty haens: T X X 1. s nearly sngular, whch makes numercally unstable. Ths reflected n large changes n ther magntudes wth small changes n data. ^ V ( X T 1 X ) ^ 2. The matrx has very large elements. Therefore are large, whch makes statstcally nonsgnfcant. j ^ ) Var ( 2 v jj 25

26 Measures of Multcollnearty 1. The correlaton matrx R. Easy but can t reflect lnear relatonshs between more than two varables. 2. Determnant of R can be used as measurement of T sngularty of X X 3. Varance Inflaton Factors (VIF): the dagonal elements of R 1. VIF > 10 s regarded as unaccetable. 26

27 3.2 Polynomal Regresson A secal case of a lnear model: Problems: k y 0 1 x... k x x x 2,,, 1. The owers of x,.e., tend to be hghly correlated. 2. If k s large, the magntudes of these owers tend to vary over a rather wde range. x k So, set k<=3 f a good dea, and almost never use k>5. 27

28 1. Centerng the x-varable: Solutons * * * 0 1 k y ( x x)... ( x x)k 2. Effect: removng the non-essental multcollnearty n the data. 3. Further more, we can standardze the data by dvdng the standard devaton s of x: x x x s x 4. Effect: helng to allevate the second roblem. 5. Usng the frst few rncal comonents of the orgnal varables nstead of the orgnal varables. 28

29 3.3 Dummy Predctor Varables & The General Lnear Model How to handle the categorcal redctor varables? 1. If we have categores of an ordnal varable, such as the rognoss of a atent (oor, average, good), one can assgn numercal scores to the categores (oor=1, average=2, good=3). 29

30 2. If we have nomnal varable wth c>=2 categores. Use c-1 ndcator varables, x, called Dummy Varables,,, x 1 c 1 to code. x 1 for the th category, 1 c1 x 1 xc for the cth category. 30

31 Why don t we just use c ndcator varables: x, x,..., x 1 2 c? If we use that, there wll be a lnear deendency among them: x1 x2... x c 1 Ths wll cause multcollnearty. 31

32 Examle of the dummy varables For nstance, f we have four years of quarterly sale data of a certan brand of soda. How can we model the tme trend by fttng a multle regresson equaton? Soluton: We use quarter as a redctor varable x1. To model the seasonal trend, we use ndcator varables x2, x3, x4, for Wnter, Srng and Summer, resectvely. For Fall, all three equal zero. That means: Wnter- (1,0,0), Srng-(0,1,0), Summer-(0,0,1), Fall-(0,0,0). Then we have the model: Y 0 1x1 2x2 3x3 4x4, 1,,16 32

33 3. Once the dummy varables are ncluded, the resultng regresson model s referred to as a General Lnear Model. Ths term must be dfferentated from that of the Generalzed Lnear Model whch nclude the General Lnear Model as a secal case wth the dentty lnk functon: μ 0 1x1 2x2 k xk E Y The generalzed lnear model wll lnk the model arameters to the redctors through a lnk functon. For another examle, we recall the logt lnk n the logstc regresson. 33

34 4. Examle, Galton Here we revst the classc regresson towards Medocrty n Heredtary Stature by Francs Galton He erformed a smle regresson to redct offsrng heght based on the average arent heght htt:// Sloe of regresson lne was less than 1 showng that extremely tall arents had less extremely tall chldren At the tme, Galton dd not have multle regresson as a tool so he had to use other methods to account for the dfference between male and female heghts We can now erform multle regresson on arentoffsrng heght and use multle varables as redctors 34

35 Examle, Galton OUR MODEL: Y x x x Y = heght of chld x 1 = heght of father x 2 = heght of mother x 3 = gender of chld 35

36 Examle, Galton In matrx notaton: Y X ( X ' X ) 1 X ' y We fnd that: β 0 = β 1 = β 2 = β 3 =

37 Examle, Galton Y Chld

38 Examle, Galton X Father Mother Gender

39 Examle, Galton Imortant calculatons SSE SST r 2 MSE 1 ( y ( y y y) SSE SST SSE 4.64 n ( k 1) ) , y X ˆ Is the redcted heght of each chld gven a set of redctor varables 39

40 Examle, Galton Are these values sgnfcantly dfferent than zero? H o : β j = 0 H a : β j 0 Reject H 0j f j t j tn( k 1), /2 t894,0.025 SE( ) V j ( X ' X ) j SE( ) MSE * v jj 40

41 Examle, Galton β-estmate SE t Intercet * Father Heght Mother Heght * * Gender * * <0.05. We conclude that all β are sgnfcantly dfferent than zero. 41

42 Examle, Galton Testng the model as a whole: H o : β 0 = β 1 = β 2 = β 3 = 0 H a : The above s not true. Reject H 0 f [ ( 1)] F f f r 2 2 r n k 2 k(1 r ) SSE SST k, n( k 1), 3,894,.05 2 ( ) SSE y y SST y y 2 ( ) 11, Snce F = >2.615, we reject H o and conclude that our model redcts heght better than by chance. 42

43 Examle, Galton Makng Predctons Let s say George Clooney (71 nches) and Madonna (64 nches) would have a baby boy. Y * (71) (64) (1) % Predcton nterval: Y* t 894,.025 MSE *(1 x*' Vx*) Y* Y* t 894,.025 MSE *(1 x*' Vx*) ± 4.84 = (65.13, 74.81) 43

44 EXAMPLE, GALTON SAS code, data ste: htt:// Data Galton; Inut Famly Father Mother Gender $ Heght Kds; Datalnes; M F F F M M F F ; Run; 44

45 EXAMPLE, GALTON SAS code, roc REG ods grahcs on; data revse; set Galton; f Gender = 'F' then sex = 1.0; else sex = 0.0; run; roc reg data=revse; ttle "roc reg; Deendence of Chld Heghts on Parental Heghts"; model heght = father mother sex / vf colln; run; qut; 45

46 46

47 EXAMPLE, GALTON 47

48 EXAMPLE, GALTON 48

49 EXAMPLE, GALTON SAS code, roc GLM Alternatvely, one can use roc GLM rocedure that can ncororate the categorcal varable (sex) drectly va the class statement. Another added beneft s that SAS wll rovde an overall sgnfcance test for the categorcal varable. roc glm data=galton; Class gender; model heght = father mother gender; run; qut; 49

50 EXAMPLE, Galton 50

51 EXAMPLE, GALTON Dear Students, dd you notce any volaton of assumtons n the analyss of the Galton data? The answer s that we have volated the ndeendent observatons assumton as many kds were from the same famles! Ths, however, can be easly resolved wth more advanced statstcal models that wll nclude Famly as a random effect (random regressor.) 51

52 5. Varables Selecton Method A. Stewse Regresson 52

53 Varables selecton method (1) Why do we need to select the varables? (2) How do we select varables? * stewse regresson * best subset regresson 53

54 Stewse Regresson (-1)-varable model: P-varable model x x x Y, 1, 1 1, x x Y, 1 1 1,

55 0 0 Hyothesstest F - test Partal : 1 : 0 H H 55 2 / 1), ( 0 1), ( 1, 1 : ) ( : 1)] ( /[ ) /1 ( n n t t H reject SE t test statstc f n SSE SSE SSE F

56 Partal correlaton coeffcents )] ( [ )... ( )... ( )... ( x x yx x x yx r n r t F x x SSE x x SSE x x SSE SSE SSE SSE r x x yx 56

57 5. Varables selecton method A. Stewse Regresson: SAS Examle 57

58 Examle 11.5 (T&D g. 416), 11.9 (T&D g. 431) The followng table shows data on the heat evolved n calores durng the hardenng of cement on a er gram bass (y) along wth the ercentages of four ngredents: trcalcum alumnate (x1), trcalcum slcate (x2), tetracalcum alumno ferrte (x3), and dcalcum slcate (x4). No. X1 X2 X3 X4 Y Ref: T & D: "Statstcs and Data Analyss", by Tamhane and Dunlo, Pearson, 1999, 2nd edton, Pearson; ISBN:

59 SAS Program (stewse varable selecton s used) data examle115; nut x1 x2 x3 x4 y; datalnes; ; run; roc reg data=examle115; model y = x1 x2 x3 x4 /selecton=stewse; run; 59

60 Selected SAS outut The REG Procedure Model: MODEL1 Deendent Varable: y Stewse Selecton: Ste 4 Parameter Standard Varable Estmate Error Tye II SS F Value Pr > F Intercet <.0001 x <.0001 x <.0001 Bounds on condton number: ,

61 SAS Outut (cont) All varables left n the model are sgnfcant at the level. No other varable met the sgnfcance level for entry nto the model. Summary of Stewse Selecton Varable Varable Number Partal Model Ste Entered Removed Vars In R-Square R-Square C() F Value Pr > F 1 x x < x x

62 5. Varables selecton method B. Best Subsets Regresson 62

63 Best Subsets Regresson For the stewse regresson algorthm The fnal model s not guaranteed to be otmal n any secfed sense. In the best subsets regresson, subset of varables s chosen from the collecton of all subsets of k redctor varables) that otmzes a well-defned objectve crteron 63

64 Best Subsets Regresson In the stewse regresson, We get only one sngle fnal models. In the best subsets regresson, The nvestor could secfy a sze for the redctors for the model. 64

65 Best Subsets Regresson Otmalty Crtera r 2 -Crteron: r 2 Adjusted r 2 -Crteron: SSR 1 SST r 2 adj, SSE SST 1 MSE MST C -Crteron (recommended for ts ease of comutaton and ts ablty to judge the redctve ower of a model) n 1 [ E[ Yˆ ] E[ Y 2 ]] 1 2 The samle estmator, Mallows C -statstc, s gven by C SSE 2( 1) n ˆ 2 65

66 Best Subsets Regresson Algorthm Note that our roblem s to fnd the mnmum of a gven functon. Use the stewse subsets regresson algorthm and relace the artal F crteron wth other crteron such as C. Enumerate all ossble cases and fnd the mnmum of the crteron functons. Other ossblty? 66

67 Best Subsets Regresson & SAS roc reg data=examle115; run; model y = x1 x2 x3 x4 /selecton=adjrsq; For the selecton oton, SAS has mlemented 9 methods n total. For best subset method, we have the followng otons: Maxmum R 2 Imrovement (MAXR) Mnmum R 2 (MINR) Imrovement R 2 Selecton (RSQUARE) Adjusted R 2 Selecton (ADJRSQ) Mallows' C Selecton (CP) 67

68 6. Buldng A Multle Regresson Model Stes and Strategy 68

69 Modelng s an teratve rocess. Several cycles of the stes maybe needed before arrvng at the fnal model. The basc rocess conssts of seven stes 69

70 Get started and Follow the Stes Categorzaton by Usage Dvde the Data Collect the Data Exlore the Data Ft Canddate Models Select and Evaluate Select the Fnal Model 70

71 Lnear Regresson Assumtons Mean of Error Is 0 Varance of Error s Constant Probablty Dstrbuton of Error s Normal Errors are Indeendent 71

72 Resdual Plot for Functonal Form (Lnearty) Add X^2 Term Correct Secfcaton e e X X 72

73 Resdual Plot for Equal Varance SR Unequal Varance Correct Secfcaton SR X Fan-shaed. Standardzed resduals used tycally (resdual dvded by standard error of redcton) X 73

74 Resdual Plot for Indeendence Not Indeendent Correct Secfcaton SR SR X X 74

75 Questons? 75

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur Analyss of Varance and Desgn of Exerments-I MODULE III LECTURE - 2 EXPERIMENTAL DESIGN MODELS Dr. Shalabh Deartment of Mathematcs and Statstcs Indan Insttute of Technology Kanur 2 We consder the models

More information

Chapter 13: Multiple Regression

Chapter 13: Multiple Regression Chapter 13: Multple Regresson 13.1 Developng the multple-regresson Model The general model can be descrbed as: It smplfes for two ndependent varables: The sample ft parameter b 0, b 1, and b are used to

More information

Statistics for Economics & Business

Statistics for Economics & Business Statstcs for Economcs & Busness Smple Lnear Regresson Learnng Objectves In ths chapter, you learn: How to use regresson analyss to predct the value of a dependent varable based on an ndependent varable

More information

Lecture 3 Stat102, Spring 2007

Lecture 3 Stat102, Spring 2007 Lecture 3 Stat0, Sprng 007 Chapter 3. 3.: Introducton to regresson analyss Lnear regresson as a descrptve technque The least-squares equatons Chapter 3.3 Samplng dstrbuton of b 0, b. Contnued n net lecture

More information

Chapter 11: Simple Linear Regression and Correlation

Chapter 11: Simple Linear Regression and Correlation Chapter 11: Smple Lnear Regresson and Correlaton 11-1 Emprcal Models 11-2 Smple Lnear Regresson 11-3 Propertes of the Least Squares Estmators 11-4 Hypothess Test n Smple Lnear Regresson 11-4.1 Use of t-tests

More information

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur Analyss of Varance and Desgn of Exerments-I MODULE II LECTURE - GENERAL LINEAR HYPOTHESIS AND ANALYSIS OF VARIANCE Dr. Shalabh Deartment of Mathematcs and Statstcs Indan Insttute of Technology Kanur 3.

More information

Statistics for Managers Using Microsoft Excel/SPSS Chapter 13 The Simple Linear Regression Model and Correlation

Statistics for Managers Using Microsoft Excel/SPSS Chapter 13 The Simple Linear Regression Model and Correlation Statstcs for Managers Usng Mcrosoft Excel/SPSS Chapter 13 The Smple Lnear Regresson Model and Correlaton 1999 Prentce-Hall, Inc. Chap. 13-1 Chapter Topcs Types of Regresson Models Determnng the Smple Lnear

More information

Lecture 6: Introduction to Linear Regression

Lecture 6: Introduction to Linear Regression Lecture 6: Introducton to Lnear Regresson An Manchakul amancha@jhsph.edu 24 Aprl 27 Lnear regresson: man dea Lnear regresson can be used to study an outcome as a lnear functon of a predctor Example: 6

More information

STAT 3008 Applied Regression Analysis

STAT 3008 Applied Regression Analysis STAT 3008 Appled Regresson Analyss Tutoral : Smple Lnear Regresson LAI Chun He Department of Statstcs, The Chnese Unversty of Hong Kong 1 Model Assumpton To quantfy the relatonshp between two factors,

More information

Department of Quantitative Methods & Information Systems. Time Series and Their Components QMIS 320. Chapter 6

Department of Quantitative Methods & Information Systems. Time Series and Their Components QMIS 320. Chapter 6 Department of Quanttatve Methods & Informaton Systems Tme Seres and Ther Components QMIS 30 Chapter 6 Fall 00 Dr. Mohammad Zanal These sldes were modfed from ther orgnal source for educatonal purpose only.

More information

Negative Binomial Regression

Negative Binomial Regression STATGRAPHICS Rev. 9/16/2013 Negatve Bnomal Regresson Summary... 1 Data Input... 3 Statstcal Model... 3 Analyss Summary... 4 Analyss Optons... 7 Plot of Ftted Model... 8 Observed Versus Predcted... 10 Predctons...

More information

Logistic regression with one predictor. STK4900/ Lecture 7. Program

Logistic regression with one predictor. STK4900/ Lecture 7. Program Logstc regresson wth one redctor STK49/99 - Lecture 7 Program. Logstc regresson wth one redctor 2. Maxmum lkelhood estmaton 3. Logstc regresson wth several redctors 4. Devance and lkelhood rato tests 5.

More information

Basic Business Statistics, 10/e

Basic Business Statistics, 10/e Chapter 13 13-1 Basc Busness Statstcs 11 th Edton Chapter 13 Smple Lnear Regresson Basc Busness Statstcs, 11e 009 Prentce-Hall, Inc. Chap 13-1 Learnng Objectves In ths chapter, you learn: How to use regresson

More information

Dependent variable for case i with variance σ 2 g i. Number of distinct cases. Number of independent variables

Dependent variable for case i with variance σ 2 g i. Number of distinct cases. Number of independent variables REGRESSION Notaton Ths rocedure erforms multle lnear regresson wth fve methods for entry and removal of varables. It also rovdes extensve analyss of resdual and nfluental cases. Caseweght (CASEWEIGHT)

More information

Y = β 0 + β 1 X 1 + β 2 X β k X k + ε

Y = β 0 + β 1 X 1 + β 2 X β k X k + ε Chapter 3 Secton 3.1 Model Assumptons: Multple Regresson Model Predcton Equaton Std. Devaton of Error Correlaton Matrx Smple Lnear Regresson: 1.) Lnearty.) Constant Varance 3.) Independent Errors 4.) Normalty

More information

Multiple Regression Analysis

Multiple Regression Analysis Multle Regresson Analss Roland Szlág Ph.D. Assocate rofessor Correlaton descres the strength of a relatonsh, the degree to whch one varale s lnearl related to another Regresson shows us how to determne

More information

Lecture 9: Linear regression: centering, hypothesis testing, multiple covariates, and confounding

Lecture 9: Linear regression: centering, hypothesis testing, multiple covariates, and confounding Lecture 9: Lnear regresson: centerng, hypothess testng, multple covarates, and confoundng Sandy Eckel seckel@jhsph.edu 6 May 008 Recall: man dea of lnear regresson Lnear regresson can be used to study

More information

Lecture 9: Linear regression: centering, hypothesis testing, multiple covariates, and confounding

Lecture 9: Linear regression: centering, hypothesis testing, multiple covariates, and confounding Recall: man dea of lnear regresson Lecture 9: Lnear regresson: centerng, hypothess testng, multple covarates, and confoundng Sandy Eckel seckel@jhsph.edu 6 May 8 Lnear regresson can be used to study an

More information

Economics 130. Lecture 4 Simple Linear Regression Continued

Economics 130. Lecture 4 Simple Linear Regression Continued Economcs 130 Lecture 4 Contnued Readngs for Week 4 Text, Chapter and 3. We contnue wth addressng our second ssue + add n how we evaluate these relatonshps: Where do we get data to do ths analyss? How do

More information

Chapter 15 Student Lecture Notes 15-1

Chapter 15 Student Lecture Notes 15-1 Chapter 15 Student Lecture Notes 15-1 Basc Busness Statstcs (9 th Edton) Chapter 15 Multple Regresson Model Buldng 004 Prentce-Hall, Inc. Chap 15-1 Chapter Topcs The Quadratc Regresson Model Usng Transformatons

More information

Econ107 Applied Econometrics Topic 3: Classical Model (Studenmund, Chapter 4)

Econ107 Applied Econometrics Topic 3: Classical Model (Studenmund, Chapter 4) I. Classcal Assumptons Econ7 Appled Econometrcs Topc 3: Classcal Model (Studenmund, Chapter 4) We have defned OLS and studed some algebrac propertes of OLS. In ths topc we wll study statstcal propertes

More information

1. Inference on Regression Parameters a. Finding Mean, s.d and covariance amongst estimates. 2. Confidence Intervals and Working Hotelling Bands

1. Inference on Regression Parameters a. Finding Mean, s.d and covariance amongst estimates. 2. Confidence Intervals and Working Hotelling Bands Content. Inference on Regresson Parameters a. Fndng Mean, s.d and covarance amongst estmates.. Confdence Intervals and Workng Hotellng Bands 3. Cochran s Theorem 4. General Lnear Testng 5. Measures of

More information

Statistics for Managers Using Microsoft Excel/SPSS Chapter 14 Multiple Regression Models

Statistics for Managers Using Microsoft Excel/SPSS Chapter 14 Multiple Regression Models Statstcs for Managers Usng Mcrosoft Excel/SPSS Chapter 14 Multple Regresson Models 1999 Prentce-Hall, Inc. Chap. 14-1 Chapter Topcs The Multple Regresson Model Contrbuton of Indvdual Independent Varables

More information

a. (All your answers should be in the letter!

a. (All your answers should be in the letter! Econ 301 Blkent Unversty Taskn Econometrcs Department of Economcs Md Term Exam I November 8, 015 Name For each hypothess testng n the exam complete the followng steps: Indcate the test statstc, ts crtcal

More information

Comparison of Regression Lines

Comparison of Regression Lines STATGRAPHICS Rev. 9/13/2013 Comparson of Regresson Lnes Summary... 1 Data Input... 3 Analyss Summary... 4 Plot of Ftted Model... 6 Condtonal Sums of Squares... 6 Analyss Optons... 7 Forecasts... 8 Confdence

More information

Naïve Bayes Classifier

Naïve Bayes Classifier 9/8/07 MIST.6060 Busness Intellgence and Data Mnng Naïve Bayes Classfer Termnology Predctors: the attrbutes (varables) whose values are used for redcton and classfcaton. Predctors are also called nut varables,

More information

Chapter 8 Indicator Variables

Chapter 8 Indicator Variables Chapter 8 Indcator Varables In general, e explanatory varables n any regresson analyss are assumed to be quanttatve n nature. For example, e varables lke temperature, dstance, age etc. are quanttatve n

More information

Statistics for Business and Economics

Statistics for Business and Economics Statstcs for Busness and Economcs Chapter 11 Smple Regresson Copyrght 010 Pearson Educaton, Inc. Publshng as Prentce Hall Ch. 11-1 11.1 Overvew of Lnear Models n An equaton can be ft to show the best lnear

More information

Biostatistics. Chapter 11 Simple Linear Correlation and Regression. Jing Li

Biostatistics. Chapter 11 Simple Linear Correlation and Regression. Jing Li Bostatstcs Chapter 11 Smple Lnear Correlaton and Regresson Jng L jng.l@sjtu.edu.cn http://cbb.sjtu.edu.cn/~jngl/courses/2018fall/b372/ Dept of Bonformatcs & Bostatstcs, SJTU Recall eat chocolate Cell 175,

More information

Lecture Notes for STATISTICAL METHODS FOR BUSINESS II BMGT 212. Chapters 14, 15 & 16. Professor Ahmadi, Ph.D. Department of Management

Lecture Notes for STATISTICAL METHODS FOR BUSINESS II BMGT 212. Chapters 14, 15 & 16. Professor Ahmadi, Ph.D. Department of Management Lecture Notes for STATISTICAL METHODS FOR BUSINESS II BMGT 1 Chapters 14, 15 & 16 Professor Ahmad, Ph.D. Department of Management Revsed August 005 Chapter 14 Formulas Smple Lnear Regresson Model: y =

More information

where I = (n x n) diagonal identity matrix with diagonal elements = 1 and off-diagonal elements = 0; and σ 2 e = variance of (Y X).

where I = (n x n) diagonal identity matrix with diagonal elements = 1 and off-diagonal elements = 0; and σ 2 e = variance of (Y X). 11.4.1 Estmaton of Multple Regresson Coeffcents In multple lnear regresson, we essentally solve n equatons for the p unnown parameters. hus n must e equal to or greater than p and n practce n should e

More information

ANSWERS. Problem 1. and the moment generating function (mgf) by. defined for any real t. Use this to show that E( U) var( U)

ANSWERS. Problem 1. and the moment generating function (mgf) by. defined for any real t. Use this to show that E( U) var( U) Econ 413 Exam 13 H ANSWERS Settet er nndelt 9 deloppgaver, A,B,C, som alle anbefales å telle lkt for å gøre det ltt lettere å stå. Svar er gtt . Unfortunately, there s a prntng error n the hnt of

More information

Chapter 15 - Multiple Regression

Chapter 15 - Multiple Regression Chapter - Multple Regresson Chapter - Multple Regresson Multple Regresson Model The equaton that descrbes how the dependent varable y s related to the ndependent varables x, x,... x p and an error term

More information

The Ordinary Least Squares (OLS) Estimator

The Ordinary Least Squares (OLS) Estimator The Ordnary Least Squares (OLS) Estmator 1 Regresson Analyss Regresson Analyss: a statstcal technque for nvestgatng and modelng the relatonshp between varables. Applcatons: Engneerng, the physcal and chemcal

More information

Learning Objectives for Chapter 11

Learning Objectives for Chapter 11 Chapter : Lnear Regresson and Correlaton Methods Hldebrand, Ott and Gray Basc Statstcal Ideas for Managers Second Edton Learnng Objectves for Chapter Usng the scatterplot n regresson analyss Usng the method

More information

[The following data appear in Wooldridge Q2.3.] The table below contains the ACT score and college GPA for eight college students.

[The following data appear in Wooldridge Q2.3.] The table below contains the ACT score and college GPA for eight college students. PPOL 59-3 Problem Set Exercses n Smple Regresson Due n class /8/7 In ths problem set, you are asked to compute varous statstcs by hand to gve you a better sense of the mechancs of the Pearson correlaton

More information

Confidence intervals for weighted polynomial calibrations

Confidence intervals for weighted polynomial calibrations Confdence ntervals for weghted olynomal calbratons Sergey Maltsev, Amersand Ltd., Moscow, Russa; ur Kalambet, Amersand Internatonal, Inc., Beachwood, OH e-mal: kalambet@amersand-ntl.com htt://www.chromandsec.com

More information

ECONOMICS 351*-A Mid-Term Exam -- Fall Term 2000 Page 1 of 13 pages. QUEEN'S UNIVERSITY AT KINGSTON Department of Economics

ECONOMICS 351*-A Mid-Term Exam -- Fall Term 2000 Page 1 of 13 pages. QUEEN'S UNIVERSITY AT KINGSTON Department of Economics ECOOMICS 35*-A Md-Term Exam -- Fall Term 000 Page of 3 pages QUEE'S UIVERSITY AT KIGSTO Department of Economcs ECOOMICS 35* - Secton A Introductory Econometrcs Fall Term 000 MID-TERM EAM ASWERS MG Abbott

More information

x yi In chapter 14, we want to perform inference (i.e. calculate confidence intervals and perform tests of significance) in this setting.

x yi In chapter 14, we want to perform inference (i.e. calculate confidence intervals and perform tests of significance) in this setting. The Practce of Statstcs, nd ed. Chapter 14 Inference for Regresson Introducton In chapter 3 we used a least-squares regresson lne (LSRL) to represent a lnear relatonshp etween two quanttatve explanator

More information

x i1 =1 for all i (the constant ).

x i1 =1 for all i (the constant ). Chapter 5 The Multple Regresson Model Consder an economc model where the dependent varable s a functon of K explanatory varables. The economc model has the form: y = f ( x,x,..., ) xk Approxmate ths by

More information

Interval Estimation in the Classical Normal Linear Regression Model. 1. Introduction

Interval Estimation in the Classical Normal Linear Regression Model. 1. Introduction ECONOMICS 35* -- NOTE 7 ECON 35* -- NOTE 7 Interval Estmaton n the Classcal Normal Lnear Regresson Model Ths note outlnes the basc elements of nterval estmaton n the Classcal Normal Lnear Regresson Model

More information

4 Analysis of Variance (ANOVA) 5 ANOVA. 5.1 Introduction. 5.2 Fixed Effects ANOVA

4 Analysis of Variance (ANOVA) 5 ANOVA. 5.1 Introduction. 5.2 Fixed Effects ANOVA 4 Analyss of Varance (ANOVA) 5 ANOVA 51 Introducton ANOVA ANOVA s a way to estmate and test the means of multple populatons We wll start wth one-way ANOVA If the populatons ncluded n the study are selected

More information

2016 Wiley. Study Session 2: Ethical and Professional Standards Application

2016 Wiley. Study Session 2: Ethical and Professional Standards Application 6 Wley Study Sesson : Ethcal and Professonal Standards Applcaton LESSON : CORRECTION ANALYSIS Readng 9: Correlaton and Regresson LOS 9a: Calculate and nterpret a sample covarance and a sample correlaton

More information

Reminder: Nested models. Lecture 9: Interactions, Quadratic terms and Splines. Effect Modification. Model 1

Reminder: Nested models. Lecture 9: Interactions, Quadratic terms and Splines. Effect Modification. Model 1 Lecture 9: Interactons, Quadratc terms and Splnes An Manchakul amancha@jhsph.edu 3 Aprl 7 Remnder: Nested models Parent model contans one set of varables Extended model adds one or more new varables to

More information

Chapter 3. Two-Variable Regression Model: The Problem of Estimation

Chapter 3. Two-Variable Regression Model: The Problem of Estimation Chapter 3. Two-Varable Regresson Model: The Problem of Estmaton Ordnary Least Squares Method (OLS) Recall that, PRF: Y = β 1 + β X + u Thus, snce PRF s not drectly observable, t s estmated by SRF; that

More information

18. SIMPLE LINEAR REGRESSION III

18. SIMPLE LINEAR REGRESSION III 8. SIMPLE LINEAR REGRESSION III US Domestc Beers: Calores vs. % Alcohol Ftted Values and Resduals To each observed x, there corresponds a y-value on the ftted lne, y ˆ ˆ = α + x. The are called ftted values.

More information

STATISTICS QUESTIONS. Step by Step Solutions.

STATISTICS QUESTIONS. Step by Step Solutions. STATISTICS QUESTIONS Step by Step Solutons www.mathcracker.com 9//016 Problem 1: A researcher s nterested n the effects of famly sze on delnquency for a group of offenders and examnes famles wth one to

More information

LINEAR REGRESSION ANALYSIS. MODULE VIII Lecture Indicator Variables

LINEAR REGRESSION ANALYSIS. MODULE VIII Lecture Indicator Variables LINEAR REGRESSION ANALYSIS MODULE VIII Lecture - 7 Indcator Varables Dr. Shalabh Department of Maematcs and Statstcs Indan Insttute of Technology Kanpur Indcator varables versus quanttatve explanatory

More information

Linear Approximation with Regularization and Moving Least Squares

Linear Approximation with Regularization and Moving Least Squares Lnear Approxmaton wth Regularzaton and Movng Least Squares Igor Grešovn May 007 Revson 4.6 (Revson : March 004). 5 4 3 0.5 3 3.5 4 Contents: Lnear Fttng...4. Weghted Least Squares n Functon Approxmaton...

More information

The SAS program I used to obtain the analyses for my answers is given below.

The SAS program I used to obtain the analyses for my answers is given below. Homework 1 Answer sheet Page 1 The SAS program I used to obtan the analyses for my answers s gven below. dm'log;clear;output;clear'; *************************************************************; *** EXST7034

More information

Advanced Topics in Optimization. Piecewise Linear Approximation of a Nonlinear Function

Advanced Topics in Optimization. Piecewise Linear Approximation of a Nonlinear Function Advanced Tocs n Otmzaton Pecewse Lnear Aroxmaton of a Nonlnear Functon Otmzaton Methods: M8L Introducton and Objectves Introducton There exsts no general algorthm for nonlnear rogrammng due to ts rregular

More information

17 - LINEAR REGRESSION II

17 - LINEAR REGRESSION II Topc 7 Lnear Regresson II 7- Topc 7 - LINEAR REGRESSION II Testng and Estmaton Inferences about β Recall that we estmate Yˆ ˆ β + ˆ βx. 0 μ Y X x β0 + βx usng To estmate σ σ squared error Y X x ε s ε we

More information

Outline. Zero Conditional mean. I. Motivation. 3. Multiple Regression Analysis: Estimation. Read Wooldridge (2013), Chapter 3.

Outline. Zero Conditional mean. I. Motivation. 3. Multiple Regression Analysis: Estimation. Read Wooldridge (2013), Chapter 3. Outlne 3. Multple Regresson Analyss: Estmaton I. Motvaton II. Mechancs and Interpretaton of OLS Read Wooldrdge (013), Chapter 3. III. Expected Values of the OLS IV. Varances of the OLS V. The Gauss Markov

More information

Chapter 14 Simple Linear Regression Page 1. Introduction to regression analysis 14-2

Chapter 14 Simple Linear Regression Page 1. Introduction to regression analysis 14-2 Chapter 4 Smple Lnear Regresson Page. Introducton to regresson analyss 4- The Regresson Equaton. Lnear Functons 4-4 3. Estmaton and nterpretaton of model parameters 4-6 4. Inference on the model parameters

More information

Lecture 2: Prelude to the big shrink

Lecture 2: Prelude to the big shrink Lecture 2: Prelude to the bg shrnk Last tme A slght detour wth vsualzaton tools (hey, t was the frst day... why not start out wth somethng pretty to look at?) Then, we consdered a smple 120a-style regresson

More information

is the calculated value of the dependent variable at point i. The best parameters have values that minimize the squares of the errors

is the calculated value of the dependent variable at point i. The best parameters have values that minimize the squares of the errors Multple Lnear and Polynomal Regresson wth Statstcal Analyss Gven a set of data of measured (or observed) values of a dependent varable: y versus n ndependent varables x 1, x, x n, multple lnear regresson

More information

The Multiple Classical Linear Regression Model (CLRM): Specification and Assumptions. 1. Introduction

The Multiple Classical Linear Regression Model (CLRM): Specification and Assumptions. 1. Introduction ECONOMICS 5* -- NOTE (Summary) ECON 5* -- NOTE The Multple Classcal Lnear Regresson Model (CLRM): Specfcaton and Assumptons. Introducton CLRM stands for the Classcal Lnear Regresson Model. The CLRM s also

More information

Chapter 9: Statistical Inference and the Relationship between Two Variables

Chapter 9: Statistical Inference and the Relationship between Two Variables Chapter 9: Statstcal Inference and the Relatonshp between Two Varables Key Words The Regresson Model The Sample Regresson Equaton The Pearson Correlaton Coeffcent Learnng Outcomes After studyng ths chapter,

More information

28. SIMPLE LINEAR REGRESSION III

28. SIMPLE LINEAR REGRESSION III 8. SIMPLE LINEAR REGRESSION III Ftted Values and Resduals US Domestc Beers: Calores vs. % Alcohol To each observed x, there corresponds a y-value on the ftted lne, y ˆ = βˆ + βˆ x. The are called ftted

More information

LINEAR REGRESSION ANALYSIS. MODULE IX Lecture Multicollinearity

LINEAR REGRESSION ANALYSIS. MODULE IX Lecture Multicollinearity LINEAR REGRESSION ANALYSIS MODULE IX Lecture - 31 Multcollnearty Dr. Shalabh Department of Mathematcs and Statstcs Indan Insttute of Technology Kanpur 6. Rdge regresson The OLSE s the best lnear unbased

More information

/ n ) are compared. The logic is: if the two

/ n ) are compared. The logic is: if the two STAT C141, Sprng 2005 Lecture 13 Two sample tests One sample tests: examples of goodness of ft tests, where we are testng whether our data supports predctons. Two sample tests: called as tests of ndependence

More information

[ ] λ λ λ. Multicollinearity. multicollinearity Ragnar Frisch (1934) perfect exact. collinearity. multicollinearity. exact

[ ] λ λ λ. Multicollinearity. multicollinearity Ragnar Frisch (1934) perfect exact. collinearity. multicollinearity. exact Multcollnearty multcollnearty Ragnar Frsch (934 perfect exact collnearty multcollnearty K exact λ λ λ K K x+ x+ + x 0 0.. λ, λ, λk 0 0.. x perfect ntercorrelated λ λ λ x+ x+ + KxK + v 0 0.. v 3 y β + β

More information

NANYANG TECHNOLOGICAL UNIVERSITY SEMESTER I EXAMINATION MTH352/MH3510 Regression Analysis

NANYANG TECHNOLOGICAL UNIVERSITY SEMESTER I EXAMINATION MTH352/MH3510 Regression Analysis NANYANG TECHNOLOGICAL UNIVERSITY SEMESTER I EXAMINATION 014-015 MTH35/MH3510 Regresson Analyss December 014 TIME ALLOWED: HOURS INSTRUCTIONS TO CANDIDATES 1. Ths examnaton paper contans FOUR (4) questons

More information

Introduction to Regression

Introduction to Regression Introducton to Regresson Dr Tom Ilvento Department of Food and Resource Economcs Overvew The last part of the course wll focus on Regresson Analyss Ths s one of the more powerful statstcal technques Provdes

More information

Chapter 5 Multilevel Models

Chapter 5 Multilevel Models Chapter 5 Multlevel Models 5.1 Cross-sectonal multlevel models 5.1.1 Two-level models 5.1.2 Multple level models 5.1.3 Multple level modelng n other felds 5.2 Longtudnal multlevel models 5.2.1 Two-level

More information

Comparing two Quantiles: the Burr Type X and Weibull Cases

Comparing two Quantiles: the Burr Type X and Weibull Cases IOSR Journal of Mathematcs (IOSR-JM) e-issn: 78-578, -ISSN: 39-765X. Volume, Issue 5 Ver. VII (Se. - Oct.06), PP 8-40 www.osrjournals.org Comarng two Quantles: the Burr Tye X and Webull Cases Mohammed

More information

Chapter 14 Simple Linear Regression

Chapter 14 Simple Linear Regression Chapter 4 Smple Lnear Regresson Chapter 4 - Smple Lnear Regresson Manageral decsons often are based on the relatonshp between two or more varables. Regresson analss can be used to develop an equaton showng

More information

Resource Allocation and Decision Analysis (ECON 8010) Spring 2014 Foundations of Regression Analysis

Resource Allocation and Decision Analysis (ECON 8010) Spring 2014 Foundations of Regression Analysis Resource Allocaton and Decson Analss (ECON 800) Sprng 04 Foundatons of Regresson Analss Readng: Regresson Analss (ECON 800 Coursepak, Page 3) Defntons and Concepts: Regresson Analss statstcal technques

More information

UNIVERSITY OF TORONTO Faculty of Arts and Science. December 2005 Examinations STA437H1F/STA1005HF. Duration - 3 hours

UNIVERSITY OF TORONTO Faculty of Arts and Science. December 2005 Examinations STA437H1F/STA1005HF. Duration - 3 hours UNIVERSITY OF TORONTO Faculty of Arts and Scence December 005 Examnatons STA47HF/STA005HF Duraton - hours AIDS ALLOWED: (to be suppled by the student) Non-programmable calculator One handwrtten 8.5'' x

More information

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur Analyss of Varance and Desgn of Experment-I MODULE VII LECTURE - 3 ANALYSIS OF COVARIANCE Dr Shalabh Department of Mathematcs and Statstcs Indan Insttute of Technology Kanpur Any scentfc experment s performed

More information

STAT 511 FINAL EXAM NAME Spring 2001

STAT 511 FINAL EXAM NAME Spring 2001 STAT 5 FINAL EXAM NAME Sprng Instructons: Ths s a closed book exam. No notes or books are allowed. ou may use a calculator but you are not allowed to store notes or formulas n the calculator. Please wrte

More information

Lecture 4 Hypothesis Testing

Lecture 4 Hypothesis Testing Lecture 4 Hypothess Testng We may wsh to test pror hypotheses about the coeffcents we estmate. We can use the estmates to test whether the data rejects our hypothess. An example mght be that we wsh to

More information

Unit 10: Simple Linear Regression and Correlation

Unit 10: Simple Linear Regression and Correlation Unt 10: Smple Lnear Regresson and Correlaton Statstcs 571: Statstcal Methods Ramón V. León 6/28/2004 Unt 10 - Stat 571 - Ramón V. León 1 Introductory Remarks Regresson analyss s a method for studyng the

More information

Fuzzy approach to solve multi-objective capacitated transportation problem

Fuzzy approach to solve multi-objective capacitated transportation problem Internatonal Journal of Bonformatcs Research, ISSN: 0975 087, Volume, Issue, 00, -0-4 Fuzzy aroach to solve mult-objectve caactated transortaton roblem Lohgaonkar M. H. and Bajaj V. H.* * Deartment of

More information

Linear Regression Analysis: Terminology and Notation

Linear Regression Analysis: Terminology and Notation ECON 35* -- Secton : Basc Concepts of Regresson Analyss (Page ) Lnear Regresson Analyss: Termnology and Notaton Consder the generc verson of the smple (two-varable) lnear regresson model. It s represented

More information

Correlation and Regression. Correlation 9.1. Correlation. Chapter 9

Correlation and Regression. Correlation 9.1. Correlation. Chapter 9 Chapter 9 Correlaton and Regresson 9. Correlaton Correlaton A correlaton s a relatonshp between two varables. The data can be represented b the ordered pars (, ) where s the ndependent (or eplanator) varable,

More information

Chapter 4: Regression With One Regressor

Chapter 4: Regression With One Regressor Chapter 4: Regresson Wth One Regressor Copyrght 2011 Pearson Addson-Wesley. All rghts reserved. 1-1 Outlne 1. Fttng a lne to data 2. The ordnary least squares (OLS) lne/regresson 3. Measures of ft 4. Populaton

More information

Linear regression. Regression Models. Chapter 11 Student Lecture Notes Regression Analysis is the

Linear regression. Regression Models. Chapter 11 Student Lecture Notes Regression Analysis is the Chapter 11 Student Lecture Notes 11-1 Lnear regresson Wenl lu Dept. Health statstcs School of publc health Tanjn medcal unversty 1 Regresson Models 1. Answer What Is the Relatonshp Between the Varables?.

More information

Topic 7: Analysis of Variance

Topic 7: Analysis of Variance Topc 7: Analyss of Varance Outlne Parttonng sums of squares Breakdown the degrees of freedom Expected mean squares (EMS) F test ANOVA table General lnear test Pearson Correlaton / R 2 Analyss of Varance

More information

Hidden Markov Model Cheat Sheet

Hidden Markov Model Cheat Sheet Hdden Markov Model Cheat Sheet (GIT ID: dc2f391536d67ed5847290d5250d4baae103487e) Ths document s a cheat sheet on Hdden Markov Models (HMMs). It resembles lecture notes, excet that t cuts to the chase

More information

Lecture Notes on Linear Regression

Lecture Notes on Linear Regression Lecture Notes on Lnear Regresson Feng L fl@sdueducn Shandong Unversty, Chna Lnear Regresson Problem In regresson problem, we am at predct a contnuous target value gven an nput feature vector We assume

More information

Chapter 7 Generalized and Weighted Least Squares Estimation. In this method, the deviation between the observed and expected values of

Chapter 7 Generalized and Weighted Least Squares Estimation. In this method, the deviation between the observed and expected values of Chapter 7 Generalzed and Weghted Least Squares Estmaton The usual lnear regresson model assumes that all the random error components are dentcally and ndependently dstrbuted wth constant varance. When

More information

Lectures - Week 4 Matrix norms, Conditioning, Vector Spaces, Linear Independence, Spanning sets and Basis, Null space and Range of a Matrix

Lectures - Week 4 Matrix norms, Conditioning, Vector Spaces, Linear Independence, Spanning sets and Basis, Null space and Range of a Matrix Lectures - Week 4 Matrx norms, Condtonng, Vector Spaces, Lnear Independence, Spannng sets and Bass, Null space and Range of a Matrx Matrx Norms Now we turn to assocatng a number to each matrx. We could

More information

STAT 3340 Assignment 1 solutions. 1. Find the equation of the line which passes through the points (1,1) and (4,5).

STAT 3340 Assignment 1 solutions. 1. Find the equation of the line which passes through the points (1,1) and (4,5). (out of 15 ponts) STAT 3340 Assgnment 1 solutons (10) (10) 1. Fnd the equaton of the lne whch passes through the ponts (1,1) and (4,5). β 1 = (5 1)/(4 1) = 4/3 equaton for the lne s y y 0 = β 1 (x x 0

More information

Lecture 16 Statistical Analysis in Biomaterials Research (Part II)

Lecture 16 Statistical Analysis in Biomaterials Research (Part II) 3.051J/0.340J 1 Lecture 16 Statstcal Analyss n Bomaterals Research (Part II) C. F Dstrbuton Allows comparson of varablty of behavor between populatons usng test of hypothess: σ x = σ x amed for Brtsh statstcan

More information

Lecture 10 Support Vector Machines II

Lecture 10 Support Vector Machines II Lecture 10 Support Vector Machnes II 22 February 2016 Taylor B. Arnold Yale Statstcs STAT 365/665 1/28 Notes: Problem 3 s posted and due ths upcomng Frday There was an early bug n the fake-test data; fxed

More information

Chapter 2 - The Simple Linear Regression Model S =0. e i is a random error. S β2 β. This is a minimization problem. Solution is a calculus exercise.

Chapter 2 - The Simple Linear Regression Model S =0. e i is a random error. S β2 β. This is a minimization problem. Solution is a calculus exercise. Chapter - The Smple Lnear Regresson Model The lnear regresson equaton s: where y + = β + β e for =,..., y and are observable varables e s a random error How can an estmaton rule be constructed for the

More information

Topic- 11 The Analysis of Variance

Topic- 11 The Analysis of Variance Topc- 11 The Analyss of Varance Expermental Desgn The samplng plan or expermental desgn determnes the way that a sample s selected. In an observatonal study, the expermenter observes data that already

More information

e i is a random error

e i is a random error Chapter - The Smple Lnear Regresson Model The lnear regresson equaton s: where + β + β e for,..., and are observable varables e s a random error How can an estmaton rule be constructed for the unknown

More information

Composite Hypotheses testing

Composite Hypotheses testing Composte ypotheses testng In many hypothess testng problems there are many possble dstrbutons that can occur under each of the hypotheses. The output of the source s a set of parameters (ponts n a parameter

More information

Predictive Analytics : QM901.1x Prof U Dinesh Kumar, IIMB. All Rights Reserved, Indian Institute of Management Bangalore

Predictive Analytics : QM901.1x Prof U Dinesh Kumar, IIMB. All Rights Reserved, Indian Institute of Management Bangalore Sesson Outlne Introducton to classfcaton problems and dscrete choce models. Introducton to Logstcs Regresson. Logstc functon and Logt functon. Maxmum Lkelhood Estmator (MLE) for estmaton of LR parameters.

More information

4.3 Poisson Regression

4.3 Poisson Regression of teratvely reweghted least squares regressons (the IRLS algorthm). We do wthout gvng further detals, but nstead focus on the practcal applcaton. > glm(survval~log(weght)+age, famly="bnomal", data=baby)

More information

Maximum Likelihood Estimation and Binary Dependent Variables

Maximum Likelihood Estimation and Binary Dependent Variables MLE and Bnary Deendent Varables Maxmum Lkelhood Estmaton and Bnary Deendent Varables. Startng wth a Smle Examle: Bernoull Trals Lets start wth a smle examle: Teams A and B lay one another 0 tmes; A wns

More information

JAB Chain. Long-tail claims development. ASTIN - September 2005 B.Verdier A. Klinger

JAB Chain. Long-tail claims development. ASTIN - September 2005 B.Verdier A. Klinger JAB Chan Long-tal clams development ASTIN - September 2005 B.Verder A. Klnger Outlne Chan Ladder : comments A frst soluton: Munch Chan Ladder JAB Chan Chan Ladder: Comments Black lne: average pad to ncurred

More information

Homework 10 Stat 547. Problem ) Z D!

Homework 10 Stat 547. Problem ) Z D! Homework 0 Stat 547 Problem 74 Notaton: h s the hazard rate for the aneulod grou, h s the hazard rate for the dlod grou (a Log-rank test s erformed: H 0 : h (t = h (t Sgnfcance level α = 005 Test statstc

More information

See Book Chapter 11 2 nd Edition (Chapter 10 1 st Edition)

See Book Chapter 11 2 nd Edition (Chapter 10 1 st Edition) Count Data Models See Book Chapter 11 2 nd Edton (Chapter 10 1 st Edton) Count data consst of non-negatve nteger values Examples: number of drver route changes per week, the number of trp departure changes

More information

ANSWERS CHAPTER 9. TIO 9.2: If the values are the same, the difference is 0, therefore the null hypothesis cannot be rejected.

ANSWERS CHAPTER 9. TIO 9.2: If the values are the same, the difference is 0, therefore the null hypothesis cannot be rejected. ANSWERS CHAPTER 9 THINK IT OVER thnk t over TIO 9.: χ 2 k = ( f e ) = 0 e Breakng the equaton down: the test statstc for the ch-squared dstrbuton s equal to the sum over all categores of the expected frequency

More information

Econ Statistical Properties of the OLS estimator. Sanjaya DeSilva

Econ Statistical Properties of the OLS estimator. Sanjaya DeSilva Econ 39 - Statstcal Propertes of the OLS estmator Sanjaya DeSlva September, 008 1 Overvew Recall that the true regresson model s Y = β 0 + β 1 X + u (1) Applyng the OLS method to a sample of data, we estmate

More information

Statistics Chapter 4

Statistics Chapter 4 Statstcs Chapter 4 "There are three knds of les: les, damned les, and statstcs." Benjamn Dsrael, 1895 (Brtsh statesman) Gaussan Dstrbuton, 4-1 If a measurement s repeated many tmes a statstcal treatment

More information

( ) 2 ( ) ( ) Problem Set 4 Suggested Solutions. Problem 1

( ) 2 ( ) ( ) Problem Set 4 Suggested Solutions. Problem 1 Problem Set 4 Suggested Solutons Problem (A) The market demand functon s the soluton to the followng utlty-maxmzaton roblem (UMP): The Lagrangean: ( x, x, x ) = + max U x, x, x x x x st.. x + x + x y x,

More information