PubH 7405: REGRESSION ANALYSIS INTRODUCTION TO POISSON REGRESSION

Size: px
Start display at page:

Download "PubH 7405: REGRESSION ANALYSIS INTRODUCTION TO POISSON REGRESSION"

Transcription

1 PubH 7405: REGRESSION ANALYSIS INTRODUCTION TO POISSON REGRESSION

2 PROTOTYPE EXAMPLE #1 We have data for 44 physcans workng n emergency medcne at a major hosptal system. The concern s about the number of complants each receved the prevous year. Why s t dfferent from physcan to physcan? Could t be explaned usng other factors? Data avalable consst of the number of patent vsts - and four covarates (the revenue, n dollars per hour; work load at the emergency servce, n hours; gender, Female/Male, and resdency tranng n emergency medcne, No/Yes.

3 Queston: Can we do Regresson usng the Normal Error Regresson Model? Possble Issue: The response, Number of Complants, s not on contnuous scale; t s a count of course, not normally dstrbuted.

4 PROTOTYPE EXAMPLE #2 Skn Cancer data for dfferent age groups n two metropoltan areas: Mnneapols-St. Paul and Dallas-Fort Worth: (1) Any age effect? If so, s t the same for the two ctes? (2) Any weather effect? (dfference between two ctes) If so, s t the same for all age groups? Skn Cancer Data Cty: Mnneapols-St. Paul Dallas-Ft. Worth Age Group Cases Populaton Cases Populaton , , , , , , , , , , , , , , , ,538

5 Questons are regresson-type questons (about man effects or margnal contrbutons and about possble effect modfcatons). & the same possble ssue: The Response or Dependent Varable, the Number of Skn Cancer Cases a count, s not on the contnuous scale and not normally dstrbuted.

6 In Regresson Analyss/Model, we usually mpose the condton that the Response Varable Y s on the contnuous scale maybe because of the popular Normal Error Model - not because Y s always on the contnuous scale. In a prevous lecture, the Dependent Varable of nterest was represented by an Bnary or Indcator Varable Y takng on values 0 and 1. The dstrbuton s Bernoull whch s a specal case of the Bnomal Dstrbuton. And we ntroduced Logstc Regresson.

7 We are now ntroducng a new form of regresson, the Posson Regresson, where the Response or Dependent Varable Y represents count data non-negatve ntegers.

8 RARE EVENTS It can be shown that the lmtng form of the bnomal dstrbuton, when n s ncreasngly large (n ) and π s ncreasngly small (π 0) whle θ = nπ (the mean) remans constant, s: Pr(X x θ e = x) = x! = p(x; θ) A random varable havng ths probablty functon s sad to have Posson Dstrbuton P(θ). For example, wth n = 48 and π =.05: b(x = 5; n, π) =.059 p(x = 5; θ) =.060 θ

9 The Posson Model s often used when the random varable X s supposed to represent the number of occurrences of some random event n an unt nterval of tme or space, or some volume of matter; numerous applcatons n health scences have been documented. For example, the number of vrus n a soluton, the number of defectve teeth per ndvdual, the number of focal lesons n vrology, the number of vctms of specfc dseases, the number of cancer deaths per household, the number of nfant deaths n certan localty durng a gven year, among others.

10 The mean and varance of the Posson Dstrbuton are: μ = θ σ 2 = θ (A very specal and strong characterstc where the varance s equal to the mean).

11 REGRESSION NEEDS The Posson model s often used when the random varable X s supposed to represent the number of occurrences of some random event n an nterval of tme or space, or some volume of matter and numerous applcatons n health scences have been documented. In some of these applcatons, one may be nterested n to see f the Posson-dstrbuted dependent varable Y can be predcted from or explaned by other varables. The other varables are called predctors, or explanatory or ndependent varables. For example, we may be nterested n the number of defectve teeth Y per ndvdual as a functon of gender and age of a chld, branch of toothpaste, and whether the famly has or does not have dental nsurance.

12 & refer to Example 1 (Number of complans) and Example 2 (Number of skn cancer cases).

13 POISSON REGRESSION MODEL When the dependent varable Y s assumed to follow a Posson dstrbuton wth mean θ; the Posson regresson model expresses ths mean as a functon of certan ndependent varables X 1, X 2,..., X k, n addton to the sze of the observaton unt from whch one obtaned the count of nterest. For example, f Y s the number of vrus n a soluton then the sze s the volume of the soluton; or f Y s the number of defectve teeth then the sze s the total number of teeth for that same ndvdual.

14 In our frame work, the dependent varable Y s assumed to follow a Posson dstrbuton; ts values y 's are avalable from n observaton unt whch s also characterzed by an ndependent varable X. For the observaton unt ( n), let s be the sze and x be the covarate value. The Posson regresson model assumes that the relatonshp between the mean of Y and the covarate X s descrbed by: E(Y ) = sλ(x ) = s exp(β 0 + β 1 x ) where λ(x ) s called the rsk of/for observaton unt (1 n).

15 The basc ratonale for usng the term rsk s the approxmaton of the Bnomal dstrbuton by the Posson dstrbuton. Recall that, when n goes to nfnty, π tends 0 whle θ = nπ remans constant, the bnomal dstrbuton B(n,π) can be approxmated by the Posson dstrbuton P(θ). The number n s the sze of the observaton unt; so the rato between the mean and the sze represents π (or λ(x) n the new model); that s the probablty or rsk (and, the rato of rsks s called the rsks rato or relatve rsk ).

16 Model wth Several Covarates Suppose we want to consder k covarates, X 1, X 2,, X k, smultaneously. The smple Posson regresson model of prevous secton can be easly generalzed and expressed as: E(Y ) = sλ(x1, x2,...,x = s exp(β + β x = s exp(β k j= 1 1 β k j ) + β x j ) 2 x β where λ(x j s) s called the rsk of/for observaton unt (1 n), x j s the value of the covarate X j measured from subject. 1 x k )

17 EXAMPLE The purpose of ths study was to examne the data for 44 physcans workng for an emergency at a major hosptal so as to determne whch of the followng four varables are related to the number of complants receved durng the prevous year. In addton to the number of complants, served as the dependent varable, data avalable consst of the number of vsts - whch serves as the sze for the observaton unt, the physcan - and four covarates. Table 6.2 presents the complete data set. For each of the 44 physcan there are two contnuous ndependent varables, the revenue (dollars per hour) and work load at the emergency servce (hours) and two bnary varables, gender (Female/Male) and resdency tranng n emergency servces (No/Yes).

18 No. of Vsts Complant Gender Resdency Revenue Hours Y F N M Y M N M N M N M Y M N M N F Y M N F N M N M N F N M Y M Y M N M Y M N M N F Y F Y F N M Y F N F Y M Y M N M Y F Y M N M N M N M N M Y F Y M N M N M Y M Y M N M Y F Y M

19 The nterpretaton or meanng of the Regresson Coeffcents could be seen as follows whch s smlar to the case of the normal Error Regresson Model.

20 MEASURE OF ASSOCIATION Consder the case of a bnary covarate X, say, representng an exposure (1 = exposed, 0 = not exposed). We have the followng: lnλ λ (exposed) λ (unexposed) β0 = β 0 + β = exp(β + lns 1 1 ) + lns f exposed(or x = 1) f unexposed (or x = 0) Ths quantty, represented by exp(β1), s the relatve rsk assocated wth the exposure.

21 Smlarly, we have for a contnuous covarate X and consder any value x of X, lnλ λ (X = x + 1) λ (X = x) = β = 0 β + β exp(β ) + β1x + lns f X = x (x + 1) + lns f X = x + 1 Ths quantty, represented by exp(β1), s the relatve rsk assocated wth one unt ncrease n the value of X, from x to (x+1).

22 ESTIMATION OF PARAMETERS Under the assumpton that Y s dstrbuted as Posson wth the above mean, the Lkelhood Functon s gven by: L(y; β) lnl(y; = β) n = 1 = [s { y lns lny! + y [β0 + β1x ] sexp[β 0 + β1x ]} λ(x )] y exp[ sλ(x y! from whch estmates of the two regresson coeffcents β 0 and β 1 can be obtaned by the Maxmum Lkelhood procedure. )]

23 EXAMPLE #1 Refer to the Emergency Servce data and suppose we want to nvestgate the relatonshp between the number of complants Y (adjusted for number of vsts) and resdency tranng X. It may be perceved that by havng tranng n the specalty a physcan would perform better and, therefore, less lkely to provoke complants. An applcaton of the smple Posson regresson analyss yelds: Varable Coeffcent St Error z-statstc p-value Intercept < No Resdency The result ndcates that the common percepton s almost true, that the relatonshp between the number of complants and no resdency tranng n emergency servce s margnally sgnfcant (p = ); the relatve rsk assocated wth no resdency tranng s: exp(.3041) = 1.36 Those wthout prevous tranng s 36 percent more lkely to receve the same number of complants as those who were traned n the specalty.

24 SAS SAMPLE a SAS program would nclude these nstructons: DATA EMERGENCY; INPUT VISITS CASES RESIDENCY; LN = LOG(VISITS); CARDS; (Data); PROC GENMOD DATA EMERGENCY; CLASS RESIDENCY; MODEL CASES = RESIDENCY/ DIST = POISSON LINK = LOG OFFSET = LN; where EMERGENCY s the name assgned to the data set, VISITS s the number of vsts, CASES s the number of complants (Y), and RESIDENCY (X) s the bnary covarate ndcatng whether the physcan receved resdency tranng n the specalty. The opton CLASS s used to declare that the covarate s categorcal.

25 Model wth Several Covarates Suppose we want to consder k covarates, X 1, X 2,, X k, smultaneously. The model s: E(Y ) = sλ(x1, x2,...,x = s exp(β + β x = s exp(β k j= 1 1 k β x j ) + β j ) 2 x β where λ(x j s) s called the rsk of/for observaton unt (1 n), x j s the value of the covarate X j measured from subject. L(y; β) lnl(y; = β) n = 1 = [s λ(x j y lns )] y exp[ s λ(x y lny!! + y [β 1 0 x j k + ) )] k j= 1 β j x j ] s exp[β 0 + k j= 1 β j x j ]

26 Also smlar to the smple regresson case, exp(β ) represents: () The Relatve Rsk assocated wth, say, an exposure f X s bnary (exposed or X =1 versus unexposed or X =0), or: () The Relatve Rsk due to one unt ncrease n the value of X f X s contnuous (X =x+1 versus X =x).

27 After estmates b s of regresson coeffcents β s and ther standard errors have been obtaned, a 95 percent confdence nterval for the log of the Relatve Rsk assocated wth the th factor s gven by: b ± SE(b ). These results are necessary n the effort to dentfy mportant rsk factors for the Posson outcome, the count.

28 Note: We form confdence ntervals of the regresson coeffcent before exponentatng (the endponts) to obtan relatve rsks.

29 Before such analyses are done, the problem and the data have to be examned carefully. If some of the varables are hghly correlated, then one or fewer of the correlated factors are lkely to be as good predctors as all of them; nformaton from other smlar studes also has to be ncorporated so as to drop some of these correlated explanatory varables. The uses of products, such as x 1 *x 2, and hgher power terms, such as x 12, may be necessary and can mprove the goodness-of-ft.

30 We are assumng a log-lnear regresson model n whch, for example, the Relatve Rsk due to one unt ncrease n the value of a contnuous X (X = x+1 versus X = x) s ndependent of x. Therefore, f ths lnearty seems to be volated, the ncorporaton of powers of X should be serously consdered. The use of products wll help n the nvestgaton of possble effect modfcatons.

31 There are no smple dagnostc tool, such as Scatter Dagram for Normal Error Regresson Model, for detectng lack of lnearty. One mght consder to nclude a quadratc term and see f t s sgnfcant.

32 And, fnally, the messy problem of mssng data; most packaged programs would delete a subject f one or more covarate values are mssng.

33 TESTING HYPOTHESES Once we have ft a multple Posson regresson model and obtaned estmates for the varous parameters of nterest, we want to answer questons about the contrbutons of varous factors to the predcton of the Posson-dstrbuted response varable. There are three types of such questons: () An overall test: taken collectvely, does the entre set of explanatory or ndependent varables contrbute sgnfcantly to the predcton of response? () Test for the value of a sngle factor: does the addton of one partcular varable of nterest add sgnfcantly to the predcton of response over and above that acheved by other ndependent varables? () Test for contrbuton of a group of varables: does the addton of a group of varables add sgnfcantly to the predcton of response over and above that acheved by other ndependent varables?

34 OVERALLL REGRESSION EFFECTS We frst consder the frst queston stated above concernng an overall test for a model contanng k factors. The null hypothess for ths test may be stated as: "all k ndependent varables consdered together do not explan the varaton n the response any more than the sze alone". In other words, H : β = β = = β = k 0

35 H : β = β =... = β = o 1 2 k 0 Snce: E(Y ) = sλ(x1, x2,..., x = s exp(β + β x = s exp(β k j= 1 If H 0 s true, the average/mean response has nothng to do wth the predctors. 1 β k j ) + β x j ) 2 x β k x k )

36 H0 : β1 = β2 = = βk = 0 Ths can be tested usng the Lkelhood Rato Ch-square test at k degrees of freedom: X 2 = 2{lnL k lnl 0 } where lnl k s the log lkelhood value for the model contanng all k covarates and lnl0 s the log lkelhood value for the model contanng only the ntercept. Computer packaged program, such as SAS PROC GENMOD, provdes these log lkelhood values but n separate runs.

37 If ths overall test s sgnfcant, t only means one or some of the regresson coeffcents (the slopes ) not equal to zero; t does not tell us whch coeffcents are not zero or whch factor or factors are related to the response. Ths s smlar to performng the oneway ANOVA F-test before checkng for parwse dfferences.

38 TESTING FOR SINGLE FACTOR The queston s: Does the addton of one partcular factor of nterest add sgnfcantly to the predcton of Dependent Varable over and above that acheved by other factors?. The Null Hypothess for ths test may stated as: "Factor X does not have any value added to the explan the varaton n Y-values over and above that acheved by other factors ". In other words, H : β = 0 0

39 The Null Hypothess s H Regardless of the number of varables n the model, one smple approach s usng b z = SE(b ) where b s the correspondng estmated regresson coeffcent for factor X and SE(b ) s the estmate of the standard error of b, both of whch are prnted by standard computer packaged programs such as SAS. In performng ths test, we refer the value of the z score to percentles of the standard normal dstrbuton; for example, we compare the absolute value of z to 1.96 for a two-sded test at the 5 percent level. Note: No use of t-dstrbuton here. : β 0 = 0

40 EXAMPLE #2 Refer to the data set on emergency servce; Usng all four covarates, we found that only the effect of work load (Hours) s sgnfcant at the 5 percent level. Varable Coeffcent St Error z-statstc p-value Intercept < No Resdency Female Revenue Hours

41 Gven a contnuous varable of nterest, one can ft a polynomal model and use ths type of test to check for lnearty. It can also be used to check for a sngle product representng an effect modfcaton. For example, to focus on the quadratc term of: E(Y ) = s λ(x ) = s exp(β 0 + β 1 Hour + β 2 Hour 2 )

42 TESTING FOR A GROUP OF VARABLES The queston s: Does the addton of a group of factors add sgnfcantly to the predcton of Y over and above that acheved by other factors? The Null Hypothess for ths test may stated as: "Factors {X +1, X +2,, X +m }, consdered together as a group, do not have any value added to the predcton of the Mean of Y that other factors are already ncluded n the model". In other words, H0 : β+ 1 = β+ 2 =... = β+ m = 0

43 Ths multple contrbuton test s often used to test whether a smlar group of varables, such as demographc characterstcs, s mportant for the predcton of the mean of Y; these varables have some trat n common. Another applcaton: collecton of powers and/or product terms. It s of nterest to assess powers & nteracton effects collectvely before consderng ndvdual nteracton terms n a model. It reduces the total number of tests & helps to provde better control of overall Type I error rates whch may be nflated due to multple testng.

44 The process can also be used to test for the contrbuton of a categorcal covarate whch s represented by several dummy varables.

45 EXAMPLE #3 In ths example, the dependent varable s the number of cases of skn cancer. Data nvolve only two covarates; age and locaton, both are categorcal. We use seven dummy varables to represent the eght age groups (wth 85+ age group beng the baselne) and one for locaton (wth Mnneapols-St. Paul as the baselne) Skn Cancer Data Cty: Mnneapols-St. Paul Dallas-Ft. Worth Age Group Cases Populaton Cases Populaton , , , , , , , , , , , , , , , ,538

46 SEQUENTIAL ADJUSTMENT In the type 3 analyss, we test the sgnfcance of the effect of each factor added to the model contanng all other factors lke n most common multple regresson analyses; that s to nvestgate the addtonal contrbuton of the factor to the explanaton of the dependent varable. Sometmes, however, we may be nterested n a herarchcal or sequental adjustment. For example, we focus on the quadratc term (adjusted) n addton to the regular term (unadjusted). Ths can be acheved usng PROC GENMOD by requestng the type 1 analyss opton

47 OVERDISPERSION The Posson s a very specal dstrbuton; ts mean µ and ts varance σ 2 are equal. If we use the varance-mean rato as a dsperson parameter then t s 1 n a standard Posson model, less than 1 n an under-dspersed model, and greater than 1 n an over-dspersed model. Over-dsperson s a common phenomenon n practce and t causes concerns because the mplcaton s serous; the analyss whch assumes the Posson model often under-estmates standard errors and, thus, wrongly nflates the level of sgnfcance.

48 MEASURING OVERDISPERSION After a Posson regresson model s ftted, dsperson s measured by the scaled devance or scaled Pearson ch-square; t s the devance or Pearson ch-square dvded by the degrees of freedom (number of observatons mnus number of parameters). The devance s defned as twce the dfference between the maxmum achevable log lkelhood and the log lkelhood at the maxmum lkelhood estmates of the regresson parameters.

49 EXAMPLE Refer to the data set on emergency servce wth all four covarates, we can see that both ndces are greater than 1 ndcatng an overdsperson. In ths example, we have a sample sze of 44 but fve degrees of freedom lost due to the estmaton of the fve regresson parameters, ncludng the ntercept. Crteron df Value Scaled Value Devance Pearson Ch-Square

50 FITTING OVERDISPERSED MODEL PROC GENMOD allows the specfcaton of a scale parameter to ft over-dspersed Posson regresson models. Instead of a varance equal to the mean, Var(Y) = µ t allows the varance functon to have a multplcatve over-dsperson factor ϕ (specfed by users): Var(Y) = ϕµ

51 EXAMPLE #4 Refer to the data set on emergency servce; Usng all four covarates, we have the followng results by fttng the regular Posson Model. Varable Coeffcent St Error z-statstc p-value Intercept < No Resdency Female Revenue Hours Crteron df Value Scaled Value Devance Pearson Ch-Square

52 The model could be ftted n the usual way, and the pont estmates of regresson coeffcent are not affected. The covarance matrx, however, s multpled by ϕ. There are two optons avalable for fttng over-dspersed models; the users can control ether the scaled devance (by specfyng DSCALE n the model statement) or the scaled Pearson ch-square (by specfyng PSCALE n the model statement). The value of the controlled ndex becomes 1; the value of the other s close to but may not be equal to 1.

53 Note: a SAS program would nclude ths nstructon: MODEL CASES = GENDER RESIDENCY REVENUE HOURS/ DIST = POISSON LINK = LOG OFFSET = LN(DSCALE);

54 NEW RESULTS Varable Coeffcent St Error z-statstc p-value Intercept < No Resdency Female Revenue Hours Crteron df Value Scaled Value Devance Pearson Ch-Square As compared to the results of the regular model, the pont estmates reman the same but the standard errors are larger; the effect of work load (Hours) s no longer sgnfcant at the 5 percent level.

55 STEPWISE REGRESSION In many applcatons, our major nterest s to dentfy mportant rsk factors. In other words, we wsh to dentfy from many avalable factors a small subset of factors that relate sgnfcantly to the outcome, e.g. the dsease under nvestgaton. In that dentfcaton process, of course, we wsh to avod a large Type I ( false postve ) error. In a regresson analyss, a Type I error corresponds to ncludng a predctor that has no real relatonshp to the outcome; such an ncluson can greatly confuse the nterpretaton of the regresson results. One popular procedure s Stepwse Regresson

56 Wth Posson Regresson, t s stll desrable and possble to perform stepwse regresson. Unfortunately, PROC GENMOD does not have an automatc opton; one has to run t many tmes and, at each step, choose to add or remove a varable manually.

57 Wth Posson Regresson, we lost these tools that we use wth NERM: R 2 and all coeffcents of partal determnaton (there are some substtutes but not as good) All graphs (Scatter dagram, all resdual plots, and Varable-added Plot) Least Squares method (but MLE s better) All other methods/tools are unchanged (test for sngle factors, stepwse, etc )

58 However, f Y s dstrbuted as Posson, try Y = Y; t both stablzes the varance and mproves normalty. Then you could get approxmate results usng PROC REG and the Normal Error Regresson Model (If Y s bnary, you would have no choce but Logstc Regresson).

59 Readngs & Exercses Readngs: A thorough readng of the text s secton 14.3 (pp ) s hghly recommended. Exercses: None

60 Due As Homework 22.1 Refer to dataset Emergency Servce, let Y = Number of Complants and four ndependent varables, X 1 = Gender, X 2 = Resdency, X 3 = Revenue, and X 4 = Hours; choose codng 0/1 for X1 and X2. a) Ft the Posson Regresson Model usng PROC GENMOD and confrm the results of Example #2. b) Usng the same data set, take the square root of Y and use as the new Dependent Varable and ft the Normal Error Regresson Model usng PROC REG; draw your conclusons on effects of covarates. c) Comment on the smlartes or dfferences between the two sets of results obtaned n (a) and (b).

Chapter 13: Multiple Regression

Chapter 13: Multiple Regression Chapter 13: Multple Regresson 13.1 Developng the multple-regresson Model The general model can be descrbed as: It smplfes for two ndependent varables: The sample ft parameter b 0, b 1, and b are used to

More information

Negative Binomial Regression

Negative Binomial Regression STATGRAPHICS Rev. 9/16/2013 Negatve Bnomal Regresson Summary... 1 Data Input... 3 Statstcal Model... 3 Analyss Summary... 4 Analyss Optons... 7 Plot of Ftted Model... 8 Observed Versus Predcted... 10 Predctons...

More information

Chapter 11: Simple Linear Regression and Correlation

Chapter 11: Simple Linear Regression and Correlation Chapter 11: Smple Lnear Regresson and Correlaton 11-1 Emprcal Models 11-2 Smple Lnear Regresson 11-3 Propertes of the Least Squares Estmators 11-4 Hypothess Test n Smple Lnear Regresson 11-4.1 Use of t-tests

More information

Comparison of Regression Lines

Comparison of Regression Lines STATGRAPHICS Rev. 9/13/2013 Comparson of Regresson Lnes Summary... 1 Data Input... 3 Analyss Summary... 4 Plot of Ftted Model... 6 Condtonal Sums of Squares... 6 Analyss Optons... 7 Forecasts... 8 Confdence

More information

Lecture 6: Introduction to Linear Regression

Lecture 6: Introduction to Linear Regression Lecture 6: Introducton to Lnear Regresson An Manchakul amancha@jhsph.edu 24 Aprl 27 Lnear regresson: man dea Lnear regresson can be used to study an outcome as a lnear functon of a predctor Example: 6

More information

Statistics for Economics & Business

Statistics for Economics & Business Statstcs for Economcs & Busness Smple Lnear Regresson Learnng Objectves In ths chapter, you learn: How to use regresson analyss to predct the value of a dependent varable based on an ndependent varable

More information

Polynomial Regression Models

Polynomial Regression Models LINEAR REGRESSION ANALYSIS MODULE XII Lecture - 6 Polynomal Regresson Models Dr. Shalabh Department of Mathematcs and Statstcs Indan Insttute of Technology Kanpur Test of sgnfcance To test the sgnfcance

More information

Predictive Analytics : QM901.1x Prof U Dinesh Kumar, IIMB. All Rights Reserved, Indian Institute of Management Bangalore

Predictive Analytics : QM901.1x Prof U Dinesh Kumar, IIMB. All Rights Reserved, Indian Institute of Management Bangalore Sesson Outlne Introducton to classfcaton problems and dscrete choce models. Introducton to Logstcs Regresson. Logstc functon and Logt functon. Maxmum Lkelhood Estmator (MLE) for estmaton of LR parameters.

More information

Econ107 Applied Econometrics Topic 3: Classical Model (Studenmund, Chapter 4)

Econ107 Applied Econometrics Topic 3: Classical Model (Studenmund, Chapter 4) I. Classcal Assumptons Econ7 Appled Econometrcs Topc 3: Classcal Model (Studenmund, Chapter 4) We have defned OLS and studed some algebrac propertes of OLS. In ths topc we wll study statstcal propertes

More information

Chapter 15 - Multiple Regression

Chapter 15 - Multiple Regression Chapter - Multple Regresson Chapter - Multple Regresson Multple Regresson Model The equaton that descrbes how the dependent varable y s related to the ndependent varables x, x,... x p and an error term

More information

Statistics for Managers Using Microsoft Excel/SPSS Chapter 14 Multiple Regression Models

Statistics for Managers Using Microsoft Excel/SPSS Chapter 14 Multiple Regression Models Statstcs for Managers Usng Mcrosoft Excel/SPSS Chapter 14 Multple Regresson Models 1999 Prentce-Hall, Inc. Chap. 14-1 Chapter Topcs The Multple Regresson Model Contrbuton of Indvdual Independent Varables

More information

STAT 405 BIOSTATISTICS (Fall 2016) Handout 15 Introduction to Logistic Regression

STAT 405 BIOSTATISTICS (Fall 2016) Handout 15 Introduction to Logistic Regression STAT 45 BIOSTATISTICS (Fall 26) Handout 5 Introducton to Logstc Regresson Ths handout covers materal found n Secton 3.7 of your text. You may also want to revew regresson technques n Chapter. In ths handout,

More information

1. Inference on Regression Parameters a. Finding Mean, s.d and covariance amongst estimates. 2. Confidence Intervals and Working Hotelling Bands

1. Inference on Regression Parameters a. Finding Mean, s.d and covariance amongst estimates. 2. Confidence Intervals and Working Hotelling Bands Content. Inference on Regresson Parameters a. Fndng Mean, s.d and covarance amongst estmates.. Confdence Intervals and Workng Hotellng Bands 3. Cochran s Theorem 4. General Lnear Testng 5. Measures of

More information

See Book Chapter 11 2 nd Edition (Chapter 10 1 st Edition)

See Book Chapter 11 2 nd Edition (Chapter 10 1 st Edition) Count Data Models See Book Chapter 11 2 nd Edton (Chapter 10 1 st Edton) Count data consst of non-negatve nteger values Examples: number of drver route changes per week, the number of trp departure changes

More information

Statistics for Managers Using Microsoft Excel/SPSS Chapter 13 The Simple Linear Regression Model and Correlation

Statistics for Managers Using Microsoft Excel/SPSS Chapter 13 The Simple Linear Regression Model and Correlation Statstcs for Managers Usng Mcrosoft Excel/SPSS Chapter 13 The Smple Lnear Regresson Model and Correlaton 1999 Prentce-Hall, Inc. Chap. 13-1 Chapter Topcs Types of Regresson Models Determnng the Smple Lnear

More information

[The following data appear in Wooldridge Q2.3.] The table below contains the ACT score and college GPA for eight college students.

[The following data appear in Wooldridge Q2.3.] The table below contains the ACT score and college GPA for eight college students. PPOL 59-3 Problem Set Exercses n Smple Regresson Due n class /8/7 In ths problem set, you are asked to compute varous statstcs by hand to gve you a better sense of the mechancs of the Pearson correlaton

More information

Lecture 9: Linear regression: centering, hypothesis testing, multiple covariates, and confounding

Lecture 9: Linear regression: centering, hypothesis testing, multiple covariates, and confounding Recall: man dea of lnear regresson Lecture 9: Lnear regresson: centerng, hypothess testng, multple covarates, and confoundng Sandy Eckel seckel@jhsph.edu 6 May 8 Lnear regresson can be used to study an

More information

Lecture 9: Linear regression: centering, hypothesis testing, multiple covariates, and confounding

Lecture 9: Linear regression: centering, hypothesis testing, multiple covariates, and confounding Lecture 9: Lnear regresson: centerng, hypothess testng, multple covarates, and confoundng Sandy Eckel seckel@jhsph.edu 6 May 008 Recall: man dea of lnear regresson Lnear regresson can be used to study

More information

x i1 =1 for all i (the constant ).

x i1 =1 for all i (the constant ). Chapter 5 The Multple Regresson Model Consder an economc model where the dependent varable s a functon of K explanatory varables. The economc model has the form: y = f ( x,x,..., ) xk Approxmate ths by

More information

Chapter 5 Multilevel Models

Chapter 5 Multilevel Models Chapter 5 Multlevel Models 5.1 Cross-sectonal multlevel models 5.1.1 Two-level models 5.1.2 Multple level models 5.1.3 Multple level modelng n other felds 5.2 Longtudnal multlevel models 5.2.1 Two-level

More information

STAT 3008 Applied Regression Analysis

STAT 3008 Applied Regression Analysis STAT 3008 Appled Regresson Analyss Tutoral : Smple Lnear Regresson LAI Chun He Department of Statstcs, The Chnese Unversty of Hong Kong 1 Model Assumpton To quantfy the relatonshp between two factors,

More information

Psychology 282 Lecture #24 Outline Regression Diagnostics: Outliers

Psychology 282 Lecture #24 Outline Regression Diagnostics: Outliers Psychology 282 Lecture #24 Outlne Regresson Dagnostcs: Outlers In an earler lecture we studed the statstcal assumptons underlyng the regresson model, ncludng the followng ponts: Formal statement of assumptons.

More information

ECONOMICS 351*-A Mid-Term Exam -- Fall Term 2000 Page 1 of 13 pages. QUEEN'S UNIVERSITY AT KINGSTON Department of Economics

ECONOMICS 351*-A Mid-Term Exam -- Fall Term 2000 Page 1 of 13 pages. QUEEN'S UNIVERSITY AT KINGSTON Department of Economics ECOOMICS 35*-A Md-Term Exam -- Fall Term 000 Page of 3 pages QUEE'S UIVERSITY AT KIGSTO Department of Economcs ECOOMICS 35* - Secton A Introductory Econometrcs Fall Term 000 MID-TERM EAM ASWERS MG Abbott

More information

4.3 Poisson Regression

4.3 Poisson Regression of teratvely reweghted least squares regressons (the IRLS algorthm). We do wthout gvng further detals, but nstead focus on the practcal applcaton. > glm(survval~log(weght)+age, famly="bnomal", data=baby)

More information

since [1-( 0+ 1x1i+ 2x2 i)] [ 0+ 1x1i+ assumed to be a reasonable approximation

since [1-( 0+ 1x1i+ 2x2 i)] [ 0+ 1x1i+ assumed to be a reasonable approximation Econ 388 R. Butler 204 revsons Lecture 4 Dummy Dependent Varables I. Lnear Probablty Model: the Regresson model wth a dummy varables as the dependent varable assumpton, mplcaton regular multple regresson

More information

Chapter 15 Student Lecture Notes 15-1

Chapter 15 Student Lecture Notes 15-1 Chapter 15 Student Lecture Notes 15-1 Basc Busness Statstcs (9 th Edton) Chapter 15 Multple Regresson Model Buldng 004 Prentce-Hall, Inc. Chap 15-1 Chapter Topcs The Quadratc Regresson Model Usng Transformatons

More information

Diagnostics in Poisson Regression. Models - Residual Analysis

Diagnostics in Poisson Regression. Models - Residual Analysis Dagnostcs n Posson Regresson Models - Resdual Analyss 1 Outlne Dagnostcs n Posson Regresson Models - Resdual Analyss Example 3: Recall of Stressful Events contnued 2 Resdual Analyss Resduals represent

More information

STATISTICS QUESTIONS. Step by Step Solutions.

STATISTICS QUESTIONS. Step by Step Solutions. STATISTICS QUESTIONS Step by Step Solutons www.mathcracker.com 9//016 Problem 1: A researcher s nterested n the effects of famly sze on delnquency for a group of offenders and examnes famles wth one to

More information

/ n ) are compared. The logic is: if the two

/ n ) are compared. The logic is: if the two STAT C141, Sprng 2005 Lecture 13 Two sample tests One sample tests: examples of goodness of ft tests, where we are testng whether our data supports predctons. Two sample tests: called as tests of ndependence

More information

BIO Lab 2: TWO-LEVEL NORMAL MODELS with school children popularity data

BIO Lab 2: TWO-LEVEL NORMAL MODELS with school children popularity data Lab : TWO-LEVEL NORMAL MODELS wth school chldren popularty data Purpose: Introduce basc two-level models for normally dstrbuted responses usng STATA. In partcular, we dscuss Random ntercept models wthout

More information

Y = β 0 + β 1 X 1 + β 2 X β k X k + ε

Y = β 0 + β 1 X 1 + β 2 X β k X k + ε Chapter 3 Secton 3.1 Model Assumptons: Multple Regresson Model Predcton Equaton Std. Devaton of Error Correlaton Matrx Smple Lnear Regresson: 1.) Lnearty.) Constant Varance 3.) Independent Errors 4.) Normalty

More information

Department of Quantitative Methods & Information Systems. Time Series and Their Components QMIS 320. Chapter 6

Department of Quantitative Methods & Information Systems. Time Series and Their Components QMIS 320. Chapter 6 Department of Quanttatve Methods & Informaton Systems Tme Seres and Ther Components QMIS 30 Chapter 6 Fall 00 Dr. Mohammad Zanal These sldes were modfed from ther orgnal source for educatonal purpose only.

More information

ANSWERS. Problem 1. and the moment generating function (mgf) by. defined for any real t. Use this to show that E( U) var( U)

ANSWERS. Problem 1. and the moment generating function (mgf) by. defined for any real t. Use this to show that E( U) var( U) Econ 413 Exam 13 H ANSWERS Settet er nndelt 9 deloppgaver, A,B,C, som alle anbefales å telle lkt for å gøre det ltt lettere å stå. Svar er gtt . Unfortunately, there s a prntng error n the hnt of

More information

Statistics for Business and Economics

Statistics for Business and Economics Statstcs for Busness and Economcs Chapter 11 Smple Regresson Copyrght 010 Pearson Educaton, Inc. Publshng as Prentce Hall Ch. 11-1 11.1 Overvew of Lnear Models n An equaton can be ft to show the best lnear

More information

Here is the rationale: If X and y have a strong positive relationship to one another, then ( x x) will tend to be positive when ( y y)

Here is the rationale: If X and y have a strong positive relationship to one another, then ( x x) will tend to be positive when ( y y) Secton 1.5 Correlaton In the prevous sectons, we looked at regresson and the value r was a measurement of how much of the varaton n y can be attrbuted to the lnear relatonshp between y and x. In ths secton,

More information

Department of Statistics University of Toronto STA305H1S / 1004 HS Design and Analysis of Experiments Term Test - Winter Solution

Department of Statistics University of Toronto STA305H1S / 1004 HS Design and Analysis of Experiments Term Test - Winter Solution Department of Statstcs Unversty of Toronto STA35HS / HS Desgn and Analyss of Experments Term Test - Wnter - Soluton February, Last Name: Frst Name: Student Number: Instructons: Tme: hours. Ads: a non-programmable

More information

Basic Business Statistics, 10/e

Basic Business Statistics, 10/e Chapter 13 13-1 Basc Busness Statstcs 11 th Edton Chapter 13 Smple Lnear Regresson Basc Busness Statstcs, 11e 009 Prentce-Hall, Inc. Chap 13-1 Learnng Objectves In ths chapter, you learn: How to use regresson

More information

Correlation and Regression. Correlation 9.1. Correlation. Chapter 9

Correlation and Regression. Correlation 9.1. Correlation. Chapter 9 Chapter 9 Correlaton and Regresson 9. Correlaton Correlaton A correlaton s a relatonshp between two varables. The data can be represented b the ordered pars (, ) where s the ndependent (or eplanator) varable,

More information

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur Analyss of Varance and Desgn of Experment-I MODULE VII LECTURE - 3 ANALYSIS OF COVARIANCE Dr Shalabh Department of Mathematcs and Statstcs Indan Insttute of Technology Kanpur Any scentfc experment s performed

More information

Statistics II Final Exam 26/6/18

Statistics II Final Exam 26/6/18 Statstcs II Fnal Exam 26/6/18 Academc Year 2017/18 Solutons Exam duraton: 2 h 30 mn 1. (3 ponts) A town hall s conductng a study to determne the amount of leftover food produced by the restaurants n the

More information

January Examinations 2015

January Examinations 2015 24/5 Canddates Only January Examnatons 25 DO NOT OPEN THE QUESTION PAPER UNTIL INSTRUCTED TO DO SO BY THE CHIEF INVIGILATOR STUDENT CANDIDATE NO.. Department Module Code Module Ttle Exam Duraton (n words)

More information

Lecture Notes for STATISTICAL METHODS FOR BUSINESS II BMGT 212. Chapters 14, 15 & 16. Professor Ahmadi, Ph.D. Department of Management

Lecture Notes for STATISTICAL METHODS FOR BUSINESS II BMGT 212. Chapters 14, 15 & 16. Professor Ahmadi, Ph.D. Department of Management Lecture Notes for STATISTICAL METHODS FOR BUSINESS II BMGT 1 Chapters 14, 15 & 16 Professor Ahmad, Ph.D. Department of Management Revsed August 005 Chapter 14 Formulas Smple Lnear Regresson Model: y =

More information

Chapter 14: Logit and Probit Models for Categorical Response Variables

Chapter 14: Logit and Probit Models for Categorical Response Variables Chapter 4: Logt and Probt Models for Categorcal Response Varables Sect 4. Models for Dchotomous Data We wll dscuss only ths secton of Chap 4, whch s manly about Logstc Regresson, a specal case of the famly

More information

Chapter 9: Statistical Inference and the Relationship between Two Variables

Chapter 9: Statistical Inference and the Relationship between Two Variables Chapter 9: Statstcal Inference and the Relatonshp between Two Varables Key Words The Regresson Model The Sample Regresson Equaton The Pearson Correlaton Coeffcent Learnng Outcomes After studyng ths chapter,

More information

First Year Examination Department of Statistics, University of Florida

First Year Examination Department of Statistics, University of Florida Frst Year Examnaton Department of Statstcs, Unversty of Florda May 7, 010, 8:00 am - 1:00 noon Instructons: 1. You have four hours to answer questons n ths examnaton.. You must show your work to receve

More information

Economics 130. Lecture 4 Simple Linear Regression Continued

Economics 130. Lecture 4 Simple Linear Regression Continued Economcs 130 Lecture 4 Contnued Readngs for Week 4 Text, Chapter and 3. We contnue wth addressng our second ssue + add n how we evaluate these relatonshps: Where do we get data to do ths analyss? How do

More information

The Multiple Classical Linear Regression Model (CLRM): Specification and Assumptions. 1. Introduction

The Multiple Classical Linear Regression Model (CLRM): Specification and Assumptions. 1. Introduction ECONOMICS 5* -- NOTE (Summary) ECON 5* -- NOTE The Multple Classcal Lnear Regresson Model (CLRM): Specfcaton and Assumptons. Introducton CLRM stands for the Classcal Lnear Regresson Model. The CLRM s also

More information

Basically, if you have a dummy dependent variable you will be estimating a probability.

Basically, if you have a dummy dependent variable you will be estimating a probability. ECON 497: Lecture Notes 13 Page 1 of 1 Metropoltan State Unversty ECON 497: Research and Forecastng Lecture Notes 13 Dummy Dependent Varable Technques Studenmund Chapter 13 Bascally, f you have a dummy

More information

Properties of Least Squares

Properties of Least Squares Week 3 3.1 Smple Lnear Regresson Model 3. Propertes of Least Squares Estmators Y Y β 1 + β X + u weekly famly expendtures X weekly famly ncome For a gven level of x, the expected level of food expendtures

More information

x = , so that calculated

x = , so that calculated Stat 4, secton Sngle Factor ANOVA notes by Tm Plachowsk n chapter 8 we conducted hypothess tests n whch we compared a sngle sample s mean or proporton to some hypotheszed value Chapter 9 expanded ths to

More information

Linear Regression Analysis: Terminology and Notation

Linear Regression Analysis: Terminology and Notation ECON 35* -- Secton : Basc Concepts of Regresson Analyss (Page ) Lnear Regresson Analyss: Termnology and Notaton Consder the generc verson of the smple (two-varable) lnear regresson model. It s represented

More information

Chapter 14 Simple Linear Regression

Chapter 14 Simple Linear Regression Chapter 4 Smple Lnear Regresson Chapter 4 - Smple Lnear Regresson Manageral decsons often are based on the relatonshp between two or more varables. Regresson analss can be used to develop an equaton showng

More information

Reminder: Nested models. Lecture 9: Interactions, Quadratic terms and Splines. Effect Modification. Model 1

Reminder: Nested models. Lecture 9: Interactions, Quadratic terms and Splines. Effect Modification. Model 1 Lecture 9: Interactons, Quadratc terms and Splnes An Manchakul amancha@jhsph.edu 3 Aprl 7 Remnder: Nested models Parent model contans one set of varables Extended model adds one or more new varables to

More information

NANYANG TECHNOLOGICAL UNIVERSITY SEMESTER I EXAMINATION MTH352/MH3510 Regression Analysis

NANYANG TECHNOLOGICAL UNIVERSITY SEMESTER I EXAMINATION MTH352/MH3510 Regression Analysis NANYANG TECHNOLOGICAL UNIVERSITY SEMESTER I EXAMINATION 014-015 MTH35/MH3510 Regresson Analyss December 014 TIME ALLOWED: HOURS INSTRUCTIONS TO CANDIDATES 1. Ths examnaton paper contans FOUR (4) questons

More information

Statistics MINITAB - Lab 2

Statistics MINITAB - Lab 2 Statstcs 20080 MINITAB - Lab 2 1. Smple Lnear Regresson In smple lnear regresson we attempt to model a lnear relatonshp between two varables wth a straght lne and make statstcal nferences concernng that

More information

Introduction to Regression

Introduction to Regression Introducton to Regresson Dr Tom Ilvento Department of Food and Resource Economcs Overvew The last part of the course wll focus on Regresson Analyss Ths s one of the more powerful statstcal technques Provdes

More information

Introduction to Generalized Linear Models

Introduction to Generalized Linear Models INTRODUCTION TO STATISTICAL MODELLING TRINITY 00 Introducton to Generalzed Lnear Models I. Motvaton In ths lecture we extend the deas of lnear regresson to the more general dea of a generalzed lnear model

More information

Composite Hypotheses testing

Composite Hypotheses testing Composte ypotheses testng In many hypothess testng problems there are many possble dstrbutons that can occur under each of the hypotheses. The output of the source s a set of parameters (ponts n a parameter

More information

Chapter 8 Indicator Variables

Chapter 8 Indicator Variables Chapter 8 Indcator Varables In general, e explanatory varables n any regresson analyss are assumed to be quanttatve n nature. For example, e varables lke temperature, dstance, age etc. are quanttatve n

More information

ANSWERS CHAPTER 9. TIO 9.2: If the values are the same, the difference is 0, therefore the null hypothesis cannot be rejected.

ANSWERS CHAPTER 9. TIO 9.2: If the values are the same, the difference is 0, therefore the null hypothesis cannot be rejected. ANSWERS CHAPTER 9 THINK IT OVER thnk t over TIO 9.: χ 2 k = ( f e ) = 0 e Breakng the equaton down: the test statstc for the ch-squared dstrbuton s equal to the sum over all categores of the expected frequency

More information

ECONOMETRICS - FINAL EXAM, 3rd YEAR (GECO & GADE)

ECONOMETRICS - FINAL EXAM, 3rd YEAR (GECO & GADE) ECONOMETRICS - FINAL EXAM, 3rd YEAR (GECO & GADE) June 7, 016 15:30 Frst famly name: Name: DNI/ID: Moble: Second famly Name: GECO/GADE: Instructor: E-mal: Queston 1 A B C Blank Queston A B C Blank Queston

More information

STAT 511 FINAL EXAM NAME Spring 2001

STAT 511 FINAL EXAM NAME Spring 2001 STAT 5 FINAL EXAM NAME Sprng Instructons: Ths s a closed book exam. No notes or books are allowed. ou may use a calculator but you are not allowed to store notes or formulas n the calculator. Please wrte

More information

UNIVERSITY OF TORONTO Faculty of Arts and Science. December 2005 Examinations STA437H1F/STA1005HF. Duration - 3 hours

UNIVERSITY OF TORONTO Faculty of Arts and Science. December 2005 Examinations STA437H1F/STA1005HF. Duration - 3 hours UNIVERSITY OF TORONTO Faculty of Arts and Scence December 005 Examnatons STA47HF/STA005HF Duraton - hours AIDS ALLOWED: (to be suppled by the student) Non-programmable calculator One handwrtten 8.5'' x

More information

2016 Wiley. Study Session 2: Ethical and Professional Standards Application

2016 Wiley. Study Session 2: Ethical and Professional Standards Application 6 Wley Study Sesson : Ethcal and Professonal Standards Applcaton LESSON : CORRECTION ANALYSIS Readng 9: Correlaton and Regresson LOS 9a: Calculate and nterpret a sample covarance and a sample correlaton

More information

4 Analysis of Variance (ANOVA) 5 ANOVA. 5.1 Introduction. 5.2 Fixed Effects ANOVA

4 Analysis of Variance (ANOVA) 5 ANOVA. 5.1 Introduction. 5.2 Fixed Effects ANOVA 4 Analyss of Varance (ANOVA) 5 ANOVA 51 Introducton ANOVA ANOVA s a way to estmate and test the means of multple populatons We wll start wth one-way ANOVA If the populatons ncluded n the study are selected

More information

NUMERICAL DIFFERENTIATION

NUMERICAL DIFFERENTIATION NUMERICAL DIFFERENTIATION 1 Introducton Dfferentaton s a method to compute the rate at whch a dependent output y changes wth respect to the change n the ndependent nput x. Ths rate of change s called the

More information

Andreas C. Drichoutis Agriculural University of Athens. Abstract

Andreas C. Drichoutis Agriculural University of Athens. Abstract Heteroskedastcty, the sngle crossng property and ordered response models Andreas C. Drchouts Agrculural Unversty of Athens Panagots Lazards Agrculural Unversty of Athens Rodolfo M. Nayga, Jr. Texas AMUnversty

More information

Resource Allocation and Decision Analysis (ECON 8010) Spring 2014 Foundations of Regression Analysis

Resource Allocation and Decision Analysis (ECON 8010) Spring 2014 Foundations of Regression Analysis Resource Allocaton and Decson Analss (ECON 800) Sprng 04 Foundatons of Regresson Analss Readng: Regresson Analss (ECON 800 Coursepak, Page 3) Defntons and Concepts: Regresson Analss statstcal technques

More information

ANOVA. The Observations y ij

ANOVA. The Observations y ij ANOVA Stands for ANalyss Of VArance But t s a test of dfferences n means The dea: The Observatons y j Treatment group = 1 = 2 = k y 11 y 21 y k,1 y 12 y 22 y k,2 y 1, n1 y 2, n2 y k, nk means: m 1 m 2

More information

DO NOT OPEN THE QUESTION PAPER UNTIL INSTRUCTED TO DO SO BY THE CHIEF INVIGILATOR. Introductory Econometrics 1 hour 30 minutes

DO NOT OPEN THE QUESTION PAPER UNTIL INSTRUCTED TO DO SO BY THE CHIEF INVIGILATOR. Introductory Econometrics 1 hour 30 minutes 25/6 Canddates Only January Examnatons 26 Student Number: Desk Number:...... DO NOT OPEN THE QUESTION PAPER UNTIL INSTRUCTED TO DO SO BY THE CHIEF INVIGILATOR Department Module Code Module Ttle Exam Duraton

More information

LOGIT ANALYSIS. A.K. VASISHT Indian Agricultural Statistics Research Institute, Library Avenue, New Delhi

LOGIT ANALYSIS. A.K. VASISHT Indian Agricultural Statistics Research Institute, Library Avenue, New Delhi LOGIT ANALYSIS A.K. VASISHT Indan Agrcultural Statstcs Research Insttute, Lbrary Avenue, New Delh-0 02 amtvassht@asr.res.n. Introducton In dummy regresson varable models, t s assumed mplctly that the dependent

More information

LINEAR REGRESSION ANALYSIS. MODULE IX Lecture Multicollinearity

LINEAR REGRESSION ANALYSIS. MODULE IX Lecture Multicollinearity LINEAR REGRESSION ANALYSIS MODULE IX Lecture - 31 Multcollnearty Dr. Shalabh Department of Mathematcs and Statstcs Indan Insttute of Technology Kanpur 6. Rdge regresson The OLSE s the best lnear unbased

More information

PubH 7405: REGRESSION ANALYSIS. SLR: INFERENCES, Part II

PubH 7405: REGRESSION ANALYSIS. SLR: INFERENCES, Part II PubH 7405: REGRESSION ANALSIS SLR: INFERENCES, Part II We cover te topc of nference n two sessons; te frst sesson focused on nferences concernng te slope and te ntercept; ts s a contnuaton on estmatng

More information

Limited Dependent Variables

Limited Dependent Variables Lmted Dependent Varables. What f the left-hand sde varable s not a contnuous thng spread from mnus nfnty to plus nfnty? That s, gven a model = f (, β, ε, where a. s bounded below at zero, such as wages

More information

The Ordinary Least Squares (OLS) Estimator

The Ordinary Least Squares (OLS) Estimator The Ordnary Least Squares (OLS) Estmator 1 Regresson Analyss Regresson Analyss: a statstcal technque for nvestgatng and modelng the relatonshp between varables. Applcatons: Engneerng, the physcal and chemcal

More information

Linear Approximation with Regularization and Moving Least Squares

Linear Approximation with Regularization and Moving Least Squares Lnear Approxmaton wth Regularzaton and Movng Least Squares Igor Grešovn May 007 Revson 4.6 (Revson : March 004). 5 4 3 0.5 3 3.5 4 Contents: Lnear Fttng...4. Weghted Least Squares n Functon Approxmaton...

More information

9. Binary Dependent Variables

9. Binary Dependent Variables 9. Bnar Dependent Varables 9. Homogeneous models Log, prob models Inference Tax preparers 9.2 Random effects models 9.3 Fxed effects models 9.4 Margnal models and GEE Appendx 9A - Lkelhood calculatons

More information

LINEAR REGRESSION ANALYSIS. MODULE IX Lecture Multicollinearity

LINEAR REGRESSION ANALYSIS. MODULE IX Lecture Multicollinearity LINEAR REGRESSION ANALYSIS MODULE IX Lecture - 30 Multcollnearty Dr. Shalabh Department of Mathematcs and Statstcs Indan Insttute of Technology Kanpur 2 Remedes for multcollnearty Varous technques have

More information

Maximum Likelihood Estimation of Binary Dependent Variables Models: Probit and Logit. 1. General Formulation of Binary Dependent Variables Models

Maximum Likelihood Estimation of Binary Dependent Variables Models: Probit and Logit. 1. General Formulation of Binary Dependent Variables Models ECO 452 -- OE 4: Probt and Logt Models ECO 452 -- OE 4 Maxmum Lkelhood Estmaton of Bnary Dependent Varables Models: Probt and Logt hs note demonstrates how to formulate bnary dependent varables models

More information

18. SIMPLE LINEAR REGRESSION III

18. SIMPLE LINEAR REGRESSION III 8. SIMPLE LINEAR REGRESSION III US Domestc Beers: Calores vs. % Alcohol Ftted Values and Resduals To each observed x, there corresponds a y-value on the ftted lne, y ˆ ˆ = α + x. The are called ftted values.

More information

Linear regression. Regression Models. Chapter 11 Student Lecture Notes Regression Analysis is the

Linear regression. Regression Models. Chapter 11 Student Lecture Notes Regression Analysis is the Chapter 11 Student Lecture Notes 11-1 Lnear regresson Wenl lu Dept. Health statstcs School of publc health Tanjn medcal unversty 1 Regresson Models 1. Answer What Is the Relatonshp Between the Varables?.

More information

Correlation and Regression

Correlation and Regression Correlaton and Regresson otes prepared by Pamela Peterson Drake Index Basc terms and concepts... Smple regresson...5 Multple Regresson...3 Regresson termnology...0 Regresson formulas... Basc terms and

More information

PBAF 528 Week Theory Is the variable s place in the equation certain and theoretically sound? Most important! 2. T-test

PBAF 528 Week Theory Is the variable s place in the equation certain and theoretically sound? Most important! 2. T-test PBAF 528 Week 6 How do we choose our model? How do you decde whch ndependent varables? If you want to read more about ths, try Studenmund, A.H. Usng Econometrcs Chapter 7. (ether 3 rd or 4 th Edtons) 1.

More information

28. SIMPLE LINEAR REGRESSION III

28. SIMPLE LINEAR REGRESSION III 8. SIMPLE LINEAR REGRESSION III Ftted Values and Resduals US Domestc Beers: Calores vs. % Alcohol To each observed x, there corresponds a y-value on the ftted lne, y ˆ = βˆ + βˆ x. The are called ftted

More information

Module Contact: Dr Susan Long, ECO Copyright of the University of East Anglia Version 1

Module Contact: Dr Susan Long, ECO Copyright of the University of East Anglia Version 1 UNIVERSITY OF EAST ANGLIA School of Economcs Man Seres PG Examnaton 016-17 ECONOMETRIC METHODS ECO-7000A Tme allowed: hours Answer ALL FOUR Questons. Queston 1 carres a weght of 5%; Queston carres 0%;

More information

Multinomial logit regression

Multinomial logit regression 07/0/6 Multnomal logt regresson Introducton We now turn our attenton to regresson models for the analyss of categorcal dependent varables wth more than two response categores: Y car owned (many possble

More information

Chapter 2 - The Simple Linear Regression Model S =0. e i is a random error. S β2 β. This is a minimization problem. Solution is a calculus exercise.

Chapter 2 - The Simple Linear Regression Model S =0. e i is a random error. S β2 β. This is a minimization problem. Solution is a calculus exercise. Chapter - The Smple Lnear Regresson Model The lnear regresson equaton s: where y + = β + β e for =,..., y and are observable varables e s a random error How can an estmaton rule be constructed for the

More information

CHAPTER 6 GOODNESS OF FIT AND CONTINGENCY TABLE PREPARED BY: DR SITI ZANARIAH SATARI & FARAHANIM MISNI

CHAPTER 6 GOODNESS OF FIT AND CONTINGENCY TABLE PREPARED BY: DR SITI ZANARIAH SATARI & FARAHANIM MISNI CHAPTER 6 GOODNESS OF FIT AND CONTINGENCY TABLE Expected Outcomes Able to test the goodness of ft for categorcal data. Able to test whether the categorcal data ft to the certan dstrbuton such as Bnomal,

More information

3.1 Expectation of Functions of Several Random Variables. )' be a k-dimensional discrete or continuous random vector, with joint PMF p (, E X E X1 E X

3.1 Expectation of Functions of Several Random Variables. )' be a k-dimensional discrete or continuous random vector, with joint PMF p (, E X E X1 E X Statstcs 1: Probablty Theory II 37 3 EPECTATION OF SEVERAL RANDOM VARIABLES As n Probablty Theory I, the nterest n most stuatons les not on the actual dstrbuton of a random vector, but rather on a number

More information

e i is a random error

e i is a random error Chapter - The Smple Lnear Regresson Model The lnear regresson equaton s: where + β + β e for,..., and are observable varables e s a random error How can an estmaton rule be constructed for the unknown

More information

Biostatistics 360 F&t Tests and Intervals in Regression 1

Biostatistics 360 F&t Tests and Intervals in Regression 1 Bostatstcs 360 F&t Tests and Intervals n Regresson ORIGIN Model: Y = X + Corrected Sums of Squares: X X bar where: s the y ntercept of the regresson lne (translaton) s the slope of the regresson lne (scalng

More information

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur Analyss of Varance and Desgn of Experment-I MODULE VIII LECTURE - 34 ANALYSIS OF VARIANCE IN RANDOM-EFFECTS MODEL AND MIXED-EFFECTS EFFECTS MODEL Dr Shalabh Department of Mathematcs and Statstcs Indan

More information

Lecture 3: Probability Distributions

Lecture 3: Probability Distributions Lecture 3: Probablty Dstrbutons Random Varables Let us begn by defnng a sample space as a set of outcomes from an experment. We denote ths by S. A random varable s a functon whch maps outcomes nto the

More information

Lectures - Week 4 Matrix norms, Conditioning, Vector Spaces, Linear Independence, Spanning sets and Basis, Null space and Range of a Matrix

Lectures - Week 4 Matrix norms, Conditioning, Vector Spaces, Linear Independence, Spanning sets and Basis, Null space and Range of a Matrix Lectures - Week 4 Matrx norms, Condtonng, Vector Spaces, Lnear Independence, Spannng sets and Bass, Null space and Range of a Matrx Matrx Norms Now we turn to assocatng a number to each matrx. We could

More information

Lecture 4 Hypothesis Testing

Lecture 4 Hypothesis Testing Lecture 4 Hypothess Testng We may wsh to test pror hypotheses about the coeffcents we estmate. We can use the estmates to test whether the data rejects our hypothess. An example mght be that we wsh to

More information

Answers Problem Set 2 Chem 314A Williamsen Spring 2000

Answers Problem Set 2 Chem 314A Williamsen Spring 2000 Answers Problem Set Chem 314A Wllamsen Sprng 000 1) Gve me the followng crtcal values from the statstcal tables. a) z-statstc,-sded test, 99.7% confdence lmt ±3 b) t-statstc (Case I), 1-sded test, 95%

More information

= z 20 z n. (k 20) + 4 z k = 4

= z 20 z n. (k 20) + 4 z k = 4 Problem Set #7 solutons 7.2.. (a Fnd the coeffcent of z k n (z + z 5 + z 6 + z 7 + 5, k 20. We use the known seres expanson ( n+l ( z l l z n below: (z + z 5 + z 6 + z 7 + 5 (z 5 ( + z + z 2 + z + 5 5

More information

Lecture 2: Prelude to the big shrink

Lecture 2: Prelude to the big shrink Lecture 2: Prelude to the bg shrnk Last tme A slght detour wth vsualzaton tools (hey, t was the frst day... why not start out wth somethng pretty to look at?) Then, we consdered a smple 120a-style regresson

More information

Topic- 11 The Analysis of Variance

Topic- 11 The Analysis of Variance Topc- 11 The Analyss of Varance Expermental Desgn The samplng plan or expermental desgn determnes the way that a sample s selected. In an observatonal study, the expermenter observes data that already

More information

Lecture 16 Statistical Analysis in Biomaterials Research (Part II)

Lecture 16 Statistical Analysis in Biomaterials Research (Part II) 3.051J/0.340J 1 Lecture 16 Statstcal Analyss n Bomaterals Research (Part II) C. F Dstrbuton Allows comparson of varablty of behavor between populatons usng test of hypothess: σ x = σ x amed for Brtsh statstcan

More information