Introduction to Generalized Linear Models

Size: px
Start display at page:

Download "Introduction to Generalized Linear Models"

Transcription

1 INTRODUCTION TO STATISTICAL MODELLING TRINITY 00 Introducton to Generalzed Lnear Models I. Motvaton In ths lecture we extend the deas of lnear regresson to the more general dea of a generalzed lnear model (GLM. The essence of lnear models s that the response varable s contnuous and normally dstrbuted: here we relax these assumptons and consder cases where the response varable s non-normal and n partcular has a dscrete dstrbuton. Although these models are more general than lnear models, nearly all of the technques for testng hypotheses regardng the regresson coeffcents, and checkng the assumptons of the model apply drectly to a glm. In addton, a lnear model s smply a specal type of a generalsed lnear model and thus all of the dscusson below apples equally to lnear models. In Lecture we saw that a typcal statstcal model can be expressed as an equaton that equates the mean(s of the response varable(s to some functon of a lnear combnaton of the explanatory varables: E [ Y X x] = η β + β X + + β X η[ LC( X ; ] ( 0 p p = β = L, ( In equaton (, the form of the functon η( s known, as are Y and X (the latter for a partcular choce of explanatory varables. However, the parameters of the model, β = β, β, K, β, are not known and must be estmated. The smple lnear model s a ( 0 p specal case of the above n whch the functon η( s the dentty functon. Generalzed lnear models extend the deas underlyng the smple lnear model 0 p p, ~, ( µ E[ Y X ] = β + β X + β X + L + β X Y N σ ( where the Y are ndependent, to the followng more general stuatons:. Response varables can follow dstrbutons other than the Normal dstrbuton. They can be dscrete, or logcal (one of two categores.. The relatonshp between the response and the explanatory varables does not have to take on the smple lnear form above.

2 INTRODUCTION TO STATISTICAL MODELLING TRINITY 00 GLMs are based on the exponental famly of dstrbutons that shares many of the desrable statstcal propertes of the Normal dstrbuton. Consder a sngle random varable Y whose probablty dstrbuton depends only on a sngle parameter θ. The dstrbuton of Y belongs to the exponental famly f t can be wrtten n the form f ( y, θ = exp[ { yθ b( θ } a( φ + c( y, φ ] (3 where a, b and c are known functons. Furthermore, φ s a scale parameter that s treated as a nusance parameter f t s unknown. If φ s known, ths s an exponental-famly model wth canoncal parameter θ. Some well-known dstrbutons that belong to the exponental famly of dstrbutons nclude the Normal, exponental, Posson and bnomal dstrbutons. For example, consder the dscrete random varable Y that follows a Posson dstrbuton wth parameter λ. The probablty functon for Y s f ( y, λ y λ e = y! where y takes the values 0,,, K. Ths can be rewrtten as ( y, λ exp( y log λ λ log y! = exp( yθ exp( θ log y! f y, θ f = = (. Ths s exactly the form n (3 wth θ = log λ, φ =, ( φ =, b θ = exp θ and c( y, θ = log y!. Smlarly, all the other dstrbutons n the exponental famly can be rewrtten n form (. λ ( a ( ( In the case of GLMs we requre an extenson of numercal estmaton methods to estmate the parameters β from the lnear model n (, to a more general stuaton where there s some non-lnear functon, g, relatng x β, that s g ( µ x β = ( Y E = µ to the lnear component where g s called the lnk functon. The estmaton of the parameters s typcally based on maxmum lkelhood. Although explct mathematcal expressons can be found for the estmators n some specal cases, numercal optmsaton methods are usually needed. These methods are ncluded n most modern statstcal software packages. It s not the am of ths course to go nto any detal on estmaton, snce we wll focus on the applcaton of these models rather than ther estmaton. GLMs are extensvely used n the analyss of bnary data (e.g. logstc regresson and count data (e.g. Posson and log-lnear models. We wll consder some of these applcatons n the followng lecture. In ths lecture we wll ntroduce the basc GLM setup and assumptons.

3 INTRODUCTION TO STATISTICAL MODELLING TRINITY 00 II. Basc GLM setup and assumptons We wll defne the generalzed lnear model n terms of a set of ndependent random varables Y, Y,, each wth a dstrbuton from the exponental famly of K Y N dstrbutons. The generalzed lnear model has three components:. The random component: the response varables, Y, Y, K,Y N, are assumed to share the same dstrbuton from the exponental famly of dstrbutons ntroduced n the prevous secton, wth E = and constant varance σ. ( Ths part descrbes how the response varable s dstrbuted.. The systematc component: covarates (explanatory varables x, x, K, x p produce a lnear predctor η gven by η = p x j = β j 3. The lnk functon, g, between the random and systematc components. It descrbes how the covarates are related to the random component,.e. η = g ( µ, where µ = E( Y and can be any monotone dfferentable functon. An mportant aspect of generalzed lnear models that we need to keep n mnd s that they assume ndependent (or at least uncorrelated observatons. A second mportant assumpton s that there s a sngle error term n the model. Ths corresponds to Assumpton for the lnear model, namely, that the only error n the model has to do wth the response varable: we wll assume that the X varables are measured wthout error. In the case of generalzed lnear models we no longer assume constant varance of the resduals, although we stll have to know how the varance depends on the mean. The varance functon V ( µ relates the varance of Y s related to ts mean µ. The form of ths functon s determned by the dstrbuton that s assumed. Y µ III. Goodness of ft and comparng models Overvew A very mportant aspect of generalzed models, and ndeed all statstcal models (Lectures -4, s to evaluate the relevance of our model for our data and how well t fts the data. In statstcal terms ths s referred to as goodness of ft. We are also nterested n comparng dfferent models and selectng a model that s reasonably smple, but that provdes a good ft. Ths nvolves fndng a balance between mprovng the ft on one sde wthout unnecessary ncreasng the complexty of the model on the other. In a statstcal modellng framework we perform hypothess tests to compare how well two related models ft the data. In the generalzed lnear model framework, the two models we compare should have the same probablty dstrbuton and lnk functon, but they 3

4 INTRODUCTION TO STATISTICAL MODELLING TRINITY 00 can dffer wth regards to the number of parameters n the systematc component of the model. The smpler model relatng to the null hypothess s therefore a specal case of the other more general model. If the smpler model fts the data as well as the more general model t s retaned on the grounds of parsmony and cannot be rejected. If the more general model provdes a sgnfcantly better ft than the smple model, then s rejected n favour of the alternatve hypothess H, whch corresponds to the H 0 more general model. In order to make these comparsons we need goodness of ft statstcs to descrbe how well the models ft. These statstcs can be based on any of a number of crtera such as the maxmum value of the log-lkelhood functon, the mnmum value of the sum of squares crteron or a composte statstc based on the resduals. If f Y ( y, θ s the densty functon for a random varable Y gven the parameter θ, then the log-lkelhood based on a set of ndependent observatons of Y, y, y,k, y n, s then defned as l ( µ, y = log f ( y, θ where µ = ( µ, µ, K, µ n. It s mportant to note the subtle shft n emphass from the densty functon. In the densty functon f ( y, θ s consdered as a functon n y for fxed θ, whereas the log-lkelhood s prmarly consdered as a functon of θ for the partcular data observed ( y, y, K,. y n In order to test the hypotheses above, samplng dstrbutons of the goodness of ft statstcs are requred. In the followng subsecton we consder one such goodness of ft crteron, the devance, n a bt more detal. Fnally, we wll say somethng about the choce of scale for the analyss. It s an mportant aspect of the model selecton process, although scalng problems are consderably reduced n the generalzed lnear model setup. The normalty and constant varance assumpton of the lnear regresson model s for nstance no longer a requrement. The choce of scale s largely dependent on the purpose for whch the scale wll be used. It s also mportant to keep n mnd that no sngle scale wll smultaneously produce all the desred propertes. H 0 H 0 The Devance One way of assessng the adequacy of a model s to compare t wth a more general model wth the maxmum number of parameters that can be estmated. It s referred to as the saturated model. In the saturated model there s bascally one parameter per observaton. The devance assesses the goodness of ft for the model by lookng at the dfference between the log-lkelhood functons of the saturated model and the model under nvestgaton,.e. l( b sat, y l( b, y. Here b sat denotes the maxmum lkelhood estmator of the parameter vector of the saturated model, β, and b s the maxmum sat lkelhood estmator of the parameters of the model under nvestgaton, β. The 4

5 INTRODUCTION TO STATISTICAL MODELLING TRINITY 00 maxmum lkelhood estmator s the estmator that maxmses the lkelhood functon. The devance s defned as { l( b, y ( b y } D = sat l,. A small devance mples a good ft. The samplng dstrbuton of the devance s approxmately χ ( m p, υ, where ν s the non-centralty parameter. The devance has an exact χ dstrbuton f the response varables Y are normally dstrbuted. In ths ( Y case, however, D depends on var = σ whch, n practce, s usually unknown. Ths prevents the drect use of the devance as a goodness of ft statstc n ths case. For other dstrbutons of the Y, the devance may only be approxmately ch-square. It must be noted that ths approxmaton can be very poor for lmted amounts of data. In the case of the bnomal and Posson dstrbutons, for example, D can be calculated and used drectly as a goodness of ft statstc. If the scale parameter φ s unknown or known to have a value other than one, we us a scaled verson of the devance and we call t the scaled devance. D φ The devance forms the bass for most hypothess testng for generalzed lnear models. Suppose we are nterested n comparng the ft of two models. These models need to have the same probablty dstrbuton and the same lnk functon. The models also need to be herarchcal, whch means that the systematc component of the smpler model s a specal case of the lnear component of the more general model M. M 0 Consder the null hypothess H 0 : β = β 0 β = M β q that corresponds to model M 0 and a more general hypothess H : β = β β = M β p that corresponds to model M, wth q < p < N. We can test aganst H usng the dfference of the devance statstcs D = D H 0 D = { l( b, y l( b, y } { l( b, y l( b, y } = { l( b, y ( b y } 0 sat 0 sat l 0, 5

6 INTRODUCTION TO STATISTICAL MODELLING TRINITY 00 If both models descrbe the data well, then D 0 follows a χ ( N q dstrbuton and D follows a χ ( N p dstrbuton. It then follows that D has a χ ( p q dstrbuton under certan ndependence assumptons. If the value of D s consstent wth the χ ( p q dstrbuton we would generally choose the model correspondng to because t s smpler. If the value of D s n the crtcal regon of the χ p q dstrbuton we would reject n favour of H on the grounds that model H 0 provdes a sgnfcantly better descrpton of the data. It must be noted that ths model too may not ft the data partcularly well. M 0 H 0 ( If the devance can be calculated from the data, then D provdes a good method for hypothess testng. The samplng dstrbuton of D s usually better approxmated by the ch-squared dstrbuton than s the samplng dstrbuton of a sngle devance. M Model checkng As was the case for the smple lnear model, we should perform model checkng after we have ftted a partcular generalzed lnear model. As before we should look at whether the model s reasonable and we should nvestgate whether the varous assumptons we make when we ft and draw nference usng the model are satsfed. If the checks and nvestgatons do reveal that there s a problem, then there are a number of dfferent solutons avalable to us. These were dscussed n Lecture Notes on model checkng for lnear models. In ths secton we dscuss graphcal technques avalable to us for tryng to detect systematc departures from our generalzed lnear model that may for example be the result of an ncorrectly specfed lnk functon, varance functon or a msspecfcaton of the explanatory varables n the systematc component n the model. As was the case for the lnear model, many of these graphcal technques ental usng the resdual values from our ftted model. For generalzed lnear models we requre an extended defnton of resduals, applcable to dstrbutons other than the Normal dstrbuton. Many of these these resduals are dscussed n detal n McCullagh and Nelder (989. These resduals can be used to assess varous aspects of the adequacy of a ftted generalzed lnear model that we have mentoned above. The resduals should be unrelated to the explanatory varables and they can also be use to dentfy any unusual values that may requre some further nvestgaton. Varous plots of the resduals can be used to assess these propertes. Systematc pattern n the resdual plots can for example be ndcatve of an unsutable lnk functon, wrong scale of one or more of the predctors, or omsson of a quadratc term n a predctor. Examples of extended defntons of resduals that are wdely used n model checkng for GLMs nclude the Pearson and devance resduals. The Pearson resduals are just rescaled versons of the raw or response resduals and are defned as 6

7 INTRODUCTION TO STATISTICAL MODELLING TRINITY 00 r P ( y ˆ µ V ( ˆ µ =. ( Here V µ s the varance functon. The name s taken from the fact that for the Posson dstrbuton the Pearson resdual s just the sgned square root of the component of the Pearson X goodness-of-ft statstc, so that r p = X. If the devance s used as a measure of dscrepancy of a generalzed lnear model, then each unt contrbutes a quantty d to the devance, so that D = d. The devance resdual s defned as D ( y d r = sgn µ. Most often standardsed versons of the above resduals are used n model checkng. The standardsed versons of the Pearson and devance resdual are gven by r P = ˆ φ ( y ˆ µ V ( ˆ µ ( h and r D = φˆ r D ( h respectvely. In the above h s the equvalent to the leverage that we have defned for the lnear model. In general the devance resdual, ether unstandardzed or standardzed, s preferred to the Pearson resdual. Below we wll dscuss a few basc plots of the (standardzed resduals that can be used to check the valdty of our model.. Informal checks usng the resduals It s almost standard procedure to consder scatterplots of the resduals aganst some functon of the ftted values. Scatterplots of the standardzed devance resduals aganst the estmated lnear predctor ηˆ or aganst the ftted values transformed to a constant scale of the error dstrbuton are recommended for ths purpose. For a few commonly used error dstrbutons the followng transformed values are recommended: µˆ for Normal errors, µˆ for Posson errors, sn µˆ for bnomal errors, log µˆ for gamma errors. The plot should be centred at ˆ µ = 0 and a constant range. Typcal devatons from ths pattern nclude curvature n the resduals wth the mean and a systematc change n 7

8 INTRODUCTION TO STATISTICAL MODELLING TRINITY 00 the range of the resduals wth ftted value. Curvature may arse from several causes, ncludng the wrong choce of lnk functon, wrong choce of scale of one or more of the covarates, or omsson of a quadratc term n a covarate. A second useful scatterplot s that of the resduals aganst an explanatory varable n the lnear predctor. Ths plot should exhbt the same form as the plot above. The presence of systematc trend usually arses for the same reasons as for the plot above. Addtonally, the trend may also be the result of a faulty scale n another explanatory varable that s closely correlated wth the one under nvestgaton. The thrd scatterplot that we wll consder s termed an added-varable plot. Ths s equvalent to the partal regresson plot that we consdered for the lnear model. Ths plot helps us to check f an omtted explanatory varable should be ncluded n the lnear predctor. The added-varable plot for a partcular canddate explanatory varable s formed by (a. fndng the unstandardzed resduals for the exstng generalzed lnear model wth response varable Y and any already ncluded explanatory varables, (b. fndng the unstandardzed resduals for another lnear model n whch the canddate explanatory varable s treated as the response, usng the same lnear predctor as for Y (here, the canddate explanatory varable s treated as the response varable, and (c. plottng the frst set of resduals aganst the second set. The presence of a trend n the ponts n ths plot ndcates that you mght consder ncludng the partcular canddate varable n the model as an explanatory varable, and the shape of the trend can tell you what forms of the varable you mght nclude.. Checkng the varance functon A plot of the absolute resduals aganst the ftted values gves an nformal check on the adequacy of the assumed varance functon. An ll-chosen varance functon wll result n a trend n the mean. A postve trend ndcates that the current varance functon s ncreasng to slowly wth the mean, and vce versa. 3. Checkng the lnk functon An nformal check nvolves examnng the plot of the scale-adjusted dependent varable aganst ηˆ, the estmated lnear predctor. Ths should approxmately be a straght lne. For lnk functons of the power famly an upward curvature n the plot ponts to a lnk wth hgher power than that used. A downward curvature ponts to a lower power. For bnary data ths plot s unnformatve. McCullagh and Nelder (989 dscuss more formal methods for ths stuaton. Note that checks for the lnk functon are affected by falure to establsh the correct scales for one or more of the explanatory varables n the lnear predctor. Ths can be valdated usng partal resdual plots. They are descrbed below. 4. Checkng the scales of explanatory varables The partal resdual plot s a useful tool for checkng whether the correct scales have been used for the explanatory varables. In ts generalzed form the partal resdual s 8

9 INTRODUCTION TO STATISTICAL MODELLING TRINITY 00 defned by u = z ˆ η + ˆ γx where z s the adjusted dependent varable, ηˆ the ftted lnear predctor and γˆ the parameter estmate for the explanatory varable x. The plot of u aganst x provdes an nformal check f the scale of x s satsfactory. A correctly specfed scale should result n an approxmately lnear plot. The form of the plot may suggest a sutable alternatve f the scale s not approprate. Note that dstortons n ths plot may also occur when the scales of other explanatory varables are wrong, whch may requre that we look at partal resdual plots for several explanatory varables. 5. Checks for outlyng or nfluental ponts We also need to check for ndvdual ponts that may dffer from the general pattern set by the remander of the ponts. We defned the leverage h for ndvdual ponts n the context of the lnear model by whch to judge ther nfluence on the ft. We can consder these measures for GLMs as well, but we must note that a pont n the extreme of the explanatory varable s range wll not necessarly have a hgh leverage f ts weght s small. The Cook s dstance was ntroduced as a measure of nfluence for the lnear model n a prevous lecture. Adapted versons of the Cook s dstance can be used for generalzed lnear models. The Studentzed resduals e * = r * = Y ˆ Y( seˆ( Y Yˆ ( that were ntroduced n the context of the lnear model can also be used to assess the consstency of ndvdual ponts. One-step approxmatons of these resduals exst that are approprate for GLMs. To nterpret the n values that we get for the leverage, Cook s dstance and Studentzed resduals respectvely, we need some measure to assess how large extreme values would be n a sample of a gven sze even f no unusual ponts were present. Normal plots can be used for ths purpose. There are two forms of Normal plots: the half-normal plot and the Normal plot. We wll not go nto the theoretcal detal underlyng these plots. We wll only menton that the half- Normal plot s approprate for non-negatve quanttes lke the leverage and Cook s dstance, whle for the Studentzed resduals there are two optons, ether a half- Normal plot of * r or a full Normal plot of r tself. For ether plot the ordered values of the statstc are plotted aganst the expected order statstcs of a Normal sample. Extreme ponts wll appear at the extremes of the plot, and may possbly devate from the trend ndcated by the remander of the ponts. Note that s trend s not necessarly lnear n the case of the leverage or Cook s dstances. * 9

10 INTRODUCTION TO STATISTICAL MODELLING TRINITY Checks for correlatons n the errors An ndex plot of the resduals should can be used to assess correlaton n the resduals. If the resduals are ndependent ths should fluctuate randomly wthout systematc pattern. If the resduals are correlated specal modellng methods are needed. K. Javaras and W. Vos (00 References Davdson, A. C. and Snell, E. J. (99. Resduals and dagnostcs. Chapter 4 of Hnkley et al. (99. Dobson, A. (990. An Introducton to Generalzed Lnear Models ( nd ed.. Boca Raton, FL: Chapman and Hall/CRC. Hnkley, D. V., Red, N. and Snell, E. J. eds (99. Statstcal Theory and Modellng. In Honour of Sr Davd Cox. London: Chapman & Hall. McCullagh, P. and Nelder, J. A. (989. Generalzed Lnear Models ( nd ed.. London: Chapman and Hall. 0

Chapter 11: Simple Linear Regression and Correlation

Chapter 11: Simple Linear Regression and Correlation Chapter 11: Smple Lnear Regresson and Correlaton 11-1 Emprcal Models 11-2 Smple Lnear Regresson 11-3 Propertes of the Least Squares Estmators 11-4 Hypothess Test n Smple Lnear Regresson 11-4.1 Use of t-tests

More information

Comparison of Regression Lines

Comparison of Regression Lines STATGRAPHICS Rev. 9/13/2013 Comparson of Regresson Lnes Summary... 1 Data Input... 3 Analyss Summary... 4 Plot of Ftted Model... 6 Condtonal Sums of Squares... 6 Analyss Optons... 7 Forecasts... 8 Confdence

More information

Econ107 Applied Econometrics Topic 3: Classical Model (Studenmund, Chapter 4)

Econ107 Applied Econometrics Topic 3: Classical Model (Studenmund, Chapter 4) I. Classcal Assumptons Econ7 Appled Econometrcs Topc 3: Classcal Model (Studenmund, Chapter 4) We have defned OLS and studed some algebrac propertes of OLS. In ths topc we wll study statstcal propertes

More information

Psychology 282 Lecture #24 Outline Regression Diagnostics: Outliers

Psychology 282 Lecture #24 Outline Regression Diagnostics: Outliers Psychology 282 Lecture #24 Outlne Regresson Dagnostcs: Outlers In an earler lecture we studed the statstcal assumptons underlyng the regresson model, ncludng the followng ponts: Formal statement of assumptons.

More information

Negative Binomial Regression

Negative Binomial Regression STATGRAPHICS Rev. 9/16/2013 Negatve Bnomal Regresson Summary... 1 Data Input... 3 Statstcal Model... 3 Analyss Summary... 4 Analyss Optons... 7 Plot of Ftted Model... 8 Observed Versus Predcted... 10 Predctons...

More information

Diagnostics in Poisson Regression. Models - Residual Analysis

Diagnostics in Poisson Regression. Models - Residual Analysis Dagnostcs n Posson Regresson Models - Resdual Analyss 1 Outlne Dagnostcs n Posson Regresson Models - Resdual Analyss Example 3: Recall of Stressful Events contnued 2 Resdual Analyss Resduals represent

More information

STAT 3008 Applied Regression Analysis

STAT 3008 Applied Regression Analysis STAT 3008 Appled Regresson Analyss Tutoral : Smple Lnear Regresson LAI Chun He Department of Statstcs, The Chnese Unversty of Hong Kong 1 Model Assumpton To quantfy the relatonshp between two factors,

More information

3.1 Expectation of Functions of Several Random Variables. )' be a k-dimensional discrete or continuous random vector, with joint PMF p (, E X E X1 E X

3.1 Expectation of Functions of Several Random Variables. )' be a k-dimensional discrete or continuous random vector, with joint PMF p (, E X E X1 E X Statstcs 1: Probablty Theory II 37 3 EPECTATION OF SEVERAL RANDOM VARIABLES As n Probablty Theory I, the nterest n most stuatons les not on the actual dstrbuton of a random vector, but rather on a number

More information

x i1 =1 for all i (the constant ).

x i1 =1 for all i (the constant ). Chapter 5 The Multple Regresson Model Consder an economc model where the dependent varable s a functon of K explanatory varables. The economc model has the form: y = f ( x,x,..., ) xk Approxmate ths by

More information

Kernel Methods and SVMs Extension

Kernel Methods and SVMs Extension Kernel Methods and SVMs Extenson The purpose of ths document s to revew materal covered n Machne Learnng 1 Supervsed Learnng regardng support vector machnes (SVMs). Ths document also provdes a general

More information

See Book Chapter 11 2 nd Edition (Chapter 10 1 st Edition)

See Book Chapter 11 2 nd Edition (Chapter 10 1 st Edition) Count Data Models See Book Chapter 11 2 nd Edton (Chapter 10 1 st Edton) Count data consst of non-negatve nteger values Examples: number of drver route changes per week, the number of trp departure changes

More information

2016 Wiley. Study Session 2: Ethical and Professional Standards Application

2016 Wiley. Study Session 2: Ethical and Professional Standards Application 6 Wley Study Sesson : Ethcal and Professonal Standards Applcaton LESSON : CORRECTION ANALYSIS Readng 9: Correlaton and Regresson LOS 9a: Calculate and nterpret a sample covarance and a sample correlaton

More information

Predictive Analytics : QM901.1x Prof U Dinesh Kumar, IIMB. All Rights Reserved, Indian Institute of Management Bangalore

Predictive Analytics : QM901.1x Prof U Dinesh Kumar, IIMB. All Rights Reserved, Indian Institute of Management Bangalore Sesson Outlne Introducton to classfcaton problems and dscrete choce models. Introducton to Logstcs Regresson. Logstc functon and Logt functon. Maxmum Lkelhood Estmator (MLE) for estmaton of LR parameters.

More information

Simulated Power of the Discrete Cramér-von Mises Goodness-of-Fit Tests

Simulated Power of the Discrete Cramér-von Mises Goodness-of-Fit Tests Smulated of the Cramér-von Mses Goodness-of-Ft Tests Steele, M., Chaselng, J. and 3 Hurst, C. School of Mathematcal and Physcal Scences, James Cook Unversty, Australan School of Envronmental Studes, Grffth

More information

Polynomial Regression Models

Polynomial Regression Models LINEAR REGRESSION ANALYSIS MODULE XII Lecture - 6 Polynomal Regresson Models Dr. Shalabh Department of Mathematcs and Statstcs Indan Insttute of Technology Kanpur Test of sgnfcance To test the sgnfcance

More information

Chapter 5 Multilevel Models

Chapter 5 Multilevel Models Chapter 5 Multlevel Models 5.1 Cross-sectonal multlevel models 5.1.1 Two-level models 5.1.2 Multple level models 5.1.3 Multple level modelng n other felds 5.2 Longtudnal multlevel models 5.2.1 Two-level

More information

/ n ) are compared. The logic is: if the two

/ n ) are compared. The logic is: if the two STAT C141, Sprng 2005 Lecture 13 Two sample tests One sample tests: examples of goodness of ft tests, where we are testng whether our data supports predctons. Two sample tests: called as tests of ndependence

More information

Chapter 13: Multiple Regression

Chapter 13: Multiple Regression Chapter 13: Multple Regresson 13.1 Developng the multple-regresson Model The general model can be descrbed as: It smplfes for two ndependent varables: The sample ft parameter b 0, b 1, and b are used to

More information

Statistics for Managers Using Microsoft Excel/SPSS Chapter 13 The Simple Linear Regression Model and Correlation

Statistics for Managers Using Microsoft Excel/SPSS Chapter 13 The Simple Linear Regression Model and Correlation Statstcs for Managers Usng Mcrosoft Excel/SPSS Chapter 13 The Smple Lnear Regresson Model and Correlaton 1999 Prentce-Hall, Inc. Chap. 13-1 Chapter Topcs Types of Regresson Models Determnng the Smple Lnear

More information

Chapter 8 Indicator Variables

Chapter 8 Indicator Variables Chapter 8 Indcator Varables In general, e explanatory varables n any regresson analyss are assumed to be quanttatve n nature. For example, e varables lke temperature, dstance, age etc. are quanttatve n

More information

4 Analysis of Variance (ANOVA) 5 ANOVA. 5.1 Introduction. 5.2 Fixed Effects ANOVA

4 Analysis of Variance (ANOVA) 5 ANOVA. 5.1 Introduction. 5.2 Fixed Effects ANOVA 4 Analyss of Varance (ANOVA) 5 ANOVA 51 Introducton ANOVA ANOVA s a way to estmate and test the means of multple populatons We wll start wth one-way ANOVA If the populatons ncluded n the study are selected

More information

The Multiple Classical Linear Regression Model (CLRM): Specification and Assumptions. 1. Introduction

The Multiple Classical Linear Regression Model (CLRM): Specification and Assumptions. 1. Introduction ECONOMICS 5* -- NOTE (Summary) ECON 5* -- NOTE The Multple Classcal Lnear Regresson Model (CLRM): Specfcaton and Assumptons. Introducton CLRM stands for the Classcal Lnear Regresson Model. The CLRM s also

More information

The Ordinary Least Squares (OLS) Estimator

The Ordinary Least Squares (OLS) Estimator The Ordnary Least Squares (OLS) Estmator 1 Regresson Analyss Regresson Analyss: a statstcal technque for nvestgatng and modelng the relatonshp between varables. Applcatons: Engneerng, the physcal and chemcal

More information

Statistics for Economics & Business

Statistics for Economics & Business Statstcs for Economcs & Busness Smple Lnear Regresson Learnng Objectves In ths chapter, you learn: How to use regresson analyss to predct the value of a dependent varable based on an ndependent varable

More information

Composite Hypotheses testing

Composite Hypotheses testing Composte ypotheses testng In many hypothess testng problems there are many possble dstrbutons that can occur under each of the hypotheses. The output of the source s a set of parameters (ponts n a parameter

More information

1. Inference on Regression Parameters a. Finding Mean, s.d and covariance amongst estimates. 2. Confidence Intervals and Working Hotelling Bands

1. Inference on Regression Parameters a. Finding Mean, s.d and covariance amongst estimates. 2. Confidence Intervals and Working Hotelling Bands Content. Inference on Regresson Parameters a. Fndng Mean, s.d and covarance amongst estmates.. Confdence Intervals and Workng Hotellng Bands 3. Cochran s Theorem 4. General Lnear Testng 5. Measures of

More information

Linear Regression Analysis: Terminology and Notation

Linear Regression Analysis: Terminology and Notation ECON 35* -- Secton : Basc Concepts of Regresson Analyss (Page ) Lnear Regresson Analyss: Termnology and Notaton Consder the generc verson of the smple (two-varable) lnear regresson model. It s represented

More information

Chapter 9: Statistical Inference and the Relationship between Two Variables

Chapter 9: Statistical Inference and the Relationship between Two Variables Chapter 9: Statstcal Inference and the Relatonshp between Two Varables Key Words The Regresson Model The Sample Regresson Equaton The Pearson Correlaton Coeffcent Learnng Outcomes After studyng ths chapter,

More information

Chapter 15 Student Lecture Notes 15-1

Chapter 15 Student Lecture Notes 15-1 Chapter 15 Student Lecture Notes 15-1 Basc Busness Statstcs (9 th Edton) Chapter 15 Multple Regresson Model Buldng 004 Prentce-Hall, Inc. Chap 15-1 Chapter Topcs The Quadratc Regresson Model Usng Transformatons

More information

x = , so that calculated

x = , so that calculated Stat 4, secton Sngle Factor ANOVA notes by Tm Plachowsk n chapter 8 we conducted hypothess tests n whch we compared a sngle sample s mean or proporton to some hypotheszed value Chapter 9 expanded ths to

More information

Chapter 2 - The Simple Linear Regression Model S =0. e i is a random error. S β2 β. This is a minimization problem. Solution is a calculus exercise.

Chapter 2 - The Simple Linear Regression Model S =0. e i is a random error. S β2 β. This is a minimization problem. Solution is a calculus exercise. Chapter - The Smple Lnear Regresson Model The lnear regresson equaton s: where y + = β + β e for =,..., y and are observable varables e s a random error How can an estmaton rule be constructed for the

More information

4.3 Poisson Regression

4.3 Poisson Regression of teratvely reweghted least squares regressons (the IRLS algorthm). We do wthout gvng further detals, but nstead focus on the practcal applcaton. > glm(survval~log(weght)+age, famly="bnomal", data=baby)

More information

STAT 405 BIOSTATISTICS (Fall 2016) Handout 15 Introduction to Logistic Regression

STAT 405 BIOSTATISTICS (Fall 2016) Handout 15 Introduction to Logistic Regression STAT 45 BIOSTATISTICS (Fall 26) Handout 5 Introducton to Logstc Regresson Ths handout covers materal found n Secton 3.7 of your text. You may also want to revew regresson technques n Chapter. In ths handout,

More information

Generalized Linear Methods

Generalized Linear Methods Generalzed Lnear Methods 1 Introducton In the Ensemble Methods the general dea s that usng a combnaton of several weak learner one could make a better learner. More formally, assume that we have a set

More information

Lecture 4 Hypothesis Testing

Lecture 4 Hypothesis Testing Lecture 4 Hypothess Testng We may wsh to test pror hypotheses about the coeffcents we estmate. We can use the estmates to test whether the data rejects our hypothess. An example mght be that we wsh to

More information

Economics 130. Lecture 4 Simple Linear Regression Continued

Economics 130. Lecture 4 Simple Linear Regression Continued Economcs 130 Lecture 4 Contnued Readngs for Week 4 Text, Chapter and 3. We contnue wth addressng our second ssue + add n how we evaluate these relatonshps: Where do we get data to do ths analyss? How do

More information

Econ Statistical Properties of the OLS estimator. Sanjaya DeSilva

Econ Statistical Properties of the OLS estimator. Sanjaya DeSilva Econ 39 - Statstcal Propertes of the OLS estmator Sanjaya DeSlva September, 008 1 Overvew Recall that the true regresson model s Y = β 0 + β 1 X + u (1) Applyng the OLS method to a sample of data, we estmate

More information

Limited Dependent Variables

Limited Dependent Variables Lmted Dependent Varables. What f the left-hand sde varable s not a contnuous thng spread from mnus nfnty to plus nfnty? That s, gven a model = f (, β, ε, where a. s bounded below at zero, such as wages

More information

Joint Statistical Meetings - Biopharmaceutical Section

Joint Statistical Meetings - Biopharmaceutical Section Iteratve Ch-Square Test for Equvalence of Multple Treatment Groups Te-Hua Ng*, U.S. Food and Drug Admnstraton 1401 Rockvlle Pke, #200S, HFM-217, Rockvlle, MD 20852-1448 Key Words: Equvalence Testng; Actve

More information

Linear Approximation with Regularization and Moving Least Squares

Linear Approximation with Regularization and Moving Least Squares Lnear Approxmaton wth Regularzaton and Movng Least Squares Igor Grešovn May 007 Revson 4.6 (Revson : March 004). 5 4 3 0.5 3 3.5 4 Contents: Lnear Fttng...4. Weghted Least Squares n Functon Approxmaton...

More information

Comparison of the Population Variance Estimators. of 2-Parameter Exponential Distribution Based on. Multiple Criteria Decision Making Method

Comparison of the Population Variance Estimators. of 2-Parameter Exponential Distribution Based on. Multiple Criteria Decision Making Method Appled Mathematcal Scences, Vol. 7, 0, no. 47, 07-0 HIARI Ltd, www.m-hkar.com Comparson of the Populaton Varance Estmators of -Parameter Exponental Dstrbuton Based on Multple Crtera Decson Makng Method

More information

ANOVA. The Observations y ij

ANOVA. The Observations y ij ANOVA Stands for ANalyss Of VArance But t s a test of dfferences n means The dea: The Observatons y j Treatment group = 1 = 2 = k y 11 y 21 y k,1 y 12 y 22 y k,2 y 1, n1 y 2, n2 y k, nk means: m 1 m 2

More information

Department of Quantitative Methods & Information Systems. Time Series and Their Components QMIS 320. Chapter 6

Department of Quantitative Methods & Information Systems. Time Series and Their Components QMIS 320. Chapter 6 Department of Quanttatve Methods & Informaton Systems Tme Seres and Ther Components QMIS 30 Chapter 6 Fall 00 Dr. Mohammad Zanal These sldes were modfed from ther orgnal source for educatonal purpose only.

More information

Basically, if you have a dummy dependent variable you will be estimating a probability.

Basically, if you have a dummy dependent variable you will be estimating a probability. ECON 497: Lecture Notes 13 Page 1 of 1 Metropoltan State Unversty ECON 497: Research and Forecastng Lecture Notes 13 Dummy Dependent Varable Technques Studenmund Chapter 13 Bascally, f you have a dummy

More information

BOOTSTRAP METHOD FOR TESTING OF EQUALITY OF SEVERAL MEANS. M. Krishna Reddy, B. Naveen Kumar and Y. Ramu

BOOTSTRAP METHOD FOR TESTING OF EQUALITY OF SEVERAL MEANS. M. Krishna Reddy, B. Naveen Kumar and Y. Ramu BOOTSTRAP METHOD FOR TESTING OF EQUALITY OF SEVERAL MEANS M. Krshna Reddy, B. Naveen Kumar and Y. Ramu Department of Statstcs, Osmana Unversty, Hyderabad -500 007, Inda. nanbyrozu@gmal.com, ramu0@gmal.com

More information

Durban Watson for Testing the Lack-of-Fit of Polynomial Regression Models without Replications

Durban Watson for Testing the Lack-of-Fit of Polynomial Regression Models without Replications Durban Watson for Testng the Lack-of-Ft of Polynomal Regresson Models wthout Replcatons Ruba A. Alyaf, Maha A. Omar, Abdullah A. Al-Shha ralyaf@ksu.edu.sa, maomar@ksu.edu.sa, aalshha@ksu.edu.sa Department

More information

Lecture 6: Introduction to Linear Regression

Lecture 6: Introduction to Linear Regression Lecture 6: Introducton to Lnear Regresson An Manchakul amancha@jhsph.edu 24 Aprl 27 Lnear regresson: man dea Lnear regresson can be used to study an outcome as a lnear functon of a predctor Example: 6

More information

LINEAR REGRESSION ANALYSIS. MODULE IX Lecture Multicollinearity

LINEAR REGRESSION ANALYSIS. MODULE IX Lecture Multicollinearity LINEAR REGRESSION ANALYSIS MODULE IX Lecture - 30 Multcollnearty Dr. Shalabh Department of Mathematcs and Statstcs Indan Insttute of Technology Kanpur 2 Remedes for multcollnearty Varous technques have

More information

LINEAR REGRESSION ANALYSIS. MODULE IX Lecture Multicollinearity

LINEAR REGRESSION ANALYSIS. MODULE IX Lecture Multicollinearity LINEAR REGRESSION ANALYSIS MODULE IX Lecture - 31 Multcollnearty Dr. Shalabh Department of Mathematcs and Statstcs Indan Insttute of Technology Kanpur 6. Rdge regresson The OLSE s the best lnear unbased

More information

Statistics for Business and Economics

Statistics for Business and Economics Statstcs for Busness and Economcs Chapter 11 Smple Regresson Copyrght 010 Pearson Educaton, Inc. Publshng as Prentce Hall Ch. 11-1 11.1 Overvew of Lnear Models n An equaton can be ft to show the best lnear

More information

Introduction to Regression

Introduction to Regression Introducton to Regresson Dr Tom Ilvento Department of Food and Resource Economcs Overvew The last part of the course wll focus on Regresson Analyss Ths s one of the more powerful statstcal technques Provdes

More information

ECONOMICS 351*-A Mid-Term Exam -- Fall Term 2000 Page 1 of 13 pages. QUEEN'S UNIVERSITY AT KINGSTON Department of Economics

ECONOMICS 351*-A Mid-Term Exam -- Fall Term 2000 Page 1 of 13 pages. QUEEN'S UNIVERSITY AT KINGSTON Department of Economics ECOOMICS 35*-A Md-Term Exam -- Fall Term 000 Page of 3 pages QUEE'S UIVERSITY AT KIGSTO Department of Economcs ECOOMICS 35* - Secton A Introductory Econometrcs Fall Term 000 MID-TERM EAM ASWERS MG Abbott

More information

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur Analyss of Varance and Desgn of Exerments-I MODULE III LECTURE - 2 EXPERIMENTAL DESIGN MODELS Dr. Shalabh Deartment of Mathematcs and Statstcs Indan Insttute of Technology Kanur 2 We consder the models

More information

Econ107 Applied Econometrics Topic 9: Heteroskedasticity (Studenmund, Chapter 10)

Econ107 Applied Econometrics Topic 9: Heteroskedasticity (Studenmund, Chapter 10) I. Defnton and Problems Econ7 Appled Econometrcs Topc 9: Heteroskedastcty (Studenmund, Chapter ) We now relax another classcal assumpton. Ths s a problem that arses often wth cross sectons of ndvduals,

More information

Interval Estimation in the Classical Normal Linear Regression Model. 1. Introduction

Interval Estimation in the Classical Normal Linear Regression Model. 1. Introduction ECONOMICS 35* -- NOTE 7 ECON 35* -- NOTE 7 Interval Estmaton n the Classcal Normal Lnear Regresson Model Ths note outlnes the basc elements of nterval estmaton n the Classcal Normal Lnear Regresson Model

More information

Properties of Least Squares

Properties of Least Squares Week 3 3.1 Smple Lnear Regresson Model 3. Propertes of Least Squares Estmators Y Y β 1 + β X + u weekly famly expendtures X weekly famly ncome For a gven level of x, the expected level of food expendtures

More information

e i is a random error

e i is a random error Chapter - The Smple Lnear Regresson Model The lnear regresson equaton s: where + β + β e for,..., and are observable varables e s a random error How can an estmaton rule be constructed for the unknown

More information

First Year Examination Department of Statistics, University of Florida

First Year Examination Department of Statistics, University of Florida Frst Year Examnaton Department of Statstcs, Unversty of Florda May 7, 010, 8:00 am - 1:00 noon Instructons: 1. You have four hours to answer questons n ths examnaton.. You must show your work to receve

More information

18. SIMPLE LINEAR REGRESSION III

18. SIMPLE LINEAR REGRESSION III 8. SIMPLE LINEAR REGRESSION III US Domestc Beers: Calores vs. % Alcohol Ftted Values and Resduals To each observed x, there corresponds a y-value on the ftted lne, y ˆ ˆ = α + x. The are called ftted values.

More information

III. Econometric Methodology Regression Analysis

III. Econometric Methodology Regression Analysis Page Econ07 Appled Econometrcs Topc : An Overvew of Regresson Analyss (Studenmund, Chapter ) I. The Nature and Scope of Econometrcs. Lot s of defntons of econometrcs. Nobel Prze Commttee Paul Samuelson,

More information

ANSWERS. Problem 1. and the moment generating function (mgf) by. defined for any real t. Use this to show that E( U) var( U)

ANSWERS. Problem 1. and the moment generating function (mgf) by. defined for any real t. Use this to show that E( U) var( U) Econ 413 Exam 13 H ANSWERS Settet er nndelt 9 deloppgaver, A,B,C, som alle anbefales å telle lkt for å gøre det ltt lettere å stå. Svar er gtt . Unfortunately, there s a prntng error n the hnt of

More information

More metrics on cartesian products

More metrics on cartesian products More metrcs on cartesan products If (X, d ) are metrc spaces for 1 n, then n Secton II4 of the lecture notes we defned three metrcs on X whose underlyng topologes are the product topology The purpose of

More information

Chapter 14: Logit and Probit Models for Categorical Response Variables

Chapter 14: Logit and Probit Models for Categorical Response Variables Chapter 4: Logt and Probt Models for Categorcal Response Varables Sect 4. Models for Dchotomous Data We wll dscuss only ths secton of Chap 4, whch s manly about Logstc Regresson, a specal case of the famly

More information

Lecture 12: Discrete Laplacian

Lecture 12: Discrete Laplacian Lecture 12: Dscrete Laplacan Scrbe: Tanye Lu Our goal s to come up wth a dscrete verson of Laplacan operator for trangulated surfaces, so that we can use t n practce to solve related problems We are mostly

More information

28. SIMPLE LINEAR REGRESSION III

28. SIMPLE LINEAR REGRESSION III 8. SIMPLE LINEAR REGRESSION III Ftted Values and Resduals US Domestc Beers: Calores vs. % Alcohol To each observed x, there corresponds a y-value on the ftted lne, y ˆ = βˆ + βˆ x. The are called ftted

More information

Chapter 6. Supplemental Text Material

Chapter 6. Supplemental Text Material Chapter 6. Supplemental Text Materal S6-. actor Effect Estmates are Least Squares Estmates We have gven heurstc or ntutve explanatons of how the estmates of the factor effects are obtaned n the textboo.

More information

January Examinations 2015

January Examinations 2015 24/5 Canddates Only January Examnatons 25 DO NOT OPEN THE QUESTION PAPER UNTIL INSTRUCTED TO DO SO BY THE CHIEF INVIGILATOR STUDENT CANDIDATE NO.. Department Module Code Module Ttle Exam Duraton (n words)

More information

Here is the rationale: If X and y have a strong positive relationship to one another, then ( x x) will tend to be positive when ( y y)

Here is the rationale: If X and y have a strong positive relationship to one another, then ( x x) will tend to be positive when ( y y) Secton 1.5 Correlaton In the prevous sectons, we looked at regresson and the value r was a measurement of how much of the varaton n y can be attrbuted to the lnear relatonshp between y and x. In ths secton,

More information

Basic Business Statistics, 10/e

Basic Business Statistics, 10/e Chapter 13 13-1 Basc Busness Statstcs 11 th Edton Chapter 13 Smple Lnear Regresson Basc Busness Statstcs, 11e 009 Prentce-Hall, Inc. Chap 13-1 Learnng Objectves In ths chapter, you learn: How to use regresson

More information

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur Analyss of Varance and Desgn of Experment-I MODULE VII LECTURE - 3 ANALYSIS OF COVARIANCE Dr Shalabh Department of Mathematcs and Statstcs Indan Insttute of Technology Kanpur Any scentfc experment s performed

More information

Semiparametric geographically weighted generalised linear modelling in GWR 4.0

Semiparametric geographically weighted generalised linear modelling in GWR 4.0 Semparametrc geographcally weghted generalsed lnear modellng n GWR 4.0 T. Nakaya 1, A. S. Fotherngham 2, M. Charlton 2, C. Brunsdon 3 1 Department of Geography, Rtsumekan Unversty, 56-1 Tojn-kta-mach,

More information

The Geometry of Logit and Probit

The Geometry of Logit and Probit The Geometry of Logt and Probt Ths short note s meant as a supplement to Chapters and 3 of Spatal Models of Parlamentary Votng and the notaton and reference to fgures n the text below s to those two chapters.

More information

DO NOT OPEN THE QUESTION PAPER UNTIL INSTRUCTED TO DO SO BY THE CHIEF INVIGILATOR. Introductory Econometrics 1 hour 30 minutes

DO NOT OPEN THE QUESTION PAPER UNTIL INSTRUCTED TO DO SO BY THE CHIEF INVIGILATOR. Introductory Econometrics 1 hour 30 minutes 25/6 Canddates Only January Examnatons 26 Student Number: Desk Number:...... DO NOT OPEN THE QUESTION PAPER UNTIL INSTRUCTED TO DO SO BY THE CHIEF INVIGILATOR Department Module Code Module Ttle Exam Duraton

More information

Introduction to Vapor/Liquid Equilibrium, part 2. Raoult s Law:

Introduction to Vapor/Liquid Equilibrium, part 2. Raoult s Law: CE304, Sprng 2004 Lecture 4 Introducton to Vapor/Lqud Equlbrum, part 2 Raoult s Law: The smplest model that allows us do VLE calculatons s obtaned when we assume that the vapor phase s an deal gas, and

More information

ISQS 6348 Final Open notes, no books. Points out of 100 in parentheses. Y 1 ε 2

ISQS 6348 Final Open notes, no books. Points out of 100 in parentheses. Y 1 ε 2 ISQS 6348 Fnal Open notes, no books. Ponts out of 100 n parentheses. 1. The followng path dagram s gven: ε 1 Y 1 ε F Y 1.A. (10) Wrte down the usual model and assumptons that are mpled by ths dagram. Soluton:

More information

LINEAR REGRESSION ANALYSIS. MODULE VIII Lecture Indicator Variables

LINEAR REGRESSION ANALYSIS. MODULE VIII Lecture Indicator Variables LINEAR REGRESSION ANALYSIS MODULE VIII Lecture - 7 Indcator Varables Dr. Shalabh Department of Maematcs and Statstcs Indan Insttute of Technology Kanpur Indcator varables versus quanttatve explanatory

More information

Rockefeller College University at Albany

Rockefeller College University at Albany Rockefeller College Unverst at Alban PAD 705 Handout: Maxmum Lkelhood Estmaton Orgnal b Davd A. Wse John F. Kenned School of Government, Harvard Unverst Modfcatons b R. Karl Rethemeer Up to ths pont n

More information

THE ROYAL STATISTICAL SOCIETY 2006 EXAMINATIONS SOLUTIONS HIGHER CERTIFICATE

THE ROYAL STATISTICAL SOCIETY 2006 EXAMINATIONS SOLUTIONS HIGHER CERTIFICATE THE ROYAL STATISTICAL SOCIETY 6 EXAMINATIONS SOLUTIONS HIGHER CERTIFICATE PAPER I STATISTICAL THEORY The Socety provdes these solutons to assst canddates preparng for the eamnatons n future years and for

More information

SIMPLE LINEAR REGRESSION

SIMPLE LINEAR REGRESSION Smple Lnear Regresson and Correlaton Introducton Prevousl, our attenton has been focused on one varable whch we desgnated b x. Frequentl, t s desrable to learn somethng about the relatonshp between two

More information

A New Method for Estimating Overdispersion. David Fletcher and Peter Green Department of Mathematics and Statistics

A New Method for Estimating Overdispersion. David Fletcher and Peter Green Department of Mathematics and Statistics A New Method for Estmatng Overdsperson Davd Fletcher and Peter Green Department of Mathematcs and Statstcs Byron Morgan Insttute of Mathematcs, Statstcs and Actuaral Scence Unversty of Kent, England Overvew

More information

Lecture 3 Stat102, Spring 2007

Lecture 3 Stat102, Spring 2007 Lecture 3 Stat0, Sprng 007 Chapter 3. 3.: Introducton to regresson analyss Lnear regresson as a descrptve technque The least-squares equatons Chapter 3.3 Samplng dstrbuton of b 0, b. Contnued n net lecture

More information

Chapter 3. Two-Variable Regression Model: The Problem of Estimation

Chapter 3. Two-Variable Regression Model: The Problem of Estimation Chapter 3. Two-Varable Regresson Model: The Problem of Estmaton Ordnary Least Squares Method (OLS) Recall that, PRF: Y = β 1 + β X + u Thus, snce PRF s not drectly observable, t s estmated by SRF; that

More information

Learning Objectives for Chapter 11

Learning Objectives for Chapter 11 Chapter : Lnear Regresson and Correlaton Methods Hldebrand, Ott and Gray Basc Statstcal Ideas for Managers Second Edton Learnng Objectves for Chapter Usng the scatterplot n regresson analyss Usng the method

More information

Modeling and Simulation NETW 707

Modeling and Simulation NETW 707 Modelng and Smulaton NETW 707 Lecture 5 Tests for Random Numbers Course Instructor: Dr.-Ing. Magge Mashaly magge.ezzat@guc.edu.eg C3.220 1 Propertes of Random Numbers Random Number Generators (RNGs) must

More information

ECONOMETRICS - FINAL EXAM, 3rd YEAR (GECO & GADE)

ECONOMETRICS - FINAL EXAM, 3rd YEAR (GECO & GADE) ECONOMETRICS - FINAL EXAM, 3rd YEAR (GECO & GADE) June 7, 016 15:30 Frst famly name: Name: DNI/ID: Moble: Second famly Name: GECO/GADE: Instructor: E-mal: Queston 1 A B C Blank Queston A B C Blank Queston

More information

since [1-( 0+ 1x1i+ 2x2 i)] [ 0+ 1x1i+ assumed to be a reasonable approximation

since [1-( 0+ 1x1i+ 2x2 i)] [ 0+ 1x1i+ assumed to be a reasonable approximation Econ 388 R. Butler 204 revsons Lecture 4 Dummy Dependent Varables I. Lnear Probablty Model: the Regresson model wth a dummy varables as the dependent varable assumpton, mplcaton regular multple regresson

More information

For now, let us focus on a specific model of neurons. These are simplified from reality but can achieve remarkable results.

For now, let us focus on a specific model of neurons. These are simplified from reality but can achieve remarkable results. Neural Networks : Dervaton compled by Alvn Wan from Professor Jtendra Malk s lecture Ths type of computaton s called deep learnng and s the most popular method for many problems, such as computer vson

More information

NUMERICAL DIFFERENTIATION

NUMERICAL DIFFERENTIATION NUMERICAL DIFFERENTIATION 1 Introducton Dfferentaton s a method to compute the rate at whch a dependent output y changes wth respect to the change n the ndependent nput x. Ths rate of change s called the

More information

Unit 10: Simple Linear Regression and Correlation

Unit 10: Simple Linear Regression and Correlation Unt 10: Smple Lnear Regresson and Correlaton Statstcs 571: Statstcal Methods Ramón V. León 6/28/2004 Unt 10 - Stat 571 - Ramón V. León 1 Introductory Remarks Regresson analyss s a method for studyng the

More information

LOGIT ANALYSIS. A.K. VASISHT Indian Agricultural Statistics Research Institute, Library Avenue, New Delhi

LOGIT ANALYSIS. A.K. VASISHT Indian Agricultural Statistics Research Institute, Library Avenue, New Delhi LOGIT ANALYSIS A.K. VASISHT Indan Agrcultural Statstcs Research Insttute, Lbrary Avenue, New Delh-0 02 amtvassht@asr.res.n. Introducton In dummy regresson varable models, t s assumed mplctly that the dependent

More information

9. Binary Dependent Variables

9. Binary Dependent Variables 9. Bnar Dependent Varables 9. Homogeneous models Log, prob models Inference Tax preparers 9.2 Random effects models 9.3 Fxed effects models 9.4 Margnal models and GEE Appendx 9A - Lkelhood calculatons

More information

Topic- 11 The Analysis of Variance

Topic- 11 The Analysis of Variance Topc- 11 The Analyss of Varance Expermental Desgn The samplng plan or expermental desgn determnes the way that a sample s selected. In an observatonal study, the expermenter observes data that already

More information

Lecture 10 Support Vector Machines II

Lecture 10 Support Vector Machines II Lecture 10 Support Vector Machnes II 22 February 2016 Taylor B. Arnold Yale Statstcs STAT 365/665 1/28 Notes: Problem 3 s posted and due ths upcomng Frday There was an early bug n the fake-test data; fxed

More information

Laboratory 1c: Method of Least Squares

Laboratory 1c: Method of Least Squares Lab 1c, Least Squares Laboratory 1c: Method of Least Squares Introducton Consder the graph of expermental data n Fgure 1. In ths experment x s the ndependent varable and y the dependent varable. Clearly

More information

Tests of Single Linear Coefficient Restrictions: t-tests and F-tests. 1. Basic Rules. 2. Testing Single Linear Coefficient Restrictions

Tests of Single Linear Coefficient Restrictions: t-tests and F-tests. 1. Basic Rules. 2. Testing Single Linear Coefficient Restrictions ECONOMICS 35* -- NOTE ECON 35* -- NOTE Tests of Sngle Lnear Coeffcent Restrctons: t-tests and -tests Basc Rules Tests of a sngle lnear coeffcent restrcton can be performed usng ether a two-taled t-test

More information

An R implementation of bootstrap procedures for mixed models

An R implementation of bootstrap procedures for mixed models The R User Conference 2009 July 8-10, Agrocampus-Ouest, Rennes, France An R mplementaton of bootstrap procedures for mxed models José A. Sánchez-Espgares Unverstat Poltècnca de Catalunya Jord Ocaña Unverstat

More information

ANOMALIES OF THE MAGNITUDE OF THE BIAS OF THE MAXIMUM LIKELIHOOD ESTIMATOR OF THE REGRESSION SLOPE

ANOMALIES OF THE MAGNITUDE OF THE BIAS OF THE MAXIMUM LIKELIHOOD ESTIMATOR OF THE REGRESSION SLOPE P a g e ANOMALIES OF THE MAGNITUDE OF THE BIAS OF THE MAXIMUM LIKELIHOOD ESTIMATOR OF THE REGRESSION SLOPE Darmud O Drscoll ¹, Donald E. Ramrez ² ¹ Head of Department of Mathematcs and Computer Studes

More information

Lecture 6 More on Complete Randomized Block Design (RBD)

Lecture 6 More on Complete Randomized Block Design (RBD) Lecture 6 More on Complete Randomzed Block Desgn (RBD) Multple test Multple test The multple comparsons or multple testng problem occurs when one consders a set of statstcal nferences smultaneously. For

More information

Hydrological statistics. Hydrological statistics and extremes

Hydrological statistics. Hydrological statistics and extremes 5--0 Stochastc Hydrology Hydrologcal statstcs and extremes Marc F.P. Berkens Professor of Hydrology Faculty of Geoscences Hydrologcal statstcs Mostly concernes wth the statstcal analyss of hydrologcal

More information

Laboratory 3: Method of Least Squares

Laboratory 3: Method of Least Squares Laboratory 3: Method of Least Squares Introducton Consder the graph of expermental data n Fgure 1. In ths experment x s the ndependent varable and y the dependent varable. Clearly they are correlated wth

More information