BAYESIAN APPROACH FOR SELECTION BIAS CORRECTION IN REGRESSION. Labeed Mokatrin. Submitted to the. Faculty of the College of Arts and Sciences

Size: px
Start display at page:

Download "BAYESIAN APPROACH FOR SELECTION BIAS CORRECTION IN REGRESSION. Labeed Mokatrin. Submitted to the. Faculty of the College of Arts and Sciences"

Transcription

1

2 BAESIAN APPROACH FOR SELECTION BIAS CORRECTION IN REGRESSION By Labeed Mokatrn Submtted to the Faculty of the College of Arts and Scences of Amercan Unversty n Partal Fulfllment of the Requrements for the Degree of Doctor of Phlosophy In Statstcs Char: Jun Lu Ph.D. Mary Gray Ph.D. Dean of the College of Arts and Scences Monca Jackson Ph.D. Date 0 Amercan Unversty Washngton D.C. 006

3 BAESIAN APPROACH FOR SAMPLE SELECTION BIAS CORRECTION IN REGRESSION B Labeed Mokatrn ABSTRACT Selecton bas occurs when samples are self-selected rather than randomly selected from the target populaton. Ths s a well-known problem and has been extensvely studed n research studes n statstcs and economcs. In ths work I adopt a Bayesan approach to correct sample selecton bas under the self-selecton setup proposed n Heckman model. Bayesan methods treat the populaton parameters of nterest as random varables nstead of unknown constants. The dstrbutons of these random parameters are called pror dstrbutons. Statstcal nference s based on the posteror dstrbuton whch combnes nformaton from the data and the pror. Markov Chan Monte Carlo MCMC methods are used for Bayesan computaton of the posteror dstrbutons. The results from the proposed Bayesan method are compared to that of Heckman s two-step estmaton va varous smulaton studes. A comprehensve smulaton study s conducted where varous scenaros are consdered for the smulaton setup and desgn. Furthermore n addton to the most common self-selecton setup the new approach s extended to handle self-selecton wth Bnary outcome model.

4 ACKNOWLEDGMENTS Frst I would lke to thank my dssertaton advsor Dr. Jun Lu. Hs advce nsghtful crtcsms and patent encouragement aded the wrtng of ths dssertaton n nnumerable ways. I would also lke to thank the commttee members Dr. Mary Gray and Dr. Monca Jackson. Ths work would not have been possble wthout ther valued advce and suggestons. Dr. Gray s steadfast support was greatly needed and deeply apprecated. I would lke to thank the faculty and staff members of the department of Mathematcs and Statstcs at AU n partcular Dr. Behzad Jalala and Lnda Greene for all ther help. I am forever grateful to my famly Shreen Adam and Rany for ther contnued support and encouragement. The numerous sacrfces they have made over the last few years allowed me to reach mportant mlestones n my professonal career. Fnally I would lke to extend my grattude to my mom and dad for all ther support and enthusasm for my educaton throughout my lfe.

5 TABLE OF CONTENTS ABSTRACT... ACKNOWLEDGMENTS... LIST OF TABLES... v LIST OF llustratons... v Chapter. INTRODUCTION... The Problem of Sample Selecton Bas... Heckman s Two-Step Method... Lmtatons of Heckman s Method and Known Alternatves... 3 Bayesan Approach SELF-SELECTED SAMPLING MODEL AND BIAS... 6 Self-Selected Samplng Model... 6 Self-Selected Samplng Bas... 9 MLE and Heckman Correcton MODEL DEVELOPMENT... 7 Bref Introducton to Bayesan Methods and MCMC... 7 Mssng Values and Latent Varables... 0 Pror Dstrbuton of Lnear Component... Samplng Scheme of Lnear Component SIMULATION STUD... 5 v

6 Women Wage Example... 5 Comprehensve Smulaton Study CASE STUD: PLACEMENT EAM AND MATH ACHIEVEMENT BINAR SELECTIVIT MODEL Generalzed Lnear Model Bayesan Estmaton Data Example DISCUSSION REFERENCES v

7 LIST OF TABLES Table. Women Wage Data Varable Descrpton Estmaton Usng OLS and Heckman Models Estmaton Usng Bayesan Methods Smulaton Results for Selecton Rate Smulaton Results for Correlaton Level Smulaton Results for Multcollnearty Smulaton Results for Sample Sze Varable Descrptons for Placement Exam Data Estmaton Results for Placement Exam Data Estmaton Results for Bnary Selectvty v

8 LIST OF ILLUSTRATIONS Fgure. Sample Paths for Comprehensve Smuaton Densty Plots for Comprehensve Smuaton Movng Averages for Comprehensve Smuaton Sample Paths for Placement Exam Example Densty Plots for Placement Exam Example Movng Averages for Placement Exam Example Sample Paths for Bnary Selectvty Model Densty Plots for Bnary Selectvty Model Movng Averages for Bnary Selectvty Model v

9 CHAPTER INTRODUCTION The Problem of Sample Selecton Bas In some socology and economcs studes samples are self-selected rather than randomly selected from the target populaton. Bas can occur when usng self-selected samples because the selecton crtera are often correlated wth the varables of nterest. Such bas s often called the Sample Selecton Bas. For example Mansk & Wse 983 studed the relatonshp between SAT scores and potental college achevement. The researchers could only sample from students who were already admtted to college but not from all students who could potentally go to college. In ths case students who scored well n the SAT were more lkely to attend college and hence are more lkely to be selected nto the sample. Another example s the study of women s educaton background and ther earnngs Heckman 979. Samples were selected from women wth labor force partcpaton. However ndvduals only jon the labor force f ther potental earnngs or occupatonal status meet some crtera. As a result samplng from women n labor force gnored the women who had low potental earnngs. Heckman s Two-Step Method Heckman 976 rased the ssue of sample selecton bas when a dependent

10 varable n the regresson has values that are mssng not at random. He proposed to estmate the full nformaton maxmum lkelhood FIML by way of a two-step method. Ths method s called a Lmted Informaton Maxmum Lkelhood LIML. FIML s a well-known econometrc technque for estmatng equaton models n whch the parameters of all equatons are estmated smultaneously wth all the nformaton n the model Maddala 977. Smlar to FIML LIML s a maxmum lkelhood bass for estmatng one structural equaton or a proper subset of structural equatons from a system of equatons Anderson and Rubn n 949. Heckman 976 dscussed the common structure of statstcal models of lmted dependent varable as well as a smple estmator for ths model. He presented a unfed summary of statstcal model selecton and lmted dependent varables. Heckman 979 proposed a soluton to sample selecton bas usng the two-step estmator method. Accordng to ths method n the frst step we use probt regresson to model the sample selecton process. A new varable called the Inverse Mlls Rato s calculated based on the probt regresson results. In the second step we add the Inverse Mlls rato to the regresson analyss as an ndependent varable and smply use Ordnary Least Squares OLS to estmate the regresson coeffcents. Heckman s two-step estmaton procedure s easy to mplement. It has been well recognzed n the appled felds such as economcs and socology as a correcton for sample selecton bas. Examples of the applcaton of Heckman s two-step estmaton can be found n for example Mroz 987 Nawata 994 and Leung and u 996.

11 3 Lmtatons of Heckman s Method and Known Alternatves Dscussons of Heckman s two-step estmaton and other approaches to sample selecton bas present themselves readly n the lterature n the last two decades. Wnshp and Mare 99 dscussed the dffcultes and lmtaton of several sample selecton bas correcton technques. They show how self-selecton leads to based estmates n regresson revew models that have been proposed dscuss Heckman's estmator and ts lmtatons and dscuss other approaches to selecton such as nonparametrc approaches to estmatng selecton models. They suggest that when selecton s an ssue researchers should present estmates usng a varety of methods because the results may depend on the method used. Nawata 993 analyzes methods for estmatng models wth selecton bas by comparng Maxmum Lkelhood Estmaton MLE and Heckman's two-step estmator wth Monte Carlo experments. The results show that Heckman's two-step estmator can perform well when there s no multcollnearty between the Inverse Mlls Rato and the explanatory varables. However t wll perform relatvely poorly when multcollnearty exsts and MLE becomes more effcent. Stolzenberg and Relles 997 provde mathematcal tools to assst ntuton about selecton bas n concrete emprcal analyss. They ndcate that there s no general soluton to the selecton bas problem but they present a new decomposton of selecton bas. In ths decomposton the analyst should be able to develop ntuton and make reasonable judgments about the source severty and drecton of sample selecton bas n a partcular analyss. The authors also lst several bas correcton procedures that are

12 4 avalable. They suggest that the safest approach to sample selecton bas problem s frst to understand how nonrandom selecton occurs n the data. If the data seem to be selected as descrbed by Heckman then t s approprate to use Heckman's two-step model. Pahn 000 dscusses Monte Carlo studes of Heckman s correcton and llustrates a crtque of Heckman's estmator. He ndcates that the explanatory varables n Heckman s two-step model may have a large set of varables n common whch causes collnearty wth the Inverse Mlls Rato. Pahn concludes that we should dagnose collnearty problems before decdng whch estmator to use. If there s no collnearty between the regressors and the Inverse Mlls Rato the Heckman two-part model s the most robust approach. On the other hand f collnearty problems exst the MLE approach s preferable to Heckman s two-step method. Bayesan Approach In ths study we propose a Bayesan approach to correct sample selecton bas under the self-selecton setup proposed n Heckman 979. Bayesan methods treat the populaton parameters of nterest as random varables nstead of unknown constants. The dstrbutons of these random parameters are called pror dstrbutons. Often both expert knowledge and mathematcal convenence play a role n selectng a partcular type of pror dstrbuton. Statstcal nference s based on the posteror dstrbuton whch combnes nformaton from the data and the pror. We use Markov Chan Monte Carlo MCMC methods for Bayesan computaton of the posteror dstrbutons n ths study. We also compare the performance of the proposed Bayesan method and that of Heckman s two-step estmaton va smulaton study.

13 5 In the next chapter we gve a more detaled ntroducton to the sample selecton problem and Heckman's two-step estmaton. Usng the same assumpton on the data collecton as used by Heckman a Bayesan model to correct the selecton bas s ntroduced n chapter three. A smulaton study s presented n chapter four where we demonstrate the proposed Bayesan method and compare the estmates from varous approaches usng the Women Wage data example and also by usng a comprehensve smulaton study where varous scenaros wll be consdered for the smulaton setup and desgn. In chapter fve we wll apply the Bayesan model to a real-world data example by usng AU students placement exam data. In chapter sx we wll extend the proposed Bayesan method to the Generalzed Lnear Model GLM.

14 CHAPTER SELF-SELECTED SAMPLING MODEL AND BIAS Self-Selected Samplng Model A famous example of sample selecton bas s the estmaton of the wage equaton Pahn 000. When tryng to estmate the results of schoolng on the wage rate the researcher faces the problem that some ndvduals who have receved schoolng do not work. These ndvduals have not receved an offer that meets ther reservaton wage. If we assume a postve relatonshp between schoolng and wages people wth lttle schoolng wll on average have a lower offered wage and therefore a lower employment rate than those wth more years of schoolng. But we only observe the wage offers whch exceed an ndvdual s reservaton wage. As a consequence we only observe the wages of those people wth few years of schoolng that receve comparatvely hgh wage offers. In ths case there s self-selected samplng and the OLS estmate s based. In ths example smple OLS regresson of wages on years of schoolng wll lead to bas estmates because the sample workng people s unrepresentatve of the populaton one s nterested n all people who have receved schoolng. The selecton problem can be vewed as a problem of mssng observatons except that they are not mssng at random. A lnear regresson model wth self-selecton samples can be presented usng the followng two-equaton model: 6

15 7 u We call Equaton the observaton equaton and Equaton the selecton equaton. In the prevous example ndvduals who are only able to acheve low wage rate gven ther level of schoolng wll decde not to work. Therefore the probablty that ther offered wage s below ther reservaton wage s hghest. In other words and u can be expected to be postvely correlated whch causes sample selectons bas. When observatons are mssng at random Equaton can stll be estmated by OLS. Typcally there are three causes of non-randomly mssng observatons: censorng truncaton or self-selected samplng. A sample s censored when observatons on are not avalable n some range and are reported at a cutoff value but the explanatory varables are all avalable. When observaton on the are also unavalable the sample s sad to be truncated. When self-selected samplng occurs observatons on are recorded only f another varable value. In ths artcle we dscuss self-selected samplng. s observed only f takes on a value above or below some cutoff s greater than a cutoff value C meanng the th subject s selected. Wthout loss of generalty we can assume the cutoff C to be 0. From equaton the populaton regresson functon s E [ 3 ]

16 8 The regresson functon for the ncomplete sample s E[ selecton rule] E[ selecton rule] E[ u ] 4 The last term n Equaton 4 s equal 0 f and u are uncorrelated and not equal 0 otherwse. two scenaros: Dependng on whether s drectly observed or not we consder the followng Assumng Scenaro A s fully observed we have u 5 mssng f 0 Otherwse 6 Assumng Scenaro B s not fully observed then we observe a dummy D 0 f 0 Otherwse Hence we can wrte the observaton equaton as: mssng f Otherwse D D where In practce model B s used more often than model A. We descrbe the bas that arses from each model n the next secton. 7 8

17 9 Self-Selected Samplng Bas Consder scenaro A. The regresson functon for the subsample where the data are avalable can be wrtten as: u E u E E E 9 Assumng u has a bvarate normal dstrbuton ~ N u 0 hence. exp z z u f Then the correcton wll have the followng form E Now consder scenaro B. The regresson functon for the subsample where the data are avalable can be wrtten as: 0 0 u E u E E D E 3 then the correcton wll have the followng form D E 4 where and are the standardzed normal densty and dstrbuton functons respectvely.

18 We can wrte Equaton 4 as: E w 5 D equaton 5 hghlghts the omtted varable that causes OLS estmaton of Equaton to be based. The varable s the hazard rato or the Inverse Mlls Rato. For both scenaros A and B Equatons and 4 show that the estmated wll be unbased when s uncorrelated wth selecton process s "gnorable". 0 u 0 so that the data are mssng randomly or the In general assume that and u follow a jont dstrbuton functon f u where θ s a fnte set of parameters. Applyng the Bayes rule we can wrte: f u d du E[ u ] f u d du 6 Here could be a nonlnear functon of and the parameters θ. Ths means that the condtonal expectaton of gven and the probablty that s observed wll be equal to the usual regresson functon plus a nonlnear functon of the selecton equaton regressors Equaton 4. that has non-zero mean as we showed n Therefore when estmatng we can conclude that the estmated ntercept wll be based because the mean of the resduals s not zero. Also f and are not completely uncorrelated.e. they have varables n common or they are correlated the estmated slope coeffcent wll be based because there s an omtted varable n the regresson namely that s correlated wth the ncluded varable. We can see that even f and are ndependent the fact that the data s nonrandomly mssng wll ntroduce heteroskedastcty to the error term so OLS s not fully effcent.

19 MLE and Heckman Correcton There are two major exstng approaches for estmatng the self-selected sample model under the assumpton of bvarate normal. The frst method s FIML and the second s Heckman's well-known two-step procedure. We dscuss each of these methods and adopt Heckman s method as a benchmark for smulaton comparsons because of ts popularty. We wll consder scenaro B n the selecton stage for both methods. In practce t s more common to assume that s not fully observed. In the maxmum lkelhood approach we specfy a complete model setup as n Equatons and and we assume the followng jont dstrbuton for u 0 ~ N u 0 7 We typcally assume a bvarate normal dstrbuton wth zero and means and correlaton. There s no generally accepted name for ths model. The restrcton s used to smplfy the calculatons of the lkelhood functon. We dvde the observatons nto groups accordng to the type of data observed. Each group of observatons wll have a dfferent form for the lkelhood. For example for the sample selecton model there are two types of observatons: those where >0 and those where s not observed and we know that 0. For those where 0. For these observatons the lkelhood functon s the probablty of the jont event and 0. We can wrte ths probablty for the th observaton as the followng:

20 0 0 du u du u f u P f P f P 8 Thus the probablty of an observaton for whch we see the data s the densty functon at the pont multpled by the condtonal probablty dstrbuton for gven the value of that was observed. For those where s not observed and we know that 0. For these observatons the lkelhood functon s just the margnal probablty that 0. We have no ndependent nformaton on. Ths probablty s wrtten as 0 u P P 9 therefore f we assume the frst N observatons have 0 and the rest have 0 then the log lkelhood for the complete sample of observatons s the followng:

21 3 N log L ; data [ log log N0 log N 0 log ] 0 In the above log lkelhood there are N 0 observatons where we do not observe and there are N observatons where we do observe. Then N 0 N N. The parameter estmates for the sample selecton model can be obtaned by maxmzng ths lkelhood functon wth respect to ts arguments. As an alternatve to MLE Heckman 979 developed a two-step model that s wdely used for sample selecton bas. Heckman s model s based on two latent dependent varables. The steps of Heckman's estmaton are: a Estmate n Equaton usng a probt model; b Use the estmated to calculate E u ; c Estmate n equaton 5 by replacng E u wth. Estmaton of Equaton 5 by OLS gves consstent parameter estmates but specal formulas are needed to get correct standard errors because the errors V are correlated.

22 4 If 0 the usual formula provdes a consstent estmate of the covarance matrx of the parameters n the second-stage regresson. Heckman suggests that we use the t-test of the coeffcent on the varable as a test of sample selecton bas. Melno 98 shows that ths represents the optmal test of selectvty bas under the mantaned dstrbutonal assumptons as t s based on the same moment as the Lagrange multpler test. That s both the Lagrange multpler test and the t-test for the coeffcent on are based on the correlaton between the errors n the prmary equaton and the errors from the selecton equaton. Note that the Inverse Mlls Rato s the error from the probt equaton explanng selecton. In other words Heckman s proposal s to estmate the Inverse Mlls Rato n a probt model and then estmate Equaton 5 by OLS to obtan consstent estmates of and Although Heckman s two-step procedure gves a consstent estmator varous papers crtcze ts small sample propertes. Many clams were that the predctve power of subsample OLS or the two-step model s at least as good as the one of Heckman s procedure or MLE. Here the two-step model gves the condtonal expectaton of wages. Daun 984 contends that the condtonal expectaton s of nterest to us. In addton we nterpret the coeffcent of the two part model wth the same way that we estmate the

23 wage equaton by subsample OLS. Stolzenberg and Relles 990 provde evdence that the hgher the correlaton between the error terms the greater the superorty of the maxmum lkelhood and maybe OLS estmator over Heckman procedure n terms of effcency. The most mportant lne of crtcsm of Heckman s procedure s based on practcal rather than theoretcal grounds. If the set of 5 varables that affect the wage n the wage equaton are almost dentcal wth the set of varables that affect labor force partcpaton n selecton equaton then the second step of Heckman's method s only dentfed through the nonlnearty of the Inverse Mlls Rato. In many practcal cases we only observe values wthn the quas-lnear not completely lnear range of the nverse mlls rato. Then we need varables that are good predctors of labor force partcpaton and do not appear n whch are dffcult to fnd n practce. Most studes fnd that the two-step approach can be unrelable n the absence of excluson restrctons. Generally an excluson restrcton s requred to generate credble estmates: there must be at least one varable whch appears wth a non-zero coeffcent n the selecton equaton but does not appear n the equaton of nterest. If no such varable s avalable t may be dffcult to correct for samplng selectvty. Leung and u 996 conclude that ths result s due to expermental desgn. They fnd that Heckman s twostep estmator s effectve provded that at least one dsplays suffcent varaton to nduce tal behavor n the Inverse Mlls Rato. Under certan crcumstances even when ts assumptons and formal requrements are satsfed the two-step selecton bas

24 correcton s known to produce estmates that are farther from true parameter values than estmates obtaned by uncorrected OLS. Puhan 000 strongly recommends exploratory work to check for collnearty problems before decdng on whch estmator to apply. If there s no collnearty between the regressors and the Inverse Mlls Rato the Heckman two-part model s the most robust approach. On the other hand f collnearty problems exst the MLE approach s preferable to Heckman s two-step method. In the next chapter we propose a Bayesan method to estmate sample selecton bas. We study ts behavor by comparng our estmates to Heckman's estmates. 6

25 CHAPTER 3 MODEL DEVELOPMENT Bref Introducton to Bayesan Methods and MCMC The Bayesan approach s fundamentally dfferent from the conventonal approach. In the conventonal approach a sample... n s drawn from a populaton wth an unknown but fxed parameter θ. Knowledge about θ s obtaned from the observed random sample. In the Bayesan approach θ s consdered to be a random varable and ts varaton can be descrbed by a probablty dstrbuton called the pror dstrbuton. Ths s a subjectve dstrbuton based on the researcher's belef and s formulated before the data are seen hence pror. When a sample from a populaton ndexed by θ s observed the pror dstrbuton s updated wth the nformaton n the sample. The updated pror s called the posteror dstrbuton. The Bayesan approach s concerned wth generatng the posteror dstrbuton of the parameters and provdes a more complete pcture of the uncertanty n the estmaton of unknown parameters especally after the confoundng effects of nusance parameters are removed. A complete ntroducton to Bayesan analyss can be found n Lee 997 and Draper 000. The foundaton of Bayesan statstcs s Bayes' Theorem whch s used to update the posteror dstrbuton. Bayes Theorem s named after Thomas Bayes He calculated the probablty of a new event on the bass of earler probablty estmates 7

26 8 that have been derved from emprcal data. Bayes work became the bass of a statstcal technque whch s now called Bayesan statstcs or Bayes method. The basc prncple of Bayes theorem s as follows. If event A occurred the probablty that event E also occurred s E p E A p E p E A p A p E p E A p A E P In structured modelng and analyss Bayes method can be wrtten n the followng equaton. Assume we observe data y from dstrbuton wth parameter of θ and we wsh to make nference about another random varable θ where θ s drawn from some dstrbuton πθ. Then d y p y p y p y p y p y p where y s a vector of the observed data and θ s the unknown parameters. The posteror probablty condtonal on y s pθy. The pror dstrbuton s πθ and t can be nformatve or non-nformatve. An nformatve pror expresses specfc defnte nformaton about a varable. The lkelhood functon s pyθ when t s regarded as a functon of θ for a fxed * y. The pror predctve dstrbuton also called the margnal dstrbuton of y s py. Wth Bayes model we can estmate the posteror dstrbuton pθy by ntegratng the full Bayes equaton the lkelhood and pror probablty functons. For example f y represents a random sample from N then we have: exp y y p

27 9 under the non-nformatve pror: y the posteror dstrbuton s: y n y Another approach s to use MCMC smulaton to obtan the posteror dstrbuton. Metropols 953 showed how ths method helps n constructng a Markov Chan wth statonary dstrbuton. The method was generalzed by Hastngs 970 and s now wdely used to sample from analytcally ntractable probablty dstrbutons arsng n statstcs Glks 996; Robert and Casella 999. The effcency of MCMC methods s of sgnfcant practcal mportance and loosely speakng s determned by the convergence rate of the chan. In contrast to the maxmum lkelhood method the MCMC Bayesan method s useful and relable even for fnte sample szes snce convergence results depend only on the number of teratons. The man advantage of Bayesan methodology s that n the absence of much data the pror dstrbuton carres a lot of weght; but the more data that are observed the less nfluence the pror dstrbuton has on the posteror dstrbuton. The most common crtcsm of Bayesan methodology s that snce there s no sngle correct pror dstrbuton then all conclusons drawn from the posteror dstrbuton are suspect. We develop a Bayesan method for estmatng the parameters of the self-selected samplng model. We mplement MCMC methods and Gbbs samplng to facltate

28 0 computaton for the posteror estmates. We also conduct a smulaton study to determne the performance of the MCMC algorthm for varous pror dstrbutons. Mssng Values and Latent Varables Recall scenaro B n secton. and latent varable u where N x and ~ N0 ~ One can show that P D P 0 x where s the cumulatve densty functon cdf of N 0. The prors for other parameters reman the same: x ~ BVN x note that s a latent varable and t needs to be sampled. Also m... n are mssng and they wll be sampled as well. The below steps show how to sample If D then y s observed for... m gven ntal or sampled values MCMC of s and we can show that normally dstrbuted wth followng condtonal mean and condtonal varance: ; y D ~ N x x and 0 We can sample 0. from truncated Normal by the above equaton and

29 If D 0 then y s observed for m... n hence sample x ~ BVN x wth the restrcton 0 snce D 0. One way to do ths s to generate jontly untl we get a sample wth 0. Now recall scenaro B s fully observed and s not. The jont dstrbuton s: x ~ BVN x for mssng m... n one can sample ; ~ N x x Pror Dstrbuton of Lnear Component Consder the multvarate regresson model for... n where s an m-vector of dependent varables for unt ; s an m p matrx of ndependent varables for unt β s a p-vector of regresson coeffcents; and s the error term. The error terms are mutually ndependent random varables from a multvarate normal dstrbuton wth mean zero and covarance matrx Σ ~ 0. A well-accepted Bayesan approach s to consder the normal dstrbuton as the pror of β s because t s qute flexble. We wll consder the pror dstrbutons: ~ N p ~ IW m df H N m

30 Where IW m df H s the m -dmensonal nverted Wshart dstrbuton wth df pror degrees of freedom and scale parameters H wth densty functon: p H H p m m p p p 4 exp m tr p m Now recall Heckman's sample selecton model Equatons and : u for the Bayesan analyss of the above model we assume the jont dstrbuton of and u s bvarate normal: u ~ N0 where ρ s the correlaton coeffcent. To compute ths model we need to provde the pror dstrbuton usng β ~ N p ~ N p q snce ρ s unknown we use the Inverse Wshart dstrbuton: ~ IW m df H

31 3 Samplng Scheme of Lnear Component Consderng the multvarate normal dstrbuton ~ N where ~ N And ~ the condtonal posteror of β can be calculated usng f y f f the condtonal posteror s proportonal to the exponent part so we can wrte: ] exp[ ] exp[ ] exp[ ] exp[ T T T T f takng the frst and second dervatves wth respect to β we can fnd the condtonal precson matrx Ω. Thus the frst dervatve: f f T and the second dervatve: f f T T therefore the varance or the condtonal precson matrx s:. T settng the frst dervatve equal to zero we can calculate the mean 0 T T T T T T we can wrte: T T T T T thus T T T T T T and

32 4 T therefore ] [ ~ MVN f T the above posteror mean s a weghted average of the data and the pror mean wth weghts gven by the data pror precson matrces and. The condtonal posteror of can be calculated usng y f f thus the condtonal posteror can be wrtten as: m y f f exp m T m x y x y the above can be expressed as: ] exp[ H x y x y tr m T df m n also we can wrte: ] exp[ H S m df where m T x y x y S ths leads to ~ H S m df Wshart The model developed n ths secton wll be used to analyze data examples and smulated data n the next chapter. We show the results of MCMC smulatons carred out and evaluate samplng propertes of the estmators dscussed n ths secton.

33 CHAPTER 4 SIMULATION STUD In ths chapter we apply the proposed Bayesan approach by conductng two smulatons studes usng MCMC methods and Gbbs samplng to facltate computaton of posteror estmates. The frst smulaton study uses an artfcal data example to determne the performance of MCMC usng varous prors. The second smulaton s a comprehensve smulaton study usng a generated data wth the ablty to test varous data scenaros such as: sample mssng rate resduals correlaton multcollnearty and sample sze. Women Wage Example Dscussons n the context of labor economcs concernng labor force populaton wages and earnngs hghlght the mportance of sample selecton. One representatve example s the estmaton of women s wages. Snce we only observe the wages of women who enter the workforce our sample represents only one part of the wage offer dstrbuton. Other secondary wage groups such as marred women and teenagers are not represented. Therefore estmaton procedures may nvolve certan bas when appled to the secondary wage groups. Ths s an example of self-selected samplng bas. We propose a Bayesan MCMC algorthm to estmate parameters of a selfselected samplng model. We consder the labor force example n the STATA user manual. Ths data s used to llustrate Heckman s approach by predctng women's wages from ther educaton and age. To evaluate the performance of the proposed Bayesan approach and compare wth other methods we smulate sample selecton as t was 5

34 6 specfed n the example and perform the MCMC algorthm usng dfferent Wshart pror specfcatons. MCMC methods use smulaton of Markov chans n the parameter space. The Markov chans are defned n such a way that the posteror dstrbuton n the gves statstcal nference problem s asymptotc dstrbuton. Ths allows usng averages to approxmate the desred posterors expectatons. Several standard algorthms to defne such Markov chans exst ncludng Gbbs samplng and Metropols-Hastng. Usng these algorthms t s possble to mplement posteror smulaton n essentally any problem whch allows pontwse evaluaton of pror dstrbuton and lkelhood functons. The data contan a sample of 000 observatons of 5 varables. A bref descrpton of the varables that are relevant for our analyss s shown n Table From among the 000 observatons; we observe wage data for only 343. The remanng 657 women were not n the pad work force and so dd not receve wages. We are nterested n modelng two thngs: the decson of the women to enter the labor force and predctng women s hourly wage. We wll consder a reasonable assumpton that the women s decson to enter the labor force s a functon of age martal status the number of chldren and her level of educaton. Also the wage rate a woman earns s a functon of her age and educaton.

35 7 Table Women Wage Data Varable Descrpton Varable Name Age Educaton Marred Chldren Wage Defnton Age of the woman Number of years of educaton of the woman Dummy varable equal to f the woman s marred 0 otherwse Number of chldren that the woman has n her household Hourly wage of the woman We begn wth OLS estmaton of the regresson model usng only the observatons that have wage data. The estmates can be found n Table see `OLS- Selected Wage' row. Ths analyss would be fne f n fact the mssng wage data were mssng completely at random. However the decson to work or not work was made by the ndvdual woman. Thus those who were not workng consttute a self-selected sample and not a random sample. It s lkely that some of the women who would have earned low wages chose not to work. If so ths would account for much of the mssng wage data. Thus t s lkely that we wll over-estmate the wages of the women n the populaton. So somehow we need to account for nformaton that we have on the nonworkng women. We attempt to do ths by replacng the mssng values wth zeros for wage varable. The estmates can be found n Table see OLS- Non-mssng Wage row.

36 8 Table Estmaton Usng OLS and Heckman Models Method Parameter Parameter Estmate Standard Error Bas OLS Full Wage Intercept NA OLS Selected Wage Intercept OLS Non-Mssng Wage Intercept Heckman Intercept OLS Full Wage Educaton NA OLS Selected Wage Educaton OLS Non-Mssng Wage Educaton Heckman Educaton OLS Full Wage Age NA OLS Selected Wage Age OLS Non-Mssng Wage Age Heckman Age OLS Full Wage Inverse Mlls NA NA NA OLS Selected Wage Inverse Mlls NA NA NA OLS Non-Mssng Wage Inverse Mlls NA NA NA Heckman Inverse Mlls NA Ths analyss s also troublng. It s true that we are usng data from all 000 women but zero s not a far estmate of what the women would have earned f they had chosen to work. It s lkely that ths model wll under-estmate the wages of women n the populaton. The soluton to our quandary s to use the Heckman selecton model Heckman 979. The Heckman selecton model allows us to use nformaton from non-workng women to mprove the estmates of the parameters n the regresson model. The Heckman selecton model provdes consstent asymptotcally effcent estmates for all parameters n the model. In our example we have one model predctng wages and one model predctng whether a woman wll be workng. We wll use martal status chldren

37 9 educaton and age to predct selecton. In addton to the two equatons Heckman estmates ρ actually the nverse hyperbolc tangent of ρ the correlaton of the resduals n the two equatons and Σ actually the log of Σ the standard error of the resduals of the wage equaton. Then λ = ρσ. The estmates can be found n Table see Heckman row. Recall that we do have full wage nformaton on all 000 women. We can therefore run a regresson usng the full wage nformaton to use as a comparson. The estmates can be found n Table see OLS- Full Wage row. The Selected Wage model tends to over-estmate wages; the Non-Mssng Wage model tends to severely under-estmate wages; and the Heckman model does the best job n predctng wages. Fnally we consder the Bayesan approach to predctng women's wage from ther educaton and age. In ths approach the posteror dstrbutons are too complcated to evaluate analytcally. However by usng MCMC methods and Gbbs samplng ths posteror dstrbuton can be sampled ndrectly by generatng a sample of parameter values from the condtonal dstrbuton of nterest. Posteror Bayes estmates are then obtaned from the generated samples. We estmate the parameters usng the MCMC algorthm as the followng:. Use Bayesan approach usng many loops for Gbbs samplng / MCMC to repeatedly sample from the condtonal dstrbuton Sample latent varable n selecton stage Update mssng y s and mean predcton of y s Update and jontly Update usng Wshart pror whch leads to Wshart posteror

38 30. Repeat the above steps 500 tmes The goal s to see how ths method performs when we use dfferent prors and dfferent correlatons of the error terms n the two-model equaton. We run the algorthm for 500 teratons after convergence dscardng the frst 500 teratons. The estmates are n Table 3. Comparng the results n Table and Table 3 we fnd that the Bayesan approach s provdng estmates that are at least as effectve as the Heckman s estmates and are better than the OLS usng selected wage. The Wshart dstrbuton s an objectve pror because the posteror mean wll be affected by the pror choce. We perform our analyss under two nstances of a Wshart pror: Wsh3H & Wsh4H. Also we change the nverse scale matrx n the Wshart pror n usng dfferent values of σ: The results n Table 3 show that the parameters dd not change much compared wth the results n Table. Ths ndcates that the Bayesan approach s performng as well as the Heckman approach or even better n some scenaros. We can see good estmates for under Wshart3H wth Sgma0000. The standard error and bas are low compared to other estmates.

39 3 Table 3 Estmaton Usng Bayesan Methods Wshart Pror Inverse Scale Matrx Parameter Estmate Intercept Standard Error Bas Parameter Estmate Educaton Standard Error Bas 3 H H H H H H Age Wshart Pror Inverse Scale Matrx Parameter Estmate Standard Error Bas 3 H H H H H H Intercept Wshart Pror Inverse Scale Matrx Parameter Estmate Standar d Error Bas Parameter Estmate Educaton Standard Error Bas 4 H H H H H H Age Wshart Pror Inverse Scale Matrx Parameter Estmate Standar d Error Bas 3 H H H H H H

40 3 Comprehensve Smulaton Study Here we conduct a more comprehensve smulaton study to further nvestgate the effect of pror dstrbutons and the robustness of the Bayesan approach. Unlke the work n the prevous secton the data sets here wll be generated from several specfc condtons. More specfcally I consder the effects of the fracton of selecton correlaton between selecton of regresson models sample szes and the fracton of the ndependent varables that appear n both selecton and regresson model. The smulated data wll be analyzed by both Heckman s two-stage estmator and the Bayesan methods proposed above. The estmates from both methods wll be evaluated and compared n terms of Bas and RMSE Root Mean Square Error. The Bas of an estmator s the dfference between the estmator s expected value and the true parameter value of the parameter beng estmated. An estmator or decson rule wth zero bas s called unbased. Otherwse the estmator s sad to be based. RMSE s based on two sums of squares: Sum of Squares Total SST and Sum of Squares Error SSE. SST measures how far the data are from the mean and SSE measures how far the data are from the model s predcted vales. The data set wll be generated usng the two model equaton wth the below selfselected samples. Observaton stage: Selecton stage: u Where 0 ~ N u 0

41 33 Note that s the observaton stage and s the selecton stage. Wthout loss of generalty we wll observe y... y yn n m where z z... zn are greater than the selecton cutoff value c. Hence y... n y n ym wll be consdered mssng at the analyss step. We consder the followng scenaros at the data generaton step: Change the level or percentage of mssng values Change the value of to control the level of correlaton Change the two scalars x and x to control the level of multcollnearty Change the sample sze The data are smulated as follows: we generate desgn matrces and n observaton and screenng stage wth 3 vector for each wth ther frst rows fxed as one to make constant terms for each equaton. The two other rows n and are ndependently generated from a unform dstrbuton on [0]. I set the true parameter value for 0 as 3 respectvely and I generate random error and u from a bvarate normal densty wth zero mean and varance-covarance matrx. In each case 500 Gbbs samples were drawn the frst 500 were dscarded and the remanng 000 were used for posteror nference. I tred multple runs to ensure convergence of the results. The man smulaton ddn t nclude Thnnng strategy of reducng autocorrelaton by storng only every m th pont after the burn-n perod however we are confdent wth the results because we use dfferent settngs wth dfferent startng ponts and the results are obtaned from 00 replcatons to avod systematc mstakes. Snce 500 mght not be long enough for MCMC to converge I

42 34 selected one smulaton scenaro from one chan and ran 000 MCMC samples usng two dfferent startng values. The frst 000 were dscarded as burn-n and we appled thnnng of 5 leavng 000 effectve posteror samples to show plot behavor and how the dstrbuton converges. Convergence refers to the dea that eventually the MCMC and Gbbs Sampler that we choose dd eventually reach a statonary dstrbuton whch s also our target dstrbuton. To test the results we generated the followng dagnoss plots: trace plots to show the samplng path kernel densty to show the posteror densty functon and movng averages of posteror samples to show that samples are convergng to smlar values. One way to see f our chan has converged s to see how well our chan s mxng or movng around the parameter space. If our chan s takng a long tme to move around the parameter space then t wll take longer to converge. We can see how well our chan s mxng through vsual nspecton. We wll dscuss these nspectons for every parameter. Fgure shows the samplng paths for 0 from two dfferent startng chans. Ths fgure contans plots known as trace plots of the teraton number aganst the value of the draw of the parameters at each teraton. These plots are useful to show whether our chan s convergng to the same value or gets stuck n certan areas of the parameter space whch ndcate bad mxng. Our results show that all samples converge to the same dstrbuton and there seems to be large spread for estmates.

43 Fgure. Sample Path for Comprehensve Smulaton 35

44 36 Fgure shows the posteror densty plots for the estmates from the two MCMC chans wth the normal densty curve. The plots show strong evdence for convergence for 0 and whch s reflected n the dstrbutons. Usually non-convergence s reflected n multmodal dstrbuton and ths s especally true f the kernel densty s not just mult-modal but lumpy. Fgure 3 shows the plots of the movng averages of 0 from the two MCMC chans. The x-axs represents the number of teratons and the y-axs shows the posteror mean from these teratons. As a result all the paths are beleved to be statonary n an acceptable rang. When comparng the two settngs 0 and seem to converge to the same value farly quckly however convergence does not seem to be apparent whch s consstent wth the results n prevous studes. The followng sectons summarze the smulaton results for the varous set-ups we mentoned earler. For each sample of data generated we obtan MCMC estmates by calculatng the mean of the condtonal posteror denstes for the smulated samples RMSE and Bas. We also nclude the estmates from Heckman s method and from OLS usng all data and subset data.

45 Fgure. Densty Plots for Comprehensve Smulaton 37

46 Fgure 3. Movng Averages for Comprehensve Smulaton 38

47 39 Selecton Rate Table 4 shows the smulaton results for the regresson coeffcents usng dfferent selecton rate scenaros. To test the effect of the mssng rate level we use the followng two levels: 50% and 0%. In both scenaros we notce reducton n RMSE and n bas n most coeffcents when usng a Bayesan approach. By comparng the Bayesan method to Heckman s method we can see 64% 0.4 versus 0.05 reducton n bas for 0 n the frst scenaro and 93% reducton n Bas n the second scenaro. Ths shows sgnfcant mprovement n 0 estmates usng Bayesan methods comparng to Heckman s method when the mssng rate s low. We also notce 0% reducton n RMSE n the second scenaro for both and. The RMSE for n Bayesan s 0.7 versus 0.83 n Heckman. There s also a slght reducton n RMSE for n the second scenaro - RMSE of 0.79 n Bayesan versus 0.8 n Heckman. The results of both scenaros ndcate that the Bayesan approach s performng as well as the Heckman approach or even better.

48 40 Table 4 Smulaton Results for Selecton Rate Selecton Rate Method Parameter Mean RMSE Bas 50% OLS all data β % OLS subset β % Heckman β % Bayesan β % OLS all data β % OLS subset β % Heckman β % Bayesan β % OLS all data β % OLS subset β % Heckman β % Bayesan β % OLS all data β % OLS subset β % Heckman β % Bayesan β % OLS all data β % OLS subset β % Heckman β % Bayesan β % OLS all data β % OLS subset β % Heckman β % Bayesan β Note. Sample sze = 80 correlaton level = 0.50 and multcollnearty COR < 0..

49 4 Correlaton Level Table 5 shows the smulaton results for the regresson coeffcents usng dfferent correlaton levels. The correlaton level s controlled by the value of correlaton between the error terms cor u. To check the Bayesan approach s performance we assgn the followng three correlaton values: and The overall results show that the Bayesan approach provdes a sgnfcant reducton n RMSE and n bas when correlaton level s hgh. We see reducton n bas n the frst two scenaros 0.3 and 0.5 and less reducton for and bas when the correlaton level s hgh cor u =0.75. The results for the lowest correlaton level cor u =0.3 show slght reducton n Bas for all coeffcents wth no sgnfcant mprovement n RMSE. However sgnfcant reducton n RMSE seems to exst n the hgh correlaton scenaros.

50 4 Table 5 Smulaton Results for Correlaton Level Correlaton Method Parameter Mean RMSE Bas 0.30 OLS all data β OLS subset β Heckman β Bayesan β OLS all data β OLS subset β Heckman β Bayesan β OLS all data β OLS subset β Heckman β Bayesan β OLS all data β OLS subset β Heckman β Bayesan β OLS all data β OLS subset β Heckman β Bayesan β OLS all data β OLS subset β Heckman β Bayesan β OLS all data β OLS subset β Heckman β Bayesan β OLS all data β OLS subset β Heckman β Bayesan β OLS all data β OLS subset β Heckman β Bayesan β Note. Sample sze = 80 selecton rate = 0% and multcollnearty COR < 0..

51 43 Multcollnearty The results n Table 6 show the smulaton results for the regresson coeffcents usng three dfferent multcollnearty levels. Ths level s controlled by the level of correlaton between two explanatory varables n observaton and screenng stages [ cor x x ]. To test the effect of multcollnearty we use the followng levels: and.00. The thrd scenaro [ cor x x =] refers to the case where the screenng stage and the observaton stage contan one common explanatory varable. The Bayesan method seems to provde best results for all coeffcents when the multcollnearty level s hgh. The thrd case show large decrease n RMSE and bas wth a Bayesan approach comparng to Heckman. The Bas for 0 dropped 63% and the RMSE for dropped 5% n Bayesan estmaton when comparng to Heckman method. There seems to be no sgnfcant mprovement wth the Bayesan approach n the frst scenaro where the multcollnearty level s very low. In ths case Heckman s approach seems to be a good choce for estmaton but defntely not when a hgh level of multcollnearty exsts.

52 44 Table 6 Smulaton Results for Multcollnearty Multcollnearty Method Parameter Mean RMSE Bas 0.06 OLS all data β OLS subset β Heckman β Bayesan β OLS all data β OLS subset β Heckman β Bayesan β OLS all data β OLS subset β Heckman β Bayesan β OLS all data β OLS subset β Heckman β Bayesan β OLS all data β OLS subset β Heckman β Bayesan β OLS all data β OLS subset β Heckman β Bayesan β OLS all data β OLS subset β Heckman β Bayesan β OLS all data β OLS subset β Heckman β Bayesan β OLS all data β OLS subset β Heckman β Bayesan β Note. Sample sze = 80 selecton rate = 0% and correlaton level = 0.50.

53 45 Sample Sze Fnally Table 7 shows the smulaton results for the regresson coeffcents usng three dfferent sample szes N= Ths s another stuaton where the results of both scenaros ndcate that the Bayesan approach s performng as well as the Heckman approach or even better. The results for varous sample sze show smlar slght reducton of less than 0% n RMSE average for Bayesan method comparng to Heckman for all coeffcents. However we see sgnfcant mprovement n Bas for Bayesan method for 0. The Bas reducton n Bayesan method comparng to Heckman for 0 exceeds 50% n all scenaros. Ths comprehensve smulaton study used varous scenaros that a researcher could face when dealng wth a real data. The results proved the effectveness of the Bayesan approach and showed the lmtatons of Heckman s approach partcularly when faced wth a hgh level of multcollnearty. A detaled nvestgaton of the multcollnearty ssue can be found n Leung and u 996. They show that the degree of multcollnearty s the man decson drver to judge the approprateness of the LIML and FIML estmates n relaton to the two-part model. In emprcal analyss n wage equatons for example the standard procedure to solve the multcollnearty problem s to fnd varables that determne the probablty to work selecton equaton but not the wage rate observaton equaton drectly. Practcal examples for these varables could be the ncome of the spouse household ncome etc. However these varables are not always avalable n practcal stuatons.

54 46 Table 7 Smulaton Results for Sample Sze Sample Sze Method Parameter Mean RMSE Bas N=80 OLS all data β N=80 OLS subset β N=80 Heckman β N=80 Bayesan β N=80 OLS all data β N=80 OLS subset β N=80 Heckman β N=80 Bayesan β N=80 OLS all data β N=80 OLS subset β N=80 Heckman β N=80 Bayesan β N=0 OLS all data β N=0 OLS subset β N=0 Heckman β N=0 Bayesan β N=0 OLS all data β N=0 OLS subset β N=0 Heckman β N=0 Bayesan β N=0 OLS all data β N=0 OLS subset β N=0 Heckman β N=0 Bayesan β N=60 OLS all data β N=60 OLS subset β N=60 Heckman β N=60 Bayesan β N=60 OLS all data β N=60 OLS subset β N=60 Heckman β N=60 Bayesan β N=60 OLS all data β N=60 OLS subset β N=60 Heckman β N=60 Bayesan β Note. Selecton rate = 0% selecton level = 0.50 and multcollnearty COR < 0..

55 CHAPTER 5 CASE STUD: PLACEMENT EAM AND MATH ACHIEVEMENT In ths chapter we apply the Bayesan model to a real-world data example by usng AU students placement exam data. The goal s to nvestgate how the students placement exam scores are assocated wth ther frst year math achevement. All AU students are supposed to take the math placement exam and regster for approprate math/stat courses accordngly. The problem s that nformaton about the students frst year math achevements grades s only avalable for those who actually take and complete ther frst year math courses. We wsh to forecast outcomes n the whole pool of freshmen but are forced to rely on a subset chosen non-randomly. The data contan 0 freshmen students wth 4 varables. A bref descrpton of the varables that are relevant for our analyss s shown n Table 8. From among the 0 students we observe students grades for only 75. The remanng 60 students dd not regster or complete a math class n fall 00 and so dd not receve a grade. We are nterested to see how placement exam score are related to the student s math grade. Due to the self-selecton we need to perform accurate estmaton by correctng for sample selecton bas. The student grade s a functon of hs placement score and the recommended class. A dummy varable called Basc Level was created to determne the type of recommended math class e.g. Basc Algebra Appled Calculus etc.. If the recommended class type s classfed as basc the new dummy varable value s equal to ; otherwse the value s equal to zero. Ths new dummy varable was ncluded n the observaton stage as an explanatory varable. 47

56 48 Table 8 Varable Descrptons for Placement Exam Data Varable Name Placement Score Basc Level Grade Defnton Recommended class based on placement exam score Student s placement exam score Dummy varable equal to f the recommended class type s classfed as a basc level; otherwse the value s equal to zero Student s grade for the class We begn wth OLS estmaton of the regresson model usng only the observatons that have grade data. The estmates can be found n Table 9 n `OLS- subset' row. The estmated coeffcent shows a very small effect of placement exam score on the student s math achevement = Ths analyss would be fne f n fact the mssng grade data were mssng completely at random. However the decson to regster for a math class or not was made by the ndvdual student. Thus those who were not regstered consttute a self-selected sample and not a random sample. It s lkely that some of the students who had low placement scores chose not to regster for any math class. If so ths would account for much of the mssng grade data. Thus t s lkely that we wll over-estmate the grade of the student n the populaton. On the other hand the Heckman s method s supposed to allow us to use nformaton from non-regstered students to mprove the estmates of the parameters n the regresson model. However Heckman s method shows negatve estmates and low

57 49 negatve effect of placement exam score on the student s math achevement = Table 9 Case Study Results for Placement Exam Data Intercept β0 Score β Basc Level β Mean Std. Error Mean Std. Error Mean Std. Error OLS subset Heckman Bayesan Fnally we consder the Bayesan approach usng MCMC methods and Gbbs samplng where the posteror dstrbuton can be sampled ndrectly by generatng a sample of parameter values from the condtonal dstrbuton of nterest. Posteror Bayes estmates are then obtaned from the generated samples. In the 0000 samples the frst 000 are dscarded and we use thnnng of leavng 9000 effectve posteror samples. Fgure 4 shows the sample path for 0 usng two dfferent startng ponts. The plots show that all values converge to the same dstrbuton. Fgure 5 shows the posteror densty plot for 0 and the two settngs show normal dstrbutons. Fgure 6 shows the movng averages for 0 from the two MCMC chans. The plots ndcate that the two MCMC chans are convergng to the same value for 0 and. The Movng Averages for Basc Level from the two settngs seem not to converge to the

58 50 same value. The Bayesan estmate for n Table 9 seem to support ths result. Ths ndcates low effect of Basc Level on student s math achevement. Table 9 shows that the estmated effect of placement score on student s grade s postve when applyng the Bayesan model. Such relatonshp of placement score s not dentfed n estmates usng Heckman s method. However Basc Level seems to have less effect n the Bayesan method compared wth Heckman s method. There seems to be no mprovement n the standard errors usng the Bayesan approach for all coeffcents. The Bayesan standard errors seem to be larger than those of the other two estmaton methods Heckman and OLS.

59 Fgure 4. Sample Paths for Placement Exam Example 5

60 Fgure 5. Densty Plots for Placement Exam Example 5

61 Fgure 6. Movng Averages for Placement Exam Example 53

62 CHAPTER 6 BINAR SELECTIVIT MODEL In ths chapter the proposed Bayesan method s extended to the Generalzed Lnear Model GLM. The GLM extends the lnear regresson model n order to accommodate non-normal responses e.g. bnomal data frequency data etc. to lnear equaton va a lnk functon. Examples of GLM nclude well-known models such as logstc regresson and log-lnear models Posson regresson for frequency tables etc. Conceptually the Bayesan specfcaton s straghtforward. We need to assgn a pror for regresson coeffcents as n the prevous regresson examples. There s no closed form soluton avalable but t s smple to obtan samples from posterors va MCMC. Generalzed Lnear Model Let y...yn denote n ndependent observatons on a response varable and treat y as a realzaton of a random varable. In GLM we assume that y that s part of the exponental famly wth three man components random systematc and lnk. The random part s the dstrbuton of the observatons the systematc component s the lnear combnaton of explanatory varables and the lnk functon s the lnk between the random part and the systematc component. The exponental famly s defned as the followng: y b f y; exp c y 4 a where and are locaton and scale parameters respectvely. The mean and varance are: 54

63 55 ' '' E y b and V y b a and we assume that the expected value s a lnear functon of x. g x ' 4 where s the lnear predctor g. s the lnk functon x are the predctors and s a vector of unknown parameters regresson coeffcents. Our current model employs a lnear regresson n the observed stage and a probt/logstc regresson n the selecton stage. Let us consder U ~ N x ' whch follows where P x x ' t t / exp z The relaton s lnearzed by the nverse normal transformaton ' x x j j The cutoff value of U s fxed and the mean of U s changng wth x. p j The goal s to have probt or GLM n the observed as n the selecton stage. We wll obtan ths by usng two latent varables n each stage and we wll be able to allow correlaton between these two varables. Ths s smlar to Heckman s selecton model except that now we have a bnary outcomes n the observaton stage. Assume the followng selecton setup: dz

64 56 Otherwse f D 0 0 and ~ x N s a latent varable. 0 0 sn * or D f g Ms or D and f or D and f The probt regresson model can expressed as: * x P ~ x N s also a latent varable. Therefore the latent varables can be expressed as the followng Bvarate Normal: ~ x x BVN Bayesan Estmaton The Bayesan setup for the GLM s an extenson of the framework we have used for regresson models. Suppose we have g we need to choose a pror densty for the parameters. The posteror densty s then expressed as: ; ; ; ; y f y y f d d y f y f y where y s the margnal lkelhood of the data obtaned by ntegratng the lkelhood condtonal on the unknown regresson coeffcent and dsperson parameter across the pror densty. The jont posteror densty of unobserved and gven s N x y x y y L D L y D y * * } {

65 57 Let ] [ ' ' from the above jont posteror we now nfer condtonal posterors and mplement Gbbs sampler. We start wth samplng the condtonal posteror of from ~ * x x BVN D therefore ~ * z N D - - Truncated at left by 0 f D ~ * z N D - - Truncated at rght by 0 f 0 D where z x x n a smlar way we calculate the condtonal posteror of. We can get a result for when D and * whch also truncated normal. ~ * N D - - Truncated at left by 0 f D and * ~ * N D - - Truncated at left by 0 otherwse where x x To sample we use the followng pror N y let 00 ' ' x x and ' W and we can get the condtonal posteror functon of whch s normal densty ~ B N where ' W B and B B ' 0

66 58 can sample We can sample and teratvely by drawng gven and vce versa. We and from the posteror margnal dstrbuton at each teraton. Ths margnal dstrbuton s condtonal only on the data and not on any parameters. And then we can sample from the same posteror full condtonal dstrbuton as the followng: Sample and from ts posteror margnal dstrbuton. Sample from the same posteror full condtonal dstrbuton as descrbed prevously.

67 59 Data Example The prevous model s appled to the same data example we used n Chapter 5 AU Students Replacement Exam Score Data except that the response varable s now bnary. For each student I assgned a new varable called Pass that takes the value of f Grade>3.0 Pass and otherwse 0 Fal. I begn wth MLE estmaton for the bvarate probt model usng only the observatons that have grade data. The estmates can be found n ths frst row n Table 0. As dscussed n prevous examples t s lkely that we wll over-estmate proporton of Passed students n the populaton. Bayesan method s appled after runnng 0000 of teratons MCMC usng Gbbs Sampler wth 000 as burn-n. The results of ths method can be found n ths frst row n Table 0. Table 0 Model Results for Bnary Selectvty Intercept Score Basc Level Mean Std. Error Mean Std. Error Mean Std. Error MLE Bayesan Fgure 7 shows the sample path for 0 usng two dfferent startng ponts. The plots show that all values converge to the same dstrbuton. Fgure 8 shows the posteror densty plot for 0 and the two settngs show normal dstrbuton. Fgure

68 60 9 shows the Movng Averages for 0 from the two MCMC chans. The graph ndcates that the two MCMC chans are convergng to the same value. The results n Table 0 ndcates that the Bayesan approach s performng at least as well as the MLE approach. The Score coeffcent seems to ndcate postve relatonshp wth Grade. Ths relatonshp was reversed wth negatve coeffcent for Score n MLE method. Another advantage n the Bayesan approach seems to be n the slght reducton of the standard error for all coeffcents. In general we would expect strong postve relatonshp between the placement exam score and the fnal outcome whether the student passed or faled math class n the frst semester. Smlarly we would expect larger correlaton between Basc Level whch s based on the student s placement exam score and whether the student passed or faled math class. However these strong correlatons are not present n the data. Ths s manly due to the lmtaton of the avalable varables that we have to use n our model. The varable Score s used n both stages selecton and observaton whch leads to hgh multcollnearty. These results confrm that the parameters estmates are not very effectve when multcollnearty exsts n the model.

69 6 Fgure 7. Sample Paths for Bnary Selectvty 6

A Robust Method for Calculating the Correlation Coefficient

A Robust Method for Calculating the Correlation Coefficient A Robust Method for Calculatng the Correlaton Coeffcent E.B. Nven and C. V. Deutsch Relatonshps between prmary and secondary data are frequently quantfed usng the correlaton coeffcent; however, the tradtonal

More information

Econ107 Applied Econometrics Topic 3: Classical Model (Studenmund, Chapter 4)

Econ107 Applied Econometrics Topic 3: Classical Model (Studenmund, Chapter 4) I. Classcal Assumptons Econ7 Appled Econometrcs Topc 3: Classcal Model (Studenmund, Chapter 4) We have defned OLS and studed some algebrac propertes of OLS. In ths topc we wll study statstcal propertes

More information

LINEAR REGRESSION ANALYSIS. MODULE IX Lecture Multicollinearity

LINEAR REGRESSION ANALYSIS. MODULE IX Lecture Multicollinearity LINEAR REGRESSION ANALYSIS MODULE IX Lecture - 30 Multcollnearty Dr. Shalabh Department of Mathematcs and Statstcs Indan Insttute of Technology Kanpur 2 Remedes for multcollnearty Varous technques have

More information

Chapter 13: Multiple Regression

Chapter 13: Multiple Regression Chapter 13: Multple Regresson 13.1 Developng the multple-regresson Model The general model can be descrbed as: It smplfes for two ndependent varables: The sample ft parameter b 0, b 1, and b are used to

More information

Psychology 282 Lecture #24 Outline Regression Diagnostics: Outliers

Psychology 282 Lecture #24 Outline Regression Diagnostics: Outliers Psychology 282 Lecture #24 Outlne Regresson Dagnostcs: Outlers In an earler lecture we studed the statstcal assumptons underlyng the regresson model, ncludng the followng ponts: Formal statement of assumptons.

More information

The Multiple Classical Linear Regression Model (CLRM): Specification and Assumptions. 1. Introduction

The Multiple Classical Linear Regression Model (CLRM): Specification and Assumptions. 1. Introduction ECONOMICS 5* -- NOTE (Summary) ECON 5* -- NOTE The Multple Classcal Lnear Regresson Model (CLRM): Specfcaton and Assumptons. Introducton CLRM stands for the Classcal Lnear Regresson Model. The CLRM s also

More information

Lecture 6: Introduction to Linear Regression

Lecture 6: Introduction to Linear Regression Lecture 6: Introducton to Lnear Regresson An Manchakul amancha@jhsph.edu 24 Aprl 27 Lnear regresson: man dea Lnear regresson can be used to study an outcome as a lnear functon of a predctor Example: 6

More information

Parametric fractional imputation for missing data analysis. Jae Kwang Kim Survey Working Group Seminar March 29, 2010

Parametric fractional imputation for missing data analysis. Jae Kwang Kim Survey Working Group Seminar March 29, 2010 Parametrc fractonal mputaton for mssng data analyss Jae Kwang Km Survey Workng Group Semnar March 29, 2010 1 Outlne Introducton Proposed method Fractonal mputaton Approxmaton Varance estmaton Multple mputaton

More information

3.1 Expectation of Functions of Several Random Variables. )' be a k-dimensional discrete or continuous random vector, with joint PMF p (, E X E X1 E X

3.1 Expectation of Functions of Several Random Variables. )' be a k-dimensional discrete or continuous random vector, with joint PMF p (, E X E X1 E X Statstcs 1: Probablty Theory II 37 3 EPECTATION OF SEVERAL RANDOM VARIABLES As n Probablty Theory I, the nterest n most stuatons les not on the actual dstrbuton of a random vector, but rather on a number

More information

Department of Quantitative Methods & Information Systems. Time Series and Their Components QMIS 320. Chapter 6

Department of Quantitative Methods & Information Systems. Time Series and Their Components QMIS 320. Chapter 6 Department of Quanttatve Methods & Informaton Systems Tme Seres and Ther Components QMIS 30 Chapter 6 Fall 00 Dr. Mohammad Zanal These sldes were modfed from ther orgnal source for educatonal purpose only.

More information

Economics 130. Lecture 4 Simple Linear Regression Continued

Economics 130. Lecture 4 Simple Linear Regression Continued Economcs 130 Lecture 4 Contnued Readngs for Week 4 Text, Chapter and 3. We contnue wth addressng our second ssue + add n how we evaluate these relatonshps: Where do we get data to do ths analyss? How do

More information

Basically, if you have a dummy dependent variable you will be estimating a probability.

Basically, if you have a dummy dependent variable you will be estimating a probability. ECON 497: Lecture Notes 13 Page 1 of 1 Metropoltan State Unversty ECON 497: Research and Forecastng Lecture Notes 13 Dummy Dependent Varable Technques Studenmund Chapter 13 Bascally, f you have a dummy

More information

Statistics for Economics & Business

Statistics for Economics & Business Statstcs for Economcs & Busness Smple Lnear Regresson Learnng Objectves In ths chapter, you learn: How to use regresson analyss to predct the value of a dependent varable based on an ndependent varable

More information

Limited Dependent Variables

Limited Dependent Variables Lmted Dependent Varables. What f the left-hand sde varable s not a contnuous thng spread from mnus nfnty to plus nfnty? That s, gven a model = f (, β, ε, where a. s bounded below at zero, such as wages

More information

January Examinations 2015

January Examinations 2015 24/5 Canddates Only January Examnatons 25 DO NOT OPEN THE QUESTION PAPER UNTIL INSTRUCTED TO DO SO BY THE CHIEF INVIGILATOR STUDENT CANDIDATE NO.. Department Module Code Module Ttle Exam Duraton (n words)

More information

Negative Binomial Regression

Negative Binomial Regression STATGRAPHICS Rev. 9/16/2013 Negatve Bnomal Regresson Summary... 1 Data Input... 3 Statstcal Model... 3 Analyss Summary... 4 Analyss Optons... 7 Plot of Ftted Model... 8 Observed Versus Predcted... 10 Predctons...

More information

Andreas C. Drichoutis Agriculural University of Athens. Abstract

Andreas C. Drichoutis Agriculural University of Athens. Abstract Heteroskedastcty, the sngle crossng property and ordered response models Andreas C. Drchouts Agrculural Unversty of Athens Panagots Lazards Agrculural Unversty of Athens Rodolfo M. Nayga, Jr. Texas AMUnversty

More information

x i1 =1 for all i (the constant ).

x i1 =1 for all i (the constant ). Chapter 5 The Multple Regresson Model Consder an economc model where the dependent varable s a functon of K explanatory varables. The economc model has the form: y = f ( x,x,..., ) xk Approxmate ths by

More information

Comparison of Regression Lines

Comparison of Regression Lines STATGRAPHICS Rev. 9/13/2013 Comparson of Regresson Lnes Summary... 1 Data Input... 3 Analyss Summary... 4 Plot of Ftted Model... 6 Condtonal Sums of Squares... 6 Analyss Optons... 7 Forecasts... 8 Confdence

More information

x = , so that calculated

x = , so that calculated Stat 4, secton Sngle Factor ANOVA notes by Tm Plachowsk n chapter 8 we conducted hypothess tests n whch we compared a sngle sample s mean or proporton to some hypotheszed value Chapter 9 expanded ths to

More information

Bayesian predictive Configural Frequency Analysis

Bayesian predictive Configural Frequency Analysis Psychologcal Test and Assessment Modelng, Volume 54, 2012 (3), 285-292 Bayesan predctve Confgural Frequency Analyss Eduardo Gutérrez-Peña 1 Abstract Confgural Frequency Analyss s a method for cell-wse

More information

Chapter 5 Multilevel Models

Chapter 5 Multilevel Models Chapter 5 Multlevel Models 5.1 Cross-sectonal multlevel models 5.1.1 Two-level models 5.1.2 Multple level models 5.1.3 Multple level modelng n other felds 5.2 Longtudnal multlevel models 5.2.1 Two-level

More information

Composite Hypotheses testing

Composite Hypotheses testing Composte ypotheses testng In many hypothess testng problems there are many possble dstrbutons that can occur under each of the hypotheses. The output of the source s a set of parameters (ponts n a parameter

More information

1. Inference on Regression Parameters a. Finding Mean, s.d and covariance amongst estimates. 2. Confidence Intervals and Working Hotelling Bands

1. Inference on Regression Parameters a. Finding Mean, s.d and covariance amongst estimates. 2. Confidence Intervals and Working Hotelling Bands Content. Inference on Regresson Parameters a. Fndng Mean, s.d and covarance amongst estmates.. Confdence Intervals and Workng Hotellng Bands 3. Cochran s Theorem 4. General Lnear Testng 5. Measures of

More information

Chapter 2 - The Simple Linear Regression Model S =0. e i is a random error. S β2 β. This is a minimization problem. Solution is a calculus exercise.

Chapter 2 - The Simple Linear Regression Model S =0. e i is a random error. S β2 β. This is a minimization problem. Solution is a calculus exercise. Chapter - The Smple Lnear Regresson Model The lnear regresson equaton s: where y + = β + β e for =,..., y and are observable varables e s a random error How can an estmaton rule be constructed for the

More information

2016 Wiley. Study Session 2: Ethical and Professional Standards Application

2016 Wiley. Study Session 2: Ethical and Professional Standards Application 6 Wley Study Sesson : Ethcal and Professonal Standards Applcaton LESSON : CORRECTION ANALYSIS Readng 9: Correlaton and Regresson LOS 9a: Calculate and nterpret a sample covarance and a sample correlaton

More information

Module 3 LOSSY IMAGE COMPRESSION SYSTEMS. Version 2 ECE IIT, Kharagpur

Module 3 LOSSY IMAGE COMPRESSION SYSTEMS. Version 2 ECE IIT, Kharagpur Module 3 LOSSY IMAGE COMPRESSION SYSTEMS Verson ECE IIT, Kharagpur Lesson 6 Theory of Quantzaton Verson ECE IIT, Kharagpur Instructonal Objectves At the end of ths lesson, the students should be able to:

More information

Markov Chain Monte Carlo Lecture 6

Markov Chain Monte Carlo Lecture 6 where (x 1,..., x N ) X N, N s called the populaton sze, f(x) f (x) for at least one {1, 2,..., N}, and those dfferent from f(x) are called the tral dstrbutons n terms of mportance samplng. Dfferent ways

More information

Lecture 3 Stat102, Spring 2007

Lecture 3 Stat102, Spring 2007 Lecture 3 Stat0, Sprng 007 Chapter 3. 3.: Introducton to regresson analyss Lnear regresson as a descrptve technque The least-squares equatons Chapter 3.3 Samplng dstrbuton of b 0, b. Contnued n net lecture

More information

Linear Approximation with Regularization and Moving Least Squares

Linear Approximation with Regularization and Moving Least Squares Lnear Approxmaton wth Regularzaton and Movng Least Squares Igor Grešovn May 007 Revson 4.6 (Revson : March 004). 5 4 3 0.5 3 3.5 4 Contents: Lnear Fttng...4. Weghted Least Squares n Functon Approxmaton...

More information

An Introduction to Censoring, Truncation and Sample Selection Problems

An Introduction to Censoring, Truncation and Sample Selection Problems An Introducton to Censorng, Truncaton and Sample Selecton Problems Thomas Crossley SPIDA June 2003 1 A. Introducton A.1 Basc Ideas Most of the statstcal technques we study are for estmatng (populaton)

More information

Econ Statistical Properties of the OLS estimator. Sanjaya DeSilva

Econ Statistical Properties of the OLS estimator. Sanjaya DeSilva Econ 39 - Statstcal Propertes of the OLS estmator Sanjaya DeSlva September, 008 1 Overvew Recall that the true regresson model s Y = β 0 + β 1 X + u (1) Applyng the OLS method to a sample of data, we estmate

More information

4 Analysis of Variance (ANOVA) 5 ANOVA. 5.1 Introduction. 5.2 Fixed Effects ANOVA

4 Analysis of Variance (ANOVA) 5 ANOVA. 5.1 Introduction. 5.2 Fixed Effects ANOVA 4 Analyss of Varance (ANOVA) 5 ANOVA 51 Introducton ANOVA ANOVA s a way to estmate and test the means of multple populatons We wll start wth one-way ANOVA If the populatons ncluded n the study are selected

More information

ANSWERS. Problem 1. and the moment generating function (mgf) by. defined for any real t. Use this to show that E( U) var( U)

ANSWERS. Problem 1. and the moment generating function (mgf) by. defined for any real t. Use this to show that E( U) var( U) Econ 413 Exam 13 H ANSWERS Settet er nndelt 9 deloppgaver, A,B,C, som alle anbefales å telle lkt for å gøre det ltt lettere å stå. Svar er gtt . Unfortunately, there s a prntng error n the hnt of

More information

DO NOT OPEN THE QUESTION PAPER UNTIL INSTRUCTED TO DO SO BY THE CHIEF INVIGILATOR. Introductory Econometrics 1 hour 30 minutes

DO NOT OPEN THE QUESTION PAPER UNTIL INSTRUCTED TO DO SO BY THE CHIEF INVIGILATOR. Introductory Econometrics 1 hour 30 minutes 25/6 Canddates Only January Examnatons 26 Student Number: Desk Number:...... DO NOT OPEN THE QUESTION PAPER UNTIL INSTRUCTED TO DO SO BY THE CHIEF INVIGILATOR Department Module Code Module Ttle Exam Duraton

More information

ECONOMICS 351*-A Mid-Term Exam -- Fall Term 2000 Page 1 of 13 pages. QUEEN'S UNIVERSITY AT KINGSTON Department of Economics

ECONOMICS 351*-A Mid-Term Exam -- Fall Term 2000 Page 1 of 13 pages. QUEEN'S UNIVERSITY AT KINGSTON Department of Economics ECOOMICS 35*-A Md-Term Exam -- Fall Term 000 Page of 3 pages QUEE'S UIVERSITY AT KIGSTO Department of Economcs ECOOMICS 35* - Secton A Introductory Econometrcs Fall Term 000 MID-TERM EAM ASWERS MG Abbott

More information

Structure and Drive Paul A. Jensen Copyright July 20, 2003

Structure and Drive Paul A. Jensen Copyright July 20, 2003 Structure and Drve Paul A. Jensen Copyrght July 20, 2003 A system s made up of several operatons wth flow passng between them. The structure of the system descrbes the flow paths from nputs to outputs.

More information

Lecture 9: Linear regression: centering, hypothesis testing, multiple covariates, and confounding

Lecture 9: Linear regression: centering, hypothesis testing, multiple covariates, and confounding Recall: man dea of lnear regresson Lecture 9: Lnear regresson: centerng, hypothess testng, multple covarates, and confoundng Sandy Eckel seckel@jhsph.edu 6 May 8 Lnear regresson can be used to study an

More information

Chapter 11: Simple Linear Regression and Correlation

Chapter 11: Simple Linear Regression and Correlation Chapter 11: Smple Lnear Regresson and Correlaton 11-1 Emprcal Models 11-2 Smple Lnear Regresson 11-3 Propertes of the Least Squares Estmators 11-4 Hypothess Test n Smple Lnear Regresson 11-4.1 Use of t-tests

More information

Lecture 9: Linear regression: centering, hypothesis testing, multiple covariates, and confounding

Lecture 9: Linear regression: centering, hypothesis testing, multiple covariates, and confounding Lecture 9: Lnear regresson: centerng, hypothess testng, multple covarates, and confoundng Sandy Eckel seckel@jhsph.edu 6 May 008 Recall: man dea of lnear regresson Lnear regresson can be used to study

More information

e i is a random error

e i is a random error Chapter - The Smple Lnear Regresson Model The lnear regresson equaton s: where + β + β e for,..., and are observable varables e s a random error How can an estmaton rule be constructed for the unknown

More information

Lecture Notes on Linear Regression

Lecture Notes on Linear Regression Lecture Notes on Lnear Regresson Feng L fl@sdueducn Shandong Unversty, Chna Lnear Regresson Problem In regresson problem, we am at predct a contnuous target value gven an nput feature vector We assume

More information

LINEAR REGRESSION ANALYSIS. MODULE IX Lecture Multicollinearity

LINEAR REGRESSION ANALYSIS. MODULE IX Lecture Multicollinearity LINEAR REGRESSION ANALYSIS MODULE IX Lecture - 31 Multcollnearty Dr. Shalabh Department of Mathematcs and Statstcs Indan Insttute of Technology Kanpur 6. Rdge regresson The OLSE s the best lnear unbased

More information

Outline. Zero Conditional mean. I. Motivation. 3. Multiple Regression Analysis: Estimation. Read Wooldridge (2013), Chapter 3.

Outline. Zero Conditional mean. I. Motivation. 3. Multiple Regression Analysis: Estimation. Read Wooldridge (2013), Chapter 3. Outlne 3. Multple Regresson Analyss: Estmaton I. Motvaton II. Mechancs and Interpretaton of OLS Read Wooldrdge (013), Chapter 3. III. Expected Values of the OLS IV. Varances of the OLS V. The Gauss Markov

More information

Global Sensitivity. Tuesday 20 th February, 2018

Global Sensitivity. Tuesday 20 th February, 2018 Global Senstvty Tuesday 2 th February, 28 ) Local Senstvty Most senstvty analyses [] are based on local estmates of senstvty, typcally by expandng the response n a Taylor seres about some specfc values

More information

On an Extension of Stochastic Approximation EM Algorithm for Incomplete Data Problems. Vahid Tadayon 1

On an Extension of Stochastic Approximation EM Algorithm for Incomplete Data Problems. Vahid Tadayon 1 On an Extenson of Stochastc Approxmaton EM Algorthm for Incomplete Data Problems Vahd Tadayon Abstract: The Stochastc Approxmaton EM (SAEM algorthm, a varant stochastc approxmaton of EM, s a versatle tool

More information

Online Appendix to: Axiomatization and measurement of Quasi-hyperbolic Discounting

Online Appendix to: Axiomatization and measurement of Quasi-hyperbolic Discounting Onlne Appendx to: Axomatzaton and measurement of Quas-hyperbolc Dscountng José Lus Montel Olea Tomasz Strzaleck 1 Sample Selecton As dscussed before our ntal sample conssts of two groups of subjects. Group

More information

LOGIT ANALYSIS. A.K. VASISHT Indian Agricultural Statistics Research Institute, Library Avenue, New Delhi

LOGIT ANALYSIS. A.K. VASISHT Indian Agricultural Statistics Research Institute, Library Avenue, New Delhi LOGIT ANALYSIS A.K. VASISHT Indan Agrcultural Statstcs Research Insttute, Lbrary Avenue, New Delh-0 02 amtvassht@asr.res.n. Introducton In dummy regresson varable models, t s assumed mplctly that the dependent

More information

Testing for seasonal unit roots in heterogeneous panels

Testing for seasonal unit roots in heterogeneous panels Testng for seasonal unt roots n heterogeneous panels Jesus Otero * Facultad de Economía Unversdad del Rosaro, Colomba Jeremy Smth Department of Economcs Unversty of arwck Monca Gulett Aston Busness School

More information

Generalized Linear Methods

Generalized Linear Methods Generalzed Lnear Methods 1 Introducton In the Ensemble Methods the general dea s that usng a combnaton of several weak learner one could make a better learner. More formally, assume that we have a set

More information

Maximum Likelihood Estimation of Binary Dependent Variables Models: Probit and Logit. 1. General Formulation of Binary Dependent Variables Models

Maximum Likelihood Estimation of Binary Dependent Variables Models: Probit and Logit. 1. General Formulation of Binary Dependent Variables Models ECO 452 -- OE 4: Probt and Logt Models ECO 452 -- OE 4 Maxmum Lkelhood Estmaton of Bnary Dependent Varables Models: Probt and Logt hs note demonstrates how to formulate bnary dependent varables models

More information

since [1-( 0+ 1x1i+ 2x2 i)] [ 0+ 1x1i+ assumed to be a reasonable approximation

since [1-( 0+ 1x1i+ 2x2 i)] [ 0+ 1x1i+ assumed to be a reasonable approximation Econ 388 R. Butler 204 revsons Lecture 4 Dummy Dependent Varables I. Lnear Probablty Model: the Regresson model wth a dummy varables as the dependent varable assumpton, mplcaton regular multple regresson

More information

NUMERICAL DIFFERENTIATION

NUMERICAL DIFFERENTIATION NUMERICAL DIFFERENTIATION 1 Introducton Dfferentaton s a method to compute the rate at whch a dependent output y changes wth respect to the change n the ndependent nput x. Ths rate of change s called the

More information

Markov Chain Monte Carlo (MCMC), Gibbs Sampling, Metropolis Algorithms, and Simulated Annealing Bioinformatics Course Supplement

Markov Chain Monte Carlo (MCMC), Gibbs Sampling, Metropolis Algorithms, and Simulated Annealing Bioinformatics Course Supplement Markov Chan Monte Carlo MCMC, Gbbs Samplng, Metropols Algorthms, and Smulated Annealng 2001 Bonformatcs Course Supplement SNU Bontellgence Lab http://bsnuackr/ Outlne! Markov Chan Monte Carlo MCMC! Metropols-Hastngs

More information

See Book Chapter 11 2 nd Edition (Chapter 10 1 st Edition)

See Book Chapter 11 2 nd Edition (Chapter 10 1 st Edition) Count Data Models See Book Chapter 11 2 nd Edton (Chapter 10 1 st Edton) Count data consst of non-negatve nteger values Examples: number of drver route changes per week, the number of trp departure changes

More information

The Geometry of Logit and Probit

The Geometry of Logit and Probit The Geometry of Logt and Probt Ths short note s meant as a supplement to Chapters and 3 of Spatal Models of Parlamentary Votng and the notaton and reference to fgures n the text below s to those two chapters.

More information

Predictive Analytics : QM901.1x Prof U Dinesh Kumar, IIMB. All Rights Reserved, Indian Institute of Management Bangalore

Predictive Analytics : QM901.1x Prof U Dinesh Kumar, IIMB. All Rights Reserved, Indian Institute of Management Bangalore Sesson Outlne Introducton to classfcaton problems and dscrete choce models. Introducton to Logstcs Regresson. Logstc functon and Logt functon. Maxmum Lkelhood Estmator (MLE) for estmaton of LR parameters.

More information

Chapter 9: Statistical Inference and the Relationship between Two Variables

Chapter 9: Statistical Inference and the Relationship between Two Variables Chapter 9: Statstcal Inference and the Relatonshp between Two Varables Key Words The Regresson Model The Sample Regresson Equaton The Pearson Correlaton Coeffcent Learnng Outcomes After studyng ths chapter,

More information

III. Econometric Methodology Regression Analysis

III. Econometric Methodology Regression Analysis Page Econ07 Appled Econometrcs Topc : An Overvew of Regresson Analyss (Studenmund, Chapter ) I. The Nature and Scope of Econometrcs. Lot s of defntons of econometrcs. Nobel Prze Commttee Paul Samuelson,

More information

4.3 Poisson Regression

4.3 Poisson Regression of teratvely reweghted least squares regressons (the IRLS algorthm). We do wthout gvng further detals, but nstead focus on the practcal applcaton. > glm(survval~log(weght)+age, famly="bnomal", data=baby)

More information

Simulated Power of the Discrete Cramér-von Mises Goodness-of-Fit Tests

Simulated Power of the Discrete Cramér-von Mises Goodness-of-Fit Tests Smulated of the Cramér-von Mses Goodness-of-Ft Tests Steele, M., Chaselng, J. and 3 Hurst, C. School of Mathematcal and Physcal Scences, James Cook Unversty, Australan School of Envronmental Studes, Grffth

More information

Primer on High-Order Moment Estimators

Primer on High-Order Moment Estimators Prmer on Hgh-Order Moment Estmators Ton M. Whted July 2007 The Errors-n-Varables Model We wll start wth the classcal EIV for one msmeasured regressor. The general case s n Erckson and Whted Econometrc

More information

Basic Business Statistics, 10/e

Basic Business Statistics, 10/e Chapter 13 13-1 Basc Busness Statstcs 11 th Edton Chapter 13 Smple Lnear Regresson Basc Busness Statstcs, 11e 009 Prentce-Hall, Inc. Chap 13-1 Learnng Objectves In ths chapter, you learn: How to use regresson

More information

A Comparative Study for Estimation Parameters in Panel Data Model

A Comparative Study for Estimation Parameters in Panel Data Model A Comparatve Study for Estmaton Parameters n Panel Data Model Ahmed H. Youssef and Mohamed R. Abonazel hs paper examnes the panel data models when the regresson coeffcents are fxed random and mxed and

More information

Chapter 8 Indicator Variables

Chapter 8 Indicator Variables Chapter 8 Indcator Varables In general, e explanatory varables n any regresson analyss are assumed to be quanttatve n nature. For example, e varables lke temperature, dstance, age etc. are quanttatve n

More information

Statistics for Business and Economics

Statistics for Business and Economics Statstcs for Busness and Economcs Chapter 11 Smple Regresson Copyrght 010 Pearson Educaton, Inc. Publshng as Prentce Hall Ch. 11-1 11.1 Overvew of Lnear Models n An equaton can be ft to show the best lnear

More information

Properties of Least Squares

Properties of Least Squares Week 3 3.1 Smple Lnear Regresson Model 3. Propertes of Least Squares Estmators Y Y β 1 + β X + u weekly famly expendtures X weekly famly ncome For a gven level of x, the expected level of food expendtures

More information

Stat260: Bayesian Modeling and Inference Lecture Date: February 22, Reference Priors

Stat260: Bayesian Modeling and Inference Lecture Date: February 22, Reference Priors Stat60: Bayesan Modelng and Inference Lecture Date: February, 00 Reference Prors Lecturer: Mchael I. Jordan Scrbe: Steven Troxler and Wayne Lee In ths lecture, we assume that θ R; n hgher-dmensons, reference

More information

Uncertainty as the Overlap of Alternate Conditional Distributions

Uncertainty as the Overlap of Alternate Conditional Distributions Uncertanty as the Overlap of Alternate Condtonal Dstrbutons Olena Babak and Clayton V. Deutsch Centre for Computatonal Geostatstcs Department of Cvl & Envronmental Engneerng Unversty of Alberta An mportant

More information

This column is a continuation of our previous column

This column is a continuation of our previous column Comparson of Goodness of Ft Statstcs for Lnear Regresson, Part II The authors contnue ther dscusson of the correlaton coeffcent n developng a calbraton for quanttatve analyss. Jerome Workman Jr. and Howard

More information

Marginal Effects in Probit Models: Interpretation and Testing. 1. Interpreting Probit Coefficients

Marginal Effects in Probit Models: Interpretation and Testing. 1. Interpreting Probit Coefficients ECON 5 -- NOE 15 Margnal Effects n Probt Models: Interpretaton and estng hs note ntroduces you to the two types of margnal effects n probt models: margnal ndex effects, and margnal probablty effects. It

More information

Chapter 15 Student Lecture Notes 15-1

Chapter 15 Student Lecture Notes 15-1 Chapter 15 Student Lecture Notes 15-1 Basc Busness Statstcs (9 th Edton) Chapter 15 Multple Regresson Model Buldng 004 Prentce-Hall, Inc. Chap 15-1 Chapter Topcs The Quadratc Regresson Model Usng Transformatons

More information

Department of Statistics University of Toronto STA305H1S / 1004 HS Design and Analysis of Experiments Term Test - Winter Solution

Department of Statistics University of Toronto STA305H1S / 1004 HS Design and Analysis of Experiments Term Test - Winter Solution Department of Statstcs Unversty of Toronto STA35HS / HS Desgn and Analyss of Experments Term Test - Wnter - Soluton February, Last Name: Frst Name: Student Number: Instructons: Tme: hours. Ads: a non-programmable

More information

Comparison of the Population Variance Estimators. of 2-Parameter Exponential Distribution Based on. Multiple Criteria Decision Making Method

Comparison of the Population Variance Estimators. of 2-Parameter Exponential Distribution Based on. Multiple Criteria Decision Making Method Appled Mathematcal Scences, Vol. 7, 0, no. 47, 07-0 HIARI Ltd, www.m-hkar.com Comparson of the Populaton Varance Estmators of -Parameter Exponental Dstrbuton Based on Multple Crtera Decson Makng Method

More information

Chapter 15 - Multiple Regression

Chapter 15 - Multiple Regression Chapter - Multple Regresson Chapter - Multple Regresson Multple Regresson Model The equaton that descrbes how the dependent varable y s related to the ndependent varables x, x,... x p and an error term

More information

Statistics II Final Exam 26/6/18

Statistics II Final Exam 26/6/18 Statstcs II Fnal Exam 26/6/18 Academc Year 2017/18 Solutons Exam duraton: 2 h 30 mn 1. (3 ponts) A town hall s conductng a study to determne the amount of leftover food produced by the restaurants n the

More information

Inner Product. Euclidean Space. Orthonormal Basis. Orthogonal

Inner Product. Euclidean Space. Orthonormal Basis. Orthogonal Inner Product Defnton 1 () A Eucldean space s a fnte-dmensonal vector space over the reals R, wth an nner product,. Defnton 2 (Inner Product) An nner product, on a real vector space X s a symmetrc, blnear,

More information

Computation of Higher Order Moments from Two Multinomial Overdispersion Likelihood Models

Computation of Higher Order Moments from Two Multinomial Overdispersion Likelihood Models Computaton of Hgher Order Moments from Two Multnomal Overdsperson Lkelhood Models BY J. T. NEWCOMER, N. K. NEERCHAL Department of Mathematcs and Statstcs, Unversty of Maryland, Baltmore County, Baltmore,

More information

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur Analyss of Varance and Desgn of Experment-I MODULE VII LECTURE - 3 ANALYSIS OF COVARIANCE Dr Shalabh Department of Mathematcs and Statstcs Indan Insttute of Technology Kanpur Any scentfc experment s performed

More information

Factor models with many assets: strong factors, weak factors, and the two-pass procedure

Factor models with many assets: strong factors, weak factors, and the two-pass procedure Factor models wth many assets: strong factors, weak factors, and the two-pass procedure Stanslav Anatolyev 1 Anna Mkusheva 2 1 CERGE-EI and NES 2 MIT December 2017 Stanslav Anatolyev and Anna Mkusheva

More information

Kernel Methods and SVMs Extension

Kernel Methods and SVMs Extension Kernel Methods and SVMs Extenson The purpose of ths document s to revew materal covered n Machne Learnng 1 Supervsed Learnng regardng support vector machnes (SVMs). Ths document also provdes a general

More information

Correlation and Regression. Correlation 9.1. Correlation. Chapter 9

Correlation and Regression. Correlation 9.1. Correlation. Chapter 9 Chapter 9 Correlaton and Regresson 9. Correlaton Correlaton A correlaton s a relatonshp between two varables. The data can be represented b the ordered pars (, ) where s the ndependent (or eplanator) varable,

More information

First Year Examination Department of Statistics, University of Florida

First Year Examination Department of Statistics, University of Florida Frst Year Examnaton Department of Statstcs, Unversty of Florda May 7, 010, 8:00 am - 1:00 noon Instructons: 1. You have four hours to answer questons n ths examnaton.. You must show your work to receve

More information

Introduction to Regression

Introduction to Regression Introducton to Regresson Dr Tom Ilvento Department of Food and Resource Economcs Overvew The last part of the course wll focus on Regresson Analyss Ths s one of the more powerful statstcal technques Provdes

More information

Homework Assignment 3 Due in class, Thursday October 15

Homework Assignment 3 Due in class, Thursday October 15 Homework Assgnment 3 Due n class, Thursday October 15 SDS 383C Statstcal Modelng I 1 Rdge regresson and Lasso 1. Get the Prostrate cancer data from http://statweb.stanford.edu/~tbs/elemstatlearn/ datasets/prostate.data.

More information

Econometrics: What's It All About, Alfie?

Econometrics: What's It All About, Alfie? ECON 351* -- Introducton (Page 1) Econometrcs: What's It All About, Ale? Usng sample data on observable varables to learn about economc relatonshps, the unctonal relatonshps among economc varables. Econometrcs

More information

Estimation: Part 2. Chapter GREG estimation

Estimation: Part 2. Chapter GREG estimation Chapter 9 Estmaton: Part 2 9. GREG estmaton In Chapter 8, we have seen that the regresson estmator s an effcent estmator when there s a lnear relatonshp between y and x. In ths chapter, we generalzed the

More information

/ n ) are compared. The logic is: if the two

/ n ) are compared. The logic is: if the two STAT C141, Sprng 2005 Lecture 13 Two sample tests One sample tests: examples of goodness of ft tests, where we are testng whether our data supports predctons. Two sample tests: called as tests of ndependence

More information

Rockefeller College University at Albany

Rockefeller College University at Albany Rockefeller College Unverst at Alban PAD 705 Handout: Maxmum Lkelhood Estmaton Orgnal b Davd A. Wse John F. Kenned School of Government, Harvard Unverst Modfcatons b R. Karl Rethemeer Up to ths pont n

More information

Linear Regression Analysis: Terminology and Notation

Linear Regression Analysis: Terminology and Notation ECON 35* -- Secton : Basc Concepts of Regresson Analyss (Page ) Lnear Regresson Analyss: Termnology and Notaton Consder the generc verson of the smple (two-varable) lnear regresson model. It s represented

More information

The Ordinary Least Squares (OLS) Estimator

The Ordinary Least Squares (OLS) Estimator The Ordnary Least Squares (OLS) Estmator 1 Regresson Analyss Regresson Analyss: a statstcal technque for nvestgatng and modelng the relatonshp between varables. Applcatons: Engneerng, the physcal and chemcal

More information

Parametric fractional imputation for missing data analysis

Parametric fractional imputation for missing data analysis Secton on Survey Research Methods JSM 2008 Parametrc fractonal mputaton for mssng data analyss Jae Kwang Km Wayne Fuller Abstract Under a parametrc model for mssng data, the EM algorthm s a popular tool

More information

Limited Dependent Variables and Panel Data. Tibor Hanappi

Limited Dependent Variables and Panel Data. Tibor Hanappi Lmted Dependent Varables and Panel Data Tbor Hanapp 30.06.2010 Lmted Dependent Varables Dscrete: Varables that can take onl a countable number of values Censored/Truncated: Data ponts n some specfc range

More information

An (almost) unbiased estimator for the S-Gini index

An (almost) unbiased estimator for the S-Gini index An (almost unbased estmator for the S-Gn ndex Thomas Demuynck February 25, 2009 Abstract Ths note provdes an unbased estmator for the absolute S-Gn and an almost unbased estmator for the relatve S-Gn for

More information

Non-Mixture Cure Model for Interval Censored Data: Simulation Study ABSTRACT

Non-Mixture Cure Model for Interval Censored Data: Simulation Study ABSTRACT Malaysan Journal of Mathematcal Scences 8(S): 37-44 (2014) Specal Issue: Internatonal Conference on Mathematcal Scences and Statstcs 2013 (ICMSS2013) MALAYSIAN JOURNAL OF MATHEMATICAL SCIENCES Journal

More information

P R. Lecture 4. Theory and Applications of Pattern Recognition. Dept. of Electrical and Computer Engineering /

P R. Lecture 4. Theory and Applications of Pattern Recognition. Dept. of Electrical and Computer Engineering / Theory and Applcatons of Pattern Recognton 003, Rob Polkar, Rowan Unversty, Glassboro, NJ Lecture 4 Bayes Classfcaton Rule Dept. of Electrcal and Computer Engneerng 0909.40.0 / 0909.504.04 Theory & Applcatons

More information

Chapter 5. Solution of System of Linear Equations. Module No. 6. Solution of Inconsistent and Ill Conditioned Systems

Chapter 5. Solution of System of Linear Equations. Module No. 6. Solution of Inconsistent and Ill Conditioned Systems Numercal Analyss by Dr. Anta Pal Assstant Professor Department of Mathematcs Natonal Insttute of Technology Durgapur Durgapur-713209 emal: anta.bue@gmal.com 1 . Chapter 5 Soluton of System of Lnear Equatons

More information

JAB Chain. Long-tail claims development. ASTIN - September 2005 B.Verdier A. Klinger

JAB Chain. Long-tail claims development. ASTIN - September 2005 B.Verdier A. Klinger JAB Chan Long-tal clams development ASTIN - September 2005 B.Verder A. Klnger Outlne Chan Ladder : comments A frst soluton: Munch Chan Ladder JAB Chan Chan Ladder: Comments Black lne: average pad to ncurred

More information

Module 9. Lecture 6. Duality in Assignment Problems

Module 9. Lecture 6. Duality in Assignment Problems Module 9 1 Lecture 6 Dualty n Assgnment Problems In ths lecture we attempt to answer few other mportant questons posed n earler lecture for (AP) and see how some of them can be explaned through the concept

More information

Econ107 Applied Econometrics Topic 9: Heteroskedasticity (Studenmund, Chapter 10)

Econ107 Applied Econometrics Topic 9: Heteroskedasticity (Studenmund, Chapter 10) I. Defnton and Problems Econ7 Appled Econometrcs Topc 9: Heteroskedastcty (Studenmund, Chapter ) We now relax another classcal assumpton. Ths s a problem that arses often wth cross sectons of ndvduals,

More information