Mismeasured Variables in Econometric Analysis: Problems from the Right and Problems from the Left. Jerry Hausman 1

Size: px
Start display at page:

Download "Mismeasured Variables in Econometric Analysis: Problems from the Right and Problems from the Left. Jerry Hausman 1"

Transcription

1 DRAFT, January 3, 2001 Please do not quote wthout permsson Forthcomng: Journal of Economc Perspectves Msmeasured Varables n Econometrc Analyss: Problems from the Rght and Problems from the Left Jerry Hausman 1 The effect of msmeasured varables n statstcal and econometrc analyss s one of the oldest known problems, datng from the 1880 s n Adcock (1887). In the most straghtforward regresson analyss wth a sngle regressor varable under classcal msmeasurement assumptons the least squares estmate s downward based n magntude towards zero. 2 At MIT I have called ths the Iron law of econometrcs the magntude of the estmate s usually smaller than expected. Whle a msmeasured rght hand sde varable creates ths problem, a msmeasured left hand sde varable under classcal assumptons does not lead to bas. The only result s less precson n the estmated coeffcent and a lower t-statstc. In ths paper I consder three recent developments for msmeasurement econometrc models that may be less famlar to readers. The typcal soluton to the rght hand sde msmeasurement problem s to use nstrumental varables to acheve a consstent estmate. 3 However, n the last decade a lterature has arsen regardng the use of weak nstruments that can result n sgnfcant fnte sample bas. 4 Thus, relance on an nstrumental varable estmator to solve themsmeasurement problem may be msplaced n a partcular applcaton. Here, I dscuss a specfcaton test of Hahn and Hausman (1999) that permts a test of the weak nstrument hypothess. Next, for non-lnear models wth msmeasured varables, applcaton of nstrumental varables leads to nconsstent results. At MIT I call ths the forbdden regresson where mproper use of nstrumental varables leads to nconsstency. A paper 1 Department of Economcs, MIT, jhausman@mt.edu. Ths paper s gven n memory of Zv Grlches. Jason Abrevaya provded helpful comments. 2 Surveys of these classcal results are found n Agner, Hsao, Kapteyn and Wansbeek, T. (1984), Fuller (1987), and Hausman, Newey, and Powell (1995). Henceforth, I wll assume that the true parameter s postve so that I wll not repeat the n magntude qualfer. 3 Of course, a soluton may not exst f sutable nstruments cannot be found. 4 See e.g. Bound, Jaeger, and Baker (1997).

2 2 by Hausman, Ichmura, Newey and Powell (HINP) (1991) proposes a consstent estmator for the polynomal regresson specfcaton n the presence of msmeasurement. I dscuss that estmator here and a recent extenson of the approach to general non-lnear specfcatons by Schennach (2000). Thus, consstent estmators now are known for msmeasured non-lnear regresson models. Lastly, I return to msmeasured left hand sde varables. Many lmted dependent varable models n econometrcs are nconsstently estmated f the left hand sde varables have measurement error. 5 I dscuss probt/logt type models of bnary outcomes or dscrete choce and demonstrate that nconsstency occurs. I also dscuss consstent estmators of these models wth measurement errors and also specfcaton tests. Interestngly, these models do not requre nstrumental varables for consstent estmaton so long as a monotoncty condton s satsfed. I next consder duraton models and demonstrate that under measurement errors on the left hand sde varables, nconsstency wll agan result for general models. But so long as a stochastc domnance condton s satsfed, these models can agan be estmated consstently wthout the use of an nstrument. The last model I consder wth a msmeasured left hand sde varable s the quantle regresson (QR) model. Agan nconsstent estmates result, but I do not currently have a soluton to ths problem. I. Lnear Models Wth Msmeasured Varables I begn wth the classc lnear regresson msmeasurement model. I assume a lnear specfcaton after means of the varables have been removed wth the rght hand sde varable msmeasured wth uncorrelated measurement error where z s the true unobserved regressor and x s the observed varable that contans measurement error: y x = βz + ε = βx + ε βη = βx + τ = 1,..., n = z + η (1) 5 All the models dscussed here wll also be nconsstently estmated f the rght hand sde varable s msmeasured. However, I do not dscuss that topc specfcally n the paper snce the specfcatons can be consdered as nonlnear specfcatons.

3 3 I wll assume that the sample s ndependent and dentcally dstrbuted (..d.) and that z s the unobserved true varable and that the error n observaton η s mean zero and s uncorrelated wth z and wth ε. 6 Other rght hand sde varables, assumed to be measured wthout error, have been partalled out of the model. I wll assume that β > 0. The classc result s that usually the least squares (OLS) estmator b < β. 7 Ths result causes the ron law of econometrcs that I suggested above, and t s also called attenuaton n the statstcs lterature. In terms of a large sample result, p lm b = αβ < β where α = σ z / σ x = σ z /( σ z + σ η ) < 1. Thus, the amount of large sample bas depends on the rato of the varance of the sgnal (true varable) to the sum of the varance of the sgnal and the varance of the nose (error n measurement). 8 Another well-known classc result arses f the left hand sde and rght hand sde varables are nterchanged n the regresson specfcaton. The nverse of the OLS estmator of the coeffcent n the reverse regresson, g, gves an upward based estmate of the true coeffcent β. In large samples we have the bounds, p lmb < β < p lm g. A less well known result tells the wdth of the bounds snce 2 b / g = R where the R 2 arses from the regresson equaton. Thus, for cross secton econometrcs where R 2 s often about 0.3 the bounds can be qute wde whle n a tmes seres applcaton f the R 2 s qute hgh, msmeasurement wll not have a large effect on the least squares estmate. 9 As a last result, note that f the left hand sde varable s measured wth error but not the rght hand sde varable, the OLS estmator b would be unbased under the usual Gauss-Markov assumptons. 10 Indeed, the new measurement error term, ω, assumed to 6 The..d. assumpton wll be mantaned throughout the paper. 7 Ths result does not hold n general f the measurement error s correlated wth z or wth ε, although the downward bas stll often occurs. A straghtforward calculaton demonstrates the effect of correlated measurement error on the estmator b. 8 For the effect on the estmated coeffcents of the other varables n the regresson specfcaton that have been partalled out see Mejer and Wansbeek (2000). 9 In tmes seres, the approprate R 2 s after partallng out other rght hand sde varables, whch can reduce the orgnal R 2 by qute a lot. Also, note that ths result demonstrates that the tmes seres verson of the permanent ncome hypothess, n ts most smple form of equaton (1) wth z and x permanent and measured ncome, does not lead to a sgnfcantly greater measured MPC because the R 2 n tme seres data s qute hgh. 10 By Gauss-Markov assumptons I mean the assumptons that lead to OLS beng the best lnear unbased estmator.

4 4 be uncorrelated wth ε would not be separately dentfed because the measured varable q = + ω would be equvalent to replacng y by q n the orgnal regresson y specfcaton and denotng the resdual as ν = ε + ω. The only result would be reduced precson n the estmate b, a lower t-statstc, and a reduced R 2. However, f both left and rght hand sde varables are msmeasured wth errors of measurement uncorrelated wth each other and wth ε, the b remans the same as above but g ncreases because the R 2 has decreased snce ν has replaced ε and σ 2 ν 2 2 = σ ε + σ ω. Thus, the bounds for the true coeffcent ncrease when measurement error occurs n both the left and rght hand sde varables. Most solutons to the msmeasurement problem for the lnear specfcaton n econometrcs depend on the use of nstrumental varables. 11 Instrumental varables are assumed to be correlated wth the true z but uncorrelated wth ether ε and η. Let the vector of nstrumental varables be w and the nstrumental varable (IV) estmator wll use a lnear combnaton of the w to acheve a consstent estmator b IV of the true coeffcent β. In the usual case under Gauss-Markov assumptons 2SLS wll gve the effcent nstrumental varable estmator, whle n more general stuatons of condtonal heteroscedastcty the Whte (1984) effcent IV estmator should be used. 12 However, a sgnfcant understandng has emerged over the past few years that IV estmaton of the errors-n-varables model can lead to problems of nference n the stuaton of weak nstruments, whch can arse when the nstruments do not have a hgh degree of explanatory power for the msmeasured varable(s) or when the number of nstruments becomes large. The stuaton of weak nstruments causes conventonal (frst order) asymptotctheory to provde a poor gude to fnte sample nference. These problems of nference n the weak nstrument stuaton can arse when conventonal (frst order) asymptotc nference technques are used. In partcular, conventonal frst order 11 Of course, t has long been known that consstent estmators exst n the non-normal case, whch do not requre nstrumental varables; see e.g. Agner et. al. (1984). However, these estmators have not been used very much n econometrcs. 12 Henceforth, I wll confne the choce of estmators and tests to the Gauss-Markov setup of no heteroscedastcty or seral correlaton.

5 5 asymptotcs can lead to a lack of ndcaton of a problem even though sgnfcant (large sample) bas s present because estmated standard errors are not very accurate. Hahn-Hausman (HH 1999) take a new approach to dentfyng the potental problem and use hgher order asymptotc dstrbuton theory to determne f the conventonal frst order IV asymptotcs are relable n a partcular stuaton. HH demonstrate that the fnte sample bas depends on 3 factors: (1) bas s a monotoncally 2 ncreasng functon of β σ 2 η the varance of the measurement error multpled by the true coeffcent; (2) bas s a monotoncally ncreasng functon of K the number of nstrumental varables; and (3) bas s a monotoncally decreasng functon of the R 2 of the reduced form specfcaton of x regressed on the nstrumental varables w. The new specfcaton test takes the general approach of Hausman (1978) and estmates the same parameter(s) n two dfferent ways. In partcular, we compare the dfference of the forward (conventonal) 2SLS estmator of the coeffcent of the rght hand sde msmeasured varable wth the reverse regresson 2SLS estmator of the same unknown parameter when the normalzaton s changed. 13 Under the null hypothess that conventonal frst order asymptotcs provdes a relable gude, the two estmates should be very smlar. Indeed, they have untary correlaton accordng to frst order asymptotc dstrbuton theory. If the IV estmator s workng well, the forward and reverse estmators take the form under conventonal (frst order) asymptotcs, ( b IV g ), p lm n = 0 so that the forward and reverse estmators for β should be IV qute close. However, f they are not close where the measure of closeness s determned by second order asymptotcs n HH, then problems of nferences are lkely to be present. When second order asymptotc dstrbuton theory s used, the two estmators wll dffer due to second order bas terms. The HH test subtracts off these bas terms and then sees whether the resultng dfference n the two estmates satsfes the results of second order asymptotc theory. If t does and the second order bas term s small, the HH test does not reject the use of frst order asymptotc theory. If the new specfcaton test rejects HH then consder estmaton of the equaton by second-order unbased estmators of the type frst proposed by Nagar (1959). A second specfcaton test s then used to see 13 Thus, 2SLS s appled to the forward and reverse regresson, whch was dscussed above.

6 6 f these second-order unbased estmator provdes a sutable bass for nference. The HH specfcaton test may be qute useful n cross secton stuatons wth a msmeasured rght had sde varable because of the sgnfcant fnte sample bas that often exsts along wth nstruments that do not have hgh explanatory power for the msmeasured varable. II. Non-Lnear Models Wth Msmeasured Varables The lnear model wth measurement error s equvalent to a lnear smultaneous equaton model, so that two stage least squares or a closely related estmator s used. However, ths relatonshp no longer holds n the nonlnear regresson framework as noted by Y. Amemya (1985). The reason that 2SLS no longer leads to a consstent estmator n the nonlnear errors n varables problem s because the error of measurement s no longer addtvely separable from the true varable n the nonlnear regresson model. Applcaton of an IV estmator such as 2SLS or nonlnear 2SLS (N2SLS) leads to nconsstent estmates. A straghtforward way n whch to see why 2SLS or N2SLS does not yeld consstent estmators n the nonlnear errors n varable model s to consder the lnear n parameters and nonlnear n varables specfcaton for equaton (1) where z s replaced by the nonlnear functon g z ), whch s assumed to be a suffcently smooth ( functon to do Taylor approxmatons. Replacng the unobservable z wth the observed varable x leads to hgher order terms of the Taylor expanson whch enter the regresson error term but contan the true varable z. Thus, as demonstrated by Hausman-Newey- Powell (HNP, 1995), IV estmaton wll be nconsstent because the nstruments wll necessarly be correlated wth both the msmeasured rght hand sde and the error term n the econometrc specfcaton. Hausman-Ichmura-Newey-Powell (HINP, 1991) demonstrate how to solve the problem and acheve consstent estmates n the case of a polynomal specfcaton: K j y = β j ( z ) + ε = 1,...,n (2) j=0

7 7 where the unobserved varable z s replaced by x, whch s observed wth measurement error η as n equaton (1). HINP develop a consstent estmator under ether of two sets of assumptons. The frst s a repeated measurement specfcaton where w = z + v and v s ndependent of z, where the stronger (than no correlaton) assumpton s requred because of the nonlnear specfcaton. The second specfcaton s the nstrumental varables specfcaton x z + = wδ v where v s ndependent of q so that = w δ + v + η. The unknown parameter vector δ s estmated usng OLS. The HINP (1991) approach uses estmates of hgher order moments to derve consstent estmators. The usual IV approach multples through equaton (1) by where d s the estmate of δ. Then takng probablty lmts allows a soluton of the w d, normal equatons for b. In the polynomal specfcaton HINP use hgher order moments of the ndcator varable p w or the nstrument varable p ( w d) to multply both sdes of the equaton. HINP(1991) derve recursve relatonshps, whch permt calculaton of the estmates of the elements of the unknown parameter vector and soluton for the vector b whch s a consstent estmate of the unknown parameters β j n equaton (2). HINP also derve asymptotc varance estmators for the b vector. HNP(1995) use ths approach to estmate Engel curves for famly expendture n the U.S. Estmaton of Engel curves has long been an area of nterest among econometrcans where the "Leser (1963)-Workng" form of Engel curve n whch budget shares are regressed on the log of ncome or expendture has been wdely adopted n recent research. However, n a notable paper Gorman (1981) consdered Engel curves n whch ether expendture or budget shares are specfed as polynomals n functons of expendture, e.g. log of expendture. Gven an "exactly aggregable functon", Gorman demonstrates that the rank of the matrx of coeffcents for the polynomal terms n ncome s at most three. HNP nvestgate Engel curve specfcatons of the Gorman form and provde tests of hs rank three restrcton. Few studes of Engel curves have used estmators other than least squares or nonlnear least squares. HNP nvestgate two alternatve sets of nstruments: expendture n other perods whch follows from a lfe cycle model approach to consumpton or

8 8 determnants of ncome and expendture such as educaton and age. HNP use the 1982 U.S. Consumer Expendture Survey (CES). The CES collects data from famles over four quarters so that HNP can apply the repeated measurement technque. HNP use budget share and total expendture for each famly from 1982:1 and for the repeated measurement total expendture from 1982:2. They estmate Engel curves on fve commodty groups: food, clothng, recreaton, health care, and transportaton. HNP fnd that the usual assumpton of constant budget share elastctes, whch the Leser-Workng specfcaton mposes, appears nconsstent wth the 1982 CES data. HNP also fnd that a Hausman (1978) type specfcaton test of the IV estmates versus the OLS estmates strongly rejects the OLS estmates, ndcatng the mportance of the measurement error specfcaton. Thus, HNP fnd strong evdence that use of current expendture n estmaton of Engel curves on mcro data leads to errors n varables problems. Lastly, HNP explore the Gorman results. For the polynomal specfcaton of equaton (2), the rank restrcton takes the form that the rato of coeffcents of the cubc terms to the coeffcents of the quadratc terms wll be constant across budget share equatons. HNP estmate the "Gorman statstc" to see whether the coeffcents n the cubc specfc of equaton (2) have rank three. HNP fnd a rather remarkable result. HNP fnd the ratos of the coeffcents to be extremely close n actual values and estmated precsely. The ratos of the coeffcents for the dfferent budget share equatons are 25.0, -25.2, -25.1, , and The near equalty of the ratos provdes strong support for the Gorman hypothess. Thus, use of consstent estmates of a msmeasurement model for Engel curve analyss provdes an nterestng result that a restrcton from economc theory apples to the data. In a recent doctoral thess at MIT, Schennach (2000) has extended the HINP (1991) results from the polynomal specfcaton of equaton (2) to the consstent estmaton of nonlnear models wth measurement errors n the explanatory varables when one repeated observaton exsts. Thus, regresson models wth nonlnear functons such as g ( z ), where z s an unobserved varable can be estmated consstently. Schennach uses the Fourer transform to convert the ntegral equatons that relate the 14 The HINP consstent estmates provde stronger support for the Gorman hypothess than do the OLS estmates.

9 9 dstrbuton of the unobserved true varables to the msmeasured observed varables nto algebrac equatons. She then solves these equatons to dentfy moments of the true unobserved varables. These moments are used to construct a tradtonal nonlnear least squares estmator. Whle HINP restrcted ther estmator to polynomal moments of the unobserved varables and ts product wth the left hand sde varable, Schennach allows for arbtrary functons of the msmeasured rght hand sde varable x. 15 Thus, the approach s consderably more flexble than the HINP approach, although t does nvolve calculaton of the emprcal Fourer transforms. Schennach estmates nonlnear Engel curves usng CES data from 1984:1 usng the next quarter expendture as the repeated measurement. For both of her nonlnear specfcaton she fnds that for 4 out or the 5 budget categores that she estmates, a Hausman (1978) test rejects the tradtonal NLS estmator n favor of her estmator that allows for msmeasured rght hand sde varables. Thus, both HNP (1995) and Schennach (2000) fnd that n estmaton of Engel curves, a least squares approach whch uses current expendture nstead of a permanent ncome approach leads to nconsstent estmates where the usual downward bas (attenuaton) s present n the least squares estmates. III. Measurement Error n the Left Hand Sde Varables: Probt and Logt I now consder the consequences of msmeasurement n the left hand sde varable n bnary outcome models where the observed left hand sde varable s a functon of an unobserved (latent) dependent varable. Ths specfcaton often arses n lmted dependent varable models. The outcome s dfferent from the usual regresson specfcaton where a msmeasured left hand sde varable does not lead to problems, as dscussed above. In the lmted dependent varable specfcaton, msclassfcaton of the left hand sde varable leads to based and nconsstent estmators. The usual latent varable specfcaton has an observed latent varable y * and an * * observed varable y = 1 f y 0 and y = 0 f y < 0. The regresson specfcaton that allows for msclassfcaton s: 15 In partcular, consstent estmaton for probt and logt models wth msmeasured rght hand sde (explanatory) varables follows.

10 10 q = α 0 + ( 1 α 0 α1) F( z β ) + ε (3) where α 0 s the probablty that q = 1 although the true y = 0, α 1 s the probablty that q = 0 although the true y = 1, and F s the cumulatve dstrbuton for the probt model or logt model. Wth no msmeasurement α = α 0 and nonlnear least squares (NLS) 0 1 = or maxmum lkelhood estmaton (MLE) of equaton (3), mposng the no msclassfcaton assumpton, leads to consstent estmates. However, f msclassfcaton s present, both NLS and MLE lead to based and nconsstent estmaton. However, f equaton (3) s estmate by NLS allowng for msclassfcaton, Hausman, Abrevaya, and Scott-Morton (HAS 1998) demonstrate that consstent estmaton results. Thus, the estmated coeffcents of α 0 and α 1 provde a specfcaton test for msmeasurement. Smlarly, HAS demonstrate that MLE, whch permts for msclassfcaton, s straghtforward and consstent estmates result so long as a monotoncty condton holds: α +α 1. Ths monotoncty condton s relatvely weak snce t says that the 0 1 < combned probablty of msclassfcaton s not so hgh that on average you cannot tell whch result actually occurred. Thus, two results arse n the bnary lmted dependent varable cases whch are dfferent from the classcal regresson specfcaton wth measurement error n the left hand sde varable: (1) nconsstent estmaton results from the msmeasurement and (2) consstent estmaton does not requre nstrumental varables. The analyss extends to dscrete response models wth more than two categores, and MLE estmaton s consstent. HAS fnd that relatvely small amounts of msclassfcaton, as lttle as 2%, can lead to sgnfcant amounts of large sample bas n Monte Carlo experments. The assumpton of normally dstrbuted dsturbances requred by the probt specfcaton or extreme value dsturbances for the logt specfcaton, used n the specfcaton of F n equaton (3), s not necessary for consstent estmaton. Thus, HAS also demonstrate that the monotoncty condton allows for semparametrc estmaton where no cumulatve dstrbuton needs to be assumed. HAS demonstrate that the maxmum rank correlaton (MRC) estmator of Han (1987) provdes a consstent

11 11 estmator for β. A further advantage of the MRC estmator s that a more flexble model of msclassfcaton s suffcent for consstent estmaton as the exact form of msclassfcaton need not be specfed or estmated. HAS then demonstrate how to non-parametrcally estmate the response functon F as a functon of z b, the consstent estmate, usng an sotonc regresson (IR) method. The IR method works n the current stuaton snce F s monotonc n z β snce t s a dstrbuton functon. The IR technque mposes a monotoncty condton on the lefthand sde varable n a regresson specfcaton. The IR estmator s pontwse consstent, and HAS develop asymptotc standard errors and confdence ntervals for the consstent estmator of F. As an emprcal example HAS specfy and estmate a model of job change usng both the CPS (Current Populaton Survey) and PSID (Panel Study of Income Dynamcs) data sets. The results from the CPS data set, dscussed here, demonstrate strong evdence of msclassfcaton. Usng the MLE of a probt specfcaton whch allows for msclassfcaton, HAS fnd that α 0, the probablty of msclassfcaton for non-job changers s estmated to be 6% and s very precsely estmated, and α 1, the probablty of msclassfcaton for job changers s estmated to be 31% and s also very precsely estmated. The MLE estmates of many of the rght hand sde varables change by large amounts when msclassfcaton s permtted. HAS then use the MRC and IR estmators that do not mpose functonal form restrctons. They fnd that α 0, the probablty of msclassfcaton for non-job changers s estmated to be 4% and s very precsely estmated, and α 1, the probablty of msclassfcaton for job changers s estmated to be 40% and s also very precsely estmated. Thus, HAS fnd that the semparametrc estmates are qute smlar to the MLEestmates that allow for msclassfcaton. Also, the MRC estmates of most of the rght hand sde varables change by only relatvely small amounts compared to the parametrc MLE estmates that allow for msclassfcaton. Thus, msclassfcaton appears to be a potentally serous problem n mcro data where nconsstent results may arse. Whle the combnaton MRC/IR approach s qute flexble, the lmted emprcal experence of HAS appears to

12 12 demonstrate that the MLE approach to probt or logt allowng for msclassfcaton may gve reasonable results n many actual emprcal stuatons. IV. Measurement Error n the Left Hand Sde Varables: More General Models the form: The model n the last secton s a partcular example of a class of models that take y = f ( β ) + ε * z (4) where f s strctly ncreasng and y * s an unobserved (latent) varable. To allow for msmeasurement n ths more general stuaton Abrevaya-Hausman (AH 1999) model the observed left hand sde varable q as a stochastc functon of the underlyng y *. AH demonstraton that ths model allows for bnary choce wth msclassfcaton as n HAS (1998), msmeasured dscrete dependent varables, and msmeasured contnuous dependent varables. In ths latter stuaton the observed left hand sde varable s contnuous and s a functon of the unobserved (latent) varable and a random dsturbance so that q = h y, ω ). Ths last model specfcaton ncludes the ncreasngly used ( * duraton and hazard models. AH consder sem-parametrc estmaton of these models usng ether the MRC estmators or the more general monotone rank estmator (MRE) developed by Cavanagh and Sherman (1998). To acheve dentfcaton and allow consstent estmaton usng the MRC and MRE estmators AH develop a suffcent condton on the msmeasurement process. The suffcent condton s that the dstrbuton for the observed varable q for a hgher latent y * stochastcally domnates the dstrbuton of q j for a lower latent y * j. Thus, the effect of the msmeasurement cannot on average permute the orderng of the observed left hand sde varables wth respect to the orderng of the unobserved latent varables. So the effect of the msmeasurement cannot be too large on average. The use of the noton of frst order stochastc domnance s famlar from mcroeconomcs

13 13 where t s used to order portfolos or rsky dstrbutons. 16 In terms of an actual problem wth msmeasurement n the left hand sde varable the approprate queston to ask to satsfy the suffcent condton s: Are observatonal unts wth larger true values for ther left hand sde varable more lkely to report larger values than observatons unts wth smaller true values? If the answer s yes, the AH technques can be used. Agan nstrumental varables are not requred for consstent estmaton. AH consder ther approach for the duraton models such as the well-known Cox (1972) partal lkelhood model, the Han-Hausman (1990) and Meyer (1990) specfcatons, and more generally the proportonal hazard model. A hazard model, n the context of the return to employment for an unemployed person, answers the queston of what s the probablty of becomng employed n the next tme perod condtonal on beng unemployed up to the prevous tme perod. Thus, the hazard model has a smlarty to dscrete response models, but t allows for the effect of past outcomes on the probablty of the return to employment to affect the outcome n the current perod. AH demonstrate that conventonal MLE estmaton of commonly used duraton models and hazard models s nconsstent when the left hand sde varable s msmeasured, wth the sngle excepton of the hghly restrctve Webull duraton model. 17 So f the person were actually unemployed for say 24 weeks and answered 26 weeks n the survey, msmeasurement wllexst and would, n general, lead to nconsstent estmaton. AH fnd n Monte Carlo experments that both the Cox model and the Han-Hausman-Meyer (HHM) models have coeffcent estmates that are attenuated, e.g. based towards zero. Thus, duraton model specfcaton that do not allow for msmeasurement n the left hand sde varable have results smlar to the classcal regresson model wth msmeasurement n the rght hand sde varable. AH demonstrate that MRC and MRE estmators do well n the Monte Carlo experments n takng account of the msmeasurement and estmatng the unknown parameters. AH dscuss the fndng of msmeasured duraton data n survey or unemployment duraton data. Msmeasured duratons are common to all data sets that have survey 16 See e.g. Mas-Colell, Whnston, and Green (1995), p The Webull specfcaton requres a monotonc (baselne) hazard, whch makes t too restrctve for most problems. Also, the partcular applcaton cannot allow for censorng (check) whch s typcally present n applcatons of duraton models.

14 14 responses, and AH concentrate on the Survey of Income and Program partcpaton (SIPP) data set. Some common fndngs are that a sgnfcant number of reportng error are found n the CPS about 37% of unemployed workers overstated unemployment duratons; longer spells have a hgher proporton of reportng errors, and responses tend to be focal responses for nstance at a number of weeks that corresponds to an nteger month amount, e.g. 4 or 8 weeks. AH conduct an emprcal analyss on unemployment duraton n the SIPP data and use the Cox and HHM models, whch do not allow for msmeasurement as well as a sem-parametrc MRE model that does permt msmeasurement. Whle many of the MRE parameter estmates for the demographc varables are smlar to the Cox and HHM parameter estmates, one potentally mportant dfference s found. Allowng for msmeasurement fnds a much smaller and statstcally nsgnfcant effect on the UI beneft levels on unemployment duraton. The prevous wage nteracted wth recevng UI contnues to be sgnfcant, but the actual sze of the UI beneft ceases to be sgnfcant. In partcular the commonly used replacement-rate specfcaton s rejected when msmeasurement s permtted n the unemployment duratons. AH conclude that msmeasurement n the left hand sde varable can have an mportant effect n duraton models. V. An Unsolved Problem: Measurement Error n the Left Hand Sde Varable of Quantle Regresson Models Koenker and Bassett (KB 1978) ntroduced the quantle regresson (QR) estmator nto econometrcs. The QR estmator gves a vew of the entre condtonal dstrbuton rather than just, say, the condtonal mean. The advantage of the QR estmator s that t s a robust estmator of the regresson specfcaton so that t s not as senstve to outlers,.e. extreme observatons. 18 Alternatvely, the QR estmator performs well on effcency grounds when the stochastc dsturbance has thck tals that depart from the normal (Gaussan) assumpton. Lastly, the QR estmator does well n the heteroscedastc stuaton. The QR estmator estmates a regresson specfcaton at a number of θ th regresson quantles for 0 < θ < 1 n the regresson model specfcaton ε ( θ ) = y z β ( θ ). The QR estmator thus generalzes the regresson medan estmator

15 15 for the regresson quantle of θ = 1/ 2. Interestng applcatons of the QR estmator are ncreasngly common. Buchnsky (1994) used the QR technque to explore changes n the wage dstrbuton n the CPS over a 25 year perod. Buchnsky focuses on changes n the returns to educaton and experence at dfferent ponts of the wage dstrbuton, gven that he beleves that heteroscedastcty s lkely to be an mportant factor. He fnds that the returns to educaton and experence are dfferent at the dfferent quantles, θ = {0.1, 0.25, 0.5, 0.75, 0.9}. To the best of my knowledge, no one has explored the effect of msmeasured left hand sde varables for the QR approach. As before, assume that the measured left hand sde varable q = + ω. Msmeasurement n the left hand sde varable mght be y expected n some applcatons. For nstance, n the Buchnsky paper the man left hand sde varable s the weekly wage, defned n the CPS as the total ncome from wages and salares last year dvded by the number of weeks worked last year. Both numerator and denomnator arse from survey responses n the CPS (check). However, t s not dffcult to demonstrate that a msmeasured left hand sde varable wll lead to nconsstent QR estmaton because the stochastc dsturbance s now ε + ω. The presence of the error n measurement attenuates the quantles coeffcent estmates, leadng to estmates that are lnear combnaton of the actual quantle coeffcents β (θ ) for dfferent θ s. As an emprcal example I took a QR specfcaton wth a constant and one rght hand varable n the presence of heteroscedastcty and then added a (standard) normal measurement error to the left hand sde varable. The results for estmates of the slope coeffcent are n Table 1: 18 See the paper by Professor Koenker n ths ssue.

16 16 Table 1: Estmated QR Slope Coeffcents wth and wthout Measurement Error (ME) 19 Quantle No ME Normal ME As expected, the QR slope estmates wth normal measurement error are attenuated towards the medan coeffcent because of the mxture of the regresson error ε and the measurement error ω. Whle the QR estmator s robust to certan data features that create problems for the classcal lnear regresson model, t s not robust to a msmeasured left hand sde varable, whch does not create problems for the classcal lnear regresson model. I have not yet found a soluton to ths problem, smlar to the soluton for the lmted dependent varable problems and duraton model problems. Fndng a soluton seems a good topc for future research. 19 I used 10,000 observatons n the Monte Carlo runs. The rght hand sde varable s drawn from a unform dstrbuton. The specfcaton contans sgnfcant condtonal heteroscedastcty.

17 17 References Abrevaya J. and J. Hausman (1999), Semparametrc Estmaton wth Msmeasured Dependent Varables: An Applcaton to Duraton Models for Unemployment Spells, Annales D Econome et de Statstque, 55-56, Bound, J., D. A. Jaeger, and R. M. Baker (1997), Problems wth Instrumental Varables Estmaton when the Correlaton Between the Instruments and the Endogenous Explanatory Varable s Weak, Journal of the Amercan Statstcal Assocaton, 90, Adcock, R. (1887), "A Problem n Least Squares", The Analyst, 5, Agner, D.J., Hsao, C., Kapteyn A., Wansbeek, T. (1984), "Latent Varable Models n Econometrcs", n Z. Grlches and M. D. Intrlgator, Handbook of Econometrcs, vol. II, Amemya, Y. (1985), "Instrumental Varable Estmator for the Nonlnear Errors-n- Varables Model," Journal of Econometrcs, 28, Buchnsky, M. Changes n the U.S. Wage Structure : Applcaton of Quantle Regresson, Econometrca, 62, Cavanagh C., and R. Sherman (1998), Rank Estmators for Monotonc Index Models, Journal of Econometrcs, 84, Cox D.R. (1972), Regresson Models and Lfe Tables (wth dscusson), Journal of the Royal Statstcal Socety, B, 34: Gorman, W.M. (1981), "Some Engel Curves", n A. Deaton ed., Essays n the Theory and Measurement of Consumer Behavour n Honor of Sr Rchard Stone, Cambrdge: Cambrdge Unv. Press. Hahn J. and J. Hausman (1999), A New Specfcaton Test for the Valdty of Instrumental Varables, forthcomng Econometrca. Han A. (1987), Non-Parametrc Analyss of a Generalzed Regresson Model: The Maxmum Rank Correlaton Estmator, Journal of Econometrcs, 35: Han A. and J. Hausman (1990), Flexble Parametrc Estmaton of duraton and Competng Rsk Models, Journal of Appled Econometrcs, 5, Hausman, J. (1978), "Specfcaton Tests n Econometrcs", Econometrca, 46,

18 18 Hausman J., J. Abrevaya, and F. Scott-Morton (1998), Msclassfcaton of an Dependent Varable n a Dscrete-Response Settng, Journal of Econometrcs, 87, Hausman, J., H. Ichmura, W. Newey, and J. Powell (1991), "Measurement Errors n Polynomal Regresson Models," Journal of Econometrcs, 50, Hausman, J., W. Newey, and J. Powell (1995), "Measurement Errors n Polynomal Regresson Models," Journal of Econometrcs, 65, Koenker R. and G. Bassett, Regresson Quantles, Econometrca, 46, Leser, C.E.V. (1963), "Forms of Engel Functons", Econometrca, 31, Lvtan N. (1961), "Errors n Varables and Engel Curve Analyss", Econometrca, 29, Mas-Colell A., M. Whnston, and J. Green, Mcroeconomc Theory, Oxford Unv. Press, Mejer E. and T. Wansbeek, Measurement Error n a Sngle Regressor, Economc Letters, 69, Meyer B. (1990), Unemployment Insurance and Unemployment Spells, Econometrca, 58, Nagar, A.L. (1959): The Bas and Moment Matrx of the General k-class Estmators of the Parameters n Smultaneous Equatons, Econometrca, 27, Schennach S. (2000), Estmaton of Nonlnear Models wth Measurement Error, unpublshed MIT Ph.D. thess. Whte H. (1984), Asymptotc Theory for Econometrcans, New York: Academc Press.

Econ107 Applied Econometrics Topic 3: Classical Model (Studenmund, Chapter 4)

Econ107 Applied Econometrics Topic 3: Classical Model (Studenmund, Chapter 4) I. Classcal Assumptons Econ7 Appled Econometrcs Topc 3: Classcal Model (Studenmund, Chapter 4) We have defned OLS and studed some algebrac propertes of OLS. In ths topc we wll study statstcal propertes

More information

Factor models with many assets: strong factors, weak factors, and the two-pass procedure

Factor models with many assets: strong factors, weak factors, and the two-pass procedure Factor models wth many assets: strong factors, weak factors, and the two-pass procedure Stanslav Anatolyev 1 Anna Mkusheva 2 1 CERGE-EI and NES 2 MIT December 2017 Stanslav Anatolyev and Anna Mkusheva

More information

Basically, if you have a dummy dependent variable you will be estimating a probability.

Basically, if you have a dummy dependent variable you will be estimating a probability. ECON 497: Lecture Notes 13 Page 1 of 1 Metropoltan State Unversty ECON 497: Research and Forecastng Lecture Notes 13 Dummy Dependent Varable Technques Studenmund Chapter 13 Bascally, f you have a dummy

More information

Limited Dependent Variables

Limited Dependent Variables Lmted Dependent Varables. What f the left-hand sde varable s not a contnuous thng spread from mnus nfnty to plus nfnty? That s, gven a model = f (, β, ε, where a. s bounded below at zero, such as wages

More information

Chapter 20 Duration Analysis

Chapter 20 Duration Analysis Chapter 20 Duraton Analyss Duraton: tme elapsed untl a certan event occurs (weeks unemployed, months spent on welfare). Survval analyss: duraton of nterest s survval tme of a subject, begn n an ntal state

More information

Primer on High-Order Moment Estimators

Primer on High-Order Moment Estimators Prmer on Hgh-Order Moment Estmators Ton M. Whted July 2007 The Errors-n-Varables Model We wll start wth the classcal EIV for one msmeasured regressor. The general case s n Erckson and Whted Econometrc

More information

See Book Chapter 11 2 nd Edition (Chapter 10 1 st Edition)

See Book Chapter 11 2 nd Edition (Chapter 10 1 st Edition) Count Data Models See Book Chapter 11 2 nd Edton (Chapter 10 1 st Edton) Count data consst of non-negatve nteger values Examples: number of drver route changes per week, the number of trp departure changes

More information

LINEAR REGRESSION ANALYSIS. MODULE IX Lecture Multicollinearity

LINEAR REGRESSION ANALYSIS. MODULE IX Lecture Multicollinearity LINEAR REGRESSION ANALYSIS MODULE IX Lecture - 30 Multcollnearty Dr. Shalabh Department of Mathematcs and Statstcs Indan Insttute of Technology Kanpur 2 Remedes for multcollnearty Varous technques have

More information

DO NOT OPEN THE QUESTION PAPER UNTIL INSTRUCTED TO DO SO BY THE CHIEF INVIGILATOR. Introductory Econometrics 1 hour 30 minutes

DO NOT OPEN THE QUESTION PAPER UNTIL INSTRUCTED TO DO SO BY THE CHIEF INVIGILATOR. Introductory Econometrics 1 hour 30 minutes 25/6 Canddates Only January Examnatons 26 Student Number: Desk Number:...... DO NOT OPEN THE QUESTION PAPER UNTIL INSTRUCTED TO DO SO BY THE CHIEF INVIGILATOR Department Module Code Module Ttle Exam Duraton

More information

3.1 Expectation of Functions of Several Random Variables. )' be a k-dimensional discrete or continuous random vector, with joint PMF p (, E X E X1 E X

3.1 Expectation of Functions of Several Random Variables. )' be a k-dimensional discrete or continuous random vector, with joint PMF p (, E X E X1 E X Statstcs 1: Probablty Theory II 37 3 EPECTATION OF SEVERAL RANDOM VARIABLES As n Probablty Theory I, the nterest n most stuatons les not on the actual dstrbuton of a random vector, but rather on a number

More information

Economics 130. Lecture 4 Simple Linear Regression Continued

Economics 130. Lecture 4 Simple Linear Regression Continued Economcs 130 Lecture 4 Contnued Readngs for Week 4 Text, Chapter and 3. We contnue wth addressng our second ssue + add n how we evaluate these relatonshps: Where do we get data to do ths analyss? How do

More information

January Examinations 2015

January Examinations 2015 24/5 Canddates Only January Examnatons 25 DO NOT OPEN THE QUESTION PAPER UNTIL INSTRUCTED TO DO SO BY THE CHIEF INVIGILATOR STUDENT CANDIDATE NO.. Department Module Code Module Ttle Exam Duraton (n words)

More information

x i1 =1 for all i (the constant ).

x i1 =1 for all i (the constant ). Chapter 5 The Multple Regresson Model Consder an economc model where the dependent varable s a functon of K explanatory varables. The economc model has the form: y = f ( x,x,..., ) xk Approxmate ths by

More information

LOGIT ANALYSIS. A.K. VASISHT Indian Agricultural Statistics Research Institute, Library Avenue, New Delhi

LOGIT ANALYSIS. A.K. VASISHT Indian Agricultural Statistics Research Institute, Library Avenue, New Delhi LOGIT ANALYSIS A.K. VASISHT Indan Agrcultural Statstcs Research Insttute, Lbrary Avenue, New Delh-0 02 amtvassht@asr.res.n. Introducton In dummy regresson varable models, t s assumed mplctly that the dependent

More information

4 Analysis of Variance (ANOVA) 5 ANOVA. 5.1 Introduction. 5.2 Fixed Effects ANOVA

4 Analysis of Variance (ANOVA) 5 ANOVA. 5.1 Introduction. 5.2 Fixed Effects ANOVA 4 Analyss of Varance (ANOVA) 5 ANOVA 51 Introducton ANOVA ANOVA s a way to estmate and test the means of multple populatons We wll start wth one-way ANOVA If the populatons ncluded n the study are selected

More information

The Multiple Classical Linear Regression Model (CLRM): Specification and Assumptions. 1. Introduction

The Multiple Classical Linear Regression Model (CLRM): Specification and Assumptions. 1. Introduction ECONOMICS 5* -- NOTE (Summary) ECON 5* -- NOTE The Multple Classcal Lnear Regresson Model (CLRM): Specfcaton and Assumptons. Introducton CLRM stands for the Classcal Lnear Regresson Model. The CLRM s also

More information

ECONOMICS 351*-A Mid-Term Exam -- Fall Term 2000 Page 1 of 13 pages. QUEEN'S UNIVERSITY AT KINGSTON Department of Economics

ECONOMICS 351*-A Mid-Term Exam -- Fall Term 2000 Page 1 of 13 pages. QUEEN'S UNIVERSITY AT KINGSTON Department of Economics ECOOMICS 35*-A Md-Term Exam -- Fall Term 000 Page of 3 pages QUEE'S UIVERSITY AT KIGSTO Department of Economcs ECOOMICS 35* - Secton A Introductory Econometrcs Fall Term 000 MID-TERM EAM ASWERS MG Abbott

More information

Andreas C. Drichoutis Agriculural University of Athens. Abstract

Andreas C. Drichoutis Agriculural University of Athens. Abstract Heteroskedastcty, the sngle crossng property and ordered response models Andreas C. Drchouts Agrculural Unversty of Athens Panagots Lazards Agrculural Unversty of Athens Rodolfo M. Nayga, Jr. Texas AMUnversty

More information

Chapter 5 Multilevel Models

Chapter 5 Multilevel Models Chapter 5 Multlevel Models 5.1 Cross-sectonal multlevel models 5.1.1 Two-level models 5.1.2 Multple level models 5.1.3 Multple level modelng n other felds 5.2 Longtudnal multlevel models 5.2.1 Two-level

More information

Exam. Econometrics - Exam 1

Exam. Econometrics - Exam 1 Econometrcs - Exam 1 Exam Problem 1: (15 ponts) Suppose that the classcal regresson model apples but that the true value of the constant s zero. In order to answer the followng questons assume just one

More information

Linear Regression Analysis: Terminology and Notation

Linear Regression Analysis: Terminology and Notation ECON 35* -- Secton : Basc Concepts of Regresson Analyss (Page ) Lnear Regresson Analyss: Termnology and Notaton Consder the generc verson of the smple (two-varable) lnear regresson model. It s represented

More information

CHAPTER 8. Exercise Solutions

CHAPTER 8. Exercise Solutions CHAPTER 8 Exercse Solutons 77 Chapter 8, Exercse Solutons, Prncples of Econometrcs, 3e 78 EXERCISE 8. When = N N N ( x x) ( x x) ( x x) = = = N = = = N N N ( x ) ( ) ( ) ( x x ) x x x x x = = = = Chapter

More information

Department of Quantitative Methods & Information Systems. Time Series and Their Components QMIS 320. Chapter 6

Department of Quantitative Methods & Information Systems. Time Series and Their Components QMIS 320. Chapter 6 Department of Quanttatve Methods & Informaton Systems Tme Seres and Ther Components QMIS 30 Chapter 6 Fall 00 Dr. Mohammad Zanal These sldes were modfed from ther orgnal source for educatonal purpose only.

More information

Lecture 6: Introduction to Linear Regression

Lecture 6: Introduction to Linear Regression Lecture 6: Introducton to Lnear Regresson An Manchakul amancha@jhsph.edu 24 Aprl 27 Lnear regresson: man dea Lnear regresson can be used to study an outcome as a lnear functon of a predctor Example: 6

More information

Maximum Likelihood Estimation of Binary Dependent Variables Models: Probit and Logit. 1. General Formulation of Binary Dependent Variables Models

Maximum Likelihood Estimation of Binary Dependent Variables Models: Probit and Logit. 1. General Formulation of Binary Dependent Variables Models ECO 452 -- OE 4: Probt and Logt Models ECO 452 -- OE 4 Maxmum Lkelhood Estmaton of Bnary Dependent Varables Models: Probt and Logt hs note demonstrates how to formulate bnary dependent varables models

More information

III. Econometric Methodology Regression Analysis

III. Econometric Methodology Regression Analysis Page Econ07 Appled Econometrcs Topc : An Overvew of Regresson Analyss (Studenmund, Chapter ) I. The Nature and Scope of Econometrcs. Lot s of defntons of econometrcs. Nobel Prze Commttee Paul Samuelson,

More information

A Comparative Study for Estimation Parameters in Panel Data Model

A Comparative Study for Estimation Parameters in Panel Data Model A Comparatve Study for Estmaton Parameters n Panel Data Model Ahmed H. Youssef and Mohamed R. Abonazel hs paper examnes the panel data models when the regresson coeffcents are fxed random and mxed and

More information

Chapter 2 - The Simple Linear Regression Model S =0. e i is a random error. S β2 β. This is a minimization problem. Solution is a calculus exercise.

Chapter 2 - The Simple Linear Regression Model S =0. e i is a random error. S β2 β. This is a minimization problem. Solution is a calculus exercise. Chapter - The Smple Lnear Regresson Model The lnear regresson equaton s: where y + = β + β e for =,..., y and are observable varables e s a random error How can an estmaton rule be constructed for the

More information

Chapter 11: Simple Linear Regression and Correlation

Chapter 11: Simple Linear Regression and Correlation Chapter 11: Smple Lnear Regresson and Correlaton 11-1 Emprcal Models 11-2 Smple Lnear Regresson 11-3 Propertes of the Least Squares Estmators 11-4 Hypothess Test n Smple Lnear Regresson 11-4.1 Use of t-tests

More information

Econ Statistical Properties of the OLS estimator. Sanjaya DeSilva

Econ Statistical Properties of the OLS estimator. Sanjaya DeSilva Econ 39 - Statstcal Propertes of the OLS estmator Sanjaya DeSlva September, 008 1 Overvew Recall that the true regresson model s Y = β 0 + β 1 X + u (1) Applyng the OLS method to a sample of data, we estmate

More information

Properties of Least Squares

Properties of Least Squares Week 3 3.1 Smple Lnear Regresson Model 3. Propertes of Least Squares Estmators Y Y β 1 + β X + u weekly famly expendtures X weekly famly ncome For a gven level of x, the expected level of food expendtures

More information

e i is a random error

e i is a random error Chapter - The Smple Lnear Regresson Model The lnear regresson equaton s: where + β + β e for,..., and are observable varables e s a random error How can an estmaton rule be constructed for the unknown

More information

Department of Statistics University of Toronto STA305H1S / 1004 HS Design and Analysis of Experiments Term Test - Winter Solution

Department of Statistics University of Toronto STA305H1S / 1004 HS Design and Analysis of Experiments Term Test - Winter Solution Department of Statstcs Unversty of Toronto STA35HS / HS Desgn and Analyss of Experments Term Test - Wnter - Soluton February, Last Name: Frst Name: Student Number: Instructons: Tme: hours. Ads: a non-programmable

More information

1. Inference on Regression Parameters a. Finding Mean, s.d and covariance amongst estimates. 2. Confidence Intervals and Working Hotelling Bands

1. Inference on Regression Parameters a. Finding Mean, s.d and covariance amongst estimates. 2. Confidence Intervals and Working Hotelling Bands Content. Inference on Regresson Parameters a. Fndng Mean, s.d and covarance amongst estmates.. Confdence Intervals and Workng Hotellng Bands 3. Cochran s Theorem 4. General Lnear Testng 5. Measures of

More information

Psychology 282 Lecture #24 Outline Regression Diagnostics: Outliers

Psychology 282 Lecture #24 Outline Regression Diagnostics: Outliers Psychology 282 Lecture #24 Outlne Regresson Dagnostcs: Outlers In an earler lecture we studed the statstcal assumptons underlyng the regresson model, ncludng the followng ponts: Formal statement of assumptons.

More information

The Ordinary Least Squares (OLS) Estimator

The Ordinary Least Squares (OLS) Estimator The Ordnary Least Squares (OLS) Estmator 1 Regresson Analyss Regresson Analyss: a statstcal technque for nvestgatng and modelng the relatonshp between varables. Applcatons: Engneerng, the physcal and chemcal

More information

Chapter 3. Two-Variable Regression Model: The Problem of Estimation

Chapter 3. Two-Variable Regression Model: The Problem of Estimation Chapter 3. Two-Varable Regresson Model: The Problem of Estmaton Ordnary Least Squares Method (OLS) Recall that, PRF: Y = β 1 + β X + u Thus, snce PRF s not drectly observable, t s estmated by SRF; that

More information

ANOMALIES OF THE MAGNITUDE OF THE BIAS OF THE MAXIMUM LIKELIHOOD ESTIMATOR OF THE REGRESSION SLOPE

ANOMALIES OF THE MAGNITUDE OF THE BIAS OF THE MAXIMUM LIKELIHOOD ESTIMATOR OF THE REGRESSION SLOPE P a g e ANOMALIES OF THE MAGNITUDE OF THE BIAS OF THE MAXIMUM LIKELIHOOD ESTIMATOR OF THE REGRESSION SLOPE Darmud O Drscoll ¹, Donald E. Ramrez ² ¹ Head of Department of Mathematcs and Computer Studes

More information

9. Binary Dependent Variables

9. Binary Dependent Variables 9. Bnar Dependent Varables 9. Homogeneous models Log, prob models Inference Tax preparers 9.2 Random effects models 9.3 Fxed effects models 9.4 Margnal models and GEE Appendx 9A - Lkelhood calculatons

More information

Limited Dependent Variables and Panel Data. Tibor Hanappi

Limited Dependent Variables and Panel Data. Tibor Hanappi Lmted Dependent Varables and Panel Data Tbor Hanapp 30.06.2010 Lmted Dependent Varables Dscrete: Varables that can take onl a countable number of values Censored/Truncated: Data ponts n some specfc range

More information

Midterm Examination. Regression and Forecasting Models

Midterm Examination. Regression and Forecasting Models IOMS Department Regresson and Forecastng Models Professor Wllam Greene Phone: 22.998.0876 Offce: KMC 7-90 Home page: people.stern.nyu.edu/wgreene Emal: wgreene@stern.nyu.edu Course web page: people.stern.nyu.edu/wgreene/regresson/outlne.htm

More information

Chapter 15 Student Lecture Notes 15-1

Chapter 15 Student Lecture Notes 15-1 Chapter 15 Student Lecture Notes 15-1 Basc Busness Statstcs (9 th Edton) Chapter 15 Multple Regresson Model Buldng 004 Prentce-Hall, Inc. Chap 15-1 Chapter Topcs The Quadratc Regresson Model Usng Transformatons

More information

Lectures - Week 4 Matrix norms, Conditioning, Vector Spaces, Linear Independence, Spanning sets and Basis, Null space and Range of a Matrix

Lectures - Week 4 Matrix norms, Conditioning, Vector Spaces, Linear Independence, Spanning sets and Basis, Null space and Range of a Matrix Lectures - Week 4 Matrx norms, Condtonng, Vector Spaces, Lnear Independence, Spannng sets and Bass, Null space and Range of a Matrx Matrx Norms Now we turn to assocatng a number to each matrx. We could

More information

Statistics II Final Exam 26/6/18

Statistics II Final Exam 26/6/18 Statstcs II Fnal Exam 26/6/18 Academc Year 2017/18 Solutons Exam duraton: 2 h 30 mn 1. (3 ponts) A town hall s conductng a study to determne the amount of leftover food produced by the restaurants n the

More information

Maximum Likelihood Estimation of Binary Dependent Variables Models: Probit and Logit. 1. General Formulation of Binary Dependent Variables Models

Maximum Likelihood Estimation of Binary Dependent Variables Models: Probit and Logit. 1. General Formulation of Binary Dependent Variables Models ECO 452 -- OE 4: Probt and Logt Models ECO 452 -- OE 4 Mamum Lkelhood Estmaton of Bnary Dependent Varables Models: Probt and Logt hs note demonstrates how to formulate bnary dependent varables models for

More information

The Geometry of Logit and Probit

The Geometry of Logit and Probit The Geometry of Logt and Probt Ths short note s meant as a supplement to Chapters and 3 of Spatal Models of Parlamentary Votng and the notaton and reference to fgures n the text below s to those two chapters.

More information

Statistics for Economics & Business

Statistics for Economics & Business Statstcs for Economcs & Busness Smple Lnear Regresson Learnng Objectves In ths chapter, you learn: How to use regresson analyss to predct the value of a dependent varable based on an ndependent varable

More information

Continuous vs. Discrete Goods

Continuous vs. Discrete Goods CE 651 Transportaton Economcs Charsma Choudhury Lecture 3-4 Analyss of Demand Contnuous vs. Dscrete Goods Contnuous Goods Dscrete Goods x auto 1 Indfference u curves 3 u u 1 x 1 0 1 bus Outlne Data Modelng

More information

NUMERICAL DIFFERENTIATION

NUMERICAL DIFFERENTIATION NUMERICAL DIFFERENTIATION 1 Introducton Dfferentaton s a method to compute the rate at whch a dependent output y changes wth respect to the change n the ndependent nput x. Ths rate of change s called the

More information

Chapter 5. Solution of System of Linear Equations. Module No. 6. Solution of Inconsistent and Ill Conditioned Systems

Chapter 5. Solution of System of Linear Equations. Module No. 6. Solution of Inconsistent and Ill Conditioned Systems Numercal Analyss by Dr. Anta Pal Assstant Professor Department of Mathematcs Natonal Insttute of Technology Durgapur Durgapur-713209 emal: anta.bue@gmal.com 1 . Chapter 5 Soluton of System of Lnear Equatons

More information

Statistics Chapter 4

Statistics Chapter 4 Statstcs Chapter 4 "There are three knds of les: les, damned les, and statstcs." Benjamn Dsrael, 1895 (Brtsh statesman) Gaussan Dstrbuton, 4-1 If a measurement s repeated many tmes a statstcal treatment

More information

4.3 Poisson Regression

4.3 Poisson Regression of teratvely reweghted least squares regressons (the IRLS algorthm). We do wthout gvng further detals, but nstead focus on the practcal applcaton. > glm(survval~log(weght)+age, famly="bnomal", data=baby)

More information

Econ107 Applied Econometrics Topic 9: Heteroskedasticity (Studenmund, Chapter 10)

Econ107 Applied Econometrics Topic 9: Heteroskedasticity (Studenmund, Chapter 10) I. Defnton and Problems Econ7 Appled Econometrcs Topc 9: Heteroskedastcty (Studenmund, Chapter ) We now relax another classcal assumpton. Ths s a problem that arses often wth cross sectons of ndvduals,

More information

Statistics for Managers Using Microsoft Excel/SPSS Chapter 13 The Simple Linear Regression Model and Correlation

Statistics for Managers Using Microsoft Excel/SPSS Chapter 13 The Simple Linear Regression Model and Correlation Statstcs for Managers Usng Mcrosoft Excel/SPSS Chapter 13 The Smple Lnear Regresson Model and Correlaton 1999 Prentce-Hall, Inc. Chap. 13-1 Chapter Topcs Types of Regresson Models Determnng the Smple Lnear

More information

Now we relax this assumption and allow that the error variance depends on the independent variables, i.e., heteroskedasticity

Now we relax this assumption and allow that the error variance depends on the independent variables, i.e., heteroskedasticity ECON 48 / WH Hong Heteroskedastcty. Consequences of Heteroskedastcty for OLS Assumpton MLR. 5: Homoskedastcty var ( u x ) = σ Now we relax ths assumpton and allow that the error varance depends on the

More information

Polynomial Regression Models

Polynomial Regression Models LINEAR REGRESSION ANALYSIS MODULE XII Lecture - 6 Polynomal Regresson Models Dr. Shalabh Department of Mathematcs and Statstcs Indan Insttute of Technology Kanpur Test of sgnfcance To test the sgnfcance

More information

Estimation: Part 2. Chapter GREG estimation

Estimation: Part 2. Chapter GREG estimation Chapter 9 Estmaton: Part 2 9. GREG estmaton In Chapter 8, we have seen that the regresson estmator s an effcent estmator when there s a lnear relatonshp between y and x. In ths chapter, we generalzed the

More information

Negative Binomial Regression

Negative Binomial Regression STATGRAPHICS Rev. 9/16/2013 Negatve Bnomal Regresson Summary... 1 Data Input... 3 Statstcal Model... 3 Analyss Summary... 4 Analyss Optons... 7 Plot of Ftted Model... 8 Observed Versus Predcted... 10 Predctons...

More information

Chapter 13: Multiple Regression

Chapter 13: Multiple Regression Chapter 13: Multple Regresson 13.1 Developng the multple-regresson Model The general model can be descrbed as: It smplfes for two ndependent varables: The sample ft parameter b 0, b 1, and b are used to

More information

An Introduction to Censoring, Truncation and Sample Selection Problems

An Introduction to Censoring, Truncation and Sample Selection Problems An Introducton to Censorng, Truncaton and Sample Selecton Problems Thomas Crossley SPIDA June 2003 1 A. Introducton A.1 Basc Ideas Most of the statstcal technques we study are for estmatng (populaton)

More information

Lecture 4 Hypothesis Testing

Lecture 4 Hypothesis Testing Lecture 4 Hypothess Testng We may wsh to test pror hypotheses about the coeffcents we estmate. We can use the estmates to test whether the data rejects our hypothess. An example mght be that we wsh to

More information

Inference from Data Partitions

Inference from Data Partitions Inference from Data Parttons Rcardo Bórquez and Melvn Hnch May 18, 2010 1 Introducton Consder a statonary process X = {X t, t = 1, 2,..., T } defned on (Ω, F, {F t }, P ) where F denotes the Borel sets,

More information

STAT 405 BIOSTATISTICS (Fall 2016) Handout 15 Introduction to Logistic Regression

STAT 405 BIOSTATISTICS (Fall 2016) Handout 15 Introduction to Logistic Regression STAT 45 BIOSTATISTICS (Fall 26) Handout 5 Introducton to Logstc Regresson Ths handout covers materal found n Secton 3.7 of your text. You may also want to revew regresson technques n Chapter. In ths handout,

More information

Inner Product. Euclidean Space. Orthonormal Basis. Orthogonal

Inner Product. Euclidean Space. Orthonormal Basis. Orthogonal Inner Product Defnton 1 () A Eucldean space s a fnte-dmensonal vector space over the reals R, wth an nner product,. Defnton 2 (Inner Product) An nner product, on a real vector space X s a symmetrc, blnear,

More information

Kernel Methods and SVMs Extension

Kernel Methods and SVMs Extension Kernel Methods and SVMs Extenson The purpose of ths document s to revew materal covered n Machne Learnng 1 Supervsed Learnng regardng support vector machnes (SVMs). Ths document also provdes a general

More information

Simulated Power of the Discrete Cramér-von Mises Goodness-of-Fit Tests

Simulated Power of the Discrete Cramér-von Mises Goodness-of-Fit Tests Smulated of the Cramér-von Mses Goodness-of-Ft Tests Steele, M., Chaselng, J. and 3 Hurst, C. School of Mathematcal and Physcal Scences, James Cook Unversty, Australan School of Envronmental Studes, Grffth

More information

Statistics for Managers Using Microsoft Excel/SPSS Chapter 14 Multiple Regression Models

Statistics for Managers Using Microsoft Excel/SPSS Chapter 14 Multiple Regression Models Statstcs for Managers Usng Mcrosoft Excel/SPSS Chapter 14 Multple Regresson Models 1999 Prentce-Hall, Inc. Chap. 14-1 Chapter Topcs The Multple Regresson Model Contrbuton of Indvdual Independent Varables

More information

Composite Hypotheses testing

Composite Hypotheses testing Composte ypotheses testng In many hypothess testng problems there are many possble dstrbutons that can occur under each of the hypotheses. The output of the source s a set of parameters (ponts n a parameter

More information

Chapter 8 Indicator Variables

Chapter 8 Indicator Variables Chapter 8 Indcator Varables In general, e explanatory varables n any regresson analyss are assumed to be quanttatve n nature. For example, e varables lke temperature, dstance, age etc. are quanttatve n

More information

ECONOMETRICS - FINAL EXAM, 3rd YEAR (GECO & GADE)

ECONOMETRICS - FINAL EXAM, 3rd YEAR (GECO & GADE) ECONOMETRICS - FINAL EXAM, 3rd YEAR (GECO & GADE) June 7, 016 15:30 Frst famly name: Name: DNI/ID: Moble: Second famly Name: GECO/GADE: Instructor: E-mal: Queston 1 A B C Blank Queston A B C Blank Queston

More information

[ ] λ λ λ. Multicollinearity. multicollinearity Ragnar Frisch (1934) perfect exact. collinearity. multicollinearity. exact

[ ] λ λ λ. Multicollinearity. multicollinearity Ragnar Frisch (1934) perfect exact. collinearity. multicollinearity. exact Multcollnearty multcollnearty Ragnar Frsch (934 perfect exact collnearty multcollnearty K exact λ λ λ K K x+ x+ + x 0 0.. λ, λ, λk 0 0.. x perfect ntercorrelated λ λ λ x+ x+ + KxK + v 0 0.. v 3 y β + β

More information

Chapter 6. Supplemental Text Material

Chapter 6. Supplemental Text Material Chapter 6. Supplemental Text Materal S6-. actor Effect Estmates are Least Squares Estmates We have gven heurstc or ntutve explanatons of how the estmates of the factor effects are obtaned n the textboo.

More information

UNR Joint Economics Working Paper Series Working Paper No Further Analysis of the Zipf Law: Does the Rank-Size Rule Really Exist?

UNR Joint Economics Working Paper Series Working Paper No Further Analysis of the Zipf Law: Does the Rank-Size Rule Really Exist? UNR Jont Economcs Workng Paper Seres Workng Paper No. 08-005 Further Analyss of the Zpf Law: Does the Rank-Sze Rule Really Exst? Fungsa Nota and Shunfeng Song Department of Economcs /030 Unversty of Nevada,

More information

Predictive Analytics : QM901.1x Prof U Dinesh Kumar, IIMB. All Rights Reserved, Indian Institute of Management Bangalore

Predictive Analytics : QM901.1x Prof U Dinesh Kumar, IIMB. All Rights Reserved, Indian Institute of Management Bangalore Sesson Outlne Introducton to classfcaton problems and dscrete choce models. Introducton to Logstcs Regresson. Logstc functon and Logt functon. Maxmum Lkelhood Estmator (MLE) for estmaton of LR parameters.

More information

Lecture 9: Linear regression: centering, hypothesis testing, multiple covariates, and confounding

Lecture 9: Linear regression: centering, hypothesis testing, multiple covariates, and confounding Recall: man dea of lnear regresson Lecture 9: Lnear regresson: centerng, hypothess testng, multple covarates, and confoundng Sandy Eckel seckel@jhsph.edu 6 May 8 Lnear regresson can be used to study an

More information

Global Sensitivity. Tuesday 20 th February, 2018

Global Sensitivity. Tuesday 20 th February, 2018 Global Senstvty Tuesday 2 th February, 28 ) Local Senstvty Most senstvty analyses [] are based on local estmates of senstvty, typcally by expandng the response n a Taylor seres about some specfc values

More information

Lecture 9: Linear regression: centering, hypothesis testing, multiple covariates, and confounding

Lecture 9: Linear regression: centering, hypothesis testing, multiple covariates, and confounding Lecture 9: Lnear regresson: centerng, hypothess testng, multple covarates, and confoundng Sandy Eckel seckel@jhsph.edu 6 May 008 Recall: man dea of lnear regresson Lnear regresson can be used to study

More information

Comparison of Regression Lines

Comparison of Regression Lines STATGRAPHICS Rev. 9/13/2013 Comparson of Regresson Lnes Summary... 1 Data Input... 3 Analyss Summary... 4 Plot of Ftted Model... 6 Condtonal Sums of Squares... 6 Analyss Optons... 7 Forecasts... 8 Confdence

More information

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur Analyss of Varance and Desgn of Experment-I MODULE VIII LECTURE - 34 ANALYSIS OF VARIANCE IN RANDOM-EFFECTS MODEL AND MIXED-EFFECTS EFFECTS MODEL Dr Shalabh Department of Mathematcs and Statstcs Indan

More information

Econometrics of Panel Data

Econometrics of Panel Data Econometrcs of Panel Data Jakub Mućk Meetng # 8 Jakub Mućk Econometrcs of Panel Data Meetng # 8 1 / 17 Outlne 1 Heterogenety n the slope coeffcents 2 Seemngly Unrelated Regresson (SUR) 3 Swamy s random

More information

since [1-( 0+ 1x1i+ 2x2 i)] [ 0+ 1x1i+ assumed to be a reasonable approximation

since [1-( 0+ 1x1i+ 2x2 i)] [ 0+ 1x1i+ assumed to be a reasonable approximation Econ 388 R. Butler 204 revsons Lecture 4 Dummy Dependent Varables I. Lnear Probablty Model: the Regresson model wth a dummy varables as the dependent varable assumpton, mplcaton regular multple regresson

More information

3/3/2014. CDS M Phil Econometrics. Vijayamohanan Pillai N. CDS Mphil Econometrics Vijayamohan. 3-Mar-14. CDS M Phil Econometrics.

3/3/2014. CDS M Phil Econometrics. Vijayamohanan Pillai N. CDS Mphil Econometrics Vijayamohan. 3-Mar-14. CDS M Phil Econometrics. Dummy varable Models an Plla N Dummy X-varables Dummy Y-varables Dummy X-varables Dummy X-varables Dummy varable: varable assumng values 0 and to ndcate some attrbutes To classfy data nto mutually exclusve

More information

/ n ) are compared. The logic is: if the two

/ n ) are compared. The logic is: if the two STAT C141, Sprng 2005 Lecture 13 Two sample tests One sample tests: examples of goodness of ft tests, where we are testng whether our data supports predctons. Two sample tests: called as tests of ndependence

More information

THE CHINESE REMAINDER THEOREM. We should thank the Chinese for their wonderful remainder theorem. Glenn Stevens

THE CHINESE REMAINDER THEOREM. We should thank the Chinese for their wonderful remainder theorem. Glenn Stevens THE CHINESE REMAINDER THEOREM KEITH CONRAD We should thank the Chnese for ther wonderful remander theorem. Glenn Stevens 1. Introducton The Chnese remander theorem says we can unquely solve any par of

More information

Testing for seasonal unit roots in heterogeneous panels

Testing for seasonal unit roots in heterogeneous panels Testng for seasonal unt roots n heterogeneous panels Jesus Otero * Facultad de Economía Unversdad del Rosaro, Colomba Jeremy Smth Department of Economcs Unversty of arwck Monca Gulett Aston Busness School

More information

THE ROYAL STATISTICAL SOCIETY 2006 EXAMINATIONS SOLUTIONS HIGHER CERTIFICATE

THE ROYAL STATISTICAL SOCIETY 2006 EXAMINATIONS SOLUTIONS HIGHER CERTIFICATE THE ROYAL STATISTICAL SOCIETY 6 EXAMINATIONS SOLUTIONS HIGHER CERTIFICATE PAPER I STATISTICAL THEORY The Socety provdes these solutons to assst canddates preparng for the eamnatons n future years and for

More information

Linear Approximation with Regularization and Moving Least Squares

Linear Approximation with Regularization and Moving Least Squares Lnear Approxmaton wth Regularzaton and Movng Least Squares Igor Grešovn May 007 Revson 4.6 (Revson : March 004). 5 4 3 0.5 3 3.5 4 Contents: Lnear Fttng...4. Weghted Least Squares n Functon Approxmaton...

More information

Conjugacy and the Exponential Family

Conjugacy and the Exponential Family CS281B/Stat241B: Advanced Topcs n Learnng & Decson Makng Conjugacy and the Exponental Famly Lecturer: Mchael I. Jordan Scrbes: Bran Mlch 1 Conjugacy In the prevous lecture, we saw conjugate prors for the

More information

Workshop: Approximating energies and wave functions Quantum aspects of physical chemistry

Workshop: Approximating energies and wave functions Quantum aspects of physical chemistry Workshop: Approxmatng energes and wave functons Quantum aspects of physcal chemstry http://quantum.bu.edu/pltl/6/6.pdf Last updated Thursday, November 7, 25 7:9:5-5: Copyrght 25 Dan Dll (dan@bu.edu) Department

More information

CHAPER 11: HETEROSCEDASTICITY: WHAT HAPPENS WHEN ERROR VARIANCE IS NONCONSTANT?

CHAPER 11: HETEROSCEDASTICITY: WHAT HAPPENS WHEN ERROR VARIANCE IS NONCONSTANT? Basc Econometrcs, Gujarat and Porter CHAPER 11: HETEROSCEDASTICITY: WHAT HAPPENS WHEN ERROR VARIANCE IS NONCONSTANT? 11.1 (a) False. The estmators are unbased but are neffcent. (b) True. See Sec. 11.4

More information

Systems of Equations (SUR, GMM, and 3SLS)

Systems of Equations (SUR, GMM, and 3SLS) Lecture otes on Advanced Econometrcs Takash Yamano Fall Semester 4 Lecture 4: Sstems of Equatons (SUR, MM, and 3SLS) Seemngl Unrelated Regresson (SUR) Model Consder a set of lnear equatons: $ + ɛ $ + ɛ

More information

x = , so that calculated

x = , so that calculated Stat 4, secton Sngle Factor ANOVA notes by Tm Plachowsk n chapter 8 we conducted hypothess tests n whch we compared a sngle sample s mean or proporton to some hypotheszed value Chapter 9 expanded ths to

More information

JAB Chain. Long-tail claims development. ASTIN - September 2005 B.Verdier A. Klinger

JAB Chain. Long-tail claims development. ASTIN - September 2005 B.Verdier A. Klinger JAB Chan Long-tal clams development ASTIN - September 2005 B.Verder A. Klnger Outlne Chan Ladder : comments A frst soluton: Munch Chan Ladder JAB Chan Chan Ladder: Comments Black lne: average pad to ncurred

More information

Statistics for Business and Economics

Statistics for Business and Economics Statstcs for Busness and Economcs Chapter 11 Smple Regresson Copyrght 010 Pearson Educaton, Inc. Publshng as Prentce Hall Ch. 11-1 11.1 Overvew of Lnear Models n An equaton can be ft to show the best lnear

More information

BIO Lab 2: TWO-LEVEL NORMAL MODELS with school children popularity data

BIO Lab 2: TWO-LEVEL NORMAL MODELS with school children popularity data Lab : TWO-LEVEL NORMAL MODELS wth school chldren popularty data Purpose: Introduce basc two-level models for normally dstrbuted responses usng STATA. In partcular, we dscuss Random ntercept models wthout

More information

= z 20 z n. (k 20) + 4 z k = 4

= z 20 z n. (k 20) + 4 z k = 4 Problem Set #7 solutons 7.2.. (a Fnd the coeffcent of z k n (z + z 5 + z 6 + z 7 + 5, k 20. We use the known seres expanson ( n+l ( z l l z n below: (z + z 5 + z 6 + z 7 + 5 (z 5 ( + z + z 2 + z + 5 5

More information

Module 3 LOSSY IMAGE COMPRESSION SYSTEMS. Version 2 ECE IIT, Kharagpur

Module 3 LOSSY IMAGE COMPRESSION SYSTEMS. Version 2 ECE IIT, Kharagpur Module 3 LOSSY IMAGE COMPRESSION SYSTEMS Verson ECE IIT, Kharagpur Lesson 6 Theory of Quantzaton Verson ECE IIT, Kharagpur Instructonal Objectves At the end of ths lesson, the students should be able to:

More information

Lecture 12: Discrete Laplacian

Lecture 12: Discrete Laplacian Lecture 12: Dscrete Laplacan Scrbe: Tanye Lu Our goal s to come up wth a dscrete verson of Laplacan operator for trangulated surfaces, so that we can use t n practce to solve related problems We are mostly

More information

A Monte Carlo Study for Swamy s Estimate of Random Coefficient Panel Data Model

A Monte Carlo Study for Swamy s Estimate of Random Coefficient Panel Data Model A Monte Carlo Study for Swamy s Estmate of Random Coeffcent Panel Data Model Aman Mousa, Ahmed H. Youssef and Mohamed R. Abonazel Department of Appled Statstcs and Econometrcs, Instute of Statstcal Studes

More information

2016 Wiley. Study Session 2: Ethical and Professional Standards Application

2016 Wiley. Study Session 2: Ethical and Professional Standards Application 6 Wley Study Sesson : Ethcal and Professonal Standards Applcaton LESSON : CORRECTION ANALYSIS Readng 9: Correlaton and Regresson LOS 9a: Calculate and nterpret a sample covarance and a sample correlaton

More information