Inference under shape restrictions

Size: px
Start display at page:

Download "Inference under shape restrictions"

Transcription

1 Iferece uder shape restrictios Joachim Freyberger Brado Reeves July 3, 207 Abstract We propose a uiformly valid iferece method for a ukow fuctio or parameter vector satisfyig certai shape restrictios. The method applies very geerally, amely to a wide rage of fiite dimesioal ad oparametric problems, such as regressios or istrumetal variable estimatio, to both kerel or series estimators, ad to may differet shape restrictios. A major applicatio of our iferece method is to costruct uiform cofidece bads for a ukow fuctio of iterest. Our cofidece bads are asymptotically equivalet to stadard urestricted cofidece bads if the true fuctio strictly satisfies all shape restrictios, but they ca be much smaller if some of the shape restrictios are bidig or close to bidig. We illustrate these sizable width gais as well as the wide applicability of our method i Mote Carlo simulatios ad i a empirical applicatio. Keywords: Shape restrictios, iferece, oparametric, uiform cofidece bads. We thak Richard Bludell, Iva Caay, Bruce Hase, Joel Horowitz, Philipp Ketz, Matt Maste, Fracesca Moliari, Taisuke Otsu, Jack Porter, Azeem Shaikh, Xiaoxia Shi, Alex Torgovitsky, Daiel Wilhelm, ad semiar particpats at UW Madiso, UCL, LSE, Bosto College, Northwester Uiversity, ad Humboldt Uiversity for helpful commets ad discussios. We also thak Richard Bludell, Joel Horowitz, ad Matthias Parey for sharig their data. Departmet of Ecoomics, Uiversity of Wiscosi - Madiso. jfreyberger@ssc.wisc.edu. Departmet of Ecoomics, Uiversity of Wiscosi - Madiso. breeves@wisc.edu.

2 Itroductio Researchers ca ofte use either parametric or oparametric methods to estimate the parameters of a model. Parametric estimators have favorable properties, such as good fiite sample precisio ad fast rates of covergece, ad it is usually straightforward to use them for iferece. However, parametric models are ofte misspecified. Specifically, ecoomic theory rarely implies a particular fuctioal form, such as a liear or quadratic demad fuctio, ad coclusios draw from a icorrect parametric model ca be misleadig. Noparametric methods, o the other had, do ot impose strog fuctioal form assumptios, but as a cosequece, cofidece itervals obtaied from them are ofte much wider. I this paper we explore shape restrictios i order to restrict the class of fuctios but without imposig arbitrary parametric assumptios. Shape restrictios are ofte reasoable assumptios, such as assumig that the retur to eductio is positive, ad they ca be implied by ecoomic theory. For example, demad fuctios are geerally mootoically decreasig i prices, cost fuctios are mootoe icreasig, homogeeous of degree, ad cocave i iput prices, Egel curves of ormal goods are mootoically icreasig, ad utility fuctios of risk averse agets are cocave. There is a log history of estimatio uder shape restrictios i ecoometrics ad statistics, goig back to Hildreth 954 ad Bruk 955, ad obtaiig shape restricted estimators is simple i may settigs. Moreover, shape restricted estimators ca have much better fiite sample properties, such as lower mea squared errors, compared to urestricted estimators. Oe would therefore hope that the improved fiite sample precisio traslates to smaller cofidece sets. Usig shape restrictios for iferece is much more complicated tha simply obtaiig a restricted estimator. The mai reaso is that the distributio of the restricted estimator depeds o where the shape restrictios bid, which is ukow a priori. I this paper we propose a uiformly valid iferece method, which icorporates shape restrictios ad ca be used to test hypotheses about a ukow fuctio or parameter vector. The method applies very geerally, amely to a wide rage of fiite dimesioal ad oparametric problems, such as regressios or istrumetal variable estimatio, to both kerel or series estimators, ad to may differet shape restrictios. Oe major applicatio of our iferece method is to costruct uiform cofidece bads for a fuctio. Such a bad cosists of a lower boud fuctio ad a upper boud fuctio such that the true fuctio is betwee them with at least a pre-specified probability. Our cofidece bads have desirable properties. I particular, they are asymptotically equivalet to stadard urestricted cofidece bads if the true fuctio strictly satisfies all shape 2

3 restrictios e.g. if the true fuctio is strictly icreasig but the shape restrictio is that it is weakly icreasig. However, if for the true fuctio some of the shape restrictios are bidig or close to bidig, the cofidece bads are geerally much smaller. decrease i the width reflects the icreased precisio of the costraied estimator. Moreover, the bads always iclude the shape restricted estimator of the fuctio ad are therefore ever empty. Fially, the proposed method provides uiformly valid iferece over a large class of distributios, which i particular implies that the cofidece bads do ot suffer from uder-coverage if some of the shape restrictios are close to bidig. These cases are empirically relevat. For example, demad fuctios are likely to be strictly decreasig, but oparametric estimates are ofte ot mootoe, suggestig that the demad fuctio is close to costat for some prices. To the best of our kowledge, eve i a regressio model uder mootoicity, there are o existig uiform cofidece bads, which are ever empty, uiformly valid, ad yield width reductios whe the shape restrictios are bidig or close to bidig. Furthermore, our method applies very geerally. For example, our paper is the first to provide such iferece results for the oparametric istrumetal variables NPIV model uder geeral shape costraits. Similar to may other iferece problems i ostadard settigs, istead of tryig to obtai cofidece sets directly from the asymptotic distributio of the estimator, our iferece procedure is based o test iversio. 2 The This meas that we start by testig the ull hypothesis that the true parameter vector θ 0 is equal to some fixed value θ. I series estimatio θ 0 represets the coefficiets i the series approximatio of a fuctio ad θ 0 ca therefore grow i dimesio as the sample size icreases. The major advatage of the test iversio approach is that uder the ull hypothesis we kow exactly which of the shape restrictios are bidig or close to bidig. Therefore, uder the ull hypothesis, we ca approximate the distributio of the estimator i large samples ad we ca decide whether or ot we reject the ull hypothesis. We ca the collect all values for which we do ot reject, which form a cofidece set for θ 0. To obtai uiform cofidece bads, or cofidece sets for other fuctios of θ 0, we project o the cofidece set for θ 0 see Sectio 2 for a simple illustratio. We choose the test statistic i such a way that our cofidece bads are asymptotically equivalet to stadard urestricted cofidece bads if θ 0 is sufficietly i the iterior of the parameter space. Thus, i this case, the cofidece bads have the right coverage asymptotically. If Aalogously to may other papers, closeess to the boudary is relative to the sample size. 2 Other ostadard iferece settigs iclude autoregressive models e.g. Mikusheva 2007, weak idetificatio e.g. Adrews ad Cheg 202, ad partial idetificatio e.g. Adrews ad Soares

4 some of the shape restrictios are bidig or close to bidig, our iferece procedure will geerally be coservative due to the projectio. However, i these cases we also obtai very sizable width gais compared to a stadard urestricted bad. Furthermore, due to test iversio ad projectios, our iferece method ca be computatioally demadig. We give a sese of the computatioal costs i Sectio 6. We also briefly describe recet computatioal advaces, which might help to mitigate these costs. I Mote Carlo simulatios we costruct uiform cofidece bads i a series regressio framework as well as i the NPIV model uder a mootoicity costrait. I the NPIV model the gais of usig shape restrictios are geerally much higher. For example, we show that with a fourth order polyomial approximatio of the true fuctio, the average width gais ca be up to 73%, depedig o the slope of the true fuctio. We also provide a empirical applicatio, where we estimate demad fuctios for gasolie, subject to the fuctios beig weakly decreasig. The width gais from usig shape restrictios are betwee 25% ad 45% i this settig. We ow explai how our paper fits ito the related literature. There is a vast literature o estimatio uder shape restrictios goig back to Hildreth 954 ad Bruk 955 who suggest estimators uder cocavity ad mootoicity restrictios, respectively. Other related work icludes, amog may others, Mukerjee 988, Dierckx 980, Ramsay 988, Mamme 99a, Mamme 99b, Mamme ad Thomas-Aga 999, Hall ad Huag 200, Haag, Hoderlei, ad Pedakur 2009, Du, Parmeter, ad Racie 203, ad Wag ad She 203. See also Delecroix ad Thomas-Aga 2000 ad Hederso ad Parmeter 2009 for additioal refereces. May of the early papers focus o implemetatio issues ad subsequet papers discuss rates of covergece of shape restricted estimators. May iferece results, such as those by Mamme 99b, Groeeboom, Jogbloed, ad Weller 200, Dette, Neumeyer, ad Pilz 2006, Birke ad Dette 2007, ad Pal ad Woodroofe 2007 are for poits of the fuctio where the shape restrictios do ot bid. It is also well kow that a shape restricted estimator has a ostadard distributio if the shape restrictios bid; see for example Wright 98 ad Geyer 994. Freyberger ad Horowitz 205 provide iferece methods i a partially idetified NPIV model uder shape restrictios with discrete regressors ad istrumets. Empirical applicatios iclude Matzki 994, Lewbel 995, Ait-Sahalia ad Duarte 2003 ad Bludell, Horowitz, ad Parey 202, 207. There is also a iterestig literature o risk bouds e.g. Zhag 2002, Chatterjee, Gutuboyia, ad Se 205, Chetverikov ad Wilhelm 207 showig, amog others, that a restricted estimator ca have a faster rate of covergece tha a urestricted estimator whe the 4

5 true fuctio is close to the boudary. Specifically, the results i Chetverikov ad Wilhelm 207 imply that a mootoe estimator i the NPIV settig does ot suffer from a slow rate of covergece due to the ill-posed iverse problem if the true fuctio is close to costat. There is also a large, less related literature o testig shape restrictios. Usig existig methods, uiform cofidece bads uder shape restrictios ca be obtaied i three distict ways. First, oe could obtai a stadard urestricted cofidece bad ad itersect it with all fuctios which satisfy the shape restrictios see for example Dümbge 998, A drawback of the resultig bads is that they ca be empty with positive probability ad hece, they do ot satisfy the reasoableess property of Müller ad Norets 206. Furthermore, i our simulatios, our bads are o average much arrower tha such mootoized bads. The secod possibility is to use the rearragemet approach of Cherozhukov, Feradez-Val, ad Galicho 2009, which works with mootoicity restrictios ad is very easy to implemet. However, the average width does ot chage by rearragig the bad. Fially, oe could use a two step procedure recetly suggested by Horowitz ad Lee 207 i a kerel regressio framework with very geeral costraits. I the first step, they estimate the poits where the shape restrictios bid. I the secod step, they estimate the fuctio uder equality costraits ad hece, they obtai a asymptotically ormally distributed estimator, which they ca use to obtai uiform cofidece bads. While their approach is computatioally much simpler tha ours, their mai result leads to bads which ca suffer from uder-coverage if some of the shape restrictios are close to bidig. They also suggest usig a bias correctio term to improve the fiite sample coverage probability, but they do ot provide ay theoretical results for this method. Cherozhukov, Newey, ad Satos 205 develop a geeral testig procedure, which allows, amog others, testig shape restrictios ad obtaiig cofidece regios for fuctioals uder shape restrictios. Eve though there is some overlap i the settigs where both methods apply, the techical argumets are very differet. Their method is also based o test iversio ad it is robust to partial idetificatio, but it is restricted to coditioal momet models ad series estimators. We allow for a geeral setup ad estimators, but we assume poit idetificatio. Sice our focus is o testig a growig parameter vector, we are able to obtai uiform cofidece bads ext to cofidece sets for fuctioals. However, if the mai object of iterest is a sigle fuctioal, their approach might be computatioally simpler because they test fixed values of the fuctioal directly, rather tha projectig o a cofidece set for the etire parameter vector. Fially, our paper builds o previous work o iferece i ostadard problems, most 5

6 importatly the papers of Adrews 999, 200 o estimatio ad testig whe a parameter is o the boudary of the parameter space. The mai differece of our paper to Adrews work is that we allow testig for a growig parameter vector while Adrews cosiders a vector of a fixed dimesio. Moreover, we show that our iferece method is uiformly valid whe the parameters ca be either at the boudary, close to the boudary, or away from the boudary. We also use differet test statistics because we ivert them to obtai cofidece bads. Thus, while the geeral approach is similar, the details of the argumets are very differet. Ketz 207 has a similar setup as Adrews but allows for certai parameter sequeces that are close to the boudary uder o-egativity costraits. Outlie: The remaider of the paper is orgaized as follows. We start by illustratig the most importat features of our iferece approach i a very simple example. Sectio 3 discusses a geeral settig, icludig high level assumptios for uiformly valid iferece. Sectios 4 ad 5 provide low level coditios i a regressio framework for both series ad kerel estimatio ad the NPIV model, respectively. The remaiig sectios cotai Mote Carlo simulatios, the empirical applicatio, ad a coclusio. Proofs of the results from Sectios 4 ad 5, computatioal details, ad additioal simulatio results are i a supplemetary appedix with sectio umbers S., S.2, etc.. Notatio: For ay matrix A, A deotes the Frobeius orm. For ay square matrix A, A S = sup x = Ax deotes the spectral orm. For a positive semi-defiite matrix Ω ad a vector a let a Ω = a Ωa. Let λ mi A ad λ max A deote the smallest ad the largest eigevalue of a symmetric square matrix A. For a sequece of radom variables X ad a class of distributios P we say that X = o p ε uiformly over P P if sup P P P X δε 0 for ay δ > 0. We say that X = O p ε uiformly over P P if for ay δ > 0 there are M δ ad N δ such that sup P P P X M δ ε δ for all N δ. 2 Illustrative example We ow illustrate the mai features of our method i a very simple example. We the explai how these ideas ca easily be geeralized before itroducig the geeral setup i Sectio 3. Suppose that X Nθ 0, I 2 2 ad that we observe a radom sample {X i } i= of X. Deote the sample average by X. We are iterested i estimatig θ 0 uder the assumptio that θ 0, θ 0,2. A urestricted estimator of θ 0, deoted by ˆθ ur, is ˆθ ur = arg mi θ R 2 θ X 2 + θ 2 X

7 Hece ˆθ ur = X. Aalogously, a restricted estimator is ˆθ r = arg mi θ R 2 : θ θ 2 θ X 2 + θ 2 X 2 2 = arg mi θ R 2 : θ θ 2 0 = arg mi θ R 2 : θ θ 2 0 θ ˆθ ur 2 θ θ 0 ˆθ ur θ 0 2. Let λ = θ θ 0. From a chage of variables it the follows that ˆθr θ 0 = arg mi λ R 2 : λ λ 2 θ 0,2 θ 0, Let Z N0, I 2 2. Sice ˆθ ur θ 0 N0, I 2 2 we get λ ˆθ ur θ 0 2. ˆθr θ 0 d = arg mi λ R 2 : λ λ 2 θ 0,2 θ 0, λ Z 2, where d = meas that the radom variables o the left ad right side have the same distributio. Notice that while the distributio of ˆθ ur θ 0 does ot deped o θ 0 ad, the distributio of ˆθ r θ 0 depeds o θ 0,2 θ 0,, which measures how close θ 0 is to the boudary of the parameter space relative to. We deote a radom variable which has the same distributio as ˆθ r θ 0 by Z θ 0. As a example, suppose that θ 0, = θ 0,2. The Z θ 0 is the projectio of Z o the set {z R 2 : z z 2 }. A 95% cofidece regio for θ 0 usig the urestricted estimator ca be costructed by fidig the costat c ur such that P max{ Z, Z 2 } c ur = It the follows immediately that P ˆθ ur, c ur θ 0, ˆθ ur, + c ur ad ˆθ ur,2 c ur θ 0,2 ˆθ ur,2 + c ur = Thus CI ur = { θ R 2 : ˆθur, c ur θ ˆθ ur, + c ur ad ˆθ ur,2 c ur θ 2 ˆθ ur,2 + c } ur is a 95% cofidece set for θ 0. While there are may differet 95% cofidece regios for θ 0, rectagular regios are particularly easy to report especially i larger dimesios, because oe oly has to report the extreme poits of each coordiate. 7

8 Similarly, ow lookig at the restricted estimator, for each θ R 2 let c r, θ be such that ad defie CI r as { θ R 2 : θ θ 2, ˆθ r, c r,θ Agai, by costructio P θ 0 CI r = P max{ Z, θ, Z,2 θ } c r, θ = θ ˆθ r, + c r,θ, ˆθ r,2 c r,θ θ 2 ˆθ r,2 + c } r,θ. Figure illustrates the relatio betwee c ur ad c r, θ. The first pael shows a radom sample of Z. The dashed square cotais all z R 2 such that max{ z, z 2 } c ur. The secod pael displays the correspodig radom sample of Z θ 0 whe θ 0,2 θ 0, = 0, Figure : Scatter plots of samples illustratig relatio betwee critical values 4 Urestricted 4 p 30;2! 3 0; = Z2 0 Z Z Z p 30;2! 3 0; = p 30;2! 3 0; = Z2 0 Z Z Z 8

9 which is simply the projectio of Z o the set {z R 2 : z z 2 }. I particular, for each realizatio z we have z θ 0 = z if z z 2 ad z θ 0 = 0.5z + z 2, z + z 2 if z > z 2. Therefore, if max{ z, z 2 } c ur, the also max{ z, θ 0, z,2 θ 0 } c ur, which immediately implies that c r, θ 0 c ur. The solid square cotais all z R 2 such that max{ z, z 2 } c r, θ 0, which is strictly iside the dashed square. The third ad fourth pael show a similar situatios with θ 0,2 θ 0, = ad θ 0,2 θ 0, = 5, respectively. As θ 0,2 θ 0, icreases, the percetage projected o the solid lie decreases ad therefore c r, θ 0 gets closer to c ur. Moreover, oce θ 0,2 θ 0, is large eough, c r, θ 0 = c ur. Figure 2 shows the resultig cofidece regios for θ 0 whe = 00, coditioal o specific realizatios of ˆθ ur ad ˆθ r. The cofidece sets deped o these realizatios, but give ˆθ ur ad ˆθ r, they do ot deped o θ 0. The dashed red square is CI ur ad the solid Figure 2: Cofidece regios 0.6 ^3 ur = 0; 0 0 ad ^3 r = 0; ^3 ur = 0; 0: 0 ad ^3 r = 0; 0: ^3 ur = 0; 0:3 0 ad ^3 r = 0; 0: ^3 ur = 0:;!0: 0 ad ^3 r = 0;

10 blue lies are the boudary of CI r. I the first pael ˆθ ur = ˆθ r = 0, 0. Sice ˆθ ur = ˆθ r ad c r, θ c ur for all θ R 2, it holds that CI r CI ur. Also otice that sice c r, θ depeds o θ, CI r is ot a triagle as opposed to the set CI ur {θ R : θ θ 2 }. The secod ad the third pael display similar situatios with ˆθ ur = ˆθ r = 0, 0. ad ˆθ ur = ˆθ r = 0, 0.3, respectively. I both cases, CI r CI ur. It also follows from the previous discussio that if ˆθ ur = ˆθ r ad if ˆθ ur,2 ˆθ ur, is large eough the CI ur = CI r. Cosequetly, for ay fixed θ 0 with θ 0, < θ 0,2, it holds that P CI r = CI ur. However, this equivalece does ot hold if θ 0 is at the boudary or close to the boudary. Furthermore, it the holds with positive probability that CI ur {θ R : θ θ 2 } =, while CI r always cotais ˆθ r. The fourth pael illustrates that if ˆθ ur ˆθ r, the CI r is ot a subset of CI ur. The set CI r is a exact 95% cofidece set for θ 0, but it caot simply be characterized by its extreme poits ad it ca be hard to report with more tha two dimesios. Nevertheless, we ca use it to costruct a rectagular cofidece set. To do so, for j =, 2 defie ˆθ L r,j = mi θ CI r θ j ad ˆθU r,j = max θ CI r θ j ad CI r = { θ R 2 : θ θ 2 ad ˆθ r, L θ ˆθ r, U ad ˆθ r,2 L θ 2 ˆθ } r,2 U. It the holds by costructio that CI r CI r ad therefore P θ 0 CI r Moreover, just as before, if ˆθ ur = ˆθ r, the CI r CI ur. If for example ˆθ ur = ˆθ r = 0, 0, the ˆθ r,2 U = ˆθ r, L = c ur / but ˆθ r, U = ˆθ r,2 L < c ur /, which ca be see from the first pael of Figure 2. Hece, relative to the cofidece set from the urestricted estimator, we obtai width gais for the upper ed of the first dimesio ad the lower ed of the secod dimesio. The width gais decrease as ˆθ ur moves away from the boudary ito the iterior of Θ R. Moreover, for ay ˆθ ur ad ˆθ r ad j =, 2 we get ˆθ r,j U ˆθ r,j L 2c ur /. Thus, the sides of the square {θ R 2 : ˆθ r, L θ ˆθ r, U ad ˆθ r,2 L θ 2 ˆθ r,2} U are ever loger tha the sides of the square CI ur. Fially, if ˆθ ur is sufficietly i the iterior of Θ R, the CI r = CI ur, which is a importat feature of our iferece method. We get this equivalece i the iterior of Θ R because we ivert a test based o a particular type of test statistic, amely max{ Z, Z 2 }. If we started out with a differet test statistic, such as Z 2 + Z2, 2 we would ot obtai CI r = CI ur i the iterior of Θ R. We retur to this result more geerally i Sectio 3.2 ad discuss possible alterative ways of costructig cofidece regios i Sectio 8. This method of costructig cofidece sets is easy to geeralize. As a first step, let Θ R be a restricted parameter space ad let Q θ be a populatio objective fuctio. Suppose that the urestricted estimator ˆθ ur miimizes Q θ. Also suppose that Q θ is a quadratic 0

11 fuctio i θ which implies that 2 Q θ does ot deped o θ. The with ˆΩ = 2 Q θ we get Q θ = Q ˆθ ur + Q ˆθ ur θ ˆθ ur + 2 θ ˆθ ur ˆΩθ ˆθur ad sice Q ˆθ ur = 0 it holds that ˆθ r = arg mi θ Θ R θ ˆθ ur 2ˆΩ. Hece, ˆθ r is simply the projectio of ˆθ ur o Θ R. Thus, just as before, we ca use a chage of variables ad the characterize the distributio of ˆθ r θ 0 i terms of the distributio of ˆθ ur θ 0 ad a local parameter space that depeds o θ 0 ad. 3 Geeral setup I this sectio we discuss a geeral framework ad provide coditios for uiformly valid iferece. We start with a iformal overview of the iferece method ad provide the formal assumptios ad results i Sectio 3.. regios for geeral fuctios of the parameter vector. Let Θ R K I Sectio 3.2 we discuss rectagular cofidece be the parameter space ad let Θ R Θ be a restricted parameter space. Ifereces focuses o θ 0 Θ R. I a example discussed i Sectio 4.2 we have θ 0 = EY X = x... EY X = x K, ad K icreases with the sample size. I this case, the cofidece regios we obtai are aalogous to the oes i the simple example above. For series estimatio we take θ 0 R K such that g 0 x px θ 0, where g 0 is a ukow fuctio of iterest ad px is a vector of basis fuctios. A rectagular cofidece regio for certai fuctios of θ 0 ca the be iterpreted as a uiform cofidece bad for g 0 ; see Sectio 4.3 for details. Eve though θ 0 ad Θ may deped o the sample size, we omit the subscripts for brevity. As explaied i Sectio 2, i may applicatios we ca obtai a restricted estimator as a projectio of a urestricted estimator o the restricted parameter space. More geerally, we assume that there exist ˆθ ur ad ˆθ r such that ˆθ r is approximately the projectio of ˆθ ur o Θ R uder some orm ˆΩ see Assumptio below for a formal statemet. Moreover, sice the rate of covergece may be slower tha /, let κ be a sequece of umbers such that κ as. The ˆθ r arg mi θ ˆθ ur 2ˆΩ θ Θ R = arg mi κ θ θ 0 κ ˆθ ur θ 0 2ˆΩ. θ Θ R

12 Next defie Λ θ 0 = {λ R K : λ = κ θ θ 0 for some θ Θ R }. The κ ˆθ r θ 0 arg mi λ κ ˆθ ur θ 0 2ˆΩ. λ Λ θ 0 We will also assume that κ ˆθ ur θ 0 is approximately N0, Σ distributed see Assumptio 2 for a formal statemet ad that we have a cosistet estimator of Σ, deoted by ˆΣ. Now let Z N0, I K K be idepedet of ˆΣ ad ˆΩ ad defie Z θ, ˆΣ, ˆΩ = arg mi λ ˆΣ /2 Z 2ˆΩ. λ Λ θ We will use the distributio of Z θ 0, ˆΣ, ˆΩ to approximate the distributio of κ ˆθ r θ 0. This idea is aalogous to Adrews 999, 200; see for example Theorem 2e i Adrews 999. The mai differeces are that θ 0 ca grow i dimesios as ad that our local parameter space Λ θ 0 depeds o because we allow θ 0 to be close to the boudary. Now for θ Θ R cosider testig H 0 : θ 0 = θ based o a test statistic T, which depeds o κ ˆθ r θ ad ˆΣ. For example T κ ˆθ r θ, ˆΣ κ ˆθ r,k = max θ k. k=,...,k ˆΣkk We reject H 0 if ad oly if T κ ˆθ r θ, ˆΣ > c α, θ, ˆΣ, ˆΩ, where c α, θ, ˆΣ, ˆΩ = if{c R : P T Z θ, ˆΣ, ˆΩ, ˆΣ c ˆΣ, ˆΩ α}. Our α cofidece set for θ 0 is the CI = {θ Θ R : T κ ˆθ r θ, ˆΣ c α, θ, ˆΣ, ˆΩ}. To guaratee that P θ 0 CI α uiformly over a class of distributios P we require P T κ ˆθ r θ 0, ˆΣ c α, θ 0, ˆΣ, ˆΩ α 0. sup P P Notice that if ˆθ r was exactly the projectio of ˆθ ur o Θ R, if κ ˆθ ur θ 0 was exactly N0, Σ distributed, if Σ ad Ω were kow, ad if T Z θ 0, Σ, Ω, Σ was cotiuously distributed, the by costructio P T κ ˆθ r θ 0, Σ c α, θ 0, Σ, Ω = α, 2

13 just as i the simple example i Sectio 2. Therefore, the assumptios below simply guaratee that the various approximatio errors are small ad that small approximatio errors oly have a small impact o the distributio of the test statistic. 3. Assumptios ad mai result Let ε be a sequece of positive umbers with ε 0. We discuss the role of ε after statig the assumptios. Let P be a set of distributios satisfyig the followig assumptios. 3 Assumptio. There exists a symmetric, positive semi-defiite matrix ˆΩ such that ad R = o p ε uiformly over P P. κ ˆθ r θ 0 = arg mi λ κ ˆθ ur θ 0 2ˆΩ + R λ Λ θ 0 Assumptio 2. There exist symmetric, positive defiite matrices Ω ad Σ ad a sequece of radom variables Z N0, Σ such that λ mi Ω /2 κ ˆθ ur θ 0 Z = o p ε uiformly over P P. Assumptio 3. There exists a costat C λ > 0 such that /C λ λ mi Σ C λ, /C λ λ max Ω C λ ad λ max Σ λ mi Ω ˆΣ Σ 2 S = o p ε 2 /K uiformly over P P. ad Assumptio 4. Θ R is closed ad covex ad θ 0 Θ R. λ max Σ λ mi Ω 2 ˆΩ Ω S = o p ε 2 /K Assumptio 5. Let Σ ad Σ 2 be ay symmetric ad positive defiite matrices such that /B λ mi Σ B ad /B λ mi Σ 2 B for some costat B > 0. There exists a costat C, possibly depedig o B, such that for ay z R K ad z 2 R K T z, Σ T z 2, Σ C z z 2 ad T z, Σ T z, Σ 2 C z Σ Σ 2 S. Assumptio 6. There exists δ 0, α such that for all β [α δ, α + δ] ad sup P T Z θ 0, Σ, Ω, Σ c β, θ 0, Σ, Ω ε β 0 P P sup P T Z θ 0, Σ, Ω, Σ c β, θ 0, Σ, Ω + ε β 0. P P 3 Eve though θ 0 depeds o P P, we do ot make the depedece explicit i the otatio. 3

14 As demostrated above, if ˆθ ur maximizes Q θ ad if 2 Q θ does ot deped o θ, the Assumptio holds with R = 0 ad ˆΩ = 2 Q θ. Adrews 999 provides geeral sufficiet coditios for a small remaider i a quadratic expasio. The assumptio also holds by costructio if we simply project ˆθ ur o Θ R to obtai ˆθ r. More geerally, the assumptio does ot ecessarily require ˆθ ur to be a urestricted estimator of a criterio fuctio, which may ot eve exist i some settigs if the criterio fuctio is ot defied outside of Θ R. Eve i these cases, ˆθ r is usually a approximate projectio of a asymptotically ormally distributed estimator o Θ R. 4 Assumptio 2 ca be verified usig a couplig argumet ad the rate of covergece of ˆθ ur ca be slower tha /. Assumptio 3 esures that the estimatio errors of ˆΣ ad ˆΩ are egligible. If λ mi Ω is bouded away from 0 ad if λ max Σ is bouded, the the assumptio simply states that ˆΣ Σ S = o p ε / K ad ˆΩ Ω S = o p ε 2 /K, which is easy to verify i specific examples. Allowig λ mi Ω 0 is importat for ill-posed iverse problems such as NPIV. We explai i Sectios 4 ad 5 that both /C λ λ mi Σ C λ ad /C λ λ max Ω C λ hold uder commo assumptios i a variety of settigs. We could adapt the assumptios to allow for λ mi Σ 0 ad λ max Ω, but this would require much more otatio. Assumptio 4 holds for example with liear iequality costraits of the form Θ R = {θ R K : Aθ b}. Other examples of covex shape restrictios for series estimators are mootoicity, covexity/cocavity, icreasig returs to scale, or homogeeity of a certai degree, but we rule out Slutzki restrictios, which Horowitz ad Lee 207 allow for. The assumptio implies that Λ θ 0 is closed ad covex as well. The mai purpose of this assumptio is to esure that the projectio o Λ θ 0 is oexpasive, ad thus, we could replace it with a higher level assumptio. 5 Assumptio 5 imposes cotiuity coditios o the test statistic. We provide several examples of test statistics satisfyig this assumptio i Sectios 4 ad 5. Assumptio 6 is a cotiuity coditio o the distributio of T Z θ 0, Σ, Ω, Σ, which requires that its distributio fuctio does ot become too steep too quickly as icreases. It is usually referred to as a ati-cocetratio coditio ad it is ot ucommo i these type of testig problems; see e.g. Assumptio 6.7 of Cherozhukov, Newey, ad Satos 205. If the distributio fuctio is cotiuous for ay fixed K, the the assumptio is a abstract rate coditio o how fast K ca diverge relative to ε. As explaied below, to get aroud this assumptio we could take c α, θ, ˆΣ, ˆΩ + ε istead of c α, θ, ˆΣ, ˆΩ as the critical value. Also 4 See Ketz 207 for the costructio of such a estimator. ˆθur does ot eve have to be a feasible estimator ad we could simply replace κ ˆθ ur θ 0 by a radom variable Ẑ, which is allowed for by our geeral formulatio; specifically see Z T i Adrews I.e. we use arg mi λ Λθ 0 λ z ˆΩ arg mi λ Λθ 0 λ z 2 ˆΩ ˆΩ C z z 2 ˆΩ for some C > 0. 4

15 otice that Assumptios 5 impose very little restrictios o the shape restrictios ad hece, they are isufficiet to guaratee that the distributio fuctio of T Z θ 0, Σ, Ω, Σ is cotiuous. We ow get the followig result. Theorem. Suppose Assumptios 5 hold. The lim if If i additio Assumptio 6 holds the P sup P P if P T κ ˆθ r θ 0, ˆΣ c α, θ 0, ˆΣ, ˆΩ + ε α. P P T κ ˆθ r θ 0, ˆΣ c α, θ 0, ˆΣ, ˆΩ α 0. The first part of Theorem implies that if we take c α, θ, ˆΣ, ˆΩ + ε for ay fixed ε > 0 as the critical value, the the rejectio probability is asymptotically at most α uder the ull hypothesis, eve if Assumptio 6 does ot hold. I this case, ε ca go to 0 arbitrarily slowly. A alterative iterpretatio is that with c α, θ, ˆΣ, ˆΩ as the critical value ad without Assumptio 6, the rejectio probability might be larger tha α i the limit, but the resultig cofidece set is arbitrarily close to the α cofidece set. The secod part states that the test has the right size asymptotically if Assumptios 6 hold. 3.2 Rectagular cofidece sets for fuctios The previous results yield asymptotically valid cofidece regios for θ 0. However, these regios might be hard to report if K is large ad they may ot be the mai object of iterest. For example, we might be more iterested i a uiform cofidece bad for a fuctio rather tha a cofidece regio of the coefficiets i the series expasio. We ow discuss how we ca use these regios to obtai rectagular cofidece sets for fuctios h : R K R L usig projectios, similar as i Sectio 2 where hθ = θ. Rectagular cofidece regios are easy to report because we oly have to report the extreme poits of each coordiate, which is crucial whe L is large. Our method applies to geeral fuctios, such as fuctio values or average derivatives i oparameteric estimatio. I our applicatios we focus o uiform cofidece bads, which we ca obtai usig specific fuctios h, as explaied i Sectios 4 ad 5. Defie CI = {θ Θ R : T κ ˆθ r θ, ˆΣ c α, θ, ˆΣ, ˆΩ} ad let ĥ L l = if h lθ ad ĥ U l = sup h l θ, l =,..., L. θ CI θ CI 5

16 Notice that if θ 0 CI, the ĥl l h l θ 0 ad ĥu l h l θ 0 for all l =,..., L. We therefore obtai the followig corollary. 6 Corollary. Suppose Assumptios 6 hold. The lim if ĥl if P l h l θ 0 P P ĥu l for all l =,..., L α. A projectio for ay T satisfyig the assumptios above yields a rectagular cofidece regio with coverage probability at least α i the limit. I the examples discussed i Sectios 4 ad 5 we pick T such that the resultig cofidece regio is ocoservative for θ 0 i the iterior of Θ R, just as the cofidece sets i Figure 2. I these examples h l θ = c l +q l θ, where c l is a costat ad q l R L, ad possibly L > K. We the let T κ ˆθ r θ, ˆΣ κ q ˆθr θ = max q ˆΣq,..., κ q L ˆθr θ q L ˆΣqL. Now suppose that for ay θ CI, the critical value does ot deped o θ, which will be the case with probability approachig if θ 0 is i the iterior of the parameter space. That is cθ, ˆΣ, ˆΩ = ĉ. The CI = θ Θ q ˆΣq l l R : h l ˆθ r ĉ κ h l θ h l ˆθ r + ĉ q l ˆΣq l κ for all l =,..., L. Moreover, by the defiitios of the ifimum ad the supremum as the largest lower boud ad smallest upper boud respectively, it holds that q ˆΣq ĥ L l l q ˆΣq l h l ˆθ r ĉ ad ĥ U l l l h l ˆθ r + ĉ κ κ for all l =,..., L ad thus, Cosequetly P ĥ L l h l θ 0 ĥu l for all l =,..., L θ 0 CI. ĥl l h l θ 0 ĥu l for all l =,..., L = P θ 0 CI. We state a formal result, which guaratees that the projectio based cofidece set does ot suffer from over-coverage if θ 0 is sufficietly i the iterior of the parameter space, i Corollary A i the appedix. The results ca be exteded to oliear fuctios h alog the lies of Freyberger ad Rai Uder Assumptios - 5 oly, we could project o {θ Θ R : T κ ˆθ r θ, ˆΣ c α, θ, ˆΣ, ˆΩ + ε } to obtai the same coclusio as i Corollary. 6

17 4 Coditioal mea estimatio I this sectio we provide sufficiet coditios for Assumptios 5 whe Y = g 0 X + U, EU X = 0 ad Y, X ad U are scalar radom variables. We also explai how we ca use the projectio results to obtai uiform cofidece bads for g 0. We first assume that X is discretely distributed to illustrate that the iferece method ca easily be applied to fiite dimesioal models. We the let X be cotiuously distributed ad discuss both kerel ad series estimators. Throughout, we assume that the data is a radom sample {Y i, X i } i=. The proofs of all results i this ad the followig sectio are i the supplemetary appedix. 4. Discrete regressors Suppose that X is discretely distributed with support X = {x,..., x K }, where K is fixed. Let θ 0 = EY X = x... EY X = x K ad ˆθ ur = i= Y ix i =x i= X i=x... i= Y ix i =x K i= X i=x K. Defie σ 2 x k = V aru X = x k ad px k = P X = x k > 0, ad let σ 2 x Σ = diag px,..., σ2 x K px K ad where ˆpx k = i= X i = x k ad ˆσ ˆΣ 2 x = diag ˆpx,..., ˆσ2 x K, ˆpx K ˆσ 2 x k = i= Y i 2 X i = x k i= X i = x k i= Y 2 ix i = x k i= X. i = x k Let Θ R be a covex subset of R K, such as Θ R = {θ R K : Aθ b}. Now defie ˆθ r = arg mi θ ˆθ ur 2ˆΣ θ Θ R ad hece ˆΩ = ˆΣ. Other weight fuctios ˆΩ, such as the idetity matrix, are possible choices as well. We discuss this issue further i Sectio 8. As a test statistic we use { } T z, ˆΣ = max z / ˆΣ,..., z K / ˆΣ KK 7

18 because the resultig cofidece regio of the urestricted estimator is rectagular, aalogous to the oe i Sectio 2. We ow get the followig result. Theorem 2. Let P be the class of distributios satisfyig the followig assumptios. The. {Y i, X i } i= is a iid sample from the distributio of Y, X with σ 2 x k [/C, C], px k /C, ad EU 4 X = x k C for all k =,..., K ad for some C > Θ R is closed ad covex ad θ 0 Θ R. 3. = oε 3. lim if if P T ˆθ r θ 0, ˆΣ c α, θ 0, ˆΣ, ˆΩ + ε α. P P If i additio Assumptio 6 holds the sup P P P T ˆθ r θ 0, ˆΣ c α, θ 0, ˆΣ, ˆΩ Next let h l θ = θ l for l =,..., K. α 0. The the results i Sectio 3.2 yield a rectagular cofidece regio for θ 0, which ca be iterpreted as a uiform cofidece bad for g 0 x,..., g 0 x K. Moreover, Corollary A i the appedix shows that the bad is ocoservative if θ 0 is sufficietly i the iterior of the parameter space. 4.2 Kerel regressio We ow suppose that X is cotiuously distributed with desity f X. We deote its support by X ad assume that X = [x, x]. Let {x,..., x K } X ad θ 0 = EY X = x... EY X = x K. Here K icreases as the sample size icreases ad thus, our setup is very similar to Horowitz ad Lee 207. Let K be a kerel fuctio ad h the badwidth. The urestricted estimator is i= ˆθ ur = Y ik x X i h K x X i... i= h i= ik Y xk X i h i= K xk X i h Defie B = Ku2 du ad σ 2 x = V aru X = x ad let σ 2 x B Σ = diag f X x,..., σ2 x K B, f X x K 8.

19 ad ˆσ ˆΣ 2 x B = diag ˆf X x,..., ˆσ2 x K B, ˆf X x K where ˆf X x k = h i= K x k X i h ad ˆσ 2 x k = i= Y i 2 x K k X i h i= K x k X i h i= Y x ik k X i i= K x k X i h h Just as before, let Θ R be covex such as Θ R = {θ R K : Aθ b} ad defie ˆθ r = arg mi θ ˆθ ur 2ˆΣ, θ Θ R implyig that ˆΩ = ˆΣ. Fially, as before we let { } T z, ˆΣ = max z / ˆΣ,..., z K / ˆΣ KK. We get the followig result. Theorem 3. Let P be the class of distributios satisfyig the followig assumptios. The. The data {Y i, X i } i= is a iid sample where X = [x, x]. a g 0 x ad f X x are twice cotiuously differetiable with uiformly bouded fuctio values ad derivatives. if x X f X x /C for some C > 0. b σ 2 x is twice cotiuously differetiable, the fuctio ad derivatives are uiformly bouded o X, ad if x X σ 2 x /C for some C > 0. c EY 4 X = x C for some C > x k x k > 2h for all k ad x > x + h ad x K < x h. 3. K is a bouded ad symmetric pdf with support [, ]. 4. Θ R is closed ad covex ad θ 0 Θ R. 5. K h 5 = oε 2 ad K5/2 h lim if sup P P = oε 3. ˆΣ if P T h ˆθ r θ 0, c α, θ 0, ˆΣ, ˆΩ + ε α. P P If i additio Assumptio 6 holds the P T ˆΣ h ˆθ r θ 0, c α, θ 0, ˆΣ, ˆΩ 9 α 0. 2.

20 The first assumptio cotais stadard smoothess ad momet coditios. The secod assumptio guaratees that estimators of g 0 x k ad g 0 x l for k l are idepedet, just as i Horowitz ad Lee 207, ad it also avoids complicatios associated with x k beig too close to the boudary of the support. The third assumptio imposes stadard restrictios o the kerel fuctio ad the fourth assumptio has bee discussed before. The fifth assumptio cotais rate coditios. Notice that with a fixed K, these rates are the stadard coditios for asymptotic ormality with udersmoothig i kerel regressio. The rate coditios also imply that K h 0, which is similar to Horowitz ad Lee 207. Oce agai with h l θ = θ l for l =,..., K the results i Sectio 3.2 yield a rectagular cofidece regio for θ 0, which is a uiform cofidece bad for g 0 x,..., g 0 x K. Remark. While we use the Nadaraya-Watso estimator for simplicity, the geeral theory also applies to other estimators, such as local polyomial estimators. Aother possibility is to use a bias corrected estimator ad the adjusted stadard errors suggested by Caloico, Cattaeo, ad Farrell 207. Fially, the geeral theory ca also be adapted to icorporate a worst-case bias as i Armstrog ad Kolesár 206 istead of usig the udersmoothig assumptio; see Sectio S.2 for details. 4.3 Series regressio I this sectio we agai assume that X X is cotiuously distributed, but we use a series estimator. Oe advatage of a series estimator is that it yields uiform cofidece bads for the etire fuctio g 0, rather tha just a vector of fuctio values. Let p K x R K be a vector of basis fuctios ad write g 0 x p K x θ 0 for some θ 0 Θ R. We agai let Θ R be a covex set such as {θ R K : Aθ b}. For example, we could impose the costraits p K x j θ 0 for j =,..., J. Notice that J is ot restricted, ad we could eve impose p K x θ 0 for all x X if it is computatioally feasible. 7 The urestricted ad restricted estimators are ˆθ ur = arg mi θ R K Y i p K X i θ 2 i= ad ˆθ r = arg mi θ Θ R Y i p K X i θ 2, i= 7 For example, with quadratic splies p K x θ 0 reduces to fiitely may iequality costraits. 20

21 respectively. The assumptios esure that both miimizers are uique with probability approachig. Sice the objective fuctio is quadratic i θ 0 we have ˆθr θ 0 = arg mi λ ˆθ ur θ 0 2ˆΩ, λ Λ θ 0 where ˆΩ = i= p K X i p K X i ad Ω = EˆΩ. Defie Σ = Ep K X i p K X i EU 2 i p K X i p K X i Ep K X i p K X i. Also let Ûi = Y i p K X i ˆθur ad ˆΣ = ˆΩ Ûi 2 p K X i p K X i i= Let ˆσx = p K x ˆΣp K x. We use the test statistic T ˆθ r θ 0, ˆΣ = sup x X ˆΩ. p K x ˆθr θ 0 ˆσx. The followig theorem provides coditios to esure that cofidece sets for θ 0 have the correct coverage asymptotically. We the explai how we ca use these sets to costruct uiform cofidece bads for g 0 x. To state the theorem, let ξk = sup x X p K x. Theorem 4. Let P be the class of distributios satisfyig the followig assumptios. The. The data {Y i, X i } i= is a iid sample from the distributio of Y, X with EU 2 X [/C, C] ad EU 4 X C for some C > The basis fuctios p k are orthoormal o X with respect to the L 2 orm ad f X x [/C, C] for all x X ad some C > Θ R is closed ad covex ad θ 0 Θ R is such that for some costats C g ad γ > 0 4. K 2γ = oε 2, ξk2 K 4 lim if sup P P sup g 0 x p K x θ 0 C g K γ. x X = oε 6, ad ξk4 K 3 = oε 2. if P T ˆθ r θ 0, ˆΣ c α, θ 0, ˆΣ, ˆΩ + ε α. P P If i additio Assumptio 6 holds the P T ˆθ r θ 0, ˆΣ c α, θ 0, ˆΣ, ˆΩ 2 α 0.

22 The first assumptio imposes stadard momet coditios. The mai role of the secod assumptio is to guaratee that the miimum eigevalues of Σ ad Ω are bouded ad bouded away from 0. The third assumptio says that g 0 ca be well approximated by a fuctio satisfyig the costraits, ad the fourth assumptio provides rate coditios. For asymptotic ormality of oliear fuctioals Newey 997 assumes that K 2γ + ξk 4 K 2 0. For orthoormal polyomials ξk = C p K ad for splies ξk = C s K. Thus, our rate coditios are slightly stroger tha the oes i Newey 997, but we also obtai cofidece sets for the K dimesioal vector θ 0, which we ca trasform to uiform cofidece bads for g 0. The last rate coditio, ξk4 K 3 that varu i X i = σ 2 > 0. = oε 2, is ot eeded uder the additioal assumptios Remark 2. I a fiite dimesioal regressio framework with K = K, the third assumptio always holds ad the fourth assumptio oly requires that. I this case the secod assumptio ca be replaced with the full rak coditio λ mi Ep K Xp K X /C. To obtai a uiform cofidece bad for g 0 X, defie ad let CI = {θ Θ R : T ˆθ r θ, ˆΣ c α, θ, ˆΣ, ˆΩ} ĝ l x = mi θ CI p K x θ ad ĝ u x = max θ CI p K x θ. Also otice that p K x 2 is bouded away from 0 if the basis fuctios cotai the costat fuctio. We get the followig result. Corollary 2. Suppose the assumptios of Theorem 4 ad Assumptio 6 hold. suppose that if x X p K x 2 > /C for some costat C > 0. The Further lim if if P ĝ lx g 0 x ĝ u x x X α. P P Remark 3. Without ay restrictios o the parameter space, ivertig our test statistic results i a uiform cofidece bad where the width of the bad is proportioal to the stadard deviatio of the estimated fuctio for each x. This bad ca also be obtaied as a projectig o the uderlyig cofidece set for θ 0 ; see Freyberger ad Rai 207 for this equivalece result. If θ 0 is sufficietly i the iterior of the parameter space, a applicatio of Corollary A shows that the restricted bad is equivalet to that bad with probability approachig. I this case the projectio based bad is ot coservative. 22

23 Remark 4. Similar as before, Assumptio 6 is ot eeded if the bad is obtaied by projectig o {θ Θ R : T ˆθ r θ, ˆΣ c α, θ, ˆΣ, ˆΩ + ε } Remark 5. The results ca be exteded to a partially liear model of the form Y = g 0 X + X 2γ 0 + U. The parameter vector θ 0 would the cotai both γ 0 ad the coefficiets of the series approximatio of g 0. 5 Istrumetal variables estimatio As the fial applicatio of the geeral method we cosider the NPIV model Y = g 0 X + U, EU Z = 0, where X ad Z are cotiuously distributed scalar radom variables with bouded support. We assume for otatioal simplicity that X ad Z have the same support, X, but this assumptio is without loss of geerality because X ad Z ca always be trasformed to have support o [0, ]. We assume that EU 2 Z = σ 2 to focus o the complicatios resultig from the ill-posed iverse problem. Here, the data is a radom sample {Y i, X i, Z i } i=. As before, let p K x R K be a vector of basis fuctios ad write g 0 x p K x θ 0 for some θ 0 Θ R, where Θ R is a covex subset of R K. Let P X be the K matrix, where the ith row is p K X i ad defie P Z aalogously. Let Y be the vector cotaiig Y i. Let ad ˆθ ur = arg mi θ R K Y P X θ P Z P ZP Z P ZY P X θ ˆθ r = arg mi θ Θ R Y P X θ P Z P ZP Z P ZY P X θ. For simplicity we use the same basis fuctio as well as the same umber of basis fuctios for X i ad Z i. Our results ca be geeralized to allow for differet basis fuctios ad more istrumets tha regressors. Sice the objective fuctio is quadratic i θ 0 we have ˆθr θ 0 = arg mi λ ˆθ ur θ 0 2ˆΩ, λ Λ θ 0 where ˆΩ = P X P ZP Z P Z P Z P X. Furthermore, let Q XZ = Ep K X i p K Z i. The Σ = σ 2 Q XZ Ep K Z i p K Z i Q XZ, which we estimate by ˆΣ = ˆσ 2 ˆΩ with ˆσ 2 = i= Û 2 i ad Ûi = Y i p K X i ˆθur. 23

24 As before, ˆσx = p K x ˆΣp K x ad the test statistic is T ˆθ r θ 0, ˆΣ = sup x X p K x ˆθr θ 0 ˆσx. The followig theorem provides coditios to esure that cofidece sets for θ 0 have the correct coverage, ad aalogously to before we ca trasform these sets to uiform cofidece bads for g 0 x. As before, let ξk = sup x X p K x. Theorem 5. Let P be the class of distributios satisfyig the followig assumptios.. The data {Y i, X i, Z i } i= is a iid sample from the distributio of Y, X, Z with EU 2 Z = σ 2 [/C, C] ad EU 4 Z C for some C > The fuctios p k are orthoormal o X with respect to the L 2 orm ad the desities of X ad Z are uiformly bouded above ad bouded away from Θ R is closed ad covex ad for some fuctio bk ad θ 0 Θ R sup g 0 x p K x θ 0 bk. x X 4. λ mi Q XZ Q XZ τ K > 0 ad λ max Q XZ Q XZ [/C, C] for some C <. 5. bk2 τ 2 K = oε 2 ad ξk2 K 4 τ 6 K = oε 6. The lim if sup P P if P T ˆθ r θ 0, ˆΣ c α, θ 0, ˆΣ, ˆΩ + ε α. P P If i additio Assumptio 6 holds the P T ˆθ r θ 0, ˆΣ c α, θ 0, ˆΣ, ˆΩ α 0. Assumptios 3 of the theorem are very similar to those of Theorem 4. Assumptio 4 defies a measure of ill-posedess τ K, which affects the rate coditios. It is easy to show that λ max Q XZ Q XZ is bouded as log as f XZ is square itegrable. However, λ max Q XZ Q XZ C also allows for X = Z as a special case. I fact, i this case, τ K is bouded away from 0 ad all assumptios reduce to the oes i the series regressio framework with homoskedasticity. Moreover, similar to Remark 2, the assumptios also allow for K to be fixed i which case all coditios reduce to stadard assumptios i a parametric IV framework. Remark 5. Fially, the results ca also be exteded to a partially liear model; see 24

25 6 Mote Carlo simulatios To ivestigate fiite sample properties of our iferece method we simulate data from the model Y = g 0 X + U, EU Z = 0, where X [, ] ad g 0 X = c F 4 X +. 2 Here, F is the cdf of a t-distributio with oe degree of freedom ad we vary the costat c. Figure 3 shows the fuctio for = 5, 000 ad c {0, 0, 20, 30, 40, 50}. Clearly, c = 0 belogs to the costat fuctio. As c icreases the slope of g 0 x icreases for every x. Let X, Z, ad U be joitly ormally distributed with varu = 0.25 ad var Z = var X =. Let X = 2F X X Uif[, ] ad Z = 2F Z Z Uif[, ]. We cosider two DGPs. First, we let cov X, U = 0. Thus, X is exogeous ad we use the series estimator described i Sectio 4.3. Secod, we let cov X, Z = 0.7 ad cov X, U = 0.5 ad use the NPIV estimator. I both cases we focus o uiform cofidece bads for g 0. I this sectio we report results with Legedre polyomials as basis fuctio. I Sectio S.4 i the supplemet we report qualitatively very similar results for quadratic splies. For the series regressio settig we take =, 000 ad for NPIV we use = 5, 000. We take sample sizes large eough such that the urestricted estimator has good coverage properties for a sufficietly large umber of series terms, which helps i aalyzig how coservative the restricted cofidece bads ca be. All results are based o, 000 Mote Carlo simulatios. Figure 3: g 0 for differet values of c c = 0 c = 0 c = 20 c = 30 c = c = 50 25

26 We impose the restrictio that g 0 is weakly decreasig ad we eforce this costrait o 0 equally spaced poits. We solve for the uiform cofidece bads o 30 equally spaced grid poit. Usig fier grids has almost o impact o the results, but icreases the computatioal costs. 8 To solve the optimizatio problems, we have to calculate c α θ, ˆΣ, ˆΩ, which is ot available i closed form. To do so, we take 2000 draws from a multivariate ormal distributio ad use them to estimate the distributio fuctio of T Z θ, ˆΣ, ˆΩ, ˆΣ usig a kerel estimator ad Silverma s rule of thumb badwidth. We the take the α quatile of the estimated distributio fuctio as the critical value. Estimatig the distributio fuctio simply as a step fuctio yields almost idetical critical values for ay give θ, but our costructio esures that the estimated critical value is a smooth fuctio of θ. The umber of draws from the ormal distributio is aalogous to the umber of bootstrap samples i other settigs ad usig more draws has almost o impact o our results. Tables ad 2 show the simulatio results for the series regressio model ad the NPIV model, respectively. The first colum is the order of the polyomial ad K = 2 belogs to a liear fuctio. We use the same umber of basis fuctios for X ad Z, but usig K + 3 for the istrumet matrix yields very similar results. The third ad fourth colums show the coverage rates of uiform cofidece bads usig the urestricted ad shape restricted method, respectively. The omial coverage rate is For a cofidece bad [ĝ l x, ĝ u x] defie the average width as j= ĝ ux j ĝ l x j, where {x j } 30 j= are the grid poits. Colums 5 ad 6 show the medias of the average widths of the, 000 simulated data sets for the urestricted ad restricted estimator, respectively. Let width s ur ad width s r be the average widths i data set s. The last colums shows the media of width s ur width s r/width s ur across the, 000 simulated data sets. Eve though the mea gais are very similar, we report the media gais to esure that our gais are ot maily caused by extreme outcomes. I Table we ca see that the urestricted estimator has coverage rates close to 0.95 if c = 0. For K = 2 ad K = 3, the coverage probability drops sigificatly below 0.95 whe c is large because icreasig c also icreases the approximatio bias. For larger values of K, the coverage probability of the urestricted bad is close to 0.95 for all reported values of c. Due to the projectio, the coverage probability of the restricted bad teds to be above the oe of the urestricted bad. Whe c is large eough, such as c = 0 with K = 2, the two bads are idetical with very large probability. The average width of the urestricted bad does ot deped o c. O the other had, the average width of the restricted bad is 8 I the applicatio we use a grid of 00 poits for the uiform cofidece bads, but we use a coarser grid for the simulatios, because our reported results are based o 78, 000 estimated cofidece bads i total. 26

Resampling Methods. X (1/2), i.e., Pr (X i m) = 1/2. We order the data: X (1) X (2) X (n). Define the sample median: ( n.

Resampling Methods. X (1/2), i.e., Pr (X i m) = 1/2. We order the data: X (1) X (2) X (n). Define the sample median: ( n. Jauary 1, 2019 Resamplig Methods Motivatio We have so may estimators with the property θ θ d N 0, σ 2 We ca also write θ a N θ, σ 2 /, where a meas approximately distributed as Oce we have a cosistet estimator

More information

Inference under shape restrictions

Inference under shape restrictions Inference under shape restrictions Joachim Freyberger Brandon Reeves July 31, 2017 Abstract We propose a uniformly valid inference method for an unknown function or parameter vector satisfying certain

More information

Kernel density estimator

Kernel density estimator Jauary, 07 NONPARAMETRIC ERNEL DENSITY ESTIMATION I this lecture, we discuss kerel estimatio of probability desity fuctios PDF Noparametric desity estimatio is oe of the cetral problems i statistics I

More information

Efficient GMM LECTURE 12 GMM II

Efficient GMM LECTURE 12 GMM II DECEMBER 1 010 LECTURE 1 II Efficiet The estimator depeds o the choice of the weight matrix A. The efficiet estimator is the oe that has the smallest asymptotic variace amog all estimators defied by differet

More information

Lecture 19: Convergence

Lecture 19: Convergence Lecture 19: Covergece Asymptotic approach I statistical aalysis or iferece, a key to the success of fidig a good procedure is beig able to fid some momets ad/or distributios of various statistics. I may

More information

11 THE GMM ESTIMATION

11 THE GMM ESTIMATION Cotets THE GMM ESTIMATION 2. Cosistecy ad Asymptotic Normality..................... 3.2 Regularity Coditios ad Idetificatio..................... 4.3 The GMM Iterpretatio of the OLS Estimatio.................

More information

Statistical Inference Based on Extremum Estimators

Statistical Inference Based on Extremum Estimators T. Rotheberg Fall, 2007 Statistical Iferece Based o Extremum Estimators Itroductio Suppose 0, the true value of a p-dimesioal parameter, is kow to lie i some subset S R p : Ofte we choose to estimate 0

More information

Convergence of random variables. (telegram style notes) P.J.C. Spreij

Convergence of random variables. (telegram style notes) P.J.C. Spreij Covergece of radom variables (telegram style otes).j.c. Spreij this versio: September 6, 2005 Itroductio As we kow, radom variables are by defiitio measurable fuctios o some uderlyig measurable space

More information

Rank tests and regression rank scores tests in measurement error models

Rank tests and regression rank scores tests in measurement error models Rak tests ad regressio rak scores tests i measuremet error models J. Jurečková ad A.K.Md.E. Saleh Charles Uiversity i Prague ad Carleto Uiversity i Ottawa Abstract The rak ad regressio rak score tests

More information

Lecture 6 Simple alternatives and the Neyman-Pearson lemma

Lecture 6 Simple alternatives and the Neyman-Pearson lemma STATS 00: Itroductio to Statistical Iferece Autum 06 Lecture 6 Simple alteratives ad the Neyma-Pearso lemma Last lecture, we discussed a umber of ways to costruct test statistics for testig a simple ull

More information

Summary. Recap ... Last Lecture. Summary. Theorem

Summary. Recap ... Last Lecture. Summary. Theorem Last Lecture Biostatistics 602 - Statistical Iferece Lecture 23 Hyu Mi Kag April 11th, 2013 What is p-value? What is the advatage of p-value compared to hypothesis testig procedure with size α? How ca

More information

Topic 9: Sampling Distributions of Estimators

Topic 9: Sampling Distributions of Estimators Topic 9: Samplig Distributios of Estimators Course 003, 2016 Page 0 Samplig distributios of estimators Sice our estimators are statistics (particular fuctios of radom variables), their distributio ca be

More information

Summary and Discussion on Simultaneous Analysis of Lasso and Dantzig Selector

Summary and Discussion on Simultaneous Analysis of Lasso and Dantzig Selector Summary ad Discussio o Simultaeous Aalysis of Lasso ad Datzig Selector STAT732, Sprig 28 Duzhe Wag May 4, 28 Abstract This is a discussio o the work i Bickel, Ritov ad Tsybakov (29). We begi with a short

More information

32 estimating the cumulative distribution function

32 estimating the cumulative distribution function 32 estimatig the cumulative distributio fuctio 4.6 types of cofidece itervals/bads Let F be a class of distributio fuctios F ad let θ be some quatity of iterest, such as the mea of F or the whole fuctio

More information

Properties and Hypothesis Testing

Properties and Hypothesis Testing Chapter 3 Properties ad Hypothesis Testig 3.1 Types of data The regressio techiques developed i previous chapters ca be applied to three differet kids of data. 1. Cross-sectioal data. 2. Time series data.

More information

Advanced Analysis. Min Yan Department of Mathematics Hong Kong University of Science and Technology

Advanced Analysis. Min Yan Department of Mathematics Hong Kong University of Science and Technology Advaced Aalysis Mi Ya Departmet of Mathematics Hog Kog Uiversity of Sciece ad Techology September 3, 009 Cotets Limit ad Cotiuity 7 Limit of Sequece 8 Defiitio 8 Property 3 3 Ifiity ad Ifiitesimal 8 4

More information

Empirical Processes: Glivenko Cantelli Theorems

Empirical Processes: Glivenko Cantelli Theorems Empirical Processes: Gliveko Catelli Theorems Mouliath Baerjee Jue 6, 200 Gliveko Catelli classes of fuctios The reader is referred to Chapter.6 of Weller s Torgo otes, Chapter??? of VDVW ad Chapter 8.3

More information

Introductory statistics

Introductory statistics CM9S: Machie Learig for Bioiformatics Lecture - 03/3/06 Itroductory statistics Lecturer: Sriram Sakararama Scribe: Sriram Sakararama We will provide a overview of statistical iferece focussig o the key

More information

Chapter 3. Strong convergence. 3.1 Definition of almost sure convergence

Chapter 3. Strong convergence. 3.1 Definition of almost sure convergence Chapter 3 Strog covergece As poited out i the Chapter 2, there are multiple ways to defie the otio of covergece of a sequece of radom variables. That chapter defied covergece i probability, covergece i

More information

Topic 9: Sampling Distributions of Estimators

Topic 9: Sampling Distributions of Estimators Topic 9: Samplig Distributios of Estimators Course 003, 2018 Page 0 Samplig distributios of estimators Sice our estimators are statistics (particular fuctios of radom variables), their distributio ca be

More information

Topic 9: Sampling Distributions of Estimators

Topic 9: Sampling Distributions of Estimators Topic 9: Samplig Distributios of Estimators Course 003, 2018 Page 0 Samplig distributios of estimators Sice our estimators are statistics (particular fuctios of radom variables), their distributio ca be

More information

A RANK STATISTIC FOR NON-PARAMETRIC K-SAMPLE AND CHANGE POINT PROBLEMS

A RANK STATISTIC FOR NON-PARAMETRIC K-SAMPLE AND CHANGE POINT PROBLEMS J. Japa Statist. Soc. Vol. 41 No. 1 2011 67 73 A RANK STATISTIC FOR NON-PARAMETRIC K-SAMPLE AND CHANGE POINT PROBLEMS Yoichi Nishiyama* We cosider k-sample ad chage poit problems for idepedet data i a

More information

Frequentist Inference

Frequentist Inference Frequetist Iferece The topics of the ext three sectios are useful applicatios of the Cetral Limit Theorem. Without kowig aythig about the uderlyig distributio of a sequece of radom variables {X i }, for

More information

Output Analysis and Run-Length Control

Output Analysis and Run-Length Control IEOR E4703: Mote Carlo Simulatio Columbia Uiversity c 2017 by Marti Haugh Output Aalysis ad Ru-Legth Cotrol I these otes we describe how the Cetral Limit Theorem ca be used to costruct approximate (1 α%

More information

Chapter 6 Infinite Series

Chapter 6 Infinite Series Chapter 6 Ifiite Series I the previous chapter we cosidered itegrals which were improper i the sese that the iterval of itegratio was ubouded. I this chapter we are goig to discuss a topic which is somewhat

More information

A statistical method to determine sample size to estimate characteristic value of soil parameters

A statistical method to determine sample size to estimate characteristic value of soil parameters A statistical method to determie sample size to estimate characteristic value of soil parameters Y. Hojo, B. Setiawa 2 ad M. Suzuki 3 Abstract Sample size is a importat factor to be cosidered i determiig

More information

Lecture 33: Bootstrap

Lecture 33: Bootstrap Lecture 33: ootstrap Motivatio To evaluate ad compare differet estimators, we eed cosistet estimators of variaces or asymptotic variaces of estimators. This is also importat for hypothesis testig ad cofidece

More information

Notes On Median and Quantile Regression. James L. Powell Department of Economics University of California, Berkeley

Notes On Median and Quantile Regression. James L. Powell Department of Economics University of California, Berkeley Notes O Media ad Quatile Regressio James L. Powell Departmet of Ecoomics Uiversity of Califoria, Berkeley Coditioal Media Restrictios ad Least Absolute Deviatios It is well-kow that the expected value

More information

ECE 901 Lecture 12: Complexity Regularization and the Squared Loss

ECE 901 Lecture 12: Complexity Regularization and the Squared Loss ECE 90 Lecture : Complexity Regularizatio ad the Squared Loss R. Nowak 5/7/009 I the previous lectures we made use of the Cheroff/Hoeffdig bouds for our aalysis of classifier errors. Hoeffdig s iequality

More information

Definition 4.2. (a) A sequence {x n } in a Banach space X is a basis for X if. unique scalars a n (x) such that x = n. a n (x) x n. (4.

Definition 4.2. (a) A sequence {x n } in a Banach space X is a basis for X if. unique scalars a n (x) such that x = n. a n (x) x n. (4. 4. BASES I BAACH SPACES 39 4. BASES I BAACH SPACES Sice a Baach space X is a vector space, it must possess a Hamel, or vector space, basis, i.e., a subset {x γ } γ Γ whose fiite liear spa is all of X ad

More information

Statistical Inference (Chapter 10) Statistical inference = learn about a population based on the information provided by a sample.

Statistical Inference (Chapter 10) Statistical inference = learn about a population based on the information provided by a sample. Statistical Iferece (Chapter 10) Statistical iferece = lear about a populatio based o the iformatio provided by a sample. Populatio: The set of all values of a radom variable X of iterest. Characterized

More information

w (1) ˆx w (1) x (1) /ρ and w (2) ˆx w (2) x (2) /ρ.

w (1) ˆx w (1) x (1) /ρ and w (2) ˆx w (2) x (2) /ρ. 2 5. Weighted umber of late jobs 5.1. Release dates ad due dates: maximimizig the weight of o-time jobs Oce we add release dates, miimizig the umber of late jobs becomes a sigificatly harder problem. For

More information

5. Likelihood Ratio Tests

5. Likelihood Ratio Tests 1 of 5 7/29/2009 3:16 PM Virtual Laboratories > 9. Hy pothesis Testig > 1 2 3 4 5 6 7 5. Likelihood Ratio Tests Prelimiaries As usual, our startig poit is a radom experimet with a uderlyig sample space,

More information

Inference under shape restrictions

Inference under shape restrictions Inference under shape restrictions Joachim Freyberger Brandon Reeves June 7, 2018 Abstract We propose a uniformly valid inference method for an unknown function or parameter vector satisfying certain shape

More information

Economics 241B Relation to Method of Moments and Maximum Likelihood OLSE as a Maximum Likelihood Estimator

Economics 241B Relation to Method of Moments and Maximum Likelihood OLSE as a Maximum Likelihood Estimator Ecoomics 24B Relatio to Method of Momets ad Maximum Likelihood OLSE as a Maximum Likelihood Estimator Uder Assumptio 5 we have speci ed the distributio of the error, so we ca estimate the model parameters

More information

Discrete Mathematics for CS Spring 2008 David Wagner Note 22

Discrete Mathematics for CS Spring 2008 David Wagner Note 22 CS 70 Discrete Mathematics for CS Sprig 2008 David Wager Note 22 I.I.D. Radom Variables Estimatig the bias of a coi Questio: We wat to estimate the proportio p of Democrats i the US populatio, by takig

More information

EECS564 Estimation, Filtering, and Detection Hwk 2 Solns. Winter p θ (z) = (2θz + 1 θ), 0 z 1

EECS564 Estimation, Filtering, and Detection Hwk 2 Solns. Winter p θ (z) = (2θz + 1 θ), 0 z 1 EECS564 Estimatio, Filterig, ad Detectio Hwk 2 Sols. Witer 25 4. Let Z be a sigle observatio havig desity fuctio where. p (z) = (2z + ), z (a) Assumig that is a oradom parameter, fid ad plot the maximum

More information

Chapter 6 Principles of Data Reduction

Chapter 6 Principles of Data Reduction Chapter 6 for BST 695: Special Topics i Statistical Theory. Kui Zhag, 0 Chapter 6 Priciples of Data Reductio Sectio 6. Itroductio Goal: To summarize or reduce the data X, X,, X to get iformatio about a

More information

1 Introduction to reducing variance in Monte Carlo simulations

1 Introduction to reducing variance in Monte Carlo simulations Copyright c 010 by Karl Sigma 1 Itroductio to reducig variace i Mote Carlo simulatios 11 Review of cofidece itervals for estimatig a mea I statistics, we estimate a ukow mea µ = E(X) of a distributio by

More information

Slide Set 13 Linear Model with Endogenous Regressors and the GMM estimator

Slide Set 13 Linear Model with Endogenous Regressors and the GMM estimator Slide Set 13 Liear Model with Edogeous Regressors ad the GMM estimator Pietro Coretto pcoretto@uisa.it Ecoometrics Master i Ecoomics ad Fiace (MEF) Uiversità degli Studi di Napoli Federico II Versio: Friday

More information

(A sequence also can be thought of as the list of function values attained for a function f :ℵ X, where f (n) = x n for n 1.) x 1 x N +k x N +4 x 3

(A sequence also can be thought of as the list of function values attained for a function f :ℵ X, where f (n) = x n for n 1.) x 1 x N +k x N +4 x 3 MATH 337 Sequeces Dr. Neal, WKU Let X be a metric space with distace fuctio d. We shall defie the geeral cocept of sequece ad limit i a metric space, the apply the results i particular to some special

More information

Last Lecture. Wald Test

Last Lecture. Wald Test Last Lecture Biostatistics 602 - Statistical Iferece Lecture 22 Hyu Mi Kag April 9th, 2013 Is the exact distributio of LRT statistic typically easy to obtai? How about its asymptotic distributio? For testig

More information

Stat 421-SP2012 Interval Estimation Section

Stat 421-SP2012 Interval Estimation Section Stat 41-SP01 Iterval Estimatio Sectio 11.1-11. We ow uderstad (Chapter 10) how to fid poit estimators of a ukow parameter. o However, a poit estimate does ot provide ay iformatio about the ucertaity (possible

More information

Sequences and Series of Functions

Sequences and Series of Functions Chapter 6 Sequeces ad Series of Fuctios 6.1. Covergece of a Sequece of Fuctios Poitwise Covergece. Defiitio 6.1. Let, for each N, fuctio f : A R be defied. If, for each x A, the sequece (f (x)) coverges

More information

Asymptotic distribution of the first-stage F-statistic under weak IVs

Asymptotic distribution of the first-stage F-statistic under weak IVs November 6 Eco 59A WEAK INSTRUMENTS III Testig for Weak Istrumets From the results discussed i Weak Istrumets II we kow that at least i the case of a sigle edogeous regressor there are weak-idetificatio-robust

More information

Econ 325 Notes on Point Estimator and Confidence Interval 1 By Hiro Kasahara

Econ 325 Notes on Point Estimator and Confidence Interval 1 By Hiro Kasahara Poit Estimator Eco 325 Notes o Poit Estimator ad Cofidece Iterval 1 By Hiro Kasahara Parameter, Estimator, ad Estimate The ormal probability desity fuctio is fully characterized by two costats: populatio

More information

Tests of Hypotheses Based on a Single Sample (Devore Chapter Eight)

Tests of Hypotheses Based on a Single Sample (Devore Chapter Eight) Tests of Hypotheses Based o a Sigle Sample Devore Chapter Eight MATH-252-01: Probability ad Statistics II Sprig 2018 Cotets 1 Hypothesis Tests illustrated with z-tests 1 1.1 Overview of Hypothesis Testig..........

More information

Optimally Sparse SVMs

Optimally Sparse SVMs A. Proof of Lemma 3. We here prove a lower boud o the umber of support vectors to achieve geeralizatio bouds of the form which we cosider. Importatly, this result holds ot oly for liear classifiers, but

More information

Lecture 2. The Lovász Local Lemma

Lecture 2. The Lovász Local Lemma Staford Uiversity Sprig 208 Math 233A: No-costructive methods i combiatorics Istructor: Ja Vodrák Lecture date: Jauary 0, 208 Origial scribe: Apoorva Khare Lecture 2. The Lovász Local Lemma 2. Itroductio

More information

Infinite Sequences and Series

Infinite Sequences and Series Chapter 6 Ifiite Sequeces ad Series 6.1 Ifiite Sequeces 6.1.1 Elemetary Cocepts Simply speakig, a sequece is a ordered list of umbers writte: {a 1, a 2, a 3,...a, a +1,...} where the elemets a i represet

More information

Supplemental Material: Proofs

Supplemental Material: Proofs Proof to Theorem Supplemetal Material: Proofs Proof. Let be the miimal umber of traiig items to esure a uique solutio θ. First cosider the case. It happes if ad oly if θ ad Rak(A) d, which is a special

More information

Statistics 511 Additional Materials

Statistics 511 Additional Materials Cofidece Itervals o mu Statistics 511 Additioal Materials This topic officially moves us from probability to statistics. We begi to discuss makig ifereces about the populatio. Oe way to differetiate probability

More information

1 Review and Overview

1 Review and Overview DRAFT a fial versio will be posted shortly CS229T/STATS231: Statistical Learig Theory Lecturer: Tegyu Ma Lecture #3 Scribe: Migda Qiao October 1, 2013 1 Review ad Overview I the first half of this course,

More information

1 Inferential Methods for Correlation and Regression Analysis

1 Inferential Methods for Correlation and Regression Analysis 1 Iferetial Methods for Correlatio ad Regressio Aalysis I the chapter o Correlatio ad Regressio Aalysis tools for describig bivariate cotiuous data were itroduced. The sample Pearso Correlatio Coefficiet

More information

Lecture 3. Properties of Summary Statistics: Sampling Distribution

Lecture 3. Properties of Summary Statistics: Sampling Distribution Lecture 3 Properties of Summary Statistics: Samplig Distributio Mai Theme How ca we use math to justify that our umerical summaries from the sample are good summaries of the populatio? Lecture Summary

More information

Ω ). Then the following inequality takes place:

Ω ). Then the following inequality takes place: Lecture 8 Lemma 5. Let f : R R be a cotiuously differetiable covex fuctio. Choose a costat δ > ad cosider the subset Ωδ = { R f δ } R. Let Ωδ ad assume that f < δ, i.e., is ot o the boudary of f = δ, i.e.,

More information

Chi-Squared Tests Math 6070, Spring 2006

Chi-Squared Tests Math 6070, Spring 2006 Chi-Squared Tests Math 6070, Sprig 2006 Davar Khoshevisa Uiversity of Utah February XXX, 2006 Cotets MLE for Goodess-of Fit 2 2 The Multiomial Distributio 3 3 Applicatio to Goodess-of-Fit 6 3 Testig for

More information

LECTURE 14 NOTES. A sequence of α-level tests {ϕ n (x)} is consistent if

LECTURE 14 NOTES. A sequence of α-level tests {ϕ n (x)} is consistent if LECTURE 14 NOTES 1. Asymptotic power of tests. Defiitio 1.1. A sequece of -level tests {ϕ x)} is cosistet if β θ) := E θ [ ϕ x) ] 1 as, for ay θ Θ 1. Just like cosistecy of a sequece of estimators, Defiitio

More information

Rates of Convergence by Moduli of Continuity

Rates of Convergence by Moduli of Continuity Rates of Covergece by Moduli of Cotiuity Joh Duchi: Notes for Statistics 300b March, 017 1 Itroductio I this ote, we give a presetatio showig the importace, ad relatioship betwee, the modulis of cotiuity

More information

Lecture 3 The Lebesgue Integral

Lecture 3 The Lebesgue Integral Lecture 3: The Lebesgue Itegral 1 of 14 Course: Theory of Probability I Term: Fall 2013 Istructor: Gorda Zitkovic Lecture 3 The Lebesgue Itegral The costructio of the itegral Uless expressly specified

More information

Estimation for Complete Data

Estimation for Complete Data Estimatio for Complete Data complete data: there is o loss of iformatio durig study. complete idividual complete data= grouped data A complete idividual data is the oe i which the complete iformatio of

More information

Econ 325/327 Notes on Sample Mean, Sample Proportion, Central Limit Theorem, Chi-square Distribution, Student s t distribution 1.

Econ 325/327 Notes on Sample Mean, Sample Proportion, Central Limit Theorem, Chi-square Distribution, Student s t distribution 1. Eco 325/327 Notes o Sample Mea, Sample Proportio, Cetral Limit Theorem, Chi-square Distributio, Studet s t distributio 1 Sample Mea By Hiro Kasahara We cosider a radom sample from a populatio. Defiitio

More information

Mathematical Statistics - MS

Mathematical Statistics - MS Paper Specific Istructios. The examiatio is of hours duratio. There are a total of 60 questios carryig 00 marks. The etire paper is divided ito three sectios, A, B ad C. All sectios are compulsory. Questios

More information

Linear regression. Daniel Hsu (COMS 4771) (y i x T i β)2 2πσ. 2 2σ 2. 1 n. (x T i β y i ) 2. 1 ˆβ arg min. β R n d

Linear regression. Daniel Hsu (COMS 4771) (y i x T i β)2 2πσ. 2 2σ 2. 1 n. (x T i β y i ) 2. 1 ˆβ arg min. β R n d Liear regressio Daiel Hsu (COMS 477) Maximum likelihood estimatio Oe of the simplest liear regressio models is the followig: (X, Y ),..., (X, Y ), (X, Y ) are iid radom pairs takig values i R d R, ad Y

More information

Statistical inference: example 1. Inferential Statistics

Statistical inference: example 1. Inferential Statistics Statistical iferece: example 1 Iferetial Statistics POPULATION SAMPLE A clothig store chai regularly buys from a supplier large quatities of a certai piece of clothig. Each item ca be classified either

More information

Study the bias (due to the nite dimensional approximation) and variance of the estimators

Study the bias (due to the nite dimensional approximation) and variance of the estimators 2 Series Methods 2. Geeral Approach A model has parameters (; ) where is ite-dimesioal ad is oparametric. (Sometimes, there is o :) We will focus o regressio. The fuctio is approximated by a series a ite

More information

Lecture 2: Monte Carlo Simulation

Lecture 2: Monte Carlo Simulation STAT/Q SCI 43: Itroductio to Resamplig ethods Sprig 27 Istructor: Ye-Chi Che Lecture 2: ote Carlo Simulatio 2 ote Carlo Itegratio Assume we wat to evaluate the followig itegratio: e x3 dx What ca we do?

More information

62. Power series Definition 16. (Power series) Given a sequence {c n }, the series. c n x n = c 0 + c 1 x + c 2 x 2 + c 3 x 3 +

62. Power series Definition 16. (Power series) Given a sequence {c n }, the series. c n x n = c 0 + c 1 x + c 2 x 2 + c 3 x 3 + 62. Power series Defiitio 16. (Power series) Give a sequece {c }, the series c x = c 0 + c 1 x + c 2 x 2 + c 3 x 3 + is called a power series i the variable x. The umbers c are called the coefficiets of

More information

Accuracy Assessment for High-Dimensional Linear Regression

Accuracy Assessment for High-Dimensional Linear Regression Uiversity of Pesylvaia ScholarlyCommos Statistics Papers Wharto Faculty Research -016 Accuracy Assessmet for High-Dimesioal Liear Regressio Toy Cai Uiversity of Pesylvaia Zijia Guo Uiversity of Pesylvaia

More information

MATH 320: Probability and Statistics 9. Estimation and Testing of Parameters. Readings: Pruim, Chapter 4

MATH 320: Probability and Statistics 9. Estimation and Testing of Parameters. Readings: Pruim, Chapter 4 MATH 30: Probability ad Statistics 9. Estimatio ad Testig of Parameters Estimatio ad Testig of Parameters We have bee dealig situatios i which we have full kowledge of the distributio of a radom variable.

More information

Lecture 11 October 27

Lecture 11 October 27 STATS 300A: Theory of Statistics Fall 205 Lecture October 27 Lecturer: Lester Mackey Scribe: Viswajith Veugopal, Vivek Bagaria, Steve Yadlowsky Warig: These otes may cotai factual ad/or typographic errors..

More information

MASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.265/15.070J Fall 2013 Lecture 3 9/11/2013. Large deviations Theory. Cramér s Theorem

MASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.265/15.070J Fall 2013 Lecture 3 9/11/2013. Large deviations Theory. Cramér s Theorem MASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.265/5.070J Fall 203 Lecture 3 9//203 Large deviatios Theory. Cramér s Theorem Cotet.. Cramér s Theorem. 2. Rate fuctio ad properties. 3. Chage of measure techique.

More information

EFFECTIVE WLLN, SLLN, AND CLT IN STATISTICAL MODELS

EFFECTIVE WLLN, SLLN, AND CLT IN STATISTICAL MODELS EFFECTIVE WLLN, SLLN, AND CLT IN STATISTICAL MODELS Ryszard Zieliński Ist Math Polish Acad Sc POBox 21, 00-956 Warszawa 10, Polad e-mail: rziel@impagovpl ABSTRACT Weak laws of large umbers (W LLN), strog

More information

MA131 - Analysis 1. Workbook 3 Sequences II

MA131 - Analysis 1. Workbook 3 Sequences II MA3 - Aalysis Workbook 3 Sequeces II Autum 2004 Cotets 2.8 Coverget Sequeces........................ 2.9 Algebra of Limits......................... 2 2.0 Further Useful Results........................

More information

REGRESSION WITH QUADRATIC LOSS

REGRESSION WITH QUADRATIC LOSS REGRESSION WITH QUADRATIC LOSS MAXIM RAGINSKY Regressio with quadratic loss is aother basic problem studied i statistical learig theory. We have a radom couple Z = X, Y ), where, as before, X is a R d

More information

Basis for simulation techniques

Basis for simulation techniques Basis for simulatio techiques M. Veeraraghava, March 7, 004 Estimatio is based o a collectio of experimetal outcomes, x, x,, x, where each experimetal outcome is a value of a radom variable. x i. Defiitios

More information

Product measures, Tonelli s and Fubini s theorems For use in MAT3400/4400, autumn 2014 Nadia S. Larsen. Version of 13 October 2014.

Product measures, Tonelli s and Fubini s theorems For use in MAT3400/4400, autumn 2014 Nadia S. Larsen. Version of 13 October 2014. Product measures, Toelli s ad Fubii s theorems For use i MAT3400/4400, autum 2014 Nadia S. Larse Versio of 13 October 2014. 1. Costructio of the product measure The purpose of these otes is to preset the

More information

Expectation and Variance of a random variable

Expectation and Variance of a random variable Chapter 11 Expectatio ad Variace of a radom variable The aim of this lecture is to defie ad itroduce mathematical Expectatio ad variace of a fuctio of discrete & cotiuous radom variables ad the distributio

More information

Math Solutions to homework 6

Math Solutions to homework 6 Math 175 - Solutios to homework 6 Cédric De Groote November 16, 2017 Problem 1 (8.11 i the book): Let K be a compact Hermitia operator o a Hilbert space H ad let the kerel of K be {0}. Show that there

More information

S1 Notation and Assumptions

S1 Notation and Assumptions Statistica Siica: Supplemet Robust-BD Estimatio ad Iferece for Varyig-Dimesioal Geeral Liear Models Chumig Zhag Xiao Guo Che Cheg Zhegju Zhag Uiversity of Wiscosi-Madiso Supplemetary Material S Notatio

More information

1 Duality revisited. AM 221: Advanced Optimization Spring 2016

1 Duality revisited. AM 221: Advanced Optimization Spring 2016 AM 22: Advaced Optimizatio Sprig 206 Prof. Yaro Siger Sectio 7 Wedesday, Mar. 9th Duality revisited I this sectio, we will give a slightly differet perspective o duality. optimizatio program: f(x) x R

More information

Measure and Measurable Functions

Measure and Measurable Functions 3 Measure ad Measurable Fuctios 3.1 Measure o a Arbitrary σ-algebra Recall from Chapter 2 that the set M of all Lebesgue measurable sets has the followig properties: R M, E M implies E c M, E M for N implies

More information

STA Object Data Analysis - A List of Projects. January 18, 2018

STA Object Data Analysis - A List of Projects. January 18, 2018 STA 6557 Jauary 8, 208 Object Data Aalysis - A List of Projects. Schoeberg Mea glaucomatous shape chages of the Optic Nerve Head regio i aimal models 2. Aalysis of VW- Kedall ati-mea shapes with a applicatio

More information

Because it tests for differences between multiple pairs of means in one test, it is called an omnibus test.

Because it tests for differences between multiple pairs of means in one test, it is called an omnibus test. Math 308 Sprig 018 Classes 19 ad 0: Aalysis of Variace (ANOVA) Page 1 of 6 Itroductio ANOVA is a statistical procedure for determiig whether three or more sample meas were draw from populatios with equal

More information

MASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.436J/15.085J Fall 2008 Lecture 19 11/17/2008 LAWS OF LARGE NUMBERS II THE STRONG LAW OF LARGE NUMBERS

MASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.436J/15.085J Fall 2008 Lecture 19 11/17/2008 LAWS OF LARGE NUMBERS II THE STRONG LAW OF LARGE NUMBERS MASSACHUSTTS INSTITUT OF TCHNOLOGY 6.436J/5.085J Fall 2008 Lecture 9 /7/2008 LAWS OF LARG NUMBRS II Cotets. The strog law of large umbers 2. The Cheroff boud TH STRONG LAW OF LARG NUMBRS While the weak

More information

MAT1026 Calculus II Basic Convergence Tests for Series

MAT1026 Calculus II Basic Convergence Tests for Series MAT026 Calculus II Basic Covergece Tests for Series Egi MERMUT 202.03.08 Dokuz Eylül Uiversity Faculty of Sciece Departmet of Mathematics İzmir/TURKEY Cotets Mootoe Covergece Theorem 2 2 Series of Real

More information

Math 61CM - Solutions to homework 3

Math 61CM - Solutions to homework 3 Math 6CM - Solutios to homework 3 Cédric De Groote October 2 th, 208 Problem : Let F be a field, m 0 a fixed oegative iteger ad let V = {a 0 + a x + + a m x m a 0,, a m F} be the vector space cosistig

More information

TR/46 OCTOBER THE ZEROS OF PARTIAL SUMS OF A MACLAURIN EXPANSION A. TALBOT

TR/46 OCTOBER THE ZEROS OF PARTIAL SUMS OF A MACLAURIN EXPANSION A. TALBOT TR/46 OCTOBER 974 THE ZEROS OF PARTIAL SUMS OF A MACLAURIN EXPANSION by A. TALBOT .. Itroductio. A problem i approximatio theory o which I have recetly worked [] required for its solutio a proof that the

More information

6 Sample Size Calculations

6 Sample Size Calculations 6 Sample Size Calculatios Oe of the major resposibilities of a cliical trial statisticia is to aid the ivestigators i determiig the sample size required to coduct a study The most commo procedure for determiig

More information

Journal of Multivariate Analysis. Superefficient estimation of the marginals by exploiting knowledge on the copula

Journal of Multivariate Analysis. Superefficient estimation of the marginals by exploiting knowledge on the copula Joural of Multivariate Aalysis 102 (2011) 1315 1319 Cotets lists available at ScieceDirect Joural of Multivariate Aalysis joural homepage: www.elsevier.com/locate/jmva Superefficiet estimatio of the margials

More information

x iu i E(x u) 0. In order to obtain a consistent estimator of β, we find the instrumental variable z which satisfies E(z u) = 0. z iu i E(z u) = 0.

x iu i E(x u) 0. In order to obtain a consistent estimator of β, we find the instrumental variable z which satisfies E(z u) = 0. z iu i E(z u) = 0. 27 However, β MM is icosistet whe E(x u) 0, i.e., β MM = (X X) X y = β + (X X) X u = β + ( X X ) ( X u ) \ β. Note as follows: X u = x iu i E(x u) 0. I order to obtai a cosistet estimator of β, we fid

More information

Estimation of a population proportion March 23,

Estimation of a population proportion March 23, 1 Social Studies 201 Notes for March 23, 2005 Estimatio of a populatio proportio Sectio 8.5, p. 521. For the most part, we have dealt with meas ad stadard deviatios this semester. This sectio of the otes

More information

Fall 2013 MTH431/531 Real analysis Section Notes

Fall 2013 MTH431/531 Real analysis Section Notes Fall 013 MTH431/531 Real aalysis Sectio 8.1-8. Notes Yi Su 013.11.1 1. Defiitio of uiform covergece. We look at a sequece of fuctios f (x) ad study the coverget property. Notice we have two parameters

More information

Machine Learning Theory Tübingen University, WS 2016/2017 Lecture 12

Machine Learning Theory Tübingen University, WS 2016/2017 Lecture 12 Machie Learig Theory Tübige Uiversity, WS 06/07 Lecture Tolstikhi Ilya Abstract I this lecture we derive risk bouds for kerel methods. We will start by showig that Soft Margi kerel SVM correspods to miimizig

More information

Output Analysis (2, Chapters 10 &11 Law)

Output Analysis (2, Chapters 10 &11 Law) B. Maddah ENMG 6 Simulatio Output Aalysis (, Chapters 10 &11 Law) Comparig alterative system cofiguratio Sice the output of a simulatio is radom, the comparig differet systems via simulatio should be doe

More information

Lecture Notes 15 Hypothesis Testing (Chapter 10)

Lecture Notes 15 Hypothesis Testing (Chapter 10) 1 Itroductio Lecture Notes 15 Hypothesis Testig Chapter 10) Let X 1,..., X p θ x). Suppose we we wat to kow if θ = θ 0 or ot, where θ 0 is a specific value of θ. For example, if we are flippig a coi, we

More information

Statistical Pattern Recognition

Statistical Pattern Recognition Statistical Patter Recogitio Classificatio: No-Parametric Modelig Hamid R. Rabiee Jafar Muhammadi Sprig 2014 http://ce.sharif.edu/courses/92-93/2/ce725-2/ Ageda Parametric Modelig No-Parametric Modelig

More information

Stochastic Simulation

Stochastic Simulation Stochastic Simulatio 1 Itroductio Readig Assigmet: Read Chapter 1 of text. We shall itroduce may of the key issues to be discussed i this course via a couple of model problems. Model Problem 1 (Jackso

More information

Department of Mathematics

Department of Mathematics Departmet of Mathematics Ma 3/103 KC Border Itroductio to Probability ad Statistics Witer 2017 Lecture 19: Estimatio II Relevat textbook passages: Larse Marx [1]: Sectios 5.2 5.7 19.1 The method of momets

More information

Statistical and Mathematical Methods DS-GA 1002 December 8, Sample Final Problems Solutions

Statistical and Mathematical Methods DS-GA 1002 December 8, Sample Final Problems Solutions Statistical ad Mathematical Methods DS-GA 00 December 8, 05. Short questios Sample Fial Problems Solutios a. Ax b has a solutio if b is i the rage of A. The dimesio of the rage of A is because A has liearly-idepedet

More information