Generalized Empirical Likelihood Estimators and Tests under Partial, Weak and Strong Identification

Size: px
Start display at page:

Download "Generalized Empirical Likelihood Estimators and Tests under Partial, Weak and Strong Identification"

Transcription

1 Generalized Empirical Likelihood Estimators and Tests under Partial, Weak and Strong Identification Patrik Guggenberger Department of Economics U.C.L.A. Richard J. Smith cemmap U.C.L and I.F.S. and Department of Economics University of Warwick First Draft: June 2002; This revision: October 2004

2 Abstract The purpose of this paper is to describe the performance of generalized empirical likelihood (GEL) methods for time series instrumental variable models specified by nonlinear moment restrictions as in Stock and Wright (2000) when identification may be weak. The paper makes two main contributions. Firstly, we show that all GEL estimators are first-order equivalent under weak identification. The GEL estimator under weak identification is inconsistent and has a nonstandard asymptotic distribution. Secondly, the paper proposes new GEL test statistics, which have chi-square asymptotic null distributions independent of the strength or weakness of identification. Consequently, unlike those for Wald and likelihood ratio statistics, the size of tests formed from these statistics is not distorted by the strength or weakness of identification. Modified versions of the statistics are presented for tests of hypotheses on parameter subvectors when the parameters not under test are strongly identified. Monte Carlo results for the linear instrumental variable regression model suggest that tests based on these statistics have very good size properties even in the presence of conditional heteroskedasticity. The tests have competitive power properties, especially for thick tailed or asymmetric error distributions. JEL Classification: C12, C31, C32 Keywords: Generalized Empirical Likelihood, Nonlinear Moment Conditions, Similar Tests, Weak Identification. This paper is a revision of Guggenberger s job market paper Generalized Empirical Likelihood Tests under Partial, Weak, and Strong Identification. We are thankful to the Editor, P.C.B. Phillips, and three referees for very helpful suggestions on an earlier version of this paper. Guggenberger gratefully acknowledges the continuous help and support of his advisor Donald Andrews who played a prominent role in the formulation of this paper. He thanks Peter Phillips and Joseph Altonji for their extremely valuable comments. We also thank Vadim Marner for help with the simulation section and John Chao, Guido Imbens, Michael Jansson, Frank Kleibergen, Marcelo Moreira, Jonathan Wright and Motohiro Yogo for helpful comments. Aspects of this research have been presented at the 2003 Econometric Society European Meetings, York Econometrics Workshop 2004, Seminaire Malinvaud, CREST- INSEE, and seminars at Albany, Alicante, Austin Texas, Brown, Chicago, Chicago GSB, Harvard/MIT, Irvine, Konstanz, Laval, Madison Wisconsin, Mannheim, Maryland, Penn, NYU, PennState, Pittsburgh, Princeton, Rice, Riverside, Rochester, San Diego, Texas A&M, UCLA, USC and Yale and we would like to thank all the participants. Guggenberger and Smith received financial support through a Carl Arvid Anderson Prize Fellowship and a 2002 Leverhulme Major Research Fellowship, respectively. Corresponding author: Patrik Guggenberger, Bunche Hall 8385, Department of Economics, U.C.L.A., Box , Los Angeles, CA guggenbe@econ.ucla.edu

3 1 Introduction It is often the case that the instrumental variables available to empirical researchers are only weakly correlated with the endogenous variables. That is, identification is weak. Phillips (1989), Nelson and Startz (1990) and a large literature following these early contributions show that in such situations classical normal and chi-square asymptotic approximations to the finite-sample distributions of instrumental variables estimators and statistics can be very poor. For example, even though likelihood ratio and Wald test statistics are asymptotically chi-square, use of chi-square critical values can lead to extreme size distortions in finite samples, see Dufour (1997). The purpose then of this paper is to ascertain the performance of generalized empirical likelihood (GEL) methods [Newey and Smith (2004), henceforth NS, Smith (1997, 2001)] for time series instrumental variable models specified by nonlinear moment restrictions when identification may be weak [as in Stock and Wright (2000), henceforth SW]. In particular, the paper makes two principal contributions. Firstly, the asymptotic distribution of the GEL estimator is derived for a weakly identified set-up. Secondly, the paper proposes new, theoretically and computationally attractive GEL test statistics. The asymptotic null distribution of these statistics is chi-square under partial [Phillips (1989)], weak [SW] and strong identification. Thus, thesizeoftests formed from these statistics is invariant to the strength or weakness of identification. Importantly, we also provide asymptotic power results for the various statistics suggestedinthispaper. GEL estimators and test statistics are alternatives to those based on generalized method of moments (GMM); see Hansen (1982), Newey (1985) and Newey and West (1987). GEL has received considerable attention recently due to its competitive bias properties. For example, NS show that for many models the asymptotic bias of empirical likelihood (EL) does not grow with the number of moment restrictions, while that of GMM estimators grows without bound, a finding that may imply favorable properties for GEL-based test statistics. Similar to the findings in Phillips (1984, 1989) and SW for limited information maximum likelihood (LIML), two stage least squares (2SLS) and GMM, GEL estimators of weakly identified parameters have nonstandard asymptotic distributions and are in general inconsistent. Therefore, inference based on the classical normal approximation is inappropriate under weak identification. As in NS for strong identification, the first-order asymptotics of the GEL estimator under weak identification do not depend on the choice of the GEL criterion function. This finding is rather surprising and contrasts with 2SLS and LIML estimators whose first-order asymptotic theory differs under weak identification. The statistics proposed here are asymptotically pivotal in contrast to classical Wald and likelihood ratio statistics no matter the strength of identification. The first statistic GELR ρ is based on the GEL criterion function and may be thought of as a nonparametric likelihood ratio statistic. Two further statistics generalize Kleibergen s (2001) GMM-based K-statistic to the GEL context. Like the K-statistic which is a quadratic form in the first order derivative vector of the continuous updating GMM objective function, the second GEL statistic S ρ is a score-type statistic being [1]

4 a quadratic form in the GEL criterion score vector. The third statistic LM ρ is similar in structure to a GMM Lagrange multiplier statistic [Newey and West (1987)] and is asymptotically equivalent to the score-type statistic being a quadratic form in thesamplemomentvector. Confidence regions constructed from the K- andgel score-type statistics are never empty and contain the continuous updating estimator (CUE) and GEL estimator respectively. All forms of GEL statistics admit limiting chi-square null distributions with degrees of freedom equal to the number of instrumental variables or moment conditions for the first statistic and the dimension of the parameter vector for the second and third statistics. In over-identified situations, therefore, tests based on the latter statistics should be expected to have better power properties than those based on the former. In many cases, an applied researcher is interested in inference on a parameter subvector rather than the whole parameter vector. Modified versions of these statistics are therefore suggested for the subvector case when the remaining parameters are strongly identified. Monte Carlo simulations for the i.i.d. linear instrumental variable model with a wide range of error distributions compare our test statistics to several others, including homoskedastic and heteroskedastic versions of the K-statistic of Kleibergen (2001, 2002a) and the similar conditional likelihood ratio statistic LR M of Moreira (2003). We find that our tests have very good size properties even in the presence of conditional heteroskedasticity. In contrast, the homoskedastic version of the K-statistic of Kleibergen (2002a) and the LR M -statistic of Moreira (2003) are size-distorted under conditional heteroskedasticity. Our tests have competitive power properties, especially for thick-tailed or asymmetric error distributions. Given the nonparametric construction of the GEL estimator, robustness of GEL-based test statistics to different error distributions should be expected. Like SW, our paper allows for both i.i.d. and martingale difference sequences (m.d.s.) but does not apply to more general time series models, see assumption M θ (ii) below. Allowing for m.d.s. observations covers various cases of intertemporal Euler equations applications and regression models with m.d.s. errors. Therefore, the extension from the i.i.d. linear (dealt with in the first chapter of the dissertation of the first author, Guggenberger (2003)) to the particular time series setting with nonlinear moment restrictions considered here seems worthwhile, especially since there is essentially no cost (in terms of complications of the proofs) to making this extension. The proofs for consistency and for the asymptotic distribution of the GEL estimator build on Guggenberger (2003) which adapt those given in NS for the i.i.d. strongly identified context. Subsequent to the i.i.d. linear version of this paper, two related papers have appeared. Firstly, Caner (2003) derives the asymptotic distribution of the exponential tilting (ET) estimator [see Imbens, Spady and Johnson (1998) and Kitamura and Stutzer (1997)] under weak identification with nonlinear moment restrictions for independent observations. Caner (2003) also obtains an ET version of the K-statistic for nonlinear moment restrictions. Secondly, Otsu (2003) considers GEL based tests under weak identification in a more general time series setting than considered here and examines the GEL criterion function statistic GELR ρ and a modified version of [2]

5 the K-statistic based on Kitamura and Stutzer s (1997) and Smith s (2001) kernel smoothed GEL estimator that is efficient under strong identification. The remainder of the paper is organized as follows. In Section 2, the model and the assumptions are discussed, the GEL estimator is briefly reviewed and the asymptotic distribution of the GEL estimator under weak identification is derived. Section 3 introduces the GEL-based test statistics. We derive their asymptotic limiting distribution and show that it is unaffected by the degree of identification. Section 4 generalizes these results to hypotheses involving subvectors of the unknown parameter vector. Section 5 describes the simulation results. All proofs are relegated to the Appendix. The following notation is used in the paper. The symbols d, p and denote convergence in distribution, convergence in probability and weak convergence of empirical processes, respectively. For the latter, see Andrews (1994) for a definition. For convergence almost surely we write a.s. and with probability approaching 1 is replaced by w.p.a.1. The space C i (M) contains all functions that are i times continuously differentiable on M. For a symmetric matrix A, A > 0 means that A is positive definite and λ min (A) and λ max (A) denote the smallest and largest eigenvalue of A in absolute value, respectively. By A 0 wedenotethetransposeofamatrixa. For a full column rank matrix A R k p and positive definite matrix K R k k,wedenotebyp A (K) the oblique projection matrix A(A 0 K 1 A) 1 A 0 K 1 on the column space of A in the metric K and define M A (K) :=I k P A (K), where I k is the k-dimensional identity matrix; we abbreviate this notation to P A and M A if K = I k.thesymbol denotes the Kronecker product. Furthermore, vec(m) stands for the column vectorization of the k p matrix M, i.e. if M =(m 1,..., m p ) then vec(m) =(m 0 1,..., m0 p )0. By we denote the Euclidean norm. 2 Estimation This section is concerned with the asymptotic distribution of the GEL estimator when some elements of the parameter vector of interest may be only weakly identified. Intuitively, then, the moment conditions which define the model may not be particularly informative about these parameters. 2.1 Model We consider models specified by a finite number of moment restrictions. Let {z i : i = 1,..., n} be R l -valued data and, for each n N, g n : G Θ R k a given function, where G R l and Θ R p denotes the parameter space. The model has a true parameter θ 0 for which the moment condition is satisfied. For g n (z i, θ) we will usually write g i (θ). Eg n (z i, θ 0 )=0 (2.1) [3]

6 Example 1: (I.i.d. Linear Instrumental Variable (IV) Regression.) Guggenberger (2003, first chapter) discusses in detail GEL estimation and testing for this model under weak identification. The structural form (SF) equation is given by and the reduced form (RF) for Y by y = Y θ 0 + u, (2.2) Y = ZΠ + V, (2.3) where y, u R n, Y,V R n p, Z R n k and Π R k p. The matrix Y may contain both exogenous and endogenous variables, Y = (X, W) say, where X R n p X and W R n p W denote the respective observation matrices of exogenous and endogenous variables. The variables Z =(X, Z W ) constitute a set of instruments for the endogenous variables W. Thefirst p X columns of Π equal the first p X columns of I k and the first p X columns of V are 0. DenotebyY i,v i,z i,... (i =1,..., n) the i th row of the matrix Y, V, Z,... written as a column vector. Assuming the instruments and the structural error are uncorrelated, Eu i Z i =0, it follows that Eg i (θ 0 )=0, where for each i =1,..., n, g i (θ) :=(y i Yi 0θ)Z i. Note that in this example g i (θ) depends on n if the reduced form coefficient matrix Π is modeled to depend on n, see Staiger and Stock (1997), where Π n = n 1/2 C for a fixed matrix C. Example 2: (Conditional Moment Restrictions.) As in SW the moment conditions may result from conditional moment restrictions. Assume E[h(Y i, θ 0 ) F i ]=0, where h : H Θ R k 1, H R k 2 and F i is the information set at time i. Let Z i be a k 3 -dimensional vector of instruments contained in F i. If g i (θ) :=h(y i, θ) Z i, then Eg i (θ 0 )=0follows by taking iterated expectations. In (2.1), k = k 1 k 3 and l = k 2 + k Assumptions This section is concerned with the asymptotic distribution of the GEL estimator for θ when some components of θ 0 =(α 0 0, β0 0) 0, α 0 say, α 0 A, A R p A, are only weakly identified. Intuitively, this means that the moment condition (2.1) is not very informative about α 0. For parameter vectors θ =(α 0, β 0 0) 0, Eg n (z i, θ) maybeveryclose to zero, not only for α close to α 0 but also when α is far from α 0. In that case, the restriction Eg n (z i, θ 0 )=0is not very helpful for making inference on α 0. Assumption ID below provides a theoretical asymptotic framework for this phenomenon, which is taken from Assumption C in SW (p.1061). We refer the reader to SW (pp ) which provides substantial detailed motivation for this assumption and an explanation of why it models α 0 as weakly and β 0 as strongly identified. To describe the moment and distributional assumptions, we require some additional notation. bg(θ) :=n 1 n P g i (θ), b G(θ) :=n 1 [4] n P G i (θ),

7 Ψ n (θ) :=n 1/2 (bg(θ) Ebg(θ)), bω(θ) :=n 1 P n g i (θ)g i (θ) 0, where, if defined, G i (θ) :=( g i / θ)(θ) R k p. For notational convenience, a subscript n has been omitted in certain expressions. Define the k k matrices 1 Ω(θ):= lim n En 1 np g i (θ)g i (θ) 0, (θ 1, θ 2 ):= lim n EΨ n(θ 1 )Ψ n (θ 2 ) 0 and (θ) := (θ, θ). Let θ =(α 0, β 0 ) 0, where α A, A R p A, β B, B R p B let N B denote an open neighborhood of β 0. and p A + p B = p. Also Assumption Θ: The true parameter θ 0 =(α 0 0, β0 0) 0 is in the interior of the compact space Θ = A B. Assumption ID: (i) Ebg(θ) =n 1/2 m 1n (θ)+m 2 (β), where m 1n,m 1 : Θ R k and m 2 : B R k are continuous functions such that m 1n (θ) m 1 (θ) uniformly on Θ, m 1 (θ 0 )=0and m 2 (β) =0if and only if β = β 0 ; (ii) m 2 C 1 (N ); (iii) Let M 2 (β) :=( m 2 / β)(β) R k p B. M 2 (β 0 ) has full column rank p B. Next we detail the necessary moment assumptions. 2 Assumption M: (i) max 1 i n sup θ Θ g i (θ) = o p (n 1/2 ); (ii) Ω( ) is in C 0 (A {β 0 }) and bounded on Θ, Ω(θ) is nonsingular for all θ A {β 0 }, sup θ Θ bω(θ) Ω(θ) = o p (1), sup θ A N n 1 P n g i(θ)g i (θ) 0 = O p (1); (iii) Ψ n Ψ, where Ψ(θ) is a Gaussian stochastic process on Θ with mean zero and covariance function EΨ(θ 1 )Ψ(θ 2 ) 0 = (θ 1, θ 2 ). For each ε > 0 there exists a M ε < such that Pr(sup θ A N Ψ(θ) M ε ) > 1 ε. M(i) adapts Assumption 1(d) of NS, E sup β B g i (β) ξ < for some ξ > 2, from the i.i.d. setting with strong identification (p A =0and thus θ = β and Θ = B) to the weakly identified set-up considered here. A sufficient condition for M(i) in the time series context and under ID is given by sup i 1 E sup g i (θ) ξ < for some ξ > 2. (2.4) θ Θ Indeed, a simple application of the Markov inequality shows that (2.4) implies max 1 i n sup θ Θ g i (θ) = O p (n 1/ξ )=o p (n 1/2 ). See the Appendix for a proof. M(ii), which adapts Assumption 1(e) of NS to the weakly identified set-up, ensures that bω(θ) is 1 Note that (θ) is Ω(θ) in SW. We choose our notation for Ω(θ) for consistency with NS. 2 Weak convergence here is defined with respect to the sup-norm on function spaces and Euclidean norm on R k. [5]

8 nonsingular for θ A N. M(iii) is essentially the high level Assumption B of SW (p.1059) that states that Ψ n obeys a functional central limit theorem. In Assumption B,SW provide primitive sufficient conditions for their Assumption B that can also be found in Andrews (1994). Note that the definition of weak convergence [Andrews (1994, p.2250)] and M(iii) imply that sup θ Θ Ψ n (θ) d sup θ Θ Ψ(θ) and, thus, also that sup θ Θ bg(θ) Ebg(θ) p 0. IntheproofofTheorem2below,werequire sup θ A N Ψ(θ) bounded in probability. It is interesting to note that for i.i.d. data, an application of the Borel-Cantelli Lemma shows that M(i) is implied by Assumption 1(d) of NS even if ξ =2; see Owen (1990, Lemma 3) for a proof. Hence, using Lemmas 7-9 given below, their Assumption 1(d) can be weakened to ξ 2 for the consistency and asymptotic normality of the GEL estimator under strong identification with i.i.d. data (see their Theorems 3.1 and 3.2). Therefore, for i.i.d. data, identical assumptions guarantee consistency and asymptotic normality for both GEL and two step efficient GMM estimators [Hansen (1982)]. Example 1 (cont.): See Guggenberger (2003). For the linear IV model (2.2) Assumption ID can be expressed as Assumption ID : Π = Π n =(Π An, Π B ) R k (p A+p B ), where p A + p B = p. For a fixed matrix C A R k p A, Π An = n 1/2 C A and Π B has full column rank. Under Assumption ID, i.i.d. data and instrument exogeneity it follows that Ebg(θ) =Eg i (θ) =E(Z i Z 0 i)(n 1/2 C A, Π B )(θ 0 θ), which implies that in the notation of ID(i), m 1n (θ) =m 1 (θ) =E(Z i Zi 0)C A(α 0 α) and m 2 (β) =E(Z i Zi 0)Π B(β 0 β). Also, note that Assumption ID includes the partially identified model of Phillips (1989). In particular, choosing p A and setting C A =0, one obtains a model in which Π may have any desired (less than full) rank. We now give simple sufficient conditions that imply Assumption M. Let U := (u, V ). Assumption M : (i) {(U i,z i ):i 1} are i.i.d.; (ii) EZ i Ui 0 =0; (iii) E Z i 4 <, Q ZZ := E(Z i Zi 0) > 0, Eu2 i Z izi 0, Eu iv ij Z i Zi 0 and EV ijv ik Z i Zi 0 exist and are finite for j, k =1,...,p, where V ij denotes the j th component of the vector V i ; (iv) Ω(θ) is nonsingular for all θ A {β 0 }. Assumptions M (i) and (ii) state that errors and exogenous variables are jointly i.i.d. and the standard instrument exogeneity assumption is satisfied whereas M (iii) and (iv) are technical conditions. The following lemma shows that Assumption M in the linear model implies Assumption M. Lemma 1 Suppose that Assumptions ID, M and Θ hold in the linear IV model (2.2). Then Assumption ID and M hold. [6]

9 Therefore the various technical conditions of Assumption M reduce to very simple moment conditions in the linear model. Note that M implies E[sup θ Θ g i (θ) ξ ] < for ξ =2. However, we do not need the assumption E[sup θ Θ g i (θ) ξ ] < for a ξ > 2 to prove n 1/2 -consistency of the GEL estimator of the strongly identified parameters. Assumption HOM: (Conditional Homoskedasticity) E(U i Ui 0 Z i)=σ U > 0. HOM, which is used in Staiger and Stock (1997), is sufficient for Assumption M (iv). That is, Assumptions M (i)-(iii) and HOM imply M (iv) under ID. This follows from Ω(θ) =Q ZZ vασ 0 uva v α for θ A {β 0 }, where vα 0 := (1, (α 0 α) 0 ) and Σ uva is the (1 + p A ) (1 + p A ) upper left submatrix of Σ U. However, M is more general than HOM because it allows for conditional heteroskedasticity. For example, u i = Z i ζ i,whereζ i N(0, 1) is independent of Z i N(0,I k ), is compatible with M. 2.3 The GEL Estimator This subsection provides a formal definition of the GEL estimator of θ 0. Let ρ be a real-valued function Q R, where Q is an open interval of the real line that contains 0 and bλ n (θ) :={λ R k : λ 0 g i (θ) Q for i =1,..., n}. (2.5) If defined, let ρ j (v) :=( j ρ/ v j )(v) and ρ j := ρ j (0) for nonnegative integers j. The GEL estimator is the solution to a saddle point problem 3 where b θρ := arg min θ Θ sup λ b Λ n(θ) bp ρ (θ, λ), (2.6) P bp ρ (θ, λ) :=(2 n ρ(λ 0 g i (θ))/n) 2ρ 0. (2.7) Assumption ρ: (i) ρ is concave on Q; (ii) ρ is C 2 in a neighborhood of 0 and ρ 1 = ρ 2 = 1. 3 For compact Θ, continuousρ and g i (i =1,..., n), theexistenceofanargmin b θ may be shown. In fact, sup b λ bλ n (θ) P (θ, λ), viewed as a function in θ, can be shown to be lower semicontinuous (ls). Afunctionf(x) is ls at x 0 if, for each real number c such that c<f(x 0), there exists an open neighborhood U of x 0 such that c<f(x) holds for all x U. The function f is said to be ls if it is ls at each x 0 of its domain. It is easily shown that ls functions on compact sets take on their minimum. Uniqueness of b θ, however, is not implied. As a simple example, consider the i.i.d. linear IV model in (2.2) when p =2and let the two components Y ij, (j =1, 2), ofy i be independent Bernoulli random variables. Then, for each n, the probability that Y i1 = Y i2 for every i =1,..., n is positive. If Y i1 = Y i2 for every i =1,..., n and b θ Θ is an argmin of sup b λ Λn(θ) b P (θ, λ), theneachθ Θ with θ 1 + θ 2 = b θ 1 + b θ 2 is as well. To uniquely define b θ, we could, for example, do the following. From the set of all vectors θ Θ that minimize sup b λ Λn(θ) b P (θ, λ), let b θ be the vector that has smallest first component. (If that does not pin down b θ uniquely, choose from the remaining vectors according to the second component and so on.) [7]

10 The definition of the GEL estimator b θ ρ is adopted from NS. We slightly modify their definition of bp ρ (θ, λ) by recentering and rescaling which simplifies the presentation. We usually write bp (θ, λ) for bp ρ (θ, λ) and b θ for b θ ρ. The most popular GEL estimators are the CUE, the EL and the ET estimator which correspond to ρ(v) = (1 + v) 2 /2, ρ(v) = ln(1 v) and ρ(v) = exp v, respectively. The EL estimator was introduced by Imbens (1997), Owen (1988, 1990) and Qin and Lawless (1994) and the ET estimator by Imbens, Spady and Johnson (1998) and Kitamura and Stutzer (1997). For a recent survey of GEL methods see Imbens (2002) First-Order Equivalence This subsection obtains the asymptotic distribution of the GEL estimator b θ ρ under Assumption ID. Theorem 2 shows that the weakly identified parameters of θ 0 are estimated inconsistently and their GEL estimator has a nonstandard limiting distribution while the GEL estimator of the strongly identified parameters is n 1/2 -consistent but no longer asymptotically normal. Analogous results are available for LIML or more generally for GMM; see Phillips (1984) and SW, Theorem 1. The rather surprising finding is that the first-order asymptotic theory under ID is identical for all GEL estimators b θ ρ,aslongasρ satisfies Assumption ρ. 5 This is in contrast to the asymptotic theory for k-class estimators under weak identification. As shown in Staiger and Stock (1997, Theorem 1), the nonstandard asymptotic distribution of the k-class estimator depends on κ defined by n(k 1) d κ. Therefore, LIML and 2SLS are not first-order equivalent under weak identification. If defined, let λ(θ) be such that bp (θ, λ(θ)) = max bp (θ, λ). λ Λ b n(θ) For θ =(α 0, β 0 ) 0 Θ and b R p B let P (θ,b):=[ψ(θ)+m 1 (θ)+m 2 (β)b] 0 Ω(θ) 1 [Ψ(θ)+m 1 (θ)+m 2 (β)b]. The next theorem establishes the asymptotic behavior of b θ =(bα 0, b β 0 ) 0 under Assumption ID. Theorem 2 Suppose Assumptions Θ, ID,M and ρ are satisfied. (i) bα is in general inconsistent and n 1/2 ( b β β 0 )=O p (1). 4 A choice of bω(θ) 1 as the weighting matrix W T (θ T (θ)) in SW ((2.2), p.1058), i.e. ( P n gi(θ)gi(θ)0 /n) 1, results in the CUE which is the GEL estimator based on ρ(v) = (1+v) 2 /2; see NS, Theorem 2.1. Hansen, Heaton and Yaron (1996) and Pakes and Pollard (1989) define the (GMM) CUE using the centered weighting matrix ( P n (g i(θ) bg(θ))(g i (θ) bg(θ)) 0 /n) 1. However, as shown in NS, fn.1, both versions of the CUE are numerically identical. 5 The proof of Theorem 2 uses a second order Taylor expansion of P b ρ(θ, λ) in λ about 0 in which the only impact of ρ asymptotically is through ρ 1 and ρ 2 which are both 1. [8]

11 (ii) The following more precise result holds. For any fixed M>0let B M := {b R p B : b M} and define θ αb := (α 0, β n 1/2 b 0 ) 0. Then, for (α,b) A B M, n bp (θ αb, λ(θ αb )) P αb := P ((α 0, β 0 0) 0,b). Assume there exists a random element (α,b ) A R p B such that a.s. P α b < inf (α,b) (A R p B )\G P αb for every open set G that contains (α, β ).Then (bα,n 1/2 ( b β β 0 )) d (α,b ). Remark 1: Theorem 2(ii) is analogous to Theorem 1 in SW (p.1062) for GMM. Note that from (A.5) λ( b θ)= (Ω((bα 0, β 0 0 )0 )+o p (1)) 1 bg( b θ)=o p (n 1/2 ) as bg( b θ)= O p (n 1/2 ). Moreover, using the proof of Theorem 2 it can be shown that n 1/2 λ( b θ) d Ω((α 0, β 0 0) 0 ) 1 M M2 (β 0 )(Ω((α 0, β 0 0) 0 ))[Ψ((α 0, β 0 0) 0 )+m 1 ((α 0, β 0 0) 0 )]. Therefore, like β, b although n 1/2 -consistent, n 1/2 λ( b θ) admits a nonstandard asymptotic distribution, see Caner (2003). If p A =0, where all parameters are strongly identified, n 1/2 λ( b θ) d N(0, Ω 1 M M2 (Ω) M M2 (Ω) 0 Ω 1 ), where M 2 := M 2 (β 0 ), Ω := Ω(β 0 ) and := (β 0 ). The covariance matrix reduces to Ω 1 M M2 (Ω) in the i.i.d. case. The proof of Theorem 2 also provides a formula (A.7) for b (α) :=argmin b R p B P αb for α A. In particular, if p A =0, (A.7) shows that where n 1/2 (ˆβ β 0 ) d N(0,V(β 0 )), V (β 0 ):=(M 0 2Ω 1 M 2 ) 1 M 0 2Ω 1 Ω 1 M 2 (M 0 2Ω 1 M 2 ) 1. The matrix V (β 0 ) simplifies to (M2 0Ω 1 M 2 ) 1 in the i.i.d. case and thus the above formula coincides with Theorem 3.2 of NS. However, the asymptotic variance matrix of n 1/2 ( β β b 0 ) in the time series context is in general different from that in NS and the estimator β b as defined above would thus be inefficient. Block methods as in Kitamura (1997) or kernel-smoothing methods as in Smith (2001) can be used for efficient GEL estimation in a time series context with strong identification. In the case p A > 0, the fact that the asymptotic distribution of the strongly identified parameter estimates is in general nonnormal is a consequence of the inconsistent estimation of α 0. Remark 2: Given the nonnormal asymptotic distribution of the GMM and GEL parameter estimates under weak identification (established in Theorem 1 in SW and Theorem 2 above, respectively) the asymptotic distribution of test statistics based on these estimators, such as t- or Wald statistics, will also be nonstandard and nonpivotal. Furthermore, these limiting distributions depend on quantities that cannot be consistently estimated [see Staiger and Stock (1997, p.564)] which militates against their use for the construction of test statistics or confidence regions for θ 0. The next section introduces alternative approaches that overcome these difficulties. Example 1 (cont.): The specialization of Theorem 2 to the i.i.d. modelofexample1wasderivedinguggenberger(2003). linear IV [9]

12 3 Test Statistics This section proposes several statistics to test the simple hypothesis H 0 : θ = θ 0 versus H 1 : θ 6= θ 0. We establish that they are asymptotically pivotal quantities and have limiting chi-square null distributions under Assumption ID. Therefore these statistics lead to tests whose size properties are unaffected by the strength or weakness of identification. For the time series set-up considered here there are at least two other statistics that share this property, namely Anderson and Rubin s (1949) ARand Kleibergen s (2001, 2002a) K-statistic. The first statistic GELR ρ (θ) that we describe may be interpreted as a likelihood ratio statistic. It has an asymptotic χ 2 (k) null distribution and is first-order equivalent to the AR-statistic. The second set of statistics in this section, S ρ (θ) and LM ρ (θ), arebasedonthefocof bp ρ (θ, λ) with respect to θ. Each has a limiting χ 2 (p) null distribution and is first-order equivalent to the K-statistic. For a recent survey on robust inference methods with weak identification, see Stock, Wright and Yogo (2002). To motivate the first statistic, consider an i.i.d. setting. In this case, GELR EL (θ) may be thought of in terms of the empirical likelihood ratio statistic R(θ), where 6 R(θ) := Q sup { n w i w 1,...,w n (1/n) P n P w i g i (θ) =0,w i > 0, n w i =1}. (3.1) The criterion function R(θ) can be interpreted as a nonparametric likelihood ratio. Indeed, for fixed θ Θ and given g i (θ), (i =1,..., n), the numerator of R(θ) is the maximal probability of observing the given sample g i (θ), (i =1,..., n), over all discrete probability distributions (w 1,..., w n ) on the sample such that the sample analogue P n w ig i (θ) =0of the moment condition (2.1) is satisfied. The denominator (1/n) n equals the unrestricted maximal probability. It can then be shown that 2lnR(θ 0 )= n bp EL (θ 0, λ(θ 0 )), where λ(θ 0 ) is the vector of Lagrange multipliers associated with the k moment restrictions P n w ig i (θ 0 )=0in the constrained maximization problem (3.1). Therefore, the renormalized criterion function of the EL estimator has an interpretation as 2 times the logarithm of the likelihood ratio statistic R(θ 0 ). Generalizing from the i.i.d. to the time series set-up and from EL to arbitrary ρ, the first statistic we consider is the renormalized GEL criterion function (2.7) GELR ρ (θ) :=n bp ρ (θ, λ(θ)). (3.2) Secondly, following Kleibergen s (2001) suggestion of constructing a statistic from the FOC with respect to θ but in the GMM framework, we construct a test statistic basedonthegelfocfor b θ. If the minimum of the objective function bp (θ, λ(θ)) is obtained in the interior of Θ, the score vector with respect to θ must equal 0 at b θ, i.e. λ( b np θ) 0 ρ 1 (λ( b θ) 0 g i ( b θ))g i ( b θ)/n =0 0. (3.3) 6 NS show that under certain conditions including {z i : i 1} i.i.d., ˆθ EL =argmax θ Θ ln R(θ). Thus ln R(θ) can be interpreted as the criterion function of the EL estimator. [10]

13 For θ Θ, define the k p matrix P D ρ (θ) := n ρ 1 (λ(θ) 0 g i (θ))g i (θ)/n. (3.4) Thus,(3.3)maybewrittenasλ( b θ) 0 D ρ ( b θ)=0 0. The test statistic is therefore given as a quadratic form in the score vector λ(θ) 0 D ρ (θ) evaluated at the hypothesized parameter vector θ ³ S ρ (θ) :=nλ(θ) 0 D ρ (θ) D ρ (θ) 0 eω(θ) 1 D ρ (θ) 1 Dρ (θ) 0 λ(θ), (3.5) where ρ is any function satisfying Assumption ρ and eω(θ) is a consistent estimator of (θ). We also consider the following variant of S ρ (θ) ³ LM ρ (θ) :=nbg(θ) 0 eω(θ) 1 D ρ (θ) D ρ (θ) 0 eω(θ) 1 D ρ (θ) 1 Dρ (θ) 0 eω(θ) 1 bg(θ) (3.6) that substitutes (θ) 1 bg(θ) for λ(θ) in S ρ (θ); see (A.8) in the Appendix, where it is shown that n 1/2 λ(θ) = (θ) 1 n 1/2 bg(θ)+o p (1). The statistic LM ρ (θ) is similar to a GMM Lagrange multiplier statistic given in Newey and West (1987). To make the origin of the above test statistics clearer, we adopted the notation LM ρ (θ) and S ρ (θ) respectively in place of K ρ (θ) and Kρ L (θ) previously given to the statistics in Guggenberger (2003). To use these statistics for hypothesis tests or for the constructionofconfidence regions one needs a consistent estimator eω(θ) of (θ). Under assumptions given below, the sample average bω(θ) may be used for eω(θ). 7 Note that when ρ(v) = (1 + v) 2 /2, that is, in the case of a GEL CUE criterion, the GEL statistics S ρ (θ) (3.5) and LM ρ (θ) (3.6) are then identical and given in closed form by (3.6) with λ(θ) = bω(θ) bg(θ) in the definition of D CUE (θ),where bω(θ) denotes any generalized inverse of bω(θ). As noted above the GEL and GMM CUE are numerically identical. However, although the structures of the two statistics coincide, in general, the statistic LM CUE (θ) and Kleibergen s (2001) K-statistic based on the GMM CUE are not identical. The reason is that, in general, the first order derivatives of the GMM and GEL CUE objective functions are not equal. The K-statistic in Kleibergen (2001) is basedonthefocofthegmmcuecriterionbg(θ) 0 eω(θ) 1 bg(θ). It replaces D CUE (θ) in LM CUE (θ) by bg(θ) ev (θ)(i p (eω(θ) 1 bg(θ))), where ev (θ) is an estimator for lim n E{n 1 P n P n j=1 [G i(θ) EG i (θ)][(i p g j (θ) 0 ) E(I p g j (θ) 0 )]}. The particular assumptions made on (θ) determine the choice of estimators eω(θ) and ev (θ). If the sample average bω(θ) is used for eω(θ) and P n G i(θ)(i p g i (θ) 0 )/n for ev (θ), then the statistic LM CUE (θ) and the K-statistic coincide. Some intuition for these test statistics is provided under strong identification. Under strong identification, NS show consistency of b θ. Therefore, if the FOC (3.3) 7 Alternatively, instead of using uniform weights in the definition of b Ω(θ) onecoulduseempirical probabilities that are associated with each GEL estimator; see section 2 of NS. However, preliminary Monte Carlo simulations (not reported here) showed no clear improvement in the performance of the test statistics. [11]

14 hold at b θ, then, at least asymptotically, they also hold at the true value θ 0. The statistic S ρ (θ) can then be interpreted as a quadratic form whose criterion is expected to be small at the true value θ 0. If, however, all parameters are weakly identified this argument is no longer valid. From Theorem 2, b θ is no longer consistent for θ 0. Therefore, although the FOC hold at b θ, this does not imply automatically that they also approximately hold at the true value θ 0. However, it can be shown that under weak identification the FOC λ(θ) 0 D ρ (θ) =0 0 not only hold at b θ w.p.a.1 but are satisfied to order O p (T 1 ) uniformly over θ Θ. Thus, under weak identification the FOC do not pin down the true value θ 0. Consequently, the power properties of hypothesis tests for θ 0 basedonthestatisticss ρ (θ) or LM ρ (θ) should be expected to be better under strong rather than weak identification. Size properties however are not affected by the strength or weakness of identification. This is corroborated by the Monte Carlo simulations reported below and theoretically by Theorem 4. We now turn to consider the asymptotic distribution of GELR ρ (θ) evaluated at avectorθ =(α 0, β 0 0) 0, thus allowing for a fixed alternative in the weakly identified components. We need the following local version of Assumption M. Assumption M θ : Let θ =(α 0, β 0 0 )0 A {β 0 }.Suppose(i) max 1 i n g i (θ) = o p (n 1/2 ); (ii) (θ) > 0, bω(θ) p (θ), n 1 P n g i(θ)g i (θ) 0 = O p (1); (iii) Ψ n (θ) d Ψ(θ), where Ψ(θ) N(0, (θ)). Note that for θ =(α 0, β 0 0) 0 M θ (iii) and ID imply that bg(θ) p 0. Thus, under M θ (iii) and ID the assumption bω(θ) p (θ) in M θ (ii) is equivalent to the assumption n 1 P n (g i(θ) bg(θ))(g i (θ) bg(θ)) 0 p (θ) for θ =(α 0, β 0 0) 0, which is Assumption D in SW. The assumption rules out many interesting time series cases. However, it is more general than an i.i.d. assumption. The assumption allows for m.d.s. and thus covers various intertemporal Euler equations applications and regression models with m.d.s. errors. As in SW, a possible application is the intertemporally separable CCAPM. Without assuming bω(θ) p (θ), a limiting chi-square distribution would no longer obtain in the following theorems. The problem arises because the GEL estimator as defined in (2.6) is not efficient in the time series set-up considered here. Theorem 3 Suppose ID, M θ (i)-(iii) and ρ hold for θ =(α 0, β 0 0) 0.Then GELR ρ (θ) d χ 2 (k, δ), where the noncentrality parameter δ = m 1 (θ) 0 (θ) 1 m 1 (θ). Inparticular, GELR ρ (θ 0 ) d χ 2 (k). To describe the asymptotic distribution of the statistics LM ρ (θ 0 ) and S ρ (θ 0 ),we need the following additional assumptions. Write G i (θ) =(G ia (θ),g ib (θ)), where the matrices G ia (θ) and G ib (θ) areofcolumndimensionp A and p B, respectively. Let θ =(α 0, β 0 0) 0 A {β 0 } and M Θ an open neighborhood of θ. [12]

15 Assumption M θ : (cont.) (iv) bg( ) is differentiable at θ a.s. for each θ M, bg(θ) is integrable (with respect to the probability measure) for all θ M, sup θ M bg(θ) is integrable, m 1n C 1 (Θ) and M 1n ( ) :=( m 1n / θ)( ) converges uniformly on Θ to some function; (v) n 1 P n (vecg ia(θ))gi 0(θ) p A (θ) ( A (θ) is definedin(vii)),eω(θ) p (θ), bg B (θ) :=n 1 P n G ib(θ) p E bg B (θ); (vi) n 1 P n G ia(θ) g i (θ) = O p (1), n 3/2 P n P G ib(θ) g i (θ) = o p (1); (vii) n 1/2 n ((vec(g ia(θ) EG ia (θ))) 0, (g i (θ) Eg i (θ)) 0 ) 0 d N(0,V(θ)), where V (θ) := lim n var(n 1/2 P n ((vecg ia(θ)) 0,g i (θ) 0 ) 0 ) R k(p A+1) k(p A +1) is positive definite. In M θ (vii) write µ AA V (θ) = A 0 A (θ), where AA (θ) R p Ak p A k. Assumption M θ (iv) allows the interchange of the order of integration and differentiation in Assumption ID, i.e. ( Ebg/ θ)(θ) =E bg(θ). It also guarantees that M 1n (θ) M 1 (θ) :=( m 1 / θ)(θ). Assumption ID and M θ thus imply that E bg(θ) =n 1/2 M 1n (θ)+(0,m 2 (β 0 )) (0,M 2 (β 0 )), (3.7) where by ID the limit matrix (0,M 2 (β 0 )) is of deficient rank p B. Assumption M θ (v) is comparable to M θ (ii), where bω(θ) p (θ) was assumed and extends M θ (ii) to crossproduct terms in vecg ia (θ) and g i (θ). AssumptionM θ (vi) contains additional weak technical conditions that guarantee that certain expressions in the proof of Theorem 4 are asymptotically negligible. The key assumption is M θ (vii) which is a stronger version of M θ (iii) and states that a CLT theorem holds simultaneously for the centered g i (θ) and part of the derivative matrix, namely vecg ia (θ). Write LM ρ (θ) =nbg 0 eω 1 D(D 0 eω 1 D) 1 D 0 eω 1 bg, where D = D ρ (θ) and eω = eω(θ). As shown in the proof of Theorem 4, for θ =(α 0, β 0 0) 0, Assumptions ID, ρ, M θ (i)-(vi) and bg A (θ) :=n 1 P n G ia(θ) p E bg A (θ) imply that D p (0,M 2 (β 0 )). Therefore, the probability limit of D 0 eω 1 D is not invertible without renormalisation. Define D := DΛ where the p p diagonal matrix Λ := diag(n 1/2,...,n 1/2, 1,...,1) with first p A diagonal elements equal to n 1/2 and the remainder equal to unity. Hence, LM ρ (θ) =nbg 0 eω 1 D (D 0 eω 1 D ) 1 D 0 eω 1 bg. (3.8) In the proof of Theorem 4 we show that under Assumptions ID, ρ and M θ (i)-(vi) µ vecd IkpA = vec(0, M 2 (β 0 ))+ A (θ) (θ) n 1/2 n P µ vecgia (θ) g i (θ) +o p (1). The additional M θ (vii), in particular the full rank assumption on V (θ), ensuresthat D 0 eω 1 D has full rank w.p.a.1. Assumption M θ (vii) is closely related to Kleibergen s (2001) Assumption 1. Unlike Kleibergen (2001), however, we assume ID which, as just [13]

16 shown, requires that we are specific about which part of the derivative matrix G i (θ) together with g i (θ) satisfies a CLT with full rank covariance matrix, namely G ia (θ) which corresponds to the weakly identified parameters. Assumption ID possesses the advantage that we can obtain the asymptotic distribution of the test statistics under fixedalternativesoftheformθ =(α 0, β 0 0) 0 and therefore derive asymptotic power results. Theorem 4 Suppose ID, M θ (i)-(vii) and ρ hold for θ =(α 0, β 0 0) 0.Then, S ρ (θ),lm ρ (θ) d (W (α)+ζ) 0 (W (α)+ζ), where the random p-vector W(α) is defined in (A.11) of the Appendix, ζ N(0,I p ) and W and ζ are independent. We have W(α 0 ) 0 and therefore S ρ (θ 0 ),LM ρ (θ 0 ) d χ 2 (p). Remark 1: The proof of Theorem 4 crucially hinges on the fact that n 1/2 λ(θ 0 ) and vecd ρ (θ 0 ) (suitably normalized) from the FOC (3.3) are asymptotically jointly normally distributed and, moreover, are asymptotically independent. A similar result is critical also for Kleibergen s (2001) K-statistic which generalizes Brown and Newey s (1998) analysis of efficient GMM moment estimation to the weakly identified set-up. Therefore, by using an appropriate weighting matrix in the quadratic forms (3.5) and (3.6) that define S ρ (θ 0 ) and LM ρ (θ 0 ), respectively, we immediately obtain the limiting χ 2 (p) null distribution of Theorem 4. Remark 2: Theorems 3 and 4 provide a straightforward method to construct confidence regions or hypothesis tests on θ 0. For example, a critical region for a test of the hypothesis H 0 : θ = θ 0 versus H 1 : θ 6= θ 0 at significance level r is given by {GELR ρ (θ 0 ) χ 2 r(k)}, where χ 2 r(k) denotes the (1 r)-critical value from the χ 2 (k) distribution. A (1 r)-confidence region for θ 0 is obtained by inverting the justdescribed test, i.e. {θ Θ : GELR ρ (θ) χ 2 r(k)}. Confidence regions and hypothesis tests based on S ρ (θ) and LM ρ (θ) may be constructed in a similar fashion. Remark 3: Theorems 3 and 4 demonstrate that GELR ρ (θ 0 ), S ρ (θ 0 ) and LM ρ (θ 0 ) are asymptotically pivotal statistics under weak and strong identification. Therefore, the size of tests based on these statistics should not vary much with the strength or weakness of identification in finite samples. However, these results also show that under weak identification hypothesis tests based on these statistics are inconsistent. For example, the noncentrality parameter δ does not diverge for increasing sample size and therefore the rejection rate under the alternative does not converge to 1. This is intuitively reasonable because if identification is weak one cannot learn much about α 0 from the data. Remark 4: A drawback of GELR ρ (θ 0 ) is that its limiting null distribution has degrees of freedom equal to k, the number of moment conditions rather than the dimension of the parameter vector. In general, this has a negative impact on the power properties of hypothesis tests based on GELR ρ (θ 0 ) in over-identified situations. [14]

17 On the other hand, the limiting null distribution of S ρ (θ 0 ) and LM ρ (θ 0 ) has degrees of freedom equal to p. Therefore the power of tests based on these statistics should not be negatively affected by a high degree of over-identification. Anderson and Rubin s (1949) AR-statistic has a χ 2 (k) limiting null distribution as well. Kleibergen (2002b) shows that it equals the sum of two independent statistics, namely the K- statistic [Kleibergen (2002a)] and a J-statistic [Hansen (1982)] that test location and misspecification, respectively. Mutatis mutandis, a similar decomposition may be given for the GELR ρ (θ 0 ) statistic in terms of S ρ (θ 0 ) or LM ρ (θ 0 ). Remark 5: SW (Theorem 2) derive the asymptotic distribution under weak identification of the analogue of GELR ρ (θ 0 ) for the (GMM) CUE which is also a χ 2 (k) null distribution. In the i.i.d. context, Qin and Lawless (1994, Theorem 2) propose the statistic 2lnR( b θ EL ) 2lnR(θ 0 ) to test the hypothesis H 0 : θ = θ 0 which is shown to be asymptotically distributed as χ 2 (p) under strong identification. However, due to the dependence on b θ EL, this statistic is no longer asymptotically pivotal and thus leads to size-distorted tests under weak identification. Example 1 (cont.): Guggenberger (2003) derives the results given in Theorems 3and4underAssumptionsΘ, ID,M andρ allowing for alternatives α A and Pitman drift in the data generating process for the strongly identified parameters to assess the asymptotic power properties of the tests; i.e. ID holds and for some fixed b R p B, y = Y (θ 0 +n 1/2 (0 0,b 0 ) 0 )+u. To simplify our presentation here we ignore the possibility of Pitman drift. Results for the i.i.d. linear IV model follow directly from the above theorems because, as is easily shown, Assumptions ID, M, ρ and V (θ) > 0 imply M θ for any consistent estimator eω(θ) of Ω(θ). Inparticular,V (θ) has a simple representation. For θ =(α 0, β 0 0) 0, Ω(θ) = (θ) and AA (θ) =E(V ia ViA 0 Z izi 0), where V ia consists of the first p A components of V i in (2.3). 4 Subvector Test Statistics We now assume that interest is focussed on the subvector α 0 R p A of θ 0 =(α 0 0, β0 0) 0. However, we no longer maintain Assumption ID. In particular, α 0 may not necessarily be weakly identified. To adapt the test statistics of section 3 to the subvector case, the basic idea is to replace β by a GEL estimator b β(α). To make this idea more rigorous, define the GEL estimator b β(α) for β 0 bβ(α) :=argmin sup bp ((α 0, β 0 ) 0, λ). (4.1) β B λ Λ b n(α 0,β 0 ) 0 We usually write β b for β(α) b where there is no ambiguity. A requirement of the analysis below is that β b p β 0 if α = α 0. Therefore, we assume that the nuisance parameters β 0 which are not involved in the hypothesis under test are strongly identified; see Theorem 2. On the other hand, the components of α 0 can be weakly or strongly identified and in Assumption ID α below we assume the former holds for α 01 [15]

18 and the latter for α 02, where α 0 =(α 0 01, α0 02 )0. The main advantage of the subvector test statistics introduced in this section is that asymptotically they have accurate sizes independent of whether α 0 is weakly or strongly identified. This property is not shared by classical tests based on Wald, likelihood ratio or Lagrange multiplier statistics. In general, they have correct size only if θ 0 is strongly identified. In contrast, the subvector tests in Guggenberger and Wolf (2004) based on a subsampling approach have exact asymptotic sizes without any additional identification assumption. Let θ =(α 0 1, α0 2, β0 ) 0, where α j A j, A j R p A j, (j =1, 2), pa1 + p A2 = p A and β B, B R p B.AlsoletN A 2 B be an open neighborhood of (α 02, β 0 ). Assumption A: Thetrueparameterθ 0 =(α 0 01, α0 02, β0 0) 0 is in the interior of the compact space Θ, where Θ = A 1 A 2 B. Assumption ID α :(i)ebg(θ) =n 1/2 m 1n (θ)+m 2 (α 2, β), where m 1n,m 1 : Θ R k and m 2 : A 2 B R k are continuous functions such that m 1n (θ) m 1 (θ) uniformly on Θ, m 1 (θ 0 )=0and m 2 (α 2, β) =0if and only if (α 2, β) =(α 02, β 0 ); (ii) m 2 C 1 (N ); (iii) Let M 2 ( ) :=( m 2 / (α 0 2, β0 ) 0 )( ) R k (p A 2 +p B ). M 2 (α 02, β 0 ) has full column rank p A2 + p B. Assumption ID α implies α 01 and (α 02, β 0 ) are weakly and strongly identified, respectively. Assumptions A and ID α adapt Assumptions Θ and ID in section 3 for the subvector case. Let b θα := (α 0, b β(α) 0 ) 0 and θ αβ := (α 0, β 0 ) 0. We now introduce the subvector statistics. Recall the definition of GELR ρ (θ) in (3.2). The GELR ρ subvector test statistic is given by GELR sub ρ (α) :=GELR ρ ( b θ α ). We need the following technical assumptions for our derivation of its asymptotic distribution. To obtain theoretical power properties, we again allow a fixed alternative for the weakly identified components, α 01 here. For a 1 A 1 let a := (a 0 1, α0 02 )0 be a fixed vector whose strongly identified component α 02 is the same as the corresponding component of the true parameter vector θ 0. Let M B be an open neighborhood of β 0. Assumption M α :(i)max 1 i n sup β B g i (θ aβ ) = o p (n 1/2 ); (ii) sup β B bω(θ aβ ) Γ(θ aβ ) p 0 for some matrix Γ( ) that is uniformly bounded on {θ aβ : β B}, continuous at θ aβ0 and Γ(θ aβ0 )= (θ aβ0 ) > 0 and n 1 P n g i(θ aβ0 )g i (θ aβ0 ) 0 = O p (1); (iii) Ψ n (θ aβ0 ) d Ψ(θ aβ0 ), where Ψ(θ aβ0 ) N(0, (θ aβ0 )); (iv) bg B ( ) := n 1 P n ( g i/ β)( ) exists at θ aβ a.s. for each β M, bg(θ aβ ) is integrable for all β M, sup β M bg B (θ aβ ) is integrable, m 1n / β( ) is continuous at θ aβ a.s. for each β M and m 1n / β(θ aβ ) converges uniformly over β M to some function; (v) bg(θ aβ ) p Ebg(θ aβ ) uniformly over β B, G b B (θ aβ ) p EG b B (θ aβ ) uniformly over [16]

19 β M; (vi) sup β M n 1 P n G ib(θ aβ ) = O p (1). Mutatis mutandis M α has the same interpretation as M θ. For example M α (ii) guarantees that λ max (bω( b θ a )) is bounded and λ min (bω( b θ a )) is bounded away from zero w.p.a.1 while M α (iv) and ID α imply that for β M we have E bg B (θ aβ )= n 1/2 ( m 1n / β)(θ aβ )+( m 2 / β)(α 02, β) ( m 2 / β)(α 02, β). ByID α this last matrix has full column rank for β = β 0. If we assume that the G ib (θ aβ ), (i =1,..., n), viewed as functions of β, are continuous at β 0 a.s., then we can simplify M α (vi) to n 1 P n G ib(θ aβ0 ) = O p (1). A similar comment holds for the assumptions in the continuation of M α below. Theorem 5 Assume 1 p A <p. Suppose Assumptions A, ID α,m α (i)-(vi) and ρ hold for some a 1 A 1 and a =(a 0 1, α0 02 )0.Then, GELR sub ρ (a) d χ 2 (k p B, δ), where the noncentrality parameter δ is given by δ := m 1 (θ aβ0 ) 0 (θ aβ0 ) 1 M M2β (α 02,β 0 )( (θ aβ0 ))m 1 (θ aβ0 ), where M 2β ( ) :=( m 2 / β)( ) R k p B.Inparticular, GELR sub ρ (α 0) d χ 2 (k p A ). Theorem 5 confirms that the subvector statistic GELRρ sub (α 0 ), like the full vector statistic GELR ρ (θ 0 ), is asymptotically pivotal. As above, this result can be used to construct hypothesis tests and confidence regions for α 0. We now generalize the statistics S ρ and LM ρ to the subvector case. The asymptotic variance matrices of n 1/2 bg( b θ α ) and n 1/2 λ( b θ α ) differ from those of n 1/2 bg(θ αβ0 ) and n 1/2 λ(θ αβ0 ). Therefore different weighting matrices are required in the quadratic forms defining these subvector statistics. In the Appendix (proof of Theorem 6) it is shown that for a =(a 0 1, α0 02 )0, λ( b θ a ) = argmax λ Λn( b b θ a) bp ( b θ a, λ) exists w.p.a.1 and that n 1/2 λ( b θ a ) is asymptotically normal with covariance matrix M(a), where for α =(α 0 1, α0 2 )0 R p A M(α) := (θ αβ0 ) 1 M M2β (α 2,β 0 )( (θ αβ0 )). (4.2) The first p A elements of the FOC (3.3), evaluated at b θ a,are For α R p A,let λ( b np θ a ) 0 ρ 1 (λ( b θ a ) 0 g i ( b θ a ))G ia ( b θ a )/n =0 0. (4.3) P D ρ (α) := n ρ 1 (λ( b θ α ) 0 g i ( b θ α ))G ia ( b θ α )/n R k p A, (4.4) [17]

Exponential Tilting with Weak Instruments: Estimation and Testing

Exponential Tilting with Weak Instruments: Estimation and Testing Exponential Tilting with Weak Instruments: Estimation and Testing Mehmet Caner North Carolina State University January 2008 Abstract This article analyzes exponential tilting estimator with weak instruments

More information

Subsampling Tests of Parameter Hypotheses and Overidentifying Restrictions with Possible Failure of Identification

Subsampling Tests of Parameter Hypotheses and Overidentifying Restrictions with Possible Failure of Identification Subsampling Tests of Parameter Hypotheses and Overidentifying Restrictions with Possible Failure of Identification Patrik Guggenberger Department of Economics U.C.L.A. Michael Wolf Department of Economics

More information

GENERALIZED EMPIRICAL LIKELIHOOD TESTS

GENERALIZED EMPIRICAL LIKELIHOOD TESTS GENERALIZED EMPIRICAL LIKELIHOOD TESTS IN TIME SERIES MODELS WITH POTENTIAL IDENTIFICATION FAILURE Patrik Guggenberger Richard J. Smith THE INSTITUTE FOR FISCAL STUDIES DEPARTMENT OF ECONOMICS, UCL cemmap

More information

What s New in Econometrics. Lecture 15

What s New in Econometrics. Lecture 15 What s New in Econometrics Lecture 15 Generalized Method of Moments and Empirical Likelihood Guido Imbens NBER Summer Institute, 2007 Outline 1. Introduction 2. Generalized Method of Moments Estimation

More information

Specification Test for Instrumental Variables Regression with Many Instruments

Specification Test for Instrumental Variables Regression with Many Instruments Specification Test for Instrumental Variables Regression with Many Instruments Yoonseok Lee and Ryo Okui April 009 Preliminary; comments are welcome Abstract This paper considers specification testing

More information

High Dimensional Empirical Likelihood for Generalized Estimating Equations with Dependent Data

High Dimensional Empirical Likelihood for Generalized Estimating Equations with Dependent Data High Dimensional Empirical Likelihood for Generalized Estimating Equations with Dependent Data Song Xi CHEN Guanghua School of Management and Center for Statistical Science, Peking University Department

More information

Spring 2017 Econ 574 Roger Koenker. Lecture 14 GEE-GMM

Spring 2017 Econ 574 Roger Koenker. Lecture 14 GEE-GMM University of Illinois Department of Economics Spring 2017 Econ 574 Roger Koenker Lecture 14 GEE-GMM Throughout the course we have emphasized methods of estimation and inference based on the principle

More information

Approximate Distributions of the Likelihood Ratio Statistic in a Structural Equation with Many Instruments

Approximate Distributions of the Likelihood Ratio Statistic in a Structural Equation with Many Instruments CIRJE-F-466 Approximate Distributions of the Likelihood Ratio Statistic in a Structural Equation with Many Instruments Yukitoshi Matsushita CIRJE, Faculty of Economics, University of Tokyo February 2007

More information

A CONDITIONAL LIKELIHOOD RATIO TEST FOR STRUCTURAL MODELS. By Marcelo J. Moreira 1

A CONDITIONAL LIKELIHOOD RATIO TEST FOR STRUCTURAL MODELS. By Marcelo J. Moreira 1 Econometrica, Vol. 71, No. 4 (July, 2003), 1027 1048 A CONDITIONAL LIKELIHOOD RATIO TEST FOR STRUCTURAL MODELS By Marcelo J. Moreira 1 This paper develops a general method for constructing exactly similar

More information

Testing Overidentifying Restrictions with Many Instruments and Heteroskedasticity

Testing Overidentifying Restrictions with Many Instruments and Heteroskedasticity Testing Overidentifying Restrictions with Many Instruments and Heteroskedasticity John C. Chao, Department of Economics, University of Maryland, chao@econ.umd.edu. Jerry A. Hausman, Department of Economics,

More information

What s New in Econometrics. Lecture 13

What s New in Econometrics. Lecture 13 What s New in Econometrics Lecture 13 Weak Instruments and Many Instruments Guido Imbens NBER Summer Institute, 2007 Outline 1. Introduction 2. Motivation 3. Weak Instruments 4. Many Weak) Instruments

More information

Instrumental Variables Estimation and Weak-Identification-Robust. Inference Based on a Conditional Quantile Restriction

Instrumental Variables Estimation and Weak-Identification-Robust. Inference Based on a Conditional Quantile Restriction Instrumental Variables Estimation and Weak-Identification-Robust Inference Based on a Conditional Quantile Restriction Vadim Marmer Department of Economics University of British Columbia vadim.marmer@gmail.com

More information

IV Estimation and its Limitations: Weak Instruments and Weakly Endogeneous Regressors

IV Estimation and its Limitations: Weak Instruments and Weakly Endogeneous Regressors IV Estimation and its Limitations: Weak Instruments and Weakly Endogeneous Regressors Laura Mayoral, IAE, Barcelona GSE and University of Gothenburg U. of Gothenburg, May 2015 Roadmap Testing for deviations

More information

Instrumental Variable Estimation with Heteroskedasticity and Many Instruments

Instrumental Variable Estimation with Heteroskedasticity and Many Instruments Instrumental Variable Estimation with Heteroskedasticity and Many Instruments Jerry A. Hausman Department of Economics M.I.T. Tiemen Woutersen Department of Economics University of Arizona Norman Swanson

More information

Lecture 11 Weak IV. Econ 715

Lecture 11 Weak IV. Econ 715 Lecture 11 Weak IV Instrument exogeneity and instrument relevance are two crucial requirements in empirical analysis using GMM. It now appears that in many applications of GMM and IV regressions, instruments

More information

GMM Estimation and Testing

GMM Estimation and Testing GMM Estimation and Testing Whitney Newey July 2007 Idea: Estimate parameters by setting sample moments to be close to population counterpart. Definitions: β : p 1 parameter vector, with true value β 0.

More information

Research Article Optimal Portfolio Estimation for Dependent Financial Returns with Generalized Empirical Likelihood

Research Article Optimal Portfolio Estimation for Dependent Financial Returns with Generalized Empirical Likelihood Advances in Decision Sciences Volume 2012, Article ID 973173, 8 pages doi:10.1155/2012/973173 Research Article Optimal Portfolio Estimation for Dependent Financial Returns with Generalized Empirical Likelihood

More information

INFERENCE APPROACHES FOR INSTRUMENTAL VARIABLE QUANTILE REGRESSION. 1. Introduction

INFERENCE APPROACHES FOR INSTRUMENTAL VARIABLE QUANTILE REGRESSION. 1. Introduction INFERENCE APPROACHES FOR INSTRUMENTAL VARIABLE QUANTILE REGRESSION VICTOR CHERNOZHUKOV CHRISTIAN HANSEN MICHAEL JANSSON Abstract. We consider asymptotic and finite-sample confidence bounds in instrumental

More information

Econometrics II - EXAM Answer each question in separate sheets in three hours

Econometrics II - EXAM Answer each question in separate sheets in three hours Econometrics II - EXAM Answer each question in separate sheets in three hours. Let u and u be jointly Gaussian and independent of z in all the equations. a Investigate the identification of the following

More information

A more powerful subvector Anderson and Rubin test in linear instrumental variables regression. Patrik Guggenberger Pennsylvania State University

A more powerful subvector Anderson and Rubin test in linear instrumental variables regression. Patrik Guggenberger Pennsylvania State University A more powerful subvector Anderson and Rubin test in linear instrumental variables regression Patrik Guggenberger Pennsylvania State University Joint work with Frank Kleibergen (University of Amsterdam)

More information

Conditional Linear Combination Tests for Weakly Identified Models

Conditional Linear Combination Tests for Weakly Identified Models Conditional Linear Combination Tests for Weakly Identified Models Isaiah Andrews JOB MARKET PAPER Abstract This paper constructs powerful tests applicable in a wide range of weakly identified contexts,

More information

A more powerful subvector Anderson Rubin test in linear instrumental variables regression

A more powerful subvector Anderson Rubin test in linear instrumental variables regression A more powerful subvector Anderson Rubin test in linear instrumental variables regression Patrik Guggenberger Department of Economics Pennsylvania State University Frank Kleibergen Department of Quantitative

More information

Likelihood Ratio Based Test for the Exogeneity and the Relevance of Instrumental Variables

Likelihood Ratio Based Test for the Exogeneity and the Relevance of Instrumental Variables Likelihood Ratio Based est for the Exogeneity and the Relevance of Instrumental Variables Dukpa Kim y Yoonseok Lee z September [under revision] Abstract his paper develops a test for the exogeneity and

More information

Estimation and Inference with Weak Identi cation

Estimation and Inference with Weak Identi cation Estimation and Inference with Weak Identi cation Donald W. K. Andrews Cowles Foundation Yale University Xu Cheng Department of Economics University of Pennsylvania First Draft: August, 2007 Revised: March

More information

Single Equation Linear GMM

Single Equation Linear GMM Single Equation Linear GMM Eric Zivot Winter 2013 Single Equation Linear GMM Consider the linear regression model Engodeneity = z 0 δ 0 + =1 z = 1 vector of explanatory variables δ 0 = 1 vector of unknown

More information

Partial Identification and Confidence Intervals

Partial Identification and Confidence Intervals Partial Identification and Confidence Intervals Jinyong Hahn Department of Economics, UCLA Geert Ridder Department of Economics, USC September 17, 009 Abstract We consider statistical inference on a single

More information

Closest Moment Estimation under General Conditions

Closest Moment Estimation under General Conditions Closest Moment Estimation under General Conditions Chirok Han Victoria University of Wellington New Zealand Robert de Jong Ohio State University U.S.A October, 2003 Abstract This paper considers Closest

More information

Lasso Maximum Likelihood Estimation of Parametric Models with Singular Information Matrices

Lasso Maximum Likelihood Estimation of Parametric Models with Singular Information Matrices Article Lasso Maximum Likelihood Estimation of Parametric Models with Singular Information Matrices Fei Jin 1,2 and Lung-fei Lee 3, * 1 School of Economics, Shanghai University of Finance and Economics,

More information

Inference with Weak Instruments

Inference with Weak Instruments Inference with Weak Instruments Donald W. K. Andrews 1 Cowles Foundation for Research in Economics Yale University James H. Stock Department of Economics Harvard University March 2005 Revised: September

More information

Adjusted Empirical Likelihood for Long-memory Time Series Models

Adjusted Empirical Likelihood for Long-memory Time Series Models Adjusted Empirical Likelihood for Long-memory Time Series Models arxiv:1604.06170v1 [stat.me] 21 Apr 2016 Ramadha D. Piyadi Gamage, Wei Ning and Arjun K. Gupta Department of Mathematics and Statistics

More information

The properties of L p -GMM estimators

The properties of L p -GMM estimators The properties of L p -GMM estimators Robert de Jong and Chirok Han Michigan State University February 2000 Abstract This paper considers Generalized Method of Moment-type estimators for which a criterion

More information

Asymptotic Distributions of Instrumental Variables Statistics with Many Instruments

Asymptotic Distributions of Instrumental Variables Statistics with Many Instruments CHAPTER 6 Asymptotic Distributions of Instrumental Variables Statistics with Many Instruments James H. Stock and Motohiro Yogo ABSTRACT This paper extends Staiger and Stock s (1997) weak instrument asymptotic

More information

Volume 30, Issue 1. Measuring the Intertemporal Elasticity of Substitution for Consumption: Some Evidence from Japan

Volume 30, Issue 1. Measuring the Intertemporal Elasticity of Substitution for Consumption: Some Evidence from Japan Volume 30, Issue 1 Measuring the Intertemporal Elasticity of Substitution for Consumption: Some Evidence from Japan Akihiko Noda Graduate School of Business and Commerce, Keio University Shunsuke Sugiyama

More information

Weak Identification in Maximum Likelihood: A Question of Information

Weak Identification in Maximum Likelihood: A Question of Information Weak Identification in Maximum Likelihood: A Question of Information By Isaiah Andrews and Anna Mikusheva Weak identification commonly refers to the failure of classical asymptotics to provide a good approximation

More information

Generalized Neyman Pearson optimality of empirical likelihood for testing parameter hypotheses

Generalized Neyman Pearson optimality of empirical likelihood for testing parameter hypotheses Ann Inst Stat Math (2009) 61:773 787 DOI 10.1007/s10463-008-0172-6 Generalized Neyman Pearson optimality of empirical likelihood for testing parameter hypotheses Taisuke Otsu Received: 1 June 2007 / Revised:

More information

Generalized Method of Moment

Generalized Method of Moment Generalized Method of Moment CHUNG-MING KUAN Department of Finance & CRETA National Taiwan University June 16, 2010 C.-M. Kuan (Finance & CRETA, NTU Generalized Method of Moment June 16, 2010 1 / 32 Lecture

More information

Weak Instruments, Weak Identification, and Many Instruments, Part II

Weak Instruments, Weak Identification, and Many Instruments, Part II NBER Summer Institute What s New in Econometrics ime Series Lecture 4 July 14, 2008 Weak Instruments, Weak Identification, and Many Instruments, Part II Revised July 22, 2008 4-1 Outline Lecture 3 1) What

More information

Missing dependent variables in panel data models

Missing dependent variables in panel data models Missing dependent variables in panel data models Jason Abrevaya Abstract This paper considers estimation of a fixed-effects model in which the dependent variable may be missing. For cross-sectional units

More information

Review of Classical Least Squares. James L. Powell Department of Economics University of California, Berkeley

Review of Classical Least Squares. James L. Powell Department of Economics University of California, Berkeley Review of Classical Least Squares James L. Powell Department of Economics University of California, Berkeley The Classical Linear Model The object of least squares regression methods is to model and estimate

More information

DA Freedman Notes on the MLE Fall 2003

DA Freedman Notes on the MLE Fall 2003 DA Freedman Notes on the MLE Fall 2003 The object here is to provide a sketch of the theory of the MLE. Rigorous presentations can be found in the references cited below. Calculus. Let f be a smooth, scalar

More information

Inference For High Dimensional M-estimates. Fixed Design Results

Inference For High Dimensional M-estimates. Fixed Design Results : Fixed Design Results Lihua Lei Advisors: Peter J. Bickel, Michael I. Jordan joint work with Peter J. Bickel and Noureddine El Karoui Dec. 8, 2016 1/57 Table of Contents 1 Background 2 Main Results and

More information

Robust Con dence Intervals in Nonlinear Regression under Weak Identi cation

Robust Con dence Intervals in Nonlinear Regression under Weak Identi cation Robust Con dence Intervals in Nonlinear Regression under Weak Identi cation Xu Cheng y Department of Economics Yale University First Draft: August, 27 This Version: December 28 Abstract In this paper,

More information

1 Procedures robust to weak instruments

1 Procedures robust to weak instruments Comment on Weak instrument robust tests in GMM and the new Keynesian Phillips curve By Anna Mikusheva We are witnessing a growing awareness among applied researchers about the possibility of having weak

More information

Closest Moment Estimation under General Conditions

Closest Moment Estimation under General Conditions Closest Moment Estimation under General Conditions Chirok Han and Robert de Jong January 28, 2002 Abstract This paper considers Closest Moment (CM) estimation with a general distance function, and avoids

More information

A Robust Test for Weak Instruments in Stata

A Robust Test for Weak Instruments in Stata A Robust Test for Weak Instruments in Stata José Luis Montiel Olea, Carolin Pflueger, and Su Wang 1 First draft: July 2013 This draft: November 2013 Abstract We introduce and describe a Stata routine ivrobust

More information

Estimation and Inference with Weak, Semi-strong, and Strong Identi cation

Estimation and Inference with Weak, Semi-strong, and Strong Identi cation Estimation and Inference with Weak, Semi-strong, and Strong Identi cation Donald W. K. Andrews Cowles Foundation Yale University Xu Cheng Department of Economics University of Pennsylvania This Version:

More information

Asymptotics for Nonlinear GMM

Asymptotics for Nonlinear GMM Asymptotics for Nonlinear GMM Eric Zivot February 13, 2013 Asymptotic Properties of Nonlinear GMM Under standard regularity conditions (to be discussed later), it can be shown that where ˆθ(Ŵ) θ 0 ³ˆθ(Ŵ)

More information

LECTURE ON HAC COVARIANCE MATRIX ESTIMATION AND THE KVB APPROACH

LECTURE ON HAC COVARIANCE MATRIX ESTIMATION AND THE KVB APPROACH LECURE ON HAC COVARIANCE MARIX ESIMAION AND HE KVB APPROACH CHUNG-MING KUAN Institute of Economics Academia Sinica October 20, 2006 ckuan@econ.sinica.edu.tw www.sinica.edu.tw/ ckuan Outline C.-M. Kuan,

More information

AN EMPIRICAL LIKELIHOOD RATIO TEST FOR NORMALITY

AN EMPIRICAL LIKELIHOOD RATIO TEST FOR NORMALITY Econometrics Working Paper EWP0401 ISSN 1485-6441 Department of Economics AN EMPIRICAL LIKELIHOOD RATIO TEST FOR NORMALITY Lauren Bin Dong & David E. A. Giles Department of Economics, University of Victoria

More information

Estimation of Dynamic Regression Models

Estimation of Dynamic Regression Models University of Pavia 2007 Estimation of Dynamic Regression Models Eduardo Rossi University of Pavia Factorization of the density DGP: D t (x t χ t 1, d t ; Ψ) x t represent all the variables in the economy.

More information

Generalized Method of Moments Estimation

Generalized Method of Moments Estimation Generalized Method of Moments Estimation Lars Peter Hansen March 0, 2007 Introduction Generalized methods of moments (GMM) refers to a class of estimators which are constructed from exploiting the sample

More information

Economics 583: Econometric Theory I A Primer on Asymptotics

Economics 583: Econometric Theory I A Primer on Asymptotics Economics 583: Econometric Theory I A Primer on Asymptotics Eric Zivot January 14, 2013 The two main concepts in asymptotic theory that we will use are Consistency Asymptotic Normality Intuition consistency:

More information

The outline for Unit 3

The outline for Unit 3 The outline for Unit 3 Unit 1. Introduction: The regression model. Unit 2. Estimation principles. Unit 3: Hypothesis testing principles. 3.1 Wald test. 3.2 Lagrange Multiplier. 3.3 Likelihood Ratio Test.

More information

A more powerful subvector Anderson Rubin test in linear instrumental variable regression

A more powerful subvector Anderson Rubin test in linear instrumental variable regression A more powerful subvector Anderson Rubin test in linear instrumental variable regression Patrik Guggenberger Department of Economics Pennsylvania State University Frank Kleibergen Department of Quantitative

More information

Regularized Empirical Likelihood as a Solution to the No Moment Problem: The Linear Case with Many Instruments

Regularized Empirical Likelihood as a Solution to the No Moment Problem: The Linear Case with Many Instruments Regularized Empirical Likelihood as a Solution to the No Moment Problem: The Linear Case with Many Instruments Pierre Chaussé November 29, 2017 Abstract In this paper, we explore the finite sample properties

More information

Long-Run Covariability

Long-Run Covariability Long-Run Covariability Ulrich K. Müller and Mark W. Watson Princeton University October 2016 Motivation Study the long-run covariability/relationship between economic variables great ratios, long-run Phillips

More information

GMM ESTIMATION AND UNIFORM SUBVECTOR INFERENCE WITH POSSIBLE IDENTIFICATION FAILURE

GMM ESTIMATION AND UNIFORM SUBVECTOR INFERENCE WITH POSSIBLE IDENTIFICATION FAILURE Econometric Theory, 30, 2014, 287 333. doi:10.1017/s0266466613000315 GMM ESTIMATION AND UNIFORM SUBVECTOR INFERENCE WITH POSSIBLE IDENTIFICATION FAILURE DONALD W.K. ANDREWS Cowles Foundation for Research

More information

Location Properties of Point Estimators in Linear Instrumental Variables and Related Models

Location Properties of Point Estimators in Linear Instrumental Variables and Related Models Location Properties of Point Estimators in Linear Instrumental Variables and Related Models Keisuke Hirano Department of Economics University of Arizona hirano@u.arizona.edu Jack R. Porter Department of

More information

Regression. Saraswata Chaudhuri, Thomas Richardson, James Robins and Eric Zivot. Working Paper no. 73. Center for Statistics and the Social Sciences

Regression. Saraswata Chaudhuri, Thomas Richardson, James Robins and Eric Zivot. Working Paper no. 73. Center for Statistics and the Social Sciences Split-Sample Score Tests in Linear Instrumental Variables Regression Saraswata Chaudhuri, Thomas Richardson, James Robins and Eric Zivot Working Paper no. 73 Center for Statistics and the Social Sciences

More information

Exogeneity tests and weak identification

Exogeneity tests and weak identification Cireq, Cirano, Départ. Sc. Economiques Université de Montréal Jean-Marie Dufour Cireq, Cirano, William Dow Professor of Economics Department of Economics Mcgill University June 20, 2008 Main Contributions

More information

Weak Instruments and the First-Stage Robust F-statistic in an IV regression with heteroskedastic errors

Weak Instruments and the First-Stage Robust F-statistic in an IV regression with heteroskedastic errors Weak Instruments and the First-Stage Robust F-statistic in an IV regression with heteroskedastic errors Uijin Kim u.kim@bristol.ac.uk Department of Economics, University of Bristol January 2016. Preliminary,

More information

OPTIMAL TWO-SIDED INVARIANT SIMILAR TESTS FOR INSTRUMENTAL VARIABLES REGRESSION DONALD W. K. ANDREWS, MARCELO J. MOREIRA AND JAMES H.

OPTIMAL TWO-SIDED INVARIANT SIMILAR TESTS FOR INSTRUMENTAL VARIABLES REGRESSION DONALD W. K. ANDREWS, MARCELO J. MOREIRA AND JAMES H. OPTIMAL TWO-SIDED INVARIANT SIMILAR TESTS FOR INSTRUMENTAL VARIABLES REGRESSION BY DONALD W. K. ANDREWS, MARCELO J. MOREIRA AND JAMES H. STOCK COWLES FOUNDATION PAPER NO. 1168 COWLES FOUNDATION FOR RESEARCH

More information

ROBUST CONFIDENCE SETS IN THE PRESENCE OF WEAK INSTRUMENTS By Anna Mikusheva 1, MIT, Department of Economics. Abstract

ROBUST CONFIDENCE SETS IN THE PRESENCE OF WEAK INSTRUMENTS By Anna Mikusheva 1, MIT, Department of Economics. Abstract ROBUST CONFIDENCE SETS IN THE PRESENCE OF WEAK INSTRUMENTS By Anna Mikusheva 1, MIT, Department of Economics Abstract This paper considers instrumental variable regression with a single endogenous variable

More information

Finite Sample Performance of A Minimum Distance Estimator Under Weak Instruments

Finite Sample Performance of A Minimum Distance Estimator Under Weak Instruments Finite Sample Performance of A Minimum Distance Estimator Under Weak Instruments Tak Wai Chau February 20, 2014 Abstract This paper investigates the nite sample performance of a minimum distance estimator

More information

Let us first identify some classes of hypotheses. simple versus simple. H 0 : θ = θ 0 versus H 1 : θ = θ 1. (1) one-sided

Let us first identify some classes of hypotheses. simple versus simple. H 0 : θ = θ 0 versus H 1 : θ = θ 1. (1) one-sided Let us first identify some classes of hypotheses. simple versus simple H 0 : θ = θ 0 versus H 1 : θ = θ 1. (1) one-sided H 0 : θ θ 0 versus H 1 : θ > θ 0. (2) two-sided; null on extremes H 0 : θ θ 1 or

More information

Increasing the Power of Specification Tests. November 18, 2018

Increasing the Power of Specification Tests. November 18, 2018 Increasing the Power of Specification Tests T W J A. H U A MIT November 18, 2018 A. This paper shows how to increase the power of Hausman s (1978) specification test as well as the difference test in a

More information

Hypothesis Testing. 1 Definitions of test statistics. CB: chapter 8; section 10.3

Hypothesis Testing. 1 Definitions of test statistics. CB: chapter 8; section 10.3 Hypothesis Testing CB: chapter 8; section 0.3 Hypothesis: statement about an unknown population parameter Examples: The average age of males in Sweden is 7. (statement about population mean) The lowest

More information

A Primer on Asymptotics

A Primer on Asymptotics A Primer on Asymptotics Eric Zivot Department of Economics University of Washington September 30, 2003 Revised: October 7, 2009 Introduction The two main concepts in asymptotic theory covered in these

More information

University of Pavia. M Estimators. Eduardo Rossi

University of Pavia. M Estimators. Eduardo Rossi University of Pavia M Estimators Eduardo Rossi Criterion Function A basic unifying notion is that most econometric estimators are defined as the minimizers of certain functions constructed from the sample

More information

GMM based inference for panel data models

GMM based inference for panel data models GMM based inference for panel data models Maurice J.G. Bun and Frank Kleibergen y this version: 24 February 2010 JEL-code: C13; C23 Keywords: dynamic panel data model, Generalized Method of Moments, weak

More information

Instrumental Variables Estimation in Stata

Instrumental Variables Estimation in Stata Christopher F Baum 1 Faculty Micro Resource Center Boston College March 2007 1 Thanks to Austin Nichols for the use of his material on weak instruments and Mark Schaffer for helpful comments. The standard

More information

Single Equation Linear GMM with Serially Correlated Moment Conditions

Single Equation Linear GMM with Serially Correlated Moment Conditions Single Equation Linear GMM with Serially Correlated Moment Conditions Eric Zivot October 28, 2009 Univariate Time Series Let {y t } be an ergodic-stationary time series with E[y t ]=μ and var(y t )

More information

Goodness-of-Fit Tests for Time Series Models: A Score-Marked Empirical Process Approach

Goodness-of-Fit Tests for Time Series Models: A Score-Marked Empirical Process Approach Goodness-of-Fit Tests for Time Series Models: A Score-Marked Empirical Process Approach By Shiqing Ling Department of Mathematics Hong Kong University of Science and Technology Let {y t : t = 0, ±1, ±2,

More information

Robust Backtesting Tests for Value-at-Risk Models

Robust Backtesting Tests for Value-at-Risk Models Robust Backtesting Tests for Value-at-Risk Models Jose Olmo City University London (joint work with Juan Carlos Escanciano, Indiana University) Far East and South Asia Meeting of the Econometric Society

More information

Chapter 1. GMM: Basic Concepts

Chapter 1. GMM: Basic Concepts Chapter 1. GMM: Basic Concepts Contents 1 Motivating Examples 1 1.1 Instrumental variable estimator....................... 1 1.2 Estimating parameters in monetary policy rules.............. 2 1.3 Estimating

More information

NUCLEAR NORM PENALIZED ESTIMATION OF INTERACTIVE FIXED EFFECT MODELS. Incomplete and Work in Progress. 1. Introduction

NUCLEAR NORM PENALIZED ESTIMATION OF INTERACTIVE FIXED EFFECT MODELS. Incomplete and Work in Progress. 1. Introduction NUCLEAR NORM PENALIZED ESTIMATION OF IERACTIVE FIXED EFFECT MODELS HYUNGSIK ROGER MOON AND MARTIN WEIDNER Incomplete and Work in Progress. Introduction Interactive fixed effects panel regression models

More information

More Powerful Tests for Homogeneity of Multivariate Normal Mean Vectors under an Order Restriction

More Powerful Tests for Homogeneity of Multivariate Normal Mean Vectors under an Order Restriction Sankhyā : The Indian Journal of Statistics 2007, Volume 69, Part 4, pp. 700-716 c 2007, Indian Statistical Institute More Powerful Tests for Homogeneity of Multivariate Normal Mean Vectors under an Order

More information

Testing Statistical Hypotheses

Testing Statistical Hypotheses E.L. Lehmann Joseph P. Romano Testing Statistical Hypotheses Third Edition 4y Springer Preface vii I Small-Sample Theory 1 1 The General Decision Problem 3 1.1 Statistical Inference and Statistical Decisions

More information

ROBUST CONFIDENCE SETS IN THE PRESENCE OF WEAK INSTRUMENTS By Anna Mikusheva 1, MIT, Department of Economics. Abstract

ROBUST CONFIDENCE SETS IN THE PRESENCE OF WEAK INSTRUMENTS By Anna Mikusheva 1, MIT, Department of Economics. Abstract ROBUST CONFIDENCE SETS IN THE PRESENCE OF WEAK INSTRUMENTS By Anna Mikusheva 1, MIT, Department of Economics Abstract This paper considers instrumental variable regression with a single endogenous variable

More information

Online Appendix. j=1. φ T (ω j ) vec (EI T (ω j ) f θ0 (ω j )). vec (EI T (ω) f θ0 (ω)) = O T β+1/2) = o(1), M 1. M T (s) exp ( isω)

Online Appendix. j=1. φ T (ω j ) vec (EI T (ω j ) f θ0 (ω j )). vec (EI T (ω) f θ0 (ω)) = O T β+1/2) = o(1), M 1. M T (s) exp ( isω) Online Appendix Proof of Lemma A.. he proof uses similar arguments as in Dunsmuir 979), but allowing for weak identification and selecting a subset of frequencies using W ω). It consists of two steps.

More information

Robust Two Step Confidence Sets, and the Trouble with the First Stage F Statistic

Robust Two Step Confidence Sets, and the Trouble with the First Stage F Statistic Robust Two Step Confidence Sets, and the Trouble with the First Stage F Statistic Isaiah Andrews Discussion by Bruce Hansen September 27, 2014 Discussion (Bruce Hansen) Robust Confidence Sets Sept 27,

More information

CIRJE-F-577 On Finite Sample Properties of Alternative Estimators of Coefficients in a Structural Equation with Many Instruments

CIRJE-F-577 On Finite Sample Properties of Alternative Estimators of Coefficients in a Structural Equation with Many Instruments CIRJE-F-577 On Finite Sample Properties of Alternative Estimators of Coefficients in a Structural Equation with Many Instruments T. W. Anderson Stanford University Naoto Kunitomo University of Tokyo Yukitoshi

More information

The Influence Function of Semiparametric Estimators

The Influence Function of Semiparametric Estimators The Influence Function of Semiparametric Estimators Hidehiko Ichimura University of Tokyo Whitney K. Newey MIT July 2015 Revised January 2017 Abstract There are many economic parameters that depend on

More information

ORTHOGONALITY TESTS IN LINEAR MODELS SEUNG CHAN AHN ARIZONA STATE UNIVERSITY ABSTRACT

ORTHOGONALITY TESTS IN LINEAR MODELS SEUNG CHAN AHN ARIZONA STATE UNIVERSITY ABSTRACT ORTHOGONALITY TESTS IN LINEAR MODELS SEUNG CHAN AHN ARIZONA STATE UNIVERSITY ABSTRACT This paper considers several tests of orthogonality conditions in linear models where stochastic errors may be heteroskedastic

More information

Introduction to the Mathematical and Statistical Foundations of Econometrics Herman J. Bierens Pennsylvania State University

Introduction to the Mathematical and Statistical Foundations of Econometrics Herman J. Bierens Pennsylvania State University Introduction to the Mathematical and Statistical Foundations of Econometrics 1 Herman J. Bierens Pennsylvania State University November 13, 2003 Revised: March 15, 2004 2 Contents Preface Chapter 1: Probability

More information

Inference about Clustering and Parametric. Assumptions in Covariance Matrix Estimation

Inference about Clustering and Parametric. Assumptions in Covariance Matrix Estimation Inference about Clustering and Parametric Assumptions in Covariance Matrix Estimation Mikko Packalen y Tony Wirjanto z 26 November 2010 Abstract Selecting an estimator for the variance covariance matrix

More information

Econometrica, Vol. 69, No. 6 (November, 2001), ASYMPTOTIC OPTIMALITY OF EMPIRICAL LIKELIHOOD FOR TESTING MOMENT RESTRICTIONS

Econometrica, Vol. 69, No. 6 (November, 2001), ASYMPTOTIC OPTIMALITY OF EMPIRICAL LIKELIHOOD FOR TESTING MOMENT RESTRICTIONS Econometrica, Vol. 69, No. 6 (November, 200), 66 672 ASYMPTOTIC OPTIMALITY OF EMPIRICAL LIKELIHOOD FOR TESTING MOMENT RESTRICTIONS By Yuichi Kitamura introduction Economic theory often provides testable

More information

A Local Generalized Method of Moments Estimator

A Local Generalized Method of Moments Estimator A Local Generalized Method of Moments Estimator Arthur Lewbel Boston College June 2006 Abstract A local Generalized Method of Moments Estimator is proposed for nonparametrically estimating unknown functions

More information

ESTIMATION OF NONLINEAR BERKSON-TYPE MEASUREMENT ERROR MODELS

ESTIMATION OF NONLINEAR BERKSON-TYPE MEASUREMENT ERROR MODELS Statistica Sinica 13(2003), 1201-1210 ESTIMATION OF NONLINEAR BERKSON-TYPE MEASUREMENT ERROR MODELS Liqun Wang University of Manitoba Abstract: This paper studies a minimum distance moment estimator for

More information

Econ 510 B. Brown Spring 2014 Final Exam Answers

Econ 510 B. Brown Spring 2014 Final Exam Answers Econ 510 B. Brown Spring 2014 Final Exam Answers Answer five of the following questions. You must answer question 7. The question are weighted equally. You have 2.5 hours. You may use a calculator. Brevity

More information

Imbens/Wooldridge, Lecture Notes 13, Summer 07 1

Imbens/Wooldridge, Lecture Notes 13, Summer 07 1 Imbens/Wooldridge, Lecture Notes 13, Summer 07 1 What s New in Econometrics NBER, Summer 2007 Lecture 13, Wednesday, Aug 1st, 2.00-3.00pm Weak Instruments and Many Instruments 1. Introduction In recent

More information

Exercises Chapter 4 Statistical Hypothesis Testing

Exercises Chapter 4 Statistical Hypothesis Testing Exercises Chapter 4 Statistical Hypothesis Testing Advanced Econometrics - HEC Lausanne Christophe Hurlin University of Orléans December 5, 013 Christophe Hurlin (University of Orléans) Advanced Econometrics

More information

Joint Estimation of Risk Preferences and Technology: Further Discussion

Joint Estimation of Risk Preferences and Technology: Further Discussion Joint Estimation of Risk Preferences and Technology: Further Discussion Feng Wu Research Associate Gulf Coast Research and Education Center University of Florida Zhengfei Guan Assistant Professor Gulf

More information

Consistent Parameter Estimation for Conditional Moment Restrictions

Consistent Parameter Estimation for Conditional Moment Restrictions Consistent Parameter Estimation for Conditional Moment Restrictions Shih-Hsun Hsu Department of Economics National Taiwan University Chung-Ming Kuan Institute of Economics Academia Sinica Preliminary;

More information

A Robust Approach to Estimating Production Functions: Replication of the ACF procedure

A Robust Approach to Estimating Production Functions: Replication of the ACF procedure A Robust Approach to Estimating Production Functions: Replication of the ACF procedure Kyoo il Kim Michigan State University Yao Luo University of Toronto Yingjun Su IESR, Jinan University August 2018

More information

SIMILAR-ON-THE-BOUNDARY TESTS FOR MOMENT INEQUALITIES EXIST, BUT HAVE POOR POWER. Donald W. K. Andrews. August 2011

SIMILAR-ON-THE-BOUNDARY TESTS FOR MOMENT INEQUALITIES EXIST, BUT HAVE POOR POWER. Donald W. K. Andrews. August 2011 SIMILAR-ON-THE-BOUNDARY TESTS FOR MOMENT INEQUALITIES EXIST, BUT HAVE POOR POWER By Donald W. K. Andrews August 2011 COWLES FOUNDATION DISCUSSION PAPER NO. 1815 COWLES FOUNDATION FOR RESEARCH IN ECONOMICS

More information

A more powerful subvector Anderson Rubin test in linear instrumental variable regression

A more powerful subvector Anderson Rubin test in linear instrumental variable regression A more powerful subvector Anderson Rubin test in linear instrumental variable regression Patrik Guggenberger Department of Economics Pennsylvania State University Frank Kleibergen Department of Quantitative

More information

SMOOTHED BLOCK EMPIRICAL LIKELIHOOD FOR QUANTILES OF WEAKLY DEPENDENT PROCESSES

SMOOTHED BLOCK EMPIRICAL LIKELIHOOD FOR QUANTILES OF WEAKLY DEPENDENT PROCESSES Statistica Sinica 19 (2009), 71-81 SMOOTHED BLOCK EMPIRICAL LIKELIHOOD FOR QUANTILES OF WEAKLY DEPENDENT PROCESSES Song Xi Chen 1,2 and Chiu Min Wong 3 1 Iowa State University, 2 Peking University and

More information

Testing for Weak Identification in Possibly Nonlinear Models

Testing for Weak Identification in Possibly Nonlinear Models Testing for Weak Identification in Possibly Nonlinear Models Atsushi Inoue NCSU Barbara Rossi Duke University December 24, 2010 Abstract In this paper we propose a chi-square test for identification. Our

More information

GMM and SMM. 1. Hansen, L Large Sample Properties of Generalized Method of Moments Estimators, Econometrica, 50, p

GMM and SMM. 1. Hansen, L Large Sample Properties of Generalized Method of Moments Estimators, Econometrica, 50, p GMM and SMM Some useful references: 1. Hansen, L. 1982. Large Sample Properties of Generalized Method of Moments Estimators, Econometrica, 50, p. 1029-54. 2. Lee, B.S. and B. Ingram. 1991 Simulation estimation

More information