Correlation and Heterogeneity Robust Inference using Conservativeness of Test Statistic. London School of Economics November 24, 2011

Size: px
Start display at page:

Download "Correlation and Heterogeneity Robust Inference using Conservativeness of Test Statistic. London School of Economics November 24, 2011"

Transcription

1 Correlation and Heterogeneity Robust Inference using Conservativeness of Test Statistic Rustam Ibragimov Harvard University Ulrich K. Müller Princeton University London School of Economics November 24, 2011

2 Introduction Most economic data is observational data is plausibly not i.i.d.: think of countries, firms, or time series data Correlations and heterogeneity do not necessarily invalidate standard estimators, but standard errors and inference become a challenge 1

3 Consistent Variance Estimators White (1980) for heterogeneous and independent disturbances Newey and West (1987), Andrews (1991), Davidson and de Jong (2000) and others for time series disturbances Rogers (1993) and Arellano (1987) for clustered and panel data Conley (1999) for spatially correlated data based on a Law of Large Numbers, and thus require an infinite amount of independence poor small sample properties in many instances of interest 2

4 Inconsistent Variance Estimators Part of the problem is that sample variability of consistent variance estimators is neglected Approaches that account for sample variability of variance estimator: time series: Kiefer, Vogelsang and Bunzel (2000) and Kiefer and Vogelsang (2002, 2005), Müller (2006) panel data: Donald and Lang (2004), Hansen (2005) We add to this literature and develop a general approach to robust inference when little is known about correlations and heterogeneity 3

5 Our Approach Tests on parameter β : t statistic in group estimators Assume data can be classified in a finite number q of groups that allow asymptotically independent normal inference about the (scalar) parameter of interest β, so that ˆβ a j idn (β, v 2 j ) for j = 1,, q. Time series example: Divide data into q = 4 consecutive blocks, and estimate the model 4 times. Treat ˆβ j as observations for the usual t-statistic, and reject a 5% level test if t-statistic is larger than usual critical value for q 1 degrees of freedom. Results in valid inference by small sample result. Exploits information ˆβ j a idn (β, v 2 j ) in an efficient way. Does not rely on single asymptotic model of sampling variability for estimated standard deviation Important precursor: Fama-MacBeth (1973) method

6 Our Approach Tests on equality of parameters β 1 = β 2 : Behrens-Fisher statistic in group estimators Partition data into q 1 + q 2 groups; estimate β 1 and β 2 with q 1 and q 2 groups: ˆβij a idn (βi, v 2 ij ) for j = 1,, q i, i = 1, 2. Treat ˆβ ij as observations for the usual Behrens-Fisher (BF) statistic, and reject a 1% or 5% level test if BF statistic is larger than critical value of Student-t with min(q 1, q 2 ) 1 degrees of freedom. Valid inference by new small sample result. Exploits information ˆβ a ij idn (βi, vij 2 ) in an efficient way. Applications: robust analysis of treatment effects, structural breaks, inference under heavy tails, etc.

7 Plan of Talk Introduction The Small Sample t- and Behrens-Fisher Tests Conservativeness for Heterogenous Variances Optimality t-statistic & Behrens-Fisher Statistic Based Large Sample Robust Inference Basic Idea and Properties Comparison with Standard Inference and Known Asymptotic Variance Applications: Panel data, Time Series, Inequality Measures (on-going work with Paul Kattuman, Univ. of Cambridge) Conclusions

8 t statistic based robust inference: small samples Bakirov and Székely (2005), Ibragimov and Müller (2006): Usual small sample t test of level α 5% : conservative for independent heterogeneous Gaussian observations (not α = 10%) X j N (µ, σ 2 j ), j = 1,, q : H 0 : µ = 0 against H 1 : µ 0 t-statistic t = q X sx X = q 1 q j=1 X j, s 2 X = (q 1) 1 q j=1 (X j X) 2 cv q (α) = critical value of T q 1 : P ( T q 1 > cv q (α)) = α P ( t > cv(α) H 0 ) P ( t > cv(α) H 0, σ 2 1 =... = σ2 q) = P ( T q 1 > cv(α)) = α Holds under heavy tails: mixtures of normals (stable, Student-t)

9 Conservativeness Theorem (Bakirov and Székely 2005): Let cv q (α) be the critical value of the usual two-sided t-test based on t of level α, i.e. P ( T q 1 > cv q (α)) = α, wheret k is student-t distributed with k degrees of freedom, and let Φ denote the cumulative density function of a standard normal random variable. (i) If α 2Φ( 3) = , thenforallq 2, sup P ( t > cv q (α) H 0 )=P( T q 1 > cv q (α)) = α. (1) {σ 2 1,,σ2 q} (ii) Equation (1) also holds true for 2 q 14 if α α 1 = 0.1, and for q {2, 3} if α α 2 = 0.2. Moreover, define fcv q (α i ) = q ki (q 1) cv ki (α i ) 2 / q q(k i 1) + (q k i )cv ki (α i ) 2, i {1, 2}, where k 1 =14andk 2 =3. Thenforq k i +1, sup P ( t > fcv q (α i ) H 0 )=α i. {σ 2 1,,σ2 q} 8

10 Sketch of Proof I Rewrite t 2 = (q 1)(P q j=1 X j) 2 q P q j=1 (X j X) 2 > cv2 as X 0 AX > 0, where A =(1+ cv2 q 1 )ee0 q cv2 q 1 I q,ande =(1,, 1) 0. Let D =diag(σ 1,,σ q ). Then X 0 AX Z 0 BZ, whereb = DAD and Z N (0,I q ). Z 0 BZ P q j=1 λ jz 2 j,whereλ j are the eigenvalues of B. Observation: One eigenvalue (say, λ q )ofb is positive, and q 1 are negative. Thus, P (t 2 > cv 2 )=P (Z 2 q > q 1 X j=1 c j Z 2 j ) with c j = λ j /λ q. But: eigenvalues of B are a very messy function of {σ 2 1,,σ2 q}. 9

11 Sketch of Proof II Fact, already exploited in Bakirov (1989): for c j > 0, j =1,,q 1 P (Z 2 q > q 1 X j=1 c j Z 2 j )= Z 1 0 u (q 1)/2 (1 u) 1/2 r du Qq 1 π j=1 (u + c j) Q q 1 j=1 (u+c j) is a polynomial in u with roots c j = λ j /λ q. Eigenvalues λ j are roots of det(ui q B), and so after a scale normalization of Z 0 BZ > 0toensureλ q =1,wehave q 1 Y j=1 (u + c j )= det(ui q B) u 1 Obtain expression for P (t 2 > cv 2 )asanintegralwithanintegrand that is a reasonably tractable function of {σ 2 1,,σ2 q}. Argue why it cannot be maximized for positive but unequal variances... 10

12 14 12 Sharpe ratio density: Normal vs. Cauchy Normal Cauchy

13 Behrens-Fisher statistic robust inference: small sample results X ij N (µ i, σ 2 j ), j = 1,..., q i, i = 1, 2 H 0 : µ 1 = µ 2 vs. H 1 : µ 1 µ 2 Behrens-Fisher statistic BF = X 1 X 2 s s2 2 n 1 n2 X i = q 1 qi i j=1 X i,j s 2 i = (q i 1) 1 q i j=1 (X i,j X i ) 2, i = 1, 2 P( BF cv m 1 (α)) P( T m 1 cv m 1 (α)) = α, m = min(q 1, q 2 ) for 2 q 1, q 2 50 and α {0.1%, 1%, 5%}

14 11 Effective Rejection Probabilities

15 Optimality of t-statistic Let X j, j =1,,q with q 2, be distributed independent N (μ, σ 2 j ), and consider the hypothesis test H 0 : μ =0and{σ 2 j }q j=1 arbitrary H 1 : μ 6= 0andσ 2 j = σ2 for all j Theorem: Let cv be such that P ( T q 1 > cv) = α For any q 2, a test that rejects the null hypothesis for t > cv is the uniformly most powerful scale invariant level α test. Proof: For equal variance case, t-test is UMP scale invariant. Conservativeness of t-test implies that equal variance case under the null hypothesis is least favorable. 12

16 Asymptotic t-statistic Based Inference Partition the data into q 2 groups, with n j observations in group j, and P q j=1 n j = n. Denote by ˆβ j the estimator of β using observations in group j only. Suppose the groups are chosen such that n(ˆβ j β) N (0,σ 2 j )forallj (where max 1 j q σ 2 j > 0). Satisfiedformanymodelsaslongasmin j n j, linear or nonlinear. n(ˆβ i β) and n(ˆβ j β) are asymptotically independent for i 6= j. this amounts to n(ˆβ 1 β,, ˆβ q β) 0 N (0, diag(σ 2 1,,σ2 q)) 13

17 Asymptotic t-statistic Based Inference Rejection of H 0 : β = β 0 against H 1 : β 6= β 0 if t β exceeds the (1 α/2) percentile of the student-t distribution with q 1 degrees of freedom, where t β is the usual t-statistic t β = q ˆβ β 0 sˆβ with ˆβ = q 1 P q j=1 ˆβ j and s 2ˆβ =(q 1) 1 P q j=1 (ˆβ j ˆβ) 2,isasymptotically valid inference by small sample t-test result and the Continuous Mapping Theorem. This way of conducting inference efficiently exploits the information n(ˆβ 1 β,, ˆβ q β) 0 N (0, diag(σ 2 1,,σ2 q)) as it maximizes asymptotic local power uniformly against all local alternatives where β = β n = β 0 + c/ n for some c 6= 0andσ 2 i = σ2 j for all i, j, among all scale invariant tests by optimality of t-statistic. 14

18 Asymptotic Behrens-Fisher Statistic Based Inference Robust tests of H 0 : β 1 = β 2 against H 1 : β 1 β 2 Partition the data into q 1 + q 2 groups ˆβij, j = 1,..., q i : estimators of β i, i = 1, 2, using observations in group j only Behrens-Fisher statistic BF = ˆβ 1 ˆβ 2 S S2 2 q 1 q2 ˆβ i = q 1 qi ˆβ i j=1 i,j Si 2 = (q i 1) 1 q i j=1 ( ˆβ i,j ˆβ i ) 2, i = 1, 2 Reject H 0 if BF is larger than the critical value of Student-t distribution with min(q 1, q 2 ) 1 degrees of freedom

19 15 Size Control under AR(1) Correlation

20 Comparison with Inference with Known Asymptotic Variance Typically, n(ˆβ 1 β,, ˆβ q β) 0 N (0, diag(σ 2 1,,σ2 q)) is weaker than what is required to consistently estimate the asymptotic variance. Consistent estimation not only requires more assumptions on correlation structure, but typically also higher moments. What are the efficiency cost of this additional robustness, i.e. how does the t-statistic approach compare to an approach based on consistent variance estimation (if the variance can indeed be consistently estimated)? Consider question in exactly identified GMM framework with k 1 parameter θ and moment condition E[g(θ, y i )] = 0. We are interested in conducting inference about the first element of θ, denoted by β. 16

21 Properties of Group Estimators Let G j the set of indices of group j observations. Suppose the estimator ˆθ j basedongroupj =1,,q data satisfies n(ˆθ j θ) =Γ 1 j Q j + o p (1) N (0, Γ 1 j Ω j Γ 0 1 j ) where n 1 P g(a,y i ) p i G j a a=ˆθ Γj and Q j j = n 1/2 P i G j g(θ, y i ) N (0, Ω j ), and (Q 0 1,,Q0 q) 0 N (0, diag(ω 1,, Ω q )). The simple average of the group estimators ˆθ then satisfies n(ˆθ θ) =q 1 where Σ q = q 2 P q j=1 Γ 1 j q X j=1 Ω j Γ 0 1 Γ 1 j Q j + o p (1) N (0, Σ q ) j. 17

22 Full Sample Estimator In contrast, full sample GMM estimator ˆθ (with first element ˆβ) that solves n 1 P n i=1 g(ˆθ, y i ) 0 g(ˆθ, y i )=0satisfies n(ˆθ θ) = 1 qx X q Γ j j=1 j=1 Q j + o p (1) N (0, Σ q ) where Σ q = ³ P q j=1 Γ j 1 ³ Pq j=1 Ω j ³ Pq j=1 Γ0 j 1. Estimator ˆθ not efficient under group heterogeneity. Efficient estimator would exploit q k moment conditions E[g(θ, y i )] = 0 for i G j, j =1,,q. Optimal weighting matrix depends on {Ω j } q j=1, which is assumed difficult to estimate. We thus focus on comparison of t-statistic approach with ˆβ in the numerator with inference based on ˆβ and known σ (the (1,1) element of Σ q ). 18

23 General Comparison I In general, Σ q and Σ q are not identical. t-statistic approach and inference based on ˆβ with σ 2 known differ not only in the denominator, but also in the numerator. Both tests are consistent against fixed alternatives and have power against the same local alternatives β = β 0 + c/ n. 19

24 General Comparison II Theorem: (i) Let ι be the k 1vectorwithaoneinthefirst row and zeros elsewhere. Then ι 0 Σ q ι ι 0 ( Σ q ι 1/q inf {Γ i } q i=1,{ω i} q ι 0 Σ =0 and inf i=1 q ι {Γ i } q i=1,{ω i} q ι 0 Σ i=1 q ι = 2 if k =1 0 if k 2 (ii) For any sequence of full rank matrices {Γ i } q i=1 there exists a positive definite sequence { Ω j } q j=1 so that Σ q Σ q is positive semidefinite for {Ω j } q j=1 = { Ω j } q j=1, and for any sequence of symmetric positive definite matrices {Γ i } q i=1 there exists a positive definite sequence {Ω j} q j=1 so that Σ q Σ q is negative semidefinite for {Ω j } q j=1 = {Ω j} q j=1. (iii) If Γ i = Γ for i =1,,q,then Σ q = Σ q for all {Ω j } q j=1. 20

25 Special Case Consider the special case Γ j = Γ for j =1,,q. This naturally arises when the groups have an equal number of observations n/q, and the average of the derivative of the moment condition is homogenous across groups (leading example: i.i.d. data). Then n(ˆθ θ) = n(ˆθ θ)+o p (1) and ˆβ and ˆβ are asymptotically equivalent to order n. The asymptotic local power of tests based on t β and ˆβ with σ 2 known simply reduces to the small sample power of the small sample t-statistic and z-statistic of H 0 : μ =0whenX i N (μ, σ 2 i ), that is z = P q i=1 X i q Pq i=1 σ2 i where σ 2 i is the (1,1) element of Γ 1 Ω i Γ

26 22 Numerical Comparison

27 Applications t-statistic approach requires n(ˆβ1 β,, ˆβq β) N (0, diag(σ 2 1,, σ2 q )). 1. Panel Data 2. Time Series Data 3. Inequality, poverty, risk & concentration measures Small sample results Large sample inference 28

28 Panel Data Panel with potential time series correlation and few individuals N y i,t = x 0 i,t θ + u i,t, i =1,,N, t =1,,T where {x i,t,u i,t } T t=1 are independent across i and E[x i,tu i,t ]=0for all i, t. t-statistic approach asymptotically valid if T 1 P T t=1 x i,t x 0 p i,t Γ i and T 1/2 P T t=1 x i,t u i,t N (0, Ω i )foralli as T and N fixed for some full rank matrices Γ i and Ω i. Hansen (2005) shows that usual t-statistic with Rogers (1993) standard errors converges under the null to a scaled t-statistic with q 1 degrees of freedom under asymptotic homogeneity, i.e. when Γ i = Γ and Ω i = Ω for all i. 27

29 Panel Data II For applications in Finance, concern about cross section correlation. Our results justify Fama MacBeth method where regression is run cross sectionally for each time period, and inference is based on t- statistic of resulting estimators ˆβ j, j =1,,T,evenforsmallT and potential heterogeneity of variances as long as no correlation across t. For corporate Finance applications, uncorrelatedness in time is often implausible. Rather than to try to consistently estimate the longrun variance with few observations, the approach taken here suggests forming groups of more than one unit in time to achieve approximate independence. Alternatively, assume independence in cross section dimension, say, across industries, as in Froot (1989). Combinations are possible. Same possibilities for long-run event studies, country panel data, city panel data and so forth. 29

30 Monte Carlo Results Same design as in Thompson (2006): N =50,T =25. Linear Regression, one nonconstant regressor, individual persistence : u i,t = ξ t + η i,t, η i,t = ρη i,t 1 + ε i,t common persistence : factor structure u i,t = h i f t +ε i,t, f t = ρf t 1 +ξ t, h i N(1, 0.25) 30 Individual Persistence Common Persistence ρ Size t-statistic q = t-statistic q = t-statistic q = FM with Newey-West cluster by i and t cluster by i and t + cp Size Adjusted Power t-statistic q = t-statistic q = t-statistic q = FM with Newey-West cluster by i and t cluster by i and t + cp

31 Time Series Data In absence of more specific knowledge, exploit the default assumption that correlations between observations become weaker the further apart in time they are: Divide the sample of size T into q (approximately) equal sized groups of consecutive observations. Under a wide range of assumptions on the underlying model and observations, exactly identified GMM inference satisfies brtc X sup T 1 0 r 1 t=1 T b T X c 1/2 t=1 Z g(a, y t ) r a Γ(λ)dλ p a=ˆθ 0 (1) 0 g(θ, y t ) Z 0 h(λ)dw (λ) (2) where Γ( ) isapositivedefinite k k matrix function and h( ) is nonzero. 16

32 Time Series Data With that convergence, t-statistic approach is asymptotically valid, since T ˆθ 1 θ ˆθ 2 θ. ˆθ q θ µ R 1/q 1 R 1/q 0 Γ(λ)dλ 0 h(λ)dw (λ) µ R 2/q 1 1/q Γ(λ)dλ R 2/q h(λ)dw (λ) 1/q. ( R 1 (q 1)/q Γ(λ)dλ) 1 R 1 (q 1)/q h(λ)dw (λ) In contrast, no other known way of conducting asymptotically valid inference under (1) and (2): Kiefer and Vogelsang (2002, 2005) approach requires Γ( ) and h( ) to be constant Müller (2004) shows that no long-run variance estimator can be consistent for Var[ R 1 0 h(λ)dw (λ)] for all processes that satisfy (2) 17

33 Monte Carlo Results Same design as in Andrews (1991): Linear Regression, 5 regressors, 4 nonconstant regressors are independent draws from stationary Gaussian AR(1), as are the disturbances, + heteroskedasticity. T = 128, 5% level test about coefficient of one nonconstant regressor. t-statistic (q) ˆω 2 QA ˆω2 PW ˆω 2 BT (b) ρ Size ρ Size Adjusted Power

34 Heavy-tailed inference y t = β + j= ψ js t j, S t i.i.d. stable domain of attraction, α (1, 2] Subsampling: works under strong mixing for y t, extensions for GARCH (McElroy and Politis, 2002) t statistic approach: symmetric stable (scale mixtures of normals)

35 Differences: Subsampling: first calculate block t statistic, then approximate cdf of full sample t statistic by empirical cdf s Our approach: first calculate estimators over blocks, then form their t statistic to conduct asymptotically valid inference

36 Monte Carlo Results y t = β + u t, S t : symm. stable with index α (0, 2) AR: u t = 0.5u t 1 + S t, MA: u t = 10 j=0 ψ js t j, {ψ j } 10 j=0 = {.03,.05,.07,.1,.15,.2,.15,.1,.07,.05,.03}

37 Robust inequality measurement (on-going work with Paul Kattuman, University of Cambridge) Economic & financial data: correlated, heterogeneous or heavy-tailed Income, wealth & financial returns: heterogeneity, dependence & outliers Heterogeneous observations & outliers heavy-tailed power laws P (I > x) C x ζ Tail index ζ : heavy-tailedness, likelihood of extreme observations & outliers Income: 1.5 < ζ < 3; wealth: ζ 1.5 Infinite variance Failure of standard OLS! Problematic inference on income & wealth inequality Tail index ζ inequality measures: Gini G = 1/(2ζ 1) (inequality in upper tails) Similar relations: Singh-Maddala power law distributions, log-normal Crises, emerging markets: more volatile, more extreme shocks more pronounced heterogeneity, outliers & heavy tails (e.g., IIK, 2009: income & wealth in CIS) Financial returns: 2 < ζ < 4 finite variance, infinite fourth moment 3

38 A tale of two tails Simulated data from Normal, Cauchy and Levy distributions, n=25 Normal Cauchy α=1 Levy α=1/ Figure: Heavy-tailed distributions: more extreme observations

39 A tale of two tails Light vs. heavy tails Cauchy distributiion Levy distribution α=1/2 Normal distribution α= Figure: Tails of Cauchy distributions are heavier than those of normal distributions. Tails of Lévy distributions are heavier than those of Cauchy or normal distributions.

40 Robust inequality measurement Many inequality measures: problematic under heterogeneity, heavy-tailedness & correlation Inequality, poverty & concentration measures: very sensitive to extremes & heavy tails (GE, Theil, MLD, Gini, HHI) Income & wealth: heavy-tailed (Singh-Maddala, Pareto, log-normal) poor performance (Mandelbrot, 1997, Cowell & Flachaire, 2007, Davidson & Flachaire, 2007) Robustness of measure choice, estimation & inference Asymptotic, standard & non-standard (moon) bootstrap Semiparametric bootstrap & asymptotic methods Improvement Computationally expensive (bootstrap), problematic s.e. s (asymptotic) Many studies: only point estimates of inequality Conclusions may need to be modified: standard errors & statistical significance 4

41 t statistic based robust inference Inference on income inequality measure I (Gini or Theil) Data: heavy tails, heterogeneity or dependence H 0 : I = I 0 against the alternative H a : I I 0 t statistic based robust test of level α = 5% : 1. Partition sample I 1, I 2,..., I n of observations on incomes into q 2 groups 2. Estimate income inequality measure I for each group empirical inequality measures Î j, j = 1,..., q 3. Compute the usual t statistic t I = q Î I 0 s Î with Î = q 1 q j=1 Îj and s 2 Î = (q 1) 1 q j=1 (Î j Î) 2 (t statistic treating q group estimators as observations) 4. Reject H 0 if t R exceeds the 97.5-percentile of Student t distribution T q 1 with q 1 degrees of freedom 5. 95% confidence interval for unknown I : (Î τ 0.05 sî, Î + τ 0.05 sî) where P ( T q 1 > τ 0.05 ) =

42 Connection to inequality measures Ex.: coefficient of variation for logs Y j = log X j of X 1,..., X q Ỹ 1,..., Ỹ q : i.i.d. standard normal r.v. s: X j N (0, 1) P ( T q 1 > τ 0.05 ) = 0.05 Empirical coefficient of variation ˆ CV X = s X /X = q/t Sharpe ratio, Herfindahl-Hirschman Index Analogues for logs If Y j N (0, σ 2 j ) or scale mixtures of normals (stable, etc.) P (0 < CV Y < y) P (0 < CVỸ < y) for y < 1/(τ 0.05 q) In general, does not hold for y < 1/(τ 0.1 q) Homogeneity and thin-tails (normality): Reduce the inequality in region of their small values Does not not hold poor measure of inequality for medium & high 10

43 t statistic based robust inference Asymptotically valid & efficient if empirical inequality measures Î j, j = 1,..., q : asymptotically independent, unbiased & Gaussian of possibly different variances Standard results on CLT for empirical income inequality measures (Gini, Theil, etc.): n j (Î j I) d N (0, σ 2 j ) Same conditions as for empirical inequality measure Î over full sample I 1, I 2,..., I N : N(Î I) d N (0, σ 2 ) Asymptotic validity: also holds when group estimators Î j converge (at arbitrary rate) to mixed normal Symmetric stable Conditionally normal variates, dependence through second moments Heavy tails, extremes and outliers Dependent models with common shocks 7

44 t statistic based robust inference First applications in inequality measurement Assumptions: Asymptotic normality & independence for group estimators 8 Asymptotic normality: From usual CLT for (full) sample Many applications: Natural choice of groups for asymptotic independence Time series: Divide into q = 4 consecutive blocks & estimate model 4 times Panel data, cross section correlation: Justification of Fama-MacBeth Regression is run cross sectionally for each time period & inference using t statistic if no correlation across t Other combinations Income & wealth: spatial dependence, dependence across states, common shocks, crises, etc. Groups by geographical regions asymptotically correlated estimators Groups to achieve asymptotic independence of group estimators? Independence condition: methodological problems for direct applications

45 Asymptotic independence through randomization Solution: randomization of initial samples I 1, I 2,..., I N of income observations Randomization stage: randomly assign each observation I i, i = 1,..., N, to one of q groups j = 1,..., q with equal probability 1/q t statistic stage: Apply t statistic approach to q groups of consecutive observations in randomized sample of incomes Ĩ 1, Ĩ 2,..., Ĩ N As in time series applications: observation Ĩ i in randomized sample is element of group j if (j 1)N/q < i jn/q Group estimators for randomized sample: independent by construction Both conditions for validity of t statistic inference: satisfied Randomization-based modification of t statistic robust inference: may also prove to be useful in other problems with natural dependence for group estimators 9

46 Risk, Inequality & Concentration: MC Results Cowell & Flachaire (2007): Theil measure I = (E[Y α ]/(E[Y ]) α 1)/(α(α 1)) Singh-Maddala: F (y) = 1 (1 + ay b ) c, a = 100, b = 2.8, c = 1.7 tail index ζ = bc = 4.8, I = 0.14 Pareto: F (y) = 1 (y l /y) ζ, y l = 0.1, ζ = 2.5 I = 1/(ζ 1) + log((ζ 1)/ζ) = 0.16 Lognormal: log(y ) N (µ, σ 2 ), µ = 2, σ = 1 I = σ 2 /2 = 0.5 Singh-Maddala Pareto Lognormal N Size t-statistic q = t-statistic q = Asymptotic Standard bootstrap Moon bootstrap Semiparametric bootstrap Asymptotic test with semiparametric measure % of cases semipar. measure cannot be computed % 6.3% 9.0% 3.3%

47 Risk, Inequality & Concentration: MC Results Cowell & Flachaire (2007): Mean logarithmic deviation I = log(e[y ]) E[log(Y )] Singh-Maddala: F (y) = 1 (1 + ay b ) c, a = 100, b = 2.8, c = 1.7 tail index ζ = bc = 4.8, I = 0.15 Pareto: F (y) = 1 (y l /y) ζ, y l = 0.1, ζ = 2.5 I = 1/ζ log((ζ 1)/ζ) = 0.11 Lognormal: log(y ) N (µ, σ 2 ), µ = 2, σ = 1 I = σ 2 /2 = 0.5 Singh-Maddala Pareto Lognormal N Size t-statistic q = t-statistic q = Asymptotic Standard bootstrap Moon bootstrap Semiparametric bootstrap Asymptotic test with semiparametric measure % of cases semipar. measure cannot be computed % 6.3% 9% 3.3%

48 Robust inference for risk, inequality, poverty & concentration: t-statistic based approach vs. alternative procedures Singh-Maddala:, a=100, b=2.8, c=1.7 ξ b c P( X. > y) = (1 + ay ) = bc = 4. 8

49 Tail index estimation & inequality in upper tails Inequality in upper tails, applications of semiparametric asymptotic and bootstrap inequality inference: Estimates of tail index ζ in power laws Hill s estimator OLS log-log rank-size: optimal shifts & correct s.e. s, Gabaix & Ibragimov (11) I (1) I (2)... I (n) I (n+1) : power law P (I > x) C x ζ OLS approach: regression log(t) = a b log(i (t) ) log(rank) = a b log(size) b : estimate of tail index: Simple & robust Biased in small samples One should use the Rank 1/2, & run log(rank 1/2) = a b log(size) Shift of 1/2: optimal, and reduces bias to a leading order S.e. on ζ : not OLS s.e., but is asymptotically 2ζ/ n Numerical results: advantage over standard OLS estimation procedures Performs well under dependence & deviations from power laws 11

50 Conclusion New approach to correlation and heterogeneity robust inference in large samples. Method imposes only a finite amount of independence through the assumption that estimators from different groups are independent. Approach exploits this assumption in an efficient way. Many potential applications, encouraging Monte Carlo results. Challenge to choose groups in practice. But inference requires some assumption on correlation structure, and other methods make more implicit and even less interpretable assumptions. 36

t statistic based correlation and heterogeneity robust inference

t statistic based correlation and heterogeneity robust inference t statistic based correlation and heterogeneity robust inference Rustam Ibragimov Harvard University Economics Department 1875 Cambridge Street Cambridge, MA 02138 Ulrich K. Müller Princeton University

More information

Reliable Inference in Conditions of Extreme Events. Adriana Cornea

Reliable Inference in Conditions of Extreme Events. Adriana Cornea Reliable Inference in Conditions of Extreme Events by Adriana Cornea University of Exeter Business School Department of Economics ExISta Early Career Event October 17, 2012 Outline of the talk Extreme

More information

Heteroskedasticity and Autocorrelation Consistent Standard Errors

Heteroskedasticity and Autocorrelation Consistent Standard Errors NBER Summer Institute Minicourse What s New in Econometrics: ime Series Lecture 9 July 6, 008 Heteroskedasticity and Autocorrelation Consistent Standard Errors Lecture 9, July, 008 Outline. What are HAC

More information

Economics 583: Econometric Theory I A Primer on Asymptotics

Economics 583: Econometric Theory I A Primer on Asymptotics Economics 583: Econometric Theory I A Primer on Asymptotics Eric Zivot January 14, 2013 The two main concepts in asymptotic theory that we will use are Consistency Asymptotic Normality Intuition consistency:

More information

The properties of L p -GMM estimators

The properties of L p -GMM estimators The properties of L p -GMM estimators Robert de Jong and Chirok Han Michigan State University February 2000 Abstract This paper considers Generalized Method of Moment-type estimators for which a criterion

More information

Long-Run Covariability

Long-Run Covariability Long-Run Covariability Ulrich K. Müller and Mark W. Watson Princeton University October 2016 Motivation Study the long-run covariability/relationship between economic variables great ratios, long-run Phillips

More information

11. Bootstrap Methods

11. Bootstrap Methods 11. Bootstrap Methods c A. Colin Cameron & Pravin K. Trivedi 2006 These transparencies were prepared in 20043. They can be used as an adjunct to Chapter 11 of our subsequent book Microeconometrics: Methods

More information

A Practitioner s Guide to Cluster-Robust Inference

A Practitioner s Guide to Cluster-Robust Inference A Practitioner s Guide to Cluster-Robust Inference A. C. Cameron and D. L. Miller presented by Federico Curci March 4, 2015 Cameron Miller Cluster Clinic II March 4, 2015 1 / 20 In the previous episode

More information

Linear Models and Estimation by Least Squares

Linear Models and Estimation by Least Squares Linear Models and Estimation by Least Squares Jin-Lung Lin 1 Introduction Causal relation investigation lies in the heart of economics. Effect (Dependent variable) cause (Independent variable) Example:

More information

Review of Classical Least Squares. James L. Powell Department of Economics University of California, Berkeley

Review of Classical Least Squares. James L. Powell Department of Economics University of California, Berkeley Review of Classical Least Squares James L. Powell Department of Economics University of California, Berkeley The Classical Linear Model The object of least squares regression methods is to model and estimate

More information

Econometrics of Panel Data

Econometrics of Panel Data Econometrics of Panel Data Jakub Mućk Meeting # 6 Jakub Mućk Econometrics of Panel Data Meeting # 6 1 / 36 Outline 1 The First-Difference (FD) estimator 2 Dynamic panel data models 3 The Anderson and Hsiao

More information

An estimate of the long-run covariance matrix, Ω, is necessary to calculate asymptotic

An estimate of the long-run covariance matrix, Ω, is necessary to calculate asymptotic Chapter 6 ESTIMATION OF THE LONG-RUN COVARIANCE MATRIX An estimate of the long-run covariance matrix, Ω, is necessary to calculate asymptotic standard errors for the OLS and linear IV estimators presented

More information

Wooldridge, Introductory Econometrics, 4th ed. Appendix C: Fundamentals of mathematical statistics

Wooldridge, Introductory Econometrics, 4th ed. Appendix C: Fundamentals of mathematical statistics Wooldridge, Introductory Econometrics, 4th ed. Appendix C: Fundamentals of mathematical statistics A short review of the principles of mathematical statistics (or, what you should have learned in EC 151).

More information

A Primer on Asymptotics

A Primer on Asymptotics A Primer on Asymptotics Eric Zivot Department of Economics University of Washington September 30, 2003 Revised: October 7, 2009 Introduction The two main concepts in asymptotic theory covered in these

More information

The Exact Distribution of the t-ratio with Robust and Clustered Standard Errors

The Exact Distribution of the t-ratio with Robust and Clustered Standard Errors The Exact Distribution of the t-ratio with Robust and Clustered Standard Errors by Bruce E. Hansen Department of Economics University of Wisconsin June 2017 Bruce Hansen (University of Wisconsin) Exact

More information

GARCH Models Estimation and Inference

GARCH Models Estimation and Inference GARCH Models Estimation and Inference Eduardo Rossi University of Pavia December 013 Rossi GARCH Financial Econometrics - 013 1 / 1 Likelihood function The procedure most often used in estimating θ 0 in

More information

Introduction to Econometrics

Introduction to Econometrics Introduction to Econometrics T H I R D E D I T I O N Global Edition James H. Stock Harvard University Mark W. Watson Princeton University Boston Columbus Indianapolis New York San Francisco Upper Saddle

More information

Closest Moment Estimation under General Conditions

Closest Moment Estimation under General Conditions Closest Moment Estimation under General Conditions Chirok Han and Robert de Jong January 28, 2002 Abstract This paper considers Closest Moment (CM) estimation with a general distance function, and avoids

More information

Does k-th Moment Exist?

Does k-th Moment Exist? Does k-th Moment Exist? Hitomi, K. 1 and Y. Nishiyama 2 1 Kyoto Institute of Technology, Japan 2 Institute of Economic Research, Kyoto University, Japan Email: hitomi@kit.ac.jp Keywords: Existence of moments,

More information

HAC Corrections for Strongly Autocorrelated Time Series

HAC Corrections for Strongly Autocorrelated Time Series HAC Corrections for Strongly Autocorrelated Time Series Ulrich K. Müller Princeton University Department of Economics Princeton, NJ, 08544 December 2012 Abstract Applied work routinely relies on heteroskedasticity

More information

Review of Econometrics

Review of Econometrics Review of Econometrics Zheng Tian June 5th, 2017 1 The Essence of the OLS Estimation Multiple regression model involves the models as follows Y i = β 0 + β 1 X 1i + β 2 X 2i + + β k X ki + u i, i = 1,...,

More information

Central Bank of Chile October 29-31, 2013 Bruce Hansen (University of Wisconsin) Structural Breaks October 29-31, / 91. Bruce E.

Central Bank of Chile October 29-31, 2013 Bruce Hansen (University of Wisconsin) Structural Breaks October 29-31, / 91. Bruce E. Forecasting Lecture 3 Structural Breaks Central Bank of Chile October 29-31, 2013 Bruce Hansen (University of Wisconsin) Structural Breaks October 29-31, 2013 1 / 91 Bruce E. Hansen Organization Detection

More information

STATS 200: Introduction to Statistical Inference. Lecture 29: Course review

STATS 200: Introduction to Statistical Inference. Lecture 29: Course review STATS 200: Introduction to Statistical Inference Lecture 29: Course review Course review We started in Lecture 1 with a fundamental assumption: Data is a realization of a random process. The goal throughout

More information

Supplement to Randomization Tests under an Approximate Symmetry Assumption

Supplement to Randomization Tests under an Approximate Symmetry Assumption Supplement to Randomization Tests under an Approximate Symmetry Assumption Ivan A. Canay Department of Economics Northwestern University iacanay@northwestern.edu Joseph P. Romano Departments of Economics

More information

Economics 536 Lecture 7. Introduction to Specification Testing in Dynamic Econometric Models

Economics 536 Lecture 7. Introduction to Specification Testing in Dynamic Econometric Models University of Illinois Fall 2016 Department of Economics Roger Koenker Economics 536 Lecture 7 Introduction to Specification Testing in Dynamic Econometric Models In this lecture I want to briefly describe

More information

The Exact Distribution of the t-ratio with Robust and Clustered Standard Errors

The Exact Distribution of the t-ratio with Robust and Clustered Standard Errors The Exact Distribution of the t-ratio with Robust and Clustered Standard Errors by Bruce E. Hansen Department of Economics University of Wisconsin October 2018 Bruce Hansen (University of Wisconsin) Exact

More information

Generalized Method of Moments: I. Chapter 9, R. Davidson and J.G. MacKinnon, Econometric Theory and Methods, 2004, Oxford.

Generalized Method of Moments: I. Chapter 9, R. Davidson and J.G. MacKinnon, Econometric Theory and Methods, 2004, Oxford. Generalized Method of Moments: I References Chapter 9, R. Davidson and J.G. MacKinnon, Econometric heory and Methods, 2004, Oxford. Chapter 5, B. E. Hansen, Econometrics, 2006. http://www.ssc.wisc.edu/~bhansen/notes/notes.htm

More information

Casuality and Programme Evaluation

Casuality and Programme Evaluation Casuality and Programme Evaluation Lecture V: Difference-in-Differences II Dr Martin Karlsson University of Duisburg-Essen Summer Semester 2017 M Karlsson (University of Duisburg-Essen) Casuality and Programme

More information

Repeated observations on the same cross-section of individual units. Important advantages relative to pure cross-section data

Repeated observations on the same cross-section of individual units. Important advantages relative to pure cross-section data Panel data Repeated observations on the same cross-section of individual units. Important advantages relative to pure cross-section data - possible to control for some unobserved heterogeneity - possible

More information

Introduction to Algorithmic Trading Strategies Lecture 10

Introduction to Algorithmic Trading Strategies Lecture 10 Introduction to Algorithmic Trading Strategies Lecture 10 Risk Management Haksun Li haksun.li@numericalmethod.com www.numericalmethod.com Outline Value at Risk (VaR) Extreme Value Theory (EVT) References

More information

A better way to bootstrap pairs

A better way to bootstrap pairs A better way to bootstrap pairs Emmanuel Flachaire GREQAM - Université de la Méditerranée CORE - Université Catholique de Louvain April 999 Abstract In this paper we are interested in heteroskedastic regression

More information

Fall 2017 STAT 532 Homework Peter Hoff. 1. Let P be a probability measure on a collection of sets A.

Fall 2017 STAT 532 Homework Peter Hoff. 1. Let P be a probability measure on a collection of sets A. 1. Let P be a probability measure on a collection of sets A. (a) For each n N, let H n be a set in A such that H n H n+1. Show that P (H n ) monotonically converges to P ( k=1 H k) as n. (b) For each n

More information

AN EMPIRICAL LIKELIHOOD RATIO TEST FOR NORMALITY

AN EMPIRICAL LIKELIHOOD RATIO TEST FOR NORMALITY Econometrics Working Paper EWP0401 ISSN 1485-6441 Department of Economics AN EMPIRICAL LIKELIHOOD RATIO TEST FOR NORMALITY Lauren Bin Dong & David E. A. Giles Department of Economics, University of Victoria

More information

Problem 1 (20) Log-normal. f(x) Cauchy

Problem 1 (20) Log-normal. f(x) Cauchy ORF 245. Rigollet Date: 11/21/2008 Problem 1 (20) f(x) f(x) 0.0 0.1 0.2 0.3 0.4 0.0 0.2 0.4 0.6 0.8 4 2 0 2 4 Normal (with mean -1) 4 2 0 2 4 Negative-exponential x x f(x) f(x) 0.0 0.1 0.2 0.3 0.4 0.5

More information

GARCH Models Estimation and Inference. Eduardo Rossi University of Pavia

GARCH Models Estimation and Inference. Eduardo Rossi University of Pavia GARCH Models Estimation and Inference Eduardo Rossi University of Pavia Likelihood function The procedure most often used in estimating θ 0 in ARCH models involves the maximization of a likelihood function

More information

Economics 582 Random Effects Estimation

Economics 582 Random Effects Estimation Economics 582 Random Effects Estimation Eric Zivot May 29, 2013 Random Effects Model Hence, the model can be re-written as = x 0 β + + [x ] = 0 (no endogeneity) [ x ] = = + x 0 β + + [x ] = 0 [ x ] = 0

More information

Regression and Statistical Inference

Regression and Statistical Inference Regression and Statistical Inference Walid Mnif wmnif@uwo.ca Department of Applied Mathematics The University of Western Ontario, London, Canada 1 Elements of Probability 2 Elements of Probability CDF&PDF

More information

1 Estimation of Persistent Dynamic Panel Data. Motivation

1 Estimation of Persistent Dynamic Panel Data. Motivation 1 Estimation of Persistent Dynamic Panel Data. Motivation Consider the following Dynamic Panel Data (DPD) model y it = y it 1 ρ + x it β + µ i + v it (1.1) with i = {1, 2,..., N} denoting the individual

More information

Closest Moment Estimation under General Conditions

Closest Moment Estimation under General Conditions Closest Moment Estimation under General Conditions Chirok Han Victoria University of Wellington New Zealand Robert de Jong Ohio State University U.S.A October, 2003 Abstract This paper considers Closest

More information

Flexible Estimation of Treatment Effect Parameters

Flexible Estimation of Treatment Effect Parameters Flexible Estimation of Treatment Effect Parameters Thomas MaCurdy a and Xiaohong Chen b and Han Hong c Introduction Many empirical studies of program evaluations are complicated by the presence of both

More information

Spring 2012 Math 541B Exam 1

Spring 2012 Math 541B Exam 1 Spring 2012 Math 541B Exam 1 1. A sample of size n is drawn without replacement from an urn containing N balls, m of which are red and N m are black; the balls are otherwise indistinguishable. Let X denote

More information

Income distribution and inequality measurement: The problem of extreme values

Income distribution and inequality measurement: The problem of extreme values Income distribution and inequality measurement: The problem of extreme values Frank A. Cowell, Emmanuel Flachaire To cite this version: Frank A. Cowell, Emmanuel Flachaire. Income distribution and inequality

More information

Inference Based on the Wild Bootstrap

Inference Based on the Wild Bootstrap Inference Based on the Wild Bootstrap James G. MacKinnon Department of Economics Queen s University Kingston, Ontario, Canada K7L 3N6 email: jgm@econ.queensu.ca Ottawa September 14, 2012 The Wild Bootstrap

More information

Fixed Effects, Invariance, and Spatial Variation in Intergenerational Mobility

Fixed Effects, Invariance, and Spatial Variation in Intergenerational Mobility American Economic Review: Papers & Proceedings 2016, 106(5): 400 404 http://dx.doi.org/10.1257/aer.p20161082 Fixed Effects, Invariance, and Spatial Variation in Intergenerational Mobility By Gary Chamberlain*

More information

Economic modelling and forecasting

Economic modelling and forecasting Economic modelling and forecasting 2-6 February 2015 Bank of England he generalised method of moments Ole Rummel Adviser, CCBS at the Bank of England ole.rummel@bankofengland.co.uk Outline Classical estimation

More information

Bootstrapping Heteroskedasticity Consistent Covariance Matrix Estimator

Bootstrapping Heteroskedasticity Consistent Covariance Matrix Estimator Bootstrapping Heteroskedasticity Consistent Covariance Matrix Estimator by Emmanuel Flachaire Eurequa, University Paris I Panthéon-Sorbonne December 2001 Abstract Recent results of Cribari-Neto and Zarkos

More information

Introduction to Estimation Methods for Time Series models Lecture 2

Introduction to Estimation Methods for Time Series models Lecture 2 Introduction to Estimation Methods for Time Series models Lecture 2 Fulvio Corsi SNS Pisa Fulvio Corsi Introduction to Estimation () Methods for Time Series models Lecture 2 SNS Pisa 1 / 21 Estimators:

More information

Spatial Regression. 3. Review - OLS and 2SLS. Luc Anselin. Copyright 2017 by Luc Anselin, All Rights Reserved

Spatial Regression. 3. Review - OLS and 2SLS. Luc Anselin.   Copyright 2017 by Luc Anselin, All Rights Reserved Spatial Regression 3. Review - OLS and 2SLS Luc Anselin http://spatial.uchicago.edu OLS estimation (recap) non-spatial regression diagnostics endogeneity - IV and 2SLS OLS Estimation (recap) Linear Regression

More information

On the Power of Tests for Regime Switching

On the Power of Tests for Regime Switching On the Power of Tests for Regime Switching joint work with Drew Carter and Ben Hansen Douglas G. Steigerwald UC Santa Barbara May 2015 D. Steigerwald (UCSB) Regime Switching May 2015 1 / 42 Motivating

More information

IV Quantile Regression for Group-level Treatments, with an Application to the Distributional Effects of Trade

IV Quantile Regression for Group-level Treatments, with an Application to the Distributional Effects of Trade IV Quantile Regression for Group-level Treatments, with an Application to the Distributional Effects of Trade Denis Chetverikov Brad Larsen Christopher Palmer UCLA, Stanford and NBER, UC Berkeley September

More information

Approximate Distributions of the Likelihood Ratio Statistic in a Structural Equation with Many Instruments

Approximate Distributions of the Likelihood Ratio Statistic in a Structural Equation with Many Instruments CIRJE-F-466 Approximate Distributions of the Likelihood Ratio Statistic in a Structural Equation with Many Instruments Yukitoshi Matsushita CIRJE, Faculty of Economics, University of Tokyo February 2007

More information

Panel Data Models. James L. Powell Department of Economics University of California, Berkeley

Panel Data Models. James L. Powell Department of Economics University of California, Berkeley Panel Data Models James L. Powell Department of Economics University of California, Berkeley Overview Like Zellner s seemingly unrelated regression models, the dependent and explanatory variables for panel

More information

Appendix A: The time series behavior of employment growth

Appendix A: The time series behavior of employment growth Unpublished appendices from The Relationship between Firm Size and Firm Growth in the U.S. Manufacturing Sector Bronwyn H. Hall Journal of Industrial Economics 35 (June 987): 583-606. Appendix A: The time

More information

Short Questions (Do two out of three) 15 points each

Short Questions (Do two out of three) 15 points each Econometrics Short Questions Do two out of three) 5 points each ) Let y = Xβ + u and Z be a set of instruments for X When we estimate β with OLS we project y onto the space spanned by X along a path orthogonal

More information

Lecture 4: Heteroskedasticity

Lecture 4: Heteroskedasticity Lecture 4: Heteroskedasticity Econometric Methods Warsaw School of Economics (4) Heteroskedasticity 1 / 24 Outline 1 What is heteroskedasticity? 2 Testing for heteroskedasticity White Goldfeld-Quandt Breusch-Pagan

More information

Estimation of Dynamic Panel Data Models with Sample Selection

Estimation of Dynamic Panel Data Models with Sample Selection === Estimation of Dynamic Panel Data Models with Sample Selection Anastasia Semykina* Department of Economics Florida State University Tallahassee, FL 32306-2180 asemykina@fsu.edu Jeffrey M. Wooldridge

More information

BTRY 4090: Spring 2009 Theory of Statistics

BTRY 4090: Spring 2009 Theory of Statistics BTRY 4090: Spring 2009 Theory of Statistics Guozhang Wang September 25, 2010 1 Review of Probability We begin with a real example of using probability to solve computationally intensive (or infeasible)

More information

STA205 Probability: Week 8 R. Wolpert

STA205 Probability: Week 8 R. Wolpert INFINITE COIN-TOSS AND THE LAWS OF LARGE NUMBERS The traditional interpretation of the probability of an event E is its asymptotic frequency: the limit as n of the fraction of n repeated, similar, and

More information

One-Sample Numerical Data

One-Sample Numerical Data One-Sample Numerical Data quantiles, boxplot, histogram, bootstrap confidence intervals, goodness-of-fit tests University of California, San Diego Instructor: Ery Arias-Castro http://math.ucsd.edu/~eariasca/teaching.html

More information

Estimation theory. Parametric estimation. Properties of estimators. Minimum variance estimator. Cramer-Rao bound. Maximum likelihood estimators

Estimation theory. Parametric estimation. Properties of estimators. Minimum variance estimator. Cramer-Rao bound. Maximum likelihood estimators Estimation theory Parametric estimation Properties of estimators Minimum variance estimator Cramer-Rao bound Maximum likelihood estimators Confidence intervals Bayesian estimation 1 Random Variables Let

More information

GARCH Models Estimation and Inference

GARCH Models Estimation and Inference Università di Pavia GARCH Models Estimation and Inference Eduardo Rossi Likelihood function The procedure most often used in estimating θ 0 in ARCH models involves the maximization of a likelihood function

More information

A Course in Applied Econometrics Lecture 7: Cluster Sampling. Jeff Wooldridge IRP Lectures, UW Madison, August 2008

A Course in Applied Econometrics Lecture 7: Cluster Sampling. Jeff Wooldridge IRP Lectures, UW Madison, August 2008 A Course in Applied Econometrics Lecture 7: Cluster Sampling Jeff Wooldridge IRP Lectures, UW Madison, August 2008 1. The Linear Model with Cluster Effects 2. Estimation with a Small Number of roups and

More information

The Slow Convergence of OLS Estimators of α, β and Portfolio. β and Portfolio Weights under Long Memory Stochastic Volatility

The Slow Convergence of OLS Estimators of α, β and Portfolio. β and Portfolio Weights under Long Memory Stochastic Volatility The Slow Convergence of OLS Estimators of α, β and Portfolio Weights under Long Memory Stochastic Volatility New York University Stern School of Business June 21, 2018 Introduction Bivariate long memory

More information

Predictive Regression and Robust Hypothesis Testing: Predictability Hidden by Anomalous Observations

Predictive Regression and Robust Hypothesis Testing: Predictability Hidden by Anomalous Observations Predictive Regression and Robust Hypothesis Testing: Predictability Hidden by Anomalous Observations Fabio Trojani University of Lugano and Swiss Finance Institute fabio.trojani@usi.ch Joint work with

More information

BOOTSTRAPPING DIFFERENCES-IN-DIFFERENCES ESTIMATES

BOOTSTRAPPING DIFFERENCES-IN-DIFFERENCES ESTIMATES BOOTSTRAPPING DIFFERENCES-IN-DIFFERENCES ESTIMATES Bertrand Hounkannounon Université de Montréal, CIREQ December 2011 Abstract This paper re-examines the analysis of differences-in-differences estimators

More information

Chapter 6. Panel Data. Joan Llull. Quantitative Statistical Methods II Barcelona GSE

Chapter 6. Panel Data. Joan Llull. Quantitative Statistical Methods II Barcelona GSE Chapter 6. Panel Data Joan Llull Quantitative Statistical Methods II Barcelona GSE Introduction Chapter 6. Panel Data 2 Panel data The term panel data refers to data sets with repeated observations over

More information

Lecture 7 Introduction to Statistical Decision Theory

Lecture 7 Introduction to Statistical Decision Theory Lecture 7 Introduction to Statistical Decision Theory I-Hsiang Wang Department of Electrical Engineering National Taiwan University ihwang@ntu.edu.tw December 20, 2016 1 / 55 I-Hsiang Wang IT Lecture 7

More information

Advanced Econometrics

Advanced Econometrics Advanced Econometrics Dr. Andrea Beccarini Center for Quantitative Economics Winter 2013/2014 Andrea Beccarini (CQE) Econometrics Winter 2013/2014 1 / 156 General information Aims and prerequisites Objective:

More information

STAT 461/561- Assignments, Year 2015

STAT 461/561- Assignments, Year 2015 STAT 461/561- Assignments, Year 2015 This is the second set of assignment problems. When you hand in any problem, include the problem itself and its number. pdf are welcome. If so, use large fonts and

More information

Understanding Regressions with Observations Collected at High Frequency over Long Span

Understanding Regressions with Observations Collected at High Frequency over Long Span Understanding Regressions with Observations Collected at High Frequency over Long Span Yoosoon Chang Department of Economics, Indiana University Joon Y. Park Department of Economics, Indiana University

More information

Robust Standard Errors to Spatial and Time Dependence in Aggregate Panel Models

Robust Standard Errors to Spatial and Time Dependence in Aggregate Panel Models Robust Standard Errors to Spatial and Time Dependence in Aggregate Panel Models Lucciano Villacorta February 17, 2017 Abstract This paper studies alternative approaches to consider time and spatial dependence

More information

Bayesian Semiparametric GARCH Models

Bayesian Semiparametric GARCH Models Bayesian Semiparametric GARCH Models Xibin (Bill) Zhang and Maxwell L. King Department of Econometrics and Business Statistics Faculty of Business and Economics xibin.zhang@monash.edu Quantitative Methods

More information

Motivation Non-linear Rational Expectations The Permanent Income Hypothesis The Log of Gravity Non-linear IV Estimation Summary.

Motivation Non-linear Rational Expectations The Permanent Income Hypothesis The Log of Gravity Non-linear IV Estimation Summary. Econometrics I Department of Economics Universidad Carlos III de Madrid Master in Industrial Economics and Markets Outline Motivation 1 Motivation 2 3 4 5 Motivation Hansen's contributions GMM was developed

More information

Bayesian Semiparametric GARCH Models

Bayesian Semiparametric GARCH Models Bayesian Semiparametric GARCH Models Xibin (Bill) Zhang and Maxwell L. King Department of Econometrics and Business Statistics Faculty of Business and Economics xibin.zhang@monash.edu Quantitative Methods

More information

Inference with Dependent Data Using Cluster Covariance Estimators

Inference with Dependent Data Using Cluster Covariance Estimators Inference with Dependent Data Using Cluster Covariance Estimators C. Alan Bester, Timothy G. Conley, and Christian B. Hansen First Draft: February 2008. This Draft: November 2010. Abstract. This paper

More information

Cointegrating Regressions with Messy Regressors: J. Isaac Miller

Cointegrating Regressions with Messy Regressors: J. Isaac Miller NASMES 2008 June 21, 2008 Carnegie Mellon U. Cointegrating Regressions with Messy Regressors: Missingness, Mixed Frequency, and Measurement Error J. Isaac Miller University of Missouri 1 Messy Data Example

More information

Quick Review on Linear Multiple Regression

Quick Review on Linear Multiple Regression Quick Review on Linear Multiple Regression Mei-Yuan Chen Department of Finance National Chung Hsing University March 6, 2007 Introduction for Conditional Mean Modeling Suppose random variables Y, X 1,

More information

1 Appendix A: Matrix Algebra

1 Appendix A: Matrix Algebra Appendix A: Matrix Algebra. Definitions Matrix A =[ ]=[A] Symmetric matrix: = for all and Diagonal matrix: 6=0if = but =0if 6= Scalar matrix: the diagonal matrix of = Identity matrix: the scalar matrix

More information

Obtaining Critical Values for Test of Markov Regime Switching

Obtaining Critical Values for Test of Markov Regime Switching University of California, Santa Barbara From the SelectedWorks of Douglas G. Steigerwald November 1, 01 Obtaining Critical Values for Test of Markov Regime Switching Douglas G Steigerwald, University of

More information

Recitation 5. Inference and Power Calculations. Yiqing Xu. March 7, 2014 MIT

Recitation 5. Inference and Power Calculations. Yiqing Xu. March 7, 2014 MIT 17.802 Recitation 5 Inference and Power Calculations Yiqing Xu MIT March 7, 2014 1 Inference of Frequentists 2 Power Calculations Inference (mostly MHE Ch8) Inference in Asymptopia (and with Weak Null)

More information

High Dimensional Empirical Likelihood for Generalized Estimating Equations with Dependent Data

High Dimensional Empirical Likelihood for Generalized Estimating Equations with Dependent Data High Dimensional Empirical Likelihood for Generalized Estimating Equations with Dependent Data Song Xi CHEN Guanghua School of Management and Center for Statistical Science, Peking University Department

More information

LECTURE ON HAC COVARIANCE MATRIX ESTIMATION AND THE KVB APPROACH

LECTURE ON HAC COVARIANCE MATRIX ESTIMATION AND THE KVB APPROACH LECURE ON HAC COVARIANCE MARIX ESIMAION AND HE KVB APPROACH CHUNG-MING KUAN Institute of Economics Academia Sinica October 20, 2006 ckuan@econ.sinica.edu.tw www.sinica.edu.tw/ ckuan Outline C.-M. Kuan,

More information

Testing Overidentifying Restrictions with Many Instruments and Heteroskedasticity

Testing Overidentifying Restrictions with Many Instruments and Heteroskedasticity Testing Overidentifying Restrictions with Many Instruments and Heteroskedasticity John C. Chao, Department of Economics, University of Maryland, chao@econ.umd.edu. Jerry A. Hausman, Department of Economics,

More information

Inference with Dependent Data Using Cluster Covariance Estimators

Inference with Dependent Data Using Cluster Covariance Estimators Inference with Dependent Data Using Cluster Covariance Estimators C. Alan Bester, Timothy G. Conley, and Christian B. Hansen February 2008 Abstract. This paper presents a novel way to conduct inference

More information

Dealing With Endogeneity

Dealing With Endogeneity Dealing With Endogeneity Junhui Qian December 22, 2014 Outline Introduction Instrumental Variable Instrumental Variable Estimation Two-Stage Least Square Estimation Panel Data Endogeneity in Econometrics

More information

Short T Panels - Review

Short T Panels - Review Short T Panels - Review We have looked at methods for estimating parameters on time-varying explanatory variables consistently in panels with many cross-section observation units but a small number of

More information

Chapter 7. Hypothesis Tests and Confidence Intervals in Multiple Regression

Chapter 7. Hypothesis Tests and Confidence Intervals in Multiple Regression Chapter 7 Hypothesis Tests and Confidence Intervals in Multiple Regression Outline 1. Hypothesis tests and confidence intervals for a single coefficie. Joint hypothesis tests on multiple coefficients 3.

More information

Math 494: Mathematical Statistics

Math 494: Mathematical Statistics Math 494: Mathematical Statistics Instructor: Jimin Ding jmding@wustl.edu Department of Mathematics Washington University in St. Louis Class materials are available on course website (www.math.wustl.edu/

More information

41903: Group-Based Inference

41903: Group-Based Inference 41903: Clusterin and Fama-MacBeth Notes 4 Dependent Data in Economics Many real-world data sets plausibly exhibit complicated dependence structures Consider areate panel data model: y it = x itβ + α i

More information

Answers and expectations

Answers and expectations Answers and expectations For a function f(x) and distribution P(x), the expectation of f with respect to P is The expectation is the average of f, when x is drawn from the probability distribution P E

More information

Nonparametric Identification of a Binary Random Factor in Cross Section Data - Supplemental Appendix

Nonparametric Identification of a Binary Random Factor in Cross Section Data - Supplemental Appendix Nonparametric Identification of a Binary Random Factor in Cross Section Data - Supplemental Appendix Yingying Dong and Arthur Lewbel California State University Fullerton and Boston College July 2010 Abstract

More information

STOCKHOLM UNIVERSITY Department of Economics Course name: Empirical Methods Course code: EC40 Examiner: Lena Nekby Number of credits: 7,5 credits Date of exam: Saturday, May 9, 008 Examination time: 3

More information

Supplemental Material for KERNEL-BASED INFERENCE IN TIME-VARYING COEFFICIENT COINTEGRATING REGRESSION. September 2017

Supplemental Material for KERNEL-BASED INFERENCE IN TIME-VARYING COEFFICIENT COINTEGRATING REGRESSION. September 2017 Supplemental Material for KERNEL-BASED INFERENCE IN TIME-VARYING COEFFICIENT COINTEGRATING REGRESSION By Degui Li, Peter C. B. Phillips, and Jiti Gao September 017 COWLES FOUNDATION DISCUSSION PAPER NO.

More information

Better Bootstrap Confidence Intervals

Better Bootstrap Confidence Intervals by Bradley Efron University of Washington, Department of Statistics April 12, 2012 An example Suppose we wish to make inference on some parameter θ T (F ) (e.g. θ = E F X ), based on data We might suppose

More information

Increasing the Power of Specification Tests. November 18, 2018

Increasing the Power of Specification Tests. November 18, 2018 Increasing the Power of Specification Tests T W J A. H U A MIT November 18, 2018 A. This paper shows how to increase the power of Hausman s (1978) specification test as well as the difference test in a

More information

DSGE Methods. Estimation of DSGE models: GMM and Indirect Inference. Willi Mutschler, M.Sc.

DSGE Methods. Estimation of DSGE models: GMM and Indirect Inference. Willi Mutschler, M.Sc. DSGE Methods Estimation of DSGE models: GMM and Indirect Inference Willi Mutschler, M.Sc. Institute of Econometrics and Economic Statistics University of Münster willi.mutschler@wiwi.uni-muenster.de Summer

More information

Cluster-Robust Inference

Cluster-Robust Inference Cluster-Robust Inference David Sovich Washington University in St. Louis Modern Empirical Corporate Finance Modern day ECF mainly focuses on obtaining unbiased or consistent point estimates (e.g. indentification!)

More information

A Guide to Modern Econometric:

A Guide to Modern Econometric: A Guide to Modern Econometric: 4th edition Marno Verbeek Rotterdam School of Management, Erasmus University, Rotterdam B 379887 )WILEY A John Wiley & Sons, Ltd., Publication Contents Preface xiii 1 Introduction

More information

Using all observations when forecasting under structural breaks

Using all observations when forecasting under structural breaks Using all observations when forecasting under structural breaks Stanislav Anatolyev New Economic School Victor Kitov Moscow State University December 2007 Abstract We extend the idea of the trade-off window

More information

HAR Inference: Recommendations for Practice

HAR Inference: Recommendations for Practice HAR Inference: Recommendations for Practice Eben Lazarus, Harvard Daniel Lewis, Harvard Mark Watson, Princeton & NBER James H. Stock, Harvard & NBER JBES Invited Session Sunday, Jan. 7, 2018, 1-3 pm Marriott

More information