41903: Group-Based Inference

Size: px
Start display at page:

Download "41903: Group-Based Inference"

Transcription

1 41903: Clusterin and Fama-MacBeth Notes 4

2 Dependent Data in Economics Many real-world data sets plausibly exhibit complicated dependence structures Consider areate panel data model: y it = x itβ + α i + δ t + ε it i represents somethin like firm, state, country seems likely that ε it will have a complicated structure e.. consider a productivity shock to IBM today likely related to future and past productivity shocks at IBM plausibly spills over to related firms (e.. Goole) today plausibly spills over to related firms in the future (and related to them in the past) Likely that E[ε it ε jt ] 0 even for i j and s t Failin to account for this dependence will lead to invalid inference for parameters of interest

3 Approaches Group-based inference approaches popular methods to estimatin standard errors robust to non-identically distributed and dependent data. Group-based inference procedures partition data into approximately uncorrelated roups clustered standard errors Lian and Zeer (1986), Arellano (1987) really popularized in panel data application by Bertrand, Duflo, and Mullainathan (2004) Fama-MacBeth (1973) popular in finance but much more enerally applicable Both approaches partition data into = 1,..., G roups I denotes the indices of observations in roup N observations per roup

4 Linear Model Aain Illustrate mechanics of clusterin and Fama-MacBeth in linear model: y i = x i β + ε i ; E[x i ε i ] = 0 E[x i x j ε i ε j ] 0 when i and j both belon to I E[x i x j ε i ε j ] = 0 when i I and j I h with h

5 Clusterin Mechanics Recall β a N(β, V /n) with V = Qn 1 Ω nqn 1 for and Ω n = Var[ 1 n Q n = 1 n n x i ε i ] = 1 n i=1 n i=1 E[x i x i ] 1 n =1 i I n [x i x i ] = Q i=1 G E[x i x j ε i ε j ] = 1 n j I G E[(X E )(E X )] for X the N p matrix formed by stackin observations x i for i I and E the N 1 vector formed by stackin ε i for i I Clustered standard errors based on estimatin Ω n with Ω = 1 n G (X Ê)(Ê X ) = 1 G =1 G =1 =1 N N (X Ê/ N )(Ê X / N ) for Ê the N 1 vector formed by stackin sample residuals ε i for i I

6 Fama-MacBeth (FM) Mechanics FM subset data into the G different roups: X the N p matrix formed by stackin observations x i for i I Y the N 1 vector formed by stackin observations y i for i I β = (X X ) 1 (X Y ) - Estimator of parameter of interest within roup FM Estimator: β = 1 G G β =1 FM Variance estimator: ˆV FM = 1 G 1 G 1 G ( β β)( β β) I.e. the β are just treated as observations. The estimator is their sample mean. The estimated variance is just the estimator of the variance of this sample mean. =1

7 Clustered Covariance Matrix Estimator: Lare G Asymptotics Traditional analysis considers lare G asymptotics (G ) have p Ω Ω n 0: Ê = E X ( β β) (X Ê)(Ê X ) = (X E )(E X ) (X E )(( β β)x X ) (X X ( β β))(e X ) + (X X ( β β))(( β β)x X ) G 1 G n G =1 (X E )(E X p ) Ω n 0 by LLN ( ) 1 vec G G =1 (X E )(( β β)x X ) = 1 G ( ) G =1 X X X E ( β β) = O p(1)o p(1) by LLN and consistency of β Other terms handled similarly Applies more enerally than OLS by replacin x i ε i by s i ( θ) (the element of the score vector for estimatin θ for observation i)

8 Comments on Lare G Clustered s.e. Comments: Consistency of s.e. estimator obtains with essentially no restrictions on within roup covariance structure Essentially arbitrary heteroskedasticity and dependence allowed Ω p.s.d. by construction Leveraes G - need a lare number of roups that can be taken to be approximately independent Appropriate notion of approximately independent does not require there be literally no dependence across roups Hansen (2007) verifies that no further restrictions are required on N when G if roups are literally independent Bester, Conley, and Hansen (2011) shows that N and G required if there is small correlation across roups Similar properties can be established for FM

9 MA(1) Example Suppose you want to estimate [ ] 1 n Ω n = Var η i, where E[η i ] = 0, Var[η i ] = σ 2, Cov(η i, η i 1 ) = θ n i=1 and all other covariances are 0. We have Ω n = σ 2 + 2θ n 1 n

10 MA(1) Example Consider clusterin estimators with G n roups of size N G (inore inteer problems) ˆΩ Gn = 1 n G n N G N G =1 s=( 1)N G +1 t=( 1)N G +1 E[ˆΩ Gn ] = 1 Gn(N n Gσ 2 + 2(N G 1)θ) = σ 2 + 2θ n Gn n Bias = 2θ Gn 1 n no asymptotic bias requires Gn n = 1 N G 0 Let ω st = η sη t E[η sη t]. Var[ˆΩ Gn ] = 1 n 2 Gn =1 NG q=( 1)N G +1 NG r=( 1)N G +1 η sη t Gn NG NG h=1 s=( 1)N G +1 t=( 1)N G +1 E[ωqr ωst] order G nng 2 non-zero terms even if observations were independent Usin this optimistic rate ives Var[ˆΩ Gn ] = C GnN2 G n 2 = C N G n for some C No asymptotic variance requires N G n = 1 G 0 Need G for consistency. With literally no dependence across roups, can et away with no restrictions on roup size. Otherwise, need lare roups.

11 Two-Way Clusterin Cameron, Gelbach, and Miller (2011) propose a multiway clusterin estimator Recall eneral HAC estimator for LS coefficient estimator: ˆΩ = 1 n W (d(i, j))x i x j e i e j i j Cluster estimator special case with W (d(i, j)) = 1((i) = (j)) where (i) returns the roup membership of observation i Consider two roupin variables (e.. state and year). Let (i) denote roup membership in first dimension and h(i) denote roup membership in second dimension. Two-way clusterin is HAC with W (d(i, j)) = 1((i) = (j) or h(i) = h(j)) Can obviously be eneralized to more than two roupin variables Simple computational trick - ˆΩ TW = ˆΩ Cluster, + ˆΩ Cluster,h ˆΩ Cluster, h Good properties will require minimum number of roups alon any dimension to be very lare Not necessarily p.s.d. (may return neative variance estimates) - in practice, common fix is to add a lare enouh p.d. matrix to the estimate to maintain positivity Rules out very sensible models of correlation across observations

12 Usin a Small Number of Lare Groups Takeaways from previous section: Usin many roups keeps variance of standard error estimator small Usin lare roups keeps bias of standard error estimator small These two forces work aainst each other - May lead to poor performance in finite samples Revisit testin problem for scalar parameter of interest ( ˆβ) with estimated standard error ( ˆV ) Assume n( ˆβ β) d A = V Z where Z N(0, 1) Assume ˆV d VW for some random variable W Traditional approach that relies on consistency takes W = 1 t = n( ˆβ β) ˆV d A VW = Z W Z Can use this as lon as distribution of can be obtained W Does not require consistent variance estimation - only a notion of low bias so V cancels We know how to keep bias small - Use a small G (and resultin lare roups)

13 Clustered Standard Errors with Small G Bester, Conley, and Hansen (2011) consider clusterin with eneral dependence usin small G Most useful approach relies on N /N h 1 for all, h Given this, simplify and assume N = N x 1x 1 Q 1 1 N. p. with Q = Q for all x Gx G Q G 1 N B 1. B G x 1ε 1. x Gε G d B 1. B G with Ω N 0, 0 Ω with Ω = Ω for all Ω G For inference about scalar parameter, can allow Ω to differ across roups

14 BCH Results (1) 1. Properties of OLS n( β β) = ( 1 G (x x /N)) 1 ( 1 (x ε / N)) G d ( 1 G Q ) 1 ( 1 B ) G = Q 1 1 G Λ Z where Ω = ΛΛ and Z N(0, I) iid across 2. Properties of ˆΩ: (Let Q = Q and S = ΛZ) ˆΩ d W = 1 G ( Λ Z Z Λ Q Q 1 SZ Λ ) Λ Z S Q 1 Q + Q Q 1 SS Q 1 Q [ ( ) ( )] = 1 G Λ Z Z 1 Z Z G Λ

15 BCH Results (2) 3. V = ( 1 G (x x /N)) 1 1 Ω( G (x x /N)) 1 d Q 1 WQ 1 For simplicity, look at Wald test of H 0 : β = β 0 : ˆF = n( β β) 1 V n( β β) [ [ ( ) ( d ( 1 Z )Λ Q 1 1 Q G G Λ Z Z 1 Z G QQ 1 Λ( 1 Z ) G [ [ ( ) ( = ( 1 Z ) 1 Z Z 1 Z G G G = F Z Z )] Λ ] 1 )]] 1 ( 1 Z ) G The asymptotic distribution is pivotal F Gp G p F p,g p (e.. Rao (2002), Linear Statistical Inference and its Application, Chapter 8b)

16 BCH Results (3) Main BCH Results: Under conditions above and for testin hypotheses of the form Rβ = c d 1. F Gq F G q q,g q where c is a q 1 vector 2. If q = 1, the usual t-statistic, t d G 1 t G G 1 Note that usin Ω = G ˆΩ d in formin the t-statistic ives t t G 1 G 1 - This is the scalin used in Stata When usin clustered standard errors, use critical values from a t or F distribution (rather than N(0,1) or χ 2 ) and rescale - super easy Main drawback: Stron homoeneity conditions employed

17 FM with Small G Ibraimov and Müller (2010) consider FM with a small number of roups Approach only allows testin a scalar hypothesis of the form H 0 : β 0,j = c for some j Extended to two-sample t-tests in Ibraimov and Müller (2016) Let ˆθ be the estimator of the parameter of interest obtained within the th subroup, and let n = G N where N = 1 G N. IM really need one condition: ( N(ˆθ1 θ),..., N(ˆθG θ)) d (b 1,..., b G ) N(0, Σ) where Σ is diaonal Note - no homoeneity imposed

18 IM Results (1) 1. t-statistic: t = θ θ n( θ θ) = VFM nvfm 2. FM Estimator: n( θ θ) = 1 N(ˆθ θ) G d 1 G b

19 IM Results (2) 3. FM Variance Estimator: n ˆV FM = n G = 1 G 1 = 1 G 1 1 G 1 G ( θ θ)( θ θ) =1 ( 2 N( θ θ)) d 1 G 1 Puttin 1., 2., and 3. toether ives ( N(ˆθ θ) 1 G (b b) 2 = sg 2 t d t = 1 G b s G / G 2 N(ˆθh θ)) i.e. the FM t-statistic is asymptotically the same as a t-statistic for testin H 0 : µ = 0 usin the sample mean estimated from independent normals with different variances usin the usual standard error estimator h

20 IM Results (3) How does this help? Variances are not homoeneous, so t does not follow a t-distribution. Simplified statement of theorem from Bakirov and Székely (2005): Let cv G 1 (α) be the usual two-sided critical value from a t G 1 random variable. If α <.083, then for all G 2, sup P( t > cv G 1 (α) H 0 ) = P( t G 1 > cv G 1 (α)) = α σ 2 1,σ2 2,...,σ2 G The t-test for a sample mean assumin common variances remains valid for sizes less than 8% or so reardless of heteroeneity in the variances. The FM-based t-test is asymptotically equivalent to a t-test for a sample mean so remains valid with essentially no homoeneity restrictions.

21 FM-based Permutation Inference Robustness of FM approach relies on test of sinle hypothesis bein willin to live with a conservative test Canay, Romano, and Shaikh (2014) consider a randomization test procedure that addresses both concerns Let ˆθ be the estimator of the parameter(s) of interest obtained within the th subroup. Will use ( N(ˆθ 1 θ),..., N(ˆθ G θ) ) d (b 1,..., b G) N(0, Σ) where Σ is block diaonal - i.e. E[b b h] = 0 for h Note - Good properties of CRS can be obtained under weaker conditions Aain no homoeneity imposed

22 CRS Mechanics Let T (X) be a test-statistic that takes data X and rejects if T (X) is lare (e.. a t-statistic) In our application, we will set X = ( N(ˆθ 1 θ),..., N(ˆθ G θ) ) where θ is the value of the parameter under the null hypothesis M = [ 1, 1] G be the set of sin-chanes for G variables mx be an element from M applied to X Note that as lon as the distribution of X is symmetric, mx has the same distribution as X Implementation: Compute T (mx) for all m M. (If G is lare, this will be infeasible - can instead randomly choose elements from M). Let M R be the set of m used where R denotes the number of elements in this set. Calculate p-value: ˆp = 1 R m M R 1(T (mx) T (X)) Reject hypothesized value at level α if ˆp < α

23 Intuition Basic idea for why this should work: Under H 0, distribution of X = ( N(ˆθ1 θ),..., N(ˆθG θ) ) is symmetric and elements of X are independent Multiplyin the elements X by 1 or 1 does not chane the distribution under H 0 but does chane the values of the test-statistic By lookin at the different values of the test-statistic produced we can then et the distribution of our test-statistic

24 Comments Thouhts on these procedures: All are potentially useful and none uniformly dominates in simulation CRS and IM require formally weaker conditions than clusterin CRS is not conservative under heteroeneity while IM is For small α, CRS will have no power unless a moderate number of roups are available Clusterin uses the full sample in formin its point estimator - FM (e.. IM and CRS) uses the averae of subsample estimators - finite-sample bias will behave as if it is from the smallest subsample size while variance will shrink as if parameter estimated from the full sample Use clusterin when finite-sample bias is a bier concern than heteroeneity Use CRS when a moderate number of clusters is available Use IM when usin very few clusters and heteroeneity is a concern All procedure rely on havin formed lare enouh roups to capture important sources of dependence Be careful of fixed effects that cross roup boundaries - e.. includin four-diit industry classification dummies and roupin by state may induce dependence across states

25 Example: Health Insurance Reform (Aain) Kaestner and Simon (ILLR 2002) Individual level data from CPS March Supplement Data: individuals in 51 states (includin DC) and 10 years Outcome: (lo(wae)) Variables of Interest (Treatments): fullr - full HI reform indicator partialr - partial HI reform indicator hihcost - number of mandates to cover hih cost procedures women - number of mandates for coverin larely female specific procedures other - number of mandates coverin other procedures Controls: ae: dummies for ae cateories (a2 a8) ender: dummy for male education: dummies for hih school, some collee, and collee maritalstatus: dummies for married and divorced race: dummies for African American and other race (white excluded cateory) children: number of children < 6 and number of children between 6 and 18 state effects, year effects, state specific trends

26 HI Reform Reression Estimated model: lo(wae) it = α s + κ t + δ st + treatment stγ + x itβ + ε it where treatment st is a 5 1 vector of the state level treatment variables applyin to individual i at time t. How should we roup?... state year? state? year? two-way state-year? 9 census-reion? 4 census-reion?

27 HI Reform Reression: Results OLS Estimation Results: β State 9 Reions 4 Reions Year State X Year Two-Way Full Reform Partial Reform # Hih Cost # Women # Other cv(5%) Fama-MacBeth Estimation Results: β State s.e. β 9Reions s.e β 4Reions s.e. β Year s.e. Full Reform Partial Reform # Hih Cost # Women # Other cv(5%) CRS lb State ub State lb 9Reions ub 9Reions lb 4Reions ub 4Reions lb Year ub Year Full Reform Partial Reform # Hih Cost # Women # Other

The Exact Distribution of the t-ratio with Robust and Clustered Standard Errors

The Exact Distribution of the t-ratio with Robust and Clustered Standard Errors The Exact Distribution of the t-ratio with Robust and Clustered Standard Errors by Bruce E. Hansen Department of Economics University of Wisconsin October 2018 Bruce Hansen (University of Wisconsin) Exact

More information

The Exact Distribution of the t-ratio with Robust and Clustered Standard Errors

The Exact Distribution of the t-ratio with Robust and Clustered Standard Errors The Exact Distribution of the t-ratio with Robust and Clustered Standard Errors by Bruce E. Hansen Department of Economics University of Wisconsin June 2017 Bruce Hansen (University of Wisconsin) Exact

More information

Inference with Dependent Data Using Cluster Covariance Estimators

Inference with Dependent Data Using Cluster Covariance Estimators Inference with Dependent Data Using Cluster Covariance Estimators C. Alan Bester, Timothy G. Conley, and Christian B. Hansen February 2008 Abstract. This paper presents a novel way to conduct inference

More information

Inference with Dependent Data Using Cluster Covariance Estimators

Inference with Dependent Data Using Cluster Covariance Estimators Inference with Dependent Data Using Cluster Covariance Estimators C. Alan Bester, Timothy G. Conley, and Christian B. Hansen First Draft: February 2008. This Draft: November 2010. Abstract. This paper

More information

Economics 582 Random Effects Estimation

Economics 582 Random Effects Estimation Economics 582 Random Effects Estimation Eric Zivot May 29, 2013 Random Effects Model Hence, the model can be re-written as = x 0 β + + [x ] = 0 (no endogeneity) [ x ] = = + x 0 β + + [x ] = 0 [ x ] = 0

More information

A Practitioner s Guide to Cluster-Robust Inference

A Practitioner s Guide to Cluster-Robust Inference A Practitioner s Guide to Cluster-Robust Inference A. C. Cameron and D. L. Miller presented by Federico Curci March 4, 2015 Cameron Miller Cluster Clinic II March 4, 2015 1 / 20 In the previous episode

More information

Stat260: Bayesian Modeling and Inference Lecture Date: March 10, 2010

Stat260: Bayesian Modeling and Inference Lecture Date: March 10, 2010 Stat60: Bayesian Modelin and Inference Lecture Date: March 10, 010 Bayes Factors, -priors, and Model Selection for Reression Lecturer: Michael I. Jordan Scribe: Tamara Broderick The readin for this lecture

More information

A Course in Applied Econometrics Lecture 7: Cluster Sampling. Jeff Wooldridge IRP Lectures, UW Madison, August 2008

A Course in Applied Econometrics Lecture 7: Cluster Sampling. Jeff Wooldridge IRP Lectures, UW Madison, August 2008 A Course in Applied Econometrics Lecture 7: Cluster Sampling Jeff Wooldridge IRP Lectures, UW Madison, August 2008 1. The Linear Model with Cluster Effects 2. Estimation with a Small Number of roups and

More information

Solution to the take home exam for ECON 3150/4150

Solution to the take home exam for ECON 3150/4150 Solution to the tae home exam for ECO 350/450 Jia Zhiyan and Jo Thori Lind April 2004 General comments Most of the copies we ot were quite ood, and it seems most of you have done a real effort on the problem

More information

Repeated observations on the same cross-section of individual units. Important advantages relative to pure cross-section data

Repeated observations on the same cross-section of individual units. Important advantages relative to pure cross-section data Panel data Repeated observations on the same cross-section of individual units. Important advantages relative to pure cross-section data - possible to control for some unobserved heterogeneity - possible

More information

LECTURE 5 HYPOTHESIS TESTING

LECTURE 5 HYPOTHESIS TESTING October 25, 2016 LECTURE 5 HYPOTHESIS TESTING Basic concepts In this lecture we continue to discuss the normal classical linear regression defined by Assumptions A1-A5. Let θ Θ R d be a parameter of interest.

More information

LECTURE ON HAC COVARIANCE MATRIX ESTIMATION AND THE KVB APPROACH

LECTURE ON HAC COVARIANCE MATRIX ESTIMATION AND THE KVB APPROACH LECURE ON HAC COVARIANCE MARIX ESIMAION AND HE KVB APPROACH CHUNG-MING KUAN Institute of Economics Academia Sinica October 20, 2006 ckuan@econ.sinica.edu.tw www.sinica.edu.tw/ ckuan Outline C.-M. Kuan,

More information

Chapter 6. Panel Data. Joan Llull. Quantitative Statistical Methods II Barcelona GSE

Chapter 6. Panel Data. Joan Llull. Quantitative Statistical Methods II Barcelona GSE Chapter 6. Panel Data Joan Llull Quantitative Statistical Methods II Barcelona GSE Introduction Chapter 6. Panel Data 2 Panel data The term panel data refers to data sets with repeated observations over

More information

Clustering as a Design Problem

Clustering as a Design Problem Clustering as a Design Problem Alberto Abadie, Susan Athey, Guido Imbens, & Jeffrey Wooldridge Harvard-MIT Econometrics Seminar Cambridge, February 4, 2016 Adjusting standard errors for clustering is common

More information

Short T Panels - Review

Short T Panels - Review Short T Panels - Review We have looked at methods for estimating parameters on time-varying explanatory variables consistently in panels with many cross-section observation units but a small number of

More information

Supplement to Randomization Tests under an Approximate Symmetry Assumption

Supplement to Randomization Tests under an Approximate Symmetry Assumption Supplement to Randomization Tests under an Approximate Symmetry Assumption Ivan A. Canay Department of Economics Northwestern University iacanay@northwestern.edu Joseph P. Romano Departments of Economics

More information

Casuality and Programme Evaluation

Casuality and Programme Evaluation Casuality and Programme Evaluation Lecture V: Difference-in-Differences II Dr Martin Karlsson University of Duisburg-Essen Summer Semester 2017 M Karlsson (University of Duisburg-Essen) Casuality and Programme

More information

Recent Advances in the Field of Trade Theory and Policy Analysis Using Micro-Level Data

Recent Advances in the Field of Trade Theory and Policy Analysis Using Micro-Level Data Recent Advances in the Field of Trade Theory and Policy Analysis Using Micro-Level Data July 2012 Bangkok, Thailand Cosimo Beverelli (World Trade Organization) 1 Content a) Classical regression model b)

More information

Econometrics I KS. Module 2: Multivariate Linear Regression. Alexander Ahammer. This version: April 16, 2018

Econometrics I KS. Module 2: Multivariate Linear Regression. Alexander Ahammer. This version: April 16, 2018 Econometrics I KS Module 2: Multivariate Linear Regression Alexander Ahammer Department of Economics Johannes Kepler University of Linz This version: April 16, 2018 Alexander Ahammer (JKU) Module 2: Multivariate

More information

Wild Bootstrap Inference for Wildly Dierent Cluster Sizes

Wild Bootstrap Inference for Wildly Dierent Cluster Sizes Wild Bootstrap Inference for Wildly Dierent Cluster Sizes Matthew D. Webb October 9, 2013 Introduction This paper is joint with: Contributions: James G. MacKinnon Department of Economics Queen's University

More information

Cluster-Robust Inference

Cluster-Robust Inference Cluster-Robust Inference David Sovich Washington University in St. Louis Modern Empirical Corporate Finance Modern day ECF mainly focuses on obtaining unbiased or consistent point estimates (e.g. indentification!)

More information

Central Bank of Chile October 29-31, 2013 Bruce Hansen (University of Wisconsin) Structural Breaks October 29-31, / 91. Bruce E.

Central Bank of Chile October 29-31, 2013 Bruce Hansen (University of Wisconsin) Structural Breaks October 29-31, / 91. Bruce E. Forecasting Lecture 3 Structural Breaks Central Bank of Chile October 29-31, 2013 Bruce Hansen (University of Wisconsin) Structural Breaks October 29-31, 2013 1 / 91 Bruce E. Hansen Organization Detection

More information

Increasing the Power of Specification Tests. November 18, 2018

Increasing the Power of Specification Tests. November 18, 2018 Increasing the Power of Specification Tests T W J A. H U A MIT November 18, 2018 A. This paper shows how to increase the power of Hausman s (1978) specification test as well as the difference test in a

More information

Review of Classical Least Squares. James L. Powell Department of Economics University of California, Berkeley

Review of Classical Least Squares. James L. Powell Department of Economics University of California, Berkeley Review of Classical Least Squares James L. Powell Department of Economics University of California, Berkeley The Classical Linear Model The object of least squares regression methods is to model and estimate

More information

Econ 582 Fixed Effects Estimation of Panel Data

Econ 582 Fixed Effects Estimation of Panel Data Econ 582 Fixed Effects Estimation of Panel Data Eric Zivot May 28, 2012 Panel Data Framework = x 0 β + = 1 (individuals); =1 (time periods) y 1 = X β ( ) ( 1) + ε Main question: Is x uncorrelated with?

More information

Spatial Regression. 3. Review - OLS and 2SLS. Luc Anselin. Copyright 2017 by Luc Anselin, All Rights Reserved

Spatial Regression. 3. Review - OLS and 2SLS. Luc Anselin.   Copyright 2017 by Luc Anselin, All Rights Reserved Spatial Regression 3. Review - OLS and 2SLS Luc Anselin http://spatial.uchicago.edu OLS estimation (recap) non-spatial regression diagnostics endogeneity - IV and 2SLS OLS Estimation (recap) Linear Regression

More information

Statistics and econometrics

Statistics and econometrics 1 / 36 Slides for the course Statistics and econometrics Part 10: Asymptotic hypothesis testing European University Institute Andrea Ichino September 8, 2014 2 / 36 Outline Why do we need large sample

More information

Asymptotic Behavior of a t Test Robust to Cluster Heterogeneity

Asymptotic Behavior of a t Test Robust to Cluster Heterogeneity Asymptotic Behavior of a t est Robust to Cluster Heteroeneity Andrew V. Carter Department of Statistics University of California, Santa Barbara Kevin. Schnepel and Doulas G. Steierwald Department of Economics

More information

Correlation and Heterogeneity Robust Inference using Conservativeness of Test Statistic. London School of Economics November 24, 2011

Correlation and Heterogeneity Robust Inference using Conservativeness of Test Statistic. London School of Economics November 24, 2011 Correlation and Heterogeneity Robust Inference using Conservativeness of Test Statistic Rustam Ibragimov Harvard University Ulrich K. Müller Princeton University London School of Economics November 24,

More information

Understanding Regressions with Observations Collected at High Frequency over Long Span

Understanding Regressions with Observations Collected at High Frequency over Long Span Understanding Regressions with Observations Collected at High Frequency over Long Span Yoosoon Chang Department of Economics, Indiana University Joon Y. Park Department of Economics, Indiana University

More information

MFE Financial Econometrics 2018 Final Exam Model Solutions

MFE Financial Econometrics 2018 Final Exam Model Solutions MFE Financial Econometrics 2018 Final Exam Model Solutions Tuesday 12 th March, 2019 1. If (X, ε) N (0, I 2 ) what is the distribution of Y = µ + β X + ε? Y N ( µ, β 2 + 1 ) 2. What is the Cramer-Rao lower

More information

Quick Review on Linear Multiple Regression

Quick Review on Linear Multiple Regression Quick Review on Linear Multiple Regression Mei-Yuan Chen Department of Finance National Chung Hsing University March 6, 2007 Introduction for Conditional Mean Modeling Suppose random variables Y, X 1,

More information

Answers to Problem Set #4

Answers to Problem Set #4 Answers to Problem Set #4 Problems. Suppose that, from a sample of 63 observations, the least squares estimates and the corresponding estimated variance covariance matrix are given by: bβ bβ 2 bβ 3 = 2

More information

Applied Microeconometrics (L5): Panel Data-Basics

Applied Microeconometrics (L5): Panel Data-Basics Applied Microeconometrics (L5): Panel Data-Basics Nicholas Giannakopoulos University of Patras Department of Economics ngias@upatras.gr November 10, 2015 Nicholas Giannakopoulos (UPatras) MSc Applied Economics

More information

Panel Data. March 2, () Applied Economoetrics: Topic 6 March 2, / 43

Panel Data. March 2, () Applied Economoetrics: Topic 6 March 2, / 43 Panel Data March 2, 212 () Applied Economoetrics: Topic March 2, 212 1 / 43 Overview Many economic applications involve panel data. Panel data has both cross-sectional and time series aspects. Regression

More information

Comprehensive Examination Quantitative Methods Spring, 2018

Comprehensive Examination Quantitative Methods Spring, 2018 Comprehensive Examination Quantitative Methods Spring, 2018 Instruction: This exam consists of three parts. You are required to answer all the questions in all the parts. 1 Grading policy: 1. Each part

More information

Econometrics - 30C00200

Econometrics - 30C00200 Econometrics - 30C00200 Lecture 11: Heteroskedasticity Antti Saastamoinen VATT Institute for Economic Research Fall 2015 30C00200 Lecture 11: Heteroskedasticity 12.10.2015 Aalto University School of Business

More information

Bootstrap-Based Improvements for Inference with Clustered Errors

Bootstrap-Based Improvements for Inference with Clustered Errors Bootstrap-Based Improvements for Inference with Clustered Errors Colin Cameron, Jonah Gelbach, Doug Miller U.C. - Davis, U. Maryland, U.C. - Davis May, 2008 May, 2008 1 / 41 1. Introduction OLS regression

More information

EC212: Introduction to Econometrics Review Materials (Wooldridge, Appendix)

EC212: Introduction to Econometrics Review Materials (Wooldridge, Appendix) 1 EC212: Introduction to Econometrics Review Materials (Wooldridge, Appendix) Taisuke Otsu London School of Economics Summer 2018 A.1. Summation operator (Wooldridge, App. A.1) 2 3 Summation operator For

More information

ECON 5350 Class Notes Functional Form and Structural Change

ECON 5350 Class Notes Functional Form and Structural Change ECON 5350 Class Notes Functional Form and Structural Change 1 Introduction Although OLS is considered a linear estimator, it does not mean that the relationship between Y and X needs to be linear. In this

More information

Small-Sample Methods for Cluster-Robust Inference in School-Based Experiments

Small-Sample Methods for Cluster-Robust Inference in School-Based Experiments Small-Sample Methods for Cluster-Robust Inference in School-Based Experiments James E. Pustejovsky UT Austin Educational Psychology Department Quantitative Methods Program pusto@austin.utexas.edu Elizabeth

More information

Inference with Correlated Clusters

Inference with Correlated Clusters WORKING PAPER Inference with Correlated Clusters David Powell RAND Labor & Population WR-37 May 207 This paper series made possible by the NIA funded RAND Center for the Study of Aging (P30AG0285) and

More information

Flexible Estimation of Treatment Effect Parameters

Flexible Estimation of Treatment Effect Parameters Flexible Estimation of Treatment Effect Parameters Thomas MaCurdy a and Xiaohong Chen b and Han Hong c Introduction Many empirical studies of program evaluations are complicated by the presence of both

More information

Political Science 236 Hypothesis Testing: Review and Bootstrapping

Political Science 236 Hypothesis Testing: Review and Bootstrapping Political Science 236 Hypothesis Testing: Review and Bootstrapping Rocío Titiunik Fall 2007 1 Hypothesis Testing Definition 1.1 Hypothesis. A hypothesis is a statement about a population parameter The

More information

Econometric Methods for Panel Data

Econometric Methods for Panel Data Based on the books by Baltagi: Econometric Analysis of Panel Data and by Hsiao: Analysis of Panel Data Robert M. Kunst robert.kunst@univie.ac.at University of Vienna and Institute for Advanced Studies

More information

Hypothesis Testing. 1 Definitions of test statistics. CB: chapter 8; section 10.3

Hypothesis Testing. 1 Definitions of test statistics. CB: chapter 8; section 10.3 Hypothesis Testing CB: chapter 8; section 0.3 Hypothesis: statement about an unknown population parameter Examples: The average age of males in Sweden is 7. (statement about population mean) The lowest

More information

Inference Based on the Wild Bootstrap

Inference Based on the Wild Bootstrap Inference Based on the Wild Bootstrap James G. MacKinnon Department of Economics Queen s University Kingston, Ontario, Canada K7L 3N6 email: jgm@econ.queensu.ca Ottawa September 14, 2012 The Wild Bootstrap

More information

δ -method and M-estimation

δ -method and M-estimation Econ 2110, fall 2016, Part IVb Asymptotic Theory: δ -method and M-estimation Maximilian Kasy Department of Economics, Harvard University 1 / 40 Example Suppose we estimate the average effect of class size

More information

Econ 583 Final Exam Fall 2008

Econ 583 Final Exam Fall 2008 Econ 583 Final Exam Fall 2008 Eric Zivot December 11, 2008 Exam is due at 9:00 am in my office on Friday, December 12. 1 Maximum Likelihood Estimation and Asymptotic Theory Let X 1,...,X n be iid random

More information

Generalized Method of Moment

Generalized Method of Moment Generalized Method of Moment CHUNG-MING KUAN Department of Finance & CRETA National Taiwan University June 16, 2010 C.-M. Kuan (Finance & CRETA, NTU Generalized Method of Moment June 16, 2010 1 / 32 Lecture

More information

Ordinary Least Squares Regression

Ordinary Least Squares Regression Ordinary Least Squares Regression Goals for this unit More on notation and terminology OLS scalar versus matrix derivation Some Preliminaries In this class we will be learning to analyze Cross Section

More information

An overview of applied econometrics

An overview of applied econometrics An overview of applied econometrics Jo Thori Lind September 4, 2011 1 Introduction This note is intended as a brief overview of what is necessary to read and understand journal articles with empirical

More information

We can relax the assumption that observations are independent over i = firms or plants which operate in the same industries/sectors

We can relax the assumption that observations are independent over i = firms or plants which operate in the same industries/sectors Cluster-robust inference We can relax the assumption that observations are independent over i = 1, 2,..., n in various limited ways One example with cross-section data occurs when the individual units

More information

Econometrics of Panel Data

Econometrics of Panel Data Econometrics of Panel Data Jakub Mućk Meeting # 3 Jakub Mućk Econometrics of Panel Data Meeting # 3 1 / 21 Outline 1 Fixed or Random Hausman Test 2 Between Estimator 3 Coefficient of determination (R 2

More information

Advanced Econometrics

Advanced Econometrics Based on the textbook by Verbeek: A Guide to Modern Econometrics Robert M. Kunst robert.kunst@univie.ac.at University of Vienna and Institute for Advanced Studies Vienna May 16, 2013 Outline Univariate

More information

MS&E 226: Small Data

MS&E 226: Small Data MS&E 226: Small Data Lecture 15: Examples of hypothesis tests (v5) Ramesh Johari ramesh.johari@stanford.edu 1 / 32 The recipe 2 / 32 The hypothesis testing recipe In this lecture we repeatedly apply the

More information

Problem Set 2

Problem Set 2 14.382 Problem Set 2 Paul Schrimpf Due: Friday, March 14, 2008 Problem 1 IV Basics Consider the model: y = Xβ + ɛ where X is T K. You are concerned that E[ɛ X] 0. Luckily, you have M > K variables, Z,

More information

Dealing With Endogeneity

Dealing With Endogeneity Dealing With Endogeneity Junhui Qian December 22, 2014 Outline Introduction Instrumental Variable Instrumental Variable Estimation Two-Stage Least Square Estimation Panel Data Endogeneity in Econometrics

More information

Econ 836 Final Exam. 2 w N 2 u N 2. 2 v N

Econ 836 Final Exam. 2 w N 2 u N 2. 2 v N 1) [4 points] Let Econ 836 Final Exam Y Xβ+ ε, X w+ u, w N w~ N(, σi ), u N u~ N(, σi ), ε N ε~ Nu ( γσ, I ), where X is a just one column. Let denote the OLS estimator, and define residuals e as e Y X.

More information

Chapter 7. Hypothesis Testing

Chapter 7. Hypothesis Testing Chapter 7. Hypothesis Testing Joonpyo Kim June 24, 2017 Joonpyo Kim Ch7 June 24, 2017 1 / 63 Basic Concepts of Testing Suppose that our interest centers on a random variable X which has density function

More information

Applied Quantitative Methods II

Applied Quantitative Methods II Applied Quantitative Methods II Lecture 10: Panel Data Klára Kaĺıšková Klára Kaĺıšková AQM II - Lecture 10 VŠE, SS 2016/17 1 / 38 Outline 1 Introduction 2 Pooled OLS 3 First differences 4 Fixed effects

More information

Improving Weighted Least Squares Inference

Improving Weighted Least Squares Inference University of Zurich Department of Economics Working Paper Series ISSN 664-704 (print ISSN 664-705X (online Working Paper No. 232 Improving Weighted Least Squares Inference Cyrus J. DiCiccio, Joseph P.

More information

Heteroskedasticity and Autocorrelation Consistent Standard Errors

Heteroskedasticity and Autocorrelation Consistent Standard Errors NBER Summer Institute Minicourse What s New in Econometrics: ime Series Lecture 9 July 6, 008 Heteroskedasticity and Autocorrelation Consistent Standard Errors Lecture 9, July, 008 Outline. What are HAC

More information

SIMILAR-ON-THE-BOUNDARY TESTS FOR MOMENT INEQUALITIES EXIST, BUT HAVE POOR POWER. Donald W. K. Andrews. August 2011

SIMILAR-ON-THE-BOUNDARY TESTS FOR MOMENT INEQUALITIES EXIST, BUT HAVE POOR POWER. Donald W. K. Andrews. August 2011 SIMILAR-ON-THE-BOUNDARY TESTS FOR MOMENT INEQUALITIES EXIST, BUT HAVE POOR POWER By Donald W. K. Andrews August 2011 COWLES FOUNDATION DISCUSSION PAPER NO. 1815 COWLES FOUNDATION FOR RESEARCH IN ECONOMICS

More information

large number of i.i.d. observations from P. For concreteness, suppose

large number of i.i.d. observations from P. For concreteness, suppose 1 Subsampling Suppose X i, i = 1,..., n is an i.i.d. sequence of random variables with distribution P. Let θ(p ) be some real-valued parameter of interest, and let ˆθ n = ˆθ n (X 1,..., X n ) be some estimate

More information

11. Bootstrap Methods

11. Bootstrap Methods 11. Bootstrap Methods c A. Colin Cameron & Pravin K. Trivedi 2006 These transparencies were prepared in 20043. They can be used as an adjunct to Chapter 11 of our subsequent book Microeconometrics: Methods

More information

1 Motivation for Instrumental Variable (IV) Regression

1 Motivation for Instrumental Variable (IV) Regression ECON 370: IV & 2SLS 1 Instrumental Variables Estimation and Two Stage Least Squares Econometric Methods, ECON 370 Let s get back to the thiking in terms of cross sectional (or pooled cross sectional) data

More information

WILD CLUSTER BOOTSTRAP CONFIDENCE INTERVALS*

WILD CLUSTER BOOTSTRAP CONFIDENCE INTERVALS* L Actualité économique, Revue d analyse économique, vol 91, n os 1-2, mars-juin 2015 WILD CLUSTER BOOTSTRAP CONFIDENCE INTERVALS* James G MacKinnon Department of Economics Queen s University Abstract Confidence

More information

Robust Inference with Multi-way Clustering

Robust Inference with Multi-way Clustering Robust Inference with Multi-way Clustering Colin Cameron, Jonah Gelbach, Doug Miller U.C. - Davis, U. Arizona, U.C. - Davis February 2010 February 2010 1 / 44 1.1 Introduction Moulton (1986, 1990) and

More information

Fixed-b Asymptotics for Spatially Dependent Robust Nonparametric Covariance Matrix Estimators

Fixed-b Asymptotics for Spatially Dependent Robust Nonparametric Covariance Matrix Estimators Fixed-b Asymptotics for Spatially Dependent Robust Nonparametric Covariance Matrix Estimators C. Alan Bester, Timothy G. Conley, Christian B. Hansen, and Timothy J. Vogelsang August 2008 Abstract. This

More information

Panel Data Models. James L. Powell Department of Economics University of California, Berkeley

Panel Data Models. James L. Powell Department of Economics University of California, Berkeley Panel Data Models James L. Powell Department of Economics University of California, Berkeley Overview Like Zellner s seemingly unrelated regression models, the dependent and explanatory variables for panel

More information

Reliability of inference (1 of 2 lectures)

Reliability of inference (1 of 2 lectures) Reliability of inference (1 of 2 lectures) Ragnar Nymoen University of Oslo 5 March 2013 1 / 19 This lecture (#13 and 14): I The optimality of the OLS estimators and tests depend on the assumptions of

More information

Estimation and Testing for Common Cycles

Estimation and Testing for Common Cycles Estimation and esting for Common Cycles Anders Warne February 27, 2008 Abstract: his note discusses estimation and testing for the presence of common cycles in cointegrated vector autoregressions A simple

More information

Statistics Hotelling s T Gary W. Oehlert School of Statistics 313B Ford Hall

Statistics Hotelling s T Gary W. Oehlert School of Statistics 313B Ford Hall \ C f C E A tatistics 5041 11 Hotellin s T Gary W ehlert chool of tatistics 313B Ford Hall 612-625-155 ary@statumnedu Let s think about the univariate -test First recall that there are one-sample tests,

More information

MS&E 226: Small Data

MS&E 226: Small Data MS&E 226: Small Data Lecture 6: Model complexity scores (v3) Ramesh Johari ramesh.johari@stanford.edu Fall 2015 1 / 34 Estimating prediction error 2 / 34 Estimating prediction error We saw how we can estimate

More information

Chapter 8 Heteroskedasticity

Chapter 8 Heteroskedasticity Chapter 8 Walter R. Paczkowski Rutgers University Page 1 Chapter Contents 8.1 The Nature of 8. Detecting 8.3 -Consistent Standard Errors 8.4 Generalized Least Squares: Known Form of Variance 8.5 Generalized

More information

Topic 7: Heteroskedasticity

Topic 7: Heteroskedasticity Topic 7: Heteroskedasticity Advanced Econometrics (I Dong Chen School of Economics, Peking University Introduction If the disturbance variance is not constant across observations, the regression is heteroskedastic

More information

Economics 583: Econometric Theory I A Primer on Asymptotics: Hypothesis Testing

Economics 583: Econometric Theory I A Primer on Asymptotics: Hypothesis Testing Economics 583: Econometric Theory I A Primer on Asymptotics: Hypothesis Testing Eric Zivot October 12, 2011 Hypothesis Testing 1. Specify hypothesis to be tested H 0 : null hypothesis versus. H 1 : alternative

More information

Recitation 5. Inference and Power Calculations. Yiqing Xu. March 7, 2014 MIT

Recitation 5. Inference and Power Calculations. Yiqing Xu. March 7, 2014 MIT 17.802 Recitation 5 Inference and Power Calculations Yiqing Xu MIT March 7, 2014 1 Inference of Frequentists 2 Power Calculations Inference (mostly MHE Ch8) Inference in Asymptopia (and with Weak Null)

More information

Basic Regressions and Panel Data in Stata

Basic Regressions and Panel Data in Stata Developing Trade Consultants Policy Research Capacity Building Basic Regressions and Panel Data in Stata Ben Shepherd Principal, Developing Trade Consultants 1 Basic regressions } Stata s regress command

More information

Instrumental Variables, Simultaneous and Systems of Equations

Instrumental Variables, Simultaneous and Systems of Equations Chapter 6 Instrumental Variables, Simultaneous and Systems of Equations 61 Instrumental variables In the linear regression model y i = x iβ + ε i (61) we have been assuming that bf x i and ε i are uncorrelated

More information

Non-Spherical Errors

Non-Spherical Errors Non-Spherical Errors Krishna Pendakur February 15, 2016 1 Efficient OLS 1. Consider the model Y = Xβ + ε E [X ε = 0 K E [εε = Ω = σ 2 I N. 2. Consider the estimated OLS parameter vector ˆβ OLS = (X X)

More information

Econometrics. Week 8. Fall Institute of Economic Studies Faculty of Social Sciences Charles University in Prague

Econometrics. Week 8. Fall Institute of Economic Studies Faculty of Social Sciences Charles University in Prague Econometrics Week 8 Institute of Economic Studies Faculty of Social Sciences Charles University in Prague Fall 2012 1 / 25 Recommended Reading For the today Instrumental Variables Estimation and Two Stage

More information

QED. Queen s Economics Department Working Paper No. 1355

QED. Queen s Economics Department Working Paper No. 1355 QED Queen s Economics Department Working Paper No 1355 Randomization Inference for Difference-in-Differences with Few Treated Clusters James G MacKinnon Queen s University Matthew D Webb Carleton University

More information

Statistical Inference of Covariate-Adjusted Randomized Experiments

Statistical Inference of Covariate-Adjusted Randomized Experiments 1 Statistical Inference of Covariate-Adjusted Randomized Experiments Feifang Hu Department of Statistics George Washington University Joint research with Wei Ma, Yichen Qin and Yang Li Email: feifang@gwu.edu

More information

Applied Statistics and Econometrics

Applied Statistics and Econometrics Applied Statistics and Econometrics Lecture 5 Saul Lach September 2017 Saul Lach () Applied Statistics and Econometrics September 2017 1 / 44 Outline of Lecture 5 Now that we know the sampling distribution

More information

Lecture 4: Linear panel models

Lecture 4: Linear panel models Lecture 4: Linear panel models Luc Behaghel PSE February 2009 Luc Behaghel (PSE) Lecture 4 February 2009 1 / 47 Introduction Panel = repeated observations of the same individuals (e.g., rms, workers, countries)

More information

Final Exam. 1. (6 points) True/False. Please read the statements carefully, as no partial credit will be given.

Final Exam. 1. (6 points) True/False. Please read the statements carefully, as no partial credit will be given. 1. (6 points) True/False. Please read the statements carefully, as no partial credit will be given. (a) If X and Y are independent, Corr(X, Y ) = 0. (b) (c) (d) (e) A consistent estimator must be asymptotically

More information

Simple and Trustworthy Cluster-Robust GMM Inference

Simple and Trustworthy Cluster-Robust GMM Inference Simple and Trustworthy Cluster-Robust MM Inference Jungbin Hwang Department of Economics, University of Connecticut August 30, 207 Abstract This paper develops a new asymptotic theory for two-step MM estimation

More information

Testing Hypothesis. Maura Mezzetti. Department of Economics and Finance Università Tor Vergata

Testing Hypothesis. Maura Mezzetti. Department of Economics and Finance Università Tor Vergata Maura Department of Economics and Finance Università Tor Vergata Hypothesis Testing Outline It is a mistake to confound strangeness with mystery Sherlock Holmes A Study in Scarlet Outline 1 The Power Function

More information

A Simple, Graphical Procedure for Comparing. Multiple Treatment Effects

A Simple, Graphical Procedure for Comparing. Multiple Treatment Effects A Simple, Graphical Procedure for Comparing Multiple Treatment Effects Brennan S. Thompson and Matthew D. Webb October 26, 2015 > Abstract In this paper, we utilize a new

More information

Contest Quiz 3. Question Sheet. In this quiz we will review concepts of linear regression covered in lecture 2.

Contest Quiz 3. Question Sheet. In this quiz we will review concepts of linear regression covered in lecture 2. Updated: November 17, 2011 Lecturer: Thilo Klein Contact: tk375@cam.ac.uk Contest Quiz 3 Question Sheet In this quiz we will review concepts of linear regression covered in lecture 2. NOTE: Please round

More information

Exam D0M61A Advanced econometrics

Exam D0M61A Advanced econometrics Exam D0M61A Advanced econometrics 19 January 2009, 9 12am Question 1 (5 pts.) Consider the wage function w i = β 0 + β 1 S i + β 2 E i + β 0 3h i + ε i, where w i is the log-wage of individual i, S i is

More information

Panel Data Models. Chapter 5. Financial Econometrics. Michael Hauser WS17/18 1 / 63

Panel Data Models. Chapter 5. Financial Econometrics. Michael Hauser WS17/18 1 / 63 1 / 63 Panel Data Models Chapter 5 Financial Econometrics Michael Hauser WS17/18 2 / 63 Content Data structures: Times series, cross sectional, panel data, pooled data Static linear panel data models:

More information

Recent Advances in the Field of Trade Theory and Policy Analysis Using Micro-Level Data

Recent Advances in the Field of Trade Theory and Policy Analysis Using Micro-Level Data Recent Advances in the Field of Trade Theory and Policy Analysis Using Micro-Level Data July 2012 Bangkok, Thailand Cosimo Beverelli (World Trade Organization) 1 Content a) Endogeneity b) Instrumental

More information

Lecture 4: Heteroskedasticity

Lecture 4: Heteroskedasticity Lecture 4: Heteroskedasticity Econometric Methods Warsaw School of Economics (4) Heteroskedasticity 1 / 24 Outline 1 What is heteroskedasticity? 2 Testing for heteroskedasticity White Goldfeld-Quandt Breusch-Pagan

More information

Econometrics of Panel Data

Econometrics of Panel Data Econometrics of Panel Data Jakub Mućk Meeting # 6 Jakub Mućk Econometrics of Panel Data Meeting # 6 1 / 36 Outline 1 The First-Difference (FD) estimator 2 Dynamic panel data models 3 The Anderson and Hsiao

More information

8. Hypothesis Testing

8. Hypothesis Testing FE661 - Statistical Methods for Financial Engineering 8. Hypothesis Testing Jitkomut Songsiri introduction Wald test likelihood-based tests significance test for linear regression 8-1 Introduction elements

More information

Brief Suggested Solutions

Brief Suggested Solutions DEPARTMENT OF ECONOMICS UNIVERSITY OF VICTORIA ECONOMICS 366: ECONOMETRICS II SPRING TERM 5: ASSIGNMENT TWO Brief Suggested Solutions Question One: Consider the classical T-observation, K-regressor linear

More information

Multiple Equation GMM with Common Coefficients: Panel Data

Multiple Equation GMM with Common Coefficients: Panel Data Multiple Equation GMM with Common Coefficients: Panel Data Eric Zivot Winter 2013 Multi-equation GMM with common coefficients Example (panel wage equation) 69 = + 69 + + 69 + 1 80 = + 80 + + 80 + 2 Note:

More information