ArCo: An Artificial Counterfactual Approach for High-Dimensional Data
|
|
- Merryl Cameron
- 6 years ago
- Views:
Transcription
1 ArCo: An Artificial Counterfactual Approach for High-Dimensional Data Carlos V. de Carvalho BCB and PUC-Rio Marcelo C. Medeiros PUC-Rio Ricardo Masini São Paulo School of Economics BIG DATA, MACHINE LEARNING AND THE MACROECONOMY Norges Bank, Oslo, 2-3 October 2017
2 Overview of the Method Observe aggregated time series data from t = 1 to T time
3 Overview of the Method Intervention occurs at t = T time T 0
4 Overview of the Method No clear controls. Observed variables from untreated peers time T 0
5 Overview of the Method Counterfactual estimation in-sample (before intervention) time T 0
6 Overview of the Method Counterfactual extrapolation (after the intervention) time T 0
7 One-page Summary Artificial Counterfactual (ArCo): alternative framework to measure the impact of an intervention in aggregate data when a control group is not readily available.
8 One-page Summary Artificial Counterfactual (ArCo): alternative framework to measure the impact of an intervention in aggregate data when a control group is not readily available. Generalization of the Synthetic Control (SC) method of Abadie and Gardeazabal (2003, AER) and the panel approach of Hsiao, Ching and Wan (2012, JAE).
9 One-page Summary Artificial Counterfactual (ArCo): alternative framework to measure the impact of an intervention in aggregate data when a control group is not readily available. Generalization of the Synthetic Control (SC) method of Abadie and Gardeazabal (2003, AER) and the panel approach of Hsiao, Ching and Wan (2012, JAE). Our contribution: estimator with ❶ asymptotic theory, ❷ rigorous inference, ❸ high-dimensionality, and ❹ new tests such as detection of the time of the intervention, contamination effects and effects on multiple moments.
10 One-page Summary Artificial Counterfactual (ArCo): alternative framework to measure the impact of an intervention in aggregate data when a control group is not readily available. Generalization of the Synthetic Control (SC) method of Abadie and Gardeazabal (2003, AER) and the panel approach of Hsiao, Ching and Wan (2012, JAE). Our contribution: estimator with ❶ asymptotic theory, ❷ rigorous inference, ❸ high-dimensionality, and ❹ new tests such as detection of the time of the intervention, contamination effects and effects on multiple moments. Key hypothesis: ❶ Interventions takes place in variables that can be modelled as trend-stationary processes. ❷ The intervention affects only the unit of interest
11 One-page Summary Artificial Counterfactual (ArCo): alternative framework to measure the impact of an intervention in aggregate data when a control group is not readily available. Generalization of the Synthetic Control (SC) method of Abadie and Gardeazabal (2003, AER) and the panel approach of Hsiao, Ching and Wan (2012, JAE). Our contribution: estimator with ❶ asymptotic theory, ❷ rigorous inference, ❸ high-dimensionality, and ❹ new tests such as detection of the time of the intervention, contamination effects and effects on multiple moments. Key hypothesis: ❶ Interventions takes place in variables that can be modelled as trend-stationary processes. ❷ The intervention affects only the unit of interest
12 The road map 1. The setup 2. The counterfactual estimation 3. Estimator properties 4. Inference 5. Extensions 6. Simulations 7. Empirical example: Nota Fiscal Paulista 8. Research agenda 9. Concluding remarks
13 Setup Observe q i > 0 variables for i = 1,... n units for t = 1,... T periods (Panel structure): z it = (z 1 it,..., zq i it ).
14 Setup Observe q i > 0 variables for i = 1,... n units for t = 1,... T periods (Panel structure): z it = (z 1 it,..., zq i it ). Unit 1 is exposed to the intervention at known T 0.
15 Setup Observe q i > 0 variables for i = 1,... n units for t = 1,... T periods (Panel structure): z it = (z 1 it,..., zq i it ). Unit 1 is exposed to the intervention at known T 0. The remanning n 1 units z 0t (z 2t,..., z nt ) are an untreated potential control group (donor pool).
16 Setup Observe q i > 0 variables for i = 1,... n units for t = 1,... T periods (Panel structure): z it = (z 1 it,..., zq i it ). Unit 1 is exposed to the intervention at known T 0. The remanning n 1 units z 0t (z 2t,..., z nt ) are an untreated potential control group (donor pool). Potential Outcome notation: z 1t = d t z (1) 1t + (1 d t )z (0) 1t ; d t = z 0t = z (0) 0t where z (1) it z (0) it { 1 if t T 0 0 otherwise is potential outcome under the intervention and the potential outcome with no intervention.
17 Setup Effects on functions of z 1t : h (z 1t ) : R q 1 R q { v p y t = h (z 1t ) e.g. h(v t ) = t v t y t = d t y (1) t + (1 d t )y (0) t
18 Setup Effects on functions of z 1t : h (z 1t ) : R q 1 R q { v p y t = h (z 1t ) e.g. h(v t ) = t v t y t = d t y (1) t + (1 d t )y (0) t Hypothesis of interest: y (1) t = δ t + y (0) t, t = T 0..., T, H 0 : T = 1 T T T [ ] y (1) t y (0) t = 0. }{{} δ t t=t 0
19 Setup Effects on functions of z 1t : h (z 1t ) : R q 1 R q { v p y t = h (z 1t ) e.g. h(v t ) = t v t y t = d t y (1) t + (1 d t )y (0) t Hypothesis of interest: y (1) t = δ t + y (0) t, t = T 0..., T, H 0 : T = 1 T T T [ ] y (1) t y (0) t = 0. }{{} δ t t=t 0 We do not observe the counterfactual y (0) t. Therefore, we construct an estimate ŷ (0) t such that: δ t y (1) t ŷ (0) t for t = T 0,..., T
20 Counterfactual Estimation How should we construct ŷ (0) t?
21 Counterfactual Estimation How should we construct ŷ (0) t? Ideally (in the MSE sense) we would set ŷ (0) t = E(y (0) t z 0t, z 0t 1,..., z 0t p )
22 Counterfactual Estimation How should we construct ŷ (0) t? Ideally (in the MSE sense) we would set ŷ (0) t = E(y (0) t z 0t, z 0t 1,..., z 0t p ) In practice, we choose a (parametric) specification (could be linear or not). Let x t = (z 0t, z 0t 1,..., z 0t p ) and such that E(ν t ) = 0 and y (0) t = M(x t ) + ν t, ŷ (0) t = M(x t ).
23 ArCo estimator The Artificial Counterfactual (ArCo) estimator is then simply given by T = 1 T T T t=t 0 δt, where δ t y t ŷ (0) t, for t = T 0,..., T.
24 ArCo estimator The Artificial Counterfactual (ArCo) estimator is then simply given by T = 1 T T T t=t 0 δt, where δ t y t ŷ (0) t, for t = T 0,..., T. ArCo estimator is a two-step estimator: 1. First step: estimation of M with the pre-intervention sample; 2. Second step: extrapolate M with actual data for x t and compute T.
25 ArCo estimator Estimation sample y t Δ T y t = M(x t )
26 ArCo and the literature Hsiao, Ching and Wan (2012, JAE) Two-step method where M(x t) is a linear and scalar function of a small set of variables from the peers. Correct specification (M is the conditional expectation). Selection of peers by information criteria. Date of the intervention known.
27 ArCo and the literature Hsiao, Ching and Wan (2012, JAE) Two-step method where M(x t) is a linear and scalar function of a small set of variables from the peers. Correct specification (M is the conditional expectation). Selection of peers by information criteria. Date of the intervention known. Differences-in-Differences (DiD) Number of treated units must grow. Parallel trends hypothesis. Similar control group.
28 ArCo and the literature Hsiao, Ching and Wan (2012, JAE) Two-step method where M(x t) is a linear and scalar function of a small set of variables from the peers. Correct specification (M is the conditional expectation). Selection of peers by information criteria. Date of the intervention known. Differences-in-Differences (DiD) Number of treated units must grow. Parallel trends hypothesis. Similar control group. Gobillon and Magnac (2016, REStat) Generalize the above authors by explicitly considering a factor model. interactive fixed effects with strictly exogenous regressors. Asymptotics both on the cross-section and time dimensions.
29 ArCo and the literature Abadie and Gardeazabal (2003, AER) Convex combination of peers. The weights are estimated using time averages of the observed variables. No time-series dynamics.
30 ArCo and the literature Abadie and Gardeazabal (2003, AER) Convex combination of peers. The weights are estimated using time averages of the observed variables. No time-series dynamics. Pesaran et al. (2004, JBES); Dees et al. (2007, JAE) Counterfactual based on variables that belong to the treated unit. No donor pool. Their key assumption is that a subset of variables of the treated unit is invariant to the intervention.
31 ArCo and the literature Abadie and Gardeazabal (2003, AER) Convex combination of peers. The weights are estimated using time averages of the observed variables. No time-series dynamics. Pesaran et al. (2004, JBES); Dees et al. (2007, JAE) Counterfactual based on variables that belong to the treated unit. No donor pool. Their key assumption is that a subset of variables of the treated unit is invariant to the intervention. Angrist, Jordà and Kuersteiner (2016, JBES) Information only on the treated unit and no donor pool is available.
32 ArCo estimator Key Assumption Independence Let z 0t = (z 2t,..., z nt ) denotes the vector of all the observable variables for the untreated units. Then, d s, for all t, s. z 0t =
33 ArCo estimator Key Assumption Independence Let z 0t = (z 2t,..., z nt ) denotes the vector of all the observable variables for the untreated units. Then, d s, for all t, s. z 0t = The independence condition donors are untreated.
34 ArCo estimator Key Assumption Independence Let z 0t = (z 2t,..., z nt ) denotes the vector of all the observable variables for the untreated units. Then, d s, for all t, s. z 0t = The independence condition donors are untreated. Examples of interventions (treatments): Natural disasters: Belasen and Polachek (2008, AER P&P), Cavallo, Galiani, Noy, and Pantano (2013, ReStat), Fujiki and Hsiao (2015, JoE),...
35 ArCo estimator Key Assumption Independence Let z 0t = (z 2t,..., z nt ) denotes the vector of all the observable variables for the untreated units. Then, d s, for all t, s. z 0t = The independence condition donors are untreated. Examples of interventions (treatments): Natural disasters: Belasen and Polachek (2008, AER P&P), Cavallo, Galiani, Noy, and Pantano (2013, ReStat), Fujiki and Hsiao (2015, JoE),... Region specific policies (laws): Hsiao, Ching, and Wan (2012, JAE), Abadie, Diamond, and Hainmueller (AJPS, 2015), Gobillon and Magnac (ReStat, 2016),...
36 ArCo estimator Key Assumption Independence Let z 0t = (z 2t,..., z nt ) denotes the vector of all the observable variables for the untreated units. Then, d s, for all t, s. z 0t = The independence condition donors are untreated. Examples of interventions (treatments): Natural disasters: Belasen and Polachek (2008, AER P&P), Cavallo, Galiani, Noy, and Pantano (2013, ReStat), Fujiki and Hsiao (2015, JoE),... Region specific policies (laws): Hsiao, Ching, and Wan (2012, JAE), Abadie, Diamond, and Hainmueller (AJPS, 2015), Gobillon and Magnac (ReStat, 2016),... New government or political regime: Grier and Maynard (2013, JEBO)
37 ArCo estimator Key Assumption Independence Let z 0t = (z 2t,..., z nt ) denotes the vector of all the observable variables for the untreated units. Then, d s, for all t, s. z 0t = The independence condition donors are untreated.
38 ArCo estimator Key Assumption Independence Let z 0t = (z 2t,..., z nt ) denotes the vector of all the observable variables for the untreated units. Then, d s, for all t, s. z 0t = The independence condition donors are untreated. Examples of interventions (treatments): Abadie and Diamond (2004, AER) and Abadie, Diamond, and Hainmueller (2010, JASA) have, respectively, 1712 and 1280 Google cites as measured on August 2, 2017.
39 ArCo estimator Key Assumption Independence Let z 0t = (z 2t,..., z nt ) denotes the vector of all the observable variables for the untreated units. Then, d s, for all t, s. z 0t = The independence condition donors are untreated. Examples of interventions (treatments): Abadie and Diamond (2004, AER) and Abadie, Diamond, and Hainmueller (2010, JASA) have, respectively, 1712 and 1280 Google cites as measured on August 2, Arguably the most important innovation in the evaluation literature in the last fifteen years is the synthetic control method. Athey and Imbens (2016)
40 Counterfactual Estimation Why do we expect the method to work?
41 Counterfactual Estimation Why do we expect the method to work? Possible data generating mechanism (in reduced form): z (0) it = µ i + Ψ ij ε it j j=0 ε it = Λ i f t + η it where f t (f 1) (0, Q) is a vector of common unobserved factors. Λ i (q i f ) are matrices of factor loadings; η it (q i 1) (0, R i ) is idiosyncratic error term.
42 Counterfactual Estimation Why do we expect the method to work? Possible data generating mechanism (in reduced form): z (0) it = µ i + Ψ ij ε it j j=0 ε it = Λ i f t + η it where f t (f 1) (0, Q) is a vector of common unobserved factors. Λ i (q i f ) are matrices of factor loadings; η it (q i 1) (0, R i ) is idiosyncratic error term. Quality of the pool of donors comes from the factor structure.
43 Counterfactual estimation Least Absolute Shrinkage and Selection Operator (LASSO) Set X t = (1, x t ) R d and consider M(x t ) linear: y (0) t = α + β x t + ν t = θ X t + ν t.
44 Counterfactual estimation Least Absolute Shrinkage and Selection Operator (LASSO) Set X t = (1, x t ) R d and consider M(x t ) linear: y (0) t Estimation: θ = arg min T 0 1 t=1 = α + β x t + ν t = θ X t + ν t. ( y (0) t ) 2 d θ X t + ς β j. j=1
45 Counterfactual estimation Least Absolute Shrinkage and Selection Operator (LASSO) Set X t = (1, x t ) R d and consider M(x t ) linear: y (0) t Estimation: θ = arg min T 0 1 t=1 = α + β x t + ν t = θ X t + ν t. ( y (0) t ) 2 d θ X t + ς β j. Why LASSO? Avoid overfitting. Large dataset compared to the sample size. Automatic model selection. j=1
46 Counterfactual estimation LASSO Catalog of hypotheses Design Let Σ 1 T1 T 1 t=1 E(x tx t ) and S 0 = {i : θ 0,1 0} (set of non-zero parameters). There exists a constant ψ 0 > 0 such that θ[s 0 ] 2 1 θσθs 0 ψ0 2, for all θ[s c 0 ] 1 3 θ[s 0 ] 1.
47 Counterfactual estimation LASSO Catalog of hypotheses Design Let Σ 1 T1 T 1 t=1 E(x tx t ) and S 0 = {i : θ 0,1 0} (set of non-zero parameters). There exists a constant ψ 0 > 0 such that θ[s 0 ] 2 1 θσθs 0 ψ0 2, for all θ[s c 0 ] 1 3 θ[s 0 ] 1. Compatibility condition of Bülhmann and van der Geer (2011).
48 Counterfactual estimation LASSO Catalog of hypotheses Design Let Σ 1 T1 T 1 t=1 E(x tx t ) and S 0 = {i : θ 0,1 0} (set of non-zero parameters). There exists a constant ψ 0 > 0 such that θ[s 0 ] 2 1 θσθs 0 ψ0 2, for all θ[s c 0 ] 1 3 θ[s 0 ] 1. Compatibility condition of Bülhmann and van der Geer (2011). Similar to the restriction of the smallest eigenvalue of Σ.
49 Counterfactual estimation LASSO Catalog of hypotheses Design Let Σ 1 T1 T 1 t=1 E(x tx t ) and S 0 = {i : θ 0,1 0} (set of non-zero parameters). There exists a constant ψ 0 > 0 such that θ[s 0 ] 2 1 θσθs 0 ψ0 2, for all θ[s c 0 ] 1 3 θ[s 0 ] 1. Compatibility condition of Bülhmann and van der Geer (2011). Similar to the restriction of the smallest eigenvalue of Σ. Important for prediction consistency and l 1 -consistency of the LASSO.
50 Counterfactual estimation LASSO Catalog of hypotheses Heterogeneity and dynamics Let w t (ν t, x t ), then: (a) {w t } is strong mixing with α(m) = exp( cm) for some c c > 0 (b) E w it 2γ+δ c γ for some γ > 2 and δ > 0 for all 1 i d, 1 t T and T 1, (c) E(νt 2 ) ϵ > 0, for all 1 t T and T 1.
51 Counterfactual estimation LASSO Catalog of hypotheses Heterogeneity and dynamics Let w t (ν t, x t ), then: (a) {w t } is strong mixing with α(m) = exp( cm) for some c c > 0 (b) E w it 2γ+δ c γ for some γ > 2 and δ > 0 for all 1 i d, 1 t T and T 1, (c) E(νt 2 ) ϵ > 0, for all 1 t T and T 1. w t is an α-mixing process with exponential decay.
52 Counterfactual estimation LASSO Catalog of hypotheses Heterogeneity and dynamics Let w t (ν t, x t ), then: (a) {w t } is strong mixing with α(m) = exp( cm) for some c c > 0 (b) E w it 2γ+δ c γ for some γ > 2 and δ > 0 for all 1 i d, 1 t T and T 1, (c) E(νt 2 ) ϵ > 0, for all 1 t T and T 1. w t is an α-mixing process with exponential decay. Part (b) bounds uniformly some higher moments Law of Large Numbers.
53 Counterfactual estimation LASSO Catalog of hypotheses Heterogeneity and dynamics Let w t (ν t, x t ), then: (a) {w t } is strong mixing with α(m) = exp( cm) for some c c > 0 (b) E w it 2γ+δ c γ for some γ > 2 and δ > 0 for all 1 i d, 1 t T and T 1, (c) E(νt 2 ) ϵ > 0, for all 1 t T and T 1. w t is an α-mixing process with exponential decay. Part (b) bounds uniformly some higher moments Law of Large Numbers. Part (c) Sufficient condition for the Central Limit Theorem.
54 Counterfactual estimation LASSO Catalog of hypotheses Regularity ( ) (a) ς = O d 1/γ T (b) s 0 d 2/γ T = o(1)
55 Counterfactual estimation LASSO Catalog of hypotheses Regularity ( ) (a) ς = O d 1/γ T (b) s 0 d 2/γ T = o(1) Part (a) puts discipline on the growth rate of the regularization parameter.
56 Counterfactual estimation LASSO Catalog of hypotheses Regularity ( ) (a) ς = O d 1/γ T (b) s 0 d 2/γ T = o(1) Part (a) puts discipline on the growth rate of the regularization parameter. Part (b) bounds the number of (total/relevant) parameters.
57 Counterfactual estimation LASSO Catalog of hypotheses Regularity ( ) (a) ς = O d 1/γ T (b) s 0 d 2/γ T = o(1) Part (a) puts discipline on the growth rate of the regularization parameter. Part (b) bounds the number of (total/relevant) parameters. Both conditions can be relaxed if normality is assumed.
58 Counterfactual estimation LASSO Results Consistency and Asymptotic Normality Let M be the model defined as before, whose parameters are estimated by LASSO, then under previous assumptions and as T : sup [ 1/2 sup P P TΩ P P a R q T ( ] T T ) a Φ(a) 0, where Ω T is defined in the previous proposition and Φ( ) is the cumulative distribution function of a zero-mean normal random vector with identity covariance matrix. The inequality is defined element-wise.
59 Counterfactual estimation LASSO Results Uniform Confidence Interval Let Ω T be a consistent estimator for Ω T uniformly in P P. Under the same conditions as before: [ I α j,t ± ω ] j Φ 1 (1 α/2) T for each j = 1,..., q, where ω j = [ Ω] jj and Φ 1 ( ) is the quantile function of a standard normal distribution. I α is uniformly valid (honest) in the sense that for a given ϵ > 0, there exists a T ϵ such that for all T > T ϵ : sup P P ( j,t I α ) (1 α) < ϵ. P P
60 Counterfactual estimation LASSO Results Uniform Hypothesis Test Let Ω T be a consistent estimator for Ω T uniformly in P P. Under the same conditions as before, for a given ϵ > 0, there exists a T ϵ such that for all T > T ϵ : sup P P (W T c α ) (1 α) < ϵ, P P where W T T T T, P(χ 2 q c α ) = 1 α and χ 2 q is a chi-square distributed random variable with q degrees of freedom. Ω 1 T
61 Additional Results 1. Tests under unknown intervention date. Results are not altered when the intervention date is estimated. λ is super-consistent.
62 Additional Results 1. Tests under unknown intervention date. Results are not altered when the intervention date is estimated. λ is super-consistent. 2. Multiple interventions.
63 Additional Results 1. Tests under unknown intervention date. Results are not altered when the intervention date is estimated. λ is super-consistent. 2. Multiple interventions. 3. Test for the untreated peers and /or treated unit.
64 Additional Results 1. Tests under unknown intervention date. Results are not altered when the intervention date is estimated. λ is super-consistent. 2. Multiple interventions. 3. Test for the untreated peers and /or treated unit. 4. The framework is extended to conduct joint test (e.g., affecting the mean and the variance simultaneously)
65 Additional Results 1. Tests under unknown intervention date. Results are not altered when the intervention date is estimated. λ is super-consistent. 2. Multiple interventions. 3. Test for the untreated peers and /or treated unit. 4. The framework is extended to conduct joint test (e.g., affecting the mean and the variance simultaneously) 5. R package: Fonseca, Masini, Medeiros, and Vasconcelos (2017a)
66 Additional Results 1. Tests under unknown intervention date. Results are not altered when the intervention date is estimated. λ is super-consistent. 2. Multiple interventions. 3. Test for the untreated peers and /or treated unit. 4. The framework is extended to conduct joint test (e.g., affecting the mean and the variance simultaneously) 5. R package: Fonseca, Masini, Medeiros, and Vasconcelos (2017a) 6. Carvalho, Masini and Medeiros (2017): Integrated data
67 Additional Results 1. Tests under unknown intervention date. Results are not altered when the intervention date is estimated. λ is super-consistent. 2. Multiple interventions. 3. Test for the untreated peers and /or treated unit. 4. The framework is extended to conduct joint test (e.g., affecting the mean and the variance simultaneously) 5. R package: Fonseca, Masini, Medeiros, and Vasconcelos (2017a) 6. Carvalho, Masini and Medeiros (2017): Integrated data 7. Fonseca, Masini, Medeiros, and Vasconcelos (2017b): Random Forests and other machine learning techniques.
68 Monte Carlo results Data Generating Process (DGP) Consider the following model for i {1,..., n} and t 1: where: ε it = Λ i f t + η it, z (0) it = ρa i z (0) it 1 + ε it, f t = [1, (t/t) φ, v t ], z it R q, ρ [0, 1), φ > 0, A i (q q) is a diagonal matrix with diagonal elements strictly between 1 and 1, iid N(0, 1), η it N(0, rf 2I nq), and Λ i is a (q 3) matrix of factor loadings. v t iid
69 Monte Carlo results Rejection Rates under the Null (1/2) Bias Var ŝ 0 α = Innovation Distribution T = 100, d = 100, s 0 = 5, φ = 0, ρ = 0 Normal χ 2 (1) t-stud(3) Mixed-Normal Sample Size normal dist., d = 100, s 0 = 5, φ = 0, ρ = 0 T = Number of Total Covariates normal dist., T = 100, s 0 = 5, φ = 0, ρ = 0 d =
70 Monte Carlo results Rejection Rates under the Null (2/2) Bias Var ŝ 0 α = Number of Relevant (non-zero) Covariates normal dist., T = 100, d = 100, φ = 0, ρ = 0 s 0 = Determinist Trend (t/t) φ normal dist., T = 100, d = 100, s 0 = 5, ρ = 0 φ = Serial Correlation normal dist., T = 100, d = 100, s 0 = 5, φ = 0 ρ =
71 Monte Carlo results Rejection rates under the alternative α = Step Intervention δ t = c σ 1 1{t T 0 } c = Linear Increasing δ t = c σ 1 t T 0 +1 T T {t T 0} c = T t+1 Linear Decreasing δ t = c σ 1 T T {t T 0} c =
72 Monte Carlo results Horse race (1/2) BA SC DiD* DiD GM* GM ArCo* ArCo No Time Trend (φ = 0) and No Serial Correlation (ρ = 0) Bias Var MSE No Time Trend (φ = 0) Bias Var MSE Common Linear Time Trend (φ = 1) Bias Var MSE Idiosyncratic Linear Time Trend (φ = 1) Bias Var MSE
73 Monte Carlo results Horse race (2/2) BA SC DiD* DiD GM* GM ArCo* ArCo Common Quadratic Time Trend (φ = 2) Bias Var MSE Idiosyncratic Quadratic Time Trend (φ = 2) Bias Var MSE
74 Kernel Normal Monte Carlo results No trend and no serial correlation BA SC DiD* DiD GM* GM ArCo* ArCo
75 Kernel Normal Monte Carlo results No trend BA SC DiD* DiD GM* GM ArCo* ArCo
76 Kernel Normal Monte Carlo results Common linear trend BA SC DiD* DiD GM* GM ArCo* ArCo
77 Kernel Normal Monte Carlo results Heterogeneous linear trend BA SC DiD* DiD GM* GM ArCo* ArCo
78 Kernel Normal Monte Carlo results Common quadratic trend BA SC DiD* DiD GM* GM ArCo* ArCo
79 Kernel Normal Monte Carlo results Heterogeneous quadratic trend BA SC DiD* DiD GM* GM ArCo* ArCo
80 Application In October 2007, the state government of São Paulo in Brazil implemented a anti tax evasion scheme called Nota Fiscal Paulista (NFP).
81 Application In October 2007, the state government of São Paulo in Brazil implemented a anti tax evasion scheme called Nota Fiscal Paulista (NFP). The NFP consists of a tax rebate from a state tax named ICMS (tax on circulation of products and services).
82 Application In October 2007, the state government of São Paulo in Brazil implemented a anti tax evasion scheme called Nota Fiscal Paulista (NFP). The NFP consists of a tax rebate from a state tax named ICMS (tax on circulation of products and services). Incentive to the consumer to ask for sales receipts.
83 Application In October 2007, the state government of São Paulo in Brazil implemented a anti tax evasion scheme called Nota Fiscal Paulista (NFP). The NFP consists of a tax rebate from a state tax named ICMS (tax on circulation of products and services). Incentive to the consumer to ask for sales receipts. Additionally, the registered sales receipts give the consumer the right to participate in monthly lotteries promoted by the government.
84 Application Under the premisses that: 1. a certain degree of tax evasion was occurring before the intervention, 2. the sellers has some degree of market power and 3. the penalty for tax-evasion is large enough to alter the seller behaviour, one is expected to see an upwards movements in prices due to an increase in marginal cost. Hence, we would like to investigate whether the NFP had an impact on consumer prices in São Paulo.
85 Application The NFP was not implemented throughout the sectors in the economy at once.
86 Application The NFP was not implemented throughout the sectors in the economy at once. The first sector were restaurants, followed by bakeries, bar and other food service retailers.
87 Application The NFP was not implemented throughout the sectors in the economy at once. The first sector were restaurants, followed by bakeries, bar and other food service retailers. The sample then consists of monthly inflation (food outside home) index for 10 metropolitan areas including São Paulo from Jan-05 to Sep-09.
88 Application 1200 Dec 07 Jan 08 Feb 08 Mar 08 Apr 08 May 08 Jun 08 Jul 08 Aug 08 Sep 08 Oct 08 Nov 08 Dec 08 Jan 09 Feb 09 Mar 09 Apr 09 May 09 Jun 09 Jul 09 Aug 09 Sep 09 # of participants (millions) Distributed Value (millions R$)
89 Application Panel (a): ArCo Estimates (1) (2) (3) (4) (5) (6) (7) (0.1704) (0.1486) (0.1432) (0.1480) (0.2010) (0.1600) (0.1539) Inflation Yes No No No Yes Yes Yes GDP No Yes No No Yes Yes Yes Retail Sales No No Yes No No Yes Yes Credit No No No Yes No No Yes R-squared d s T T DiD (0.1466) GM (0.1234) Panel (b): Alternative Estimates (1) (2) (3) (4) (5) (6) (0.1456) (0.1467) (0.1556) (0.1457) (0.1466) (0.1243) (0.1246) (0.1227) (0.1228) GDP Yes No No Yes Yes No Retail Sales No Yes No Yes Yes No Credit No No Yes No Yes No
90 Application ArCo estimates: CPI inflation (food outside home) Jan-05 Jan-06 Jan-07 Jan-08 Jan-09 Pre-intervention fit Counterfactual Actual index
91 Application ArCo estimates: CPI (food outside home) Jan-05 Jan-06 Jan-07 Jan-08 Jan-09 Pre-intervention fit Counterfactual Actual index
92 Concluding remarks Important to note that the ArCo is more than just another structural break estimator. By controlling for common shocks that might have occurred in all units after the intervention, it provides a effective methodology to isolate the effect of the intervention of interest. Providing compelling evidence for the causality of the structural break observed in the unit of interest It is almost a model-free methodology. It implies no structure in the stationary process (no # of lags to be determined nor # MA terms)
TEXTO PARA DISCUSSÃO. No ARCO: an artificial counterfactual approach for high-dimensional panel time-series data
EXO PARA DISCUSSÃO No. 653 ARCO: an artificial counterfactual approach for high-dimensional panel time-series data Carlos C. Carvalho Ricardo Masini Marcelo C. Medeiros DEPARAMENO DE ECONOMIA www.econ.puc-rio.br
More informationIntervention Impact Evaluation on Aggregated Data: The Artificial Counterfactual Approach for Stationary Processes
Intervention Impact Evaluation on Aggregated Data: he Artificial Counterfactual Approach for Stationary Processes Carlos V. Carvalho, Ricardo Masini and Marcelo C. Medeiros January 30, 2015 Abstract In
More informationRevisiting the Synthetic Control Estimator
Revisiting the Synthetic Control Estimator Bruno Ferman Cristine Pinto Sao Paulo School of Economics - FGV First Draft: June, 26 This Draft: October, 27 Please click here for the most recent version Abstract
More informationRevisiting the Synthetic Control Estimator
MPRA Munich Personal RePEc Archive Revisiting the Synthetic Control Estimator Bruno Ferman and Cristine Pinto Sao Paulo School of Economics - FGV 23 November 2016 Online at https://mpra.ub.uni-muenchen.de/77930/
More informationRevisiting the Synthetic Control Estimator
Revisiting the Synthetic Control Estimator Bruno Ferman Cristine Pinto Sao Paulo School of Economics - FGV First Draft: June, 2016 This Draft: August, 2017 Please click here for the most recent version
More informationArCo: An R package to Estimate Artificial Counterfactuals by Yuri R. Fonseca, Ricardo P. Masini, Marcelo C. Medeiros and Gabriel F. R.
CONTRIBUTED RESEARCH ARTICLE 91 ArCo: An R package to Estimate Artificial Counterfactuals by Yuri R. Fonseca, Ricardo P. Masini, Marcelo C. Medeiros and Gabriel F. R. Vasconcelos Abstract In this paper
More informationWhen Should We Use Linear Fixed Effects Regression Models for Causal Inference with Longitudinal Data?
When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Longitudinal Data? Kosuke Imai Department of Politics Center for Statistics and Machine Learning Princeton University
More informationOutside the Box: Using Synthetic Control Methods as a Forecasting Technique
Outside the Box: Using Synthetic Control Methods as a Forecasting Technique Stefan Klößner (Saarland University) Gregor Pfeifer (University of Hohenheim) October 6, 2016 Abstract We introduce synthetic
More informationComparing Forecast Accuracy of Different Models for Prices of Metal Commodities
Comparing Forecast Accuracy of Different Models for Prices of Metal Commodities João Victor Issler (FGV) and Claudia F. Rodrigues (VALE) August, 2012 J.V. Issler and C.F. Rodrigues () Forecast Models for
More informationInference for Factor Model Based Average Treatment
Inference for Factor Model Based Average Treatment Effects Kathleen T. Li Marketing Department The Wharton School, the University of Pennsylvania January, 208 Abstract In this paper we consider the problem
More informationWhen Should We Use Linear Fixed Effects Regression Models for Causal Inference with Longitudinal Data?
When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Longitudinal Data? Kosuke Imai Princeton University Asian Political Methodology Conference University of Sydney Joint
More informationApplied Microeconometrics (L5): Panel Data-Basics
Applied Microeconometrics (L5): Panel Data-Basics Nicholas Giannakopoulos University of Patras Department of Economics ngias@upatras.gr November 10, 2015 Nicholas Giannakopoulos (UPatras) MSc Applied Economics
More informationNon-linear panel data modeling
Non-linear panel data modeling Laura Magazzini University of Verona laura.magazzini@univr.it http://dse.univr.it/magazzini May 2010 Laura Magazzini (@univr.it) Non-linear panel data modeling May 2010 1
More informationApproximating Fixed-Horizon Forecasts Using Fixed-Event Forecasts
Approximating Fixed-Horizon Forecasts Using Fixed-Event Forecasts Malte Knüppel and Andreea L. Vladu Deutsche Bundesbank 9th ECB Workshop on Forecasting Techniques 4 June 216 This work represents the authors
More informationOn the maximum and minimum response to an impulse in Svars
On the maximum and minimum response to an impulse in Svars Bulat Gafarov (PSU), Matthias Meier (UBonn), and José-Luis Montiel-Olea (NYU) September 23, 2015 1 / 58 Introduction Structural VAR: Theoretical
More informationCentral Bank of Chile October 29-31, 2013 Bruce Hansen (University of Wisconsin) Structural Breaks October 29-31, / 91. Bruce E.
Forecasting Lecture 3 Structural Breaks Central Bank of Chile October 29-31, 2013 Bruce Hansen (University of Wisconsin) Structural Breaks October 29-31, 2013 1 / 91 Bruce E. Hansen Organization Detection
More informationShort T Panels - Review
Short T Panels - Review We have looked at methods for estimating parameters on time-varying explanatory variables consistently in panels with many cross-section observation units but a small number of
More informationAssessing the eect of 2008 crisis on Brazilian Economy.
Assessing the eect of 2008 crisis on Brazilian Economy. Emerson Fernandes Marçal Ronan Cunha Giovanni Merlin Oscar Simões CEMAP-EESP-FGV December 5, 2016 Talking points Motivation Objectives Literature
More informationTIME SERIES ANALYSIS AND FORECASTING USING THE STATISTICAL MODEL ARIMA
CHAPTER 6 TIME SERIES ANALYSIS AND FORECASTING USING THE STATISTICAL MODEL ARIMA 6.1. Introduction A time series is a sequence of observations ordered in time. A basic assumption in the time series analysis
More informationChapter 6. Panel Data. Joan Llull. Quantitative Statistical Methods II Barcelona GSE
Chapter 6. Panel Data Joan Llull Quantitative Statistical Methods II Barcelona GSE Introduction Chapter 6. Panel Data 2 Panel data The term panel data refers to data sets with repeated observations over
More informationVector Auto-Regressive Models
Vector Auto-Regressive Models Laurent Ferrara 1 1 University of Paris Nanterre M2 Oct. 2018 Overview of the presentation 1. Vector Auto-Regressions Definition Estimation Testing 2. Impulse responses functions
More informationVAR Models and Applications
VAR Models and Applications Laurent Ferrara 1 1 University of Paris West M2 EIPMC Oct. 2016 Overview of the presentation 1. Vector Auto-Regressions Definition Estimation Testing 2. Impulse responses functions
More information= observed volume on day l for bin j = base volume in jth bin, and = residual error, assumed independent with mean zero.
QB research September 4, 06 Page -Minute Bin Volume Forecast Model Overview In response to strong client demand, Quantitative Brokers (QB) has developed a new algorithm called Closer that specifically
More informationDEPARTMENT OF ECONOMICS
ISSN 0819-64 ISBN 0 7340 616 1 THE UNIVERSITY OF MELBOURNE DEPARTMENT OF ECONOMICS RESEARCH PAPER NUMBER 959 FEBRUARY 006 TESTING FOR RATE-DEPENDENCE AND ASYMMETRY IN INFLATION UNCERTAINTY: EVIDENCE FROM
More informationIntroduction to Forecasting
Introduction to Forecasting Introduction to Forecasting Predicting the future Not an exact science but instead consists of a set of statistical tools and techniques that are supported by human judgment
More informationGAMINGRE 8/1/ of 7
FYE 09/30/92 JULY 92 0.00 254,550.00 0.00 0 0 0 0 0 0 0 0 0 254,550.00 0.00 0.00 0.00 0.00 254,550.00 AUG 10,616,710.31 5,299.95 845,656.83 84,565.68 61,084.86 23,480.82 339,734.73 135,893.89 67,946.95
More informationApproximating fixed-horizon forecasts using fixed-event forecasts
Approximating fixed-horizon forecasts using fixed-event forecasts Comments by Simon Price Essex Business School June 2016 Redundant disclaimer The views in this presentation are solely those of the presenter
More informationIndependent and conditionally independent counterfactual distributions
Independent and conditionally independent counterfactual distributions Marcin Wolski European Investment Bank M.Wolski@eib.org Society for Nonlinear Dynamics and Econometrics Tokyo March 19, 2018 Views
More informationVolume 03, Issue 6. Comparison of Panel Cointegration Tests
Volume 03, Issue 6 Comparison of Panel Cointegration Tests Deniz Dilan Karaman Örsal Humboldt University Berlin Abstract The main aim of this paper is to compare the size and size-adjusted power properties
More informationWhen Should We Use Linear Fixed Effects Regression Models for Causal Inference with Panel Data?
When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Panel Data? Kosuke Imai Department of Politics Center for Statistics and Machine Learning Princeton University Joint
More informationA look into the factor model black box Publication lags and the role of hard and soft data in forecasting GDP
A look into the factor model black box Publication lags and the role of hard and soft data in forecasting GDP Marta Bańbura and Gerhard Rünstler Directorate General Research European Central Bank November
More informationCross-Validating Synthetic Controls
Cross-Validating Synthetic Controls Martin Becker, Saarland University Stefan Klößner, Saarland University Gregor Pfeifer, University of Hohenheim October 12, 2017 Abstract While the literature on synthetic
More informationAn exact and robust conformal inference method for counterfactual and synthetic controls
An exact and robust conformal inference method for counterfactual and synthetic controls Victor Chernozhukov Kaspar Wüthrich Yu Zhu The Institute for Fiscal Studies Department of Economics, UCL cemmap
More informationIdentifying Aggregate Liquidity Shocks with Monetary Policy Shocks: An Application using UK Data
Identifying Aggregate Liquidity Shocks with Monetary Policy Shocks: An Application using UK Data Michael Ellington and Costas Milas Financial Services, Liquidity and Economic Activity Bank of England May
More informationWhat s New in Econometrics? Lecture 14 Quantile Methods
What s New in Econometrics? Lecture 14 Quantile Methods Jeff Wooldridge NBER Summer Institute, 2007 1. Reminders About Means, Medians, and Quantiles 2. Some Useful Asymptotic Results 3. Quantile Regression
More informationFinal Overview. Introduction to ML. Marek Petrik 4/25/2017
Final Overview Introduction to ML Marek Petrik 4/25/2017 This Course: Introduction to Machine Learning Build a foundation for practice and research in ML Basic machine learning concepts: max likelihood,
More informationAn economic application of machine learning: Nowcasting Thai exports using global financial market data and time-lag lasso
An economic application of machine learning: Nowcasting Thai exports using global financial market data and time-lag lasso PIER Exchange Nov. 17, 2016 Thammarak Moenjak What is machine learning? Wikipedia
More informationIntroduction to Econometrics
Introduction to Econometrics T H I R D E D I T I O N Global Edition James H. Stock Harvard University Mark W. Watson Princeton University Boston Columbus Indianapolis New York San Francisco Upper Saddle
More informationComprehensive Examination Quantitative Methods Spring, 2018
Comprehensive Examination Quantitative Methods Spring, 2018 Instruction: This exam consists of three parts. You are required to answer all the questions in all the parts. 1 Grading policy: 1. Each part
More informationRepeated observations on the same cross-section of individual units. Important advantages relative to pure cross-section data
Panel data Repeated observations on the same cross-section of individual units. Important advantages relative to pure cross-section data - possible to control for some unobserved heterogeneity - possible
More informationNew Developments in Econometrics Lecture 16: Quantile Estimation
New Developments in Econometrics Lecture 16: Quantile Estimation Jeff Wooldridge Cemmap Lectures, UCL, June 2009 1. Review of Means, Medians, and Quantiles 2. Some Useful Asymptotic Results 3. Quantile
More informationIV Quantile Regression for Group-level Treatments, with an Application to the Distributional Effects of Trade
IV Quantile Regression for Group-level Treatments, with an Application to the Distributional Effects of Trade Denis Chetverikov Brad Larsen Christopher Palmer UCLA, Stanford and NBER, UC Berkeley September
More informationDifference-in-Differences Estimation
Difference-in-Differences Estimation Jeff Wooldridge Michigan State University Programme Evaluation for Policy Analysis Institute for Fiscal Studies June 2012 1. The Basic Methodology 2. How Should We
More informationInterpreting Regression Results
Interpreting Regression Results Carlo Favero Favero () Interpreting Regression Results 1 / 42 Interpreting Regression Results Interpreting regression results is not a simple exercise. We propose to split
More informationIdentifying Causal Effects in Time Series Models
Identifying Causal Effects in Time Series Models Aaron Smith UC Davis October 20, 2015 Slides available at http://asmith.ucdavis.edu 1 Identifying Causal Effects in Time Series Models If all the Metrics
More informationDepartment of Economics, UCSB UC Santa Barbara
Department of Economics, UCSB UC Santa Barbara Title: Past trend versus future expectation: test of exchange rate volatility Author: Sengupta, Jati K., University of California, Santa Barbara Sfeir, Raymond,
More informationIdentifying the Monetary Policy Shock Christiano et al. (1999)
Identifying the Monetary Policy Shock Christiano et al. (1999) The question we are asking is: What are the consequences of a monetary policy shock a shock which is purely related to monetary conditions
More informationarxiv: v3 [econ.em] 17 Apr 2018
An Exact and Robust Conformal Inference Method for Counterfactual and Synthetic Controls Victor Chernozhukov Kaspar Wüthrich Yinchu Zhu arxiv:1712.09089v3 [econ.em] 17 Apr 2018 April 16, 2018 Abstract
More informationNew Developments in Econometrics Lecture 11: Difference-in-Differences Estimation
New Developments in Econometrics Lecture 11: Difference-in-Differences Estimation Jeff Wooldridge Cemmap Lectures, UCL, June 2009 1. The Basic Methodology 2. How Should We View Uncertainty in DD Settings?
More informationEconometrics of Panel Data
Econometrics of Panel Data Jakub Mućk Meeting # 6 Jakub Mućk Econometrics of Panel Data Meeting # 6 1 / 36 Outline 1 The First-Difference (FD) estimator 2 Dynamic panel data models 3 The Anderson and Hsiao
More informationCONTAGION VERSUS FLIGHT TO QUALITY IN FINANCIAL MARKETS
EVA IV, CONTAGION VERSUS FLIGHT TO QUALITY IN FINANCIAL MARKETS Jose Olmo Department of Economics City University, London (joint work with Jesús Gonzalo, Universidad Carlos III de Madrid) 4th Conference
More informationSingle Equation Linear GMM with Serially Correlated Moment Conditions
Single Equation Linear GMM with Serially Correlated Moment Conditions Eric Zivot October 28, 2009 Univariate Time Series Let {y t } be an ergodic-stationary time series with E[y t ]=μ and var(y t )
More informationReview of Classical Least Squares. James L. Powell Department of Economics University of California, Berkeley
Review of Classical Least Squares James L. Powell Department of Economics University of California, Berkeley The Classical Linear Model The object of least squares regression methods is to model and estimate
More informationEconometric Analysis of Cross Section and Panel Data
Econometric Analysis of Cross Section and Panel Data Jeffrey M. Wooldridge / The MIT Press Cambridge, Massachusetts London, England Contents Preface Acknowledgments xvii xxiii I INTRODUCTION AND BACKGROUND
More informationShortfalls of Panel Unit Root Testing. Jack Strauss Saint Louis University. And. Taner Yigit Bilkent University. Abstract
Shortfalls of Panel Unit Root Testing Jack Strauss Saint Louis University And Taner Yigit Bilkent University Abstract This paper shows that (i) magnitude and variation of contemporaneous correlation are
More informationBUSI 460 Suggested Answers to Selected Review and Discussion Questions Lesson 7
BUSI 460 Suggested Answers to Selected Review and Discussion Questions Lesson 7 1. The definitions follow: (a) Time series: Time series data, also known as a data series, consists of observations on a
More informationOil price and macroeconomy in Russia. Abstract
Oil price and macroeconomy in Russia Katsuya Ito Fukuoka University Abstract In this note, using the VEC model we attempt to empirically investigate the effects of oil price and monetary shocks on the
More information1 Estimation of Persistent Dynamic Panel Data. Motivation
1 Estimation of Persistent Dynamic Panel Data. Motivation Consider the following Dynamic Panel Data (DPD) model y it = y it 1 ρ + x it β + µ i + v it (1.1) with i = {1, 2,..., N} denoting the individual
More informationSingle Equation Linear GMM with Serially Correlated Moment Conditions
Single Equation Linear GMM with Serially Correlated Moment Conditions Eric Zivot November 2, 2011 Univariate Time Series Let {y t } be an ergodic-stationary time series with E[y t ]=μ and var(y t )
More informationStability-based generation of scenario trees for multistage stochastic programs
Stability-based generation of scenario trees for multistage stochastic programs H. Heitsch and W. Römisch Humboldt-University Berlin Institute of Mathematics 10099 Berlin, Germany http://www.math.hu-berlin.de/~romisch
More informationGov 2002: 9. Differences in Differences
Gov 2002: 9. Differences in Differences Matthew Blackwell October 30, 2015 1 / 40 1. Basic differences-in-differences model 2. Conditional DID 3. Standard error issues 4. Other DID approaches 2 / 40 Where
More informationBacktesting Marginal Expected Shortfall and Related Systemic Risk Measures
Backtesting Marginal Expected Shortfall and Related Systemic Risk Measures Denisa Banulescu 1 Christophe Hurlin 1 Jérémy Leymarie 1 Olivier Scaillet 2 1 University of Orleans 2 University of Geneva & Swiss
More informationLIKELIHOOD RATIO INFERENCE FOR MISSING DATA MODELS
LIKELIHOOD RATIO IFERECE FOR MISSIG DATA MODELS KARU ADUSUMILLI AD TAISUKE OTSU Abstract. Missing or incomplete outcome data is a ubiquitous problem in biomedical and social sciences. Under the missing
More informationTesting Random Effects in Two-Way Spatial Panel Data Models
Testing Random Effects in Two-Way Spatial Panel Data Models Nicolas Debarsy May 27, 2010 Abstract This paper proposes an alternative testing procedure to the Hausman test statistic to help the applied
More informationStatistical Inference for Average Treatment Effects Estimated by Synthetic Control Methods
Statistical Inference for Average Treatment Effects Estimated by Synthetic Control Methods Kathleen T. Li Marketing Department The Wharton School, the University of Pennsylvania March 2018 Abstract The
More informationReview. DS GA 1002 Statistical and Mathematical Models. Carlos Fernandez-Granda
Review DS GA 1002 Statistical and Mathematical Models http://www.cims.nyu.edu/~cfgranda/pages/dsga1002_fall16 Carlos Fernandez-Granda Probability and statistics Probability: Framework for dealing with
More informationPredicting bond returns using the output gap in expansions and recessions
Erasmus university Rotterdam Erasmus school of economics Bachelor Thesis Quantitative finance Predicting bond returns using the output gap in expansions and recessions Author: Martijn Eertman Studentnumber:
More informationVector autoregressions, VAR
1 / 45 Vector autoregressions, VAR Chapter 2 Financial Econometrics Michael Hauser WS17/18 2 / 45 Content Cross-correlations VAR model in standard/reduced form Properties of VAR(1), VAR(p) Structural VAR,
More informationModel Selection and Geometry
Model Selection and Geometry Pascal Massart Université Paris-Sud, Orsay Leipzig, February Purpose of the talk! Concentration of measure plays a fundamental role in the theory of model selection! Model
More informationEconometría 2: Análisis de series de Tiempo
Econometría 2: Análisis de series de Tiempo Karoll GOMEZ kgomezp@unal.edu.co http://karollgomez.wordpress.com Segundo semestre 2016 IX. Vector Time Series Models VARMA Models A. 1. Motivation: The vector
More informationThe Power of the Synthetic Control Method
Working papers in transport, tourism, information technology and microdata analysis The Power of the Synthetic Control Method Kenneth Carling Yujiao Li Editor: Hasan Fleyeh Nr: 2016:10 Working papers in
More informationabstract sergio firpo and vítor possebom april 2 nd, 2016
SY N T H E T I C C O N T R O L E S T I M AT O R : A G E N E R A L I Z E D I N F E R E N C E P R O C E D U R E A N D C O N F I D E N C E S E T S * sergio firpo and vítor possebom april 2 nd, 2016 abstract
More informationGenerated Covariates in Nonparametric Estimation: A Short Review.
Generated Covariates in Nonparametric Estimation: A Short Review. Enno Mammen, Christoph Rothe, and Melanie Schienle Abstract In many applications, covariates are not observed but have to be estimated
More informationEconometrics in a nutshell: Variation and Identification Linear Regression Model in STATA. Research Methods. Carlos Noton.
1/17 Research Methods Carlos Noton Term 2-2012 Outline 2/17 1 Econometrics in a nutshell: Variation and Identification 2 Main Assumptions 3/17 Dependent variable or outcome Y is the result of two forces:
More informationAppendix A: The time series behavior of employment growth
Unpublished appendices from The Relationship between Firm Size and Firm Growth in the U.S. Manufacturing Sector Bronwyn H. Hall Journal of Industrial Economics 35 (June 987): 583-606. Appendix A: The time
More informationEconometric Methods for Panel Data
Based on the books by Baltagi: Econometric Analysis of Panel Data and by Hsiao: Analysis of Panel Data Robert M. Kunst robert.kunst@univie.ac.at University of Vienna and Institute for Advanced Studies
More informationThe risk of machine learning
/ 33 The risk of machine learning Alberto Abadie Maximilian Kasy July 27, 27 2 / 33 Two key features of machine learning procedures Regularization / shrinkage: Improve prediction or estimation performance
More informationEuro-indicators Working Group
Euro-indicators Working Group Luxembourg, 9 th & 10 th June 2011 Item 9.4 of the Agenda New developments in EuroMIND estimates Rosa Ruggeri Cannata Doc 309/11 What is EuroMIND? EuroMIND is a Monthly INDicator
More informationA Significance Test for the Lasso
A Significance Test for the Lasso Lockhart R, Taylor J, Tibshirani R, and Tibshirani R Ashley Petersen May 14, 2013 1 Last time Problem: Many clinical covariates which are important to a certain medical
More informationTime-Varying Vector Autoregressive Models with Structural Dynamic Factors
Time-Varying Vector Autoregressive Models with Structural Dynamic Factors Paolo Gorgi, Siem Jan Koopman, Julia Schaumburg http://sjkoopman.net Vrije Universiteit Amsterdam School of Business and Economics
More informationNews Shocks: Different Effects in Boom and Recession?
News Shocks: Different Effects in Boom and Recession? Maria Bolboaca, Sarah Fischer University of Bern Study Center Gerzensee June 7, 5 / Introduction News are defined in the literature as exogenous changes
More informationStatistics for IT Managers
Statistics for IT Managers 95-796, Fall 2012 Module 2: Hypothesis Testing and Statistical Inference (5 lectures) Reading: Statistics for Business and Economics, Ch. 5-7 Confidence intervals Given the sample
More informationLimited Dependent Variables and Panel Data
and Panel Data June 24 th, 2009 Structure 1 2 Many economic questions involve the explanation of binary variables, e.g.: explaining the participation of women in the labor market explaining retirement
More informationB y t = γ 0 + Γ 1 y t + ε t B(L) y t = γ 0 + ε t ε t iid (0, D) D is diagonal
Structural VAR Modeling for I(1) Data that is Not Cointegrated Assume y t =(y 1t,y 2t ) 0 be I(1) and not cointegrated. That is, y 1t and y 2t are both I(1) and there is no linear combination of y 1t and
More informationMachine learning, shrinkage estimation, and economic theory
Machine learning, shrinkage estimation, and economic theory Maximilian Kasy December 14, 2018 1 / 43 Introduction Recent years saw a boom of machine learning methods. Impressive advances in domains such
More informationStructural VAR Models and Applications
Structural VAR Models and Applications Laurent Ferrara 1 1 University of Paris Nanterre M2 Oct. 2018 SVAR: Objectives Whereas the VAR model is able to capture efficiently the interactions between the different
More informationGravity Models, PPML Estimation and the Bias of the Robust Standard Errors
Gravity Models, PPML Estimation and the Bias of the Robust Standard Errors Michael Pfaffermayr August 23, 2018 Abstract In gravity models with exporter and importer dummies the robust standard errors of
More informationEconometrics. Week 4. Fall Institute of Economic Studies Faculty of Social Sciences Charles University in Prague
Econometrics Week 4 Institute of Economic Studies Faculty of Social Sciences Charles University in Prague Fall 2012 1 / 23 Recommended Reading For the today Serial correlation and heteroskedasticity in
More informationClustering as a Design Problem
Clustering as a Design Problem Alberto Abadie, Susan Athey, Guido Imbens, & Jeffrey Wooldridge Harvard-MIT Econometrics Seminar Cambridge, February 4, 2016 Adjusting standard errors for clustering is common
More informationExogeneity and Causality
Università di Pavia Exogeneity and Causality Eduardo Rossi University of Pavia Factorization of the density DGP: D t (x t χ t 1, d t ; Ψ) x t represent all the variables in the economy. The econometric
More informationPart II. Time Series
Part II Time Series 12 Introduction This Part is mainly a summary of the book of Brockwell and Davis (2002). Additionally the textbook Shumway and Stoffer (2010) can be recommended. 1 Our purpose is to
More informationEstimating Average Treatment Effects Using a Modified Synthetic Control Method: Theory and Applications
Estimating Average Treatment Effects Using a Modified Synthetic Control Method: Theory and Applications Kathleen T. Li Marketing Department The Wharton School, the University of Pennsylvania June 27, 2017
More informationEconometrics of Panel Data
Econometrics of Panel Data Jakub Mućk Meeting # 9 Jakub Mućk Econometrics of Panel Data Meeting # 9 1 / 22 Outline 1 Time series analysis Stationarity Unit Root Tests for Nonstationarity 2 Panel Unit Root
More informationLeast Squares Estimation of a Panel Data Model with Multifactor Error Structure and Endogenous Covariates
Least Squares Estimation of a Panel Data Model with Multifactor Error Structure and Endogenous Covariates Matthew Harding and Carlos Lamarche January 12, 2011 Abstract We propose a method for estimating
More informationIntroduction to Algorithmic Trading Strategies Lecture 10
Introduction to Algorithmic Trading Strategies Lecture 10 Risk Management Haksun Li haksun.li@numericalmethod.com www.numericalmethod.com Outline Value at Risk (VaR) Extreme Value Theory (EVT) References
More informationCausal Inference Lecture Notes: Causal Inference with Repeated Measures in Observational Studies
Causal Inference Lecture Notes: Causal Inference with Repeated Measures in Observational Studies Kosuke Imai Department of Politics Princeton University November 13, 2013 So far, we have essentially assumed
More informationRegional Policy Evaluation: Interactive Fixed Effects and Synthetic Controls
DISCUSSION PAPER SERIES IZA DP No. 7493 Regional Policy Evaluation: Interactive Fixed Effects and Synthetic Controls Laurent Gobillon Thierry Magnac July 2013 Forschungsinstitut zur Zukunft der Arbeit
More informationTesting Error Correction in Panel data
University of Vienna, Dept. of Economics Master in Economics Vienna 2010 The Model (1) Westerlund (2007) consider the following DGP: y it = φ 1i + φ 2i t + z it (1) x it = x it 1 + υ it (2) where the stochastic
More informationRecent Advances in the Field of Trade Theory and Policy Analysis Using Micro-Level Data
Recent Advances in the Field of Trade Theory and Policy Analysis Using Micro-Level Data July 2012 Bangkok, Thailand Cosimo Beverelli (World Trade Organization) 1 Content a) Classical regression model b)
More informationEvaluating Value-at-Risk models via Quantile Regression
Evaluating Value-at-Risk models via Quantile Regression Luiz Renato Lima (University of Tennessee, Knoxville) Wagner Gaglianone, Oliver Linton, Daniel Smith. NASM-2009 05/31/2009 Motivation Recent nancial
More information