ArCo: An Artificial Counterfactual Approach for High-Dimensional Data

Size: px
Start display at page:

Download "ArCo: An Artificial Counterfactual Approach for High-Dimensional Data"

Transcription

1 ArCo: An Artificial Counterfactual Approach for High-Dimensional Data Carlos V. de Carvalho BCB and PUC-Rio Marcelo C. Medeiros PUC-Rio Ricardo Masini São Paulo School of Economics BIG DATA, MACHINE LEARNING AND THE MACROECONOMY Norges Bank, Oslo, 2-3 October 2017

2 Overview of the Method Observe aggregated time series data from t = 1 to T time

3 Overview of the Method Intervention occurs at t = T time T 0

4 Overview of the Method No clear controls. Observed variables from untreated peers time T 0

5 Overview of the Method Counterfactual estimation in-sample (before intervention) time T 0

6 Overview of the Method Counterfactual extrapolation (after the intervention) time T 0

7 One-page Summary Artificial Counterfactual (ArCo): alternative framework to measure the impact of an intervention in aggregate data when a control group is not readily available.

8 One-page Summary Artificial Counterfactual (ArCo): alternative framework to measure the impact of an intervention in aggregate data when a control group is not readily available. Generalization of the Synthetic Control (SC) method of Abadie and Gardeazabal (2003, AER) and the panel approach of Hsiao, Ching and Wan (2012, JAE).

9 One-page Summary Artificial Counterfactual (ArCo): alternative framework to measure the impact of an intervention in aggregate data when a control group is not readily available. Generalization of the Synthetic Control (SC) method of Abadie and Gardeazabal (2003, AER) and the panel approach of Hsiao, Ching and Wan (2012, JAE). Our contribution: estimator with ❶ asymptotic theory, ❷ rigorous inference, ❸ high-dimensionality, and ❹ new tests such as detection of the time of the intervention, contamination effects and effects on multiple moments.

10 One-page Summary Artificial Counterfactual (ArCo): alternative framework to measure the impact of an intervention in aggregate data when a control group is not readily available. Generalization of the Synthetic Control (SC) method of Abadie and Gardeazabal (2003, AER) and the panel approach of Hsiao, Ching and Wan (2012, JAE). Our contribution: estimator with ❶ asymptotic theory, ❷ rigorous inference, ❸ high-dimensionality, and ❹ new tests such as detection of the time of the intervention, contamination effects and effects on multiple moments. Key hypothesis: ❶ Interventions takes place in variables that can be modelled as trend-stationary processes. ❷ The intervention affects only the unit of interest

11 One-page Summary Artificial Counterfactual (ArCo): alternative framework to measure the impact of an intervention in aggregate data when a control group is not readily available. Generalization of the Synthetic Control (SC) method of Abadie and Gardeazabal (2003, AER) and the panel approach of Hsiao, Ching and Wan (2012, JAE). Our contribution: estimator with ❶ asymptotic theory, ❷ rigorous inference, ❸ high-dimensionality, and ❹ new tests such as detection of the time of the intervention, contamination effects and effects on multiple moments. Key hypothesis: ❶ Interventions takes place in variables that can be modelled as trend-stationary processes. ❷ The intervention affects only the unit of interest

12 The road map 1. The setup 2. The counterfactual estimation 3. Estimator properties 4. Inference 5. Extensions 6. Simulations 7. Empirical example: Nota Fiscal Paulista 8. Research agenda 9. Concluding remarks

13 Setup Observe q i > 0 variables for i = 1,... n units for t = 1,... T periods (Panel structure): z it = (z 1 it,..., zq i it ).

14 Setup Observe q i > 0 variables for i = 1,... n units for t = 1,... T periods (Panel structure): z it = (z 1 it,..., zq i it ). Unit 1 is exposed to the intervention at known T 0.

15 Setup Observe q i > 0 variables for i = 1,... n units for t = 1,... T periods (Panel structure): z it = (z 1 it,..., zq i it ). Unit 1 is exposed to the intervention at known T 0. The remanning n 1 units z 0t (z 2t,..., z nt ) are an untreated potential control group (donor pool).

16 Setup Observe q i > 0 variables for i = 1,... n units for t = 1,... T periods (Panel structure): z it = (z 1 it,..., zq i it ). Unit 1 is exposed to the intervention at known T 0. The remanning n 1 units z 0t (z 2t,..., z nt ) are an untreated potential control group (donor pool). Potential Outcome notation: z 1t = d t z (1) 1t + (1 d t )z (0) 1t ; d t = z 0t = z (0) 0t where z (1) it z (0) it { 1 if t T 0 0 otherwise is potential outcome under the intervention and the potential outcome with no intervention.

17 Setup Effects on functions of z 1t : h (z 1t ) : R q 1 R q { v p y t = h (z 1t ) e.g. h(v t ) = t v t y t = d t y (1) t + (1 d t )y (0) t

18 Setup Effects on functions of z 1t : h (z 1t ) : R q 1 R q { v p y t = h (z 1t ) e.g. h(v t ) = t v t y t = d t y (1) t + (1 d t )y (0) t Hypothesis of interest: y (1) t = δ t + y (0) t, t = T 0..., T, H 0 : T = 1 T T T [ ] y (1) t y (0) t = 0. }{{} δ t t=t 0

19 Setup Effects on functions of z 1t : h (z 1t ) : R q 1 R q { v p y t = h (z 1t ) e.g. h(v t ) = t v t y t = d t y (1) t + (1 d t )y (0) t Hypothesis of interest: y (1) t = δ t + y (0) t, t = T 0..., T, H 0 : T = 1 T T T [ ] y (1) t y (0) t = 0. }{{} δ t t=t 0 We do not observe the counterfactual y (0) t. Therefore, we construct an estimate ŷ (0) t such that: δ t y (1) t ŷ (0) t for t = T 0,..., T

20 Counterfactual Estimation How should we construct ŷ (0) t?

21 Counterfactual Estimation How should we construct ŷ (0) t? Ideally (in the MSE sense) we would set ŷ (0) t = E(y (0) t z 0t, z 0t 1,..., z 0t p )

22 Counterfactual Estimation How should we construct ŷ (0) t? Ideally (in the MSE sense) we would set ŷ (0) t = E(y (0) t z 0t, z 0t 1,..., z 0t p ) In practice, we choose a (parametric) specification (could be linear or not). Let x t = (z 0t, z 0t 1,..., z 0t p ) and such that E(ν t ) = 0 and y (0) t = M(x t ) + ν t, ŷ (0) t = M(x t ).

23 ArCo estimator The Artificial Counterfactual (ArCo) estimator is then simply given by T = 1 T T T t=t 0 δt, where δ t y t ŷ (0) t, for t = T 0,..., T.

24 ArCo estimator The Artificial Counterfactual (ArCo) estimator is then simply given by T = 1 T T T t=t 0 δt, where δ t y t ŷ (0) t, for t = T 0,..., T. ArCo estimator is a two-step estimator: 1. First step: estimation of M with the pre-intervention sample; 2. Second step: extrapolate M with actual data for x t and compute T.

25 ArCo estimator Estimation sample y t Δ T y t = M(x t )

26 ArCo and the literature Hsiao, Ching and Wan (2012, JAE) Two-step method where M(x t) is a linear and scalar function of a small set of variables from the peers. Correct specification (M is the conditional expectation). Selection of peers by information criteria. Date of the intervention known.

27 ArCo and the literature Hsiao, Ching and Wan (2012, JAE) Two-step method where M(x t) is a linear and scalar function of a small set of variables from the peers. Correct specification (M is the conditional expectation). Selection of peers by information criteria. Date of the intervention known. Differences-in-Differences (DiD) Number of treated units must grow. Parallel trends hypothesis. Similar control group.

28 ArCo and the literature Hsiao, Ching and Wan (2012, JAE) Two-step method where M(x t) is a linear and scalar function of a small set of variables from the peers. Correct specification (M is the conditional expectation). Selection of peers by information criteria. Date of the intervention known. Differences-in-Differences (DiD) Number of treated units must grow. Parallel trends hypothesis. Similar control group. Gobillon and Magnac (2016, REStat) Generalize the above authors by explicitly considering a factor model. interactive fixed effects with strictly exogenous regressors. Asymptotics both on the cross-section and time dimensions.

29 ArCo and the literature Abadie and Gardeazabal (2003, AER) Convex combination of peers. The weights are estimated using time averages of the observed variables. No time-series dynamics.

30 ArCo and the literature Abadie and Gardeazabal (2003, AER) Convex combination of peers. The weights are estimated using time averages of the observed variables. No time-series dynamics. Pesaran et al. (2004, JBES); Dees et al. (2007, JAE) Counterfactual based on variables that belong to the treated unit. No donor pool. Their key assumption is that a subset of variables of the treated unit is invariant to the intervention.

31 ArCo and the literature Abadie and Gardeazabal (2003, AER) Convex combination of peers. The weights are estimated using time averages of the observed variables. No time-series dynamics. Pesaran et al. (2004, JBES); Dees et al. (2007, JAE) Counterfactual based on variables that belong to the treated unit. No donor pool. Their key assumption is that a subset of variables of the treated unit is invariant to the intervention. Angrist, Jordà and Kuersteiner (2016, JBES) Information only on the treated unit and no donor pool is available.

32 ArCo estimator Key Assumption Independence Let z 0t = (z 2t,..., z nt ) denotes the vector of all the observable variables for the untreated units. Then, d s, for all t, s. z 0t =

33 ArCo estimator Key Assumption Independence Let z 0t = (z 2t,..., z nt ) denotes the vector of all the observable variables for the untreated units. Then, d s, for all t, s. z 0t = The independence condition donors are untreated.

34 ArCo estimator Key Assumption Independence Let z 0t = (z 2t,..., z nt ) denotes the vector of all the observable variables for the untreated units. Then, d s, for all t, s. z 0t = The independence condition donors are untreated. Examples of interventions (treatments): Natural disasters: Belasen and Polachek (2008, AER P&P), Cavallo, Galiani, Noy, and Pantano (2013, ReStat), Fujiki and Hsiao (2015, JoE),...

35 ArCo estimator Key Assumption Independence Let z 0t = (z 2t,..., z nt ) denotes the vector of all the observable variables for the untreated units. Then, d s, for all t, s. z 0t = The independence condition donors are untreated. Examples of interventions (treatments): Natural disasters: Belasen and Polachek (2008, AER P&P), Cavallo, Galiani, Noy, and Pantano (2013, ReStat), Fujiki and Hsiao (2015, JoE),... Region specific policies (laws): Hsiao, Ching, and Wan (2012, JAE), Abadie, Diamond, and Hainmueller (AJPS, 2015), Gobillon and Magnac (ReStat, 2016),...

36 ArCo estimator Key Assumption Independence Let z 0t = (z 2t,..., z nt ) denotes the vector of all the observable variables for the untreated units. Then, d s, for all t, s. z 0t = The independence condition donors are untreated. Examples of interventions (treatments): Natural disasters: Belasen and Polachek (2008, AER P&P), Cavallo, Galiani, Noy, and Pantano (2013, ReStat), Fujiki and Hsiao (2015, JoE),... Region specific policies (laws): Hsiao, Ching, and Wan (2012, JAE), Abadie, Diamond, and Hainmueller (AJPS, 2015), Gobillon and Magnac (ReStat, 2016),... New government or political regime: Grier and Maynard (2013, JEBO)

37 ArCo estimator Key Assumption Independence Let z 0t = (z 2t,..., z nt ) denotes the vector of all the observable variables for the untreated units. Then, d s, for all t, s. z 0t = The independence condition donors are untreated.

38 ArCo estimator Key Assumption Independence Let z 0t = (z 2t,..., z nt ) denotes the vector of all the observable variables for the untreated units. Then, d s, for all t, s. z 0t = The independence condition donors are untreated. Examples of interventions (treatments): Abadie and Diamond (2004, AER) and Abadie, Diamond, and Hainmueller (2010, JASA) have, respectively, 1712 and 1280 Google cites as measured on August 2, 2017.

39 ArCo estimator Key Assumption Independence Let z 0t = (z 2t,..., z nt ) denotes the vector of all the observable variables for the untreated units. Then, d s, for all t, s. z 0t = The independence condition donors are untreated. Examples of interventions (treatments): Abadie and Diamond (2004, AER) and Abadie, Diamond, and Hainmueller (2010, JASA) have, respectively, 1712 and 1280 Google cites as measured on August 2, Arguably the most important innovation in the evaluation literature in the last fifteen years is the synthetic control method. Athey and Imbens (2016)

40 Counterfactual Estimation Why do we expect the method to work?

41 Counterfactual Estimation Why do we expect the method to work? Possible data generating mechanism (in reduced form): z (0) it = µ i + Ψ ij ε it j j=0 ε it = Λ i f t + η it where f t (f 1) (0, Q) is a vector of common unobserved factors. Λ i (q i f ) are matrices of factor loadings; η it (q i 1) (0, R i ) is idiosyncratic error term.

42 Counterfactual Estimation Why do we expect the method to work? Possible data generating mechanism (in reduced form): z (0) it = µ i + Ψ ij ε it j j=0 ε it = Λ i f t + η it where f t (f 1) (0, Q) is a vector of common unobserved factors. Λ i (q i f ) are matrices of factor loadings; η it (q i 1) (0, R i ) is idiosyncratic error term. Quality of the pool of donors comes from the factor structure.

43 Counterfactual estimation Least Absolute Shrinkage and Selection Operator (LASSO) Set X t = (1, x t ) R d and consider M(x t ) linear: y (0) t = α + β x t + ν t = θ X t + ν t.

44 Counterfactual estimation Least Absolute Shrinkage and Selection Operator (LASSO) Set X t = (1, x t ) R d and consider M(x t ) linear: y (0) t Estimation: θ = arg min T 0 1 t=1 = α + β x t + ν t = θ X t + ν t. ( y (0) t ) 2 d θ X t + ς β j. j=1

45 Counterfactual estimation Least Absolute Shrinkage and Selection Operator (LASSO) Set X t = (1, x t ) R d and consider M(x t ) linear: y (0) t Estimation: θ = arg min T 0 1 t=1 = α + β x t + ν t = θ X t + ν t. ( y (0) t ) 2 d θ X t + ς β j. Why LASSO? Avoid overfitting. Large dataset compared to the sample size. Automatic model selection. j=1

46 Counterfactual estimation LASSO Catalog of hypotheses Design Let Σ 1 T1 T 1 t=1 E(x tx t ) and S 0 = {i : θ 0,1 0} (set of non-zero parameters). There exists a constant ψ 0 > 0 such that θ[s 0 ] 2 1 θσθs 0 ψ0 2, for all θ[s c 0 ] 1 3 θ[s 0 ] 1.

47 Counterfactual estimation LASSO Catalog of hypotheses Design Let Σ 1 T1 T 1 t=1 E(x tx t ) and S 0 = {i : θ 0,1 0} (set of non-zero parameters). There exists a constant ψ 0 > 0 such that θ[s 0 ] 2 1 θσθs 0 ψ0 2, for all θ[s c 0 ] 1 3 θ[s 0 ] 1. Compatibility condition of Bülhmann and van der Geer (2011).

48 Counterfactual estimation LASSO Catalog of hypotheses Design Let Σ 1 T1 T 1 t=1 E(x tx t ) and S 0 = {i : θ 0,1 0} (set of non-zero parameters). There exists a constant ψ 0 > 0 such that θ[s 0 ] 2 1 θσθs 0 ψ0 2, for all θ[s c 0 ] 1 3 θ[s 0 ] 1. Compatibility condition of Bülhmann and van der Geer (2011). Similar to the restriction of the smallest eigenvalue of Σ.

49 Counterfactual estimation LASSO Catalog of hypotheses Design Let Σ 1 T1 T 1 t=1 E(x tx t ) and S 0 = {i : θ 0,1 0} (set of non-zero parameters). There exists a constant ψ 0 > 0 such that θ[s 0 ] 2 1 θσθs 0 ψ0 2, for all θ[s c 0 ] 1 3 θ[s 0 ] 1. Compatibility condition of Bülhmann and van der Geer (2011). Similar to the restriction of the smallest eigenvalue of Σ. Important for prediction consistency and l 1 -consistency of the LASSO.

50 Counterfactual estimation LASSO Catalog of hypotheses Heterogeneity and dynamics Let w t (ν t, x t ), then: (a) {w t } is strong mixing with α(m) = exp( cm) for some c c > 0 (b) E w it 2γ+δ c γ for some γ > 2 and δ > 0 for all 1 i d, 1 t T and T 1, (c) E(νt 2 ) ϵ > 0, for all 1 t T and T 1.

51 Counterfactual estimation LASSO Catalog of hypotheses Heterogeneity and dynamics Let w t (ν t, x t ), then: (a) {w t } is strong mixing with α(m) = exp( cm) for some c c > 0 (b) E w it 2γ+δ c γ for some γ > 2 and δ > 0 for all 1 i d, 1 t T and T 1, (c) E(νt 2 ) ϵ > 0, for all 1 t T and T 1. w t is an α-mixing process with exponential decay.

52 Counterfactual estimation LASSO Catalog of hypotheses Heterogeneity and dynamics Let w t (ν t, x t ), then: (a) {w t } is strong mixing with α(m) = exp( cm) for some c c > 0 (b) E w it 2γ+δ c γ for some γ > 2 and δ > 0 for all 1 i d, 1 t T and T 1, (c) E(νt 2 ) ϵ > 0, for all 1 t T and T 1. w t is an α-mixing process with exponential decay. Part (b) bounds uniformly some higher moments Law of Large Numbers.

53 Counterfactual estimation LASSO Catalog of hypotheses Heterogeneity and dynamics Let w t (ν t, x t ), then: (a) {w t } is strong mixing with α(m) = exp( cm) for some c c > 0 (b) E w it 2γ+δ c γ for some γ > 2 and δ > 0 for all 1 i d, 1 t T and T 1, (c) E(νt 2 ) ϵ > 0, for all 1 t T and T 1. w t is an α-mixing process with exponential decay. Part (b) bounds uniformly some higher moments Law of Large Numbers. Part (c) Sufficient condition for the Central Limit Theorem.

54 Counterfactual estimation LASSO Catalog of hypotheses Regularity ( ) (a) ς = O d 1/γ T (b) s 0 d 2/γ T = o(1)

55 Counterfactual estimation LASSO Catalog of hypotheses Regularity ( ) (a) ς = O d 1/γ T (b) s 0 d 2/γ T = o(1) Part (a) puts discipline on the growth rate of the regularization parameter.

56 Counterfactual estimation LASSO Catalog of hypotheses Regularity ( ) (a) ς = O d 1/γ T (b) s 0 d 2/γ T = o(1) Part (a) puts discipline on the growth rate of the regularization parameter. Part (b) bounds the number of (total/relevant) parameters.

57 Counterfactual estimation LASSO Catalog of hypotheses Regularity ( ) (a) ς = O d 1/γ T (b) s 0 d 2/γ T = o(1) Part (a) puts discipline on the growth rate of the regularization parameter. Part (b) bounds the number of (total/relevant) parameters. Both conditions can be relaxed if normality is assumed.

58 Counterfactual estimation LASSO Results Consistency and Asymptotic Normality Let M be the model defined as before, whose parameters are estimated by LASSO, then under previous assumptions and as T : sup [ 1/2 sup P P TΩ P P a R q T ( ] T T ) a Φ(a) 0, where Ω T is defined in the previous proposition and Φ( ) is the cumulative distribution function of a zero-mean normal random vector with identity covariance matrix. The inequality is defined element-wise.

59 Counterfactual estimation LASSO Results Uniform Confidence Interval Let Ω T be a consistent estimator for Ω T uniformly in P P. Under the same conditions as before: [ I α j,t ± ω ] j Φ 1 (1 α/2) T for each j = 1,..., q, where ω j = [ Ω] jj and Φ 1 ( ) is the quantile function of a standard normal distribution. I α is uniformly valid (honest) in the sense that for a given ϵ > 0, there exists a T ϵ such that for all T > T ϵ : sup P P ( j,t I α ) (1 α) < ϵ. P P

60 Counterfactual estimation LASSO Results Uniform Hypothesis Test Let Ω T be a consistent estimator for Ω T uniformly in P P. Under the same conditions as before, for a given ϵ > 0, there exists a T ϵ such that for all T > T ϵ : sup P P (W T c α ) (1 α) < ϵ, P P where W T T T T, P(χ 2 q c α ) = 1 α and χ 2 q is a chi-square distributed random variable with q degrees of freedom. Ω 1 T

61 Additional Results 1. Tests under unknown intervention date. Results are not altered when the intervention date is estimated. λ is super-consistent.

62 Additional Results 1. Tests under unknown intervention date. Results are not altered when the intervention date is estimated. λ is super-consistent. 2. Multiple interventions.

63 Additional Results 1. Tests under unknown intervention date. Results are not altered when the intervention date is estimated. λ is super-consistent. 2. Multiple interventions. 3. Test for the untreated peers and /or treated unit.

64 Additional Results 1. Tests under unknown intervention date. Results are not altered when the intervention date is estimated. λ is super-consistent. 2. Multiple interventions. 3. Test for the untreated peers and /or treated unit. 4. The framework is extended to conduct joint test (e.g., affecting the mean and the variance simultaneously)

65 Additional Results 1. Tests under unknown intervention date. Results are not altered when the intervention date is estimated. λ is super-consistent. 2. Multiple interventions. 3. Test for the untreated peers and /or treated unit. 4. The framework is extended to conduct joint test (e.g., affecting the mean and the variance simultaneously) 5. R package: Fonseca, Masini, Medeiros, and Vasconcelos (2017a)

66 Additional Results 1. Tests under unknown intervention date. Results are not altered when the intervention date is estimated. λ is super-consistent. 2. Multiple interventions. 3. Test for the untreated peers and /or treated unit. 4. The framework is extended to conduct joint test (e.g., affecting the mean and the variance simultaneously) 5. R package: Fonseca, Masini, Medeiros, and Vasconcelos (2017a) 6. Carvalho, Masini and Medeiros (2017): Integrated data

67 Additional Results 1. Tests under unknown intervention date. Results are not altered when the intervention date is estimated. λ is super-consistent. 2. Multiple interventions. 3. Test for the untreated peers and /or treated unit. 4. The framework is extended to conduct joint test (e.g., affecting the mean and the variance simultaneously) 5. R package: Fonseca, Masini, Medeiros, and Vasconcelos (2017a) 6. Carvalho, Masini and Medeiros (2017): Integrated data 7. Fonseca, Masini, Medeiros, and Vasconcelos (2017b): Random Forests and other machine learning techniques.

68 Monte Carlo results Data Generating Process (DGP) Consider the following model for i {1,..., n} and t 1: where: ε it = Λ i f t + η it, z (0) it = ρa i z (0) it 1 + ε it, f t = [1, (t/t) φ, v t ], z it R q, ρ [0, 1), φ > 0, A i (q q) is a diagonal matrix with diagonal elements strictly between 1 and 1, iid N(0, 1), η it N(0, rf 2I nq), and Λ i is a (q 3) matrix of factor loadings. v t iid

69 Monte Carlo results Rejection Rates under the Null (1/2) Bias Var ŝ 0 α = Innovation Distribution T = 100, d = 100, s 0 = 5, φ = 0, ρ = 0 Normal χ 2 (1) t-stud(3) Mixed-Normal Sample Size normal dist., d = 100, s 0 = 5, φ = 0, ρ = 0 T = Number of Total Covariates normal dist., T = 100, s 0 = 5, φ = 0, ρ = 0 d =

70 Monte Carlo results Rejection Rates under the Null (2/2) Bias Var ŝ 0 α = Number of Relevant (non-zero) Covariates normal dist., T = 100, d = 100, φ = 0, ρ = 0 s 0 = Determinist Trend (t/t) φ normal dist., T = 100, d = 100, s 0 = 5, ρ = 0 φ = Serial Correlation normal dist., T = 100, d = 100, s 0 = 5, φ = 0 ρ =

71 Monte Carlo results Rejection rates under the alternative α = Step Intervention δ t = c σ 1 1{t T 0 } c = Linear Increasing δ t = c σ 1 t T 0 +1 T T {t T 0} c = T t+1 Linear Decreasing δ t = c σ 1 T T {t T 0} c =

72 Monte Carlo results Horse race (1/2) BA SC DiD* DiD GM* GM ArCo* ArCo No Time Trend (φ = 0) and No Serial Correlation (ρ = 0) Bias Var MSE No Time Trend (φ = 0) Bias Var MSE Common Linear Time Trend (φ = 1) Bias Var MSE Idiosyncratic Linear Time Trend (φ = 1) Bias Var MSE

73 Monte Carlo results Horse race (2/2) BA SC DiD* DiD GM* GM ArCo* ArCo Common Quadratic Time Trend (φ = 2) Bias Var MSE Idiosyncratic Quadratic Time Trend (φ = 2) Bias Var MSE

74 Kernel Normal Monte Carlo results No trend and no serial correlation BA SC DiD* DiD GM* GM ArCo* ArCo

75 Kernel Normal Monte Carlo results No trend BA SC DiD* DiD GM* GM ArCo* ArCo

76 Kernel Normal Monte Carlo results Common linear trend BA SC DiD* DiD GM* GM ArCo* ArCo

77 Kernel Normal Monte Carlo results Heterogeneous linear trend BA SC DiD* DiD GM* GM ArCo* ArCo

78 Kernel Normal Monte Carlo results Common quadratic trend BA SC DiD* DiD GM* GM ArCo* ArCo

79 Kernel Normal Monte Carlo results Heterogeneous quadratic trend BA SC DiD* DiD GM* GM ArCo* ArCo

80 Application In October 2007, the state government of São Paulo in Brazil implemented a anti tax evasion scheme called Nota Fiscal Paulista (NFP).

81 Application In October 2007, the state government of São Paulo in Brazil implemented a anti tax evasion scheme called Nota Fiscal Paulista (NFP). The NFP consists of a tax rebate from a state tax named ICMS (tax on circulation of products and services).

82 Application In October 2007, the state government of São Paulo in Brazil implemented a anti tax evasion scheme called Nota Fiscal Paulista (NFP). The NFP consists of a tax rebate from a state tax named ICMS (tax on circulation of products and services). Incentive to the consumer to ask for sales receipts.

83 Application In October 2007, the state government of São Paulo in Brazil implemented a anti tax evasion scheme called Nota Fiscal Paulista (NFP). The NFP consists of a tax rebate from a state tax named ICMS (tax on circulation of products and services). Incentive to the consumer to ask for sales receipts. Additionally, the registered sales receipts give the consumer the right to participate in monthly lotteries promoted by the government.

84 Application Under the premisses that: 1. a certain degree of tax evasion was occurring before the intervention, 2. the sellers has some degree of market power and 3. the penalty for tax-evasion is large enough to alter the seller behaviour, one is expected to see an upwards movements in prices due to an increase in marginal cost. Hence, we would like to investigate whether the NFP had an impact on consumer prices in São Paulo.

85 Application The NFP was not implemented throughout the sectors in the economy at once.

86 Application The NFP was not implemented throughout the sectors in the economy at once. The first sector were restaurants, followed by bakeries, bar and other food service retailers.

87 Application The NFP was not implemented throughout the sectors in the economy at once. The first sector were restaurants, followed by bakeries, bar and other food service retailers. The sample then consists of monthly inflation (food outside home) index for 10 metropolitan areas including São Paulo from Jan-05 to Sep-09.

88 Application 1200 Dec 07 Jan 08 Feb 08 Mar 08 Apr 08 May 08 Jun 08 Jul 08 Aug 08 Sep 08 Oct 08 Nov 08 Dec 08 Jan 09 Feb 09 Mar 09 Apr 09 May 09 Jun 09 Jul 09 Aug 09 Sep 09 # of participants (millions) Distributed Value (millions R$)

89 Application Panel (a): ArCo Estimates (1) (2) (3) (4) (5) (6) (7) (0.1704) (0.1486) (0.1432) (0.1480) (0.2010) (0.1600) (0.1539) Inflation Yes No No No Yes Yes Yes GDP No Yes No No Yes Yes Yes Retail Sales No No Yes No No Yes Yes Credit No No No Yes No No Yes R-squared d s T T DiD (0.1466) GM (0.1234) Panel (b): Alternative Estimates (1) (2) (3) (4) (5) (6) (0.1456) (0.1467) (0.1556) (0.1457) (0.1466) (0.1243) (0.1246) (0.1227) (0.1228) GDP Yes No No Yes Yes No Retail Sales No Yes No Yes Yes No Credit No No Yes No Yes No

90 Application ArCo estimates: CPI inflation (food outside home) Jan-05 Jan-06 Jan-07 Jan-08 Jan-09 Pre-intervention fit Counterfactual Actual index

91 Application ArCo estimates: CPI (food outside home) Jan-05 Jan-06 Jan-07 Jan-08 Jan-09 Pre-intervention fit Counterfactual Actual index

92 Concluding remarks Important to note that the ArCo is more than just another structural break estimator. By controlling for common shocks that might have occurred in all units after the intervention, it provides a effective methodology to isolate the effect of the intervention of interest. Providing compelling evidence for the causality of the structural break observed in the unit of interest It is almost a model-free methodology. It implies no structure in the stationary process (no # of lags to be determined nor # MA terms)

TEXTO PARA DISCUSSÃO. No ARCO: an artificial counterfactual approach for high-dimensional panel time-series data

TEXTO PARA DISCUSSÃO. No ARCO: an artificial counterfactual approach for high-dimensional panel time-series data EXO PARA DISCUSSÃO No. 653 ARCO: an artificial counterfactual approach for high-dimensional panel time-series data Carlos C. Carvalho Ricardo Masini Marcelo C. Medeiros DEPARAMENO DE ECONOMIA www.econ.puc-rio.br

More information

Intervention Impact Evaluation on Aggregated Data: The Artificial Counterfactual Approach for Stationary Processes

Intervention Impact Evaluation on Aggregated Data: The Artificial Counterfactual Approach for Stationary Processes Intervention Impact Evaluation on Aggregated Data: he Artificial Counterfactual Approach for Stationary Processes Carlos V. Carvalho, Ricardo Masini and Marcelo C. Medeiros January 30, 2015 Abstract In

More information

Revisiting the Synthetic Control Estimator

Revisiting the Synthetic Control Estimator Revisiting the Synthetic Control Estimator Bruno Ferman Cristine Pinto Sao Paulo School of Economics - FGV First Draft: June, 26 This Draft: October, 27 Please click here for the most recent version Abstract

More information

Revisiting the Synthetic Control Estimator

Revisiting the Synthetic Control Estimator MPRA Munich Personal RePEc Archive Revisiting the Synthetic Control Estimator Bruno Ferman and Cristine Pinto Sao Paulo School of Economics - FGV 23 November 2016 Online at https://mpra.ub.uni-muenchen.de/77930/

More information

Revisiting the Synthetic Control Estimator

Revisiting the Synthetic Control Estimator Revisiting the Synthetic Control Estimator Bruno Ferman Cristine Pinto Sao Paulo School of Economics - FGV First Draft: June, 2016 This Draft: August, 2017 Please click here for the most recent version

More information

ArCo: An R package to Estimate Artificial Counterfactuals by Yuri R. Fonseca, Ricardo P. Masini, Marcelo C. Medeiros and Gabriel F. R.

ArCo: An R package to Estimate Artificial Counterfactuals by Yuri R. Fonseca, Ricardo P. Masini, Marcelo C. Medeiros and Gabriel F. R. CONTRIBUTED RESEARCH ARTICLE 91 ArCo: An R package to Estimate Artificial Counterfactuals by Yuri R. Fonseca, Ricardo P. Masini, Marcelo C. Medeiros and Gabriel F. R. Vasconcelos Abstract In this paper

More information

When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Longitudinal Data?

When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Longitudinal Data? When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Longitudinal Data? Kosuke Imai Department of Politics Center for Statistics and Machine Learning Princeton University

More information

Outside the Box: Using Synthetic Control Methods as a Forecasting Technique

Outside the Box: Using Synthetic Control Methods as a Forecasting Technique Outside the Box: Using Synthetic Control Methods as a Forecasting Technique Stefan Klößner (Saarland University) Gregor Pfeifer (University of Hohenheim) October 6, 2016 Abstract We introduce synthetic

More information

Comparing Forecast Accuracy of Different Models for Prices of Metal Commodities

Comparing Forecast Accuracy of Different Models for Prices of Metal Commodities Comparing Forecast Accuracy of Different Models for Prices of Metal Commodities João Victor Issler (FGV) and Claudia F. Rodrigues (VALE) August, 2012 J.V. Issler and C.F. Rodrigues () Forecast Models for

More information

Inference for Factor Model Based Average Treatment

Inference for Factor Model Based Average Treatment Inference for Factor Model Based Average Treatment Effects Kathleen T. Li Marketing Department The Wharton School, the University of Pennsylvania January, 208 Abstract In this paper we consider the problem

More information

When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Longitudinal Data?

When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Longitudinal Data? When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Longitudinal Data? Kosuke Imai Princeton University Asian Political Methodology Conference University of Sydney Joint

More information

Applied Microeconometrics (L5): Panel Data-Basics

Applied Microeconometrics (L5): Panel Data-Basics Applied Microeconometrics (L5): Panel Data-Basics Nicholas Giannakopoulos University of Patras Department of Economics ngias@upatras.gr November 10, 2015 Nicholas Giannakopoulos (UPatras) MSc Applied Economics

More information

Non-linear panel data modeling

Non-linear panel data modeling Non-linear panel data modeling Laura Magazzini University of Verona laura.magazzini@univr.it http://dse.univr.it/magazzini May 2010 Laura Magazzini (@univr.it) Non-linear panel data modeling May 2010 1

More information

Approximating Fixed-Horizon Forecasts Using Fixed-Event Forecasts

Approximating Fixed-Horizon Forecasts Using Fixed-Event Forecasts Approximating Fixed-Horizon Forecasts Using Fixed-Event Forecasts Malte Knüppel and Andreea L. Vladu Deutsche Bundesbank 9th ECB Workshop on Forecasting Techniques 4 June 216 This work represents the authors

More information

On the maximum and minimum response to an impulse in Svars

On the maximum and minimum response to an impulse in Svars On the maximum and minimum response to an impulse in Svars Bulat Gafarov (PSU), Matthias Meier (UBonn), and José-Luis Montiel-Olea (NYU) September 23, 2015 1 / 58 Introduction Structural VAR: Theoretical

More information

Central Bank of Chile October 29-31, 2013 Bruce Hansen (University of Wisconsin) Structural Breaks October 29-31, / 91. Bruce E.

Central Bank of Chile October 29-31, 2013 Bruce Hansen (University of Wisconsin) Structural Breaks October 29-31, / 91. Bruce E. Forecasting Lecture 3 Structural Breaks Central Bank of Chile October 29-31, 2013 Bruce Hansen (University of Wisconsin) Structural Breaks October 29-31, 2013 1 / 91 Bruce E. Hansen Organization Detection

More information

Short T Panels - Review

Short T Panels - Review Short T Panels - Review We have looked at methods for estimating parameters on time-varying explanatory variables consistently in panels with many cross-section observation units but a small number of

More information

Assessing the eect of 2008 crisis on Brazilian Economy.

Assessing the eect of 2008 crisis on Brazilian Economy. Assessing the eect of 2008 crisis on Brazilian Economy. Emerson Fernandes Marçal Ronan Cunha Giovanni Merlin Oscar Simões CEMAP-EESP-FGV December 5, 2016 Talking points Motivation Objectives Literature

More information

TIME SERIES ANALYSIS AND FORECASTING USING THE STATISTICAL MODEL ARIMA

TIME SERIES ANALYSIS AND FORECASTING USING THE STATISTICAL MODEL ARIMA CHAPTER 6 TIME SERIES ANALYSIS AND FORECASTING USING THE STATISTICAL MODEL ARIMA 6.1. Introduction A time series is a sequence of observations ordered in time. A basic assumption in the time series analysis

More information

Chapter 6. Panel Data. Joan Llull. Quantitative Statistical Methods II Barcelona GSE

Chapter 6. Panel Data. Joan Llull. Quantitative Statistical Methods II Barcelona GSE Chapter 6. Panel Data Joan Llull Quantitative Statistical Methods II Barcelona GSE Introduction Chapter 6. Panel Data 2 Panel data The term panel data refers to data sets with repeated observations over

More information

Vector Auto-Regressive Models

Vector Auto-Regressive Models Vector Auto-Regressive Models Laurent Ferrara 1 1 University of Paris Nanterre M2 Oct. 2018 Overview of the presentation 1. Vector Auto-Regressions Definition Estimation Testing 2. Impulse responses functions

More information

VAR Models and Applications

VAR Models and Applications VAR Models and Applications Laurent Ferrara 1 1 University of Paris West M2 EIPMC Oct. 2016 Overview of the presentation 1. Vector Auto-Regressions Definition Estimation Testing 2. Impulse responses functions

More information

= observed volume on day l for bin j = base volume in jth bin, and = residual error, assumed independent with mean zero.

= observed volume on day l for bin j = base volume in jth bin, and = residual error, assumed independent with mean zero. QB research September 4, 06 Page -Minute Bin Volume Forecast Model Overview In response to strong client demand, Quantitative Brokers (QB) has developed a new algorithm called Closer that specifically

More information

DEPARTMENT OF ECONOMICS

DEPARTMENT OF ECONOMICS ISSN 0819-64 ISBN 0 7340 616 1 THE UNIVERSITY OF MELBOURNE DEPARTMENT OF ECONOMICS RESEARCH PAPER NUMBER 959 FEBRUARY 006 TESTING FOR RATE-DEPENDENCE AND ASYMMETRY IN INFLATION UNCERTAINTY: EVIDENCE FROM

More information

Introduction to Forecasting

Introduction to Forecasting Introduction to Forecasting Introduction to Forecasting Predicting the future Not an exact science but instead consists of a set of statistical tools and techniques that are supported by human judgment

More information

GAMINGRE 8/1/ of 7

GAMINGRE 8/1/ of 7 FYE 09/30/92 JULY 92 0.00 254,550.00 0.00 0 0 0 0 0 0 0 0 0 254,550.00 0.00 0.00 0.00 0.00 254,550.00 AUG 10,616,710.31 5,299.95 845,656.83 84,565.68 61,084.86 23,480.82 339,734.73 135,893.89 67,946.95

More information

Approximating fixed-horizon forecasts using fixed-event forecasts

Approximating fixed-horizon forecasts using fixed-event forecasts Approximating fixed-horizon forecasts using fixed-event forecasts Comments by Simon Price Essex Business School June 2016 Redundant disclaimer The views in this presentation are solely those of the presenter

More information

Independent and conditionally independent counterfactual distributions

Independent and conditionally independent counterfactual distributions Independent and conditionally independent counterfactual distributions Marcin Wolski European Investment Bank M.Wolski@eib.org Society for Nonlinear Dynamics and Econometrics Tokyo March 19, 2018 Views

More information

Volume 03, Issue 6. Comparison of Panel Cointegration Tests

Volume 03, Issue 6. Comparison of Panel Cointegration Tests Volume 03, Issue 6 Comparison of Panel Cointegration Tests Deniz Dilan Karaman Örsal Humboldt University Berlin Abstract The main aim of this paper is to compare the size and size-adjusted power properties

More information

When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Panel Data?

When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Panel Data? When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Panel Data? Kosuke Imai Department of Politics Center for Statistics and Machine Learning Princeton University Joint

More information

A look into the factor model black box Publication lags and the role of hard and soft data in forecasting GDP

A look into the factor model black box Publication lags and the role of hard and soft data in forecasting GDP A look into the factor model black box Publication lags and the role of hard and soft data in forecasting GDP Marta Bańbura and Gerhard Rünstler Directorate General Research European Central Bank November

More information

Cross-Validating Synthetic Controls

Cross-Validating Synthetic Controls Cross-Validating Synthetic Controls Martin Becker, Saarland University Stefan Klößner, Saarland University Gregor Pfeifer, University of Hohenheim October 12, 2017 Abstract While the literature on synthetic

More information

An exact and robust conformal inference method for counterfactual and synthetic controls

An exact and robust conformal inference method for counterfactual and synthetic controls An exact and robust conformal inference method for counterfactual and synthetic controls Victor Chernozhukov Kaspar Wüthrich Yu Zhu The Institute for Fiscal Studies Department of Economics, UCL cemmap

More information

Identifying Aggregate Liquidity Shocks with Monetary Policy Shocks: An Application using UK Data

Identifying Aggregate Liquidity Shocks with Monetary Policy Shocks: An Application using UK Data Identifying Aggregate Liquidity Shocks with Monetary Policy Shocks: An Application using UK Data Michael Ellington and Costas Milas Financial Services, Liquidity and Economic Activity Bank of England May

More information

What s New in Econometrics? Lecture 14 Quantile Methods

What s New in Econometrics? Lecture 14 Quantile Methods What s New in Econometrics? Lecture 14 Quantile Methods Jeff Wooldridge NBER Summer Institute, 2007 1. Reminders About Means, Medians, and Quantiles 2. Some Useful Asymptotic Results 3. Quantile Regression

More information

Final Overview. Introduction to ML. Marek Petrik 4/25/2017

Final Overview. Introduction to ML. Marek Petrik 4/25/2017 Final Overview Introduction to ML Marek Petrik 4/25/2017 This Course: Introduction to Machine Learning Build a foundation for practice and research in ML Basic machine learning concepts: max likelihood,

More information

An economic application of machine learning: Nowcasting Thai exports using global financial market data and time-lag lasso

An economic application of machine learning: Nowcasting Thai exports using global financial market data and time-lag lasso An economic application of machine learning: Nowcasting Thai exports using global financial market data and time-lag lasso PIER Exchange Nov. 17, 2016 Thammarak Moenjak What is machine learning? Wikipedia

More information

Introduction to Econometrics

Introduction to Econometrics Introduction to Econometrics T H I R D E D I T I O N Global Edition James H. Stock Harvard University Mark W. Watson Princeton University Boston Columbus Indianapolis New York San Francisco Upper Saddle

More information

Comprehensive Examination Quantitative Methods Spring, 2018

Comprehensive Examination Quantitative Methods Spring, 2018 Comprehensive Examination Quantitative Methods Spring, 2018 Instruction: This exam consists of three parts. You are required to answer all the questions in all the parts. 1 Grading policy: 1. Each part

More information

Repeated observations on the same cross-section of individual units. Important advantages relative to pure cross-section data

Repeated observations on the same cross-section of individual units. Important advantages relative to pure cross-section data Panel data Repeated observations on the same cross-section of individual units. Important advantages relative to pure cross-section data - possible to control for some unobserved heterogeneity - possible

More information

New Developments in Econometrics Lecture 16: Quantile Estimation

New Developments in Econometrics Lecture 16: Quantile Estimation New Developments in Econometrics Lecture 16: Quantile Estimation Jeff Wooldridge Cemmap Lectures, UCL, June 2009 1. Review of Means, Medians, and Quantiles 2. Some Useful Asymptotic Results 3. Quantile

More information

IV Quantile Regression for Group-level Treatments, with an Application to the Distributional Effects of Trade

IV Quantile Regression for Group-level Treatments, with an Application to the Distributional Effects of Trade IV Quantile Regression for Group-level Treatments, with an Application to the Distributional Effects of Trade Denis Chetverikov Brad Larsen Christopher Palmer UCLA, Stanford and NBER, UC Berkeley September

More information

Difference-in-Differences Estimation

Difference-in-Differences Estimation Difference-in-Differences Estimation Jeff Wooldridge Michigan State University Programme Evaluation for Policy Analysis Institute for Fiscal Studies June 2012 1. The Basic Methodology 2. How Should We

More information

Interpreting Regression Results

Interpreting Regression Results Interpreting Regression Results Carlo Favero Favero () Interpreting Regression Results 1 / 42 Interpreting Regression Results Interpreting regression results is not a simple exercise. We propose to split

More information

Identifying Causal Effects in Time Series Models

Identifying Causal Effects in Time Series Models Identifying Causal Effects in Time Series Models Aaron Smith UC Davis October 20, 2015 Slides available at http://asmith.ucdavis.edu 1 Identifying Causal Effects in Time Series Models If all the Metrics

More information

Department of Economics, UCSB UC Santa Barbara

Department of Economics, UCSB UC Santa Barbara Department of Economics, UCSB UC Santa Barbara Title: Past trend versus future expectation: test of exchange rate volatility Author: Sengupta, Jati K., University of California, Santa Barbara Sfeir, Raymond,

More information

Identifying the Monetary Policy Shock Christiano et al. (1999)

Identifying the Monetary Policy Shock Christiano et al. (1999) Identifying the Monetary Policy Shock Christiano et al. (1999) The question we are asking is: What are the consequences of a monetary policy shock a shock which is purely related to monetary conditions

More information

arxiv: v3 [econ.em] 17 Apr 2018

arxiv: v3 [econ.em] 17 Apr 2018 An Exact and Robust Conformal Inference Method for Counterfactual and Synthetic Controls Victor Chernozhukov Kaspar Wüthrich Yinchu Zhu arxiv:1712.09089v3 [econ.em] 17 Apr 2018 April 16, 2018 Abstract

More information

New Developments in Econometrics Lecture 11: Difference-in-Differences Estimation

New Developments in Econometrics Lecture 11: Difference-in-Differences Estimation New Developments in Econometrics Lecture 11: Difference-in-Differences Estimation Jeff Wooldridge Cemmap Lectures, UCL, June 2009 1. The Basic Methodology 2. How Should We View Uncertainty in DD Settings?

More information

Econometrics of Panel Data

Econometrics of Panel Data Econometrics of Panel Data Jakub Mućk Meeting # 6 Jakub Mućk Econometrics of Panel Data Meeting # 6 1 / 36 Outline 1 The First-Difference (FD) estimator 2 Dynamic panel data models 3 The Anderson and Hsiao

More information

CONTAGION VERSUS FLIGHT TO QUALITY IN FINANCIAL MARKETS

CONTAGION VERSUS FLIGHT TO QUALITY IN FINANCIAL MARKETS EVA IV, CONTAGION VERSUS FLIGHT TO QUALITY IN FINANCIAL MARKETS Jose Olmo Department of Economics City University, London (joint work with Jesús Gonzalo, Universidad Carlos III de Madrid) 4th Conference

More information

Single Equation Linear GMM with Serially Correlated Moment Conditions

Single Equation Linear GMM with Serially Correlated Moment Conditions Single Equation Linear GMM with Serially Correlated Moment Conditions Eric Zivot October 28, 2009 Univariate Time Series Let {y t } be an ergodic-stationary time series with E[y t ]=μ and var(y t )

More information

Review of Classical Least Squares. James L. Powell Department of Economics University of California, Berkeley

Review of Classical Least Squares. James L. Powell Department of Economics University of California, Berkeley Review of Classical Least Squares James L. Powell Department of Economics University of California, Berkeley The Classical Linear Model The object of least squares regression methods is to model and estimate

More information

Econometric Analysis of Cross Section and Panel Data

Econometric Analysis of Cross Section and Panel Data Econometric Analysis of Cross Section and Panel Data Jeffrey M. Wooldridge / The MIT Press Cambridge, Massachusetts London, England Contents Preface Acknowledgments xvii xxiii I INTRODUCTION AND BACKGROUND

More information

Shortfalls of Panel Unit Root Testing. Jack Strauss Saint Louis University. And. Taner Yigit Bilkent University. Abstract

Shortfalls of Panel Unit Root Testing. Jack Strauss Saint Louis University. And. Taner Yigit Bilkent University. Abstract Shortfalls of Panel Unit Root Testing Jack Strauss Saint Louis University And Taner Yigit Bilkent University Abstract This paper shows that (i) magnitude and variation of contemporaneous correlation are

More information

BUSI 460 Suggested Answers to Selected Review and Discussion Questions Lesson 7

BUSI 460 Suggested Answers to Selected Review and Discussion Questions Lesson 7 BUSI 460 Suggested Answers to Selected Review and Discussion Questions Lesson 7 1. The definitions follow: (a) Time series: Time series data, also known as a data series, consists of observations on a

More information

Oil price and macroeconomy in Russia. Abstract

Oil price and macroeconomy in Russia. Abstract Oil price and macroeconomy in Russia Katsuya Ito Fukuoka University Abstract In this note, using the VEC model we attempt to empirically investigate the effects of oil price and monetary shocks on the

More information

1 Estimation of Persistent Dynamic Panel Data. Motivation

1 Estimation of Persistent Dynamic Panel Data. Motivation 1 Estimation of Persistent Dynamic Panel Data. Motivation Consider the following Dynamic Panel Data (DPD) model y it = y it 1 ρ + x it β + µ i + v it (1.1) with i = {1, 2,..., N} denoting the individual

More information

Single Equation Linear GMM with Serially Correlated Moment Conditions

Single Equation Linear GMM with Serially Correlated Moment Conditions Single Equation Linear GMM with Serially Correlated Moment Conditions Eric Zivot November 2, 2011 Univariate Time Series Let {y t } be an ergodic-stationary time series with E[y t ]=μ and var(y t )

More information

Stability-based generation of scenario trees for multistage stochastic programs

Stability-based generation of scenario trees for multistage stochastic programs Stability-based generation of scenario trees for multistage stochastic programs H. Heitsch and W. Römisch Humboldt-University Berlin Institute of Mathematics 10099 Berlin, Germany http://www.math.hu-berlin.de/~romisch

More information

Gov 2002: 9. Differences in Differences

Gov 2002: 9. Differences in Differences Gov 2002: 9. Differences in Differences Matthew Blackwell October 30, 2015 1 / 40 1. Basic differences-in-differences model 2. Conditional DID 3. Standard error issues 4. Other DID approaches 2 / 40 Where

More information

Backtesting Marginal Expected Shortfall and Related Systemic Risk Measures

Backtesting Marginal Expected Shortfall and Related Systemic Risk Measures Backtesting Marginal Expected Shortfall and Related Systemic Risk Measures Denisa Banulescu 1 Christophe Hurlin 1 Jérémy Leymarie 1 Olivier Scaillet 2 1 University of Orleans 2 University of Geneva & Swiss

More information

LIKELIHOOD RATIO INFERENCE FOR MISSING DATA MODELS

LIKELIHOOD RATIO INFERENCE FOR MISSING DATA MODELS LIKELIHOOD RATIO IFERECE FOR MISSIG DATA MODELS KARU ADUSUMILLI AD TAISUKE OTSU Abstract. Missing or incomplete outcome data is a ubiquitous problem in biomedical and social sciences. Under the missing

More information

Testing Random Effects in Two-Way Spatial Panel Data Models

Testing Random Effects in Two-Way Spatial Panel Data Models Testing Random Effects in Two-Way Spatial Panel Data Models Nicolas Debarsy May 27, 2010 Abstract This paper proposes an alternative testing procedure to the Hausman test statistic to help the applied

More information

Statistical Inference for Average Treatment Effects Estimated by Synthetic Control Methods

Statistical Inference for Average Treatment Effects Estimated by Synthetic Control Methods Statistical Inference for Average Treatment Effects Estimated by Synthetic Control Methods Kathleen T. Li Marketing Department The Wharton School, the University of Pennsylvania March 2018 Abstract The

More information

Review. DS GA 1002 Statistical and Mathematical Models. Carlos Fernandez-Granda

Review. DS GA 1002 Statistical and Mathematical Models.   Carlos Fernandez-Granda Review DS GA 1002 Statistical and Mathematical Models http://www.cims.nyu.edu/~cfgranda/pages/dsga1002_fall16 Carlos Fernandez-Granda Probability and statistics Probability: Framework for dealing with

More information

Predicting bond returns using the output gap in expansions and recessions

Predicting bond returns using the output gap in expansions and recessions Erasmus university Rotterdam Erasmus school of economics Bachelor Thesis Quantitative finance Predicting bond returns using the output gap in expansions and recessions Author: Martijn Eertman Studentnumber:

More information

Vector autoregressions, VAR

Vector autoregressions, VAR 1 / 45 Vector autoregressions, VAR Chapter 2 Financial Econometrics Michael Hauser WS17/18 2 / 45 Content Cross-correlations VAR model in standard/reduced form Properties of VAR(1), VAR(p) Structural VAR,

More information

Model Selection and Geometry

Model Selection and Geometry Model Selection and Geometry Pascal Massart Université Paris-Sud, Orsay Leipzig, February Purpose of the talk! Concentration of measure plays a fundamental role in the theory of model selection! Model

More information

Econometría 2: Análisis de series de Tiempo

Econometría 2: Análisis de series de Tiempo Econometría 2: Análisis de series de Tiempo Karoll GOMEZ kgomezp@unal.edu.co http://karollgomez.wordpress.com Segundo semestre 2016 IX. Vector Time Series Models VARMA Models A. 1. Motivation: The vector

More information

The Power of the Synthetic Control Method

The Power of the Synthetic Control Method Working papers in transport, tourism, information technology and microdata analysis The Power of the Synthetic Control Method Kenneth Carling Yujiao Li Editor: Hasan Fleyeh Nr: 2016:10 Working papers in

More information

abstract sergio firpo and vítor possebom april 2 nd, 2016

abstract sergio firpo and vítor possebom april 2 nd, 2016 SY N T H E T I C C O N T R O L E S T I M AT O R : A G E N E R A L I Z E D I N F E R E N C E P R O C E D U R E A N D C O N F I D E N C E S E T S * sergio firpo and vítor possebom april 2 nd, 2016 abstract

More information

Generated Covariates in Nonparametric Estimation: A Short Review.

Generated Covariates in Nonparametric Estimation: A Short Review. Generated Covariates in Nonparametric Estimation: A Short Review. Enno Mammen, Christoph Rothe, and Melanie Schienle Abstract In many applications, covariates are not observed but have to be estimated

More information

Econometrics in a nutshell: Variation and Identification Linear Regression Model in STATA. Research Methods. Carlos Noton.

Econometrics in a nutshell: Variation and Identification Linear Regression Model in STATA. Research Methods. Carlos Noton. 1/17 Research Methods Carlos Noton Term 2-2012 Outline 2/17 1 Econometrics in a nutshell: Variation and Identification 2 Main Assumptions 3/17 Dependent variable or outcome Y is the result of two forces:

More information

Appendix A: The time series behavior of employment growth

Appendix A: The time series behavior of employment growth Unpublished appendices from The Relationship between Firm Size and Firm Growth in the U.S. Manufacturing Sector Bronwyn H. Hall Journal of Industrial Economics 35 (June 987): 583-606. Appendix A: The time

More information

Econometric Methods for Panel Data

Econometric Methods for Panel Data Based on the books by Baltagi: Econometric Analysis of Panel Data and by Hsiao: Analysis of Panel Data Robert M. Kunst robert.kunst@univie.ac.at University of Vienna and Institute for Advanced Studies

More information

The risk of machine learning

The risk of machine learning / 33 The risk of machine learning Alberto Abadie Maximilian Kasy July 27, 27 2 / 33 Two key features of machine learning procedures Regularization / shrinkage: Improve prediction or estimation performance

More information

Euro-indicators Working Group

Euro-indicators Working Group Euro-indicators Working Group Luxembourg, 9 th & 10 th June 2011 Item 9.4 of the Agenda New developments in EuroMIND estimates Rosa Ruggeri Cannata Doc 309/11 What is EuroMIND? EuroMIND is a Monthly INDicator

More information

A Significance Test for the Lasso

A Significance Test for the Lasso A Significance Test for the Lasso Lockhart R, Taylor J, Tibshirani R, and Tibshirani R Ashley Petersen May 14, 2013 1 Last time Problem: Many clinical covariates which are important to a certain medical

More information

Time-Varying Vector Autoregressive Models with Structural Dynamic Factors

Time-Varying Vector Autoregressive Models with Structural Dynamic Factors Time-Varying Vector Autoregressive Models with Structural Dynamic Factors Paolo Gorgi, Siem Jan Koopman, Julia Schaumburg http://sjkoopman.net Vrije Universiteit Amsterdam School of Business and Economics

More information

News Shocks: Different Effects in Boom and Recession?

News Shocks: Different Effects in Boom and Recession? News Shocks: Different Effects in Boom and Recession? Maria Bolboaca, Sarah Fischer University of Bern Study Center Gerzensee June 7, 5 / Introduction News are defined in the literature as exogenous changes

More information

Statistics for IT Managers

Statistics for IT Managers Statistics for IT Managers 95-796, Fall 2012 Module 2: Hypothesis Testing and Statistical Inference (5 lectures) Reading: Statistics for Business and Economics, Ch. 5-7 Confidence intervals Given the sample

More information

Limited Dependent Variables and Panel Data

Limited Dependent Variables and Panel Data and Panel Data June 24 th, 2009 Structure 1 2 Many economic questions involve the explanation of binary variables, e.g.: explaining the participation of women in the labor market explaining retirement

More information

B y t = γ 0 + Γ 1 y t + ε t B(L) y t = γ 0 + ε t ε t iid (0, D) D is diagonal

B y t = γ 0 + Γ 1 y t + ε t B(L) y t = γ 0 + ε t ε t iid (0, D) D is diagonal Structural VAR Modeling for I(1) Data that is Not Cointegrated Assume y t =(y 1t,y 2t ) 0 be I(1) and not cointegrated. That is, y 1t and y 2t are both I(1) and there is no linear combination of y 1t and

More information

Machine learning, shrinkage estimation, and economic theory

Machine learning, shrinkage estimation, and economic theory Machine learning, shrinkage estimation, and economic theory Maximilian Kasy December 14, 2018 1 / 43 Introduction Recent years saw a boom of machine learning methods. Impressive advances in domains such

More information

Structural VAR Models and Applications

Structural VAR Models and Applications Structural VAR Models and Applications Laurent Ferrara 1 1 University of Paris Nanterre M2 Oct. 2018 SVAR: Objectives Whereas the VAR model is able to capture efficiently the interactions between the different

More information

Gravity Models, PPML Estimation and the Bias of the Robust Standard Errors

Gravity Models, PPML Estimation and the Bias of the Robust Standard Errors Gravity Models, PPML Estimation and the Bias of the Robust Standard Errors Michael Pfaffermayr August 23, 2018 Abstract In gravity models with exporter and importer dummies the robust standard errors of

More information

Econometrics. Week 4. Fall Institute of Economic Studies Faculty of Social Sciences Charles University in Prague

Econometrics. Week 4. Fall Institute of Economic Studies Faculty of Social Sciences Charles University in Prague Econometrics Week 4 Institute of Economic Studies Faculty of Social Sciences Charles University in Prague Fall 2012 1 / 23 Recommended Reading For the today Serial correlation and heteroskedasticity in

More information

Clustering as a Design Problem

Clustering as a Design Problem Clustering as a Design Problem Alberto Abadie, Susan Athey, Guido Imbens, & Jeffrey Wooldridge Harvard-MIT Econometrics Seminar Cambridge, February 4, 2016 Adjusting standard errors for clustering is common

More information

Exogeneity and Causality

Exogeneity and Causality Università di Pavia Exogeneity and Causality Eduardo Rossi University of Pavia Factorization of the density DGP: D t (x t χ t 1, d t ; Ψ) x t represent all the variables in the economy. The econometric

More information

Part II. Time Series

Part II. Time Series Part II Time Series 12 Introduction This Part is mainly a summary of the book of Brockwell and Davis (2002). Additionally the textbook Shumway and Stoffer (2010) can be recommended. 1 Our purpose is to

More information

Estimating Average Treatment Effects Using a Modified Synthetic Control Method: Theory and Applications

Estimating Average Treatment Effects Using a Modified Synthetic Control Method: Theory and Applications Estimating Average Treatment Effects Using a Modified Synthetic Control Method: Theory and Applications Kathleen T. Li Marketing Department The Wharton School, the University of Pennsylvania June 27, 2017

More information

Econometrics of Panel Data

Econometrics of Panel Data Econometrics of Panel Data Jakub Mućk Meeting # 9 Jakub Mućk Econometrics of Panel Data Meeting # 9 1 / 22 Outline 1 Time series analysis Stationarity Unit Root Tests for Nonstationarity 2 Panel Unit Root

More information

Least Squares Estimation of a Panel Data Model with Multifactor Error Structure and Endogenous Covariates

Least Squares Estimation of a Panel Data Model with Multifactor Error Structure and Endogenous Covariates Least Squares Estimation of a Panel Data Model with Multifactor Error Structure and Endogenous Covariates Matthew Harding and Carlos Lamarche January 12, 2011 Abstract We propose a method for estimating

More information

Introduction to Algorithmic Trading Strategies Lecture 10

Introduction to Algorithmic Trading Strategies Lecture 10 Introduction to Algorithmic Trading Strategies Lecture 10 Risk Management Haksun Li haksun.li@numericalmethod.com www.numericalmethod.com Outline Value at Risk (VaR) Extreme Value Theory (EVT) References

More information

Causal Inference Lecture Notes: Causal Inference with Repeated Measures in Observational Studies

Causal Inference Lecture Notes: Causal Inference with Repeated Measures in Observational Studies Causal Inference Lecture Notes: Causal Inference with Repeated Measures in Observational Studies Kosuke Imai Department of Politics Princeton University November 13, 2013 So far, we have essentially assumed

More information

Regional Policy Evaluation: Interactive Fixed Effects and Synthetic Controls

Regional Policy Evaluation: Interactive Fixed Effects and Synthetic Controls DISCUSSION PAPER SERIES IZA DP No. 7493 Regional Policy Evaluation: Interactive Fixed Effects and Synthetic Controls Laurent Gobillon Thierry Magnac July 2013 Forschungsinstitut zur Zukunft der Arbeit

More information

Testing Error Correction in Panel data

Testing Error Correction in Panel data University of Vienna, Dept. of Economics Master in Economics Vienna 2010 The Model (1) Westerlund (2007) consider the following DGP: y it = φ 1i + φ 2i t + z it (1) x it = x it 1 + υ it (2) where the stochastic

More information

Recent Advances in the Field of Trade Theory and Policy Analysis Using Micro-Level Data

Recent Advances in the Field of Trade Theory and Policy Analysis Using Micro-Level Data Recent Advances in the Field of Trade Theory and Policy Analysis Using Micro-Level Data July 2012 Bangkok, Thailand Cosimo Beverelli (World Trade Organization) 1 Content a) Classical regression model b)

More information

Evaluating Value-at-Risk models via Quantile Regression

Evaluating Value-at-Risk models via Quantile Regression Evaluating Value-at-Risk models via Quantile Regression Luiz Renato Lima (University of Tennessee, Knoxville) Wagner Gaglianone, Oliver Linton, Daniel Smith. NASM-2009 05/31/2009 Motivation Recent nancial

More information