The Role of Social Comparison in Reducing Residential Water Consumption: Evidence from a Randomized Controlled Trial

Size: px
Start display at page:

Download "The Role of Social Comparison in Reducing Residential Water Consumption: Evidence from a Randomized Controlled Trial"

Transcription

1 The Role of Social Comparison in Reducing Residential Water Consumption: Evidence from a Randomized Controlled Trial Salvador Lurbé, Jesse Burkhardt, Dale T. Manning, Chris Goemans May 23, 2018 Abstract Water and Electricity utilities typically use a portfolio approach to manage demand, increasingly relying on non-pecuniary interventions to influence behavior. Despite demonstrated effectiveness, it remains unclear whether observed treatment effects in Home Water and Energy Report (HWER) studies are due to increased awareness of water use (e.g., simply receiving a letter in the mail) or due to social comparison (e.g., being categorized as "Great", "Good", or "Take Action" relative to one s peers). In light of this uncertainty, we investigate if household responses depend on the social comparison they receive. In this paper, we evaluate a randomized controlled trial (RCT) of a home water report (HWR) conducted by a city in the Western US. We estimate the average and heterogeneous treatment effect of the HWR on household water consumption after two years of treatment and, to assess whether responses depend on social comparison, we evaluate if being classified in different normative categories has an effect on the responses to the treatment. We find an average treatment effect (ATE) of 2.4% and, based on the quantiles of the consumption net of pre-treatment household fixed effects, we find that treatment effects are negative for all quantiles and significant across the whole distribution except for those consumers on the very top or the very bottom, with point estimates ranging from 0.0% for those consumers in the lower quantiles (more technically efficient) to 3.0% for those in the upper quantiles of the consumption distribution (less technically efficient). With a regression discontinuity analysis and with a triple difference-in-difference, we find that the categorization effect is not statistically significant. Department of Agricultural and Resource Economics, Colorado State University, 1200 Center Ave. Mall, Fort Collins, CO 80523, slurbe@colostate.edu. Department of Agricultural and Resource Economics, Colorado State University, 1200 Center Ave. Mall, Fort Collins, CO 80523, jesse.burkhardt@colostate.edu. Department of Agricultural and Resource Economics, Colorado State University, 1200 Center Ave. Mall, Fort Collins, CO 80523, dale.manning@colostate.edu. Department of Agricultural and Resource Economics, Colorado State University, 1200 Center Ave. Mall, Fort Collins, CO 80523, cgoemans@rams.colostate.edu. 1

2 1 Introduction As providers of natural resources and energy face increasing scarcity, interest in incentivizing consumer conservation has surged. Because of political and legal constraints, such as cost-recovery pricing and equity concerns, many firms and utilities find it difficult to use pricing policies as the sole mechanism for promoting conservation. Instead, utilities typically use a portfolio approach to manage demand, increasingly relying on non-pecuniary interventions to influence behavior. In this context, Home Water and Energy Reports (HWERs) have been used as a means to reduce consumption. However, previous studies suggest that households respond to the treatment based on their efficiency of pre-treatment water use, but it is not clear if the signal about their efficiency comes from the social comparison, since a significant "categorization effect" could not be proven (Allcott (2011)). In other words, it remains unclear whether observed treatment effects in HWER studies are due to increased awareness of water use (e.g., simply receiving a letter in the mail) or due to social comparison (e.g., being categorized as "Great", "Good", or "Take Action" relative to one s peers). Understanding the mechanism through which HWERs are a successful means for reducing consumption would inform policy makers and utility managers on how to better craft non-price interventions to maximize conservation. If categories do play a role on how households react to the HWER, they can be designed to induce larger reductions. If categories do not play a role and the response is to information provision, HWERs should be redesigned according to the underlying mechanism. In this paper, we further investigate the apparent contradiction between the treatment effect observed on HWERs and the lack of a statistically significant categorization effect. To do so, we evaluate the first two years of a Randomized Controlled Trial (RCT) of a Home Water Report (HWR) administered by a city in the Western US. We first estimate the Average Treatment Effect (ATE) using a difference-in-difference method, assessing the overall success of the non- 2

3 pecuniary intervention. We then use quantile regression to examine the relationship between technical efficiency and the treatment effect by defining quantiles of of consumption net of time-invariant household characteristics. Finally, we evaluate whether being classified in different normative categories has an effect on the responses to the treatment, using a regression discontinuity approach Allcott (2011) as well as a triple difference to evaluate the presence of the "categorization effect". Consistent with previous literature (Allcott (2011); Delmas et al. (2013); Brent et al. (2015); Ferraro and Price (2013)), we find, after two years, an ATE of 2.4%. Based on the quantiles of the consumption net of pre-treatment household fixed effects, we find that treatment effects are negative for all quantiles and significant across the whole distribution except for those consumers on the very top or on the very bottom, with point estimates ranging from 0.0% for those consumers in the lower quantiles (more technically efficient) to 3.0% for those in the upper quantiles (less technically efficient) of the consumption distribution. Thus, we do not observe what previous literature has called the "boomerang effect", stating that those households that receive the message that their consumption is low relative to their neighbors could feel entitled to increase their consumption. We could not, nonetheless, reject the hypothesis that the coefficients are not statistically different from each other, indicating that even though we observe a trend in the point estimates, the relationship between technical efficiency and the treatment effect could not be proven. Using a regression discontinuity analysis and a triple difference, we test for the existence of the categorization effect on a bandwidth in which we can assume that households on one side or the other of the category cutoff are statistically identical. Consistent with Allcott (2011), we find that the categorization effect is not significant suggesting that other mechanisms drive the observed average treatment effects. Our results confirm the success of HWERs on, not only decreasing average consumption, but doing so across users across the spectrum of technical effi- 3

4 ciency. However, our results strengthen the argument against assuming that the mechanism behind HWER s success is social comparison, though we do not yet provide an alternative explanation. Further research should attempt to test alternative mechanisms to fill this gap, informing policy makers and utilities on how to better craft non-price interventions. This paper builds on a growing literature on the experimental effects of behavioral interventions. Previous studies have shown that Home Energy Reports (HERs) decrease mean electricity use (Allcott (2011)) and HWRs decrease water consumption (Ferraro and Price (2013); Brent et al. (2015)). Delmas et al. (2013) reviews the literature on the effect of information strategies on energy conservation, including social comparisons. The authors conclude that strategies based on social comparisons reduced energy use by 11.5% and are, on average, more effective than the rest of the information provision strategies aiming to lower consumption. In some studies, effects appear to persist over time, with no reduction in effect after two years of continual treatment (Allcott (2011); Brent et al. (2015)), while other studies show declining effects over time (Ferraro and Price (2013)).Allcott and Rogers (2014) expand on this idea and find that effects become more persistent as the intervention continues over time. Finally, Jessoe et al. (2017) demonstrate that the treatment effects can go beyond the targeted sector. The remainder of this paper is as follows. In Section 2 we outline the characteristics of the experiment and the data. In Section 3 we study the ATE, outlining the estimation strategy and results. In Section 4 we study the Heterogeneous Treatment Effects (HTEs). In Section 5 we study the Categorization Effect and in Section 6 we conclude and discuss the results. 4

5 2 Experiment Overview 2.1 Experiment The RCT used for this analysis started in September 2014 and is still ongoing. For the experiment, 7000 households were randomly chosen for the treatment group and 4000 households for the Control group. Households in the treatment group receive a HWR every two months, so for the first two years of treatment, households in the treatment group have received 12 HWRs. On the other hand, households in the Control group do not see their HWRs. HWRs include the water score and comparison to other households as well as conservation tips. Figure 1 is an example of the Water Score received by a household while Figure 2 is an example of the conservation tips. Figure 1: Example of a water score. Figure 2: Example of conservation tips. According to household s number of occupants, the size of the irrigable area, and the type of residence, every household is assigned to a cohort of similar houses. For each cohort, the 52nd and the 20th percentiles are calculated and, according to each household s consumption, a water score is assigned to them. If 5

6 Table 1: Summary Statistics (1) (2) (3) (4) (5) N mean sd min max gpd 321, ,180 Number of groups 10,065 10,065 10,065 10,065 10,065 the household s consumption is less than the 20th percentile, the utility assigns a water score equal to 1 and the message of "Great". If consumption is greater than the 20th percentile but less than the 52th percentile, the water score is 2 and the message received is "Good". Finally, if consumption is greater than the 52th percentile, and the household s per capita water consumption is larger than a certain value, the assigned water score is 3 and the message is "Take Action". This last feature of the experiment, which implies that the cutoff between water scores 2 and 3 is not clear, imposes to the analysis limitations that prevent us from studying the categorization effect on that cutoff. Conservation tips are specially crafted for households with specific characteristics and they consist of possible actions to reduce water consumption Data For each household participating in the experiment, either as treatment or as control, we observe monthly consumption, in gallons per day, their cohort s 20th percentile and median consumption, and the designated water score. The variable we are seeking to explain is consumption (gpd). The summary statistics, pooling the treatment and control groups, and pooling 8 months before treatment and 23 months after treatment, are as shown in table 1. An RCT is built on the assumption that, in the absence of a treatment, both treatment and control groups would have followed the same trend (Angrist and Pischke (2008); Deaton and Cartwright (2017); Ferraro and Hanauer (2014)). This identification assumption is then based on the treatment and the control groups being random samples from the same population (in this case, the universe of residential households in the municipality for which water is provided). To test 6

7 Table 2: Two sample t-test and F-test (1) (2) (3) (4) GROUPS Obs. Mean Std. Error Std. Dev Control 29, Treatment 51, for the validity of this assumption we compare sample means and standard deviations across the Treatment and Control groups. The results of these tests are shown in table 2. For the two sided t-test for means, the calculated p-value is Thus, we fail to reject the null hypothesis that the mean pre-treatment consumption of the treatment and control groups are equal. For the two sided F-test for standard deviations, the p-value is 0.85, so we failed to reject the null hypothesis that the ratio of standard deviations for treatment and control is different than 1. Having shown that means and standard deviations for the logarithm of consumption cannot be proven to be statistically different for treatment and control groups, we have no evidence to suggest that the groups were not the result of random sampling. However, Deaton and Cartwright (2017) make an interesting argument when they claim that if the sampling is indeed random, any significant difference in means or standard deviation would be in fact a false positive by construction, indicating that rejecting the null hypothesis does not necessarily mean that the samples were not randomly drawn from the same population. In this case, as we found that the difference in means and standard deviations is not statistically significant, we can use these findings as evidence to assume orthogonality of the treatment to other causes affecting the dependent variable. Therefore the expected value of the differences between the means of all other causes affecting both groups is zero. 7

8 3 Average Treatment Effect 3.1 Estimation To estimate the average treatment effect, we use a difference-in-difference model, where the specification is as follows: log(gpd) it = β 1 P t + β 2 T i + τ T i P t + µ my + υ i + ɛ it (1) The logarithm of household i s water consumption in month t (log(gpd) it ) depends on the treatment indicator T i, post-treatment indicator P t, and household and month-by-year fixed effects (υ i and µ my ). Treatment is defined as receiving a letter with the HWR on household water consumption in month t. To explain the mechanism behind estimating ATE on a difference-in-difference setting, we use the expected value operator on equation 1, conditional on being part of treatment or control groups and before and after treatment, as follows: E[log(gpd) it T = 1, P = 1] = β 1 + β 2 + τ + µ my + υ i (2) E[log(gpd) it T = 1, P = 0] = β 2 + µ my + υ i (3) E[log(gpd) it T = 0, P = 1] = β 1 + µ my + υ i (4) E[log(gpd) it T = 0, P = 0] = µ my + υ i (5) The difference-in-difference setting is then based on the difference between before and after for treatment and control groups: E[log(gpd) it T = 1, P = 1] E[log(gpd) it T = 1, P = 0] = β 1 + τ (6) 8

9 E[log(gpd) it T = 0, P = 1] E[log(gpd) it T = 0, P = 0] = β 1 (7) Finally, the difference-in-difference ATE is τ and results from the difference between equations 6 and 7. The parameters are estimated using OLS and standard errors are clustered at the household level to allow for dependence across months. Based on the randomized nature of the experiment, which we discussed in the previous section, we can expect the estimator to be unbiased (Deaton and Cartwright (2017)) only if there are not post-randomization confounding and selection biases. In medical sciences, this feature if often achieved by "policing" the experiment, requiring blindness of the participants and researchers. In our case, we can dismiss this source of bias under the assumption that there was no contamination between treatment and control group. Known as the "Stable unit treatment value assumption" (SUTVA), Cox (1958) states causal inference requires that the outcome of one unit should not be affected by the assignment of treatment to other units Results Table 3 presents the results of estimating ATE for the first 23 months in which households received 12 HWRs. T and P were omitted due to multicollinearity, with DD as the difference-in-difference variable T P. We find a statistically significant ATE of 2.4% for the first two years of treatment, as shown in table 3. Having the dependent variable in logarithm form allows us to interpret the estimated parameter as a proportional change. We can, then, contextualize this decrease in consumption by comparing it to the change in price needed to achieve the same reduction. According to Arbués et al. (2003), the elasticity of residential water consumption ranges in the literature from 0.1 to 1.8. Based on this estimation, the observed reduction in quantity is equivalent to an increase in prices between 0.2% and 4.3%. 9

10 Table 3: ATE (1) ATE DD *** ( ) Household FE Y Month by year FE Y Observations 321,448 R-squared Notes: Standard errors clustered by household. Star values: * 0.10 ** 0.05 *** However, as Deaton and Cartwright (2017) stated, it is important to discuss the external validity of the results. As the city used for the experiment is not the result of random selection, we cannot directly extrapolate our results to other regions, but we can infer what would be the result of scaling up the treatment to the whole population from which the samples were drawn. 4 Heterogeneous Treatment Effect 4.1 Estimation In this section we use quantile regression to study the heterogeneity in treatment effects at different points in the conditional distribution of the outcome variable net of pre-treatment household fixed effects. The goal is to make sure that we are capturing how the responses to the treatment are different based on pretreatment technical efficiency. In other words, being at the higher percentiles of the consumption distribution does not indicate per-se that a household is not being efficient, but it can also indicate that the household has certain characteristics that favor water use (number of occupants, size of yard, etc.). By subtracting the pre-treatment household s fixed effects from consumption, consumption associated with household s time invariant characteristics is left out and quantiles of this distribution are a more accurate representation of a household s technical 10

11 efficiency. We then implement a conditional quantile estimator for panel data, developed by Powell (2016), using robust standard errors, on the difference-indifference specification described in the previous section Results Figure 3 presents the results of estimating HTE for the first two years of treatment as a function of the quantiles of consumption net of pre-treatment household fixed effects. Figure 3: Quantile Regression. As shown in figure 3, we find that quantile treatment effects point estimates are negative across quantiles, but not statistically different than zero at a 90% level for those at the very bottom or very top of the distribution, where the confidence interval increases its width. Point estimates range from 0.0% for those consumers in the lower quantiles of the consumption net of pre-treatment fixed effects distribution (more technical efficient households) to 3.0% for those in the larger quantiles (less technical efficient households), with a quite stable treatment effect of around 2.4% across quantiles between 0.1 and 0.9. However, we could not reject the hypothesis that the coefficients are statistically different from each other, indicating that even though we observe a slight downward slope on the 11

12 point estimates, the relationship between technical efficiency and treatment effect could not be proven. Consistent with Allcott (2011), we did not find evidence of a "Boomerang Effect", that would imply that the most efficient users increase their consumption as a response to the HWRs. 5 Categorization Effect In this section, we test if the treatment effect is a function of the categorization that a household in the Treatment group receives. The existence of a "categorization effect" would imply that the mechanism behind the success of the HWR is social comparison. Having a clear cutoff between categories allows us to assume that, within a certain bandwidth, households on one side of the cutoff are statistically identical to those on the other side of the cutoff, and thus their treatment effects based on technical efficiency are also identical. The only difference is then on the water score they have received. We test for the categorization effect on the cutoff from "Great" to "Good". While this cutoff is based on a sharp fixed value (the 20th percentile of the household s cohort s consumption), the cutoff between "Good" and "Take Action" is less clear since it based on the 52th percentile of the household s cohort but adjusted based on the number of occupants in the house. This feature is imposing us limitations that we could not overcome with the available data. We use two alternative methods to test for the categorization effect. First, we use a Regression Discontinuity (RD) design, where we want to test if there is a change in the parameters of the functional form of the mean from one side of the cutoff to the other. Finally, we use a triple difference estimator to test if, within the same bandwidth, being categorized as "Great" or "Good" statistically alters the observed response. 12

13 5.1 Regression Discontinuity Estimation Knowing that there is a clear cutoff separating those who got one category or another, we can use an RD setting to study if there is a statistically significant categorization effect. Instead of comparing the trend on a control group with the trend on a treatment group, this identification exploits the arbitrary nature of the treatment when assigning water scores. For a given cohort, the assigned water score is a deterministic and discontinuous function of the distance to the cutoff. As we previously mentioned, we then rely on the assumption than for those treated households in the neighborhood of the discontinuity, we can extrapolate across covariate values as a control strategy (Angrist and Pischke (2008)). The model used for the estimation is as follows: log(gpd) it = β 0 + ρ I(D20 i t > 0) + β 1 D20 i t I(D20 i t < 0) +β 2 D20 i t I(D20 i t > 0) + µ my + υ i + ɛ it (8) : T i = 1, P it = 1, D20 i t < h, D20 i t 0 Where D20 i t is the distance to the 20th percentile cutoff in the previous month t, affecting the consumption of the current month t. I(D20 i t > 0) is an indicator function that is 1 when D20 i t > 0 and 0 otherwise, while I(D20 i t < 0) is an indicator function that is 1 when D20 i t < 0 and 0 otherwise. If D20 i t > 0, then the assigned water score is 2, if D20 i t < 0, then the assigned water score is 1. h is half the bandwidth in which we are assuming that households are statistically identical. Both D20 i t and h are measured in gallons per day. To explain the mechanism behind the RD setting, we use the expected value operator on equation 8, conditional on observing a positive or a negative distance to the cutoff calculated from the previous month s consumption: 13

14 E[log(gpd) it (D20 i t < 0)] = β 0 + β 1 D20 i t + µ my + υ i (9) E[log(gpd) it (D20 i t > 0)] = β 0 + ρ + β 2 D20 i t + µ my + υ i (10) We can then see that, to the left of the cutoff, the intercept is β 0 and the slope is β 1 while, to the right of the cutoff, the intercept is β 0 + ρ and the slope is β 2. The parameters of interest are then ρ and the difference between β 1 and β 2. The parameter ρ can be seen as the discontinuity in the intercept while, if there is a significant difference between β 1 and β 2, the regression discontinuity is manifested as a change in the slope. Finally, to better capture a change in the intercept, we estimate a RD specification but without allowing for different slopes, as follows: log(gpd) it = β 0 + ρ I(D20 i t > 0) + β 1 D20 i t + µ my + υ i + ɛ it (11) : T i = 1, P it = 1, D20 i t < h, D20 i t 0 If we use the expected value operator on equation 11, conditional on observing a positive or a negative distance to the cutoff calculated from the previous month s consumption: E[log(gpd) it (D20 i t < 0)] = β 0 + β 1 D20 i t + µ my + υ i (12) E[log(gpd) it (D20 i t > 0)] = β 0 + ρ + β 1 D20 i t + µ my + υ i (13) We can then see that, with this specification, to the left of the cutoff the intercept is β 0 while, to the right of the cutoff, the intercept is β 0 + ρ. The slope is, on both sides, β 1. The parameter of interest is then ρ, representing a location shift in the underlying populations on one side or the other of the cut-off. 14

15 5.1.2 Results Table 4 presents the results of estimating equation 8 (allowing for different slopes) for various values of the bandwidth h. Table 4: Regression Discontinuity (different slopes) (1) (2) (3) (4) (5) h=15 h=20 h=25 h=30 h=35 D20 i t I(D20 i t < 0) ** *** *** *** *** ( ) ( ) ( ) ( ) ( ) D20 i t I(D20 i t > 0) *** *** *** *** *** ( ) ( ) ( ) ( ) ( ) I(D20 i t > 0) (0.0170) (0.0139) (0.0124) (0.0111) (0.0103) Household FE Y Y Y Y Y Month by year FE Y Y Y Y Y Observations 9,756 13,260 16,637 19,735 22,654 R-squared Notes: Standard errors clustered by household. Star values: * 0.10 ** 0.05 *** Table 5 presents the results of estimating equation 11 (without allowing for different slopes) for various values of the bandwidth h. Table 5: Regression Discontinuity (same slope) (1) (2) (3) (4) (5) h=15 h=20 h=25 h=30 h=35 D20 i t *** *** *** *** *** ( ) ( ) ( ) ( ) ( ) I(D20 i t > 0) (0.0170) (0.0139) (0.0124) (0.0111) (0.0103) Household FE Y Y Y Y Y Month by year FE Y Y Y Y Y Observations 9,756 13,260 16,637 19,735 22,654 R-squared Notes: Standard errors clustered by household. Star values: * 0.10 ** 0.05 *** By looking at tables 4 and 5, we see that ρ is, for both specifications and for every considered bandwidth h, not statistically different than zero, indicating that there is no regression discontinuity manifested as a change in intercept. Slopes are, in both cases, positive. This result is as expected since it is capturing serial au- 15

16 tocorrelation. We could expect that the distance to the cutoff in month t would be correlated to the the distance to the cutoff in month t. Finally, when we allow for different slopes, the difference between the slope on the negative side and on the positive side (β 1 and β 2 ), is not statistically significant. In sum, we could not find evidence of a regression discontinuity indicating a categorization effect, not in the shape of a "bump" in the linear fit (ρ) nor in a change in slope. 5.2 Triple Difference Estimation As we previously mentioned, with the RD approach we rely on the assumption than for those treated households in the neighborhood of the discontinuity, we can extrapolate across covariate values as a control strategy. On the other hand, with the difference-in-difference approach we use for ATE and HTE, we assume orthogonality of the treatment to other causes affecting the dependent variable, thus the expected value of the differences between the means of all other causes affecting both groups is zero. Our last identification strategy for the categorization effect is a combination of both strategies, using a triple difference estimator for those observations within the specified bandwidth. The three differences are then between before and after, treated and untreated and negative and positive distance to the 20th percentile cutoff (water score equal to 1 or 2). This strategy can strengthen the results from the RD by incorporating an extra way to control for other causes affecting the dependent variable. The triple difference specification is as follows: log(gpd) it = β 0 + β 1 P t + β 2 T i + β 3 I(W S i t = 2) +β 4 T i I(W S i t = 2) + β 5 P t I(W S i t = 2) + β 6 T i P it (14) 16

17 +β 7 T i P it I(W S i t = 2) + µ my + υ i + ɛ it : D20 i t < h, D20 i t 0 As in the difference-in-difference model we use to calculate ATE, we use the expectation operator to explain the intuition. For those who got a water score equal to 2 and were treated: E[log(gpd) it T i = 1, P t = 0, W S i t = 2] = β 0 + β 2 + β 3 + β 4 + υ i (15) E[log(gpd) it T i = 1, P t = 1, W S i t = 2] = β 0 + β 1 + β 2 + β 3 + β 4 +β 5 + β 6 + β 7 + υ i (16) The difference between these two groups is β 1 + β 5 + β 6 + β 7. For those who got a water score equal to 2 and were untreated: E[log(gpd) it T i = 0, P t = 0, W S i t = 2] = β 0 + β 3 + υ i (17) E[log(gpd) it T i = 0, P t = 1, W S i t = 2] = β 0 + β 1 + β 3 + β 5 + υ i (18) The difference between these two groups is β 1 +β 5, and the overall differencein-difference for those who got a water score equal to 2 is β 6 + β 7. Finally, for those who got a water score equal to 1, using the expectation operator and the differences as we just explained, the overall difference-in-difference is β 6. The triple difference parameter is then β 7, which represents the extra treatment effect given by getting a "Great" instead of a "Good". 17

18 5.2.2 Results For different bandwidths h, the results of equation 14 are presented in table 6. The variable DDD is the difference-in-difference-in-difference variable T i P it I(W S i t = 2). Table 6: Difference in Difference in Difference (1) (2) (3) (4) (5) h=15 h=20 h=25 h=30 h=35 I(W S i t = 2) * (0.0489) (0.0406) (0.0360) (0.0327) (0.0302) T i P t (0.0477) (0.0404) (0.0357) (0.0325) (0.0304) T i I(W S i t = 2) (0.0643) (0.0526) (0.0461) (0.0417) (0.0385) P t I(W S i t = 2) (0.0497) (0.0413) (0.0366) (0.0333) (0.0308) DDD (0.0653) (0.0536) (0.0470) (0.0425) (0.0393) Household FE Y Y Y Y Y Month by year FE Y Y Y Y Y Observations 21,579 29,100 36,233 42,913 49,122 R-squared Notes: Standard errors clustered by household. Star values: * 0.10 ** 0.05 *** With the triple difference, the parameter of interest, β 7, associated with the DDD variable, is not statistically different from zero for any h considered. Therefore, these results do not support the existence of a categorization effect. 6 Conclusions and Discussions On average, HWRs work as a means to incentivize conservation, as shown by the ATE estimate here. Even though we have shown a trend between the point estimates of the treatment effect and the quantiles based on the household s technical efficiency, with our results could not prove that those estimates are different from each other. We found no evidence of a "boomerang effect", consistent with Allcott (2011) findings on HERs. Finally, we could not find evidence indicating 18

19 the existence of a "categorization effect" between categories "Great" and "Good". This result holds with both a RD approach and a triple difference estimation. The inexistence of a categorization effect would imply that extending the range of the Good or "Take action categories would not result in a greater treatment effect. Furthermore, the mechanism behind the effect might not be social comparison, but a response to information provision or simply to the knowledge that consumption is being monitored closely. Our work validates previous findings (Allcott (2011)), with a similar methodology on a different dataset and on HWRs instead of HERs. Finally our analysis includes a methodological extension based on the use of the triple difference estimation. With a deeper understanding behind the mechanisms through which HWRs incentivize conservation, we expect to inform policy makers and utilities on how to better craft non-pecuniary interventions. For future research, we expect to test for the existence of alternative mechanisms that could explain the observed reduction in consumption that occurs as a result of information provision. 19

20 References Allcott, H. (2011). Social norms and energy conservation. Journal of Public Economics, 95(9-10): Allcott, H. and Rogers, T. (2014). The short-run and long-run effects of behavioral interventions: Experimental evidence from energy conservation. American Economic Review, 104(10): Angrist, J. D. and Pischke, J.-S. (2008). Mostly harmless econometrics: An empiricist s companion. Princeton university press. Arbués, F., Garcıa-Valiñas, M. Á., and Martınez-Espiñeira, R. (2003). Estimation of residential water demand: a state-of-the-art review. The Journal of Socio- Economics, 32(1): Brent, D. A., Cook, J. H., and Olsen, S. (2015). Social comparisons, household water use, and participation in utility conservation programs: Evidence from three randomized trials. Journal of the Association of Environmental and Resource Economists, 2(4): Cox, D. R. (1958). Planning of experiments. Deaton, A. and Cartwright, N. (2017). Understanding and misunderstanding randomized controlled trials. Social Science & Medicine. Delmas, M. A., Fischlein, M., and Asensio, O. I. (2013). Information strategies and energy conservation behavior: A meta-analysis of experimental studies from 1975 to Energy Policy, 61: Ferraro, P. J. and Hanauer, M. M. (2014). Advances in measuring the environmental and social impacts of environmental programs. Annual review of environment and resources, 39:

21 Ferraro, P. J. and Price, M. K. (2013). Using nonpecuniary strategies to influence behavior: evidence from a large-scale field experiment. Review of Economics and Statistics, 95(1): Jessoe, K., Lade, G. E., Loge, F., and Spang, E. (2017). Spillovers from behavioral interventions: Experimental evidence from water and energy use. Working Paper. Powell, D. (2016). Quantile regression with nonadditive fixed effects. RAND Corporation. 21

Empirical approaches in public economics

Empirical approaches in public economics Empirical approaches in public economics ECON4624 Empirical Public Economics Fall 2016 Gaute Torsvik Outline for today The canonical problem Basic concepts of causal inference Randomized experiments Non-experimental

More information

When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Longitudinal Data?

When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Longitudinal Data? When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Longitudinal Data? Kosuke Imai Department of Politics Center for Statistics and Machine Learning Princeton University

More information

Quantitative Economics for the Evaluation of the European Policy

Quantitative Economics for the Evaluation of the European Policy Quantitative Economics for the Evaluation of the European Policy Dipartimento di Economia e Management Irene Brunetti Davide Fiaschi Angela Parenti 1 25th of September, 2017 1 ireneb@ec.unipi.it, davide.fiaschi@unipi.it,

More information

Finding Instrumental Variables: Identification Strategies. Amine Ouazad Ass. Professor of Economics

Finding Instrumental Variables: Identification Strategies. Amine Ouazad Ass. Professor of Economics Finding Instrumental Variables: Identification Strategies Amine Ouazad Ass. Professor of Economics Outline 1. Before/After 2. Difference-in-difference estimation 3. Regression Discontinuity Design BEFORE/AFTER

More information

Applied Microeconometrics Chapter 8 Regression Discontinuity (RD)

Applied Microeconometrics Chapter 8 Regression Discontinuity (RD) 1 / 26 Applied Microeconometrics Chapter 8 Regression Discontinuity (RD) Romuald Méango and Michele Battisti LMU, SoSe 2016 Overview What is it about? What are its assumptions? What are the main applications?

More information

12E016. Econometric Methods II 6 ECTS. Overview and Objectives

12E016. Econometric Methods II 6 ECTS. Overview and Objectives Overview and Objectives This course builds on and further extends the econometric and statistical content studied in the first quarter, with a special focus on techniques relevant to the specific field

More information

Causal Inference Lecture Notes: Causal Inference with Repeated Measures in Observational Studies

Causal Inference Lecture Notes: Causal Inference with Repeated Measures in Observational Studies Causal Inference Lecture Notes: Causal Inference with Repeated Measures in Observational Studies Kosuke Imai Department of Politics Princeton University November 13, 2013 So far, we have essentially assumed

More information

When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Longitudinal Data?

When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Longitudinal Data? When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Longitudinal Data? Kosuke Imai Princeton University Asian Political Methodology Conference University of Sydney Joint

More information

REED TUTORIALS (Pty) LTD ECS3706 EXAM PACK

REED TUTORIALS (Pty) LTD ECS3706 EXAM PACK REED TUTORIALS (Pty) LTD ECS3706 EXAM PACK 1 ECONOMETRICS STUDY PACK MAY/JUNE 2016 Question 1 (a) (i) Describing economic reality (ii) Testing hypothesis about economic theory (iii) Forecasting future

More information

When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Panel Data?

When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Panel Data? When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Panel Data? Kosuke Imai Department of Politics Center for Statistics and Machine Learning Princeton University Joint

More information

On the Use of Linear Fixed Effects Regression Models for Causal Inference

On the Use of Linear Fixed Effects Regression Models for Causal Inference On the Use of Linear Fixed Effects Regression Models for ausal Inference Kosuke Imai Department of Politics Princeton University Joint work with In Song Kim Atlantic ausal Inference onference Johns Hopkins

More information

Econometrics of causal inference. Throughout, we consider the simplest case of a linear outcome equation, and homogeneous

Econometrics of causal inference. Throughout, we consider the simplest case of a linear outcome equation, and homogeneous Econometrics of causal inference Throughout, we consider the simplest case of a linear outcome equation, and homogeneous effects: y = βx + ɛ (1) where y is some outcome, x is an explanatory variable, and

More information

The Economics of European Regions: Theory, Empirics, and Policy

The Economics of European Regions: Theory, Empirics, and Policy The Economics of European Regions: Theory, Empirics, and Policy Dipartimento di Economia e Management Davide Fiaschi Angela Parenti 1 1 davide.fiaschi@unipi.it, and aparenti@ec.unipi.it. Fiaschi-Parenti

More information

Applied Microeconometrics (L5): Panel Data-Basics

Applied Microeconometrics (L5): Panel Data-Basics Applied Microeconometrics (L5): Panel Data-Basics Nicholas Giannakopoulos University of Patras Department of Economics ngias@upatras.gr November 10, 2015 Nicholas Giannakopoulos (UPatras) MSc Applied Economics

More information

Controlling for Time Invariant Heterogeneity

Controlling for Time Invariant Heterogeneity Controlling for Time Invariant Heterogeneity Yona Rubinstein July 2016 Yona Rubinstein (LSE) Controlling for Time Invariant Heterogeneity 07/16 1 / 19 Observables and Unobservables Confounding Factors

More information

LECTURE 10. Introduction to Econometrics. Multicollinearity & Heteroskedasticity

LECTURE 10. Introduction to Econometrics. Multicollinearity & Heteroskedasticity LECTURE 10 Introduction to Econometrics Multicollinearity & Heteroskedasticity November 22, 2016 1 / 23 ON PREVIOUS LECTURES We discussed the specification of a regression equation Specification consists

More information

1 Impact Evaluation: Randomized Controlled Trial (RCT)

1 Impact Evaluation: Randomized Controlled Trial (RCT) Introductory Applied Econometrics EEP/IAS 118 Fall 2013 Daley Kutzman Section #12 11-20-13 Warm-Up Consider the two panel data regressions below, where i indexes individuals and t indexes time in months:

More information

New Developments in Econometrics Lecture 11: Difference-in-Differences Estimation

New Developments in Econometrics Lecture 11: Difference-in-Differences Estimation New Developments in Econometrics Lecture 11: Difference-in-Differences Estimation Jeff Wooldridge Cemmap Lectures, UCL, June 2009 1. The Basic Methodology 2. How Should We View Uncertainty in DD Settings?

More information

Lectures 5 & 6: Hypothesis Testing

Lectures 5 & 6: Hypothesis Testing Lectures 5 & 6: Hypothesis Testing in which you learn to apply the concept of statistical significance to OLS estimates, learn the concept of t values, how to use them in regression work and come across

More information

Applied Econometrics Lecture 1

Applied Econometrics Lecture 1 Lecture 1 1 1 Università di Urbino Università di Urbino PhD Programme in Global Studies Spring 2018 Outline of this module Beyond OLS (very brief sketch) Regression and causality: sources of endogeneity

More information

Introduction to Econometrics

Introduction to Econometrics Introduction to Econometrics T H I R D E D I T I O N Global Edition James H. Stock Harvard University Mark W. Watson Princeton University Boston Columbus Indianapolis New York San Francisco Upper Saddle

More information

Difference-in-Differences Estimation

Difference-in-Differences Estimation Difference-in-Differences Estimation Jeff Wooldridge Michigan State University Programme Evaluation for Policy Analysis Institute for Fiscal Studies June 2012 1. The Basic Methodology 2. How Should We

More information

AN EVALUATION OF PARAMETRIC AND NONPARAMETRIC VARIANCE ESTIMATORS IN COMPLETELY RANDOMIZED EXPERIMENTS. Stanley A. Lubanski. and. Peter M.

AN EVALUATION OF PARAMETRIC AND NONPARAMETRIC VARIANCE ESTIMATORS IN COMPLETELY RANDOMIZED EXPERIMENTS. Stanley A. Lubanski. and. Peter M. AN EVALUATION OF PARAMETRIC AND NONPARAMETRIC VARIANCE ESTIMATORS IN COMPLETELY RANDOMIZED EXPERIMENTS by Stanley A. Lubanski and Peter M. Steiner UNIVERSITY OF WISCONSIN-MADISON 018 Background To make

More information

Monday, October 15 Handout: Multiple Regression Analysis Introduction

Monday, October 15 Handout: Multiple Regression Analysis Introduction Amherst College Department of Economics Economics 360 Fall 2012 Monday, October 15 Handout: Multiple Regression Analysis Introduction Review Simple and Multiple Regression Analysis o Distinction between

More information

ECON Introductory Econometrics. Lecture 17: Experiments

ECON Introductory Econometrics. Lecture 17: Experiments ECON4150 - Introductory Econometrics Lecture 17: Experiments Monique de Haan (moniqued@econ.uio.no) Stock and Watson Chapter 13 Lecture outline 2 Why study experiments? The potential outcome framework.

More information

Transparent Structural Estimation. Matthew Gentzkow Fisher-Schultz Lecture (from work w/ Isaiah Andrews & Jesse M. Shapiro)

Transparent Structural Estimation. Matthew Gentzkow Fisher-Schultz Lecture (from work w/ Isaiah Andrews & Jesse M. Shapiro) Transparent Structural Estimation Matthew Gentzkow Fisher-Schultz Lecture (from work w/ Isaiah Andrews & Jesse M. Shapiro) 1 A hallmark of contemporary applied microeconomics is a conceptual framework

More information

Online Appendix to Yes, But What s the Mechanism? (Don t Expect an Easy Answer) John G. Bullock, Donald P. Green, and Shang E. Ha

Online Appendix to Yes, But What s the Mechanism? (Don t Expect an Easy Answer) John G. Bullock, Donald P. Green, and Shang E. Ha Online Appendix to Yes, But What s the Mechanism? (Don t Expect an Easy Answer) John G. Bullock, Donald P. Green, and Shang E. Ha January 18, 2010 A2 This appendix has six parts: 1. Proof that ab = c d

More information

Causal Inference with Big Data Sets

Causal Inference with Big Data Sets Causal Inference with Big Data Sets Marcelo Coca Perraillon University of Colorado AMC November 2016 1 / 1 Outlone Outline Big data Causal inference in economics and statistics Regression discontinuity

More information

What s New in Econometrics? Lecture 14 Quantile Methods

What s New in Econometrics? Lecture 14 Quantile Methods What s New in Econometrics? Lecture 14 Quantile Methods Jeff Wooldridge NBER Summer Institute, 2007 1. Reminders About Means, Medians, and Quantiles 2. Some Useful Asymptotic Results 3. Quantile Regression

More information

Randomized trials for policy

Randomized trials for policy A review of the external validity of treatment effects Development Policy Research Unit, University of Cape Town Annual Bank Conference on Development Economics, 2-3 June 2014 Table of contents 1 Introduction

More information

Selection on Observables: Propensity Score Matching.

Selection on Observables: Propensity Score Matching. Selection on Observables: Propensity Score Matching. Department of Economics and Management Irene Brunetti ireneb@ec.unipi.it 24/10/2017 I. Brunetti Labour Economics in an European Perspective 24/10/2017

More information

Econometrics of Panel Data

Econometrics of Panel Data Econometrics of Panel Data Jakub Mućk Meeting # 1 Jakub Mućk Econometrics of Panel Data Meeting # 1 1 / 31 Outline 1 Course outline 2 Panel data Advantages of Panel Data Limitations of Panel Data 3 Pooled

More information

Applied Statistics and Econometrics

Applied Statistics and Econometrics Applied Statistics and Econometrics Lecture 6 Saul Lach September 2017 Saul Lach () Applied Statistics and Econometrics September 2017 1 / 53 Outline of Lecture 6 1 Omitted variable bias (SW 6.1) 2 Multiple

More information

Introduction to Econometrics. Regression with Panel Data

Introduction to Econometrics. Regression with Panel Data Introduction to Econometrics The statistical analysis of economic (and related) data STATS301 Regression with Panel Data Titulaire: Christopher Bruffaerts Assistant: Lorenzo Ricci 1 Regression with Panel

More information

EMERGING MARKETS - Lecture 2: Methodology refresher

EMERGING MARKETS - Lecture 2: Methodology refresher EMERGING MARKETS - Lecture 2: Methodology refresher Maria Perrotta April 4, 2013 SITE http://www.hhs.se/site/pages/default.aspx My contact: maria.perrotta@hhs.se Aim of this class There are many different

More information

BIOS 312: Precision of Statistical Inference

BIOS 312: Precision of Statistical Inference and Power/Sample Size and Standard Errors BIOS 312: of Statistical Inference Chris Slaughter Department of Biostatistics, Vanderbilt University School of Medicine January 3, 2013 Outline Overview and Power/Sample

More information

Introduction to Econometrics

Introduction to Econometrics Introduction to Econometrics STAT-S-301 Panel Data (2016/2017) Lecturer: Yves Dominicy Teaching Assistant: Elise Petit 1 Regression with Panel Data A panel dataset contains observations on multiple entities

More information

Applied Economics. Panel Data. Department of Economics Universidad Carlos III de Madrid

Applied Economics. Panel Data. Department of Economics Universidad Carlos III de Madrid Applied Economics Panel Data Department of Economics Universidad Carlos III de Madrid See also Wooldridge (chapter 13), and Stock and Watson (chapter 10) 1 / 38 Panel Data vs Repeated Cross-sections In

More information

External validity, causal interaction and randomised trials

External validity, causal interaction and randomised trials External validity, causal interaction and randomised trials Seán M. Muller University of Cape Town Evidence and Causality in the Sciences Conference University of Kent (Canterbury) 5 September 2012 Overview

More information

The Age-Time-Cohort Problem and the Identification of Structural Parameters in Life-Cycle Models

The Age-Time-Cohort Problem and the Identification of Structural Parameters in Life-Cycle Models Federal Reserve Bank of Minneapolis Research Department The Age-Time-Cohort Problem and the Identification of Structural Parameters in Life-Cycle Models Sam Schulhofer-Wohl February 2012 DRAFT: DO NOT

More information

A Course in Applied Econometrics Lecture 7: Cluster Sampling. Jeff Wooldridge IRP Lectures, UW Madison, August 2008

A Course in Applied Econometrics Lecture 7: Cluster Sampling. Jeff Wooldridge IRP Lectures, UW Madison, August 2008 A Course in Applied Econometrics Lecture 7: Cluster Sampling Jeff Wooldridge IRP Lectures, UW Madison, August 2008 1. The Linear Model with Cluster Effects 2. Estimation with a Small Number of roups and

More information

Econometrics of Panel Data

Econometrics of Panel Data Econometrics of Panel Data Jakub Mućk Meeting # 4 Jakub Mućk Econometrics of Panel Data Meeting # 4 1 / 30 Outline 1 Two-way Error Component Model Fixed effects model Random effects model 2 Non-spherical

More information

Price Discrimination through Refund Contracts in Airlines

Price Discrimination through Refund Contracts in Airlines Introduction Price Discrimination through Refund Contracts in Airlines Paan Jindapon Department of Economics and Finance The University of Texas - Pan American Department of Economics, Finance and Legal

More information

Lab 6 - Simple Regression

Lab 6 - Simple Regression Lab 6 - Simple Regression Spring 2017 Contents 1 Thinking About Regression 2 2 Regression Output 3 3 Fitted Values 5 4 Residuals 6 5 Functional Forms 8 Updated from Stata tutorials provided by Prof. Cichello

More information

STOCKHOLM UNIVERSITY Department of Economics Course name: Empirical Methods Course code: EC40 Examiner: Lena Nekby Number of credits: 7,5 credits Date of exam: Saturday, May 9, 008 Examination time: 3

More information

Write your identification number on each paper and cover sheet (the number stated in the upper right hand corner on your exam cover).

Write your identification number on each paper and cover sheet (the number stated in the upper right hand corner on your exam cover). Formatmall skapad: 2011-12-01 Uppdaterad: 2015-03-06 / LP Department of Economics Course name: Empirical Methods in Economics 2 Course code: EC2404 Semester: Spring 2015 Type of exam: MAIN Examiner: Peter

More information

WISE International Masters

WISE International Masters WISE International Masters ECONOMETRICS Instructor: Brett Graham INSTRUCTIONS TO STUDENTS 1 The time allowed for this examination paper is 2 hours. 2 This examination paper contains 32 questions. You are

More information

PSC 504: Instrumental Variables

PSC 504: Instrumental Variables PSC 504: Instrumental Variables Matthew Blackwell 3/28/2013 Instrumental Variables and Structural Equation Modeling Setup e basic idea behind instrumental variables is that we have a treatment with unmeasured

More information

Culture Shocks and Consequences:

Culture Shocks and Consequences: Culture Shocks and Consequences: the connection between the arts and urban economic growth Stephen Sheppard Williams College Arts, New Growth Theory, and Economic Development Symposium The Brookings Institution,

More information

Regression Discontinuity Designs in Economics

Regression Discontinuity Designs in Economics Regression Discontinuity Designs in Economics Dr. Kamiljon T. Akramov IFPRI, Washington, DC, USA Training Course on Applied Econometric Analysis September 13-23, 2016, WIUT, Tashkent, Uzbekistan Outline

More information

INTRODUCTION TO BASIC LINEAR REGRESSION MODEL

INTRODUCTION TO BASIC LINEAR REGRESSION MODEL INTRODUCTION TO BASIC LINEAR REGRESSION MODEL 13 September 2011 Yogyakarta, Indonesia Cosimo Beverelli (World Trade Organization) 1 LINEAR REGRESSION MODEL In general, regression models estimate the effect

More information

dqd: A command for treatment effect estimation under alternative assumptions

dqd: A command for treatment effect estimation under alternative assumptions UC3M Working Papers Economics 14-07 April 2014 ISSN 2340-5031 Departamento de Economía Universidad Carlos III de Madrid Calle Madrid, 126 28903 Getafe (Spain) Fax (34) 916249875 dqd: A command for treatment

More information

Principles Underlying Evaluation Estimators

Principles Underlying Evaluation Estimators The Principles Underlying Evaluation Estimators James J. University of Chicago Econ 350, Winter 2019 The Basic Principles Underlying the Identification of the Main Econometric Evaluation Estimators Two

More information

FNCE 926 Empirical Methods in CF

FNCE 926 Empirical Methods in CF FNCE 926 Empirical Methods in CF Lecture 11 Standard Errors & Misc. Professor Todd Gormley Announcements Exercise #4 is due Final exam will be in-class on April 26 q After today, only two more classes

More information

Econometric Analysis of Cross Section and Panel Data

Econometric Analysis of Cross Section and Panel Data Econometric Analysis of Cross Section and Panel Data Jeffrey M. Wooldridge / The MIT Press Cambridge, Massachusetts London, England Contents Preface Acknowledgments xvii xxiii I INTRODUCTION AND BACKGROUND

More information

Econometrics Review questions for exam

Econometrics Review questions for exam Econometrics Review questions for exam Nathaniel Higgins nhiggins@jhu.edu, 1. Suppose you have a model: y = β 0 x 1 + u You propose the model above and then estimate the model using OLS to obtain: ŷ =

More information

Development. ECON 8830 Anant Nyshadham

Development. ECON 8830 Anant Nyshadham Development ECON 8830 Anant Nyshadham Projections & Regressions Linear Projections If we have many potentially related (jointly distributed) variables Outcome of interest Y Explanatory variable of interest

More information

Introduction to Regression Analysis. Dr. Devlina Chatterjee 11 th August, 2017

Introduction to Regression Analysis. Dr. Devlina Chatterjee 11 th August, 2017 Introduction to Regression Analysis Dr. Devlina Chatterjee 11 th August, 2017 What is regression analysis? Regression analysis is a statistical technique for studying linear relationships. One dependent

More information

Basic Statistics. 1. Gross error analyst makes a gross mistake (misread balance or entered wrong value into calculation).

Basic Statistics. 1. Gross error analyst makes a gross mistake (misread balance or entered wrong value into calculation). Basic Statistics There are three types of error: 1. Gross error analyst makes a gross mistake (misread balance or entered wrong value into calculation). 2. Systematic error - always too high or too low

More information

Lecture 4: Linear panel models

Lecture 4: Linear panel models Lecture 4: Linear panel models Luc Behaghel PSE February 2009 Luc Behaghel (PSE) Lecture 4 February 2009 1 / 47 Introduction Panel = repeated observations of the same individuals (e.g., rms, workers, countries)

More information

Difference-in-Differences Methods

Difference-in-Differences Methods Difference-in-Differences Methods Teppei Yamamoto Keio University Introduction to Causal Inference Spring 2016 1 Introduction: A Motivating Example 2 Identification 3 Estimation and Inference 4 Diagnostics

More information

Internal vs. external validity. External validity. This section is based on Stock and Watson s Chapter 9.

Internal vs. external validity. External validity. This section is based on Stock and Watson s Chapter 9. Section 7 Model Assessment This section is based on Stock and Watson s Chapter 9. Internal vs. external validity Internal validity refers to whether the analysis is valid for the population and sample

More information

Wooldridge, Introductory Econometrics, 3d ed. Chapter 16: Simultaneous equations models. An obvious reason for the endogeneity of explanatory

Wooldridge, Introductory Econometrics, 3d ed. Chapter 16: Simultaneous equations models. An obvious reason for the endogeneity of explanatory Wooldridge, Introductory Econometrics, 3d ed. Chapter 16: Simultaneous equations models An obvious reason for the endogeneity of explanatory variables in a regression model is simultaneity: that is, one

More information

Regression Discontinuity Designs

Regression Discontinuity Designs Regression Discontinuity Designs Kosuke Imai Harvard University STAT186/GOV2002 CAUSAL INFERENCE Fall 2018 Kosuke Imai (Harvard) Regression Discontinuity Design Stat186/Gov2002 Fall 2018 1 / 1 Observational

More information

Chapter 10: Multiple Regression Analysis Introduction

Chapter 10: Multiple Regression Analysis Introduction Chapter 10: Multiple Regression Analysis Introduction Chapter 10 Outline Simple versus Multiple Regression Analysis Goal of Multiple Regression Analysis A One-Tailed Test: Downward Sloping Demand Theory

More information

Example: Forced Expiratory Volume (FEV) Program L13. Example: Forced Expiratory Volume (FEV) Example: Forced Expiratory Volume (FEV)

Example: Forced Expiratory Volume (FEV) Program L13. Example: Forced Expiratory Volume (FEV) Example: Forced Expiratory Volume (FEV) Program L13 Relationships between two variables Correlation, cont d Regression Relationships between more than two variables Multiple linear regression Two numerical variables Linear or curved relationship?

More information

A Practitioner s Guide to Cluster-Robust Inference

A Practitioner s Guide to Cluster-Robust Inference A Practitioner s Guide to Cluster-Robust Inference A. C. Cameron and D. L. Miller presented by Federico Curci March 4, 2015 Cameron Miller Cluster Clinic II March 4, 2015 1 / 20 In the previous episode

More information

Impact Evaluation of Rural Road Projects. Dominique van de Walle World Bank

Impact Evaluation of Rural Road Projects. Dominique van de Walle World Bank Impact Evaluation of Rural Road Projects Dominique van de Walle World Bank Introduction General consensus that roads are good for development & living standards A sizeable share of development aid and

More information

Wooldridge, Introductory Econometrics, 4th ed. Chapter 15: Instrumental variables and two stage least squares

Wooldridge, Introductory Econometrics, 4th ed. Chapter 15: Instrumental variables and two stage least squares Wooldridge, Introductory Econometrics, 4th ed. Chapter 15: Instrumental variables and two stage least squares Many economic models involve endogeneity: that is, a theoretical relationship does not fit

More information

Star Wars: The Empirics Strike Back Online Appendix

Star Wars: The Empirics Strike Back Online Appendix Star Wars: The Empirics Strike Back Online Appendix Abel Brodeur a Mathias Lé b Marc Sangnier c Yanos Zylberberg d June 2015 Reporting guidelines Rules we used during the data collecting process are described

More information

Research Design: Causal inference and counterfactuals

Research Design: Causal inference and counterfactuals Research Design: Causal inference and counterfactuals University College Dublin 8 March 2013 1 2 3 4 Outline 1 2 3 4 Inference In regression analysis we look at the relationship between (a set of) independent

More information

Small-sample cluster-robust variance estimators for two-stage least squares models

Small-sample cluster-robust variance estimators for two-stage least squares models Small-sample cluster-robust variance estimators for two-stage least squares models ames E. Pustejovsky The University of Texas at Austin Context In randomized field trials of educational interventions,

More information

Mgmt 469. Causality and Identification

Mgmt 469. Causality and Identification Mgmt 469 Causality and Identification As you have learned by now, a key issue in empirical research is identifying the direction of causality in the relationship between two variables. This problem often

More information

The Econometrics of Randomized Experiments

The Econometrics of Randomized Experiments The Econometrics of Randomized Experiments Susan Athey Guido W. Imbens Current version June 2016 Keywords: Regression Analyses, Random Assignment, Randomized Experiments, Potential Outcomes, Causality

More information

Making sense of Econometrics: Basics

Making sense of Econometrics: Basics Making sense of Econometrics: Basics Lecture 4: Qualitative influences and Heteroskedasticity Egypt Scholars Economic Society November 1, 2014 Assignment & feedback enter classroom at http://b.socrative.com/login/student/

More information

ECON 4230 Intermediate Econometric Theory Exam

ECON 4230 Intermediate Econometric Theory Exam ECON 4230 Intermediate Econometric Theory Exam Multiple Choice (20 pts). Circle the best answer. 1. The Classical assumption of mean zero errors is satisfied if the regression model a) is linear in the

More information

Controlling for latent confounding by confirmatory factor analysis (CFA) Blinded Blinded

Controlling for latent confounding by confirmatory factor analysis (CFA) Blinded Blinded Controlling for latent confounding by confirmatory factor analysis (CFA) Blinded Blinded 1 Background Latent confounder is common in social and behavioral science in which most of cases the selection mechanism

More information

Lecture 8. Using the CLR Model. Relation between patent applications and R&D spending. Variables

Lecture 8. Using the CLR Model. Relation between patent applications and R&D spending. Variables Lecture 8. Using the CLR Model Relation between patent applications and R&D spending Variables PATENTS = No. of patents (in 000) filed RDEP = Expenditure on research&development (in billions of 99 $) The

More information

New Developments in Econometrics Lecture 16: Quantile Estimation

New Developments in Econometrics Lecture 16: Quantile Estimation New Developments in Econometrics Lecture 16: Quantile Estimation Jeff Wooldridge Cemmap Lectures, UCL, June 2009 1. Review of Means, Medians, and Quantiles 2. Some Useful Asymptotic Results 3. Quantile

More information

Econometrics. Week 4. Fall Institute of Economic Studies Faculty of Social Sciences Charles University in Prague

Econometrics. Week 4. Fall Institute of Economic Studies Faculty of Social Sciences Charles University in Prague Econometrics Week 4 Institute of Economic Studies Faculty of Social Sciences Charles University in Prague Fall 2012 1 / 23 Recommended Reading For the today Serial correlation and heteroskedasticity in

More information

Panel Data Exercises Manuel Arellano. Using panel data, a researcher considers the estimation of the following system:

Panel Data Exercises Manuel Arellano. Using panel data, a researcher considers the estimation of the following system: Panel Data Exercises Manuel Arellano Exercise 1 Using panel data, a researcher considers the estimation of the following system: y 1t = α 1 + βx 1t + v 1t. (t =1,..., T ) y Nt = α N + βx Nt + v Nt where

More information

Lecture-1: Introduction to Econometrics

Lecture-1: Introduction to Econometrics Lecture-1: Introduction to Econometrics 1 Definition Econometrics may be defined as 2 the science in which the tools of economic theory, mathematics and statistical inference is applied to the analysis

More information

Review of Econometrics

Review of Econometrics Review of Econometrics Zheng Tian June 5th, 2017 1 The Essence of the OLS Estimation Multiple regression model involves the models as follows Y i = β 0 + β 1 X 1i + β 2 X 2i + + β k X ki + u i, i = 1,...,

More information

Flexible Estimation of Treatment Effect Parameters

Flexible Estimation of Treatment Effect Parameters Flexible Estimation of Treatment Effect Parameters Thomas MaCurdy a and Xiaohong Chen b and Han Hong c Introduction Many empirical studies of program evaluations are complicated by the presence of both

More information

14.32 Final : Spring 2001

14.32 Final : Spring 2001 14.32 Final : Spring 2001 Please read the entire exam before you begin. You have 3 hours. No books or notes should be used. Calculators are allowed. There are 105 points. Good luck! A. True/False/Sometimes

More information

LECTURE 11. Introduction to Econometrics. Autocorrelation

LECTURE 11. Introduction to Econometrics. Autocorrelation LECTURE 11 Introduction to Econometrics Autocorrelation November 29, 2016 1 / 24 ON PREVIOUS LECTURES We discussed the specification of a regression equation Specification consists of choosing: 1. correct

More information

SUPPLEMENT TO DEMOCRACY, REDISTRIBUTION, AND POLITICAL PARTICIPATION: EVIDENCE FROM SWEDEN (Econometrica, Vol. 82, No. 3, May 2014, )

SUPPLEMENT TO DEMOCRACY, REDISTRIBUTION, AND POLITICAL PARTICIPATION: EVIDENCE FROM SWEDEN (Econometrica, Vol. 82, No. 3, May 2014, ) Econometrica Supplementary Material SUPPLEMENT TO DEMOCRACY, REDISTRIBUTION, AND POLITICAL PARTICIPATION: EVIDENCE FROM SWEDEN 1919 1938 (Econometrica, Vol. 82, No. 3, May 2014, 961 993) BY BJÖRNTYREFORSHINNERICH

More information

Review of Statistics 101

Review of Statistics 101 Review of Statistics 101 We review some important themes from the course 1. Introduction Statistics- Set of methods for collecting/analyzing data (the art and science of learning from data). Provides methods

More information

Please discuss each of the 3 problems on a separate sheet of paper, not just on a separate page!

Please discuss each of the 3 problems on a separate sheet of paper, not just on a separate page! Econometrics - Exam May 11, 2011 1 Exam Please discuss each of the 3 problems on a separate sheet of paper, not just on a separate page! Problem 1: (15 points) A researcher has data for the year 2000 from

More information

Tutorial 3: Power and Sample Size for the Two-sample t-test with Equal Variances. Acknowledgements:

Tutorial 3: Power and Sample Size for the Two-sample t-test with Equal Variances. Acknowledgements: Tutorial 3: Power and Sample Size for the Two-sample t-test with Equal Variances Anna E. Barón, Keith E. Muller, Sarah M. Kreidler, and Deborah H. Glueck Acknowledgements: The project was supported in

More information

An Alternative Assumption to Identify LATE in Regression Discontinuity Design

An Alternative Assumption to Identify LATE in Regression Discontinuity Design An Alternative Assumption to Identify LATE in Regression Discontinuity Design Yingying Dong University of California Irvine May 2014 Abstract One key assumption Imbens and Angrist (1994) use to identify

More information

review session gov 2000 gov 2000 () review session 1 / 38

review session gov 2000 gov 2000 () review session 1 / 38 review session gov 2000 gov 2000 () review session 1 / 38 Overview Random Variables and Probability Univariate Statistics Bivariate Statistics Multivariate Statistics Causal Inference gov 2000 () review

More information

Beyond the Target Customer: Social Effects of CRM Campaigns

Beyond the Target Customer: Social Effects of CRM Campaigns Beyond the Target Customer: Social Effects of CRM Campaigns Eva Ascarza, Peter Ebbes, Oded Netzer, Matthew Danielson Link to article: http://journals.ama.org/doi/abs/10.1509/jmr.15.0442 WEB APPENDICES

More information

ES103 Introduction to Econometrics

ES103 Introduction to Econometrics Anita Staneva May 16, ES103 2015Introduction to Econometrics.. Lecture 1 ES103 Introduction to Econometrics Lecture 1: Basic Data Handling and Anita Staneva Egypt Scholars Economic Society Outline Introduction

More information

FinQuiz Notes

FinQuiz Notes Reading 10 Multiple Regression and Issues in Regression Analysis 2. MULTIPLE LINEAR REGRESSION Multiple linear regression is a method used to model the linear relationship between a dependent variable

More information

Business Statistics: Lecture 8: Introduction to Estimation & Hypothesis Testing

Business Statistics: Lecture 8: Introduction to Estimation & Hypothesis Testing Business Statistics: Lecture 8: Introduction to Estimation & Hypothesis Testing Agenda Introduction to Estimation Point estimation Interval estimation Introduction to Hypothesis Testing Concepts en terminology

More information

Econometrics -- Final Exam (Sample)

Econometrics -- Final Exam (Sample) Econometrics -- Final Exam (Sample) 1) The sample regression line estimated by OLS A) has an intercept that is equal to zero. B) is the same as the population regression line. C) cannot have negative and

More information

Final Exam. Question 1 (20 points) 2 (25 points) 3 (30 points) 4 (25 points) 5 (10 points) 6 (40 points) Total (150 points) Bonus question (10)

Final Exam. Question 1 (20 points) 2 (25 points) 3 (30 points) 4 (25 points) 5 (10 points) 6 (40 points) Total (150 points) Bonus question (10) Name Economics 170 Spring 2004 Honor pledge: I have neither given nor received aid on this exam including the preparation of my one page formula list and the preparation of the Stata assignment for the

More information

Wooldridge, Introductory Econometrics, 3d ed. Chapter 9: More on specification and data problems

Wooldridge, Introductory Econometrics, 3d ed. Chapter 9: More on specification and data problems Wooldridge, Introductory Econometrics, 3d ed. Chapter 9: More on specification and data problems Functional form misspecification We may have a model that is correctly specified, in terms of including

More information

Contest Quiz 3. Question Sheet. In this quiz we will review concepts of linear regression covered in lecture 2.

Contest Quiz 3. Question Sheet. In this quiz we will review concepts of linear regression covered in lecture 2. Updated: November 17, 2011 Lecturer: Thilo Klein Contact: tk375@cam.ac.uk Contest Quiz 3 Question Sheet In this quiz we will review concepts of linear regression covered in lecture 2. NOTE: Please round

More information