MATCHING TECHNIQUES Technical Track Session VI Céline Ferré The World Bank

MATCHING TECHNIQUES Technical Track Sessin VI Céline Ferré The Wrld Bank

When can we use matching? What if the assignment t the treatment is nt dne randmly r based n an eligibility index, but n the basis f bservable variables? This is when matching methds cme in! Matching methds allw yu t cnstruct cmparisn grups when the assignment t the treatment (r the take-up f treatment) can be mdeled n the basis f bservable variables.

When can we use matching? Intuitin: the cmparisn grup needs t be as similar as pssible t the treatment grup, in terms f the bservables befre the start f the treatment. Avids pssible cnfunders frm imbalance in bservables. The methd assumes there are n remaining unbservable differences between treatment and cmparisn grups.

Key Questin What is the effect f treatment when the assignment t the treatment is based n bservable variables?

Uncnfundedness & Selectin n bservables Let X dente a matrix in which each rw is a vectr f pre-treatment bservable variables fr individual i. Uncnfundedness: Assignment t treatment is uncnfunded given pre-treatment variables X if Y 1, Y 0 D X Uncnfundedness is equivalent t saying that: (1) within each cell defined by X: treatment is randm (2) the selectin int treatment depends nly n the bservables X.

Average effects f treatment Assuming uncnfundedness given X Intuitin Estimate the treatment effect within each cell defined by X Take the weighted average ver the different cells Maths In yur handuts: Annex 1

Strategy fr estimating average effect f treatment Selectin n bservables Uncnfundedness suggests the fllwing strategy fr the estimatin f the average treatment effect δ Stratify the data int cells defined by each particular value f X Within each cell (i.e. cnditining n X) cmpute the difference between the average utcmes f the treated and the cntrls Average these differences with respect t the distributin f X in the ppulatin f treated units. Is this strategy feasible?

Is ur strategy feasible? The Dimensinality Prblem This may nt be feasible when The sample is small The set f cvariates is large Many f the cvariates have many values r are cntinuus This is what we call The dimensinality prblem

The Dimensinality Prblem Examples Hw many cells d we have with 2 binary X variables? And with 3 binary X variables? And with K binary X variables? Hw abut if we have 2 variables that take n 7 values each? As the number f cells grws, we ll get lack f cmmn supprt cells cntaining nly treated bservatins cells cntaining nly cntrls

An Alternative t slve the Dimensinality Prblem The prpensity scre allws t cnvert the multidimensinal setup f matching int a nedimensinal setup. In that way, it allws t reduce the dimensinality prblem. Rsenbaum and Rubin Rsenbaum and Rubin (1983) prpse an equivalent and feasible estimatin strategy based n the cncept f Prpensity Scre.

Matching based n the Prpensity Scre Definitin The prpensity scre is the cnditinal prbability f receiving the treatment given the pre-treatment variables: Lemma 1 Lemma 2 p(x) =Pr{D = 1 X} = E X {D X} If p(x) is the prpensity scre, then D X p(x) Given the prpensity scre, the pre-treatment variables are balanced between beneficiaries and nn- beneficiaries Y1, Y0 D X => Y 1, Y0 D p(x) Suppse that assignment t treatment is uncnfunded given the pre-treatment variables X. Then assignment t treatment is uncnfunded given the prpensity scre p(x).

Des the prpensity scre apprach slve the dimensinality prblem? YES! The balancing prperty f the prpensity scre (Lemma 1) ensures that: Observatins with the same prpensity scre have the same distributin f bservable cvariates independently f treatment status; and fr a given prpensity scre, assignment t treatment is randm and therefre treatment and cntrl units are bservatinally identical n average.

Implementatin f the estimatin strategy This suggests the fllwing strategy fr the estimatin f the average treatment effect n the treated, called δ Step 1 Estimate the prpensity scre (see Annex 3) Step 2 Restrict the analysis t the regin f cmmn supprt (key surce f bias in bservatinal studies) Step 3 Estimate the average treatment effect by matching based n the prpensity scre

Implementatin f the estimatin strategy Step 1 Estimate the prpensity scre (see annex 3) Estimate a lgit (r prbit) mdel f prgram participatin. Predicted values are the prpensity scres. E.g. With a lgit functin, see Annex 3. This step is necessary because the true prpensity scre is unknwn and therefre the prpensity scre has t be estimated.

When is prpensity scre matching apprpriate? Idea behind prpensity scre matching: estimatin f treatment effects requires a careful matching f treated and cntrls. If treated and cntrls are very different in terms f bservables this matching is nt sufficiently clse and reliable r it may even be impssible. The cmparisn f the estimated prpensity scres acrss treated and cntrls prvides a useful diagnstic tl t evaluate hw similar are treated and cntrls, and therefre hw reliable is the estimatin strategy.

Implementatin f the estimatin strategy Step 2 Restrict the analysis t the regin f cmmn supprt (key surce f bias in bservatinal studies)

S yu want prpensity scre t be the same fr treatments and cntrls The range f variatin f prpensity scres shuld be the same fr treated and cntrls. Cunt hw many cntrls have a prpensity scre lwer than the minimum r higher than the maximum f the prpensity scres f the treated and vice versa. Frequency f prpensity scres is the same fr treated and cntrl. Draw histgrams f the estimated prpensity scres fr the treated and cntrls. The bins crrespnd t the blcks cnstructed fr the estimatin f prpensity scres.

The issue f cmmn supprt Density Density f scres fr nn-participants Density f scres fr participants 0 Regin f cmmn supprt 1 Prpensity scre

Step 3: Estimate the average treatment effect by matching based n the prpensity scre Step 3 Estimate the average treatment effect by matching based n the prpensity scre Fr each participant find a sample f nn-participants that have similar prpensity scres. Cmpare the utcme indicatr fr each participant and its cmparisn grup. Calculate the mean f these individual gains t btain the average verall gain (Heckman et al., 1998). P NP ATT ( Y - W Y ) / P j1 ij ij0 j1 i1

Step 3: Estimate the average treatment effect by matching based n the prpensity scre Similar can be defined in many ways. These different weights crrespnd t different ways f ding matching: Stratificatin n the Scre Nearest neighbr matching n the Scre Radius matching n the Scre Kernel matching n the Scre Weighting n the basis f the Scre

T summarize: Matching is the bservatinal analgue f an experiment in which placement is independent f utcmes The key difference is that a pure experiment des nt require the essentially untestable assumptin f independence cnditinal n bservables. PSM requires N selectin n unbservables Gd data Cmmn Supprt Matching is ften cmbined with difference-in-differences methds Matching is perfrmed n baseline characteristics DD cntrls fr selectin based n time-invariant unbserved characteristics Cautin! Matching with ex pst data is very risky

References Ahmed, U.A., A.R. Quisumbing, M. Nasreen, J.F. Hddintt and E. Bryan (2009). Cmparing Fd and Cash Transfers t the Ultra Pr in Bangladesh. Internatinal Fd Plicy Research Institute, Research Mngraph 63. Cattane, M., S. Galiani, P. Gertler, S. Martinez and R. Titiunik (2009). Husing, Health and Happiness. American Ecnmic Jurnal: Ecnmic Plicy 1(1). Dehejia, R.H. and S. Wahba (1999), Causal Effects in Nn-experimental Studies: Reevaluating the Evaluatin f Training Prgrams, Jurnal f the American Statistical Assciatin, 94, 448, 1053-1062. Caliend, M. and S. Kpeinig (2008), Sme Practical Guidance fr the Implementatin f Prpensity Scre Matching, Jurnal f Ecnmic Surveys 22(1). Heckman, James J. H. Ichimura, and P. Tdd (1998), Matching as an ecnmetric evaluatin estimatr, Review f Ecnmic Studies, 65, 261-294. Imbens. G. W. (2004), Nnparametric Estimatin f Average Treatment Effects under Exgeneity: A Review, The Review f Ecnmic and Statistics 86(1).

References (cnt.) Jalan, J., and M. Ravallin (2003a), Estimating the Benefit Incidence f an Antipverty Prgram by Prpensity-Scre Matching, Jurnal f Business & Ecnmic Statistics 21(1): 19-30. Jalan, J., and M. Ravallin (2003b), Des Piped Water Reduce Diarrhea fr Children in Rural India, Jurnal f Ecnmetrics 112(1): 153-173. LaLnde, R. (1986) Evaluating the Ecnmetric Evaluatins f Training Prgrams with Experimental Data, American Ecnmic Review 76(4). Rsenbaum, P.R. and D.B. Rubin (1983), The Central Rle f the Prpensity Scre in Observatinal Studies fr Causal Effects, Bimetrika 70, 1, 41 55.

Thank Yu

Q & A

Annex 1: Average effects f treatment n the treated assuming uncnfundedness given X If we are willing t assume uncnfundedness: 0 0 0 1 1 1 E Y u D =0, X = E Y u D 1, X E Y u X i i i i i i i i E Y u D =0, X E Y u D 1, X E Y u X i i i i i i i i Using these expressins, we can define fr each cell defined by =average treatment effect n the treated in cell defined by X X E { D 1, X} i i i Ei Y1 ui Y0ui Di 1, X E Y u D 1, X E i 1 i i i can measure sample analg 1 Di i 0u 0 E Y u D 1, X can NOT measure sample analg Y u 1, X E Y D 0, X i i i i can measure sample analg i X

Annex 1: Average effects f treatment n the treated assuming uncnfundedness given X Nw what is the relatin between "average treatment effect n the treated"... and... X "average treatment effect n the treated within cell defined by X "? average treatment effect n the treated E D 1 i i i by the law f iterated expectatins E E D =1, X i X i i E E D =1, X E E X i i i X X X {average treatment effect n the treated within cell defined by X}

Annex 2: Average effects f treatment and the prpensity scre S let's match treatments and cntrls n the basis f the prpensity scre p(x) instead f X. 0 =0, = 0 1, 0 1 =0, 1 1, 1 E Y u D p X E Y u D p X E Y u p X i i i i i i i i i i i E Y u D p X E Y u D p X E Y u p X i i i i i i i i i i i Using these expressins, we can define f cell defined by p X =average treatment effect n the treated in cell defined by p X Ei{ i Di 1, p X } Ei Y1 ui Y0ui Di 1, p X E Y u D 1, p X E Y u D 1, p X p X i 1 i i can measure sample analg 1 Di 1, 0 Di 0, i i i i 0 i can NOT measure sample analg E Y u p X E Y u p X i can measure sample analg

Annex 2: Average effects f treatment and the prpensity scre Nw what is the relatin between p X p X "average treatment effect n the treated"... and... "average treatment effect n the treated within cell defined by "? average treatment effect n the treated E D 1 i i i by the law f iterated expectatins Ei Ep X i Di p X =1, E E D p X p X E i i i =1, p X p X E {treatment effect n the treated within cell defined by p X } p X

Annex 3: Estimatin f the prpensity scre Any standard prbability mdel can be used t estimate the prpensity scre, e.g. a lgit mdel: h( X i ) e Pr{ Di X i} h( X i ) 1 e (16) where h(xi) is a functin f cvariates with linear and higher rder terms.

Estimatin f the prpensity scre Which higher rder terms d yu include in h(xi)? This is determined slely by the need t btain an estimate f the prpensity scre that satisfies the balancing prperty. The specificatin f h(xi) is (1) mre parsimnius than the full set f interactins between bservables X (2) thugh nt t parsimnius: it still needs t satisfy the balancing prperty. Nte: the estimatin f the prpensity scres des nt need a behaviral interpretatin.

An algrithm fr estimating the prpensity scre 1. Start with a parsimnius lgit r prbit functin t estimate the scre. 2. Srt the data accrding t the estimated prpensity scre (frm lwest t highest). 3. Stratify all bservatins in blcks such that in each blck the estimated prpensity scres fr the treated and the cntrls are nt statistically different: a) start with five blcks f equal scre range {0-0.2,..., 0.8-1} b) test whether the means f the scres fr the treated and the cntrls are statistically different in each blck c) if yes, increase the number f blcks and test again d) if n, g t next step.

An algrithm fr estimating the prpensity scre (cntinued) 4. Test that the balancing prperty hlds in all blcks fr all cvariates: a) fr each cvariate, test whether the means (and pssibly higher rder mments) fr the treated and fr the cntrls are statistically different in all blcks; b) if ne cvariate is nt balanced in ne blck, split the blck and test again within each finer blck; c) if ne cvariate is nt balanced in all blcks, mdify the lgit estimatin f the prpensity scre adding mre interactin and higher rder terms and then test again. Nte: In all this prcedure the utcme has n rle. Use the STATA prgram pscre.ad, psmatch2.ad, match.ad (frm STATA type findit name ad )