Lecture Notes on Propensity Score Matching

Similar documents
0 0 = 1 0 = 0 1 = = 1 1 = 0 0 = 1

生物統計教育訓練 - 課程. Introduction to equivalence, superior, inferior studies in RCT 謝宗成副教授慈濟大學醫學科學研究所. TEL: ext 2015

Linear Regression. Applied Linear Regression Models (Kutner, Nachtsheim, Neter, Li) hsuhl (NUK) SDA Regression 1 / 34

Chapter 1 Linear Regression with One Predictor Variable

Algorithms and Complexity

授課大綱 課號課程名稱選別開課系級學分 結果預視

國立中正大學八十一學年度應用數學研究所 碩士班研究生招生考試試題

Chapter 22 Lecture. Essential University Physics Richard Wolfson 2 nd Edition. Electric Potential 電位 Pearson Education, Inc.

Chapter 20 Cell Division Summary

Earth System Science Programme. Academic Counseling 2018

= lim(x + 1) lim x 1 x 1 (x 2 + 1) 2 (for the latter let y = x2 + 1) lim

Chapter 6. Series-Parallel Circuits ISU EE. C.Y. Lee

HKDSE Chemistry Paper 2 Q.1 & Q.3

期中考前回顧 助教 : 王珊彗. Copyright 2009 Cengage Learning

KWUN TONG GOVERNMENT SECONDARY SCHOOL 觀塘官立中學 (Office) Shun Lee Estate Kwun Tong, Kowloon 上學期測驗

台灣大學開放式課程 有機化學乙 蔡蘊明教授 本著作除另有註明, 作者皆為蔡蘊明教授, 所有內容皆採用創用 CC 姓名標示 - 非商業使用 - 相同方式分享 3.0 台灣授權條款釋出

tan θ(t) = 5 [3 points] And, we are given that d [1 points] Therefore, the velocity of the plane is dx [4 points] (km/min.) [2 points] (The other way)

Chapter 1 Physics and Measurement

統計學 Spring 2011 授課教師 : 統計系余清祥日期 :2011 年 3 月 22 日第十三章 : 變異數分析與實驗設計

磁振影像原理與臨床研究應用 課程內容介紹 課程內容 參考書籍. Introduction of MRI course 磁振成像原理 ( 前 8 週 ) 射頻脈衝 組織對比 影像重建 脈衝波序 影像假影與安全 等

Differential Equations (DE)

雷射原理. The Principle of Laser. 授課教授 : 林彥勝博士 Contents

Chapter 1 Linear Regression with One Predictor Variable

相關分析. Scatter Diagram. Ch 13 線性迴歸與相關分析. Correlation Analysis. Correlation Analysis. Linear Regression And Correlation Analysis

Multiple sequence alignment (MSA)

Statistical Intervals and the Applications. Hsiuying Wang Institute of Statistics National Chiao Tung University Hsinchu, Taiwan

Statistics and Econometrics I

pseudo-code-2012.docx 2013/5/9

Ch.9 Liquids and Solids

ApTutorGroup. SAT II Chemistry Guides: Test Basics Scoring, Timing, Number of Questions Points Minutes Questions (Multiple Choice)

Candidates Performance in Paper I (Q1-4, )

1 dx (5%) andˆ x dx converges. x2 +1 a

Candidates Performance in Paper I (Q1-4, )

在雲層閃光放電之前就開始提前釋放出離子是非常重要的因素 所有 FOREND 放電式避雷針都有離子加速裝置支援離子產生器 在產品設計時, 為增加電場更大範圍, 使用電極支援大氣離子化,

EXPERMENT 9. To determination of Quinine by fluorescence spectroscopy. Introduction

2019 年第 51 屆國際化學奧林匹亞競賽 國內初選筆試 - 選擇題答案卷

適應控制與反覆控制應用在壓電致動器之研究 Adaptive and Repetitive Control of Piezoelectric Actuators

Advanced Engineering Mathematics 長榮大學科工系 105 級

原子模型 Atomic Model 有了正確的原子模型, 才會發明了雷射

Chapter 8 Lecture. Essential University Physics Richard Wolfson 2 nd Edition. Gravity 重力 Pearson Education, Inc. Slide 8-1

壓差式迴路式均熱片之研製 Fabrication of Pressure-Difference Loop Heat Spreader

心智科學大型研究設備共同使用服務計畫身體 心靈與文化整合影像研究中心. fmri 教育講習課程 I. Hands-on (2 nd level) Group Analysis to Factorial Design

REAXYS NEW REAXYS. RAEXYS 教育訓練 PPT HOW YOU THINK HOW YOU WORK

國立交通大學 電子工程學系電子研究所碩士班 碩士論文

Chapter 5-7 Errors, Random Errors, and Statistical Data in Chemical Analyses

GSAS 安裝使用簡介 楊仲準中原大學物理系. Department of Physics, Chung Yuan Christian University

國立成功大學 航空太空工程學系 碩士論文 研究生 : 柯宗良 指導教授 : 楊憲東

邏輯設計 Hw#6 請於 6/13( 五 ) 下課前繳交

醫用磁振學 MRM 課程介紹與原理複習. Congratulations! Syllabus. From Basics to Bedside. You are HERE! License of Radiological Technologist 盧家鋒助理教授國立陽明大學生物醫學影像暨放射科學系

統計學 ( 一 ) 第七章信賴區間估計 (Estimation Using Confidence Intervals) 授課教師 : 唐麗英教授 國立交通大學工業工程與管理學系聯絡電話 :(03)

基因演算法 學習速成 南台科技大學電機系趙春棠講解

行政院國家科學委員會補助專題研究計畫 成果報告 期中進度報告 ( 計畫名稱 )

ON FINITE DIMENSIONAL APPROXIMATION IN NONPARAMETRIC REGRESSION

論文與專利寫作暨學術 倫理期末報告 班級 : 碩化一甲學號 :MA 姓名 : 林郡澤老師 : 黃常寧

Elementary Number Theory An Algebraic Apporach

Ministry of Sciences and Technology, Chinese Academy of Sciences as well as some international pharmaceutical companies.

Study of Leaf Area as Functions of Age and Temperature in Rice (Oryza sativa L.) 1

Chapter 7. The Quantum- Mechanical Model of the Atom. Chapter 7 Lecture Lecture Presentation. Sherril Soman Grand Valley State University

Tutorial Courses

奈米微污染控制工作小組 協辦單位 台灣賽默飛世爾科技股份有限公司 報名方式 本參訪活動由郭啟文先生負責 報名信箱

允許學生個人 非營利性的圖書館或公立學校合理使用本基金會網站所提供之各項試題及其解答 可直接下載而不須申請. 重版 系統地複製或大量重製這些資料的任何部分, 必須獲得財團法人臺北市九章數學教育基金會的授權許可 申請此項授權請電郵

Permutation Tests for Difference between Two Multivariate Allometric Patterns

Hong Kong s temperature record: Is it in support of global warming? 香港的溫度記錄 : 全球暖化的證據?

The dynamic N1-methyladenosine methylome in eukaryotic messenger RNA 报告人 : 沈胤

Digital Integrated Circuits Lecture 5: Logical Effort

MECHANICS OF MATERIALS

FUNDAMENTALS OF FLUID MECHANICS Chapter 3 Fluids in Motion - The Bernoulli Equation

5.5 Using Entropy to Calculate the Natural Direction of a Process in an Isolated System

FUNDAMENTALS OF FLUID MECHANICS. Chapter 8 Pipe Flow. Jyh-Cherng. Shieh Department of Bio-Industrial

ANSYS 17 應用於半導體設備和製程的應用技術

Chapter 13. Enzyme Kinetics ( 動力學 ) and Specificity ( 特異性 專一性 ) Biochemistry by. Reginald Garrett and Charles Grisham

Earth System Science Programme 地球系統科學課程 Tel: Fax:

2. Suppose that a consumer has the utility function

Learning to Recommend with Location and Context

Frequency Response (Bode Plot) with MATLAB

Selection on Observables: Propensity Score Matching.

CHAPTER 2. Energy Bands and Carrier Concentration in Thermal Equilibrium

Ph.D. Qualified Examination

侯宏誼研究簡歷 一. 學歷民國 96 年 11 月 : 雲林科技大學工程科技博士民國 89 年 06 月 : 雲林科技大學環境與安全工程研究所工學碩士民國 87 年 06 月 : 東海大學環境科學系理學學士二 主要著作 ( 一 ) 英文學術性期刊論文

Ph.D. Qualified Examination

Regression Analysis. Institute of Statistics, National Tsing Hua University, Taiwan

MRDFG 的周期界的計算的提升計畫編號 :NSC E 執行期限 : 94 年 8 月 1 日至 94 年 7 月 31 日主持人 : 趙玉政治大學資管系計畫參與人員 :

Hybrid Simulations of Ground Motions for the Earthquake Scenarios of Shanchiao and Hengchun faults

氮化鋁鎵 / 氮化鎵異質結構的電性傳輸. Electrical transport in AlGaN/GaN heterostructures

Chapter 13 Thin-layer chromatography. Shin-Hun Juang, Ph.D.

在破裂多孔介質中的情形 底下是我們考慮的抛物線微分方程式. is a domain and = f. in f. . Denote absolute permeability by. P in. k in. p in. and. and. , and external source by

Reactive Fluid Dynamics 1 G-COE 科目 複雑システムのデザイン体系 第 1 回 植田利久 慶應義塾大学大学院理工学研究科開放環境科学専攻 2009 年 4 月 14 日. Keio University

d) There is a Web page that includes links to both Web page A and Web page B.

Numbers and Fundamental Arithmetic

INTERNATIONAL BIOLOGY OLYMPIAD

General Biology I. For M117 and D76 Students. 陳正繹 Dept. of Biology & Anatomy National Defense Medical Cente

日本政府 ( 文部科学省 ) 奨学金留学生申請書

Hong Kong Examinations and Assessment Authority & Curriculum Development Institute, EDB Teachers Meeting for HKDSE Physics SBA

CH 5 More on the analysis of consumer behavior

(i) Tangutorine 1 之全合成 : 經由關鍵的中間體 2; 化合物 2 具有. Aspidospermidine 39 之合成研究 : 經由螺環中間體 47

2012 AP Calculus BC 模拟试卷

Digital Image Processing

Ch. 6 Electronic Structure and The Periodic Table

Sparse Learning Under Regularization Framework

Transcription:

Lecture Notes on Propensity Score Matching Jin-Lung Lin This lecture note is intended solely for teaching. Some parts of the notes are taken from various sources listed below and no originality is claimed. 1 Introduction A specific question: Is taking math lessons after school helpful in improving score? 補習數學有用嗎? ( 參考 : 關秉寅 李敦義 (2008) 補習數學有用嗎? 一個 反事實 的分析 台灣社會學刊,41,97-148) A first attempt to answer this question would be computing the difference between the scores of those who took the after school lessons and those who don t. By so doing, one assumes that all the students are similar and are randomly selected to take after school lessons. In reality, these are two different groups with different characteristics that would affect the learning and scoring ability. In other words, there exist sample selection bias that seriously affects the validity of the analysis. Some basic concepts: Treatment = 補習數學 (D = 1); 沒有補習數學 (D = 0) Y (1) : 補習數學學生的數學成績 ; Y (0) : 沒有補習數學學生的數學成績 ATE (Average Treatment Effect) 如果所有的人國三都補數學的話, 數學成就會有差異嗎? ATT (Average Treatment Effect on the Treated): 如果國三補數學的人, 沒補的話, 數學成就會有差異嗎? ATU (Average Treatment Effect on the Untreated): 如果國三沒補數學的人, 補習的話, 數學成就會有差異嗎? General questions: Is the treatment (for whatever) effective? Fact: some people receive treatment. Counterfactual question: What would have happened to those who, in fact, did receive treatment, if they had not received treatment (or the converse)? In short, participants differ from nonparticipants and creates the selection bias. To minimize the 1

bias, we need to find a large group of nonparticipants those individuals who are similar to the participants in all relevant treatment characteristics. 2

Table 1: The counterfactual framework Potential outcomes Group Y (1) Y (0) Treatment effect (D=1) Observable E(Y (1) D = 1) Counterfactual E(Y (0) D = 1) Control group (D=0) Counterfactual E(Y (1) D = 0) Observable E(Y (0) D = 0) 2 Matching basics Roy-Rubin model main pillars: individual, treatment and potential outcome. For binary treatment, treatment indicator 1 if individual i receives treatment D i = 0 if individual i does not receive treatment Y i (D i ) is the potential outcome for individual i, i = 1,, N. Treatment effect τ i = Y i (1) Y i (0) Only one of Y i (1), Y i (0) is observed and the other unobservable outcome is called counterfactual outcome. It is impossible to estimate τ i for each i and we could only estimate the average treatment effect. τ i = Y i (1) Y i (0) τ AT E = E(τ) = E(Y (1) Y (0)) τ AT T = E(τ D = 1) = E(Y (1) D = 1) E(Y (0) D = 1) τ AT U = E(τ D = 0) = E(Y (1) D = 0) E(Y (0) D = 0) Population average treatment effect (ATE), τ AT E, answers the question What is the expected effect of the outcome if individuals in the population were randomly assigned to treatment? τ AT E is not interesting because it includes the effects on persons not intended. On the other hand, τ AT T, average effect of the treated is defined as the difference between expected outcome values with and without treatment for those who actually participate in treatment. It determines the realized gross gain from the programme and can be compared with its cost. E(Y (0) D = 1) is counterfactual (unobserved) and E(Y (0) D = 0) is usually not a good proxy. There exists selection bias. E(Y (1) D = 1) E(Y (0) D = 0) = τ AT T + E(Y (0) D = 1) E(Y (0) D = 0) τ AT T is only identified if the selection bias, E(Y (0) D = 1) E(Y (0) D = 0) = 0 3

Furthermore, let P (D = 1) = π, then τ AT E = E(τ) = E(Y (1) Y (0)) = [πe(y (1) D = 1) + (1 π)e(y (1) D = 0)] [πe(y (0) D = 1) + (1 π)e(y (0) D = 0)] = π[e(y (1) D = 1) E(Y (0) D = 1)] + (1 π)[e(y (1) D = 0) E(Y (0) D = 0)] = πe(τ D = 1) + (1 π)e(τ D = 0) = πat T + (1 π)at U Regards to previous example, 不同組的人有同樣的因果效應嗎? Yes, E[Y (1) D = 0] = E[Y (1) D = 1], No, E[Y (1) D = 0] E[Y (1) D = 1] is the baseline bias. 不同組的人在未接受 treatment 前是一樣的嗎? Yes, E[Y (0) D = 1] = E[Y (0) D = 0] No, E[Y (0) D = 1] E[Y (0) D = 0] is the differential effect bias Then, E[Y (1) D = 1] E[Y (0) D = 0] = E(τ) + π[e(y (0) D = 1) E(Y (0) D = 0)] + (1 π)[e(τ D = 1) E(τ D = 0)] Naive Estimate = average causal effect + baseline bias + differential effect bias Fundamental assumptions: Unconfoundedness and Common Support Assumption 1: Unconfoundedness: Y (0), Y (1) D X Given a set of observable covariates, X, which is not affected by treatment, potential outcomes are independent of treatment assignment. This implies that all variables that influence treatment assignment and potential outcomes simultaneously have to be observed by the researchers. Unconfoundedness is also called selection on observable or conditional independence. Assumption 2: Overlap: 0 < P (D = 1 X) < 1 Persons with the same X values have a positive probability of being participants and nonparticipants. Assumption 3: Unconfoundedness for controls: Y (0) D X Assumption 4: Weak overlap: P (D = 1 X) < 1. 4

Troubles: as the dimension of X increases, the unconfoundedness is difficult to hold. Rosenbaum and Rubin (1983) suggested using balancing score b(x). The propensity score, P (D = 1 X) = P (X), the probability for an individual to participate in a treatment given his observed covariates X, is one balancing score. Corollary 1. Unconcernedness given the propensity score: Y (0), Y (1) D P (X) Estimation strategy τ P SM AT T = E P (X) D=1 (E(Y (1) D = 1, P (X)) E(Y (0) D = 1, P (X)) PSM estimator is the mean difference in outcomes over the common support, appropriately weighted by the propensity score distribution of participants. 3 Implementation of Propensity Score Matching 3.1 Estimating the propensity score Two choices: 1. Model to be used for the estimation 2. Variables to be included in this model Model choice - Binary Treatment logit model probit model linear probability model Model choice - Multiple treatments multinominal probit model multinominal logit model Series of binomial model linear probability model variable choice Omitting important variables can seriously increase bias in the estimation. 5

Only variables that influence simultaneously the participation decision and the outcome variable should be included. Only variables unaffected by participation should be included in the model. participants and nonparticipants should stem from the same source (dataset). Should avoid including too many variables as execrates the support problem and increases the variance. 3.2 Steps of Implementation PSM Step 0: Decide between PSM and CVM (covariate matching) Step 1: Propensity Score estimation Step 2: Choose matching algorithm Step 3: Check overlap/common support Step 4: matching quality/effect estimation Step 5: sensitivity analysis 3.3 Matching algorithm Distance measures 1. Exact: 2. Mahalanobis: 0 if X i = X j M ij = if X i X j M ij = (X i X j ) Σ 1 (X i X j ) where Σ is the covariance matrix of X in the full control group. 3. Propensity score: 4. Linear propensity score: M ij = e i e j M ij = logit(e i ) logit(e j ) 6

5. Fine balance: (Z i Z j ) Σ 1 (Z i Z j ) if logit(e i ) logit(e j ) c M ij = if logit(e i ) logit(e j ) > c 6. Prognosis score The predicted outcome each individual would have under the control condition. where c is the caliper. 1. Nearest neighbor matching M i = min j P i P j, j I 0 nonparticipant with the value of M j that is closet to P i is selected as the match. each person in the treatment group choose individual(s) with the closest propensity score to them can do this with (most common) or without replacement not very efficient as discarding a lot of information from the control group 2. Kernel based matching each person in the treatment group is matched to a weighted sum of individuals who have similar propensity scores with greatest weight being given to people with closer scores Some kernel based matching use ALL people in non-treated group (e.g. Gaussian kernel) whereas others only use people within a certain probability user-specified bandwidth (e.g. Epanechnikov Choice of bandwidth involves a trade-off of bias with precision 3. Caliper matching A match for person i is selected only if M i M j < ɛ, j I 0 where ɛ prespecified tolerance, usually.25σ m. 1-to-1 Nearest neighbor within caliper 1-to-n Nearest neighbor within caliper 7

4. Radius matching 1-NN only or more 5. Stratification and interval Group sample into five categories based on propensity score (quintiles). Within each quintile, calculate mean outcome for treated and nontreated groups. Estimate the mean difference (average treatment effects) for the whole sample (i.e., all five groups) and variance using the following equations: ˆδ K k=1 Number of strata intervals n k N [Ȳ0k Ȳ1k], V ar(ˆδ) = K ( n k k=1 N )2 V ar[ȳ0k Ȳ1k] 6. Mahalanobis matching M ij = (X i X j ) Σ 1 (X i X j ) Mahalanobis metric matching without p-score Mahalanobis metric matching with p-score added (to X i and X j ) 7. Local linear regression matching 8. Spline matching 3.4 So what does PSM do? Propensity score is the probability of taking treatment given a vector of observed variables. p(x) = P r[d = 1 X = x] If we take individuals with the same propensity score, and divide them into two groups- those who were and weren t treated-the groups will be approximately balanced on the variables predicting the propensity score. Among those with the same predicted probability of treatment ˆp, those who get treated and not treated differ only on their error term in the propensity score equation. But this error term is approximately independent of the X s. The treatment assignment Dis independent of Y, given the strata created by X s. This is why balancing should occur. Y D X 8

Common support: the overlap condition for persons with the same x value in X are allowed to have a positive probability of being in treated and control groups. We only make inferences where we have sufficient data. Unlike ordinary regression, we dont extrapolate outside the range of the observed data points. Gives us weights for the control group to make them look as similar as possible in terms of X s as treatment group Nearest neighbor PSM these weights are integers Other methods non-integers Sum of weights for control group sums to number of observations in treatment group Use weighted difference in mean outcomes between treatment and control group to find effect So only have to matching once to find impact of treatment on all outcomes of interest - always use same weights 3.5 So how do we choose best method? If matching has worked, then none of the X s should differ between control and treatment group So do another weighted probit/logit and check this is the case - If PSM has worked - none of the X s should be significant in determining whether you are in the treatment group Check to see whether there are any significant differences in the weighted means of X s between pilot and control areas (simple t-test) Usually find that one method works better than the rest But sometimes find that groups are just too different and no matching methods can come up with plausible weights Check to see if some flexible regression method gives you same answer as preferred matching method 9

3.6 Imposing Common Support In order for matching to be valid we need to observe participants and nonparticipants with the same range of characteristics - i.e for all values of characteristics X there are treated and non-treated individuals If this cannot be achieved - treated units whose p is larger than the largest p in the non-treated pool are left unmatched 4 A practical example 關秉寅 李敦義 (2008) 補習數學有用嗎? 一個 反事實 的分析 台灣社會學刊,41,97-148 研究使用的資料 :TEPS 國中樣本 ( 公開使用版 ) 2001(N = 13,978 ) 2003(N = 13,247 ) 分析樣本 : 公立國中生 ;with common support (N = 10,013 ) 應變項 : 國三數學能力 IRT, 轉換成 NCE (normal curve equivalence) 分數 (Range: 1-99; Mean: 50; S.D.: 21.06) 自變項 (Treatment): 國三補習數學 26 個配對變項 : 個人特性及學習特質 : 性別 補習經驗 W1 數學 IRT 等 (motivation, ability) 家庭背景 : 父母教育程度 職業 教育期待等班級 / 學校學習氣氛及環境分析策略 : 只研究國三數學補習的效果國三補習主要是為了準備基測學校教育對數學能力的培養比較有影響力以階層性模型探討國三補習的參與及補習效果, 以瞭解過往補習研究可能有的限制, 以及比較 OLS 及 PSM 兩者估計 ATE 可能有的差異比較有及沒有 W1 數學 IRT 做為配對變項的差異誰參加國三數學補習? 個人特性及學習特質 : 先備能力較佳 過去沒補習經驗者 回家會複習功課 自己沒有補習的意願家庭背景 : 非原住民 與雙親同住 父母不是研究所學歷 白領職業 高收入 高父母教育期望 手足人數少班級 / 學校情況 : 位於在都市化程較高地區 班上讀書風氣盛 學業競爭激烈程度高國三補習數學有用嗎? Gross effect (OLS): 12.243( 分析樣本 with common support) After controlling all matching variables (OLS): 3.017 - an estimate of ATE PSM results (all matching variables included): Total population (ATE): 2.956 Treated (ATT): 2.258 Untreated (ATU): 3.580 PSM 的 ATE 估計大多比 OLS 的估計小 ATT 比 ATU 小都會受到未納入重要自變項的影響針對國三補習數學者與從未補習者配對成功的 4689 對中進行敏感度分析的結果是, 如果未觀察到之變項對影響補習參與與否的影響力, 即 Gamma (γ), 是介於 1.25 到 1.35 時, 就可能會變成不顯著 取這些數值的自然對數, 則分別為 0.223 及 0.300, 如與實際配對變項對是否補習之邏輯迴歸係數比較, 則此數值大約是完整家庭 (nuintact) 的係數 (.226) 也就是說, 這未觀察到變項對補習與否的影響力至少要像完整家庭一樣大, 才會影響 ATT 的變化 10

5 History In 1983, Rosenbaum and Rubin published their seminal paper that first proposed this approach.. From the 1970s, Heckman and his colleagues focused on the problem of selection biases, and traditional approaches to program evaluation, including randomized experiments, classical matching, and statistical controls. Heckman later developed Difference-in-differences method 6 Skeptics Howard Bloom, MDRC. Sees PSM as a somewhat improved version of simple matching, but with many of the same limitations. Inclusion of propensity scores can help reduce large biases, but significant biases may remain. Local comparison groups are best- PSM is no miracle maker (it cannot match unmeasured contextual variables). Short-term biases (2 years) are substantially less than medium term (3 to 5 year) biases- the value of comparison groups may deteriorate Michael Sosin, University of Chicago. Strong assumption that untreated cases were not treated at random. Argues for using multiple methods and not relying on PSM 7 Limitations Limitations of Propensity Scores. Large samples are required. Group overlap must be substantial. Hidden bias may remain because matching only controls for observed variables (to the extent that they are perfectly measured) (Shadish, Cook, & Campbell, 2002) 8 Criteria Identify treatment and comparison groups with substantial overlap. Match, as much as possible, on variables that are precisely measured and stable (to avoid extreme baseline scores that will regress toward the mean). Use a composite variable- e.g., a propensity score- which minimizes group differences across many scores 9 Risk They may undermine the argument for experimental designs- n argument that is hard enough to make, now. They may be used to act as if a panel survey is an experimental design, overestimating the certainty of findings based on the PSM. 11

The crucial difference of PSM from conventional matching: match subjects on one score rather than multiple variables: the propensity score is a monotone function of the discriminant score (Rosenbaum & Rubin, 1984). Run Logistic Regression:. Dependent variable: Y=1, if participate; Y = 0, otherwise.. Choose appropriate conditioning (instrumental) variables.. Obtain propensity score: predicted probability (p) or log[p/(1-p)].general Procedure Multivariate Match Each Participant to One or More Nonparticipation Propensity Score..Nearest neighbor matching..caliper matching..mahalanobis metric matching in conjunction with PSM..Stratification matching..difference-in-differences matching (kernel & local linear weights) Sources 關秉寅 (2010), 因果推論新思維 : 反事實分析架構, A New Paradigm for Causal Inference: The Counterfactual Framework, available at http://www3.nccu.edu.tw/ soci1005/counterfactuals%20and%20psm.ppt Caliendo, Marco and Sabine Kopeinig, SOME PRACTICAL GUIDANCE FOR THE IM- PLEMENTATION OF PROPENSITY SCORE MATCHING, Journal of Economic Surveys 2008, 22, 31-2. Dehejia, Rajeev H. and Sadek Wahba PROPENSITY SCORE-MATCHING METHODS FOR NONEXPERIMENTAL CAUSAL STUDIES, The Review of Economics and Statistics, 2002, 84, 151-161. Guo, S, R. Barth, and C. Gibbons (2004), Introduction to Propensity Score Matching: A New Device for Program Evaluation, available at http://ssw.unc.edu/vrc/lectures/psm_sswr_2004.pdf Stuart, Elizabeth A., Matching methods for causal inference: A review and a look forward, To appear in Statistical Science 12