SUPPLEMENTARY APPENDIX MATERIALS [not intended for publication]

Similar documents
Identify the scale of measurement most appropriate for each of the following variables. (Use A = nominal, B = ordinal, C = interval, D = ratio.

Tables and Figures. This draft, July 2, 2007

Ron Heck, Fall Week 8: Introducing Generalized Linear Models: Logistic Regression 1 (Replaces prior revision dated October 20, 2011)

ECON Introductory Econometrics. Lecture 17: Experiments

GROWING APART: THE CHANGING FIRM-SIZE WAGE PREMIUM AND ITS INEQUALITY CONSEQUENCES ONLINE APPENDIX

Ron Heck, Fall Week 3: Notes Building a Two-Level Model

Introducing Generalized Linear Models: Logistic Regression

Presented at ESRI Education User Conference, July 6-8, 2001, San Diego, CA

Dover- Sherborn High School Mathematics Curriculum Probability and Statistics

ECON Introductory Econometrics. Lecture 4: Linear Regression with One Regressor

SUPPLEMENT TO DEMOCRACY, REDISTRIBUTION, AND POLITICAL PARTICIPATION: EVIDENCE FROM SWEDEN (Econometrica, Vol. 82, No. 3, May 2014, )

PhD/MA Econometrics Examination January 2012 PART A

Class Notes: Week 8. Probit versus Logit Link Functions and Count Data

Tutorial 6: Tutorial on Translating between GLIMMPSE Power Analysis and Data Analysis. Acknowledgements:

14.74 Lecture 10: The returns to human capital: education

WELCOME! Lecture 14: Factor Analysis, part I Måns Thulin

Analysis of Panel Data: Introduction and Causal Inference with Panel Data

Technical Appendix C: Methods

Estimating a Piecewise Growth Model with Longitudinal Data that Contains Individual Mobility across Clusters

Using Decision Trees to Evaluate the Impact of Title 1 Programs in the Windham School District

Heteroskedasticity ECONOMETRICS (ECON 360) BEN VAN KAMMEN, PHD

Product and Inventory Management (35E00300) Forecasting Models Trend analysis

Physics Observational Methods in Astronomy

Introduction to Statistical Inference

Multiple Regression Analysis: Inference ECONOMETRICS (ECON 360) BEN VAN KAMMEN, PHD

Small-Sample Methods for Cluster-Robust Inference in School-Based Experiments

Least Squares Estimation of a Panel Data Model with Multifactor Error Structure and Endogenous Covariates

Midterm Exam 1, section 1. Tuesday, February hour, 30 minutes

MVE055/MSG Lecture 8

Passing-Bablok Regression for Method Comparison

Basic Linear Model. Chapters 4 and 4: Part II. Basic Linear Model

SREE WORKSHOP ON PRINCIPAL STRATIFICATION MARCH Avi Feller & Lindsay C. Page

PSY 250 8/29/2011. Sampling. Choosing your Sample. Sampling: Selecting Research Participants. Selecting a sample of participants from the population

Syllabus for MATHEMATICS FOR INTERNATIONAL RELATIONS

The World Bank Pakistan: Third Punjab Education Sector Project (P154524)

HighSchoolMathTeachers 2017 Page 1

Regression Discontinuity Design Econometric Issues

Tests for the Odds Ratio in a Matched Case-Control Design with a Quantitative X

Regression Discontinuity

Fixed Effects, Invariance, and Spatial Variation in Intergenerational Mobility

Sampling and Sample Size. Shawn Cole Harvard Business School

Rhode Island World-Class Standards Science Grade: K - Adopted: 2006

Gov 2002: 3. Randomization Inference

Algebra 1 Part 1 Curriculum Guide Dunmore School District Dunmore, PA

Overview. Overview. Overview. Specific Examples. General Examples. Bivariate Regression & Correlation

Tables and Figures to Accompany

ESSENTIAL CONCEPTS AND SKILL SETS OF THE IOWA CORE CURRICULUM

Discrete distribution. Fitting probability models to frequency data. Hypotheses for! 2 test. ! 2 Goodness-of-fit test

INTERVAL ESTIMATION OF THE DIFFERENCE BETWEEN TWO POPULATION PARAMETERS

STANDARDS OF LEARNING CONTENT REVIEW NOTES. ALGEBRA I Part I. 2 nd Nine Weeks,

EXAMINATION: QUANTITATIVE EMPIRICAL METHODS. Yale University. Department of Political Science

Causal Inference with Big Data Sets

YouGov Survey Results

ECNS 561 Multiple Regression Analysis

MATH 135 PRE-CALCULUS: ELEMENTARY FUNCTIONS COURSE SYLLABUS FALL 2012

DO NOT CITE WITHOUT AUTHOR S PERMISSION:

Supplemental Appendix to "Alternative Assumptions to Identify LATE in Fuzzy Regression Discontinuity Designs"

Online Appendix for Training Aspiring Entrepreneurs to Pitch Experienced Investors

AP STATISTICS: Summer Math Packet

The Sun Life Financial Canadian Open Mathematics Challenge Wednesday, November 2, 2011

Uptake of IGCSE subjects 2012

The long term eects of early educational selection

Who, What, Where and How

Instrumental Variables

Regression Discontinuity

emerge Network: CERC Survey Survey Sampling Data Preparation

emerge Network: CERC Survey Survey Sampling Data Preparation

Differences in Differences (Dif-in and Panel Data. Technical Session III. Manila, December Manila, December Christel Vermeersch

Addressing Analysis Issues REGRESSION-DISCONTINUITY (RD) DESIGN

Random Intercept Models

Simple Linear Regression: One Quantitative IV

Energy and U.S. Consumers

Step 2: Select Analyze, Mixed Models, and Linear.

Regression Discontinuity


Sample Items Grade 7 MATHEMATICS

8. AN EVALUATION OF THE URBAN SYSTEMIC INITIATIVE AND OTHER ACADEMIC REFORMS IN TEXAS: STATISTICAL MODELS FOR ANALYZING LARGE-SCALE DATA SETS

Technical Review of the Analytical Benefits to be Gained from Collecting Staff-Level Data on ECEC

Combining Experimental and Non-Experimental Design in Causal Inference

Analysis of 2x2 Cross-Over Designs using T-Tests

Multiple Regression Analysis: Inference MULTIPLE REGRESSION ANALYSIS: INFERENCE. Sampling Distributions of OLS Estimators

A Study of Statistical Power and Type I Errors in Testing a Factor Analytic. Model for Group Differences in Regression Intercepts

SONOMA COUNTY ASTRONOMICAL SOCIETY

Math 1040 Final Exam Form A Introduction to Statistics Fall Semester 2010

An Introduction to Econometrics. A Self-contained Approach. Frank Westhoff. The MIT Press Cambridge, Massachusetts London, England

Marketing Research Session 10 Hypothesis Testing with Simple Random samples (Chapter 12)

Algebra II. Unit 2 Suggested Time Frame. Analyzing Functions and Absolute Value. 19 Days

Complying with Title IX

Algebra 1 Part 2 K Curriculum Guide Dunmore School District Dunmore, PA

HB Methods for Combining Estimates from Multiple Surveys

Gov 2002: 9. Differences in Differences

PSY 250. Sampling. Representative Sample. Representativeness 7/23/2015. Sampling: Selecting Research Participants

Dengue Forecasting Project

Regression Discontinuity

Can a Pseudo Panel be a Substitute for a Genuine Panel?

Probability and Inference. POLI 205 Doing Research in Politics. Populations and Samples. Probability. Fall 2015

Approximately 45 minutes

Multiple group models for ordinal variables

Quantitative Analysis II PUBPOL 528C Spring 2017

Lecture 8 CORRELATION AND LINEAR REGRESSION

Transcription:

SUPPLEMENTARY APPENDIX MATERIALS [not intended for publication] Appendix Figure A1: Schooling and democracy trends in Kenya Panel A: Proportion of population completing primary schooling in Kenya, by birth cohort and gender (190-1980 birth cohorts, source: 1999 Kenya Census) Panel B: Freedom House Political Rights Index in Kenya and sub-saharan Africa (1973-008), where 1 denotes full democracy A1

Appendix Figure A: Girls Scholarship Program (GSP) Timeline (000-007) November 000: Baseline academic tests for cohorts 1 and in Busia District taken January 001: Schools invited to participate, lists of students enrolled in grades 5 and 6 in all program schools compiled (N schools=69, N students=39) March 001: Randomization of schools into treatment and control groups (using a computer random number generator) Treatment (N schools=34, N students=1640) Control (N schools=35, N students=165) March 001: Announcement of scholarship program in treatment schools; September and October 001: Reminder meeting for parents Ongoing student and teacher attendance checks November 001: Follow-up achievement tests for cohort 1 (N schools=69, N students=111); winners announced and scholarship awards distributed May-July 00: Background individual surveys collected for students in both cohorts November 00: Follow up achievement tests for cohort (N schools=69, N students=89); winners announced and scholarship awards distributed 005-007: Follow up individual surveys collected (N students=1756, and 1387 with complete data matched to 001 and 00 data) A

Appendix Table A1: Comparing estimates in the full sample, with and without covariates Coefficient Coefficient estimate (s.e.) on estimate (s.e.) on program indicator program indicator, OLS without covariates Dependent variable: (1) () Lack of autonomy mean effect -0.181 ** -0.199 ** (0.077) (0.077) Ethnic identity is not very important (0-1) -0.033 * -0.035 * (0.00) (0.00) Religious identity is not very important (0-1) 0.005 0.004 (0.006) (0.006) Democratic attitudes mean effect 0.03 0.041 (0.098) (0.096) Days read a newspaper in last week (0 to 7) 0.134 * 0.176 ** (0.074) (0.081) Political knowledge mean effect 0.03 ** 0.8 *** (0.085) (0.084) Satisfaction with authority mean effect -0.39 *** -0.51 *** (0.061) (0.064) Perceived political efficacy mean effect 0.055 0.043 (0.066) (0.067) Participation in politics and civic affairs mean effect -0.038-0.033 (0.073) (0.073) Agree with It is sometimes necessary to use violence in support of a just cause. (0-1) 0.059 ** 0.054 * (vs. The use of violence is never justified in politics. ) (0.09) (0.09) Notes: Each cell contains results from a separate regression. Significant at 90% (*), 95% (**), 99% (***) confidence. Details on the mean effect analysis are in the text. Standard errors are clustered by school. The sample is N=1,387 for all dependent variables. In column 1, the outcome variable is regressed on the GSP program (treatment) indicator, an indicator for student cohort, student age at time of the survey, educational attainment of each parent, and timing of the follow-up survey (coefficient estimates not shown). In column, the specification drops the student age at time of the survey, and educational attainment of each parent covariates. A3

Appendix Table A: Comparing estimates in the full sample vs. among those with low baseline test scores Coefficient estimate (s.e.) on program indicator OLS Coefficient estimate (s.e.) on program indicator, baseline test score < + s.d. Dependent variable: (1) () Lack of autonomy mean effect -0.04 ** -0.176 * (0.088) (0.091) Ethnic identity is not very important (0-1) -0.04-0.04 (0.04) (0.05) Religious identity is not very important (0-1) -0.007-0.005 (0.007) (0.007) Democratic attitudes mean effect -0.037-0.077 (0.098) (0.096) Days read a newspaper in last week (0 to 7) 0.109 0.116 (0.097) (0.105) Political knowledge mean effect 0.109 0.107 (0.094) (0.097) Satisfaction with authority mean effect -0.181 ** -0.147 (0.089) (0.088) Perceived political efficacy mean effect 0.083 0.086 (0.077) (0.081) Participation in politics and civic affairs mean effect -0.144-0.138 (0.096) (0.098) Agree with It is sometimes necessary to use violence in support of a just cause. (0-1) 0.03 0.09 (vs. The use of violence is never justified in politics. ) (0.036) (0.037) Notes: Each cell contains results from a separate regression. Significant at 90% (*), 95% (**), 99% (***) confidence. Details on the mean effect analysis are in the text. Standard errors are clustered by school. The sample in column 1 consists of all individuals with baseline test score data (N=919). In column, the sample is restricted to individuals with baseline normalized test scores less than + s.d. (N=881). In columns 1 and, the outcome variable is regressed on the GSP program (treatment) indicator, an indicator for student cohort, student age at time of the survey, educational attainment of each parent, and timing of the follow-up survey (coefficient estimates not shown). A4

Appendix Table A3: Effects in schools with different numbers of predicted GSP winners Coefficient estimate (s.e.) on program indicator OLS Coefficient estimate (s.e.) on interaction between the program indicator and predicted number of winners Dependent variable: (1) () Lack of autonomy mean effect -0.193 ** 0.005 (0.096) (0.009) Ethnic identity is not very important (0-1) -0.039 ** 0.005 ** (0.018) (0.00) Religious identity is not very important (0-1) -0.003-0.0004 (0.007) (0.0006) Democratic attitudes mean effect -0.040 0.009 (0.100) (0.013) Days read a newspaper in last week (0 to 7) 0.113-0.004 (0.077) (0.007) Political knowledge mean effect 0.05 ** 0.006 (0.095) (0.009) Satisfaction with authority mean effect -0.04 *** -0.001 (0.063) (0.005) Perceived political efficacy mean effect 0.070 0.00 (0.068) (0.008) Participation in politics and civic affairs mean effect -0.0-0.008 (0.073) (0.009) Agree with It is sometimes necessary to use violence in support of a just cause. 0.045 0.007 ** (0-1) (vs. The use of violence is never justified in politics. ) (0.07) (0.003) Notes: Each row contains results from a separate regression. Significant at 90% (*), 95% (**), 99% (***) confidence. Details on the mean effect analysis are in the text. Standard errors are clustered by school. The sample is N=1,369 for all dependent variables. The outcome variable is regressed on the GSP program (treatment) indicator, the GSP indicator interacted with the predicted number of GSP winners, an indicator for student cohort, student age at time of the survey, educational attainment of each parent, and timing of the follow-up survey (coefficient estimates not shown). The predicted number of GSP winners is estimated in treatment group schools by regressing the actual number of scholarship winners in the school on various quantiles of the baseline test score distribution for students in that school; the predicted number of winners is then assigned to both treatment and control schools. A5

Appendix B: Model appendix This appendix contains some of the algebra and analysis underlying the discussion of the model in section 6. Equation 7 implies that, in the absence of a merit scholarship program (as in our control group), the variance of education will be equal to: (eqn. A1) σ = σ B + β 1σ R H. 4 This implies that the willingness to accept authority for person i at time 1 will be: (eqn. A) RR 1,ii = RR 0,ii + γγ BB 0,ii+ββ 1 RR 0,ii +ββ MM ii. In the absence of a merit scholarship program, the variance of R1,i (suppressing subscripts) is: (eqn. A3) VVVVVV(RR 1 ) σσ RR1 = VVVVVV(RR 0 + γγhh ) = VVVVVV(RR 0) + γγ VVVVVV(HH ) + CCCCCC(RR 0, γγhh ) = σσ RR + γγ σσ HH + CCCCCC(RR 0, γγ BB 0+ββ 1 RR 0 ) = σσ RR + γγ σσ HH + [0 + CCCCCC(RR 0, γγ ββ 1RR 0 ) = σσ RR (1 + γγββ 1 ) + γγ σσ HH In the absence of a merit scholarship program, the covariance between H * and R1 will be: (eqn. A4) CCCCCC(RR 1, HH ) = CCCCCC(RR 0 + γγ BB 0+ββ 1 RR 0, BB 0+ββ 1 RR 0 ) = CCCCCC(RR 0, BB 0 ) + CCCCCC(RR 0, ββ 1RR 0 ) + γγγγγγγγ(bb 0+ββ 1 RR 0 ) = ββ 1σσ RR +γγσσ BB +γγββ 1 σσ RR 4 Taking the difference between equations 8 and 9 in section 6 allows us to estimate the bias, and solve for β1 in terms of known parameters: β (eqn. A5) γ OLS γ IV = 1 σ R σ B + β 1 σ R A6

This implies β 1 is (eqn. A6) β 1 = ( γ γ OLS IV )var(h * ) σ R It is possible to solve for β 1 based on observed parameters. To do so, we first rewrite equation A3 and then substitute the variance of R 0 into equation A6: (eqn. A7) (eqn. A8) To simplify let z = σ R = σ R 1 γ σ H ( 1+ γβ 1 ) β 1 = γ ( OLS γ IV )σ H σ R 1 γ σ H ( ) 1 + γβ 1 γ ( OLS γ IV )σ H σ R 1 γ σ H, (eqn. A9) ββ 1 = zz(1 + γγββ 1 ) Solving for β1 yields: (eqn. A10) 1 β 1 = 1 z γ Re-writing yields equation 10 in the text. A7