Statistical Analysis of the Item Count Technique

Similar documents
Statistical Analysis of List Experiments

Eliciting Truthful Answers to Sensitive Survey Questions: New Statistical Methods for List and Endorsement Experiments

Eliciting Truthful Answers to Sensitive Survey Questions: New Statistical Methods for List and Endorsement Experiments

STA 303 H1S / 1002 HS Winter 2011 Test March 7, ab 1cde 2abcde 2fghij 3

Lecture 14: Introduction to Poisson Regression

Modelling counts. Lecture 14: Introduction to Poisson Regression. Overview

Item Response Theory for Conjoint Survey Experiments

High Dimensional Propensity Score Estimation via Covariate Balancing

Causal Interaction in Factorial Experiments: Application to Conjoint Analysis

Dynamics in Social Networks and Causality

Covariate Balancing Propensity Score for General Treatment Regimes

Binary Logistic Regression

Chapter 4: Factor Analysis

Generalized Linear Models for Non-Normal Data

Econometrics Problem Set 10

Identification and Inference in Causal Mediation Analysis

Statistical Analysis of Randomized Experiments with Nonignorable Missing Binary Outcomes

The Logit Model: Estimation, Testing and Interpretation

Unpacking the Black-Box: Learning about Causal Mechanisms from Experimental and Observational Studies

Chapter 11. Regression with a Binary Dependent Variable

ECONOMETRICS HONOR S EXAM REVIEW SESSION

Models for Heterogeneous Choices

Binomial Model. Lecture 10: Introduction to Logistic Regression. Logistic Regression. Binomial Distribution. n independent trials

Lecture 10: Introduction to Logistic Regression

Statistical Analysis of Causal Mechanisms

CHAPTER 1: BINARY LOGIT MODEL

Statistics Boot Camp. Dr. Stephanie Lane Institute for Defense Analyses DATAWorks 2018

A Mixture Modeling Approach for Empirical Testing of Competing Theories

STA6938-Logistic Regression Model

STATS 200: Introduction to Statistical Inference. Lecture 29: Course review

Econometrics Honor s Exam Review Session. Spring 2012 Eunice Han

Instrumental Variables

Goals. PSCI6000 Maximum Likelihood Estimation Multiple Response Model 1. Multinomial Dependent Variable. Random Utility Model

ST3241 Categorical Data Analysis I Multicategory Logit Models. Logit Models For Nominal Responses

Statistical Distribution Assumptions of General Linear Models

Course Introduction and Overview Descriptive Statistics Conceptualizations of Variance Review of the General Linear Model

Statistical Analysis of Causal Mechanisms

Causal Inference with General Treatment Regimes: Generalizing the Propensity Score

Statistics 135 Fall 2008 Final Exam

On the Use of Linear Fixed Effects Regression Models for Causal Inference

Fall 2017 STAT 532 Homework Peter Hoff. 1. Let P be a probability measure on a collection of sets A.

1. Capitalize all surnames and attempt to match with Census list. 3. Split double-barreled names apart, and attempt to match first half of name.

WISE International Masters

Binary Dependent Variable. Regression with a

Course Introduction and Overview Descriptive Statistics Conceptualizations of Variance Review of the General Linear Model

Homework 1 Solutions

WISE MA/PhD Programs Econometrics Instructor: Brett Graham Spring Semester, Academic Year Exam Version: A

Comparison between conditional and marginal maximum likelihood for a class of item response models

Linear Regression With Special Variables

Statistical Analysis of Causal Mechanisms

Causal Interaction in Factorial Experiments: Application to Conjoint Analysis

Hypothesis testing. Data to decisions

Can a Pseudo Panel be a Substitute for a Genuine Panel?

Log-linear Models for Contingency Tables

WISE MA/PhD Programs Econometrics Instructor: Brett Graham Spring Semester, Academic Year Exam Version: A

Chapter 6. Logistic Regression. 6.1 A linear model for the log odds

WISE MA/PhD Programs Econometrics Instructor: Brett Graham Spring Semester, Academic Year Exam Version: A

Chapter 1 Statistical Inference

Lecture 41 Sections Mon, Apr 7, 2008

22s:152 Applied Linear Regression. Example: Study on lead levels in children. Ch. 14 (sec. 1) and Ch. 15 (sec. 1 & 4): Logistic Regression

Probability and Statistics Notes

Mixed Models for Longitudinal Ordinal and Nominal Outcomes

What is Latent Class Analysis. Tarani Chandola

Econometric Modelling Prof. Rudra P. Pradhan Department of Management Indian Institute of Technology, Kharagpur

Panel Data. March 2, () Applied Economoetrics: Topic 6 March 2, / 43

Causal Inference Lecture Notes: Causal Inference with Repeated Measures in Observational Studies

Statistical Analysis of Causal Mechanisms for Randomized Experiments

Generalized logit models for nominal multinomial responses. Local odds ratios

Investigating Models with Two or Three Categories

Multilevel Statistical Models: 3 rd edition, 2003 Contents

STA 216, GLM, Lecture 16. October 29, 2007

Introduction to Statistical Inference

Model Estimation Example

Fractional Hot Deck Imputation for Robust Inference Under Item Nonresponse in Survey Sampling

POLI 7050 Spring 2008 February 27, 2008 Unordered Response Models I

IEOR165 Discussion Week 5

disc choice5.tex; April 11, ffl See: King - Unifying Political Methodology ffl See: King/Tomz/Wittenberg (1998, APSA Meeting). ffl See: Alvarez

Hierarchical Generalized Linear Models. ERSH 8990 REMS Seminar on HLM Last Lecture!

BIOS 625 Fall 2015 Homework Set 3 Solutions

Exam Applied Statistical Regression. Good Luck!

Ecological Inference

Latent classes for preference data

Introduction to Statistical Data Analysis Lecture 7: The Chi-Square Distribution

Finding the Gold in Your Data

Regression Discontinuity Designs

Loglikelihood and Confidence Intervals

Truncation and Censoring

Anders Skrondal. Norwegian Institute of Public Health London School of Hygiene and Tropical Medicine. Based on joint work with Sophia Rabe-Hesketh

ECON Introductory Econometrics. Lecture 11: Binary dependent variables

Advanced Quantitative Research Methodology Lecture Notes: January Ecological 28, 2012 Inference1 / 38

General structural model Part 2: Categorical variables and beyond. Psychology 588: Covariance structure and factor models

Semiparametric Generalized Linear Models

We know from STAT.1030 that the relevant test statistic for equality of proportions is:

Psych 230. Psychological Measurement and Statistics

where Female = 0 for males, = 1 for females Age is measured in years (22, 23, ) GPA is measured in units on a four-point scale (0, 1.22, 3.45, etc.

Logistic Regression. Some slides from Craig Burkett. STA303/STA1002: Methods of Data Analysis II, Summer 2016 Michael Guerzhoy

EXAMINATION: QUANTITATIVE EMPIRICAL METHODS. Yale University. Department of Political Science

Potential Outcomes Model (POM)

High-Throughput Sequencing Course

Introduction to Regression Analysis. Dr. Devlina Chatterjee 11 th August, 2017

Transcription:

Statistical Analysis of the Item Count Technique Kosuke Imai Department of Politics Princeton University Joint work with Graeme Blair May 4, 2011 Kosuke Imai (Princeton) Item Count Technique UCI (Statistics) 1 / 29

Motivation Validity of much empirical social science research relies upon accuracy of self-reported behavior and beliefs Challenge: eliciting truthful answers to sensitive survey questions e.g., racial prejudice, corruptions, fraud, support for militant groups Social desirability bias, privacy and safety concerns Lies and non-responses Solution: Use indirect rather than direct questioning 1 Randomization: Randomized response technique 2 Aggregation: Item count technique (list experiment) Kosuke Imai (Princeton) Item Count Technique UCI (Statistics) 2 / 29

Item Count Technique: An Example The 1991 National Race and Politics Survey (Sniderman et al.) Randomize the sample into the treatment and control groups The script for the control group Now I m going to read you three things that sometimes make people angry or upset. After I read all three, just tell me HOW MANY of them upset you. (I don t want to know which ones, just how many.) (1) the federal government increasing the tax on gasoline; (2) professional athletes getting million-dollar-plus salaries; (3) large corporations polluting the environment. Kosuke Imai (Princeton) Item Count Technique UCI (Statistics) 3 / 29

Item Count Technique: An Example The 1991 National Race and Politics Survey (Sniderman et al.) Randomize the sample into the treatment and control groups The script for the treatment group Now I m going to read you four things that sometimes make people angry or upset. After I read all four, just tell me HOW MANY of them upset you. (I don t want to know which ones, just how many.) (1) the federal government increasing the tax on gasoline; (2) professional athletes getting million-dollar-plus salaries; (3) large corporations polluting the environment; (4) a black family moving next door to you. Kosuke Imai (Princeton) Item Count Technique UCI (Statistics) 4 / 29

Methodological Challenges Item count technique is becoming popular: Kuklinski et al., 1997a,b; Sniderman and Carmines, 1997; Gilens et al., 1998; Kane et al., 2004; Tsuchiya et al., 2007; Streb et al., 2008; Corstange, 2009; Flavin and Keane, 2010; Glynn, 2010; Gonzalez-Ocantos et al., 2010; Holbrook and Krosnick, 2010; Janus, 2010; Redlawsk et al., 2010; Coutts and Jann, 2011 Standard practice: Use difference-in-means to estimate the proportion of those who answer yes to sensitive item Getting more out of item count technique: 1 Who are more likely to answer yes? 2 Who are answering differently to direct and indirect questioning? 3 Can we detect failures of item count technique? 4 Can we correct violations of key assumptions? Recoup the efficiency loss due to indirect questioning Kosuke Imai (Princeton) Item Count Technique UCI (Statistics) 5 / 29

Overview of the Project Goals: 1 Develop multivariate regression analysis methodology 2 Develop statistical tests to detect failures of item count technique 3 Develop methods to correct deviations from key assumption 4 Develop open-source software to implement the methods 5 Applications in Afghanistan (joint work with J. Lyall) References: 1 Imai, K. (in-press) Multivariate Regression Analysis for the Item Count Technique. Journal of the American Statistical Association 2 Blair, G. and K. Imai. Statistical Analysis of List Experiments. 3 Blair, G. and K. Imai. list: Statistical Methods for the Item Count Technique and List Experiment available at http://cran.r-project.org/package=list Kosuke Imai (Princeton) Item Count Technique UCI (Statistics) 6 / 29

Notation and Setup J: number of control items N: number of respondents T i : binary treatment indicator (1 = treatment, 0 = control) X i : pre-treatment covariates Latent potential responses to each item 1 Z ij (0): potential response to the jth item under control 2 Z ij (1): potential response to the jth item under treatment Zij : truthful latent response to the jth item Potential responses to list question 1 Y i (0) = J j=1 Z ij(0): potential response under control 2 Y i (1) = J+1 j=1 Z ij(1): potential response under treatment Y i = Y i (T i ): observed response Kosuke Imai (Princeton) Item Count Technique UCI (Statistics) 7 / 29

Identification Assumptions 1 Randomization of the Treatment 2 No Design Effect: The inclusion of the sensitive item does not affect answers to control items J Z ij (0) = j=1 J Z ij (1) j=1 3 No Liars: Answers about the sensitive item are truthful Z i,j+1 (1) = Z i,j+1 Under these assumptions, the diff-in-means estimator is unbiased Kosuke Imai (Princeton) Item Count Technique UCI (Statistics) 8 / 29

New Multivariate Regression Estimators Generalize the difference-in-means estimator The Linear least squares model: Y i = Xi γ + T }{{} i Xi δ +ɛ }{{} i control sensitive Further, generalize the estimator The nonlinear least squares model: Y i = f (X i, γ) + T i g(x i, δ) + ɛ i Logit models: f (x, γ) = J logit 1 (x γ) and g(x, δ) = logit 1 (x δ) Two-step estimation with standard error via method of moments Kosuke Imai (Princeton) Item Count Technique UCI (Statistics) 9 / 29

Extracting More Information from the Data Define a type of each respondent by (Y i (0), Zi,J+1 ) Y i (0): total number of yes for control items {0, 1,..., J} Z i,j+1 : truthful answer to the sensitive item {0, 1} A total of (2 (J + 1)) types Example: three control items (J = 3) Joint distribution is identified Y i Treatment group Control group 4 (3,1) 3 (2,1) (3,0) (3,1) (3,0) 2 (1,1) (2,0) (2,1) (2,0) 1 (0,1) (1,0) (1,1) (1,0) 0 (0,0) (0,1) (0,0) Kosuke Imai (Princeton) Item Count Technique UCI (Statistics) 10 / 29

Extracting More Information from the Data Define a type of each respondent by (Y i (0), Zi,J+1 ) Y i (0): total number of yes for control items {0, 1,..., J} : truthful answer to the sensitive item {0, 1} A total of (2 (J + 1)) types Example: three non-sensitive items (J = 3) Y i Treatment group Control group 4 (3,1) 3 (2,1) (3,0) (3,1) (3,0) 2 (1,1) (2,0) (2,1) (2,0) Z i,j+1 Joint distribution is identified: 1 (0,1) (1,0) (1,1) (1,0) 0 (0,0) (0,1) (0,0) Pr(type = (y, 1)) = Pr(Y i y T i = 0) Pr(Y i y T i = 1) Kosuke Imai (Princeton) Item Count Technique UCI (Statistics) 11 / 29

Extracting More Information from the Data Define a type of each respondent by (Y i (0), Zi,J+1 ) Y i (0): total number of yes for control items {0, 1,..., J} : truthful answer to the sensitive item {0, 1} A total of (2 (J + 1)) types Example: two control items (J = 3) Y i Treatment group Control group 4 (3,1) 3 (2,1) (3,0) (3,1) (3,0) 2 (1,1) (2,0) (2,1) (2,0) Z i,j+1 Joint distribution is identified: 1 (0,1) (1,0) (1,1) (1,0) 0 (0,0) (0,1) (0,0) Pr(type = (y, 1)) = Pr(Y i y T i = 0) Pr(Y i y T i = 1) Pr(type = (y, 0)) = Pr(Y i y T i = 1) Pr(Y i < y T i = 0) Kosuke Imai (Princeton) Item Count Technique UCI (Statistics) 12 / 29

The Maximum Likelihood Estimator Model for sensitive item as before: e.g., logistic regression Pr(Z i,j+1 = 1 X i = x) = logit 1 (x δ) Model for control items given the response to sensitive item: e.g., binomial or beta-binomial logistic regression Pr(Y i (0) = y X i = x, Z i,j+1 = z) = J logit 1 (x ψ z ) Kosuke Imai (Princeton) Item Count Technique UCI (Statistics) 13 / 29

The Likelihood Function g(x, δ) = Pr(Z i,j+1 = 1 X i = x) h z (y; x, ψ z ) = Pr(Y i (0) = y X i = x, Z i,j+1 = z) : The likelihood function: (1 g(x i, δ))h 0 (0; X i, ψ 0 ) g(x i, δ)h 1 (J; X i, ψ 1 ) i J (1,0) i J (1,J+1) J {g(x i, δ)h 1 (y 1; X i, ψ 1 ) + (1 g(x i, δ))h 0 (y; X i, ψ 0 )} y=1 i J (1,y) J {g(x i, δ)h 1 (y; X i, ψ 1 ) + (1 g(x i, δ))h 0 (y; X i, ψ 0 )} y=0 i J (0,y) where J (t, y) represents a set of respondents with (T i, Y i ) = (t, y) Maximizing this function is difficult Kosuke Imai (Princeton) Item Count Technique UCI (Statistics) 14 / 29

Missing Data Framework and the EM Algorithm Consider Zi,J+1 as missing data For some respondents, Zi,J+1 is observed The complete-data likelihood has a much simpler form: N i=1 { } Z g(x i, δ)h 1 (Y i 1; X i, ψ 1 ) T i h 1 (Y i ; X i, ψ 1 ) 1 T i,j+1 i {(1 g(x i, δ))h 0 (Y i ; X i, ψ 0 )} 1 Z i,j+1 The EM algorithm: only separate optimization of g(x, δ) and h z (y; x, ψ z ) is required weighted logistic regression weighted binomial logistic regression Both can be implemented in standard statistical software Kosuke Imai (Princeton) Item Count Technique UCI (Statistics) 15 / 29

Empirical Application: Racial Prejudice in the US Kuklinski et al. (1997 JOP): Southern whites are more prejudiced against blacks than non-southern whites no New South The limitation of the original analysis: So far our discussion has implicitly assumed that the higher level of prejudice among white southerners results from something uniquely southern, what many would call southern culture. This assumption could be wrong. If white southerners were older, less educated, and the like characteristics normally associated with greater prejudice then demographics would explain the regional difference in racial attitudes Need for a multivariate regression analysis Kosuke Imai (Princeton) Item Count Technique UCI (Statistics) 16 / 29

Estimated Proportion of Prejudiced Whites Estimated Proportion / Difference 0.5 0.4 0.3 0.2 0.1 0.0 South Difference Difference South 0.1 Non South Non South Difference in Means Maximum Likelihood Linear Least Squares Nonlinear Least Squares Maximum likelihood Without Covariates With Covariates Regression adjustments and MLE yield more efficient estimates Kosuke Imai (Princeton) Item Count Technique UCI (Statistics) 17 / 29

Studying Multiple Sensitive Items The 1991 National Race and Politics Survey includes another treatment group with the following sensitive item (4) "black leaders asking the government for affirmative action" Use of the same control items permits joint-modeling Same assumptions: No Design Effect and No Liars Extension to the design with K sensitive items: EM algorithm Kosuke Imai (Princeton) Item Count Technique UCI (Statistics) 18 / 29

Regression Results How do the patterns of generational changes differ between South and Non-South? Original analysis dichotomized the age variable without controlling for other factors Sensitive Items Control Items Black Family Affirmative Action Variables est. s.e. est. s.e. est. s.e. intercept 7.575 1.539 5.270 1.268 1.389 0.143 male 1.200 0.569 0.538 0.435 0.325 0.076 college 0.259 0.496 0.552 0.399 0.533 0.074 age 0.852 0.220 0.579 0.147 0.006 0.028 South 4.751 1.850 5.660 2.429 0.685 0.297 South age 0.643 0.347 0.833 0.418 0.093 0.061 control items Y i (0) 0.267 0.252 0.991 0.264 Kosuke Imai (Princeton) Item Count Technique UCI (Statistics) 19 / 29

Generational Changes in South and Non-South Black Family Affirmative Action 1.0 1.0 0.8 0.8 South Estimated Proportion 0.6 0.4 0.2 South Estimated Proportion 0.6 0.4 0.2 Non South 0.0 Non South 0.0 20 30 40 50 60 70 80 90 20 30 40 50 60 70 80 90 Age Age Age is important even after controlling for gender and education Gender is not in contradiction with the original analysis Kosuke Imai (Princeton) Item Count Technique UCI (Statistics) 20 / 29

Measuring Social Desirability Bias The 1994 Multi-Investigator Survey (Sniderman et al.) asks list question and later a direct sensitive question: Now I m going to ask you about another thing that sometimes makes people angry or upset. Do you get angry or upset when black leaders ask the government for affirmative action? Indirect response: model using item count technique Direct response: model using logistic regression Difference: measure of social desirability bias Kosuke Imai (Princeton) Item Count Technique UCI (Statistics) 21 / 29

Differences for the Affirmative Action Item 1.0 1.0 Non College Estimated Proportion / Difference in Proportions 0.8 0.6 0.4 0.2 0.0 0.2 0.4 List Difference List Direct Direct Estimated Difference in Proportions (social desirability bias) 0.8 0.6 0.4 0.2 0.0 0.2 0.4 College Difference Non Coll Coll Democrats Republicans Independents Democrats Republicans Independents Kosuke Imai (Princeton) Item Count Technique UCI (Statistics) 22 / 29

When Can Item Count Technique Fail? Recall the two assumptions: 1 No Design Effect: The inclusion of the sensitive item does not affect answers to non-sensitive items 2 No Liars: Answers about the sensitive item are truthful Design Effect: Respondents evaluate non-sensitive items relative to sensitive item Lies: Ceiling effect: too many yeses for non-sensitive items Floor effect: too many noes for non-sensitive items Both types of failures are difficult to detect Importance of choosing non-sensitive items Question: Can these failures be addressed statistically? Kosuke Imai (Princeton) Item Count Technique UCI (Statistics) 23 / 29

Hypothesis Test for Detecting Item Count Technique Failures Under the null hypothesis of no design effect and no liars, we expect all types (y, 1) > 0 and (y, 0) > 0 π 1 = Pr(type = (y, 1)) = Pr(Y i y T i = 0) Pr(Y i y T i = 1) 0 π 0 = Pr(type = (y, 0)) = Pr(Y i y T i = 1) Pr(Y i < y T i = 0) 0 Alternative hypothesis: At least one is negative A multivariate one-sided LR test for each t = 0, 1 ˆλ t = min(ˆπ t π t ) Σ 1 π t (ˆπ t π t ), subject to π t 0, t ˆλ t follows a mixture of χ 2 Difficult to characterize least favorable values under the joint null Bonferroni correction: Reject the joint null if min(ˆp 0, ˆp 1 ) α/2 GMS selection algorithm to increase statistical power Kosuke Imai (Princeton) Item Count Technique UCI (Statistics) 24 / 29

The Racial Prejudice Data Revisited Did the negative proportion arise by chance? Observed Data Estimated Proportion of Control Treatment Respondent Types Response counts prop. counts prop. ˆπ y0 s.e. ˆπ y1 s.e. 0 8 1.4% 19 3.0% 3.0% 0.7 1.7% 0.8 1 132 22.4 123 19.7 21.4 1.7 1.0 2.4 2 222 37.7 229 36.7 35.7 2.6 2.0 2.8 3 227 38.5 219 35.1 33.1 2.2 5.4 0.9 4 34 5.4 Total 589 624 93.2 6.8 Minimum p-value: 0.022 Kosuke Imai (Princeton) Item Count Technique UCI (Statistics) 25 / 29

Statistical Power of the Proposed Test * Expected Number of Yeses to 3 Non sensitive Items E(Yi(0)) 0.75 1.5 2.25 0.1 Statistical Power 0.0 0.25 0.5 0.75 1.0 n=500 n=1000 n=2000 0.0 0.25 0.5 0.75 1.0 n=500 n=1000 n=2000 0.0 0.25 0.5 0.75 1.0 n=500 n=1000 n=2000 0.6 0.3 0.0 0.3 0.6 0.6 0.3 0.0 0.3 0.6 0.6 0.3 0.0 0.3 0.6 Probability of Answering Yes to the Sensitive Item Pr(Zi, J+1) 0.75 0.5 0.25 Statistical Power Statistical Power Statistical Power 0.25 0.5 0.75 0.25 0.5 0.75 0.25 0.5 0.75 0.0 1.0 0.0 1.0 0.0 1.0 0.6 0.3 0.0 0.3 0.6 0.6 0.3 0.0 0.3 0.6 0.0 0.25 0.5 0.75 1.0 0.0 0.25 0.5 0.75 1.0 0.0 0.25 0.5 0.75 1.0 0.6 0.3 0.0 0.3 0.6 0.6 0.3 0.0 0.3 0.6 0.0 0.25 0.5 0.75 1.0 0.0 0.25 0.5 0.75 1.0 0.0 0.25 0.5 0.75 1.0 0.6 0.3 0.0 0.3 0.6 0.6 0.3 0.0 0.3 0.6 0.6 0.3 0.0 0.3 0.6 0.6 0.3 0.0 0.3 0.6 0.6 0.3 0.0 0.3 0.6 5 1.0 5 1.0 5 1.0 Imai (Princeton)Item erkosuke Count Technique UCI (Statistics) 26 / 29

Modeling Ceiling and Floor Effects Potential liars: Y i Treatment group Control group 4 (3,1) 3 (2,1) (3,0) (3,1) (3,1) (3,0) 2 (1,1) (2,0) (2,1) (2,0) 1 (0,1) (1,0) (1,1) (1,0) 0 (0,0) (0,1) (0,1) (0,0) Previous tests do not detect these liars: proportions would still be positive so long as there is no design effect Proposed strategy: model ceiling and/or floor effects under an additional assumption Identification assumption: conditional independence between items given covariates ML estimation can be extended to this situation Kosuke Imai (Princeton) Item Count Technique UCI (Statistics) 27 / 29

Modeling Ceiling and Floor Effects Both Ceiling Ceiling Effects Alone Floor Effects Alone and Floor Effects Variables est. s.e. est. s.e. est. s.e. Intercept 1.291 0.558 1.251 0.501 1.245 0.502 Age 0.294 0.101 0.314 0.092 0.313 0.092 College 0.345 0.336 0.605 0.298 0.606 0.298 Male 0.038 0.346 0.088 0.300 0.088 0.300 South 1.175 0.480 0.682 0.335 0.681 0.335 Prop. of liars Ceiling 0.0002 0.0017 0.0002 0.0016 Floor 0.0115 0.0000 0.0115 0.0000 Essentially no ceiling and floor effects Main conclusion for the affirmative action item seems robust This approach can be extended to model certain design effects Kosuke Imai (Princeton) Item Count Technique UCI (Statistics) 28 / 29

Concluding Remarks and Practical Suggestions Item count technique: alternative to the randomized response method Advantages: easy to use, easy to understand Challenges: 1 loss of information due to indirect questioning 2 difficulty of exploring multivariate relationship 3 potential violation of assumptions Our methods partially overcome the difficulties Suggestions for analysis: 1 Estimate proportions of types and test design effects 2 Conduct multivariate regression analyses 3 Investigate the robustness of findings to ceiling and floor effects Suggestions for design: 1 Select control items to avoid skewed response distribution 2 Avoid control items that are ambiguous and generate weak opinion 3 Conduct a pilot study and maximize statistical power Kosuke Imai (Princeton) Item Count Technique UCI (Statistics) 29 / 29