Survey nonresponse and the distribution of income

Size: px
Start display at page:

Download "Survey nonresponse and the distribution of income"

Transcription

1 Survey nonresponse and the distribution of income Emanuela Galasso* Development Research Group, World Bank Module 1. Sampling for Surveys

2 1: Why are we concerned about non response? 2: Implications for measurement of poverty and inequality 3: Evidence for the US Estimation methods Results 4: An example for China 2

3 1: Why do we care? 3

4 Types of nonresponse Item-nonresponse (participation to the survey but non-response on single questions) Imputation methods using matching Lillard et al. (1986); Little and Rubin (1987) 4

5 Types of nonresponse Item-nonresponse Imputation methods using matching Lillard et al. (1986); Little and Rubin (1987) The idea: Observations with complete data X Yes Y Yes missing data Yes No For sub-sample with complete data: Then impute missing data using: Y = M (X ) Y ˆ = Mˆ ( X ) 5

6 Types of nonresponse Unit-nonresponse ( non-compliance ) (non-participation to the survey altogether) 6

7 Unit-nonresponse: possible solutions Ex-ante: Replace non respondents with similar households Increase the sample size to compensate for it Using call-backs, monetary incentives: Van Praag et al. (1983), Alho (1990), Nijman and Verbeek (1992) Ex-post: Corrections by re-weighting the data Use imputation techniques (hot-deck, cold-deck, warm-deck, etc.) to simulate the answers of nonrespondents 7

8 Unit-nonresponse: possible solutions Ex-ante: Replace nonrespondents with similar households Increase the sample size to compensate for it Using call-backs, monetary incentives: Van Praag et al. (1983), Alho (1990), Nijman and Verbeek (1992) Ex-post: Corrections by re-weighting the data Use imputation techniques (hot-deck, cold-deck, warm-deck, etc.) to simulate the answers of nonrespondents None of the above 8

9 The best way to deal with unit-nonresponse is to prevent it Lohr, Sharon L. Sampling: Design & Analysis (1999)

10 Qualification Motivation Training Work Load Data collection method Interviewers Total Nonresponse Availability Socio-economic Type of survey Respondents Burden Economic Motivation Demographic Proxy 10 Source: Some factors affecting Non-Response. by R. Platek Survey Methodology

11 Rising concern about unitnonresponse High nonresponse rates of 10-30% are now common LSMS: 0-26% nonresponse (Scott and Steele, 2002) UK surveys: 15-30% US: 10-20% Concerns that the problem might be increasing 11

12 Nonresponse is a choice, so we need to understand behavior Survey participation is a matter of choice nobody is obliged to comply with the statistician s randomized assignment There is a perceived utility gain from compliance the satisfaction of doing one s civic duty But there is a cost too An income effect can be expected 12

13 Nonresponse bias in measuring poverty and inequality Compliance is unlikely to be random: Rich people have: higher opportunity cost of time more to hide (tax reasons) more likely to be away from home? multiple earners Poorest might also not comply: alienated from society? homeless 13

14 2: Implications for poverty and inequality measures 14

15 Implications for poverty F(y) is the true income distribution, density f(y) Fˆ ( y) is the observed distribution, density fˆ ( y) Note: ˆ ( F( y ) F y = and F( y ) F y ) = 0 p = p ) 0 r ˆ ( = r 15

16 Implications for poverty F(y) is the true income distribution, density f(y) Fˆ ( y) is the observed distribution, density fˆ ( y) Note: ˆ ( F( y ) F y = and F( y ) F y ) = 0 p = p ) 0 r ˆ ( = r Definition: correction factor w(y) such that: f ( y) = w( y) fˆ( y) y F ( y) = w( x) fˆ( x) dx y p 16

17 Implications for poverty cont., If compliance falls with income then poverty is overestimated for all measures and poverty lines. i.e., first-order dominance: if w (y) > 0 for all y (y P, y R ), F ( y) < Fˆ ( y) then for all y (y P, y R ) 17

18 First-order dominance w (y) > observed corrected

19 Example Poor Non-poor Estimated distribution (%) However, Response rate (%) True distribution of population (%) Correction factors

20 Implications for inequality If compliance falls with income (w (y) > 0) then the implications for inequality are ambiguous Lorenz curves intersect so some inequality measures will show higher inequality, some lower 20

21 Example of crossing Lorenz Curves

22 3: Evidence for the U.S. 22

23 Current Population Survey Source: CPS March supplement, , Census Bureau 3 types of non-interviews: type A: individual refused to respond or could not be reached what we define as non-response type B: housing unit vacant; type C: housing unit demolished we ignore type B/C in our analysis Year total number of households Type A households rate of nonresponse (%) ,574 4, % ,103 4, % ,763 3, % ,932 4, % ,831 6, % All years 303,203 23, % 23

24 Dependence of response rate on income Response rate and average per-capita income for 51 US states, CPS March supplement 2002 State Response Rate Average Income Maryland 86.77% $31,500 District Of Columbia 87.21% $34,999 Alaska 88.16% $26,564 New York 88.61% $26,013 New Jersey 88.71% $28,746 California 89.66% $26,822 Mississippi 95.08% $17,821 Indiana 95.21% $23,909 North Dakota 95.36% $20,154 Georgia 95.66% $23,893 West Virginia 96.65% $18,742 Alabama 97.24% $21,155 24

25 Dependence of response rate on income Response rate and average per-capita income for 51 US states, CPS March supplement % 96% probability of compliance 94% 92% 90% 88% 86% average state income 25

26 Estimation method In survey data, the income of non-responding households is by definition unobservable. However, we can observe the survey compliance rates by geographical areas. The observed characteristics of responding households, in conjunction with the observed compliance rates of the areas in which they live, allow one to estimate the household-specific probability of survey response. Thus we can correct for selective compliance by re-weighting the survey data. 26

27 Estimation method cont., {(X ij, m ij )} set of households in state j s.t. m ij households each carry characteristics X ij, where X ij includes e.g. ln(y ij ), a constant, etc. total number of households in state j: M j representative sample S j in state j with sampled households m j = Σ m ij for each sampled household ε there s a probability of response D εij {0,1} P ( D = 1 X, θ ) = P = logistic( X θ ) εij ij i i 27

28 Estimation method cont., The observed mass of respondents of group i in state/area j is E ( m ) = obs ij obs mij E [ ] = mij Pi Then summing up for a given j yields: Now let s define: i m obs ij P i = i=1 obs obs obs mij mij mij ψ j ( θ ) { E[ ]} = { m j} P P P i i m i ij w ij P i = m These are the individual weights j i i This is known! 28

29 Estimation method cont., obs obs obs mij mij mij ψ j ( θ ) { E[ ]} = { m j} P P P i i i i i where obviously [ ( θ )] = 0 E ψ j Then we can estimate θ = arg min Ψ( θ ) ψ W ( θ )' ψ ( θ ) ˆ 1 ( θ ) 29

30 Estimation method cont., Optimal weighting matrix W = Var(ψ(θ)) Hansen (1982) Assume for single state j: [ ] 2 ψ ( θ ) m σ Var = j j This can be estimated as Finally, Var ( ˆ) 2 θ = ˆ σ [ G' NG] 1 ˆ σ 2 = where ψ j ( θ ) w j 2 ψ G = θ ( θ ) 30

31 Alternative Specifications Specification Ψ(θ) min θ 1 θ 2 θ 3 1: P = logit (θ 1 + θ 2 ln (y ) + θ 3 ln (y ) 2 ) (85.95) (-15.90) (0.7320) 2: P = logit (θ 1 + θ 2 ln (y )) (3.51) (-0.329) 3: P = logit (θ 1 + θ 3 ln (y ) 2 ) (1.646) ( ) 4: P = logit (θ 2 ln (y ) + θ 3 ln (y ) 2 ) (0.289) ( ) 5: P = logit (θ 1 + θ 2 y ) : P = logit (θ 1 + θ 2 y + θ 3 y 2 ) 7: P = logit (θ 1 + θ 2 y + θ 3 ln (y )) (0.202) ( ) (0.463) ( ) ( ) (12.13) ( ) (-1.229) 31

32 Results From Specification 2 P = logit(θ 1 + θ 2 ln(y)) Year Ψ(θ) min θ 1 θ 2 Gini uncorr Gini corr ΔGini % 50.92% 5.43% (4.58) (-0.43) % 49.03% 3.82% (4.42) (-0.418) % 47.67% 3.37% (4.46) (-0.413) % 49.47% 4.48% (3.82) (-0.355) % 48.02% 3.66% (3.51) (-0.329) All % 49.07% 4.24% (1.89) (-0.177) 32

33 Graph of specification 2: Probability of compliance as a function of income 1 probability of compliance $0 $50,000 $100,000 $150,000 $200,000 $250,000 Y... income/capita 33

34 Empirical and Corrected Cumulative Income Distribution observed corrected 0 $0 $50,000 $100,000 $150,000 $200,000 $250,000 34

35 Income Distribution: Magnification observed corrected 0 $0 $2,500 $5,000 $7,500 $10,000 35

36 Correction by Percentile of Income 50% 40% implied correction of income 30% 20% 10% 0% population percentile 36

37 Empirical and Corrected Lorenz Curve observed corrected % total income % population 37

38 Lorenz Curves: Magnification observed corrected % population 38

39 Specifications with Other Variables Specifications 10 18, P = logit(θ 1 + θ 2 ln(y)+ θ 3 X 1 + θ 4 X 2 ): Specification Ψ(θ) min θ 1 θ 2 θ 3 θ 4 Gini corrected ΔGini 2: (baseline) % 4.24% (1.89) (-0.177) 10: X 1 = age X 2 = age 2 11: X 1 = age (3.52) (2.61) (-0.188) (-0.201) ( ) ( ) ( ) 49.09% 49.23% 4.26% 4.40% 12: X 2 = age (2.31) (0.198) ( ) 49.26% 4.43% 13: X 1 = (age>64) 14: X 1 = edu X 2 = edu 2 15: X 1 = edu (2.06) (7.59) (-0.188) (-0.334) ( ) (-1.447) ( ) 49.23% 48.52% 48.68% 4.40% 3.69% 3.85% (2.48) (-0.333) ( ) 16: X 1 = (edu>39) (1.93) (-0.233) (-1.187) 48.53% 3.70% 17: X 1 = sex (1.92) (-0.187) ( ) 48.84% 4.01% 18: X 1 = race % 3.43% (1.96) (-0.183) (0.1592) 19: X 1 = size % 4.32% (2.12) (-0.189) ( ) 20: X 1 = race % 3.55% X 2 = size (2.16) (-0.189) (0.1574) ( ) 39

40 4: China 40

41 Example for China Urban Household Survey of NBS Two stages in sampling Stage 1: Large national random sample with very short questionnairre and high repsonse rate Stage 2: Random sample drawn from Stage 1 sample, given very detailed survey, including daily diary, regular visits etc Use Stage 1 data to model determinants of compliance Then re-weight the data 41

42 Further reading Korinek, Anton, Johan Mistiaen and Martin Ravallion, An Econometric Method of Correcting for unit Nonresponse Bias in Surveys, Journal of Econometrics, (2007), 136: Korinek, Anton, Johan Mistiaen and Martin Ravallion, Survey Nonresponse and the Distribution of Income. Journal of Economic Inequality, (2006), 4:

Survey Nonresponse and the Distribution of Income

Survey Nonresponse and the Distribution of Income Public Disclosure Authorized Surve Nonresponse and the Distribution of Income Anton Korinek, Johan A. Mistiaen, and Martin Ravallion 1 Development Research Group, World Bank, 1818 H Street NW, Washington

More information

kmr: A Command to Correct Survey Weights for Unit Nonresponse using Group s Response Rates

kmr: A Command to Correct Survey Weights for Unit Nonresponse using Group s Response Rates kmr: A Command to Correct Survey Weights for Unit Nonresponse using Group s Response Rates Ercio Muñoz Stone Center on Socio-Economic Inequality and CUNY Graduate Center New York, USA emunozsaavedra@gc.cuny.edu

More information

VARIANCE ESTIMATION FOR NEAREST NEIGHBOR IMPUTATION FOR U.S. CENSUS LONG FORM DATA

VARIANCE ESTIMATION FOR NEAREST NEIGHBOR IMPUTATION FOR U.S. CENSUS LONG FORM DATA Submitted to the Annals of Applied Statistics VARIANCE ESTIMATION FOR NEAREST NEIGHBOR IMPUTATION FOR U.S. CENSUS LONG FORM DATA By Jae Kwang Kim, Wayne A. Fuller and William R. Bell Iowa State University

More information

Population Profiles

Population Profiles U N D E R S T A N D I N G A N D E X P L O R I N G D E M O G R A P H I C C H A N G E MAPPING AMERICA S FUTURES, BRIEF 6 2000 2010 Population Profiles Atlanta, Las Vegas, Washington, DC, and Youngstown Allison

More information

Poverty, Inequality and Growth: Empirical Issues

Poverty, Inequality and Growth: Empirical Issues Poverty, Inequality and Growth: Empirical Issues Start with a SWF V (x 1,x 2,...,x N ). Axiomatic approaches are commen, and axioms often include 1. V is non-decreasing 2. V is symmetric (anonymous) 3.

More information

Club Convergence and Clustering of U.S. State-Level CO 2 Emissions

Club Convergence and Clustering of U.S. State-Level CO 2 Emissions Methodological Club Convergence and Clustering of U.S. State-Level CO 2 Emissions J. Wesley Burnett Division of Resource Management West Virginia University Wednesday, August 31, 2013 Outline Motivation

More information

What is Survey Weighting? Chris Skinner University of Southampton

What is Survey Weighting? Chris Skinner University of Southampton What is Survey Weighting? Chris Skinner University of Southampton 1 Outline 1. Introduction 2. (Unresolved) Issues 3. Further reading etc. 2 Sampling 3 Representation 4 out of 8 1 out of 10 4 Weights 8/4

More information

Updating Small Area Welfare Indicators in the Absence of a New Census

Updating Small Area Welfare Indicators in the Absence of a New Census Updating Small Area Welfare Indicators in the Absence of a New Census DRAFT Johannes G. Hoogeveen Thomas Emwanu Paul Okiira Okwi* November 13, 2003 Abstract Elbers, Lanjouw and Lanjouw (2003a) show how,

More information

Fractional Hot Deck Imputation for Robust Inference Under Item Nonresponse in Survey Sampling

Fractional Hot Deck Imputation for Robust Inference Under Item Nonresponse in Survey Sampling Fractional Hot Deck Imputation for Robust Inference Under Item Nonresponse in Survey Sampling Jae-Kwang Kim 1 Iowa State University June 26, 2013 1 Joint work with Shu Yang Introduction 1 Introduction

More information

Commuting paradox revisited - Compensation for commutes in two-earner households? Konferenz Verkehrsökonomik und -politik 2017

Commuting paradox revisited - Compensation for commutes in two-earner households? Konferenz Verkehrsökonomik und -politik 2017 Commuting paradox revisited - Compensation for commutes in two-earner households? Konferenz Verkehrsökonomik und -politik 2017 Ulrike Illmann Technische Universität Dresden June 29, 2017 Ulrike Illmann

More information

Introduction to Survey Data Integration

Introduction to Survey Data Integration Introduction to Survey Data Integration Jae-Kwang Kim Iowa State University May 20, 2014 Outline 1 Introduction 2 Survey Integration Examples 3 Basic Theory for Survey Integration 4 NASS application 5

More information

The impact of residential density on vehicle usage and fuel consumption*

The impact of residential density on vehicle usage and fuel consumption* The impact of residential density on vehicle usage and fuel consumption* Jinwon Kim and David Brownstone Dept. of Economics 3151 SSPA University of California Irvine, CA 92697-5100 Email: dbrownst@uci.edu

More information

Vanessa Nadalin Lucas Mation IPEA - Institute for Applied Economic Research Regional Studies Association Global Conference Fortaleza 2014

Vanessa Nadalin Lucas Mation IPEA - Institute for Applied Economic Research Regional Studies Association Global Conference Fortaleza 2014 Spatial distribution of sanitation and income inequality in Brazilian Slums Vanessa Nadalin Lucas Mation IPEA - Institute for Applied Economic Research Regional Studies Association Global Conference Fortaleza

More information

extreme weather, climate & preparedness in the american mind

extreme weather, climate & preparedness in the american mind extreme weather, climate & preparedness in the american mind Extreme Weather, Climate & Preparedness In the American Mind Interview dates: March 12, 2012 March 30, 2012. Interviews: 1,008 Adults (18+)

More information

A multivariate multilevel model for the analysis of TIMMS & PIRLS data

A multivariate multilevel model for the analysis of TIMMS & PIRLS data A multivariate multilevel model for the analysis of TIMMS & PIRLS data European Congress of Methodology July 23-25, 2014 - Utrecht Leonardo Grilli 1, Fulvia Pennoni 2, Carla Rampichini 1, Isabella Romeo

More information

More on Roy Model of Self-Selection

More on Roy Model of Self-Selection V. J. Hotz Rev. May 26, 2007 More on Roy Model of Self-Selection Results drawn on Heckman and Sedlacek JPE, 1985 and Heckman and Honoré, Econometrica, 1986. Two-sector model in which: Agents are income

More information

An Overview of the Pros and Cons of Linearization versus Replication in Establishment Surveys

An Overview of the Pros and Cons of Linearization versus Replication in Establishment Surveys An Overview of the Pros and Cons of Linearization versus Replication in Establishment Surveys Richard Valliant University of Michigan and Joint Program in Survey Methodology University of Maryland 1 Introduction

More information

UNIVERSITY OF WARWICK. Summer Examinations 2015/16. Econometrics 1

UNIVERSITY OF WARWICK. Summer Examinations 2015/16. Econometrics 1 UNIVERSITY OF WARWICK Summer Examinations 2015/16 Econometrics 1 Time Allowed: 3 Hours, plus 15 minutes reading time during which notes may be made (on the question paper) BUT NO ANSWERS MAY BE BEGUN.

More information

Chapter 5: Models used in conjunction with sampling. J. Kim, W. Fuller (ISU) Chapter 5: Models used in conjunction with sampling 1 / 70

Chapter 5: Models used in conjunction with sampling. J. Kim, W. Fuller (ISU) Chapter 5: Models used in conjunction with sampling 1 / 70 Chapter 5: Models used in conjunction with sampling J. Kim, W. Fuller (ISU) Chapter 5: Models used in conjunction with sampling 1 / 70 Nonresponse Unit Nonresponse: weight adjustment Item Nonresponse:

More information

Introduction to Survey Data Analysis

Introduction to Survey Data Analysis Introduction to Survey Data Analysis JULY 2011 Afsaneh Yazdani Preface Learning from Data Four-step process by which we can learn from data: 1. Defining the Problem 2. Collecting the Data 3. Summarizing

More information

Social Vulnerability Index. Susan L. Cutter Department of Geography, University of South Carolina

Social Vulnerability Index. Susan L. Cutter Department of Geography, University of South Carolina Social Vulnerability Index Susan L. Cutter Department of Geography, University of South Carolina scutter@sc.edu Great Lakes and St. Lawrence Cities Initiative Webinar December 3, 2014 Vulnerability The

More information

VALIDATING A SURVEY ESTIMATE - A COMPARISON OF THE GUYANA RURAL FARM HOUSEHOLD SURVEY AND INDEPENDENT RICE DATA

VALIDATING A SURVEY ESTIMATE - A COMPARISON OF THE GUYANA RURAL FARM HOUSEHOLD SURVEY AND INDEPENDENT RICE DATA VALIDATING A SURVEY ESTIMATE - A COMPARISON OF THE GUYANA RURAL FARM HOUSEHOLD SURVEY AND INDEPENDENT RICE DATA David J. Megill, U.S. Bureau of the Census I. Background While attempting to validate survey

More information

Data Collection. Lecture Notes in Transportation Systems Engineering. Prof. Tom V. Mathew. 1 Overview 1

Data Collection. Lecture Notes in Transportation Systems Engineering. Prof. Tom V. Mathew. 1 Overview 1 Data Collection Lecture Notes in Transportation Systems Engineering Prof. Tom V. Mathew Contents 1 Overview 1 2 Survey design 2 2.1 Information needed................................. 2 2.2 Study area.....................................

More information

Multivariate Statistics

Multivariate Statistics Multivariate Statistics Chapter 3: Principal Component Analysis Pedro Galeano Departamento de Estadística Universidad Carlos III de Madrid pedro.galeano@uc3m.es Course 2017/2018 Master in Mathematical

More information

Typical information required from the data collection can be grouped into four categories, enumerated as below.

Typical information required from the data collection can be grouped into four categories, enumerated as below. Chapter 6 Data Collection 6.1 Overview The four-stage modeling, an important tool for forecasting future demand and performance of a transportation system, was developed for evaluating large-scale infrastructure

More information

Gini Coefficient. A supplement to Mahlerʼs Guide to Loss Distributions. Exam C. prepared by Howard C. Mahler, FCAS Copyright 2017 by Howard C. Mahler.

Gini Coefficient. A supplement to Mahlerʼs Guide to Loss Distributions. Exam C. prepared by Howard C. Mahler, FCAS Copyright 2017 by Howard C. Mahler. Gini Coefficient A supplement to Mahlerʼs Guide to Loss Distributions Eam C prepared by Howard C. Mahler, FCAS Copyright 27 by Howard C. Mahler. Howard Mahler hmahler@mac.com www.howardmahler.com/teaching

More information

Eric V. Slud, Census Bureau & Univ. of Maryland Mathematics Department, University of Maryland, College Park MD 20742

Eric V. Slud, Census Bureau & Univ. of Maryland Mathematics Department, University of Maryland, College Park MD 20742 MODEL-ASSISTED WEIGHTING FOR SURVEYS WITH MULTIPLE RESPONSE MODE Eric V. Slud, Census Bureau & Univ. of Maryland Mathematics Department, University of Maryland, College Park MD 20742 Key words: American

More information

Might using the Internet while travelling affect car ownership plans of Millennials? Dr. David McArthur and Dr. Jinhyun Hong

Might using the Internet while travelling affect car ownership plans of Millennials? Dr. David McArthur and Dr. Jinhyun Hong Might using the Internet while travelling affect car ownership plans of Millennials? Dr. David McArthur and Dr. Jinhyun Hong Introduction Travel habits among Millennials (people born between 1980 and 2000)

More information

UK Data Archive Study Number British Social Attitudes, British Social Attitudes 2008 User Guide. 2 Data collection methods...

UK Data Archive Study Number British Social Attitudes, British Social Attitudes 2008 User Guide. 2 Data collection methods... UK Data Archive Study Number 6390 - British Social Attitudes, 2008 o NatCen National Centre for Social Research British Social Attitudes 2008 User Guide Author: Roger Stafford 1 Overview of the survey...

More information

Robust Non-Parametric Techniques to Estimate the Growth Elasticity of Poverty

Robust Non-Parametric Techniques to Estimate the Growth Elasticity of Poverty International Journal of Contemporary Mathematical Sciences Vol. 14, 2019, no. 1, 43-52 HIKARI Ltd, www.m-hikari.com https://doi.org/10.12988/ijcms.2019.936 Robust Non-Parametric Techniques to Estimate

More information

Summary Article: Poverty from Encyclopedia of Geography

Summary Article: Poverty from Encyclopedia of Geography Topic Page: Poverty Definition: poverty from Dictionary of Energy Social Issues. the fact of being poor; the absence of wealth. A term with a wide range of interpretations depending on which markers of

More information

A Method for Inferring Label Sampling Mechanisms in Semi-Supervised Learning

A Method for Inferring Label Sampling Mechanisms in Semi-Supervised Learning A Method for Inferring Label Sampling Mechanisms in Semi-Supervised Learning Saharon Rosset Data Analytics Research Group IBM T.J. Watson Research Center Yorktown Heights, NY 1598 srosset@us.ibm.com Hui

More information

MEASURING THE SOCIO-ECONOMIC BIPOLARIZATION PHENOMENON

MEASURING THE SOCIO-ECONOMIC BIPOLARIZATION PHENOMENON 2 MEASURING THE SOCIO-ECONOMIC BIPOLARIZATION PHENOMENON Abstract Ştefan ŞTEFĂNESCU * The present paper emphasizes the differences between the Gini concentration coefficient [3] and a new bipolarization

More information

Chapter 11. Regression with a Binary Dependent Variable

Chapter 11. Regression with a Binary Dependent Variable Chapter 11 Regression with a Binary Dependent Variable 2 Regression with a Binary Dependent Variable (SW Chapter 11) So far the dependent variable (Y) has been continuous: district-wide average test score

More information

CS229 Machine Learning Project: Allocating funds to the right people

CS229 Machine Learning Project: Allocating funds to the right people CS229 Machine Learning Project: Allocating funds to the right people Introduction Dugalic, Adem adugalic@stanford.edu Nguyen, Tram tram@stanford.edu Our project is motivated by a common issue in developing

More information

DEEP, University of Lausanne Lectures on Econometric Analysis of Count Data Pravin K. Trivedi May 2005

DEEP, University of Lausanne Lectures on Econometric Analysis of Count Data Pravin K. Trivedi May 2005 DEEP, University of Lausanne Lectures on Econometric Analysis of Count Data Pravin K. Trivedi May 2005 The lectures will survey the topic of count regression with emphasis on the role on unobserved heterogeneity.

More information

Income Inequality in Missouri 2000

Income Inequality in Missouri 2000 Through the growth in global media, people around the world are becoming more aware of and more vocal about the gap between the rich and the poor. Policy makers, researchers and academics are also increasingly

More information

Math 142 Lecture Notes. Section 7.1 Area between curves

Math 142 Lecture Notes. Section 7.1 Area between curves Math 4 Lecture Notes Section 7. Area between curves A) Introduction Now, we want to find the area between curves using the concept of definite integral. Let's assume we want to find the area between the

More information

Introduction to Econometrics. Assessing Studies Based on Multiple Regression

Introduction to Econometrics. Assessing Studies Based on Multiple Regression Introduction to Econometrics The statistical analysis of economic (and related) data STATS301 Assessing Studies Based on Multiple Regression Titulaire: Christopher Bruffaerts Assistant: Lorenzo Ricci 1

More information

GROWING APART: THE CHANGING FIRM-SIZE WAGE PREMIUM AND ITS INEQUALITY CONSEQUENCES ONLINE APPENDIX

GROWING APART: THE CHANGING FIRM-SIZE WAGE PREMIUM AND ITS INEQUALITY CONSEQUENCES ONLINE APPENDIX GROWING APART: THE CHANGING FIRM-SIZE WAGE PREMIUM AND ITS INEQUALITY CONSEQUENCES ONLINE APPENDIX The following document is the online appendix for the paper, Growing Apart: The Changing Firm-Size Wage

More information

Measuring Poverty. Introduction

Measuring Poverty. Introduction Measuring Poverty Introduction To measure something, we need to provide answers to the following basic questions: 1. What are we going to measure? Poverty? So, what is poverty? 2. Who wants to measure

More information

Sidestepping the box: Designing a supplemental poverty indicator for school neighborhoods

Sidestepping the box: Designing a supplemental poverty indicator for school neighborhoods Sidestepping the box: Designing a supplemental poverty indicator for school neighborhoods Doug Geverdt National Center for Education Statistics Laura Nixon U.S. Census Bureau 2018 Annual meeting of the

More information

TUESDAYS AT APA PLANNING AND HEALTH. SAGAR SHAH, PhD AMERICAN PLANNING ASSOCIATION SEPTEMBER 2017 DISCUSSING THE ROLE OF FACTORS INFLUENCING HEALTH

TUESDAYS AT APA PLANNING AND HEALTH. SAGAR SHAH, PhD AMERICAN PLANNING ASSOCIATION SEPTEMBER 2017 DISCUSSING THE ROLE OF FACTORS INFLUENCING HEALTH SAGAR SHAH, PhD sshah@planning.org AMERICAN PLANNING ASSOCIATION SEPTEMBER 2017 TUESDAYS AT APA PLANNING AND HEALTH DISCUSSING THE ROLE OF FACTORS INFLUENCING HEALTH Outline of the Presentation PLANNING

More information

TWO-WAY CONTINGENCY TABLES UNDER CONDITIONAL HOT DECK IMPUTATION

TWO-WAY CONTINGENCY TABLES UNDER CONDITIONAL HOT DECK IMPUTATION Statistica Sinica 13(2003), 613-623 TWO-WAY CONTINGENCY TABLES UNDER CONDITIONAL HOT DECK IMPUTATION Hansheng Wang and Jun Shao Peking University and University of Wisconsin Abstract: We consider the estimation

More information

Electronic Research Archive of Blekinge Institute of Technology

Electronic Research Archive of Blekinge Institute of Technology Electronic Research Archive of Blekinge Institute of Technology http://www.bth.se/fou/ This is an author produced version of a ournal paper. The paper has been peer-reviewed but may not include the final

More information

Table 1. Answers to income and consumption adequacy questions Percentage of responses: less than adequate more than adequate adequate Total income 68.7% 30.6% 0.7% Food consumption 46.6% 51.4% 2.0% Clothing

More information

GIS-Based Analysis of the Commuting Behavior and the Relationship between Commuting and Urban Form

GIS-Based Analysis of the Commuting Behavior and the Relationship between Commuting and Urban Form GIS-Based Analysis of the Commuting Behavior and the Relationship between Commuting and Urban Form 1. Abstract A prevailing view in the commuting is that commuting would reconstruct the urban form. By

More information

A Method for Inferring Label Sampling Mechanisms in Semi-Supervised Learning

A Method for Inferring Label Sampling Mechanisms in Semi-Supervised Learning A Method for Inferring Label Sampling Mechanisms in Semi-Supervised Learning Saharon Rosset Data Analytics Research Group IBM T.J. Watson Research Center Yorktown Heights, NY 1598 srosset@us.ibm.com Hui

More information

Minimax-Regret Sample Design in Anticipation of Missing Data, With Application to Panel Data. Jeff Dominitz RAND. and

Minimax-Regret Sample Design in Anticipation of Missing Data, With Application to Panel Data. Jeff Dominitz RAND. and Minimax-Regret Sample Design in Anticipation of Missing Data, With Application to Panel Data Jeff Dominitz RAND and Charles F. Manski Department of Economics and Institute for Policy Research, Northwestern

More information

PALS: Neighborhood Identification, City of Frederick, Maryland. David Boston Razia Choudhry Chris Davis Under the supervision of Chao Liu

PALS: Neighborhood Identification, City of Frederick, Maryland. David Boston Razia Choudhry Chris Davis Under the supervision of Chao Liu PALS: Neighborhood Identification, City of Frederick, Maryland David Boston Razia Choudhry Chris Davis Under the supervision of Chao Liu University of Maryland School of Architecture, Planning & Preservation

More information

THE USAGE OF GIS APPLICATIONS IN SURVEYS DEPARTMENT. Tools for planning and managing the work of interviewers

THE USAGE OF GIS APPLICATIONS IN SURVEYS DEPARTMENT. Tools for planning and managing the work of interviewers THE USAGE OF GIS APPLICATIONS IN SURVEYS DEPARTMENT Tools for planning and managing the work of interviewers In the Surveys Department, we use the GIS tool for field work. For example: visualization of

More information

Molinas. June 15, 2018

Molinas. June 15, 2018 ITT8 SAMBa Presentation June 15, 2018 ling Data The data we have include: Approx 30,000 questionnaire responses each with 234 questions during 1998-2017 A data set of 60 questions asked to 500,000 households

More information

2011 NATIONAL SURVEY ON DRUG USE AND HEALTH

2011 NATIONAL SURVEY ON DRUG USE AND HEALTH 2011 NATIONAL SURVEY ON DRUG USE AND HEALTH PERSON-LEVEL SAMPLING WEIGHT CALIBRATION Prepared for the 2011 Methodological Resource Book Contract No. HHSS283200800004C RTI Project No. 0211838.207.004 Phase

More information

HOUSEHOLD surveys collect information on incomes,

HOUSEHOLD surveys collect information on incomes, USING ENSUS AND SURVEY DATA TO ESTIMATE POVERTY AND INEQUALITY FOR SMALL AREAS Alessandro Tarozzi and Angus Deaton* Abstract Recent years have seen widespread use of small-area maps based on census data

More information

The Impact of Residential Density on Vehicle Usage and Fuel Consumption: Evidence from National Samples

The Impact of Residential Density on Vehicle Usage and Fuel Consumption: Evidence from National Samples The Impact of Residential Density on Vehicle Usage and Fuel Consumption: Evidence from National Samples Jinwon Kim Department of Transport, Technical University of Denmark and David Brownstone 1 Department

More information

Known unknowns : using multiple imputation to fill in the blanks for missing data

Known unknowns : using multiple imputation to fill in the blanks for missing data Known unknowns : using multiple imputation to fill in the blanks for missing data James Stanley Department of Public Health University of Otago, Wellington james.stanley@otago.ac.nz Acknowledgments Cancer

More information

CIRPÉE Centre interuniversitaire sur le risque, les politiques économiques et l emploi

CIRPÉE Centre interuniversitaire sur le risque, les politiques économiques et l emploi CIRPÉE Centre interuniversitaire sur le risque, les politiques économiques et l emploi Cahier de recherche/working Paper 06-34 Poverty, Inequality and Stochastic Dominance, Theory and Practice: Illustration

More information

Determining Changes in Welfare Distributions at the Micro-level: Updating Poverty Maps By Chris Elbers, Jean O. Lanjouw, and Peter Lanjouw 1

Determining Changes in Welfare Distributions at the Micro-level: Updating Poverty Maps By Chris Elbers, Jean O. Lanjouw, and Peter Lanjouw 1 Determining Changes in Welfare Distributions at the Micro-level: Updating Poverty Maps By Chris Elbers, Jean O. Lanjouw, and Peter Lanjouw 1 Income and wealth distributions have a prominent position in

More information

Multidimensional Control Totals for Poststratified Weights

Multidimensional Control Totals for Poststratified Weights Multidimensional Control Totals for Poststratified Weights Darryl V. Creel and Mansour Fahimi Joint Statistical Meetings Minneapolis, MN August 7-11, 2005 RTI International is a trade name of Research

More information

Dwelling Price Ranking vs. Socio-Economic Ranking: Possibility of Imputation

Dwelling Price Ranking vs. Socio-Economic Ranking: Possibility of Imputation Dwelling Price Ranking vs. Socio-Economic Ranking: Possibility of Imputation Larisa Fleishman Yury Gubman Aviad Tur-Sinai Israeli Central Bureau of Statistics The main goals 1. To examine if dwelling prices

More information

On Measuring Growth and Inequality. Components of Changes in Poverty. with Application to Thailand

On Measuring Growth and Inequality. Components of Changes in Poverty. with Application to Thailand On Measuring Growth and Inequality Components of Changes in Poverty with Application to Thailand decomp 5/10/97 ON MEASURING GROWTH AND INEQUALITY COMPONENTS OF POVERTY WITH APPLICATION TO THAILAND by

More information

New Developments in Nonresponse Adjustment Methods

New Developments in Nonresponse Adjustment Methods New Developments in Nonresponse Adjustment Methods Fannie Cobben January 23, 2009 1 Introduction In this paper, we describe two relatively new techniques to adjust for (unit) nonresponse bias: The sample

More information

Economic Theory of Spatial Costs. of Living Indices with. Application to Thailand

Economic Theory of Spatial Costs. of Living Indices with. Application to Thailand Economic Theory of Spatial Costs of Living Indices with Application to Thailand spatial 5/10/97 ECONOMIC THEORY OF SPATIAL COSTS OF LIVING INDICES WITH APPLICATION TO THAILAND by N.Kakwani* School of Economics

More information

Planned Missingness Designs and the American Community Survey (ACS)

Planned Missingness Designs and the American Community Survey (ACS) Planned Missingness Designs and the American Community Survey (ACS) Steven G. Heeringa Institute for Social Research University of Michigan Presentation to the National Academies of Sciences Workshop on

More information

PhD/MA Econometrics Examination January 2012 PART A

PhD/MA Econometrics Examination January 2012 PART A PhD/MA Econometrics Examination January 2012 PART A ANSWER ANY TWO QUESTIONS IN THIS SECTION NOTE: (1) The indicator function has the properties: (2) Question 1 Let, [defined as if using the indicator

More information

COSTA RICA Limon City-Port Project

COSTA RICA Limon City-Port Project photos by Stefania Abakerli COSTA RICA Limon City-Port Project 2008 2013 Cecilia Corvalan William Reuben Stefania Abakerli Background Request from GoCR following Caldera-Port Concession Reform Since the

More information

Nonrespondent subsample multiple imputation in two-phase random sampling for nonresponse

Nonrespondent subsample multiple imputation in two-phase random sampling for nonresponse Nonrespondent subsample multiple imputation in two-phase random sampling for nonresponse Nanhua Zhang Division of Biostatistics & Epidemiology Cincinnati Children s Hospital Medical Center (Joint work

More information

CRP 608 Winter 10 Class presentation February 04, Senior Research Associate Kirwan Institute for the Study of Race and Ethnicity

CRP 608 Winter 10 Class presentation February 04, Senior Research Associate Kirwan Institute for the Study of Race and Ethnicity CRP 608 Winter 10 Class presentation February 04, 2010 SAMIR GAMBHIR SAMIR GAMBHIR Senior Research Associate Kirwan Institute for the Study of Race and Ethnicity Background Kirwan Institute Our work Using

More information

Poverty and Distributional Outcomes of Shocks and Policies

Poverty and Distributional Outcomes of Shocks and Policies Poverty and Distributional Outcomes of Shocks and Policies B. Essama-Nssah Poverty Reduction Group (PRMPR) The World Bank May, 006 Foreword The effect of ignoring the interpersonal variations can, in fact,

More information

Interpreting coefficients for transformed variables

Interpreting coefficients for transformed variables Interpreting coefficients for transformed variables! Recall that when both independent and dependent variables are untransformed, an estimated coefficient represents the change in the dependent variable

More information

2005 Mortgage Broker Regulation Matrix

2005 Mortgage Broker Regulation Matrix 2005 Mortgage Broker Regulation Matrix Notes on individual states follow the table REG EXEMPTIONS LIC-EDU LIC-EXP LIC-EXAM LIC-CONT-EDU NET WORTH BOND MAN-LIC MAN-EDU MAN-EXP MAN-EXAM Alabama 1 0 2 0 0

More information

SOUTH COAST COASTAL RECREATION METHODS

SOUTH COAST COASTAL RECREATION METHODS SOUTH COAST COASTAL RECREATION METHODS A customized, web-based survey instrument, which utilizes Point 97 s Viewpoint survey and mapping technology, was used to collect spatially explicit data on coastal

More information

Notes on Heterogeneity, Aggregation, and Market Wage Functions: An Empirical Model of Self-Selection in the Labor Market

Notes on Heterogeneity, Aggregation, and Market Wage Functions: An Empirical Model of Self-Selection in the Labor Market Notes on Heterogeneity, Aggregation, and Market Wage Functions: An Empirical Model of Self-Selection in the Labor Market Heckman and Sedlacek, JPE 1985, 93(6), 1077-1125 James Heckman University of Chicago

More information

Chapter. Organizing and Summarizing Data. Copyright 2013, 2010 and 2007 Pearson Education, Inc.

Chapter. Organizing and Summarizing Data. Copyright 2013, 2010 and 2007 Pearson Education, Inc. Chapter 2 Organizing and Summarizing Data Section 2.1 Organizing Qualitative Data Objectives 1. Organize Qualitative Data in Tables 2. Construct Bar Graphs 3. Construct Pie Charts When data is collected

More information

A simple bivariate count data regression model. Abstract

A simple bivariate count data regression model. Abstract A simple bivariate count data regression model Shiferaw Gurmu Georgia State University John Elder North Dakota State University Abstract This paper develops a simple bivariate count data regression model

More information

Poverty and Hazard Linkages

Poverty and Hazard Linkages Poverty and Hazard Linkages Global Risk Identification Programme Proposal Development Coordination Meeting Friday, 19 May 2006 CIESIN Earth Institute Columbia University www.ciesin.columbia.edu Data Improvements

More information

Secondary Towns and Poverty Reduction: Refocusing the Urbanization Agenda

Secondary Towns and Poverty Reduction: Refocusing the Urbanization Agenda Secondary Towns and Poverty Reduction: Refocusing the Urbanization Agenda Luc Christiaensen and Ravi Kanbur World Bank Cornell Conference Washington, DC 18 19May, 2016 losure Authorized Public Disclosure

More information

Secondary Towns, Population and Welfare in Mexico

Secondary Towns, Population and Welfare in Mexico Public Disclosure Authorized Public Disclosure Authorized Public Disclosure Authorized Secondary Towns, Population and Welfare in Mexico Isidro Soloaga Department of Economics, Universidad Iberoamericana,

More information

A Joint Tour-Based Model of Vehicle Type Choice and Tour Length

A Joint Tour-Based Model of Vehicle Type Choice and Tour Length A Joint Tour-Based Model of Vehicle Type Choice and Tour Length Ram M. Pendyala School of Sustainable Engineering & the Built Environment Arizona State University Tempe, AZ Northwestern University, Evanston,

More information

Targeting the Poor. Towards evidence-based implementation. Johan A. Mistiaen (World Bank)

Targeting the Poor. Towards evidence-based implementation. Johan A. Mistiaen (World Bank) Targeting the Poor Towards evidence-based implementation Johan A. Mistiaen (World Bank) Presentation at Joint Poverty Assessment Workshop Nairobi - May 19, 2005 Outline Integrating Data Sources - GIS What

More information

Analyzing Pilot Studies with Missing Observations

Analyzing Pilot Studies with Missing Observations Analyzing Pilot Studies with Missing Observations Monnie McGee mmcgee@smu.edu. Department of Statistical Science Southern Methodist University, Dallas, Texas Co-authored with N. Bergasa (SUNY Downstate

More information

BOOK REVIEW Sampling: Design and Analysis. Sharon L. Lohr. 2nd Edition, International Publication,

BOOK REVIEW Sampling: Design and Analysis. Sharon L. Lohr. 2nd Edition, International Publication, STATISTICS IN TRANSITION-new series, August 2011 223 STATISTICS IN TRANSITION-new series, August 2011 Vol. 12, No. 1, pp. 223 230 BOOK REVIEW Sampling: Design and Analysis. Sharon L. Lohr. 2nd Edition,

More information

Market access and rural poverty in Tanzania

Market access and rural poverty in Tanzania Market access and rural poverty in Tanzania Nicholas Minot International Food Policy Research Institute 2033 K St. NW Washington, D.C., U.S.A. Phone: +1 202 862-8199 Email: n.minot@cgiar.org Contributed

More information

Incorporating Level of Effort Paradata in Nonresponse Adjustments. Paul Biemer RTI International University of North Carolina Chapel Hill

Incorporating Level of Effort Paradata in Nonresponse Adjustments. Paul Biemer RTI International University of North Carolina Chapel Hill Incorporating Level of Effort Paradata in Nonresponse Adjustments Paul Biemer RTI International University of North Carolina Chapel Hill Acknowledgements Patrick Chen, RTI International Kevin Wang, RTI

More information

CIV3703 Transport Engineering. Module 2 Transport Modelling

CIV3703 Transport Engineering. Module 2 Transport Modelling CIV3703 Transport Engineering Module Transport Modelling Objectives Upon successful completion of this module you should be able to: carry out trip generation calculations using linear regression and category

More information

Plausible Values for Latent Variables Using Mplus

Plausible Values for Latent Variables Using Mplus Plausible Values for Latent Variables Using Mplus Tihomir Asparouhov and Bengt Muthén August 21, 2010 1 1 Introduction Plausible values are imputed values for latent variables. All latent variables can

More information

12 Lorenz line and Gini index

12 Lorenz line and Gini index Gini index and Lorenz line (06-04-) DRAFT VERSIO /8 Lorenz line and Gini index In 905, in the paper Methods of measuring the concentration of wealth, Max Otto Lorenz, an American economist, introduced

More information

W-BASED VS LATENT VARIABLES SPATIAL AUTOREGRESSIVE MODELS: EVIDENCE FROM MONTE CARLO SIMULATIONS

W-BASED VS LATENT VARIABLES SPATIAL AUTOREGRESSIVE MODELS: EVIDENCE FROM MONTE CARLO SIMULATIONS 1 W-BASED VS LATENT VARIABLES SPATIAL AUTOREGRESSIVE MODELS: EVIDENCE FROM MONTE CARLO SIMULATIONS An Liu University of Groningen Henk Folmer University of Groningen Wageningen University Han Oud Radboud

More information

NEW YORK DEPARTMENT OF SANITATION. Spatial Analysis of Complaints

NEW YORK DEPARTMENT OF SANITATION. Spatial Analysis of Complaints NEW YORK DEPARTMENT OF SANITATION Spatial Analysis of Complaints Spatial Information Design Lab Columbia University Graduate School of Architecture, Planning and Preservation November 2007 Title New York

More information

A s i a n J o u r n a l o f M u l t i d i m e n s i o n a l R e s e a r c h KUZNETS CURVE: THE CASE OF THE INDIAN ECONOMY

A s i a n J o u r n a l o f M u l t i d i m e n s i o n a l R e s e a r c h KUZNETS CURVE: THE CASE OF THE INDIAN ECONOMY P u b l i s h e d b y : T R A N S A s i a n R e s e a r c h J o u r n a l s AJMR: A s i a n J o u r n a l o f M u l t i d i m e n s i o n a l R e s e a r c h (A Do u b le B lind Re fe r e e d & Re v ie

More information

INTRODUCTION TO TRANSPORTATION SYSTEMS

INTRODUCTION TO TRANSPORTATION SYSTEMS INTRODUCTION TO TRANSPORTATION SYSTEMS Lectures 5/6: Modeling/Equilibrium/Demand 1 OUTLINE 1. Conceptual view of TSA 2. Models: different roles and different types 3. Equilibrium 4. Demand Modeling References:

More information

Assisting Countries in the Collection and Analysis of National Statistics

Assisting Countries in the Collection and Analysis of National Statistics Assisting Countries in the Collection and Analysis of National Statistics Peter O. Way, Ph.D., Chief International Programs Center U.S. Census Bureau Censuses in the 21 st Century: Improving Data Utilization

More information

SOCIO-DEMOGRAPHIC INDICATORS FOR REGIONAL POPULATION POLICIES

SOCIO-DEMOGRAPHIC INDICATORS FOR REGIONAL POPULATION POLICIES SOCIO-DEMOGRAPHIC INDICATORS FOR REGIONAL POPULATION POLICIES A CANADIAN PERSPECTIVE Marc Termote Université de Montréal Potsdam, DART Conference, September 3, 2012 3 STEPS 1. Defining the problem 2. Identifying

More information

Effective estimation of population mean, ratio and product in two-phase sampling in presence of random non-response

Effective estimation of population mean, ratio and product in two-phase sampling in presence of random non-response Effective estimation of population mean, ratio and product in two-phase sampling in presence of random non-response G. N. Singh, A. K. Sharma, A.Bandyopadhyay and C.P aul Keywords Abstract This paper presents

More information

Two-phase sampling approach to fractional hot deck imputation

Two-phase sampling approach to fractional hot deck imputation Two-phase sampling approach to fractional hot deck imputation Jongho Im 1, Jae-Kwang Kim 1 and Wayne A. Fuller 1 Abstract Hot deck imputation is popular for handling item nonresponse in survey sampling.

More information

COSTA RICA Limon City-Port Project

COSTA RICA Limon City-Port Project COSTA RICA Limon City-Port Project 2008 2013 photos by Stefania Abakerli Stefania Abakerli Cecilia Corvalan Context Costa Rica Well-known tourist destination in CA (8.4% GDP) Long tradition of political

More information

Alexina Mason. Department of Epidemiology and Biostatistics Imperial College, London. 16 February 2010

Alexina Mason. Department of Epidemiology and Biostatistics Imperial College, London. 16 February 2010 Strategy for modelling non-random missing data mechanisms in longitudinal studies using Bayesian methods: application to income data from the Millennium Cohort Study Alexina Mason Department of Epidemiology

More information

Exploiting TIMSS and PIRLS combined data: multivariate multilevel modelling of student achievement

Exploiting TIMSS and PIRLS combined data: multivariate multilevel modelling of student achievement Exploiting TIMSS and PIRLS combined data: multivariate multilevel modelling of student achievement Second meeting of the FIRB 2012 project Mixture and latent variable models for causal-inference and analysis

More information

Statistics for Managers Using Microsoft Excel 5th Edition

Statistics for Managers Using Microsoft Excel 5th Edition Statistics for Managers Using Microsoft Ecel 5th Edition Chapter 7 Sampling and Statistics for Managers Using Microsoft Ecel, 5e 2008 Pearson Prentice-Hall, Inc. Chap 7-12 Why Sample? Selecting a sample

More information

Unless provided with information to the contrary, assume for each question below that the Classical Linear Model assumptions hold.

Unless provided with information to the contrary, assume for each question below that the Classical Linear Model assumptions hold. Economics 345: Applied Econometrics Section A01 University of Victoria Midterm Examination #2 Version 1 SOLUTIONS Spring 2015 Instructor: Martin Farnham Unless provided with information to the contrary,

More information