Small Area Confidence Bounds on Small Cell Proportions in Survey Populations

Size: px
Start display at page:

Download "Small Area Confidence Bounds on Small Cell Proportions in Survey Populations"

Transcription

1 Small Area Confidence Bounds on Small Cell Proportions in Survey Populations Aaron Gilary, Jerry Maples, U.S. Census Bureau U.S. Census Bureau Eric V. Slud, U.S. Census Bureau Univ. Maryland College Park DC-AAPOR / WSS 1

2 Outline General Problem Cell-Based vs. Model-Based Methods Alternative Definitions of Effective Sample Size Data Example Erroneous Enumeration of HUs in Census Coverage Measurement study (CCM) Numerical Results Conclusion and Future Plans 2

3 General Problem Setting: given direct (ratio) estimates for small proportions ˆπ i at level of cell or domain i (e.g., county) Problem: specify upper confidence bound for ˆπ i either bound in transformed measurement scale h(π i ) from data within domain i or model-based bound connecting values π i across domains using predictors x i Approach: using effective sample-sizes n i, adapt binomial/srs estimator V ar(ˆπ i ) = ˆπ i (1 ˆπ i )/n i 3

4 Applications demographic tables in ACS, the American Community Survey; area-level Erroneous Enumeration rates in CCM, Census Coverage Measurement; and rates in other Census Bureau surveys. 4

5 A Good Cell-Based Method Liu and Kott 2009, Survey Methodology π i = true proportion n i = effective sample size, yi Binom(n i, π i) ˆπ i = y i n i = y i n i direct estimator Transformation: asin( ˆπ) (Variance-stabilizing) centered at asin( π), Var 1/(4n i ) Obtain UCB on arcsin scale, transform back to prob. scale by sin 2 (x) 5

6 Ideas of Model-Based Methods π i includes { predicted part ηi = x i β unmodeled random component (1) Fay-Herriot: π i = sin 2 (η i +u i ), u i N(0, σ 2 u) (2) Logistic: π i = eη i +v i 1+e η i +v i, v i N(0, σ 2 v ) (3) Beta-Binomial: π i = Beta( τ eη i 1+e η i, τ 1+e η i ) Parameters σ 2 u, σ 2 v, (1 + τ) 1 quantify imprecision of π i in terms of η i 6

7 Model-Based Methods, Continued Recall y i /n i = ˆπ i Fay-Herriot: arcsin( ˆπ i ) N(arcsin( π i ), Logistic & Beta-Binomial: 1 4n i ) y i Binom(n i, π i) 7

8 Estimation in Model-Based Methods Point Predictor for π i generally different from direct estimator BLUP: E(π i y i ) EBLUP: substitute MLE for β & (σ u, σ v or τ) Upper Confidence Bound for π i based on V ar(eblup ) 8

9 Defining Effective Sample Size n i If variance V i n = π i(1 π i ) i n of direct π i estimator in area i i is reliably estimated and sampling fraction f 1: DEFFi = V i π i (1 π i ), n i = n i DEFFi What if ˆV i is erratic or π i too small? Proposal 1: Area size via higher-level DEFF With ˆV /n = V (ˆπ), and DEFF at higher (e.g., State) level DEFF = ˆV π(1 π), n i = n i DEFF 9

10 Effective Sample Size, Continued Proposal 2: Eff. size modified by sampling weights w ik indiv.-level sampling weights within area i n i = n i DEFF k w 2 ik ( k w ik ) 2 / j,k w 2 jk ( j,k w jk ) 2 Effective sizes for survey CIs : Liu & Kott 2009 in Bayesian analysis: Chen et al. 2011, Malec

11 Data Example Census Coverage Measurement (CCM): evaluation of Census performance. finds Erroneous Enumeration (EE) rates for counties in sample. 170k Housing Units (HUs) and 1,728 counties. Census publishes detailed estimates for 128 counties with pop. 500k 11

12 Specific CCM Details National EE rate 2.7% for metro areas (Olson, 2012). For our 128 counties, P(ˆπ i = 0) > 0. EE Rates (i = county, k = HU): ˆπ = Σ j,k W jk Y jk Σ j,k W jk, ˆπ i = Σ k W ik Y ik Σ k W ik 12

13 Specific CCM Modeling Details Area- or cell-level models, all covariates from Census. BIC model-selection penalizes max loglik by k ln(n) (k = # regr. coef s, n = total sample size). Our selected model had 5 predictor variables: state EE rate, a synthetic estimator; area rate of single-unit households; area rate of large multi-unit households; area rate of urban households; area enumeration rate. 13

14 Variability Across Counties n = Proposal 2 Eff. S.S., n = Proposal 1 Eff. S.S. Mean 1st Q Med. 3rd Q St DEFF Cou n Cou n /n NB: n, n based on counties with 20 HUs in sample. 14

15 Results Inclusion of n led to wider, more conservative estimates for the UCBs. The Fay-Herriot and Logistic methods had slightly higher means and medians for UCBs for areas with ˆπ 0, but the Cell and Beta-Binomial methods had the highest maximum UCBs. Overall, all four methods created upper bounds within a similar range. 15

16 UCB of Production Counties where ˆπ i = Cell FH LOG BB 16

17 Conclusion & Future Plans Conclusions: For our project, we chose the Cell-Based method. With the results so close, a simpler method without a specified regression model was preferrable. We used the Proposal 2 n i for accurately capturing the variance because it did not assume SRS or equal weights within area. Results were more conservative which was a priority in this case. Future plans: extend approach to other applications test performance of DEFF under different assumptions? generalized R package for Census Bureau use? 17

18 References Chen, C., Lumley, T. and Wakefield, J. (2011), The Use of Sampling Weights in Bayesian Hierarchical Models for Small Area Estimation. U. Wash. Stat. Dept. Tech. Rep. #583. Fay, R. and Herriot, R. (1979, JASA). Liu, Y. and Kott, P. (2009, Jour. Official Stat.) Malec, D. (2005, Jour. Official Stat.) Small Area Estimation... Housing Units. Maples, J., Bell, W. and Huang, E. (2009 ASA Proc.) GVF s for area-level variances Olson, Doug (2012) Census Coverage Measurement Report on Modeling. DSSD #2010-G-13. Prentice, R. (1986, JASA) Beta-binomial regression Slud, Eric V. (2012). Small-Area Confidence Bounds... in Tables. Jour. Indian. Soc. Agric. Stat. 18

19 Thank You! 19

20 A Cell-Based Method Variance-Stabilizing transformation to arcsin sqrt scale of estimated area proportion y i /n i = yi /n i For area i : π i = true proportion, n i = eff. samp. size y i Binom(n i, π i) N (n i π i, n i π i (1 π i )) arcsin y i n i N (arcsin π i, 1 4n i ) -method Transformed scale 90% CI: arcsin y i /n i ± 1.645/ 4 n i Transform back to the probability scale by sin 2 (x) 20

21 Models which Borrow Strength across Areas Notation: ˆπ i = y i n = y i i n, â i = arcsin ˆπ i, η i = x i β i area-level observed predictors x i u i N (0, σ 2 u), v i N (0, σ 2 v ) random effects (1) Fay-Herriot : â i = η i + u i + ɛ i, ɛ i N (0, 1 4n ) i (2) Logistic Random-Intercept : y i Bin(n i, e η i +v i 1+e η i +v i ) (3) Beta-Binomial : y i Bin(n i, π i), π i Beta( τ eη i 1+e η i, τ 1+e η i ) Targets for small-area prediction: sin 2 (η i + u i ) in (1); e η i +v i 1+e η i +v i in (2); and π i in (3). 21

22 Fay-Herriot Model UCB Mod1 (FH): EBLUP ˆπ i = sin 2 (ˆθ i /n i ), based on θ i = η i + µ i, ˆθ i = ˆγ i y i + (1 ˆγ i ) x iˆβ, ˆγ i = UCB i = sin 2 ( 1 n i ˆσ 2 u ˆσ 2 u + (4n i ) 1 (ˆθ i + z ) α 2 ((1 ˆγ i) ˆσ u 2 + (1 ˆγ i ) 2 x i ˆVˆβ x i} 1/2 ) Fay and Herriot (1979), Slud (2012) NB: includes sample variability of ˆβ, not ˆσ 2 u Rao (2003) has more inclusive formulas for related mse 22

23 Logistic Random-Intercept Model-Based UCB Mod2 (LgstRI): EBLUP ˆπ i = g(y i + 1, n i + 1, ˆη i, ˆω 2 ) g(y i, n i, ˆη, ˆω2 ) where ˆη i = x i ˆβ, ˆω 2 = ˆσ 2 v + x i ˆVˆβ x i g(k, n, η, ω 2 ) = e (η+ωz)k (1 + e η+ωz φ(z)dz, φ( ) N (0, 1) ) n [ g(y UCB i = ˆπ i i + 2, n i + 2, ˆη, ˆω2 ) g(yi, n i, ˆη, ˆω2 ) ˆπ 2 i ] 1/2 NB: includes sample variability of ˆβ, not ˆσ 2 v. Jiang and Lahiri 2006, Slud

24 Beta-Binomial Model-Based UCB Mod3 (Beta-Bin): Estimation via Posterior, µ i = eη i 1 + e η i, π i y i Beta(τ µ i + y i, τ (1 µ i) + n i y i ) Empirical Bayes ˆπ i = y i + ˆτ ˆµ i n i + ˆτ Bootstrap approach to UCB 24

Eric V. Slud, Census Bureau & Univ. of Maryland Mathematics Department, University of Maryland, College Park MD 20742

Eric V. Slud, Census Bureau & Univ. of Maryland Mathematics Department, University of Maryland, College Park MD 20742 COMPARISON OF AGGREGATE VERSUS UNIT-LEVEL MODELS FOR SMALL-AREA ESTIMATION Eric V. Slud, Census Bureau & Univ. of Maryland Mathematics Department, University of Maryland, College Park MD 20742 Key words:

More information

Coverage Properties of Confidence Intervals for Proportions in Complex Sample Surveys

Coverage Properties of Confidence Intervals for Proportions in Complex Sample Surveys Coverage Properties of Confidence Intervals for Proportions in Complex Sample Surveys Carolina Franco Roderick J. A. Little Thomas A. Louis Eric V. Slud Abstract The challenges of constructing confidence

More information

A measurement error model approach to small area estimation

A measurement error model approach to small area estimation A measurement error model approach to small area estimation Jae-kwang Kim 1 Spring, 2015 1 Joint work with Seunghwan Park and Seoyoung Kim Ouline Introduction Basic Theory Application to Korean LFS Discussion

More information

Small Area Estimation with Uncertain Random Effects

Small Area Estimation with Uncertain Random Effects Small Area Estimation with Uncertain Random Effects Gauri Sankar Datta 1 2 3 and Abhyuday Mandal 4 Abstract Random effects models play an important role in model-based small area estimation. Random effects

More information

Introduction to Survey Data Integration

Introduction to Survey Data Integration Introduction to Survey Data Integration Jae-Kwang Kim Iowa State University May 20, 2014 Outline 1 Introduction 2 Survey Integration Examples 3 Basic Theory for Survey Integration 4 NASS application 5

More information

Fractional Hot Deck Imputation for Robust Inference Under Item Nonresponse in Survey Sampling

Fractional Hot Deck Imputation for Robust Inference Under Item Nonresponse in Survey Sampling Fractional Hot Deck Imputation for Robust Inference Under Item Nonresponse in Survey Sampling Jae-Kwang Kim 1 Iowa State University June 26, 2013 1 Joint work with Shu Yang Introduction 1 Introduction

More information

On Modifications to Linking Variance Estimators in the Fay-Herriot Model that Induce Robustness

On Modifications to Linking Variance Estimators in the Fay-Herriot Model that Induce Robustness Statistics and Applications {ISSN 2452-7395 (online)} Volume 16 No. 1, 2018 (New Series), pp 289-303 On Modifications to Linking Variance Estimators in the Fay-Herriot Model that Induce Robustness Snigdhansu

More information

Spatial Modeling and Prediction of County-Level Employment Growth Data

Spatial Modeling and Prediction of County-Level Employment Growth Data Spatial Modeling and Prediction of County-Level Employment Growth Data N. Ganesh Abstract For correlated sample survey estimates, a linear model with covariance matrix in which small areas are grouped

More information

Small Domains Estimation and Poverty Indicators. Carleton University, Ottawa, Canada

Small Domains Estimation and Poverty Indicators. Carleton University, Ottawa, Canada Small Domains Estimation and Poverty Indicators J. N. K. Rao Carleton University, Ottawa, Canada Invited paper for presentation at the International Seminar Population Estimates and Projections: Methodologies,

More information

Eric V. Slud, Census Bureau & Univ. of Maryland Mathematics Department, University of Maryland, College Park MD 20742

Eric V. Slud, Census Bureau & Univ. of Maryland Mathematics Department, University of Maryland, College Park MD 20742 MODEL-ASSISTED WEIGHTING FOR SURVEYS WITH MULTIPLE RESPONSE MODE Eric V. Slud, Census Bureau & Univ. of Maryland Mathematics Department, University of Maryland, College Park MD 20742 Key words: American

More information

Answer Key for STAT 200B HW No. 8

Answer Key for STAT 200B HW No. 8 Answer Key for STAT 200B HW No. 8 May 8, 2007 Problem 3.42 p. 708 The values of Ȳ for x 00, 0, 20, 30 are 5/40, 0, 20/50, and, respectively. From Corollary 3.5 it follows that MLE exists i G is identiable

More information

Fall 2017 STAT 532 Homework Peter Hoff. 1. Let P be a probability measure on a collection of sets A.

Fall 2017 STAT 532 Homework Peter Hoff. 1. Let P be a probability measure on a collection of sets A. 1. Let P be a probability measure on a collection of sets A. (a) For each n N, let H n be a set in A such that H n H n+1. Show that P (H n ) monotonically converges to P ( k=1 H k) as n. (b) For each n

More information

Web Appendix for Hierarchical Adaptive Regression Kernels for Regression with Functional Predictors by D. B. Woodard, C. Crainiceanu, and D.

Web Appendix for Hierarchical Adaptive Regression Kernels for Regression with Functional Predictors by D. B. Woodard, C. Crainiceanu, and D. Web Appendix for Hierarchical Adaptive Regression Kernels for Regression with Functional Predictors by D. B. Woodard, C. Crainiceanu, and D. Ruppert A. EMPIRICAL ESTIMATE OF THE KERNEL MIXTURE Here we

More information

Estimation of Complex Small Area Parameters with Application to Poverty Indicators

Estimation of Complex Small Area Parameters with Application to Poverty Indicators 1 Estimation of Complex Small Area Parameters with Application to Poverty Indicators J.N.K. Rao School of Mathematics and Statistics, Carleton University (Joint work with Isabel Molina from Universidad

More information

A note on multiple imputation for general purpose estimation

A note on multiple imputation for general purpose estimation A note on multiple imputation for general purpose estimation Shu Yang Jae Kwang Kim SSC meeting June 16, 2015 Shu Yang, Jae Kwang Kim Multiple Imputation June 16, 2015 1 / 32 Introduction Basic Setup Assume

More information

Robust Hierarchical Bayes Small Area Estimation for Nested Error Regression Model

Robust Hierarchical Bayes Small Area Estimation for Nested Error Regression Model Robust Hierarchical Bayes Small Area Estimation for Nested Error Regression Model Adrijo Chakraborty, Gauri Sankar Datta,3 and Abhyuday Mandal NORC at the University of Chicago, Bethesda, MD 084, USA Department

More information

Contextual Effects in Modeling for Small Domains

Contextual Effects in Modeling for Small Domains University of Wollongong Research Online Applied Statistics Education and Research Collaboration (ASEARC) - Conference Papers Faculty of Engineering and Information Sciences 2011 Contextual Effects in

More information

Measurement error as missing data: the case of epidemiologic assays. Roderick J. Little

Measurement error as missing data: the case of epidemiologic assays. Roderick J. Little Measurement error as missing data: the case of epidemiologic assays Roderick J. Little Outline Discuss two related calibration topics where classical methods are deficient (A) Limit of quantification methods

More information

Chapter 5: Models used in conjunction with sampling. J. Kim, W. Fuller (ISU) Chapter 5: Models used in conjunction with sampling 1 / 70

Chapter 5: Models used in conjunction with sampling. J. Kim, W. Fuller (ISU) Chapter 5: Models used in conjunction with sampling 1 / 70 Chapter 5: Models used in conjunction with sampling J. Kim, W. Fuller (ISU) Chapter 5: Models used in conjunction with sampling 1 / 70 Nonresponse Unit Nonresponse: weight adjustment Item Nonresponse:

More information

Frequentist Accuracy of Bayesian Estimates

Frequentist Accuracy of Bayesian Estimates Frequentist Accuracy of Bayesian Estimates Bradley Efron Stanford University RSS Journal Webinar Objective Bayesian Inference Probability family F = {f µ (x), µ Ω} Parameter of interest: θ = t(µ) Prior

More information

Plausible Values for Latent Variables Using Mplus

Plausible Values for Latent Variables Using Mplus Plausible Values for Latent Variables Using Mplus Tihomir Asparouhov and Bengt Muthén August 21, 2010 1 1 Introduction Plausible values are imputed values for latent variables. All latent variables can

More information

Bayesian performance

Bayesian performance Bayesian performance Frequentist properties of estimators refer to the performance of an estimator (say the posterior mean) over repeated experiments under the same conditions. The posterior distribution

More information

Estimation of small-area proportions using covariates and survey data

Estimation of small-area proportions using covariates and survey data Journal of Statistical Planning and Inference 112 (2003) 89 98 www.elsevier.com/locate/jspi Estimation of small-area proportions using covariates and survey data Michael D. Larsen Department of Statistics,

More information

Empirical Bayes Analysis for a Hierarchical Negative Binomial Generalized Linear Model

Empirical Bayes Analysis for a Hierarchical Negative Binomial Generalized Linear Model International Journal of Contemporary Mathematical Sciences Vol. 9, 2014, no. 12, 591-612 HIKARI Ltd, www.m-hikari.com http://dx.doi.org/10.12988/ijcms.2014.413 Empirical Bayes Analysis for a Hierarchical

More information

Bayesian Inference and the Parametric Bootstrap. Bradley Efron Stanford University

Bayesian Inference and the Parametric Bootstrap. Bradley Efron Stanford University Bayesian Inference and the Parametric Bootstrap Bradley Efron Stanford University Importance Sampling for Bayes Posterior Distribution Newton and Raftery (1994 JRSS-B) Nonparametric Bootstrap: good choice

More information

Master s Written Examination

Master s Written Examination Master s Written Examination Option: Statistics and Probability Spring 05 Full points may be obtained for correct answers to eight questions Each numbered question (which may have several parts) is worth

More information

Bayesian variable selection via. Penalized credible regions. Brian Reich, NCSU. Joint work with. Howard Bondell and Ander Wilson

Bayesian variable selection via. Penalized credible regions. Brian Reich, NCSU. Joint work with. Howard Bondell and Ander Wilson Bayesian variable selection via penalized credible regions Brian Reich, NC State Joint work with Howard Bondell and Ander Wilson Brian Reich, NCSU Penalized credible regions 1 Motivation big p, small n

More information

Stat 217 Final Exam. Name: May 1, 2002

Stat 217 Final Exam. Name: May 1, 2002 Stat 217 Final Exam Name: May 1, 2002 Problem 1. Three brands of batteries are under study. It is suspected that the lives (in weeks) of the three brands are different. Five batteries of each brand are

More information

Confidence Interval Estimation of Small Area Parameters Shrinking Both Means and Variances

Confidence Interval Estimation of Small Area Parameters Shrinking Both Means and Variances Confidence Interval Estimation of Small Area Parameters Shrinking Both Means and Variances Sarat C. Dass 1, Tapabrata Maiti 1, Hao Ren 1 and Samiran Sinha 2 1 Department of Statistics & Probability, Michigan

More information

Module 22: Bayesian Methods Lecture 9 A: Default prior selection

Module 22: Bayesian Methods Lecture 9 A: Default prior selection Module 22: Bayesian Methods Lecture 9 A: Default prior selection Peter Hoff Departments of Statistics and Biostatistics University of Washington Outline Jeffreys prior Unit information priors Empirical

More information

Accounting for Complex Sample Designs via Mixture Models

Accounting for Complex Sample Designs via Mixture Models Accounting for Complex Sample Designs via Finite Normal Mixture Models 1 1 University of Michigan School of Public Health August 2009 Talk Outline 1 2 Accommodating Sampling Weights in Mixture Models 3

More information

Categorical Predictor Variables

Categorical Predictor Variables Categorical Predictor Variables We often wish to use categorical (or qualitative) variables as covariates in a regression model. For binary variables (taking on only 2 values, e.g. sex), it is relatively

More information

Multivariate beta regression with application to small area estimation

Multivariate beta regression with application to small area estimation Multivariate beta regression with application to small area estimation Debora Ferreira de Souza debora@dme.ufrj.br Fernando Antônio da Silva Moura fmoura@im.ufrj.br Departamento de Métodos Estatísticos

More information

Linear Models A linear model is defined by the expression

Linear Models A linear model is defined by the expression Linear Models A linear model is defined by the expression x = F β + ɛ. where x = (x 1, x 2,..., x n ) is vector of size n usually known as the response vector. β = (β 1, β 2,..., β p ) is the transpose

More information

Beyond GLM and likelihood

Beyond GLM and likelihood Stat 6620: Applied Linear Models Department of Statistics Western Michigan University Statistics curriculum Core knowledge (modeling and estimation) Math stat 1 (probability, distributions, convergence

More information

Chapter 4: Imputation

Chapter 4: Imputation Chapter 4: Imputation Jae-Kwang Kim Department of Statistics, Iowa State University Outline 1 Introduction 2 Basic Theory for imputation 3 Variance estimation after imputation 4 Replication variance estimation

More information

STAT 3A03 Applied Regression With SAS Fall 2017

STAT 3A03 Applied Regression With SAS Fall 2017 STAT 3A03 Applied Regression With SAS Fall 2017 Assignment 2 Solution Set Q. 1 I will add subscripts relating to the question part to the parameters and their estimates as well as the errors and residuals.

More information

Estimation and Model Selection in Mixed Effects Models Part I. Adeline Samson 1

Estimation and Model Selection in Mixed Effects Models Part I. Adeline Samson 1 Estimation and Model Selection in Mixed Effects Models Part I Adeline Samson 1 1 University Paris Descartes Summer school 2009 - Lipari, Italy These slides are based on Marc Lavielle s slides Outline 1

More information

Domain Estimation of Survey Discontinuities

Domain Estimation of Survey Discontinuities Domain Estimation of Survey Discontinuities Nikos Tzavidis 1 Joint work with Paul Smith (University of Southampton), Timo Schmid, Natalia Rojas-Perilla, (Freie Universität Berlin), Jan van den Brackel

More information

Lecture 20 May 18, Empirical Bayes Interpretation [Efron & Morris 1973]

Lecture 20 May 18, Empirical Bayes Interpretation [Efron & Morris 1973] Stats 300C: Theory of Statistics Spring 2018 Lecture 20 May 18, 2018 Prof. Emmanuel Candes Scribe: Will Fithian and E. Candes 1 Outline 1. Stein s Phenomenon 2. Empirical Bayes Interpretation of James-Stein

More information

Stat 579: Generalized Linear Models and Extensions

Stat 579: Generalized Linear Models and Extensions Stat 579: Generalized Linear Models and Extensions Linear Mixed Models for Longitudinal Data Yan Lu April, 2018, week 15 1 / 38 Data structure t1 t2 tn i 1st subject y 11 y 12 y 1n1 Experimental 2nd subject

More information

Model-based Estimation of Poverty Indicators for Small Areas: Overview. J. N. K. Rao Carleton University, Ottawa, Canada

Model-based Estimation of Poverty Indicators for Small Areas: Overview. J. N. K. Rao Carleton University, Ottawa, Canada Model-based Estimation of Poverty Indicators for Small Areas: Overview J. N. K. Rao Carleton University, Ottawa, Canada Isabel Molina Universidad Carlos III de Madrid, Spain Paper presented at The First

More information

Linear Mixed Models. One-way layout REML. Likelihood. Another perspective. Relationship to classical ideas. Drawbacks.

Linear Mixed Models. One-way layout REML. Likelihood. Another perspective. Relationship to classical ideas. Drawbacks. Linear Mixed Models One-way layout Y = Xβ + Zb + ɛ where X and Z are specified design matrices, β is a vector of fixed effect coefficients, b and ɛ are random, mean zero, Gaussian if needed. Usually think

More information

Small area prediction based on unit level models when the covariate mean is measured with error

Small area prediction based on unit level models when the covariate mean is measured with error Graduate Theses and Dissertations Iowa State University Capstones, Theses and Dissertations 2015 Small area prediction based on unit level models when the covariate mean is measured with error Andreea

More information

Small Area Modeling of County Estimates for Corn and Soybean Yields in the US

Small Area Modeling of County Estimates for Corn and Soybean Yields in the US Small Area Modeling of County Estimates for Corn and Soybean Yields in the US Matt Williams National Agricultural Statistics Service United States Department of Agriculture Matt.Williams@nass.usda.gov

More information

Nonparametric Small Area Estimation via M-quantile Regression using Penalized Splines

Nonparametric Small Area Estimation via M-quantile Regression using Penalized Splines Nonparametric Small Estimation via M-quantile Regression using Penalized Splines Monica Pratesi 10 August 2008 Abstract The demand of reliable statistics for small areas, when only reduced sizes of the

More information

Biostatistics Advanced Methods in Biostatistics IV

Biostatistics Advanced Methods in Biostatistics IV Biostatistics 140.754 Advanced Methods in Biostatistics IV Jeffrey Leek Assistant Professor Department of Biostatistics jleek@jhsph.edu Lecture 12 1 / 36 Tip + Paper Tip: As a statistician the results

More information

BIO5312 Biostatistics Lecture 13: Maximum Likelihood Estimation

BIO5312 Biostatistics Lecture 13: Maximum Likelihood Estimation BIO5312 Biostatistics Lecture 13: Maximum Likelihood Estimation Yujin Chung November 29th, 2016 Fall 2016 Yujin Chung Lec13: MLE Fall 2016 1/24 Previous Parametric tests Mean comparisons (normality assumption)

More information

Some Curiosities Arising in Objective Bayesian Analysis

Some Curiosities Arising in Objective Bayesian Analysis . Some Curiosities Arising in Objective Bayesian Analysis Jim Berger Duke University Statistical and Applied Mathematical Institute Yale University May 15, 2009 1 Three vignettes related to John s work

More information

Chapter 01: Introduction to Small Area Estimation

Chapter 01: Introduction to Small Area Estimation Chapter 01: Introduction to Small Area Estimation Taps Maiti Department of Statistics & Probability Michigan State University Sept 23-24, 2013 Taps Maiti Department of Statistics & Probability Chapter

More information

ABSTRACT. Nadarajasundaram Ganesh Doctor of Philosophy, 2007

ABSTRACT. Nadarajasundaram Ganesh Doctor of Philosophy, 2007 ABSTRACT Title of dissertation: SMALL AREA ESTIMATION AND PREDICTION PROBLEMS: SPATIAL MODELS, BAYESIAN MULTIPLE COMPARISONS, AND ROBUST MSE ESTIMATION. Nadarajasundaram Ganesh Doctor of Philosophy, 007

More information

Statistical Inference Using Progressively Type-II Censored Data with Random Scheme

Statistical Inference Using Progressively Type-II Censored Data with Random Scheme International Mathematical Forum, 3, 28, no. 35, 1713-1725 Statistical Inference Using Progressively Type-II Censored Data with Random Scheme Ammar M. Sarhan 1 and A. Abuammoh Department of Statistics

More information

Lecture 13: More on Binary Data

Lecture 13: More on Binary Data Lecture 1: More on Binary Data Link functions for Binomial models Link η = g(π) π = g 1 (η) identity π η logarithmic log π e η logistic log ( π 1 π probit Φ 1 (π) Φ(η) log-log log( log π) exp( e η ) complementary

More information

STAT 740: Testing & Model Selection

STAT 740: Testing & Model Selection STAT 740: Testing & Model Selection Timothy Hanson Department of Statistics, University of South Carolina Stat 740: Statistical Computing 1 / 34 Testing & model choice, likelihood-based A common way to

More information

A union of Bayesian, frequentist and fiducial inferences by confidence distribution and artificial data sampling

A union of Bayesian, frequentist and fiducial inferences by confidence distribution and artificial data sampling A union of Bayesian, frequentist and fiducial inferences by confidence distribution and artificial data sampling Min-ge Xie Department of Statistics, Rutgers University Workshop on Higher-Order Asymptotics

More information

A BAYESIAN APPROACH TO JOINT SMALL AREA ESTIMATION

A BAYESIAN APPROACH TO JOINT SMALL AREA ESTIMATION A BAYESIAN APPROACH TO JOINT SMALL AREA ESTIMATION A DISSERTATION SUBMITTED TO THE FACULTY OF THE GRADUATE SCHOOL OF THE UNIVERSITY OF MINNESOTA BY Yanping Qu IN PARTIAL FULFILLMENT OF THE REQUIREMENTS

More information

Modeling Real Estate Data using Quantile Regression

Modeling Real Estate Data using Quantile Regression Modeling Real Estate Data using Semiparametric Quantile Regression Department of Statistics University of Innsbruck September 9th, 2011 Overview 1 Application: 2 3 4 Hedonic regression data for house prices

More information

Applied Regression. Applied Regression. Chapter 2 Simple Linear Regression. Hongcheng Li. April, 6, 2013

Applied Regression. Applied Regression. Chapter 2 Simple Linear Regression. Hongcheng Li. April, 6, 2013 Applied Regression Chapter 2 Simple Linear Regression Hongcheng Li April, 6, 2013 Outline 1 Introduction of simple linear regression 2 Scatter plot 3 Simple linear regression model 4 Test of Hypothesis

More information

MS&E 226: Small Data

MS&E 226: Small Data MS&E 226: Small Data Lecture 15: Examples of hypothesis tests (v5) Ramesh Johari ramesh.johari@stanford.edu 1 / 32 The recipe 2 / 32 The hypothesis testing recipe In this lecture we repeatedly apply the

More information

g-priors for Linear Regression

g-priors for Linear Regression Stat60: Bayesian Modeling and Inference Lecture Date: March 15, 010 g-priors for Linear Regression Lecturer: Michael I. Jordan Scribe: Andrew H. Chan 1 Linear regression and g-priors In the last lecture,

More information

Review for Final Exam Stat 205: Statistics for the Life Sciences

Review for Final Exam Stat 205: Statistics for the Life Sciences Review for Final Exam Stat 205: Statistics for the Life Sciences Tim Hanson, Ph.D. University of South Carolina T. Hanson (USC) Stat 205: Statistics for the Life Sciences 1 / 20 Overview of Final Exam

More information

arxiv: v1 [math.st] 16 Jan 2017

arxiv: v1 [math.st] 16 Jan 2017 arxiv:1701.04176v1 [math.st] 16 Jan 2017 A New Model Variance Estimator for an Area Level Small Area Model to Solve Multiple Problems Simultaneously Masayo Yoshimori Hirose The Institute of Statistical

More information

Biostat Methods STAT 5820/6910 Handout #9a: Intro. to Meta-Analysis Methods

Biostat Methods STAT 5820/6910 Handout #9a: Intro. to Meta-Analysis Methods Biostat Methods STAT 5820/6910 Handout #9a: Intro. to Meta-Analysis Methods Meta-analysis describes statistical approach to systematically combine results from multiple studies [identified follong an exhaustive

More information

Fractional Imputation in Survey Sampling: A Comparative Review

Fractional Imputation in Survey Sampling: A Comparative Review Fractional Imputation in Survey Sampling: A Comparative Review Shu Yang Jae-Kwang Kim Iowa State University Joint Statistical Meetings, August 2015 Outline Introduction Fractional imputation Features Numerical

More information

36. Multisample U-statistics and jointly distributed U-statistics Lehmann 6.1

36. Multisample U-statistics and jointly distributed U-statistics Lehmann 6.1 36. Multisample U-statistics jointly distributed U-statistics Lehmann 6.1 In this topic, we generalize the idea of U-statistics in two different directions. First, we consider single U-statistics for situations

More information

Empirical Likelihood Methods for Two-sample Problems with Data Missing-by-Design

Empirical Likelihood Methods for Two-sample Problems with Data Missing-by-Design 1 / 32 Empirical Likelihood Methods for Two-sample Problems with Data Missing-by-Design Changbao Wu Department of Statistics and Actuarial Science University of Waterloo (Joint work with Min Chen and Mary

More information

On the bias of the multiple-imputation variance estimator in survey sampling

On the bias of the multiple-imputation variance estimator in survey sampling J. R. Statist. Soc. B (2006) 68, Part 3, pp. 509 521 On the bias of the multiple-imputation variance estimator in survey sampling Jae Kwang Kim, Yonsei University, Seoul, Korea J. Michael Brick, Westat,

More information

Linear Regression. In this problem sheet, we consider the problem of linear regression with p predictors and one intercept,

Linear Regression. In this problem sheet, we consider the problem of linear regression with p predictors and one intercept, Linear Regression In this problem sheet, we consider the problem of linear regression with p predictors and one intercept, y = Xβ + ɛ, where y t = (y 1,..., y n ) is the column vector of target values,

More information

VARIANCE ESTIMATION FOR NEAREST NEIGHBOR IMPUTATION FOR U.S. CENSUS LONG FORM DATA

VARIANCE ESTIMATION FOR NEAREST NEIGHBOR IMPUTATION FOR U.S. CENSUS LONG FORM DATA Submitted to the Annals of Applied Statistics VARIANCE ESTIMATION FOR NEAREST NEIGHBOR IMPUTATION FOR U.S. CENSUS LONG FORM DATA By Jae Kwang Kim, Wayne A. Fuller and William R. Bell Iowa State University

More information

Functional Mixed Effects Spectral Analysis

Functional Mixed Effects Spectral Analysis Joint with Robert Krafty and Martica Hall June 4, 2014 Outline Introduction Motivating example Brief review Functional mixed effects spectral analysis Estimation Procedure Application Remarks Introduction

More information

Obnoxious lateness humor

Obnoxious lateness humor Obnoxious lateness humor 1 Using Bayesian Model Averaging For Addressing Model Uncertainty in Environmental Risk Assessment Louise Ryan and Melissa Whitney Department of Biostatistics Harvard School of

More information

IEOR E4703: Monte-Carlo Simulation

IEOR E4703: Monte-Carlo Simulation IEOR E4703: Monte-Carlo Simulation Output Analysis for Monte-Carlo Martin Haugh Department of Industrial Engineering and Operations Research Columbia University Email: martin.b.haugh@gmail.com Output Analysis

More information

Statistical Analysis of List Experiments

Statistical Analysis of List Experiments Statistical Analysis of List Experiments Graeme Blair Kosuke Imai Princeton University December 17, 2010 Blair and Imai (Princeton) List Experiments Political Methodology Seminar 1 / 32 Motivation Surveys

More information

Sampling and Estimation in Network Graphs

Sampling and Estimation in Network Graphs Sampling and Estimation in Network Graphs Gonzalo Mateos Dept. of ECE and Goergen Institute for Data Science University of Rochester gmateosb@ece.rochester.edu http://www.ece.rochester.edu/~gmateosb/ March

More information

STAT 704 Sections IRLS and Bootstrap

STAT 704 Sections IRLS and Bootstrap STAT 704 Sections 11.4-11.5. IRLS and John Grego Department of Statistics, University of South Carolina Stat 704: Data Analysis I 1 / 14 LOWESS IRLS LOWESS LOWESS (LOcally WEighted Scatterplot Smoothing)

More information

On Small Area Prediction Interval Problems

On Small Area Prediction Interval Problems On Small Area Prediction Interval Problems Snigdhansu Chatterjee, Parthasarathi Lahiri, Huilin Li University of Minnesota, University of Maryland, University of Maryland Abstract Empirical best linear

More information

Outline of GLMs. Definitions

Outline of GLMs. Definitions Outline of GLMs Definitions This is a short outline of GLM details, adapted from the book Nonparametric Regression and Generalized Linear Models, by Green and Silverman. The responses Y i have density

More information

FE670 Algorithmic Trading Strategies. Stevens Institute of Technology

FE670 Algorithmic Trading Strategies. Stevens Institute of Technology FE670 Algorithmic Trading Strategies Lecture 8. Robust Portfolio Optimization Steve Yang Stevens Institute of Technology 10/17/2013 Outline 1 Robust Mean-Variance Formulations 2 Uncertain in Expected Return

More information

LASSO-Type Penalization in the Framework of Generalized Additive Models for Location, Scale and Shape

LASSO-Type Penalization in the Framework of Generalized Additive Models for Location, Scale and Shape LASSO-Type Penalization in the Framework of Generalized Additive Models for Location, Scale and Shape Nikolaus Umlauf https://eeecon.uibk.ac.at/~umlauf/ Overview Joint work with Andreas Groll, Julien Hambuckers

More information

Multi-level Models: Idea

Multi-level Models: Idea Review of 140.656 Review Introduction to multi-level models The two-stage normal-normal model Two-stage linear models with random effects Three-stage linear models Two-stage logistic regression with random

More information

Model Specification Testing in Nonparametric and Semiparametric Time Series Econometrics. Jiti Gao

Model Specification Testing in Nonparametric and Semiparametric Time Series Econometrics. Jiti Gao Model Specification Testing in Nonparametric and Semiparametric Time Series Econometrics Jiti Gao Department of Statistics School of Mathematics and Statistics The University of Western Australia Crawley

More information

Linear Regression Models P8111

Linear Regression Models P8111 Linear Regression Models P8111 Lecture 25 Jeff Goldsmith April 26, 2016 1 of 37 Today s Lecture Logistic regression / GLMs Model framework Interpretation Estimation 2 of 37 Linear regression Course started

More information

Good Confidence Intervals for Categorical Data Analyses. Alan Agresti

Good Confidence Intervals for Categorical Data Analyses. Alan Agresti Good Confidence Intervals for Categorical Data Analyses Alan Agresti Department of Statistics, University of Florida visiting Statistics Department, Harvard University LSHTM, July 22, 2011 p. 1/36 Outline

More information

Default Priors and Effcient Posterior Computation in Bayesian

Default Priors and Effcient Posterior Computation in Bayesian Default Priors and Effcient Posterior Computation in Bayesian Factor Analysis January 16, 2010 Presented by Eric Wang, Duke University Background and Motivation A Brief Review of Parameter Expansion Literature

More information

1. Fisher Information

1. Fisher Information 1. Fisher Information Let f(x θ) be a density function with the property that log f(x θ) is differentiable in θ throughout the open p-dimensional parameter set Θ R p ; then the score statistic (or score

More information

Interval estimation. October 3, Basic ideas CLT and CI CI for a population mean CI for a population proportion CI for a Normal mean

Interval estimation. October 3, Basic ideas CLT and CI CI for a population mean CI for a population proportion CI for a Normal mean Interval estimation October 3, 2018 STAT 151 Class 7 Slide 1 Pandemic data Treatment outcome, X, from n = 100 patients in a pandemic: 1 = recovered and 0 = not recovered 1 1 1 0 0 0 1 1 1 0 0 1 0 1 0 0

More information

Simple and Multiple Linear Regression

Simple and Multiple Linear Regression Sta. 113 Chapter 12 and 13 of Devore March 12, 2010 Table of contents 1 Simple Linear Regression 2 Model Simple Linear Regression A simple linear regression model is given by Y = β 0 + β 1 x + ɛ where

More information

Miscellanea A note on multiple imputation under complex sampling

Miscellanea A note on multiple imputation under complex sampling Biometrika (2017), 104, 1,pp. 221 228 doi: 10.1093/biomet/asw058 Printed in Great Britain Advance Access publication 3 January 2017 Miscellanea A note on multiple imputation under complex sampling BY J.

More information

Chapter 10. Semi-Supervised Learning

Chapter 10. Semi-Supervised Learning Chapter 10. Semi-Supervised Learning Wei Pan Division of Biostatistics, School of Public Health, University of Minnesota, Minneapolis, MN 55455 Email: weip@biostat.umn.edu PubH 7475/8475 c Wei Pan Outline

More information

Nonparametric Small Area Estimation Using Penalized Spline Regression

Nonparametric Small Area Estimation Using Penalized Spline Regression Nonparametric Small Area Estimation Using Penalized Spline Regression 0verview Spline-based nonparametric regression Nonparametric small area estimation Prediction mean squared error Bootstrapping small

More information

MAT2377. Rafa l Kulik. Version 2015/November/26. Rafa l Kulik

MAT2377. Rafa l Kulik. Version 2015/November/26. Rafa l Kulik MAT2377 Rafa l Kulik Version 2015/November/26 Rafa l Kulik Bivariate data and scatterplot Data: Hydrocarbon level (x) and Oxygen level (y): x: 0.99, 1.02, 1.15, 1.29, 1.46, 1.36, 0.87, 1.23, 1.55, 1.40,

More information

Regression Discontinuity Designs in Stata

Regression Discontinuity Designs in Stata Regression Discontinuity Designs in Stata Matias D. Cattaneo University of Michigan July 30, 2015 Overview Main goal: learn about treatment effect of policy or intervention. If treatment randomization

More information

The Multilevel Logit Model for Binary Dependent Variables Marco R. Steenbergen

The Multilevel Logit Model for Binary Dependent Variables Marco R. Steenbergen The Multilevel Logit Model for Binary Dependent Variables Marco R. Steenbergen January 23-24, 2012 Page 1 Part I The Single Level Logit Model: A Review Motivating Example Imagine we are interested in voting

More information

Multivariate Survival Analysis

Multivariate Survival Analysis Multivariate Survival Analysis Previously we have assumed that either (X i, δ i ) or (X i, δ i, Z i ), i = 1,..., n, are i.i.d.. This may not always be the case. Multivariate survival data can arise in

More information

Practical Econometrics. for. Finance and Economics. (Econometrics 2)

Practical Econometrics. for. Finance and Economics. (Econometrics 2) Practical Econometrics for Finance and Economics (Econometrics 2) Seppo Pynnönen and Bernd Pape Department of Mathematics and Statistics, University of Vaasa 1. Introduction 1.1 Econometrics Econometrics

More information

Bayesian methods in economics and finance

Bayesian methods in economics and finance 1/26 Bayesian methods in economics and finance Linear regression: Bayesian model selection and sparsity priors Linear Regression 2/26 Linear regression Model for relationship between (several) independent

More information

COS513 LECTURE 8 STATISTICAL CONCEPTS

COS513 LECTURE 8 STATISTICAL CONCEPTS COS513 LECTURE 8 STATISTICAL CONCEPTS NIKOLAI SLAVOV AND ANKUR PARIKH 1. MAKING MEANINGFUL STATEMENTS FROM JOINT PROBABILITY DISTRIBUTIONS. A graphical model (GM) represents a family of probability distributions

More information

Statement: With my signature I confirm that the solutions are the product of my own work. Name: Signature:.

Statement: With my signature I confirm that the solutions are the product of my own work. Name: Signature:. MATHEMATICAL STATISTICS Homework assignment Instructions Please turn in the homework with this cover page. You do not need to edit the solutions. Just make sure the handwriting is legible. You may discuss

More information

Answer Key for STAT 200B HW No. 7

Answer Key for STAT 200B HW No. 7 Answer Key for STAT 200B HW No. 7 May 5, 2007 Problem 2.2 p. 649 Assuming binomial 2-sample model ˆπ =.75, ˆπ 2 =.6. a ˆτ = ˆπ 2 ˆπ =.5. From Ex. 2.5a on page 644: ˆπ ˆπ + ˆπ 2 ˆπ 2.75.25.6.4 = + =.087;

More information

A Resampling Method on Pivotal Estimating Functions

A Resampling Method on Pivotal Estimating Functions A Resampling Method on Pivotal Estimating Functions Kun Nie Biostat 277,Winter 2004 March 17, 2004 Outline Introduction A General Resampling Method Examples - Quantile Regression -Rank Regression -Simulation

More information