Small Area Confidence Bounds on Small Cell Proportions in Survey Populations
|
|
- Duane Baker
- 5 years ago
- Views:
Transcription
1 Small Area Confidence Bounds on Small Cell Proportions in Survey Populations Aaron Gilary, Jerry Maples, U.S. Census Bureau U.S. Census Bureau Eric V. Slud, U.S. Census Bureau Univ. Maryland College Park DC-AAPOR / WSS 1
2 Outline General Problem Cell-Based vs. Model-Based Methods Alternative Definitions of Effective Sample Size Data Example Erroneous Enumeration of HUs in Census Coverage Measurement study (CCM) Numerical Results Conclusion and Future Plans 2
3 General Problem Setting: given direct (ratio) estimates for small proportions ˆπ i at level of cell or domain i (e.g., county) Problem: specify upper confidence bound for ˆπ i either bound in transformed measurement scale h(π i ) from data within domain i or model-based bound connecting values π i across domains using predictors x i Approach: using effective sample-sizes n i, adapt binomial/srs estimator V ar(ˆπ i ) = ˆπ i (1 ˆπ i )/n i 3
4 Applications demographic tables in ACS, the American Community Survey; area-level Erroneous Enumeration rates in CCM, Census Coverage Measurement; and rates in other Census Bureau surveys. 4
5 A Good Cell-Based Method Liu and Kott 2009, Survey Methodology π i = true proportion n i = effective sample size, yi Binom(n i, π i) ˆπ i = y i n i = y i n i direct estimator Transformation: asin( ˆπ) (Variance-stabilizing) centered at asin( π), Var 1/(4n i ) Obtain UCB on arcsin scale, transform back to prob. scale by sin 2 (x) 5
6 Ideas of Model-Based Methods π i includes { predicted part ηi = x i β unmodeled random component (1) Fay-Herriot: π i = sin 2 (η i +u i ), u i N(0, σ 2 u) (2) Logistic: π i = eη i +v i 1+e η i +v i, v i N(0, σ 2 v ) (3) Beta-Binomial: π i = Beta( τ eη i 1+e η i, τ 1+e η i ) Parameters σ 2 u, σ 2 v, (1 + τ) 1 quantify imprecision of π i in terms of η i 6
7 Model-Based Methods, Continued Recall y i /n i = ˆπ i Fay-Herriot: arcsin( ˆπ i ) N(arcsin( π i ), Logistic & Beta-Binomial: 1 4n i ) y i Binom(n i, π i) 7
8 Estimation in Model-Based Methods Point Predictor for π i generally different from direct estimator BLUP: E(π i y i ) EBLUP: substitute MLE for β & (σ u, σ v or τ) Upper Confidence Bound for π i based on V ar(eblup ) 8
9 Defining Effective Sample Size n i If variance V i n = π i(1 π i ) i n of direct π i estimator in area i i is reliably estimated and sampling fraction f 1: DEFFi = V i π i (1 π i ), n i = n i DEFFi What if ˆV i is erratic or π i too small? Proposal 1: Area size via higher-level DEFF With ˆV /n = V (ˆπ), and DEFF at higher (e.g., State) level DEFF = ˆV π(1 π), n i = n i DEFF 9
10 Effective Sample Size, Continued Proposal 2: Eff. size modified by sampling weights w ik indiv.-level sampling weights within area i n i = n i DEFF k w 2 ik ( k w ik ) 2 / j,k w 2 jk ( j,k w jk ) 2 Effective sizes for survey CIs : Liu & Kott 2009 in Bayesian analysis: Chen et al. 2011, Malec
11 Data Example Census Coverage Measurement (CCM): evaluation of Census performance. finds Erroneous Enumeration (EE) rates for counties in sample. 170k Housing Units (HUs) and 1,728 counties. Census publishes detailed estimates for 128 counties with pop. 500k 11
12 Specific CCM Details National EE rate 2.7% for metro areas (Olson, 2012). For our 128 counties, P(ˆπ i = 0) > 0. EE Rates (i = county, k = HU): ˆπ = Σ j,k W jk Y jk Σ j,k W jk, ˆπ i = Σ k W ik Y ik Σ k W ik 12
13 Specific CCM Modeling Details Area- or cell-level models, all covariates from Census. BIC model-selection penalizes max loglik by k ln(n) (k = # regr. coef s, n = total sample size). Our selected model had 5 predictor variables: state EE rate, a synthetic estimator; area rate of single-unit households; area rate of large multi-unit households; area rate of urban households; area enumeration rate. 13
14 Variability Across Counties n = Proposal 2 Eff. S.S., n = Proposal 1 Eff. S.S. Mean 1st Q Med. 3rd Q St DEFF Cou n Cou n /n NB: n, n based on counties with 20 HUs in sample. 14
15 Results Inclusion of n led to wider, more conservative estimates for the UCBs. The Fay-Herriot and Logistic methods had slightly higher means and medians for UCBs for areas with ˆπ 0, but the Cell and Beta-Binomial methods had the highest maximum UCBs. Overall, all four methods created upper bounds within a similar range. 15
16 UCB of Production Counties where ˆπ i = Cell FH LOG BB 16
17 Conclusion & Future Plans Conclusions: For our project, we chose the Cell-Based method. With the results so close, a simpler method without a specified regression model was preferrable. We used the Proposal 2 n i for accurately capturing the variance because it did not assume SRS or equal weights within area. Results were more conservative which was a priority in this case. Future plans: extend approach to other applications test performance of DEFF under different assumptions? generalized R package for Census Bureau use? 17
18 References Chen, C., Lumley, T. and Wakefield, J. (2011), The Use of Sampling Weights in Bayesian Hierarchical Models for Small Area Estimation. U. Wash. Stat. Dept. Tech. Rep. #583. Fay, R. and Herriot, R. (1979, JASA). Liu, Y. and Kott, P. (2009, Jour. Official Stat.) Malec, D. (2005, Jour. Official Stat.) Small Area Estimation... Housing Units. Maples, J., Bell, W. and Huang, E. (2009 ASA Proc.) GVF s for area-level variances Olson, Doug (2012) Census Coverage Measurement Report on Modeling. DSSD #2010-G-13. Prentice, R. (1986, JASA) Beta-binomial regression Slud, Eric V. (2012). Small-Area Confidence Bounds... in Tables. Jour. Indian. Soc. Agric. Stat. 18
19 Thank You! 19
20 A Cell-Based Method Variance-Stabilizing transformation to arcsin sqrt scale of estimated area proportion y i /n i = yi /n i For area i : π i = true proportion, n i = eff. samp. size y i Binom(n i, π i) N (n i π i, n i π i (1 π i )) arcsin y i n i N (arcsin π i, 1 4n i ) -method Transformed scale 90% CI: arcsin y i /n i ± 1.645/ 4 n i Transform back to the probability scale by sin 2 (x) 20
21 Models which Borrow Strength across Areas Notation: ˆπ i = y i n = y i i n, â i = arcsin ˆπ i, η i = x i β i area-level observed predictors x i u i N (0, σ 2 u), v i N (0, σ 2 v ) random effects (1) Fay-Herriot : â i = η i + u i + ɛ i, ɛ i N (0, 1 4n ) i (2) Logistic Random-Intercept : y i Bin(n i, e η i +v i 1+e η i +v i ) (3) Beta-Binomial : y i Bin(n i, π i), π i Beta( τ eη i 1+e η i, τ 1+e η i ) Targets for small-area prediction: sin 2 (η i + u i ) in (1); e η i +v i 1+e η i +v i in (2); and π i in (3). 21
22 Fay-Herriot Model UCB Mod1 (FH): EBLUP ˆπ i = sin 2 (ˆθ i /n i ), based on θ i = η i + µ i, ˆθ i = ˆγ i y i + (1 ˆγ i ) x iˆβ, ˆγ i = UCB i = sin 2 ( 1 n i ˆσ 2 u ˆσ 2 u + (4n i ) 1 (ˆθ i + z ) α 2 ((1 ˆγ i) ˆσ u 2 + (1 ˆγ i ) 2 x i ˆVˆβ x i} 1/2 ) Fay and Herriot (1979), Slud (2012) NB: includes sample variability of ˆβ, not ˆσ 2 u Rao (2003) has more inclusive formulas for related mse 22
23 Logistic Random-Intercept Model-Based UCB Mod2 (LgstRI): EBLUP ˆπ i = g(y i + 1, n i + 1, ˆη i, ˆω 2 ) g(y i, n i, ˆη, ˆω2 ) where ˆη i = x i ˆβ, ˆω 2 = ˆσ 2 v + x i ˆVˆβ x i g(k, n, η, ω 2 ) = e (η+ωz)k (1 + e η+ωz φ(z)dz, φ( ) N (0, 1) ) n [ g(y UCB i = ˆπ i i + 2, n i + 2, ˆη, ˆω2 ) g(yi, n i, ˆη, ˆω2 ) ˆπ 2 i ] 1/2 NB: includes sample variability of ˆβ, not ˆσ 2 v. Jiang and Lahiri 2006, Slud
24 Beta-Binomial Model-Based UCB Mod3 (Beta-Bin): Estimation via Posterior, µ i = eη i 1 + e η i, π i y i Beta(τ µ i + y i, τ (1 µ i) + n i y i ) Empirical Bayes ˆπ i = y i + ˆτ ˆµ i n i + ˆτ Bootstrap approach to UCB 24
Eric V. Slud, Census Bureau & Univ. of Maryland Mathematics Department, University of Maryland, College Park MD 20742
COMPARISON OF AGGREGATE VERSUS UNIT-LEVEL MODELS FOR SMALL-AREA ESTIMATION Eric V. Slud, Census Bureau & Univ. of Maryland Mathematics Department, University of Maryland, College Park MD 20742 Key words:
More informationCoverage Properties of Confidence Intervals for Proportions in Complex Sample Surveys
Coverage Properties of Confidence Intervals for Proportions in Complex Sample Surveys Carolina Franco Roderick J. A. Little Thomas A. Louis Eric V. Slud Abstract The challenges of constructing confidence
More informationA measurement error model approach to small area estimation
A measurement error model approach to small area estimation Jae-kwang Kim 1 Spring, 2015 1 Joint work with Seunghwan Park and Seoyoung Kim Ouline Introduction Basic Theory Application to Korean LFS Discussion
More informationSmall Area Estimation with Uncertain Random Effects
Small Area Estimation with Uncertain Random Effects Gauri Sankar Datta 1 2 3 and Abhyuday Mandal 4 Abstract Random effects models play an important role in model-based small area estimation. Random effects
More informationIntroduction to Survey Data Integration
Introduction to Survey Data Integration Jae-Kwang Kim Iowa State University May 20, 2014 Outline 1 Introduction 2 Survey Integration Examples 3 Basic Theory for Survey Integration 4 NASS application 5
More informationFractional Hot Deck Imputation for Robust Inference Under Item Nonresponse in Survey Sampling
Fractional Hot Deck Imputation for Robust Inference Under Item Nonresponse in Survey Sampling Jae-Kwang Kim 1 Iowa State University June 26, 2013 1 Joint work with Shu Yang Introduction 1 Introduction
More informationOn Modifications to Linking Variance Estimators in the Fay-Herriot Model that Induce Robustness
Statistics and Applications {ISSN 2452-7395 (online)} Volume 16 No. 1, 2018 (New Series), pp 289-303 On Modifications to Linking Variance Estimators in the Fay-Herriot Model that Induce Robustness Snigdhansu
More informationSpatial Modeling and Prediction of County-Level Employment Growth Data
Spatial Modeling and Prediction of County-Level Employment Growth Data N. Ganesh Abstract For correlated sample survey estimates, a linear model with covariance matrix in which small areas are grouped
More informationSmall Domains Estimation and Poverty Indicators. Carleton University, Ottawa, Canada
Small Domains Estimation and Poverty Indicators J. N. K. Rao Carleton University, Ottawa, Canada Invited paper for presentation at the International Seminar Population Estimates and Projections: Methodologies,
More informationEric V. Slud, Census Bureau & Univ. of Maryland Mathematics Department, University of Maryland, College Park MD 20742
MODEL-ASSISTED WEIGHTING FOR SURVEYS WITH MULTIPLE RESPONSE MODE Eric V. Slud, Census Bureau & Univ. of Maryland Mathematics Department, University of Maryland, College Park MD 20742 Key words: American
More informationAnswer Key for STAT 200B HW No. 8
Answer Key for STAT 200B HW No. 8 May 8, 2007 Problem 3.42 p. 708 The values of Ȳ for x 00, 0, 20, 30 are 5/40, 0, 20/50, and, respectively. From Corollary 3.5 it follows that MLE exists i G is identiable
More informationFall 2017 STAT 532 Homework Peter Hoff. 1. Let P be a probability measure on a collection of sets A.
1. Let P be a probability measure on a collection of sets A. (a) For each n N, let H n be a set in A such that H n H n+1. Show that P (H n ) monotonically converges to P ( k=1 H k) as n. (b) For each n
More informationWeb Appendix for Hierarchical Adaptive Regression Kernels for Regression with Functional Predictors by D. B. Woodard, C. Crainiceanu, and D.
Web Appendix for Hierarchical Adaptive Regression Kernels for Regression with Functional Predictors by D. B. Woodard, C. Crainiceanu, and D. Ruppert A. EMPIRICAL ESTIMATE OF THE KERNEL MIXTURE Here we
More informationEstimation of Complex Small Area Parameters with Application to Poverty Indicators
1 Estimation of Complex Small Area Parameters with Application to Poverty Indicators J.N.K. Rao School of Mathematics and Statistics, Carleton University (Joint work with Isabel Molina from Universidad
More informationA note on multiple imputation for general purpose estimation
A note on multiple imputation for general purpose estimation Shu Yang Jae Kwang Kim SSC meeting June 16, 2015 Shu Yang, Jae Kwang Kim Multiple Imputation June 16, 2015 1 / 32 Introduction Basic Setup Assume
More informationRobust Hierarchical Bayes Small Area Estimation for Nested Error Regression Model
Robust Hierarchical Bayes Small Area Estimation for Nested Error Regression Model Adrijo Chakraborty, Gauri Sankar Datta,3 and Abhyuday Mandal NORC at the University of Chicago, Bethesda, MD 084, USA Department
More informationContextual Effects in Modeling for Small Domains
University of Wollongong Research Online Applied Statistics Education and Research Collaboration (ASEARC) - Conference Papers Faculty of Engineering and Information Sciences 2011 Contextual Effects in
More informationMeasurement error as missing data: the case of epidemiologic assays. Roderick J. Little
Measurement error as missing data: the case of epidemiologic assays Roderick J. Little Outline Discuss two related calibration topics where classical methods are deficient (A) Limit of quantification methods
More informationChapter 5: Models used in conjunction with sampling. J. Kim, W. Fuller (ISU) Chapter 5: Models used in conjunction with sampling 1 / 70
Chapter 5: Models used in conjunction with sampling J. Kim, W. Fuller (ISU) Chapter 5: Models used in conjunction with sampling 1 / 70 Nonresponse Unit Nonresponse: weight adjustment Item Nonresponse:
More informationFrequentist Accuracy of Bayesian Estimates
Frequentist Accuracy of Bayesian Estimates Bradley Efron Stanford University RSS Journal Webinar Objective Bayesian Inference Probability family F = {f µ (x), µ Ω} Parameter of interest: θ = t(µ) Prior
More informationPlausible Values for Latent Variables Using Mplus
Plausible Values for Latent Variables Using Mplus Tihomir Asparouhov and Bengt Muthén August 21, 2010 1 1 Introduction Plausible values are imputed values for latent variables. All latent variables can
More informationBayesian performance
Bayesian performance Frequentist properties of estimators refer to the performance of an estimator (say the posterior mean) over repeated experiments under the same conditions. The posterior distribution
More informationEstimation of small-area proportions using covariates and survey data
Journal of Statistical Planning and Inference 112 (2003) 89 98 www.elsevier.com/locate/jspi Estimation of small-area proportions using covariates and survey data Michael D. Larsen Department of Statistics,
More informationEmpirical Bayes Analysis for a Hierarchical Negative Binomial Generalized Linear Model
International Journal of Contemporary Mathematical Sciences Vol. 9, 2014, no. 12, 591-612 HIKARI Ltd, www.m-hikari.com http://dx.doi.org/10.12988/ijcms.2014.413 Empirical Bayes Analysis for a Hierarchical
More informationBayesian Inference and the Parametric Bootstrap. Bradley Efron Stanford University
Bayesian Inference and the Parametric Bootstrap Bradley Efron Stanford University Importance Sampling for Bayes Posterior Distribution Newton and Raftery (1994 JRSS-B) Nonparametric Bootstrap: good choice
More informationMaster s Written Examination
Master s Written Examination Option: Statistics and Probability Spring 05 Full points may be obtained for correct answers to eight questions Each numbered question (which may have several parts) is worth
More informationBayesian variable selection via. Penalized credible regions. Brian Reich, NCSU. Joint work with. Howard Bondell and Ander Wilson
Bayesian variable selection via penalized credible regions Brian Reich, NC State Joint work with Howard Bondell and Ander Wilson Brian Reich, NCSU Penalized credible regions 1 Motivation big p, small n
More informationStat 217 Final Exam. Name: May 1, 2002
Stat 217 Final Exam Name: May 1, 2002 Problem 1. Three brands of batteries are under study. It is suspected that the lives (in weeks) of the three brands are different. Five batteries of each brand are
More informationConfidence Interval Estimation of Small Area Parameters Shrinking Both Means and Variances
Confidence Interval Estimation of Small Area Parameters Shrinking Both Means and Variances Sarat C. Dass 1, Tapabrata Maiti 1, Hao Ren 1 and Samiran Sinha 2 1 Department of Statistics & Probability, Michigan
More informationModule 22: Bayesian Methods Lecture 9 A: Default prior selection
Module 22: Bayesian Methods Lecture 9 A: Default prior selection Peter Hoff Departments of Statistics and Biostatistics University of Washington Outline Jeffreys prior Unit information priors Empirical
More informationAccounting for Complex Sample Designs via Mixture Models
Accounting for Complex Sample Designs via Finite Normal Mixture Models 1 1 University of Michigan School of Public Health August 2009 Talk Outline 1 2 Accommodating Sampling Weights in Mixture Models 3
More informationCategorical Predictor Variables
Categorical Predictor Variables We often wish to use categorical (or qualitative) variables as covariates in a regression model. For binary variables (taking on only 2 values, e.g. sex), it is relatively
More informationMultivariate beta regression with application to small area estimation
Multivariate beta regression with application to small area estimation Debora Ferreira de Souza debora@dme.ufrj.br Fernando Antônio da Silva Moura fmoura@im.ufrj.br Departamento de Métodos Estatísticos
More informationLinear Models A linear model is defined by the expression
Linear Models A linear model is defined by the expression x = F β + ɛ. where x = (x 1, x 2,..., x n ) is vector of size n usually known as the response vector. β = (β 1, β 2,..., β p ) is the transpose
More informationBeyond GLM and likelihood
Stat 6620: Applied Linear Models Department of Statistics Western Michigan University Statistics curriculum Core knowledge (modeling and estimation) Math stat 1 (probability, distributions, convergence
More informationChapter 4: Imputation
Chapter 4: Imputation Jae-Kwang Kim Department of Statistics, Iowa State University Outline 1 Introduction 2 Basic Theory for imputation 3 Variance estimation after imputation 4 Replication variance estimation
More informationSTAT 3A03 Applied Regression With SAS Fall 2017
STAT 3A03 Applied Regression With SAS Fall 2017 Assignment 2 Solution Set Q. 1 I will add subscripts relating to the question part to the parameters and their estimates as well as the errors and residuals.
More informationEstimation and Model Selection in Mixed Effects Models Part I. Adeline Samson 1
Estimation and Model Selection in Mixed Effects Models Part I Adeline Samson 1 1 University Paris Descartes Summer school 2009 - Lipari, Italy These slides are based on Marc Lavielle s slides Outline 1
More informationDomain Estimation of Survey Discontinuities
Domain Estimation of Survey Discontinuities Nikos Tzavidis 1 Joint work with Paul Smith (University of Southampton), Timo Schmid, Natalia Rojas-Perilla, (Freie Universität Berlin), Jan van den Brackel
More informationLecture 20 May 18, Empirical Bayes Interpretation [Efron & Morris 1973]
Stats 300C: Theory of Statistics Spring 2018 Lecture 20 May 18, 2018 Prof. Emmanuel Candes Scribe: Will Fithian and E. Candes 1 Outline 1. Stein s Phenomenon 2. Empirical Bayes Interpretation of James-Stein
More informationStat 579: Generalized Linear Models and Extensions
Stat 579: Generalized Linear Models and Extensions Linear Mixed Models for Longitudinal Data Yan Lu April, 2018, week 15 1 / 38 Data structure t1 t2 tn i 1st subject y 11 y 12 y 1n1 Experimental 2nd subject
More informationModel-based Estimation of Poverty Indicators for Small Areas: Overview. J. N. K. Rao Carleton University, Ottawa, Canada
Model-based Estimation of Poverty Indicators for Small Areas: Overview J. N. K. Rao Carleton University, Ottawa, Canada Isabel Molina Universidad Carlos III de Madrid, Spain Paper presented at The First
More informationLinear Mixed Models. One-way layout REML. Likelihood. Another perspective. Relationship to classical ideas. Drawbacks.
Linear Mixed Models One-way layout Y = Xβ + Zb + ɛ where X and Z are specified design matrices, β is a vector of fixed effect coefficients, b and ɛ are random, mean zero, Gaussian if needed. Usually think
More informationSmall area prediction based on unit level models when the covariate mean is measured with error
Graduate Theses and Dissertations Iowa State University Capstones, Theses and Dissertations 2015 Small area prediction based on unit level models when the covariate mean is measured with error Andreea
More informationSmall Area Modeling of County Estimates for Corn and Soybean Yields in the US
Small Area Modeling of County Estimates for Corn and Soybean Yields in the US Matt Williams National Agricultural Statistics Service United States Department of Agriculture Matt.Williams@nass.usda.gov
More informationNonparametric Small Area Estimation via M-quantile Regression using Penalized Splines
Nonparametric Small Estimation via M-quantile Regression using Penalized Splines Monica Pratesi 10 August 2008 Abstract The demand of reliable statistics for small areas, when only reduced sizes of the
More informationBiostatistics Advanced Methods in Biostatistics IV
Biostatistics 140.754 Advanced Methods in Biostatistics IV Jeffrey Leek Assistant Professor Department of Biostatistics jleek@jhsph.edu Lecture 12 1 / 36 Tip + Paper Tip: As a statistician the results
More informationBIO5312 Biostatistics Lecture 13: Maximum Likelihood Estimation
BIO5312 Biostatistics Lecture 13: Maximum Likelihood Estimation Yujin Chung November 29th, 2016 Fall 2016 Yujin Chung Lec13: MLE Fall 2016 1/24 Previous Parametric tests Mean comparisons (normality assumption)
More informationSome Curiosities Arising in Objective Bayesian Analysis
. Some Curiosities Arising in Objective Bayesian Analysis Jim Berger Duke University Statistical and Applied Mathematical Institute Yale University May 15, 2009 1 Three vignettes related to John s work
More informationChapter 01: Introduction to Small Area Estimation
Chapter 01: Introduction to Small Area Estimation Taps Maiti Department of Statistics & Probability Michigan State University Sept 23-24, 2013 Taps Maiti Department of Statistics & Probability Chapter
More informationABSTRACT. Nadarajasundaram Ganesh Doctor of Philosophy, 2007
ABSTRACT Title of dissertation: SMALL AREA ESTIMATION AND PREDICTION PROBLEMS: SPATIAL MODELS, BAYESIAN MULTIPLE COMPARISONS, AND ROBUST MSE ESTIMATION. Nadarajasundaram Ganesh Doctor of Philosophy, 007
More informationStatistical Inference Using Progressively Type-II Censored Data with Random Scheme
International Mathematical Forum, 3, 28, no. 35, 1713-1725 Statistical Inference Using Progressively Type-II Censored Data with Random Scheme Ammar M. Sarhan 1 and A. Abuammoh Department of Statistics
More informationLecture 13: More on Binary Data
Lecture 1: More on Binary Data Link functions for Binomial models Link η = g(π) π = g 1 (η) identity π η logarithmic log π e η logistic log ( π 1 π probit Φ 1 (π) Φ(η) log-log log( log π) exp( e η ) complementary
More informationSTAT 740: Testing & Model Selection
STAT 740: Testing & Model Selection Timothy Hanson Department of Statistics, University of South Carolina Stat 740: Statistical Computing 1 / 34 Testing & model choice, likelihood-based A common way to
More informationA union of Bayesian, frequentist and fiducial inferences by confidence distribution and artificial data sampling
A union of Bayesian, frequentist and fiducial inferences by confidence distribution and artificial data sampling Min-ge Xie Department of Statistics, Rutgers University Workshop on Higher-Order Asymptotics
More informationA BAYESIAN APPROACH TO JOINT SMALL AREA ESTIMATION
A BAYESIAN APPROACH TO JOINT SMALL AREA ESTIMATION A DISSERTATION SUBMITTED TO THE FACULTY OF THE GRADUATE SCHOOL OF THE UNIVERSITY OF MINNESOTA BY Yanping Qu IN PARTIAL FULFILLMENT OF THE REQUIREMENTS
More informationModeling Real Estate Data using Quantile Regression
Modeling Real Estate Data using Semiparametric Quantile Regression Department of Statistics University of Innsbruck September 9th, 2011 Overview 1 Application: 2 3 4 Hedonic regression data for house prices
More informationApplied Regression. Applied Regression. Chapter 2 Simple Linear Regression. Hongcheng Li. April, 6, 2013
Applied Regression Chapter 2 Simple Linear Regression Hongcheng Li April, 6, 2013 Outline 1 Introduction of simple linear regression 2 Scatter plot 3 Simple linear regression model 4 Test of Hypothesis
More informationMS&E 226: Small Data
MS&E 226: Small Data Lecture 15: Examples of hypothesis tests (v5) Ramesh Johari ramesh.johari@stanford.edu 1 / 32 The recipe 2 / 32 The hypothesis testing recipe In this lecture we repeatedly apply the
More informationg-priors for Linear Regression
Stat60: Bayesian Modeling and Inference Lecture Date: March 15, 010 g-priors for Linear Regression Lecturer: Michael I. Jordan Scribe: Andrew H. Chan 1 Linear regression and g-priors In the last lecture,
More informationReview for Final Exam Stat 205: Statistics for the Life Sciences
Review for Final Exam Stat 205: Statistics for the Life Sciences Tim Hanson, Ph.D. University of South Carolina T. Hanson (USC) Stat 205: Statistics for the Life Sciences 1 / 20 Overview of Final Exam
More informationarxiv: v1 [math.st] 16 Jan 2017
arxiv:1701.04176v1 [math.st] 16 Jan 2017 A New Model Variance Estimator for an Area Level Small Area Model to Solve Multiple Problems Simultaneously Masayo Yoshimori Hirose The Institute of Statistical
More informationBiostat Methods STAT 5820/6910 Handout #9a: Intro. to Meta-Analysis Methods
Biostat Methods STAT 5820/6910 Handout #9a: Intro. to Meta-Analysis Methods Meta-analysis describes statistical approach to systematically combine results from multiple studies [identified follong an exhaustive
More informationFractional Imputation in Survey Sampling: A Comparative Review
Fractional Imputation in Survey Sampling: A Comparative Review Shu Yang Jae-Kwang Kim Iowa State University Joint Statistical Meetings, August 2015 Outline Introduction Fractional imputation Features Numerical
More information36. Multisample U-statistics and jointly distributed U-statistics Lehmann 6.1
36. Multisample U-statistics jointly distributed U-statistics Lehmann 6.1 In this topic, we generalize the idea of U-statistics in two different directions. First, we consider single U-statistics for situations
More informationEmpirical Likelihood Methods for Two-sample Problems with Data Missing-by-Design
1 / 32 Empirical Likelihood Methods for Two-sample Problems with Data Missing-by-Design Changbao Wu Department of Statistics and Actuarial Science University of Waterloo (Joint work with Min Chen and Mary
More informationOn the bias of the multiple-imputation variance estimator in survey sampling
J. R. Statist. Soc. B (2006) 68, Part 3, pp. 509 521 On the bias of the multiple-imputation variance estimator in survey sampling Jae Kwang Kim, Yonsei University, Seoul, Korea J. Michael Brick, Westat,
More informationLinear Regression. In this problem sheet, we consider the problem of linear regression with p predictors and one intercept,
Linear Regression In this problem sheet, we consider the problem of linear regression with p predictors and one intercept, y = Xβ + ɛ, where y t = (y 1,..., y n ) is the column vector of target values,
More informationVARIANCE ESTIMATION FOR NEAREST NEIGHBOR IMPUTATION FOR U.S. CENSUS LONG FORM DATA
Submitted to the Annals of Applied Statistics VARIANCE ESTIMATION FOR NEAREST NEIGHBOR IMPUTATION FOR U.S. CENSUS LONG FORM DATA By Jae Kwang Kim, Wayne A. Fuller and William R. Bell Iowa State University
More informationFunctional Mixed Effects Spectral Analysis
Joint with Robert Krafty and Martica Hall June 4, 2014 Outline Introduction Motivating example Brief review Functional mixed effects spectral analysis Estimation Procedure Application Remarks Introduction
More informationObnoxious lateness humor
Obnoxious lateness humor 1 Using Bayesian Model Averaging For Addressing Model Uncertainty in Environmental Risk Assessment Louise Ryan and Melissa Whitney Department of Biostatistics Harvard School of
More informationIEOR E4703: Monte-Carlo Simulation
IEOR E4703: Monte-Carlo Simulation Output Analysis for Monte-Carlo Martin Haugh Department of Industrial Engineering and Operations Research Columbia University Email: martin.b.haugh@gmail.com Output Analysis
More informationStatistical Analysis of List Experiments
Statistical Analysis of List Experiments Graeme Blair Kosuke Imai Princeton University December 17, 2010 Blair and Imai (Princeton) List Experiments Political Methodology Seminar 1 / 32 Motivation Surveys
More informationSampling and Estimation in Network Graphs
Sampling and Estimation in Network Graphs Gonzalo Mateos Dept. of ECE and Goergen Institute for Data Science University of Rochester gmateosb@ece.rochester.edu http://www.ece.rochester.edu/~gmateosb/ March
More informationSTAT 704 Sections IRLS and Bootstrap
STAT 704 Sections 11.4-11.5. IRLS and John Grego Department of Statistics, University of South Carolina Stat 704: Data Analysis I 1 / 14 LOWESS IRLS LOWESS LOWESS (LOcally WEighted Scatterplot Smoothing)
More informationOn Small Area Prediction Interval Problems
On Small Area Prediction Interval Problems Snigdhansu Chatterjee, Parthasarathi Lahiri, Huilin Li University of Minnesota, University of Maryland, University of Maryland Abstract Empirical best linear
More informationOutline of GLMs. Definitions
Outline of GLMs Definitions This is a short outline of GLM details, adapted from the book Nonparametric Regression and Generalized Linear Models, by Green and Silverman. The responses Y i have density
More informationFE670 Algorithmic Trading Strategies. Stevens Institute of Technology
FE670 Algorithmic Trading Strategies Lecture 8. Robust Portfolio Optimization Steve Yang Stevens Institute of Technology 10/17/2013 Outline 1 Robust Mean-Variance Formulations 2 Uncertain in Expected Return
More informationLASSO-Type Penalization in the Framework of Generalized Additive Models for Location, Scale and Shape
LASSO-Type Penalization in the Framework of Generalized Additive Models for Location, Scale and Shape Nikolaus Umlauf https://eeecon.uibk.ac.at/~umlauf/ Overview Joint work with Andreas Groll, Julien Hambuckers
More informationMulti-level Models: Idea
Review of 140.656 Review Introduction to multi-level models The two-stage normal-normal model Two-stage linear models with random effects Three-stage linear models Two-stage logistic regression with random
More informationModel Specification Testing in Nonparametric and Semiparametric Time Series Econometrics. Jiti Gao
Model Specification Testing in Nonparametric and Semiparametric Time Series Econometrics Jiti Gao Department of Statistics School of Mathematics and Statistics The University of Western Australia Crawley
More informationLinear Regression Models P8111
Linear Regression Models P8111 Lecture 25 Jeff Goldsmith April 26, 2016 1 of 37 Today s Lecture Logistic regression / GLMs Model framework Interpretation Estimation 2 of 37 Linear regression Course started
More informationGood Confidence Intervals for Categorical Data Analyses. Alan Agresti
Good Confidence Intervals for Categorical Data Analyses Alan Agresti Department of Statistics, University of Florida visiting Statistics Department, Harvard University LSHTM, July 22, 2011 p. 1/36 Outline
More informationDefault Priors and Effcient Posterior Computation in Bayesian
Default Priors and Effcient Posterior Computation in Bayesian Factor Analysis January 16, 2010 Presented by Eric Wang, Duke University Background and Motivation A Brief Review of Parameter Expansion Literature
More information1. Fisher Information
1. Fisher Information Let f(x θ) be a density function with the property that log f(x θ) is differentiable in θ throughout the open p-dimensional parameter set Θ R p ; then the score statistic (or score
More informationInterval estimation. October 3, Basic ideas CLT and CI CI for a population mean CI for a population proportion CI for a Normal mean
Interval estimation October 3, 2018 STAT 151 Class 7 Slide 1 Pandemic data Treatment outcome, X, from n = 100 patients in a pandemic: 1 = recovered and 0 = not recovered 1 1 1 0 0 0 1 1 1 0 0 1 0 1 0 0
More informationSimple and Multiple Linear Regression
Sta. 113 Chapter 12 and 13 of Devore March 12, 2010 Table of contents 1 Simple Linear Regression 2 Model Simple Linear Regression A simple linear regression model is given by Y = β 0 + β 1 x + ɛ where
More informationMiscellanea A note on multiple imputation under complex sampling
Biometrika (2017), 104, 1,pp. 221 228 doi: 10.1093/biomet/asw058 Printed in Great Britain Advance Access publication 3 January 2017 Miscellanea A note on multiple imputation under complex sampling BY J.
More informationChapter 10. Semi-Supervised Learning
Chapter 10. Semi-Supervised Learning Wei Pan Division of Biostatistics, School of Public Health, University of Minnesota, Minneapolis, MN 55455 Email: weip@biostat.umn.edu PubH 7475/8475 c Wei Pan Outline
More informationNonparametric Small Area Estimation Using Penalized Spline Regression
Nonparametric Small Area Estimation Using Penalized Spline Regression 0verview Spline-based nonparametric regression Nonparametric small area estimation Prediction mean squared error Bootstrapping small
More informationMAT2377. Rafa l Kulik. Version 2015/November/26. Rafa l Kulik
MAT2377 Rafa l Kulik Version 2015/November/26 Rafa l Kulik Bivariate data and scatterplot Data: Hydrocarbon level (x) and Oxygen level (y): x: 0.99, 1.02, 1.15, 1.29, 1.46, 1.36, 0.87, 1.23, 1.55, 1.40,
More informationRegression Discontinuity Designs in Stata
Regression Discontinuity Designs in Stata Matias D. Cattaneo University of Michigan July 30, 2015 Overview Main goal: learn about treatment effect of policy or intervention. If treatment randomization
More informationThe Multilevel Logit Model for Binary Dependent Variables Marco R. Steenbergen
The Multilevel Logit Model for Binary Dependent Variables Marco R. Steenbergen January 23-24, 2012 Page 1 Part I The Single Level Logit Model: A Review Motivating Example Imagine we are interested in voting
More informationMultivariate Survival Analysis
Multivariate Survival Analysis Previously we have assumed that either (X i, δ i ) or (X i, δ i, Z i ), i = 1,..., n, are i.i.d.. This may not always be the case. Multivariate survival data can arise in
More informationPractical Econometrics. for. Finance and Economics. (Econometrics 2)
Practical Econometrics for Finance and Economics (Econometrics 2) Seppo Pynnönen and Bernd Pape Department of Mathematics and Statistics, University of Vaasa 1. Introduction 1.1 Econometrics Econometrics
More informationBayesian methods in economics and finance
1/26 Bayesian methods in economics and finance Linear regression: Bayesian model selection and sparsity priors Linear Regression 2/26 Linear regression Model for relationship between (several) independent
More informationCOS513 LECTURE 8 STATISTICAL CONCEPTS
COS513 LECTURE 8 STATISTICAL CONCEPTS NIKOLAI SLAVOV AND ANKUR PARIKH 1. MAKING MEANINGFUL STATEMENTS FROM JOINT PROBABILITY DISTRIBUTIONS. A graphical model (GM) represents a family of probability distributions
More informationStatement: With my signature I confirm that the solutions are the product of my own work. Name: Signature:.
MATHEMATICAL STATISTICS Homework assignment Instructions Please turn in the homework with this cover page. You do not need to edit the solutions. Just make sure the handwriting is legible. You may discuss
More informationAnswer Key for STAT 200B HW No. 7
Answer Key for STAT 200B HW No. 7 May 5, 2007 Problem 2.2 p. 649 Assuming binomial 2-sample model ˆπ =.75, ˆπ 2 =.6. a ˆτ = ˆπ 2 ˆπ =.5. From Ex. 2.5a on page 644: ˆπ ˆπ + ˆπ 2 ˆπ 2.75.25.6.4 = + =.087;
More informationA Resampling Method on Pivotal Estimating Functions
A Resampling Method on Pivotal Estimating Functions Kun Nie Biostat 277,Winter 2004 March 17, 2004 Outline Introduction A General Resampling Method Examples - Quantile Regression -Rank Regression -Simulation
More information