Model-assisted Estimation of Forest Resources with Generalized Additive Models

Size: px
Start display at page:

Download "Model-assisted Estimation of Forest Resources with Generalized Additive Models"

Transcription

1 Model-assisted Estimation of Forest Resources with Generalized Additive Models Jean Opsomer, Jay Breidt, Gretchen Moisen, Göran Kauermann August 9,

2 Outline 1. Forest surveys 2. Sampling from spatial domain 3. Model-assisted estimation 4. GAM estimation for forest inventory data 5. Variance estimation for systematic samples 2

3 Research Project Collaboration between academic and US Forest Service statisticians Goal: apply on-going modeling efforts by Forest Service staff to improve efficiency of survey estimators 3

4 1. Forest Inventory and Analysis (FIA) Forest Inventory and Analysis is annual survey of all forest lands in US Multi-phase survey, including field visits phase with approximately 1 plot/ 6,000 acres Expensive: $68million in 2004 (nation-wide) 4

5 Inference for Surveys? Specific Inference expensive, high quality targeted to specific application and/or scientific question using custom-built method (or model) to achieve best possible estimator for particular variable(s) willing to defend estimates/ inference 5

6 Inference for Surveys Generic Inference cheap, reasonable quality, good for many purposes using method appropriate for large number of variables provide reasonable answers to many possible scientific questions validity of estimates resistant to model misspecification; model independent C o r n F l a k e s N E T W T. 1 2 O Z. 6

7 Survey Estimation Classical methods depend only on sampling design (Horvitz-Thompson; Hájek) Improved methods are still design-based but take advantage of auxiliary information ratio, regression, post-stratification model-assisted (Särndal et al, 1992) calibration (Deville and Särndal, 1992) nonparametric (Breidt and Opsomer, 2000), nonlinear/generalized (Wu and Sitter, 2001),... 7

8 Current Dataset 2.5 million ha ecological region in Utah Contains 968 FIA field plots on 5x5km grid FIA plots embedded in 24,980 remote sensing locations on 1x1km grid 8

9 Current Dataset (2) Remote Sensing Variables Elevation Slope Aspect Location Vegetation Index TM spectral bands Field Plot Variables Forest/non-forest Total wood volume Tree basal area Biomass Percent crown cover Mean diameter... 9

10 Systematic Sampling Common in natural resource and other spatial surveys Advantages: Simple to implement, intuitive Easy to nest within GIS environment Ensures proportional representation of domains Optimal for certain stationary processes 10

11 Systematic Sampling (2) Disadvantages Inflexible, can miss rare features in region Does not capture spatial relationships at fine scales (modeling) No design-based variance estimator 11

12 2. Sampling from Spatial Domain Phase I sample is systematic from continuous domain U D = [0, L 1 ] [0, L 2 ] Phase II sample is G 1 G 2 systematic (discrete) sub-sample of G 1 G 1 Conditional on, only 25 possible phase II sample 12

13 Sampling from Spatial Domain (2) Phase I sample G 1 (u), with u = (u 1, u 2 ) uniform random variable on [0, 1] [0, 1] and sampling intervals(δ 1, δ 2 ) G 1 (u) = {(u 1 + i 1 )δ 1, (u 2 + i 2 )δ 2 ) : i 1, i 2 = 0, 1,...} Phase II sample G 2 (u, d), where d = (d 1, d 2 ) discrete uniform on [1, 2,..., h 1 ] [1, 2,..., h 2 ] G 2 (u, d) = {(u 1 + d 1 + j 1 h 1 )δ 1, (u 2 + d 2 + j 2 h 2 )δ 2 ) : j 1, j 2 = 0, 1,...} 13

14 Population Characteristics Interested in estimating finite population total for variable z(v) on D θ z = D z(v)dv Total θ z can be gridded into cells D i1 i 2 θ z = i 1 i 2 D i1i2 z(v)dv = δ 1 δ 2 [0,1] [0,1] 14 s G 1 (u) z(s)du

15 Survey Estimation Phase I expansion estimator ˆθ 1z (u) = z(s) s G 1 (u) 1/(δ 1 δ 2 ) (unfeasible for Phase II variables) Two-phase expansion estimator ˆθ 2z (u, d) = s G 2 (u,d) z(s) 1/(δ 1 δ 2 h 1 h 2 ) Both unbiased, have exact variance formula 15

16 3. Model-Assisted Estimation Variables X(v) observed on Phase I can improve precision of survey estimators for Phase II variables Model-assisted approach provides convenient framework for incorporating auxiliary information within design-based (generic) inference 16

17 Model-Assisted Estimation (2) 1. Assume working model E ξ (z(v)) = µ(x(v)) 2. Fit model on {z(s), X(s) : s G 2 (u, d)} to predict ˆµ(s), s G 1 (u) 3. Construct model-assisted estimator ˆθ MA,z = s G 1 (u) ˆµ(s) 1/(δ 1 δ 2 ) + s G 2 (u,d) z(s) ˆµ(s) 1/(δ 1 δ 2 h 1 h 2 ) 17

18 Properties of Model- Assisted Estimator Estimator ˆθ MA,z is approximately design unbiased for large classes of models, with approximate design variance Var(ˆθ MA,z ) Var(ˆθ 1z (u)) + D 2 n 2 2 ( 1 1 h 1 h 2 ) E(S 2 (u)) S 2 1 (u) = h 1 h 2 1 t d1 d (u) = 2 h 1 d 1 =1 h 2 d 2 =1 (t d1 d 2 (u) t(u)) 2 (z(s) ˆµ(s)) s G 2 (u,d) 18

19 Applying Model- Assisted Estimation In typical survey context, many variables of interest instead of single z(v) Express estimator ˆθ MA,z = ˆθ MA,z in the form w(s)z(s) s G 2 (u,d) (automatic for linear estimators) Survey variables related to Phase I variables X(v) will benefit from improved efficiency 19

20 4. Estimation for Forest Inventory Data Forest Service researchers are investigating predictive models for forest characteristics based on remote sensing data Key variable in this survey: FOREST indicator I FOR (v) Many other variables not recorded when I FOR (v) = 0 20

21 GAM Variables for FOREST (X,Y) ELEV90CU TRASP90 SLP90CU coordinates (bivariate) elevation aspect (transformed) slope MRLCOOB5 TM satellite band 5 NDVI NLDC7 vegetation index (TM) vegetation classes (TM) 21

22 GAM Model for FOREST Model E ξ (I FOR (v)) µ FOR (v) = g(m 1 (x 1 (v)) m 6 (x 6 (v)) + x 7 (v) β) with g( ) logistic link and x k (v) Phase I variables Fitted in S-Plus using gam() with lo() smoothers, to obtain prediction ˆµ FOR (s) for s G 1 (u) 22

23 FOREST Model Components lo(xs, Ys, span = 0.5) Ys Xs s(elev90cu, df = 4) ELEV90CU s(trasp90, df = 4) s(slp90cu, df = 4) TRASP SLP90CU s(mrlc00b5, df = 4) s(ndvi, df = 4) MRLC00B NDVI

24 Other Phase II Variables NVOLTOT BA BIOMASS total wood volume (cuft/acre) tree basal area (per acre) total wood biomass (ton/acre) CRCOV percent crown cover (%) QMDALL quadratic mean diameter (in) 24

25 Modeling Other Variables Approaches considered: 1. Classical model-assisted 2. Model-assisted with FOREST prediction as auxiliary variable 3. Model-assisted with FOREST prediction indicator 25

26 We didn t do Classical model-assisted E ξ (z(v)) = m 1 (x 1 (v)) m 6 (x 6 (v)) with +x 7 (v) β m 1 ( ),..., m 6 ( ) parametric or nonparametric 2. Model-assisted with FOREST prediction as auxiliary variable E ξ (z(v)) = m 1 (x 1 (v)) m 6 (x 6 (v)) +x 7 (v) β + ˆµ FOR (v)γ 26

27 Selected Method Construct indicator for FOREST prediction { 1 if ˆµFOR (v) ˆθ 2,FOR (u, d)/ D Î FOR (v) = 0 otherwise and FOREST-interaction Phase I variables x k F (v) = ÎFOR(v)x k (v) Working model E ξ (z(v) µ(x(v)) = X F (v)β model predicts 0 whenever 27 Î FOR (v) = 0

28 Selected Method (2) Estimator constructed as ˆθ MA,z = s G 1 (u) ˆµ(s) 1/(δ 1 δ 2 ) + s G 2 (u,d) z(s) ˆµ(s) 1/(δ 1 δ 2 h 1 h 2 ) = w(s)z(s) s G 2 (u,d) Only approximately linear, due to presence of Î FOR (v) on RHS 28

29 Calibration Properties Weights w(s) calibrated for Phase I totals of auxiliary variables x k F (v) x k F (s) w(s)x k F (s) = s G 2 (u,d) s G 1 (u) 1/(δ 1 δ 2 ) and approximately for ˆµ FOR (v) and x k (v) Estimators for domains U h D improved over simpler estimators, by incorporating forest/ non-forest prediction locally 29

30 Comparing Estimators 1. EXP: expansion estimator ˆθ 2z (u, d) 2. PS: Model-assisted using NLDC7 categories only (=post-stratified, current FIA method) 3. REG: Model-assisted with linear model using Phase I variables 4. GAM/REGI: Model-assisted with linear model using Phase I variables interacted with GAM forest/non-forest prediction 30

31 Results Estimated Est. Relative Study Estimated Standard Efficiency of Variable Estimator Mean Error GAM/REGI FOREST EXP (forest/ PS non-forest REG binary) GAM NVOLTOT EXP (total wood PS volume in REG cuft/acre) REGI BA EXP (tree basal PS area per REG acre) REGI BIOMASS EXP (total wood PS biomass in REG tons/acre) REGI CRCOV EXP (percent PS crown REG cover) REGI QMDALL EXP (quadratic PS mean diameter REG in inches) REGI

32 5. Variance Estimation for Systematic Samples No design-based variance estimator for systematic sampling in each phase, sample contains only one ˆθ 1z (u) = of all possible grids in population/phase s G 1 (u) θ z = [0,1] [0,1] z(s) 1/(δ 1 δ 2 ) s G 1 (u) ˆθ 2z (u, d) = z(s) 1/(δ 1 δ 2 ) du s G 2 (u,d) z(s) 1/(δ 1 δ 2 h 1 h 2 ) 32

33 Simple Random Sampling Approximation Efficiency comparison relied on simple random sampling approximation for variance estimation For large numbers of possible samples and stationary populations, approximation is good on average Deviations can be significant for individual samples 33

34 Simple Random Sampling Approximation (2) Stationarity is reasonable for modelassisted estimators (model removes trend) But: only 25 possible samples in Phase II Approximate variance relies on asymptotic arguments 34

35 Alternative Approach to Assess Efficiency Gains Ignore Phase I variance component Var(ˆθ 1z (u)): identical across all estimators Generate synthetic population and compute exact Phase II variance over 25 samples avoid both asymptotic and simple random sampling approximations depends on appropriateness of model 35

36 Synthetic Population Higher order polynomial models fitted to sample data logistic model for FOREST remaining variables fitted only on locations with I FOR (v) = 1 Predict variables for Phase I Sample means of variables approximately match those of the original population 36

37 Approximation Bias Simulated Variable FOREST (forest/ non-forest binary) NVOLTOT (total wood volume in cuft/acre) BA (tree basal area per acre) BIOMASS (total wood biomass in tons/acre) CRCOV (percent crown cover) QMDALL (quadratic mean diameter in inches) Estimator EXP PS REG GAM EXP PS REG REGI EXP PS REG REGI EXP PS REG REGI EXP PS REG REGI EXP PS REG REGI Percent Bias of Variance Estimator

38 Relative Efficiency Relative Simulated Efficiency of Variable Estimator GAM/REGI FOREST EXP 4.51 (forest/ PS 3.13 non-forest binary) GAM REG 1.92 NVOLTOT binary) GAM EXP 1.31 (total wood PS 1.14 volume in REG 1.07 cuft/acre) BA REGI EXP 2.02 (tree basal PS 1.55 area per REG 1.19 BIOMASS acre) REGI EXP 1.67 (total wood PS 1.12 biomass in REG 1.14 tons/acre) CRCOV REGI EXP 1.49 (percent PS 1.36 cover) crown REGI 1.17 QMDALL cover) REGI EXP 2.55 (quadratic PS 1.79 meanindiameter inches) REGI 1.20

39 6. Conclusions: modelassisted estimation Model-assisted framework provides flexible approach to incorporate sophisticated models in survey estimation Nonparametric models make it possible to capture complex patterns in forest resource data Use on-going spatial modeling efforts by forestry researchers to improve tabular data 39

40 6. Conclusions: systematic sampling Systematic sampling is popular in natural resource surveys, but does not allow for a design-based variance estimator Synthetic population approach provided ad hoc solution in this case On-going research: predict design-based variance under nonparametric model (Li, 2006) 40

41 Almost-final version of this paper available at Contact info:

Model-assisted Estimation of Forest Resources with Generalized Additive Models

Model-assisted Estimation of Forest Resources with Generalized Additive Models Model-assisted Estimation of Forest Resources with Generalized Additive Models Jean D. Opsomer, F. Jay Breidt, Gretchen G. Moisen, and Göran Kauermann March 26, 2003 Abstract Multi-phase surveys are often

More information

Model-Assisted Estimation of Forest Resources With Generalized Additive Models

Model-Assisted Estimation of Forest Resources With Generalized Additive Models Model-Assisted Estimation of Forest Resources With Generalized Additive Models Jean D. OPSOMER, F.JayBREIDT, Gretchen G. MOISEN, and Göran KAUERMANN Multiphase surveys are often conducted in forest inventories,

More information

Two Applications of Nonparametric Regression in Survey Estimation

Two Applications of Nonparametric Regression in Survey Estimation Two Applications of Nonparametric Regression in Survey Estimation 1/56 Jean Opsomer Iowa State University Joint work with Jay Breidt, Colorado State University Gerda Claeskens, Université Catholique de

More information

Additional results for model-based nonparametric variance estimation for systematic sampling in a forestry survey

Additional results for model-based nonparametric variance estimation for systematic sampling in a forestry survey Additional results for model-based nonparametric variance estimation for systematic sampling in a forestry survey J.D. Opsomer Colorado State University M. Francisco-Fernández Universidad de A Coruña July

More information

F. Jay Breidt Colorado State University

F. Jay Breidt Colorado State University Model-assisted survey regression estimation with the lasso 1 F. Jay Breidt Colorado State University Opening Workshop on Computational Methods in Social Sciences SAMSI August 2013 This research was supported

More information

Finite Population Sampling and Inference

Finite Population Sampling and Inference Finite Population Sampling and Inference A Prediction Approach RICHARD VALLIANT ALAN H. DORFMAN RICHARD M. ROYALL A Wiley-Interscience Publication JOHN WILEY & SONS, INC. New York Chichester Weinheim Brisbane

More information

REPLICATION VARIANCE ESTIMATION FOR THE NATIONAL RESOURCES INVENTORY

REPLICATION VARIANCE ESTIMATION FOR THE NATIONAL RESOURCES INVENTORY REPLICATION VARIANCE ESTIMATION FOR THE NATIONAL RESOURCES INVENTORY J.D. Opsomer, W.A. Fuller and X. Li Iowa State University, Ames, IA 50011, USA 1. Introduction Replication methods are often used in

More information

Data Integration for Big Data Analysis for finite population inference

Data Integration for Big Data Analysis for finite population inference for Big Data Analysis for finite population inference Jae-kwang Kim ISU January 23, 2018 1 / 36 What is big data? 2 / 36 Data do not speak for themselves Knowledge Reproducibility Information Intepretation

More information

Nonparametric Regression Estimation of Finite Population Totals under Two-Stage Sampling

Nonparametric Regression Estimation of Finite Population Totals under Two-Stage Sampling Nonparametric Regression Estimation of Finite Population Totals under Two-Stage Sampling Ji-Yeon Kim Iowa State University F. Jay Breidt Colorado State University Jean D. Opsomer Colorado State University

More information

NONPARAMETRIC ENDOGENOUS POST-STRATIFICATION ESTIMATION

NONPARAMETRIC ENDOGENOUS POST-STRATIFICATION ESTIMATION Statistica Sinica 2011): Preprint 1 NONPARAMETRIC ENDOGENOUS POST-STRATIFICATION ESTIMATION Mark Dahlke 1, F. Jay Breidt 1, Jean D. Opsomer 1 and Ingrid Van Keilegom 2 1 Colorado State University and 2

More information

A Forest Inventory ESTimation and Analysis tool. Tracey Frescino Paul Patterson Elizabeth Freeman Gretchen Moisen

A Forest Inventory ESTimation and Analysis tool. Tracey Frescino Paul Patterson Elizabeth Freeman Gretchen Moisen A Forest Inventory ESTimation and Analysis tool Tracey Frescino Paul Patterson Elizabeth Freeman Gretchen Moisen FIA s Data and Estimation Tools FIA DataMart - Web-based tool to access FIA database through

More information

Small Area Modeling of County Estimates for Corn and Soybean Yields in the US

Small Area Modeling of County Estimates for Corn and Soybean Yields in the US Small Area Modeling of County Estimates for Corn and Soybean Yields in the US Matt Williams National Agricultural Statistics Service United States Department of Agriculture Matt.Williams@nass.usda.gov

More information

Empirical Likelihood Methods for Sample Survey Data: An Overview

Empirical Likelihood Methods for Sample Survey Data: An Overview AUSTRIAN JOURNAL OF STATISTICS Volume 35 (2006), Number 2&3, 191 196 Empirical Likelihood Methods for Sample Survey Data: An Overview J. N. K. Rao Carleton University, Ottawa, Canada Abstract: The use

More information

Penalized Balanced Sampling. Jay Breidt

Penalized Balanced Sampling. Jay Breidt Penalized Balanced Sampling Jay Breidt Colorado State University Joint work with Guillaume Chauvet (ENSAI) February 4, 2010 1 / 44 Linear Mixed Models Let U = {1, 2,...,N}. Consider linear mixed models

More information

Comments on Design-Based Prediction Using Auxilliary Information under Random Permutation Models (by Wenjun Li (5/21/03) Ed Stanek

Comments on Design-Based Prediction Using Auxilliary Information under Random Permutation Models (by Wenjun Li (5/21/03) Ed Stanek Comments on Design-Based Prediction Using Auxilliary Information under Random Permutation Models (by Wenjun Li (5/2/03) Ed Stanek Here are comments on the Draft Manuscript. They are all suggestions that

More information

A comparison of stratified simple random sampling and sampling with probability proportional to size

A comparison of stratified simple random sampling and sampling with probability proportional to size A comparison of stratified simple random sampling and sampling with probability proportional to size Edgar Bueno Dan Hedlin Per Gösta Andersson 1 Introduction When planning the sampling strategy (i.e.

More information

Fractional Hot Deck Imputation for Robust Inference Under Item Nonresponse in Survey Sampling

Fractional Hot Deck Imputation for Robust Inference Under Item Nonresponse in Survey Sampling Fractional Hot Deck Imputation for Robust Inference Under Item Nonresponse in Survey Sampling Jae-Kwang Kim 1 Iowa State University June 26, 2013 1 Joint work with Shu Yang Introduction 1 Introduction

More information

Spatial Lasso with Application to GIS Model Selection. F. Jay Breidt Colorado State University

Spatial Lasso with Application to GIS Model Selection. F. Jay Breidt Colorado State University Spatial Lasso with Application to GIS Model Selection F. Jay Breidt Colorado State University with Hsin-Cheng Huang, Nan-Jung Hsu, and Dave Theobald September 25 The work reported here was developed under

More information

Empirical Likelihood Methods for Two-sample Problems with Data Missing-by-Design

Empirical Likelihood Methods for Two-sample Problems with Data Missing-by-Design 1 / 32 Empirical Likelihood Methods for Two-sample Problems with Data Missing-by-Design Changbao Wu Department of Statistics and Actuarial Science University of Waterloo (Joint work with Min Chen and Mary

More information

Empirical Likelihood Methods

Empirical Likelihood Methods Handbook of Statistics, Volume 29 Sample Surveys: Theory, Methods and Inference Empirical Likelihood Methods J.N.K. Rao and Changbao Wu (February 14, 2008, Final Version) 1 Likelihood-based Approaches

More information

Why do we Care About Forest Sampling?

Why do we Care About Forest Sampling? Fundamentals of Forest Sampling Why Forest Sampling Sampling Theory Terminology Why Use a Sample? Readings: Avery and Burkhart Sampling Chapters Elzinga (website) USDA Sampling Handbook 232 (website) Why

More information

Fractional Imputation in Survey Sampling: A Comparative Review

Fractional Imputation in Survey Sampling: A Comparative Review Fractional Imputation in Survey Sampling: A Comparative Review Shu Yang Jae-Kwang Kim Iowa State University Joint Statistical Meetings, August 2015 Outline Introduction Fractional imputation Features Numerical

More information

Propensity Score Methods for Causal Inference

Propensity Score Methods for Causal Inference John Pura BIOS790 October 2, 2015 Causal inference Philosophical problem, statistical solution Important in various disciplines (e.g. Koch s postulates, Bradford Hill criteria, Granger causality) Good

More information

Likelihood-Based Methods

Likelihood-Based Methods Likelihood-Based Methods Handbook of Spatial Statistics, Chapter 4 Susheela Singh September 22, 2016 OVERVIEW INTRODUCTION MAXIMUM LIKELIHOOD ESTIMATION (ML) RESTRICTED MAXIMUM LIKELIHOOD ESTIMATION (REML)

More information

arxiv: v2 [math.st] 20 Jun 2014

arxiv: v2 [math.st] 20 Jun 2014 A solution in small area estimation problems Andrius Čiginas and Tomas Rudys Vilnius University Institute of Mathematics and Informatics, LT-08663 Vilnius, Lithuania arxiv:1306.2814v2 [math.st] 20 Jun

More information

Model Assisted Survey Sampling

Model Assisted Survey Sampling Carl-Erik Sarndal Jan Wretman Bengt Swensson Model Assisted Survey Sampling Springer Preface v PARTI Principles of Estimation for Finite Populations and Important Sampling Designs CHAPTER 1 Survey Sampling

More information

Remote Sensing of Environment

Remote Sensing of Environment Remote Sensing of Environment 128 (2013) 268 275 Contents lists available at SciVerse ScienceDirect Remote Sensing of Environment journal homepage: www.elsevier.com/locate/rse Inference for lidar-assisted

More information

Biostat 2065 Analysis of Incomplete Data

Biostat 2065 Analysis of Incomplete Data Biostat 2065 Analysis of Incomplete Data Gong Tang Dept of Biostatistics University of Pittsburgh September 13 & 15, 2005 1. Complete-case analysis (I) Complete-case analysis refers to analysis based on

More information

Combining data from two independent surveys: model-assisted approach

Combining data from two independent surveys: model-assisted approach Combining data from two independent surveys: model-assisted approach Jae Kwang Kim 1 Iowa State University January 20, 2012 1 Joint work with J.N.K. Rao, Carleton University Reference Kim, J.K. and Rao,

More information

New estimation methodology for the Norwegian Labour Force Survey

New estimation methodology for the Norwegian Labour Force Survey Notater Documents 2018/16 Melike Oguz-Alper New estimation methodology for the Norwegian Labour Force Survey Documents 2018/16 Melike Oguz Alper New estimation methodology for the Norwegian Labour Force

More information

Nonparametric Small Area Estimation Using Penalized Spline Regression

Nonparametric Small Area Estimation Using Penalized Spline Regression Nonparametric Small Area Estimation Using Penalized Spline Regression 0verview Spline-based nonparametric regression Nonparametric small area estimation Prediction mean squared error Bootstrapping small

More information

Cross-validation in model-assisted estimation

Cross-validation in model-assisted estimation Graduate Theses and Dissertations Iowa State University Capstones, Theses and Dissertations 009 Cross-validation in model-assisted estimation Lifeng You Iowa State University Follow this and additional

More information

Bridging the two cultures: Latent variable statistical modeling with boosted regression trees

Bridging the two cultures: Latent variable statistical modeling with boosted regression trees Bridging the two cultures: Latent variable statistical modeling with boosted regression trees Thomas G. Dietterich and Rebecca Hutchinson Oregon State University Corvallis, Oregon, USA 1 A Species Distribution

More information

Spatial bias modeling with application to assessing remotely-sensed aerosol as a proxy for particulate matter

Spatial bias modeling with application to assessing remotely-sensed aerosol as a proxy for particulate matter Spatial bias modeling with application to assessing remotely-sensed aerosol as a proxy for particulate matter Chris Paciorek Department of Biostatistics Harvard School of Public Health application joint

More information

One-phase estimation techniques

One-phase estimation techniques One-phase estimation techniques Based on Horwitz-Thompson theorem for continuous populations Radim Adolt ÚHÚL Brandýs nad Labem, Czech Republic USEWOOD WG2, Training school in Dublin, 16.-19. September

More information

Optimal Auxiliary Variable Assisted Two-Phase Sampling Designs

Optimal Auxiliary Variable Assisted Two-Phase Sampling Designs MASTER S THESIS Optimal Auxiliary Variable Assisted Two-Phase Sampling Designs HENRIK IMBERG Department of Mathematical Sciences Division of Mathematical Statistics CHALMERS UNIVERSITY OF TECHNOLOGY UNIVERSITY

More information

Modification and Improvement of Empirical Likelihood for Missing Response Problem

Modification and Improvement of Empirical Likelihood for Missing Response Problem UW Biostatistics Working Paper Series 12-30-2010 Modification and Improvement of Empirical Likelihood for Missing Response Problem Kwun Chuen Gary Chan University of Washington - Seattle Campus, kcgchan@u.washington.edu

More information

Causal Inference Lecture Notes: Causal Inference with Repeated Measures in Observational Studies

Causal Inference Lecture Notes: Causal Inference with Repeated Measures in Observational Studies Causal Inference Lecture Notes: Causal Inference with Repeated Measures in Observational Studies Kosuke Imai Department of Politics Princeton University November 13, 2013 So far, we have essentially assumed

More information

Introduction to Statistical Analysis

Introduction to Statistical Analysis Introduction to Statistical Analysis Changyu Shen Richard A. and Susan F. Smith Center for Outcomes Research in Cardiology Beth Israel Deaconess Medical Center Harvard Medical School Objectives Descriptive

More information

Comparison of Imputation Procedures for Replacing Denied-access Plots

Comparison of Imputation Procedures for Replacing Denied-access Plots Comparison of Imputation Procedures for Replacing Denied-access Plots Susan L. King 1 Abstract. In forest inventories, missing plots are caused by hazardous terrain, inaccessible locations, or denied access.

More information

Hierarchical Modelling for Multivariate Spatial Data

Hierarchical Modelling for Multivariate Spatial Data Hierarchical Modelling for Multivariate Spatial Data Geography 890, Hierarchical Bayesian Models for Environmental Spatial Data Analysis February 15, 2011 1 Point-referenced spatial data often come as

More information

Combining multiple observational data sources to estimate causal eects

Combining multiple observational data sources to estimate causal eects Department of Statistics, North Carolina State University Combining multiple observational data sources to estimate causal eects Shu Yang* syang24@ncsuedu Joint work with Peng Ding UC Berkeley May 23,

More information

1. Introduction. Keywords: Auxiliary information; Nonparametric regression; Pseudo empirical likelihood; Model-assisted approach; MARS.

1. Introduction. Keywords: Auxiliary information; Nonparametric regression; Pseudo empirical likelihood; Model-assisted approach; MARS. Nonparametric Methods for Sample Surveys of Environmental Populations Metodi nonparametrici nell inferenza per popolazioni finite di carattere ambientale Giorgio E. Montanari Dipartimento di Economia,

More information

TECHNIQUE FOR RANKING POTENTIAL PREDICTOR LAYERS FOR USE IN REMOTE SENSING ANALYSIS. Andrew Lister, Mike Hoppus, and Rachel Riemam

TECHNIQUE FOR RANKING POTENTIAL PREDICTOR LAYERS FOR USE IN REMOTE SENSING ANALYSIS. Andrew Lister, Mike Hoppus, and Rachel Riemam TECHNIQUE FOR RANKING POTENTIAL PREDICTOR LAYERS FOR USE IN REMOTE SENSING ANALYSIS Andrew Lister, Mike Hoppus, and Rachel Riemam ABSTRACT. Spatial modeling using GIS-based predictor layers often requires

More information

Graybill Conference Poster Session Introductions

Graybill Conference Poster Session Introductions Graybill Conference Poster Session Introductions 2013 Graybill Conference in Modern Survey Statistics Colorado State University Fort Collins, CO June 10, 2013 Small Area Estimation with Incomplete Auxiliary

More information

Lecture 3: Statistical Decision Theory (Part II)

Lecture 3: Statistical Decision Theory (Part II) Lecture 3: Statistical Decision Theory (Part II) Hao Helen Zhang Hao Helen Zhang Lecture 3: Statistical Decision Theory (Part II) 1 / 27 Outline of This Note Part I: Statistics Decision Theory (Classical

More information

A comparison of weighted estimators for the population mean. Ye Yang Weighting in surveys group

A comparison of weighted estimators for the population mean. Ye Yang Weighting in surveys group A comparison of weighted estimators for the population mean Ye Yang Weighting in surveys group Motivation Survey sample in which auxiliary variables are known for the population and an outcome variable

More information

Hierarchical Modeling for Multivariate Spatial Data

Hierarchical Modeling for Multivariate Spatial Data Hierarchical Modeling for Multivariate Spatial Data Sudipto Banerjee 1 and Andrew O. Finley 2 1 Biostatistics, School of Public Health, University of Minnesota, Minneapolis, Minnesota, U.S.A. 2 Department

More information

New Developments in Nonresponse Adjustment Methods

New Developments in Nonresponse Adjustment Methods New Developments in Nonresponse Adjustment Methods Fannie Cobben January 23, 2009 1 Introduction In this paper, we describe two relatively new techniques to adjust for (unit) nonresponse bias: The sample

More information

In this chapter, we provide an introduction to covariate shift adaptation toward machine learning in a non-stationary environment.

In this chapter, we provide an introduction to covariate shift adaptation toward machine learning in a non-stationary environment. 1 Introduction and Problem Formulation In this chapter, we provide an introduction to covariate shift adaptation toward machine learning in a non-stationary environment. 1.1 Machine Learning under Covariate

More information

ENGRG Introduction to GIS

ENGRG Introduction to GIS ENGRG 59910 Introduction to GIS Michael Piasecki October 13, 2017 Lecture 06: Spatial Analysis Outline Today Concepts What is spatial interpolation Why is necessary Sample of interpolation (size and pattern)

More information

Statistics: Learning models from data

Statistics: Learning models from data DS-GA 1002 Lecture notes 5 October 19, 2015 Statistics: Learning models from data Learning models from data that are assumed to be generated probabilistically from a certain unknown distribution is a crucial

More information

Jong-Min Kim* and Jon E. Anderson. Statistics Discipline Division of Science and Mathematics University of Minnesota at Morris

Jong-Min Kim* and Jon E. Anderson. Statistics Discipline Division of Science and Mathematics University of Minnesota at Morris Jackknife Variance Estimation of the Regression and Calibration Estimator for Two 2-Phase Samples Jong-Min Kim* and Jon E. Anderson jongmink@morris.umn.edu Statistics Discipline Division of Science and

More information

Directorate E: Sectoral and regional statistics Unit E-4: Regional statistics and geographical information LUCAS 2018.

Directorate E: Sectoral and regional statistics Unit E-4: Regional statistics and geographical information LUCAS 2018. EUROPEAN COMMISSION EUROSTAT Directorate E: Sectoral and regional statistics Unit E-4: Regional statistics and geographical information Doc. WG/LCU 52 LUCAS 2018 Eurostat Unit E4 Working Group for Land

More information

Sensitivity of FIA Volume Estimates to Changes in Stratum Weights and Number of Strata. Data. Methods. James A. Westfall and Michael Hoppus 1

Sensitivity of FIA Volume Estimates to Changes in Stratum Weights and Number of Strata. Data. Methods. James A. Westfall and Michael Hoppus 1 Sensitivity of FIA Volume Estimates to Changes in Stratum Weights and Number of Strata James A. Westfall and Michael Hoppus 1 Abstract. In the Northeast region, the USDA Forest Service Forest Inventory

More information

Incorporating Boosted Regression Trees into Ecological Latent Variable Models

Incorporating Boosted Regression Trees into Ecological Latent Variable Models Incorporating Boosted Regression Trees into Ecological Latent Variable Models Rebecca A. Hutchinson, Li-Ping Liu, Thomas G. Dietterich School of EECS, Oregon State University Motivation Species Distribution

More information

A comparison of stratified simple random sampling and sampling with probability proportional to size

A comparison of stratified simple random sampling and sampling with probability proportional to size A comparison of stratified simple random sampling and sampling with probability proportional to size Edgar Bueno Dan Hedlin Per Gösta Andersson Department of Statistics Stockholm University Introduction

More information

Nonresponse weighting adjustment using estimated response probability

Nonresponse weighting adjustment using estimated response probability Nonresponse weighting adjustment using estimated response probability Jae-kwang Kim Yonsei University, Seoul, Korea December 26, 2006 Introduction Nonresponse Unit nonresponse Item nonresponse Basic strategy

More information

BIAS-ROBUSTNESS AND EFFICIENCY OF MODEL-BASED INFERENCE IN SURVEY SAMPLING

BIAS-ROBUSTNESS AND EFFICIENCY OF MODEL-BASED INFERENCE IN SURVEY SAMPLING Statistica Sinica 22 (2012), 777-794 doi:http://dx.doi.org/10.5705/ss.2010.238 BIAS-ROBUSTNESS AND EFFICIENCY OF MODEL-BASED INFERENCE IN SURVEY SAMPLING Desislava Nedyalova and Yves Tillé University of

More information

AGOG 485/585 /APLN 533 Spring Lecture 5: MODIS land cover product (MCD12Q1). Additional sources of MODIS data

AGOG 485/585 /APLN 533 Spring Lecture 5: MODIS land cover product (MCD12Q1). Additional sources of MODIS data AGOG 485/585 /APLN 533 Spring 2019 Lecture 5: MODIS land cover product (MCD12Q1). Additional sources of MODIS data Outline Current status of land cover products Overview of the MCD12Q1 algorithm Mapping

More information

Improving Efficiency of Inferences in Randomized Clinical Trials Using Auxiliary Covariates

Improving Efficiency of Inferences in Randomized Clinical Trials Using Auxiliary Covariates Improving Efficiency of Inferences in Randomized Clinical Trials Using Auxiliary Covariates Anastasios (Butch) Tsiatis Department of Statistics North Carolina State University http://www.stat.ncsu.edu/

More information

REPLICATION VARIANCE ESTIMATION FOR TWO-PHASE SAMPLES

REPLICATION VARIANCE ESTIMATION FOR TWO-PHASE SAMPLES Statistica Sinica 8(1998), 1153-1164 REPLICATION VARIANCE ESTIMATION FOR TWO-PHASE SAMPLES Wayne A. Fuller Iowa State University Abstract: The estimation of the variance of the regression estimator for

More information

Chapter 8: Estimation 1

Chapter 8: Estimation 1 Chapter 8: Estimation 1 Jae-Kwang Kim Iowa State University Fall, 2014 Kim (ISU) Ch. 8: Estimation 1 Fall, 2014 1 / 33 Introduction 1 Introduction 2 Ratio estimation 3 Regression estimator Kim (ISU) Ch.

More information

Causal Inference with a Continuous Treatment and Outcome: Alternative Estimators for Parametric Dose-Response Functions

Causal Inference with a Continuous Treatment and Outcome: Alternative Estimators for Parametric Dose-Response Functions Causal Inference with a Continuous Treatment and Outcome: Alternative Estimators for Parametric Dose-Response Functions Joe Schafer Office of the Associate Director for Research and Methodology U.S. Census

More information

Hierarchical Modeling for non-gaussian Spatial Data

Hierarchical Modeling for non-gaussian Spatial Data Hierarchical Modeling for non-gaussian Spatial Data Sudipto Banerjee 1 and Andrew O. Finley 2 1 Biostatistics, School of Public Health, University of Minnesota, Minneapolis, Minnesota, U.S.A. 2 Department

More information

Critical assessment and addressed challenges in deploying innovative technologies in small-scale irrigation scheme in Uganda

Critical assessment and addressed challenges in deploying innovative technologies in small-scale irrigation scheme in Uganda Critical assessment and addressed challenges in deploying innovative technologies in small-scale irrigation scheme in Uganda Session II. Improving water-efficient irrigation: Prospects and difficulties

More information

Parametric Techniques Lecture 3

Parametric Techniques Lecture 3 Parametric Techniques Lecture 3 Jason Corso SUNY at Buffalo 22 January 2009 J. Corso (SUNY at Buffalo) Parametric Techniques Lecture 3 22 January 2009 1 / 39 Introduction In Lecture 2, we learned how to

More information

Estimating the Marginal Odds Ratio in Observational Studies

Estimating the Marginal Odds Ratio in Observational Studies Estimating the Marginal Odds Ratio in Observational Studies Travis Loux Christiana Drake Department of Statistics University of California, Davis June 20, 2011 Outline The Counterfactual Model Odds Ratios

More information

Econometrics I. Lecture 10: Nonparametric Estimation with Kernels. Paul T. Scott NYU Stern. Fall 2018

Econometrics I. Lecture 10: Nonparametric Estimation with Kernels. Paul T. Scott NYU Stern. Fall 2018 Econometrics I Lecture 10: Nonparametric Estimation with Kernels Paul T. Scott NYU Stern Fall 2018 Paul T. Scott NYU Stern Econometrics I Fall 2018 1 / 12 Nonparametric Regression: Intuition Let s get

More information

Calibration estimation in survey sampling

Calibration estimation in survey sampling Calibration estimation in survey sampling Jae Kwang Kim Mingue Park September 8, 2009 Abstract Calibration estimation, where the sampling weights are adjusted to make certain estimators match known population

More information

Weighting Methods. Harvard University STAT186/GOV2002 CAUSAL INFERENCE. Fall Kosuke Imai

Weighting Methods. Harvard University STAT186/GOV2002 CAUSAL INFERENCE. Fall Kosuke Imai Weighting Methods Kosuke Imai Harvard University STAT186/GOV2002 CAUSAL INFERENCE Fall 2018 Kosuke Imai (Harvard) Weighting Methods Stat186/Gov2002 Fall 2018 1 / 13 Motivation Matching methods for improving

More information

The Wide Dynamic Range Vegetation Index and its Potential Utility for Gap Analysis

The Wide Dynamic Range Vegetation Index and its Potential Utility for Gap Analysis Summary StatMod provides an easy-to-use and inexpensive tool for spatially applying the classification rules generated from the CT algorithm in S-PLUS. While the focus of this article was to use StatMod

More information

In Praise of the Listwise-Deletion Method (Perhaps with Reweighting)

In Praise of the Listwise-Deletion Method (Perhaps with Reweighting) In Praise of the Listwise-Deletion Method (Perhaps with Reweighting) Phillip S. Kott RTI International NISS Worshop on the Analysis of Complex Survey Data With Missing Item Values October 17, 2014 1 RTI

More information

Inferences for the Ratio: Fieller s Interval, Log Ratio, and Large Sample Based Confidence Intervals

Inferences for the Ratio: Fieller s Interval, Log Ratio, and Large Sample Based Confidence Intervals Inferences for the Ratio: Fieller s Interval, Log Ratio, and Large Sample Based Confidence Intervals Michael Sherman Department of Statistics, 3143 TAMU, Texas A&M University, College Station, Texas 77843,

More information

Parametric Techniques

Parametric Techniques Parametric Techniques Jason J. Corso SUNY at Buffalo J. Corso (SUNY at Buffalo) Parametric Techniques 1 / 39 Introduction When covering Bayesian Decision Theory, we assumed the full probabilistic structure

More information

On the Behavior of Marginal and Conditional Akaike Information Criteria in Linear Mixed Models

On the Behavior of Marginal and Conditional Akaike Information Criteria in Linear Mixed Models On the Behavior of Marginal and Conditional Akaike Information Criteria in Linear Mixed Models Thomas Kneib Institute of Statistics and Econometrics Georg-August-University Göttingen Department of Statistics

More information

Better Bootstrap Confidence Intervals

Better Bootstrap Confidence Intervals by Bradley Efron University of Washington, Department of Statistics April 12, 2012 An example Suppose we wish to make inference on some parameter θ T (F ) (e.g. θ = E F X ), based on data We might suppose

More information

Introduction to GIS I

Introduction to GIS I Introduction to GIS Introduction How to answer geographical questions such as follows: What is the population of a particular city? What are the characteristics of the soils in a particular land parcel?

More information

Transformation and Smoothing in Sample Survey Data

Transformation and Smoothing in Sample Survey Data Scandinavian Journal of Statistics, Vol. 37: 496 513, 2010 doi: 10.1111/j.1467-9469.2010.00691.x Published by Blackwell Publishing Ltd. Transformation and Smoothing in Sample Survey Data YANYUAN MA Department

More information

BOOK REVIEW Sampling: Design and Analysis. Sharon L. Lohr. 2nd Edition, International Publication,

BOOK REVIEW Sampling: Design and Analysis. Sharon L. Lohr. 2nd Edition, International Publication, STATISTICS IN TRANSITION-new series, August 2011 223 STATISTICS IN TRANSITION-new series, August 2011 Vol. 12, No. 1, pp. 223 230 BOOK REVIEW Sampling: Design and Analysis. Sharon L. Lohr. 2nd Edition,

More information

Calibration estimation using exponential tilting in sample surveys

Calibration estimation using exponential tilting in sample surveys Calibration estimation using exponential tilting in sample surveys Jae Kwang Kim February 23, 2010 Abstract We consider the problem of parameter estimation with auxiliary information, where the auxiliary

More information

An Efficient Estimation Method for Longitudinal Surveys with Monotone Missing Data

An Efficient Estimation Method for Longitudinal Surveys with Monotone Missing Data An Efficient Estimation Method for Longitudinal Surveys with Monotone Missing Data Jae-Kwang Kim 1 Iowa State University June 28, 2012 1 Joint work with Dr. Ming Zhou (when he was a PhD student at ISU)

More information

Nonparametric regression estimation under complex sampling designs

Nonparametric regression estimation under complex sampling designs Retrospective Theses and Dissertations 2004 Nonparametric regression estimation under complex sampling designs Ji-Yeon Kim Iowa State University Follow this and additional works at: http://lib.dr.iastate.edu/rtd

More information

Covariance function estimation in Gaussian process regression

Covariance function estimation in Gaussian process regression Covariance function estimation in Gaussian process regression François Bachoc Department of Statistics and Operations Research, University of Vienna WU Research Seminar - May 2015 François Bachoc Gaussian

More information

Number of (Nested) Shopping Carts Three nested shopping carts are shown.

Number of (Nested) Shopping Carts Three nested shopping carts are shown. Randy must fit shopping carts into an area that has a length of 82 feet and a width of one shopping cart. He made some measurements necessary for his computations. The table shows the length of a set of

More information

Adaptive two-stage sequential double sampling

Adaptive two-stage sequential double sampling Adaptive two-stage sequential double sampling Bardia Panahbehagh Afshin Parvardeh Babak Mohammadi March 4, 208 arxiv:803.04484v [math.st] 2 Mar 208 Abstract In many surveys inexpensive auxiliary variables

More information

Advanced Methods for Agricultural and Agroenvironmental. Emily Berg, Zhengyuan Zhu, Sarah Nusser, and Wayne Fuller

Advanced Methods for Agricultural and Agroenvironmental. Emily Berg, Zhengyuan Zhu, Sarah Nusser, and Wayne Fuller Advanced Methods for Agricultural and Agroenvironmental Monitoring Emily Berg, Zhengyuan Zhu, Sarah Nusser, and Wayne Fuller Outline 1. Introduction to the National Resources Inventory 2. Hierarchical

More information

Using Non-Linear Mixed Models for Agricultural Data

Using Non-Linear Mixed Models for Agricultural Data Using Non-Linear Mixed Models for Agricultural Data Fernando E. Miguez Energy Biosciences Institute Crop Sciences University of Illinois, Urbana-Champaign miguez@illinois.edu Oct 8 th, 8 Outline 1 Introduction

More information

Estimation for two-phase designs: semiparametric models and Z theorems

Estimation for two-phase designs: semiparametric models and Z theorems Estimation for two-phase designs:semiparametric models and Z theorems p. 1/27 Estimation for two-phase designs: semiparametric models and Z theorems Jon A. Wellner University of Washington Estimation for

More information

What s New in Econometrics. Lecture 1

What s New in Econometrics. Lecture 1 What s New in Econometrics Lecture 1 Estimation of Average Treatment Effects Under Unconfoundedness Guido Imbens NBER Summer Institute, 2007 Outline 1. Introduction 2. Potential Outcomes 3. Estimands and

More information

An Introduction to Nonstationary Time Series Analysis

An Introduction to Nonstationary Time Series Analysis An Introduction to Analysis Ting Zhang 1 tingz@bu.edu Department of Mathematics and Statistics Boston University August 15, 2016 Boston University/Keio University Workshop 2016 A Presentation Friendly

More information

Bootstrap inference for the finite population total under complex sampling designs

Bootstrap inference for the finite population total under complex sampling designs Bootstrap inference for the finite population total under complex sampling designs Zhonglei Wang (Joint work with Dr. Jae Kwang Kim) Center for Survey Statistics and Methodology Iowa State University Jan.

More information

Developing Calibration Weights and Variance Estimates for a Survey of Drug-Related Emergency-Room Visits

Developing Calibration Weights and Variance Estimates for a Survey of Drug-Related Emergency-Room Visits Developing Calibration Weights and Variance Estimates for a Survey of Drug-Related Emergency-Room Visits Phillip S. Kott RTI International Charles D. Day SAMHSA RTI International is a trade name of Research

More information

Estimating basal area, trees, and above ground biomass per acre for common tree species across the Uncompahgre Plateau using NAIP CIR imagery

Estimating basal area, trees, and above ground biomass per acre for common tree species across the Uncompahgre Plateau using NAIP CIR imagery Estimating basal area, trees, and above ground biomass per acre for common tree species across the Uncompahgre Plateau using NAIP CIR imagery By John Hogland Nathaniel Anderson Greg Jones Obective Develop

More information

Spatial statistics, addition to Part I. Parameter estimation and kriging for Gaussian random fields

Spatial statistics, addition to Part I. Parameter estimation and kriging for Gaussian random fields Spatial statistics, addition to Part I. Parameter estimation and kriging for Gaussian random fields 1 Introduction Jo Eidsvik Department of Mathematical Sciences, NTNU, Norway. (joeid@math.ntnu.no) February

More information

Bayesian dynamic modeling for large space-time weather datasets using Gaussian predictive processes

Bayesian dynamic modeling for large space-time weather datasets using Gaussian predictive processes Bayesian dynamic modeling for large space-time weather datasets using Gaussian predictive processes Alan Gelfand 1 and Andrew O. Finley 2 1 Department of Statistical Science, Duke University, Durham, North

More information

Lecture 2: Poisson point processes: properties and statistical inference

Lecture 2: Poisson point processes: properties and statistical inference Lecture 2: Poisson point processes: properties and statistical inference Jean-François Coeurjolly http://www-ljk.imag.fr/membres/jean-francois.coeurjolly/ 1 / 20 Definition, properties and simulation Statistical

More information

Manitoba s Elevation (LiDAR) & Imagery Datasets. Acquisition Plans & Opportunities for Collaboration

Manitoba s Elevation (LiDAR) & Imagery Datasets. Acquisition Plans & Opportunities for Collaboration Manitoba s Elevation (LiDAR) & Imagery Datasets Acquisition Plans & Opportunities for Collaboration Manitoba Planning Conference May 2017 Presentation Outline Manitoba s Elevation (LiDAR) and Imagery Datasets

More information

Water information system advances American River basin. Roger Bales, Martha Conklin, Steve Glaser, Bob Rice & collaborators UC: SNRI & CITRIS

Water information system advances American River basin. Roger Bales, Martha Conklin, Steve Glaser, Bob Rice & collaborators UC: SNRI & CITRIS Water information system advances American River basin Roger Bales, Martha Conklin, Steve Glaser, Bob Rice & collaborators UC: SNRI & CITRIS Opportunities Unprecedented level of information from low-cost

More information

Module 2.1 Monitoring activity data for forests using remote sensing

Module 2.1 Monitoring activity data for forests using remote sensing Module 2.1 Monitoring activity data for forests using remote sensing Module developers: Frédéric Achard, European Commission (EC) Joint Research Centre (JRC) Jukka Miettinen, EC JRC Brice Mora, Wageningen

More information